Python | Pandas Index.duplicated()

Python | Pandas Index.get_duplicates()

Last Updated : 17 Dec, 2018

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier.

Pandas Index.get_duplicates() function extract duplicated index elements. This function returns a sorted list of index elements which appear more than once in the Index.

Syntax: Index.get_duplicates()

Returns : List of duplicated indexes.

Example #1: Use Index.get_duplicates() function to find all the duplicate values in the Index.

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index(['Labrador', 'Beagle', 'Labrador', 
                    'Lhasa', 'Husky', 'Beagle']) 
  
# Print the Index 
idx 

Output :

let’s find out all the duplicate values in the Index.

# print the duplicated values. 
idx.get_duplicates() 

Output :

As we can see in the output, the Index.get_duplicates() function has returned all the values which are having more than one occurrence in the Index.

Example #2: Use Index.get_duplicates() function to find all the duplicate in the Index. The Index also contains NaN values.

# importing pandas as pd 
import pandas as pd 
  
# Creating the Index 
idx = pd.Index(['Labrador', 'Beagle', None, 'Labrador', 
             'Lhasa', 'Husky', 'Beagle', None, 'Koala']) 
  
# Print the Index 
idx 

Output :

As we can see in the output we are having some missing values. Lets see how the Index.get_duplicates() function treats them.

# print the duplicate values in Index 
idx.get_duplicates() 

Output :

The occurrence of missing values more than once has been treated as duplicates.

Python | Pandas Index.duplicated()

S

Shubham__Ranjan

News

Improve

Article Tags :

Practice Tags :

python

Similar Reads

Python | Pandas Index.get_duplicates()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas Index.get_duplicates() function extract duplicated index elements. This functio

Python | Pandas Index.duplicated()

The Index.duplicated() method in Pandas is a powerful tool for identifying duplicate values within an index. It returns a boolean array where duplicates are marked as True based on the specified criteria and False denotes unique values or the first occurrence of duplicates. This method is especially

Python | Pandas Index.drop_duplicates()

Pandas Index.drop_duplicates() function return Index with duplicate values removed in Python. Syntax of Pandas Index.drop_duplicates() Syntax: Index.drop_duplicates(labels, errors='raise') Parameters : keep : {â€˜firstâ€™, â€˜lastâ€™, False} â€˜firstâ€™ : Drop duplicates except for the first occurrence.(default

Python | Pandas Index.data

Pandas Index is an immutable ndarray implementing an ordered, sliceable set. It is the basic object which stores the axis labels for all pandas objects. Pandas Index.data attribute return the data pointer of the underlying data of the given Index object. Syntax: Index.data Parameter : None Returns :

Python | Pandas DatetimeIndex.date

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas DatetimeIndex.date attribute outputs an Index object containing the date values

Python | Pandas Index.flags

Pandas Index is an immutable ndarray implementing an ordered, sliceable set. It is the basic object which stores the axis labels for all pandas objects. Pandas Index.flags attribute return the status of all the flags for the given Index object. Syntax: Index.flags Parameter : None Returns : status o

Python | Pandas Index.equals()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas Index.equals() function determine if two Index objects contains the same elemen

Python | Pandas Index.identical()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas Index.identical() function determine if two Index objects contains the same ele

Python | Pandas Index.asof()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas Index.asof() function returns return the label from the index, or, if not prese

Python | Pandas Index.drop()

Python is a great language for doing data analysis, primarily because of the fantastic ecosystem of data-centric python packages. Pandas is one of those packages and makes importing and analyzing data much easier. Pandas Index.drop() function make new Index with passed list of labels deleted. The fu