In pandas I am using drop_duplicates(subset='Full Name', inplace=True) to remove duplicate records, however I would like to view the removed duplicate records for manual inspection. How can I do this?
            Asked
            
        
        
            Active
            
        
            Viewed 187 times
        
    3
            
            
        1 Answers
2
            Use duplicated and parameter keep=False for return all dupes with boolean indexing for filtering:
df = pd.DataFrame({'Full Name':list('abcdac'),
                   'Some val':[4,5,4,5,5,4]})
print (df)
  Full Name  Some val
0         a         4
1         b         5
2         c         4
3         d         5
4         a         5
5         c         4
df = df[df.duplicated(subset='Full Name', keep=False)]
print (df)
  Full Name  Some val
0         a         4
2         c         4
4         a         5
5         c         4
 
    
    
        jezrael
        
- 822,522
- 95
- 1,334
- 1,252
 
    