I have two columns of data (say id and master_id) in R. It should be the case that all the values in id should be present in master_id. But, I suspect that is not the case and I want to identify which ones are the erroneous values. I cannot just inspect the data as I am dealing with data of the order of 100k.
How do I go about finding the erroneous values?
 
     
     
    