I have two dataframes that have the following columns : Phone, Email and Name
Dataframe1 has 20k in length, whereas dataframe2 has 1k length. I would like to fill the blanks in the Phone column in dataframe1 with the phone numbers in dataframe2 using the email as a match index between the two dataframes.
What is the best way to do this? I have tried combine_frist() and Merge() but combine_first() returns the value in the same row rather than the value that matches the email address. Merge() resulted in the same thing.
Am I wrong to think I need to set email as an index and then map phones to that index? I feel like this is correct but I simply do not know how to do this. Any help is appreciated! Thank you :)
Example :
In [1] 
import pandas as pd
df1 = pd.DataFrame({'Phone': [1, NaN, 3, 4, 5, NaN, 7], 
                   'Name': ['Bob', 'Jon', 'Iris', 'Jacob','Donald','Beatrice','Jane'],
                   'Email': ['bob@gmail.com','jon@gmail.com','iris@gmail.com','jacob@gmail.com','donald@gmail.com','beatrice@gmail.com','jane@gmail.cm'})
df2 = pd.DataFrame({'Phone': [2, 1, 3, 5],
                  'Name': ['Jon', 'Bob', 'Donald'],
                  'Email': ['jon@gmail.com','bob@gmail.com', 'donald@gmail.com'})
In [2]: df1 
Out [2]:
Phone  Name  Email
1      Bob   bob@gmail.com
NaN    Jon   jon@gmail.com
3      Iris  iris@gmail.com
4      Jac   jacob@gmail.com
5      Don   donald@gmail.com
NaN    Bea   beatrice@gmail.com
7      Jane  jane@gmail.com
x 20000 len
In [3]: df2
Out [3]:
Phone  Name  Email
2      Jon   jon@gmail.com
1      Bob   bob@gmail.com
6      Bea   beatrice@gmail.com
5      Don   donald@gmail.com
x 1100 len
What I've tried
In [4]: df3 = pd.merge(df1,df2, on="Email", how="left")
Out [4]:
Phone  Name  Email
1      Bob   bob@gmail.com
1      Jon   jon@gmail.com
3      Iris  iris@gmail.com
4      Jac   jacob@gmail.com
5      Don   donald@gmail.com
NaN    Bea   beatrice@gmail.com
7      Jane  jane@gmail.com
In [5]: df3 = df1.combine_first(df2)
Out [5]:
Phone  Name  Email
1      Bob   bob@gmail.com
1      Jon   jon@gmail.com
3      Iris  iris@gmail.com
4      Jac   jacob@gmail.com
5      Don   donald@gmail.com
NaN    Bea   beatrice@gmail.com
7      Jane  jane@gmail.com
What I would like it to look like:
In [6]: df3
Out [6]
1      Bob   bob@gmail.com
2      Jon   jon@gmail.com
3      Iris  iris@gmail.com
4      Jac   jacob@gmail.com
5      Don   donald@gmail.com
6      Bea   beatrice@gmail.com
7      Jane  jane@gmail.com
 
     
    