I have a dataset with 1,00,000 records. I need to find euclidean distance matrix for this dataset. It should create 1,00,000*1,00,000 matrix. In python we have squareform(pdist(x)). As i cannot perform the same function on the rdd, How to do it on spark platform in python?
Asked
Active
Viewed 225 times
3
Munindra Naidu
- 31
- 1