To reduce privacy risk, I should scrub the SSNs from the input. I need to scrub them in a way that preserves uniqueness. E.g. if I have 111-11-1119, 111-11-1119, and 111-11-1118, we could number 111-11-1119 as 1 and 111-11-1118 as 2.
What's the best way to do that?
