Predicting Social Security Numbers from Public Data
Proceedings of the National Academy of Sciences, 106(27), 10975--10980 (2009)
6 Pages Posted: 6 Jan 2019
Date Written: 2009
Information about an individual’s place and date of birth can be exploited to predict his or her Social Security number (SSN). Using only publicly available information, we observed a correlation between individuals’ SSNs and their birth data and found that for younger cohorts the correlation allows statistical inference of private SSNs. The inferences are made possible by the public availability of the Social Security Administration’s Death Master File and the widespread accessibility of personal information from multiple sources, such as data brokers or profiles on social networking sites. Our results highlight the unexpected privacy consequences of the complex interactions among multiple data sources in modern information economies and quantify privacy risks associated with information revelation in public forums.
Keywords: identity theft, online social networks, privacy, statistical reidentification
Suggested Citation: Suggested Citation