Don't Get Duped: Fraud through Duplication in Public Opinion Surveys
Statistical Journal of the IAOS, Forthcoming
28 Pages Posted: 20 Mar 2015 Last revised: 23 Feb 2016
Date Written: December 12, 2015
Abstract
Fraud in survey research can take many forms, but a common form is through duplication of valid interviews. Duplication of a valid interview has a number of advantages: expected relationships between the variables will hold across the data set and, if done across a number of interviews, this approach can evade many standard techniques to detect fraud such as straight-lining analysis and the application of Benford's law. In this paper, we consider the likelihood of encountering near duplicates in survey data, suggest methods to fingerprint suspicious observations, report on our analysis of over 1,000 publicly available survey datasets and argue that nearly one in five widely used country-year surveys surveys from major international data sets have exact or near duplicates in excess of 5% of observations.
Keywords: survey research, duplicates, near duplicates, survey methodology, curb stoning, falsification, fraud, surveys
Suggested Citation: Suggested Citation