94 Pages Posted: 27 Jun 2014 Last revised: 28 Nov 2014
Date Written: July 6, 2014
This article reviews available text analysis approaches and their limited success generating rich, transparent databases at scale. After elucidating the precise difficulties faced by previous projects and methods, the article introduces a new approach to text analysis featuring innovative procedures and open source tools to optimally combine the efforts of researchers, crowds, and algorithms. The article describes the first substantive project deploying the approach, imagines other projects on theoretical/empirical frontiers enabled by the approach, and closes with promising implications for social science when researchers are able to generate larger, more complex, transparent databases faster.
Keywords: text analysis, crowdsourcing, machine learning, event analysis, crowd work, content analysis, algorithms
Suggested Citation: Suggested Citation
Adams, Nicholas, Researchers to Crowds to Algorithms: Building Large, Complex, and Transparent Databases from Text in the Age of Data Science (July 6, 2014). Available at SSRN: https://ssrn.com/abstract=2459325 or http://dx.doi.org/10.2139/ssrn.2459325