Text As Data

63 Pages Posted: 17 Mar 2017 Last revised: 14 Jul 2018

See all articles by Matthew Gentzkow

Matthew Gentzkow

Stanford University

Bryan T. Kelly

Yale SOM; AQR Capital Management, LLC; National Bureau of Economic Research (NBER)

Matt Taddy

University of Chicago

Multiple version iconThere are 2 versions of this paper

Date Written: February 15, 2017


An ever increasing share of human interaction, communication, and culture is recorded as digital text. We provide an introduction to the use of text as an input to economic research. We discuss the features that make text different from other forms of data, offer a practical overview of relevant statistical methods, and survey a variety of applications.

Suggested Citation

Gentzkow, Matthew and Kelly, Bryan T. and Taddy, Matt, Text As Data (February 15, 2017). Available at SSRN: https://ssrn.com/abstract=2934001 or http://dx.doi.org/10.2139/ssrn.2934001

Matthew Gentzkow

Stanford University ( email )

Bryan T. Kelly (Contact Author)

Yale SOM ( email )

135 Prospect Street
P.O. Box 208200
New Haven, CT 06520-8200
United States

AQR Capital Management, LLC ( email )

Greenwich, CT
United States

National Bureau of Economic Research (NBER) ( email )

1050 Massachusetts Avenue
Cambridge, MA 02138
United States

Matt Taddy

University of Chicago ( email )

Do you have negative results from your research you’d like to share?

Paper statistics

Abstract Views
PlumX Metrics