Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries

47 Pages Posted: 20 Dec 2016 Last revised: 29 Mar 2017

Alexandra Olteanu

Independent

Carlos Castillo

Eurecat Technology Center of Catalonia

Fernando Diaz

Microsoft Research

Emre Kiciman

Microsoft Research

Date Written: December 20, 2016

Abstract

Social data in digital form, which includes user-generated content, expressed or implicit relationships between people, and behavioral traces, are at the core of many popular applications and platforms, driving the research agenda of many researchers. The promises of social data are many, including understanding "what the world thinks" about a social issue, brand, product, celebrity, or other entity, as well as enabling better decision-making in a variety of fields including public policy, healthcare, and economics. Many academics and practitioners have warned against the naïve usage of social data. There are biases and inaccuracies occurring at the source of the data, but also introduced during processing. There are methodological limitations and pitfalls, as well as ethical boundaries and unexpected consequences that are often overlooked. This survey recognizes the rigor with which these issues are addressed by different researchers varies across a wide range. We present a framework for identifying a broad variety of menaces in the research and practices around social data use.

Keywords: Social media, user-generated content, behavioral traces, data biases, evaluation, ethical challanges

Suggested Citation

Olteanu, Alexandra and Castillo, Carlos and Diaz, Fernando and Kiciman, Emre, Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries (December 20, 2016). Available at SSRN: https://ssrn.com/abstract=2886526

Carlos Castillo

Eurecat Technology Center of Catalonia ( email )

Diagonal 177
9th Floor
Barcelona, Barcelona 08018
Spain

Fernando Diaz

Microsoft Research ( email )

641 Avenue of Americas
New York, NY 10011
United States

Emre Kiciman

Microsoft Research ( email )

Building 99
Redmond, WA
United States

HOME PAGE: http://research.microsoft.com/~emrek/

Paper statistics

Downloads
387
Rank
61,074
Abstract Views
2,326