When Conservatives See Red but Liberals Feel Blue: Why Labeler-Characteristic Bias Matters for Data Annotation

60 Pages Posted: 13 Sep 2023

See all articles by Nora Webb Williams

Nora Webb Williams

University of Illinois at Urbana-Champaign

Andreu Casas

Vrije Universiteit Amsterdam, Department of Communication Science

Kevin Aslett

NYU - Center for Social Media and Politics

John D. Wilkerson

University of Washington

Date Written: August 14, 2023

Abstract

Human annotation of data, including text and image materials, is a bedrock of political science research. Yet we often overlook how the identities of our annotators may systematically affect their labels. We call the sensitivity of labels to annotator identity "labeler-characteristic bias" (LCB). We demonstrate the persistence and risks of LCB for downstream analyses in two examples, first with image data from the United States and second with text data from the Netherlands. In both examples we observe significant differences in annotations based on annotator gender and political identity. After laying out a general typology of annotator biases and their relationship to inter-rater reliability, we provide suggestions and solutions for how to handle LCB. The first step to addressing LCB is to recruit a diverse labeler corps and test for LCB. Where LCB is found, solutions are modeling subgroup effects or generating composite labels based on target population demographics.

Keywords: human annotation, labeler characteristic bias, inter-rater reliability, text as data, images as data

Suggested Citation

Webb Williams, Nora and Casas, Andreu and Aslett, Kevin and Wilkerson, John D., When Conservatives See Red but Liberals Feel Blue: Why Labeler-Characteristic Bias Matters for Data Annotation (August 14, 2023). Available at SSRN: https://ssrn.com/abstract=4540742 or http://dx.doi.org/10.2139/ssrn.4540742

Nora Webb Williams (Contact Author)

University of Illinois at Urbana-Champaign ( email )

702 S. Wright Street
Urbana, IL 61801
United States

Andreu Casas

Vrije Universiteit Amsterdam, Department of Communication Science ( email )

De Boelelaan 1105
Amsterdam, 1081HV
Netherlands

Kevin Aslett

NYU - Center for Social Media and Politics ( email )

Bobst Library, E-resource Acquisitions
20 Cooper Square 3rd Floor
New York, NY 10003-711
United States

John D. Wilkerson

University of Washington ( email )

Seattle, WA 98195
United States

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
29
Abstract Views
127
PlumX Metrics