Generalizability: Machine Learning and Humans-in-the-Loop

RESEARCH HANDBOOK ON BIG DATA LAW (Roland Vogl, ed., Edward Elgar, 2020 Forthcoming)

NYU School of Law, Public Law Research Paper No. 20-7

21 Pages Posted: 10 Jul 2019 Last revised: 15 Jun 2020

See all articles by John Nay

John Nay

Stanford University - CodeX - Center for Legal Informatics; New York University (NYU)

Katherine J. Strandburg

New York University School of Law

Date Written: December 17, 2019

Abstract

Automated decision tools, which increasingly rely on machine learning (ML), are used in decision systems that permeate our lives. Examples range from high-stakes decision systems for offering credit, university admissions and employment, to decision systems serving advertising. Here, we consider data-driven tools that attempt to predict likely behavior of individuals. The debate about ML-based decision-making has spawned an important multi-disciplinary literature, which has focused primarily on fairness, accountability, and transparency. We have been struck, however, by the lack of attention to generalizability in the scholarly and policy discourse about whether and how to incorporate automated decision tools into decision systems.

This chapter explores the relationship between generalizability and the division of labor between humans and machines in decision systems. An automated decision tool is generalizable to the extent that it produces outputs that are as correct as the outputs it produced on the data used to create it. The generalizability of a ML model depends on the training process, data availability, and the underlying predictability of the outcome that it models. Ultimately, whether a tool’s generalizability is adequate for a particular decision system depends on how it is deployed, usually in conjunction with human adjudicators. Taking generalizability explicitly into account highlights important aspects of decision system design, as well as important normative trade-offs, that might otherwise be missed.

Keywords: machine learning, artificial intelligence, prediction, validation, automated decision-making, policy, rules, standards, big data, law, legal

Suggested Citation

Nay, John and Strandburg, Katherine J., Generalizability: Machine Learning and Humans-in-the-Loop (December 17, 2019). RESEARCH HANDBOOK ON BIG DATA LAW (Roland Vogl, ed., Edward Elgar, 2020 Forthcoming), NYU School of Law, Public Law Research Paper No. 20-7, Available at SSRN: https://ssrn.com/abstract=3417436 or http://dx.doi.org/10.2139/ssrn.3417436

John Nay (Contact Author)

Stanford University - CodeX - Center for Legal Informatics ( email )

HOME PAGE: http://law.stanford.edu/directory/john-nay/

New York University (NYU) ( email )

Bobst Library, E-resource Acquisitions
20 Cooper Square 3rd Floor
New York, NY 10003-711
United States

HOME PAGE: http://nyu.edu

Katherine J. Strandburg

New York University School of Law ( email )

40 Washington Square South
New York, NY 10012-1099
United States

HOME PAGE: http://rb.gy/no3i9t

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
454
Abstract Views
2,557
Rank
130,355
PlumX Metrics