Legalbench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

2023 Conference on Neural Information Processing Systems, Datasets and Benchmarks Track

Osgoode Legal Studies Research Paper No. 4583531

143 Pages Posted: 6 Dec 2023

See all articles by Neel Guha

Neel Guha

Stanford University

Julian Nyarko

Stanford Law School

Daniel E. Ho

Stanford Law School

Christopher Ré

Stanford University

Adam Chilton

University of Chicago - Law School

Aditya Narayana

Maxime Tools

Alex Chohlas-Wood

Stanford University - Department of Management Science & Engineering

Austin Peters

Stanford University

Brandon Waldon

Georgetown University

Daniel Rockmore

Dartmouth College - Department of Mathematics; Dartmouth College - Department of Computer Science

Diego A. Zambrano

Stanford University

Dmitry Talisman

Maxime Tools

Enam Hoque

LawBeta

Faiz Surani

University of California, Santa Barbara

Frank Fagan

South Texas College of Law Houston; EDHEC Augmented Law Institute

Galit Sarfaty

University of Toronto - Faculty of Law

Gregory M. Dickinson

University of Nebraska College of Law; Stanford Law School

Haggai Porat

Harvard University, Harvard Law School; Tel Aviv University School of Economics

Jason Hegland

Stanford Law School

Jessica Wu

Stanford University

Joe Nudell

Stanford University

Joel Niklaus

University of Bern - Faculty of Science; Institute of Computer Science

John Nay

Stanford University - CodeX - Center for Legal Informatics; New York University (NYU)

Jonathan H. Choi

University of Southern California; University of Southern California Gould School of Law

Kevin Tobia

Georgetown University Law Center; Georgetown University - Department of Philosophy

Margaret Hagan

Stanford Legal Design Lab; Stanford Law School

Megan Ma

Stanford University - Stanford Codex Center

Michael A. Livermore

University of Virginia School of Law

Nikon Rasumov-Rahe

Maxime Tools

Nils Holzenberger

Institut Polytechnique de Paris

Noam Kolt

Hebrew University of Jerusalem

Peter Henderson

Princeton University - Center for Information Technology Policy; Princeton University - Princeton School of Public and International Affairs; Princeton University - Program in Law & Public Policy; Princeton University - Department of Computer Science

Sean Rehaag

Centre for Refugee Studies, Refugee Law Lab & Osgoode Hall Law School, York University

Sharad Goel

Harvard University

Shang Gao

Casetext

Spencer Williams

California Western School of Law

Sunny Gandhi

Indiana University Bloomington

Tom Zur

Harvard Law School

Varun Iyer

Independent

Zehua Li

Stanford University

Date Written: September 26, 2023

Abstract

The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisciplinary process, in which we collected tasks designed and hand-crafted by legal professionals. Because these subject matter experts took a leading role in construction, tasks either measure legal reasoning capabilities that are practically useful, or measure reasoning skills that lawyers find interesting. To enable cross-disciplinary conversations about LLMs in the law, we additionally show how popular legal frameworks for describing legal reasoning—which distinguish between its many forms—correspond to LegalBench tasks, thus giving lawyers and LLM developers a common vocabulary. This paper describes LegalBench, presents an empirical evaluation of 20 open-source and commercial LLMs, and illustrates the types of research explorations LegalBench enables.

Keywords: legal practice, law and technology, large language models, artificial intelligence, empirical legal methods, machine learning

Suggested Citation

Guha, Neel and Nyarko, Julian and Ho, Daniel E. and Ré, Christopher and Chilton, Adam and Narayana, Aditya and Chohlas-Wood, Alex and Peters, Austin and Waldon, Brandon and Rockmore, Daniel and Zambrano, Diego and Talisman, Dmitry and Hoque, Enam and Surani, Faiz and Fagan, Frank and Sarfaty, Galit and Dickinson, Gregory M. and Porat, Haggai and Hegland, Jason and Wu, Jessica and Nudell, Joe and Niklaus, Joel and Nay, John and Choi, Jonathan H. and Tobia, Kevin and Hagan, Margaret and Ma, Megan and Livermore, Michael A. and Rasumov-Rahe, Nikon and Holzenberger, Nils and Kolt, Noam and Henderson, Peter and Rehaag, Sean and Goel, Sharad and Gao, Shang and Williams, Spencer and Gandhi, Sunny and Zur, Tom and Iyer, Varun and Li, Zehua, Legalbench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models (September 26, 2023). 2023 Conference on Neural Information Processing Systems, Datasets and Benchmarks Track, Osgoode Legal Studies Research Paper No. 4583531, Available at SSRN: https://ssrn.com/abstract=4583531 or http://dx.doi.org/10.2139/ssrn.4583531

Neel Guha (Contact Author)

Stanford University ( email )

Stanford, CA
United States

Julian Nyarko

Stanford Law School ( email )

559 Nathan Abbott Way
Stanford, CA 94305
United States

Daniel E. Ho

Stanford Law School ( email )

559 Nathan Abbott Way
Stanford, CA 94305-8610
United States
650-723-9560 (Phone)

HOME PAGE: http://dho.stanford.edu

Christopher Ré

Stanford University ( email )

Stanford, CA 94305
United States

Adam Chilton

University of Chicago - Law School ( email )

1111 E. 60th St.
Chicago, IL 60637
United States

HOME PAGE: http://www.adamchilton.org

Aditya Narayana

Maxime Tools ( email )

Alex Chohlas-Wood

Stanford University - Department of Management Science & Engineering ( email )

473 Via Ortega
Stanford, CA 94305-9025
United States

Austin Peters

Stanford University

Brandon Waldon

Georgetown University ( email )

Washington, DC 20057
United States

Daniel Rockmore

Dartmouth College - Department of Mathematics ( email )

United States

Dartmouth College - Department of Computer Science ( email )

United States

Diego Zambrano

Stanford University ( email )

Stanford, CA 94305
United States

Dmitry Talisman

Maxime Tools

Enam Hoque

LawBeta

Faiz Surani

University of California, Santa Barbara ( email )

South Hall 5504
Santa Barbara, CA 93106
United States

HOME PAGE: http://faizsurani.com

Frank Fagan

South Texas College of Law Houston

1303 San Jacinto Street
Houston, TX 77002
United States

EDHEC Augmented Law Institute

Roubaix, 59057
France

Galit Sarfaty

University of Toronto - Faculty of Law ( email )

78 and 84 Queen's Park
Toronto, Ontario M5S 2C5
Canada

Gregory M. Dickinson

University of Nebraska College of Law ( email )

PO Box 830902
Lincoln, NE 68583-0902
United States

Stanford Law School ( email )

559 Nathan Abbott Way
Stanford, CA 94305-8610
United States

Haggai Porat

Harvard University, Harvard Law School ( email )

Tel Aviv University School of Economics ( email )

Tel Aviv
Israel

Jason Hegland

Stanford Law School ( email )

559 Nathan Abbott Way
Stanford, CA 94305-8610
United States

Jessica Wu

Stanford University ( email )

Stanford, CA 94305
United States

Joe Nudell

Stanford University ( email )

Joel Niklaus

University of Bern - Faculty of Science ( email )

Bern, Bern
Switzerland

Institute of Computer Science ( email )

Switzerland

John Nay

Stanford University - CodeX - Center for Legal Informatics ( email )

HOME PAGE: http://law.stanford.edu/directory/john-nay/

New York University (NYU) ( email )

Bobst Library, E-resource Acquisitions
20 Cooper Square 3rd Floor
New York, NY 10003-711
United States

HOME PAGE: http://nyu.edu

Jonathan H. Choi

University of Southern California ( email )

2250 Alcazar Street
Los Angeles, CA 90089
United States

University of Southern California Gould School of Law ( email )

699 Exposition Blvd.
Los Angeles, CA 90089
United States

Kevin Tobia

Georgetown University Law Center ( email )

600 New Jersey Avenue, NW
Washington, DC 20001
United States

HOME PAGE: http://www.law.georgetown.edu/faculty/kevin-tobia/

Georgetown University - Department of Philosophy

37th and O Streets, N.W.
Washington, DC 20007
United States

Margaret Hagan

Stanford Legal Design Lab ( email )

559 Nathan Abbott Way
Stanford, CA 94305
United States

HOME PAGE: http://margarethagan.com

Stanford Law School ( email )

559 Nathan Abbott Way
Stanford, CA 94305-8610
United States

Megan Ma

Stanford University - Stanford Codex Center ( email )

559 Nathan Abbott Way
Stanford, CA 94305-8610
United States

Michael A. Livermore

University of Virginia School of Law ( email )

Nikon Rasumov-Rahe

Maxime Tools

Nils Holzenberger

Institut Polytechnique de Paris ( email )

Noam Kolt

Hebrew University of Jerusalem

HOME PAGE: http://www.noamkolt.com/

Peter Henderson

Princeton University - Center for Information Technology Policy ( email )

C231A E-Quad
Olden Street
Princeton, NJ 08540
United States

Princeton University - Princeton School of Public and International Affairs ( email )

Princeton University
Princeton, NJ 08544-1021
United States

Princeton University - Program in Law & Public Policy ( email )

Wallace Hall
Princeton, NJ 08544
United States

Princeton University - Department of Computer Science ( email )

35 Olden Street
Princeton, NJ 08540
United States

Sean Rehaag

Centre for Refugee Studies, Refugee Law Lab & Osgoode Hall Law School, York University ( email )

4700 Keele Street
Toronto, Ontario M3J 1P3
Canada

HOME PAGE: http://www.osgoode.yorku.ca/rehaag-sean/

Sharad Goel

Harvard University ( email )

1875 Cambridge Street
Cambridge, MA 02138
United States

Shang Gao

Casetext ( email )

United States

HOME PAGE: http://casetext.com

Spencer Williams

California Western School of Law ( email )

225 Cedar Street
San Diego, CA 92101
United States

Sunny Gandhi

Indiana University Bloomington ( email )

Dept of Biology
100 South Indiana Ave.
Bloomington, IN 47405
United States

Tom Zur

Harvard Law School ( email )

Varun Iyer

Independent

Zehua Li

Stanford University ( email )

Stanford, CA 94305
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
304
Abstract Views
1,392
Rank
203,285
PlumX Metrics