ChatGPT May Pass the Bar Exam Soon, but Has a Long Way to Go for the LexGLUE Benchmark

5 Pages Posted: 15 Mar 2023

Date Written: March 10, 2023

Abstract

Following the hype around OpenAI's ChatGPT conversational agent, the last straw in the recent development of Large Language Models (LLMs) that demonstrate emergent unprecedented zero-shot capabilities, we audit the latest OpenAI's GPT-3.5 model, 'gpt-3.5-turbo', the first available ChatGPT model, in the LexGLUE benchmark in a zero-shot fashion providing examples in a templated instruction-following format. The results indicate that ChatGPT achieves an average micro-F1 score of 49.0% across LexGLUE tasks, surpassing the baseline guessing rates. Notably, the model performs exceptionally well in some datasets, achieving micro-F1 scores of 62.8% and 70.1% in the ECtHR B and LEDGAR datasets, respectively. The code base and model predictions are available at https://github.com/coastalcph/zeroshot_lexglue.

Keywords: LegalNLP, LexGLUE, ChatGPT

Suggested Citation

Chalkidis, Ilias, ChatGPT May Pass the Bar Exam Soon, but Has a Long Way to Go for the LexGLUE Benchmark (March 10, 2023). Available at SSRN: https://ssrn.com/abstract=4385460 or http://dx.doi.org/10.2139/ssrn.4385460

Ilias Chalkidis (Contact Author)

University of Copenhagen ( email )

Universitetsparken 1
Copenhagen, København DK-2100
Denmark

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
303
Abstract Views
1,334
Rank
210,314
PlumX Metrics