Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation

15 Pages Posted: 25 Nov 2023

See all articles by Eun Cheol Choi

Eun Cheol Choi

University of Southern California, Annenberg School for Communication and Journalism

Emilio Ferrara

University of Southern California - Information Sciences Institute

Multiple version iconThere are 2 versions of this paper

Date Written: October 26, 2023

Abstract

In today's digital era, the rapid spread of misinformation poses threats to public well-being and societal trust. As online misinformation proliferates, manual verification by fact checkers becomes increasingly challenging. We introduce FACT-GPT (Fact-checking Augmentation with Claim matching Task-oriented Generative Pre-trained Transformer), a framework designed to automate the claim matching phase of fact-checking using Large Language Models (LLMs). This framework identifies new social media content that either supports or contradicts claims previously debunked by fact-checkers. Our approach employs GPT-4 to generate a labeled dataset consisting of simulated social media posts. This data set serves as a training ground for fine-tuning more specialized LLMs. We evaluated FACT-GPT on an extensive dataset of social media content related to public health. The results indicate that our fine-tuned LLMs rival the performance of larger pre-trained LLMs in claim matching tasks, aligning closely with human annotations. This study achieves three key milestones: it provides an automated framework for enhanced fact-checking; demonstrates the potential of LLMs to complement human expertise; offers public resources, including datasets and models, to further research and applications in the fact-checking domain.

Suggested Citation

Choi, Eun Cheol and Ferrara, Emilio, Automated Claim Matching with Large Language Models: Empowering Fact-Checkers in the Fight Against Misinformation (October 26, 2023). Available at SSRN: https://ssrn.com/abstract=4614239 or http://dx.doi.org/10.2139/ssrn.4614239

Eun Cheol Choi

University of Southern California, Annenberg School for Communication and Journalism ( email )

3502 Watt Way, Suite 304
Los Angeles, CA 90089
United States

Emilio Ferrara (Contact Author)

University of Southern California - Information Sciences Institute ( email )

United States

HOME PAGE: http://emilio.ferrara.name

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
70
Abstract Views
436
Rank
562,508
PlumX Metrics