The Challenge of Using LLMs to Simulate Human Behavior: A Causal Inference Perspective

27 Pages Posted: 8 Dec 2023 Last revised: 13 Mar 2024

See all articles by George Gui

George Gui

Columbia University - Columbia Business School, Marketing

Olivier Toubia

Columbia University - Columbia Business School, Marketing

Date Written: December 1, 2023

Abstract

Large Language Models (LLMs) have demonstrated impressive potential to simulate human behavior. Using a causal inference framework, we empirically and theoretically analyze the challenges of conducting LLM-simulated experiments, and explore potential solutions. In the context of demand estimation, we show that variations in the treatment included in the prompt (e.g., price of focal product) can cause variations in unspecified confounding factors (e.g., price of competitors, historical prices, outside temperature), introducing endogeneity and yielding implausibly flat demand curves. We propose a theoretical framework suggesting this endogeneity issue generalizes to other contexts and won't be fully resolved by merely improving the training data. Unlike real experiments where researchers assign pre-existing units across conditions, LLMs simulate units based on the entire prompt, which includes the description of the treatment. Therefore, due to associations in the training data, the characteristics of individuals and environments simulated by the LLM can be affected by the treatment assignment. We explore two potential solutions. The first specifies all contextual variables that affect both treatment and outcome, which we demonstrate to be challenging for a general-purpose LLM. The second explicitly specifies the source of treatment variation in the prompt given to the LLM (e.g., by informing the LLM that the store is running an experiment). While this approach only allows the estimation of a conditional average treatment effect that depends on the specific experimental design, it provides valuable directional results for exploratory analysis.

Keywords: LLM; GPT; Causal Inference; Human Behavior Simulation; Endogeneity

JEL Classification: C8; D8; M3; C5

Suggested Citation

Gui, George and Toubia, Olivier, The Challenge of Using LLMs to Simulate Human Behavior: A Causal Inference Perspective (December 1, 2023). Columbia Business School Research Paper No. 4650172, Available at SSRN: https://ssrn.com/abstract=4650172 or http://dx.doi.org/10.2139/ssrn.4650172

George Gui (Contact Author)

Columbia University - Columbia Business School, Marketing ( email )

New York, NY 10027
United States

Olivier Toubia

Columbia University - Columbia Business School, Marketing ( email )

New York, NY 10027
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
517
Abstract Views
2,002
Rank
106,326
PlumX Metrics