Measuring Readability with Language Predictability: A Large Language Model Approach

63 Pages Posted: 12 Apr 2024

See all articles by Amy Zang

Amy Zang

Hong Kong University of Science and Technology - Department of Accounting

Jiexin Zheng

Hong Kong University of Science & Technology (HKUST)

Rong Zheng

Hong Kong University of Science and Technology - Business School - Department of Information Systems, Business Statistics and Operations Management

Date Written: March 15, 2024

Abstract

We propose a new readability measure, the language predictability score (LPS), to assess the processing costs associated with comprehending corporate disclosure text. Our measure has strong theoretical roots and experimental support in psycholinguistics and cognitive science. Unlike the three most commonly used readability measures (i.e., the Fog index, the Bog index, and file size), the LPS measures readability based on the context of words and the target audience of the text. We use the large language models GPT-2 and BERT—which, after pre-training and fine-tuning, imitate the language ability of investors—to estimate the LPSs of a large sample of management discussion and analysis (MD&As) of annual reports. In validity tests, we show that the LPS identifies incoherent text as less readable and boilerplate content as more readable. In the main tests, we show that, after controlling for firm fixed effects, the LPSs of MD&As are significantly associated with post-filing stock return volatility as well as dispersion and accuracy in analysts’ earnings forecasts and that the LPS outperforms the Fog index, the Bog index, and file size in explaining analysts’ processing costs.

Keywords: textual analysis, large language model, processing cost, corporate disclosure, readability, language predictability

JEL Classification: G18, M41, G14, D83, G24

Suggested Citation

Zang, Amy and Zheng, Jiexin and Zheng, Rong, Measuring Readability with Language Predictability: A Large Language Model Approach (March 15, 2024). HKUST Business School Research Paper No. 2024-153, Available at SSRN: https://ssrn.com/abstract=4764707 or http://dx.doi.org/10.2139/ssrn.4764707

Amy Zang (Contact Author)

Hong Kong University of Science and Technology - Department of Accounting ( email )

LSK Business School Building
Clear Water Bay, Kowloon
Hong Kong
+852-23587561 (Phone)

HOME PAGE: http://www.AmyZang.org

Jiexin Zheng

Hong Kong University of Science & Technology (HKUST)

Rong Zheng

Hong Kong University of Science and Technology - Business School - Department of Information Systems, Business Statistics and Operations Management ( email )

Clear Water Bay
Kowloon
Hong Kong

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
427
Abstract Views
1,137
Rank
144,227
PlumX Metrics