Trillion Dollar Words: A New Financial Dataset, Task & Market Analysis
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023
Georgia Tech Scheller College of Business Research Paper No. 4447632
14 Pages Posted: 26 May 2023
Date Written: May 13, 2023
Abstract
Monetary policy pronouncements by Federal Open Market Committee (FOMC) are a major driver of financial market returns. We construct the largest tokenized and annotated dataset of FOMC speeches, meeting minutes, and press conference transcripts in order to understand how monetary policy influences financial markets. In this study, we develop a novel task of hawkish-dovish classification and benchmark various pre-trained language models on the proposed dataset. Using the best-performing model (RoBERTa-large), we construct a measure of monetary policy stance for the FOMC document release days. To evaluate the constructed measure, we study its impact on the treasury market, stock market, and macroeconomic indicators.
Keywords: FOMC, ChatGPT, LLMs, Dataset
Suggested Citation: Suggested Citation