Textual Information and IPO Underpricing: A Machine Learning Approach
49 Pages Posted: 29 Dec 2020
Date Written: October 27, 2020
Abstract
This study examines the predictive power of textual information from S-1 filings in explaining IPO underpricing. Our empirical approach differs from previous research, as we utilize several machine learning algorithms to predict whether an IPO will be underpriced, or not. We analyze a large sample of 2,481 U.S. IPOs from 1997 to 2016, and we find that textual information can effectively complement traditional financial variables in terms of prediction accuracy. In fact, models that use both textual data and financial variables as inputs have superior performance compared to models using a single type of input. We attribute our findings to the fact that textual information can reduce the ex-ante valuation uncertainty of IPO firms, thus leading to more accurate estimates.
Keywords: Initial Public Offerings, First-Day Returns, Machine Learning, Natural Language Processing
JEL Classification: C63, G12, G14, G40
Suggested Citation: Suggested Citation