Textual Information and IPO Underpricing: A Machine Learning Approach

The Journal of Financial Data Science, volume 5, issue 2, 2023[10.3905/jfds.2023.1.121]

Posted: 8 Oct 2024

See all articles by Apostolos G. Katsafados

Apostolos G. Katsafados

Athens University of Economics and Business - Department of Accounting and Finance; Bank of Greece

George N. Leledakis

Athens University of Economics and Business, School of Business, Department of Accounting and Finance

Emmanouil G. Pyrgiotakis

University of Essex - Essex Business School

Ion Androutsopoulos

Athens University of Economics and Business

Ilias Chalkidis

University of Copenhagen

Emmanouel Fergadiotis

Athens University of Economics and Business

Date Written: October 10, 2022

Abstract

This study examines the predictive power of textual information from S-1 filings in explaining IPO underpricing. The author's approach differs from previous research, as they utilize several machine learning algorithms to predict whether an IPO will be underpriced or not, as well as the magnitude of the underpricing. Using a sample of 2,481 U.S. IPOs, they find that textual information can effectively complement financial variables in terms of prediction accuracy, since models that use both sources of data produce more accurate estimates. In particular, the model with the best performance using only financial variables achieves 67.5% accuracy while the best model with both textual and financial data appears a substantial improvement (6.1%). Also, the usage of sophisticated machine learning models drives an increase in the predictive accuracy compared to the traditional logistic regression model (2.5%). The authors attribute the findings to the fact that textual information can reduce the ex-ante valuation uncertainty of IPO firms. Finally, they create a portfolio of IPOs based on the out-of-sample machine learning predictions, which remarkably achieves 27.90% average returns. Their portfolio achieves extraordinary abnormal returns in various time dimensions (both in the short and long run), achieving up to 30% better yield than the benchmark.

Keywords: Initial public offerings, First-day returns, Natural language processing, Machine learning

JEL Classification: C63, G12, G14, G40

Suggested Citation

Katsafados, Apostolos G. and Leledakis, George N. and Pyrgiotakis, Emmanouil G. and Androutsopoulos, Ion and Chalkidis, Ilias and Fergadiotis, Emmanouel, Textual Information and IPO Underpricing: A Machine Learning Approach (October 10, 2022). The Journal of Financial Data Science, volume 5, issue 2, 2023[10.3905/jfds.2023.1.121], Available at SSRN: https://ssrn.com/abstract=4963076 or http://dx.doi.org/10.3905/jfds.2023.1.121

Apostolos G. Katsafados

Athens University of Economics and Business - Department of Accounting and Finance ( email )

76 Patission Street
GR-104 34 Athens
Greece

Bank of Greece ( email )

21 E. Venizelos Avenue
GR 102 50 Athens
Greece

George N. Leledakis (Contact Author)

Athens University of Economics and Business, School of Business, Department of Accounting and Finance ( email )

76 Patission Str.
Athens, 104 34
Greece
+30 210 8203 459 (Phone)
+30 210 8228 816 (Fax)

HOME PAGE: http://www.aueb.gr/en/faculty_page/leledakis-georgios

Emmanouil G. Pyrgiotakis

University of Essex - Essex Business School ( email )

Wivenhoe Park
Colchester, CO4 3SQ
United Kingdom

Ion Androutsopoulos

Athens University of Economics and Business ( email )

76 Patission Street
Athens, 104 34
Greece

Ilias Chalkidis

University of Copenhagen ( email )

Universitetsparken 1
Copenhagen, København DK-2100
Denmark

Emmanouel Fergadiotis

Athens University of Economics and Business ( email )

76 Patission Street
Athens, 104 34
Greece

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Abstract Views
90
PlumX Metrics