Predicting Litigation Risk via Machine Learning
45 Pages Posted: 11 Dec 2020
Date Written: December 1, 2020
Abstract
We demonstrate the value of machine learning in accounting through a detailed examination of litigation risk, an important and frequently used estimate in the literature. We evaluate a comprehensive set of twelve machine learning techniques and benchmark their performance against the logistic regression models in Kim and Skinner (2012). These models improve the prediction of litigation risk, with hourglass-shaped and convolutional neural networks the most effective. The improvements are substantial, and are driven by increased precision, the most salient attribute of litigation estimates in the accounting literature. We also produce firm-year litigation risk estimates for use in future research from a convolutional neural network model that uses recursive feature elimination on a pool of 68 possible parameters. Overall, our results suggest that the joint consideration of economically-meaningful predictors and machine learning techniques maximize the effectiveness of accounting estimates.
Keywords: Machine Learning, Securities Class Action Lawsuits, Neural Network
JEL Classification: G15, G18, M41
Suggested Citation: Suggested Citation