Abstract

http://ssrn.com/abstract=2474438
 


 



Ensembles of Overfit and Overconfident Forecasts


Yael Grushka-Cockayne


University of Virginia - Darden School of Business

Victor Richmond R. Jose


Georgetown University - McDonough School of Business

Kenneth C. Lichtendahl Jr.


University of Virginia - Darden School of Business

August 18, 2015

Darden Business School Working Paper No. 2474438

Abstract:     
Firms today average forecasts collected from multiple experts and models. Because of cognitive biases, strategic incentives, or the structure of machine-learning algorithms, these forecasts are often overfit to sample data and are overconfident. Little is known about the challenges associated with aggregating such forecasts. We introduce a theoretical model to examine the combined effect of overfitting and overconfidence on the average forecast. Their combined effect is that the mean and median probability forecasts are poorly calibrated with hit rates of their prediction intervals too high and too low, respectively. Consequently, we prescribe the use of a trimmed average, or trimmed opinion pool, to achieve better calibration. We identify the random forest, a leading machine-learning algorithm that pools hundreds of overfit and overconfident regression trees, as an ideal environment for trimming probabilities. Using several known datasets, we demonstrate that trimmed ensembles can significantly improve the random forest's predictive accuracy.

Number of Pages in PDF File: 37

Keywords: wisdom of crowds; base-rate neglect; linear opinion pool; trimmed opinion pool; hit rate; calibration; random forest.

JEL Classification: C10, C53, E17


Open PDF in Browser Download This Paper

Date posted: August 3, 2014 ; Last revised: August 19, 2015

Suggested Citation

Grushka-Cockayne, Yael and Jose, Victor Richmond R. and Lichtendahl, Kenneth C., Ensembles of Overfit and Overconfident Forecasts (August 18, 2015). Darden Business School Working Paper No. 2474438. Available at SSRN: http://ssrn.com/abstract=2474438 or http://dx.doi.org/10.2139/ssrn.2474438

Contact Information

Yael Grushka-Cockayne (Contact Author)
University of Virginia (UVA) - Darden School of Business ( email )
P.O. Box 6550
Charlottesville, VA 22906-6550
United States
Victor Richmond R. Jose
Georgetown University - McDonough School of Business ( email )
544 Hariri Bldg
37th and O Sts NW
Washington, DC 20057
United States
Kenneth C. Lichtendahl Jr.
University of Virginia - Darden School of Business ( email )
P.O. Box 6550
Charlottesville, VA 22906-6550
United States
Feedback to SSRN


Paper statistics
Abstract Views: 590
Downloads: 125
Download Rank: 151,157

© 2015 Social Science Electronic Publishing, Inc. All Rights Reserved.  FAQ   Terms of Use   Privacy Policy   Copyright   Contact Us
This page was processed by apollo2 in 1.329 seconds