Improving the Art, Craft and Science of Economic Credit Risk Scorecards Using Random Forests: Why Credit Scorers and Economists Should Use Random Forests
33 Pages Posted: 13 Jun 2011 Last revised: 6 Sep 2011
Date Written: June 9, 2011
This paper outlines an approach to improving credit score modeling using random forests and compares random forests with logistic regression. It is shown that on data sets where variables have multicollinearity and complex interrelationships random forests provide a more scientific approach to analyzing variable importance and achieving optimal predictive accuracy. In addition it is shown that random forests should be used in econometric and credit risk models as they provide a powerful too to assess meaning of variables not available in standard regression models and thus allow for more robust findings.
Keywords: logistic regression, credit scoring, random forest, econometrics, statistics, predictive models, variable selection
Suggested Citation: Suggested Citation