Explainable Machine Learning Models of Consumer Credit Risk
48 Pages Posted: 14 Jan 2022 Last revised: 25 Apr 2022
Date Written: January 12, 2022
Abstract
In this paper, we create machine learning (ML) models to forecast home equity credit risk for individuals using a real-world dataset and demonstrate methods to explain the output of these ML models to make them more accessible to the end-user. We analyze the explainability of these models for various stakeholders: loan companies, regulators, loan applicants, and data scientists, incorporating their different requirements with respect to explanations. For loan companies, we generate explanations for every model prediction of creditworthiness. For regulators, we perform a stress test for extreme scenarios. For loan applicants, we generate diverse counterfactuals to guide them with steps to reverse the model's classification. Finally, for data scientists, we generate simple rules that accurately explain 70-72% of the dataset. Our work is intended to accelerate the adoption of ML techniques in domains that would benefit from explanations of their predictions.
Keywords: Machine Learning; Interpretability; Explainable AI; Credit Lending; Inductive Logic Programming; Optimal Trees; LIME; SHAP; Counterfactual.
JEL Classification: C55, C45, G0, G2, G51, C69
Suggested Citation: Suggested Citation