1935. Measuring the Driving Forces of Predictive Performance: Application to Credit Scoring
Invited abstract in session TA-49: Fair and Interpretable Machine Learning, stream Analytics.
Tuesday, 8:30-10:00Room: Parkinson B10
Authors (first author is the speaker)
| 1. | Sébastien Saurin
|
| University of Orléans | |
| 2. | Sullivan Hué
|
| Aix-Marseille University | |
| 3. | Christophe Hurlin
|
| University of Orléans | |
| 4. | Christophe Pérignon
|
| HEC Paris |
Abstract
As they play an increasingly important role in determining access to credit, credit scoring models are under growing scrutiny from banking supervisors and internal model validators. These authorities need to monitor the model performance and identify its key drivers. To facilitate this, we introduce the XPER methodology to decompose a performance metric (e.g., AUC, R2) into specific contributions associated with the various features of a forecasting model. XPER is theoretically grounded on Shapley values and is both model-agnostic and performance metric-agnostic. Furthermore, it can be implemented either at the model level or at the individual level. Using a novel dataset of car loans, we decompose the AUC of a machine-learning model trained to forecast the default probability of loan applicants. We show that a small number of features can explain a surprisingly large part of the model performance. Notably, the features that contribute the most to the predictive performance of the model may not be the ones that contribute the most to individual forecasts (SHAP). Finally, we show how XPER can be used to deal with heterogeneity issues and improve performance.
Keywords
- Machine Learning
- Finance and Banking
Status: accepted
Back to the list of papers