CMStatistics 2023: Start Registration
View Submission - CFE
A1729
Title: Machine learning for credit risk: Multi-period prediction, frailty correlation, loan portfolios, and tail probabilities Authors:  Fabio Sigrist - ETH Zurich (Switzerland) [presenting]
Nicola Leuenberger - Lucerne Universtity of Applied Sciences (Switzerland)
Abstract: Multi-period cumulative and forward corporate default probabilities are modelled using machine learning methods and a novel hybrid econometric-machine learning model is introduced which combines tree-boosting with a latent frailty model. The latter allows for modelling correlation that is not accounted for by observable predictor variables. It is found that machine learning methods have higher prediction accuracy compared to linear models with the differences being larger for longer prediction horizons. The likely reason for this is the presence of stronger interaction effects for longer prediction horizons compared to short horizons. Among all methods, tree-boosting has the highest prediction accuracy. Further, the frailty component of the newly proposed "LaGaBoost frailty model" is overall large and exhibits strong variation over time. In contrast to prior research, it is found that upper tail predictions of loan portfolio losses of frailty models are not consistently higher throughout time compared to models ignoring frailty correlation, but they show more temporal variation.