We consider the estimation of credit scores by means of semiparametric logit models. In credit scoring, the fitted rating score shall not only provide an optimal classification result but serves also as a modular component of a (typically quite complex) rating system. This means in particular that a rating score should be given by a linearly weighted sum of rating factors. That way the rating procedure can be easily interpreted and understood also by non-statisticians.
For that reason the logit model or the logistic regression approach is one of the most popular models for estimating credit rating scores. The first step in fitting the rating model is usually a nonlinear transformation of the raw variables in order to obtain a linear predictor (rating score) in the final estimation. As an alternative to this two-step approach, generalized additive models (GAM) would allow for a simultaneous estimation of both the initial transformation and final logit fit. In this study we compare GAM estimating approaches with a focus on the specific structure of credit data: small default rates, mixed discrete and continuous explanatory variables, possibly nonlinear dependencies between the regressors.
Keywords: credit scoring; generalized additive models; nonparametric function estimation
Biography: Marlene Müller received a PhD in Mathematical Statistics (1993) and a habilitation in Statistics and Econometrics (2000) from Humboldt University Berlin (Germany). In 2003 she joined the Fraunhofer Institute for Industrial Mathematics (ITWM) Kaiserslautern as a researcher and consultant in the fields of credit risk and statistics. From 2006 till 2010 she was the head of the Financial Mathematics department at the Fraunhofer ITWM. Since 2010 she is professor for applied statistics at Beuth University of Applied Sciences in Berlin (Germany).