We focus on classification methods to separate defaulting small and medium sized enterprises from nondefaulting ones. In this framework, a typical problem occurs because the proportion of defaulting firms is very close to zero, leading to a class imbalance. Moreover, a form of bias may affect the classification because models are often estimated on samples of large corporations that are not randomly selected. We investigate how different criteria for sample selection may affect the accuracy of the classification and how this problem is strongly related to class imbalance.

The effect of training set selection when predicting defaulting small and medium-sized enterprises with unbalanced data

TORELLI, Nicola
2013

Abstract

We focus on classification methods to separate defaulting small and medium sized enterprises from nondefaulting ones. In this framework, a typical problem occurs because the proportion of defaulting firms is very close to zero, leading to a class imbalance. Moreover, a form of bias may affect the classification because models are often estimated on samples of large corporations that are not randomly selected. We investigate how different criteria for sample selection may affect the accuracy of the classification and how this problem is strongly related to class imbalance.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11368/2735498
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact