Combining classifiers for credit risk prediction
Bhekisipho Twala
Journal of Systems Science and Systems Engineering ›› 2009, Vol. 18 ›› Issue (3) : 292 -311.
Combining classifiers for credit risk prediction
Credit risk prediction models seek to predict quality factors such as whether an individual will default (bad applicant) on a loan or not (good applicant). This can be treated as a kind of machine learning (ML) problem. Recently, the use of ML algorithms has proven to be of great practical value in solving a variety of risk problems including credit risk prediction. One of the most active areas of recent research in ML has been the use of ensemble (combining) classifiers. Research indicates that ensemble individual classifiers lead to a significant improvement in classification performance by having them vote for the most popular class. This paper explores the predicted behaviour of five classifiers for different types of noise in terms of credit risk prediction accuracy, and how could such accuracy be improved by using pairs of classifier ensembles. Benchmarking results on five credit datasets and comparison with the performance of each individual classifier on predictive accuracy at various attribute noise levels are presented. The experimental evaluation shows that the ensemble of classifiers technique has the potential to improve prediction accuracy.
Supervised learning / statistical pattern recognition / ensemble / credit risk / prediction
| [1] |
|
| [2] |
|
| [3] |
Albright, H.T. (1994). Construction of polynomial classifier for consumer loan applications using genetic algorithms. Department of Systems and Engineering, University of Virginia, Working Paper |
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
BIS. (2004). International convergence of capital measurement and capital standards: a revised framework. Basel Committee of Banking Supervision, Bank for International Settlements |
| [8] |
Blake, C., Keogh, E. & Merz, C.J. (1999). UCI respiratory of machine learning databases. University of California, Irvine, Department of Information and Computer Sciences |
| [9] |
|
| [10] |
|
| [11] |
Breiman, L., Friedman, J., Olshen, R. & Stone, C. (1984). Classification and Regression Trees. Wadsworth |
| [12] |
Carter, C. & Catlett, J. (1987). Assessing credit card applications using machine learning. IEEE Expert, fall: 71–79 |
| [13] |
|
| [14] |
Coffman, J.Y. (1986). The proper role of tree analysis in forecasting the risk behaviour of borrowers. MDS reports 3, 4, 7 and 9. Management Decision Systems, Atlanta |
| [15] |
|
| [16] |
|
| [17] |
Desai, V.S., Convay, J.N. & Overstreet, G.A. Credit scoring models in credit-union environment using neural networks and genetic algorithms. IMA Journal of Mathematics Applied in Business and Industry, 8: 323–346 |
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
Hand, D.J. (2008). Private communication |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
Ho, T. K. (1995). Random decision forests. In: Holmes, C.C., Adams, N.M. (eds.), Proceedings of the 3rd International Conference on Document Analysis and Recognition, 278–282 |
| [30] |
|
| [31] |
Islam, M.J., Wu, Q.M.J., Ahmadi, M. & Sid-Ahmed, M.A. (2007). Investigating the performance of naïve Bayes classifiers and k-nearest neighbor classifiers. In: International Conference on Convergence Information Technology, 1541–1546, November, 21-23, 2007 |
| [32] |
|
| [33] |
Khalik, A. & El-Sheshai, K.M. (1980). Information choice and utilization in an experiment of default prediction. Journal of Accounting Research, autumn: 325–342 |
| [34] |
|
| [35] |
|
| [36] |
Kononenko, I. (1991). Semi-naïve Bayesian classifier. In: Proceedings of European conference on Artificial Intelligence, 206–219 |
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
MINITAB. Statistical software for windows 9.0, 2002, PA, USA: MINITAB, Inc. |
| [42] |
Neumann, D.E. (2002). An enhanced neural network technique for software risk analysis. IEEE Transactions on Software Engineering, 904–912 |
| [43] |
|
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
Yobas, M.B., Crook, J.N. & Ross, P. (1997). Credit scoring using neural and evolutionary techniques. Credit Research Centre, University of Edinburgh, Working Paper 7/2 |
/
| 〈 |
|
〉 |