A comparative study of data mining methods in consumer loans credit scoring management

Wenbing Xiao , Qian Zhao , Qi Fei

Journal of Systems Science and Systems Engineering ›› 2006, Vol. 15 ›› Issue (4) : 419 -435.

PDF
Journal of Systems Science and Systems Engineering ›› 2006, Vol. 15 ›› Issue (4) : 419 -435. DOI: 10.1007/s11518-006-5023-5
Article

A comparative study of data mining methods in consumer loans credit scoring management

Author information +
History +
PDF

Abstract

Credit scoring has become a critical and challenging management science issue as the credit industry has been facing stiffer competition in recent years. Many classification methods have been suggested to tackle this problem in the literature. In this paper, we investigate the performance of various credit scoring models and the corresponding credit risk cost for three real-life credit scoring data sets. Besides the well-known classification algorithms (e.g. linear discriminant analysis, logistic regression, neural networks and k-nearest neighbor), we also investigate the suitability and performance of some recently proposed, advanced data mining techniques such as support vector machines (SVMs), classification and regression tree (CART), and multivariate adaptive regression splines (MARS). The performance is assessed by using the classification accuracy and cost of credit scoring errors. The experiment results show that SVM, MARS, logistic regression and neural networks yield a very good performance. However, CART and MARS’s explanatory capability outperforms the other methods.

Keywords

Data mining / credit scoring / classification and regression tree / support vector machines / multivariate adaptive regression splines / credit-risk evaluation

Cite this article

Download citation ▾
Wenbing Xiao, Qian Zhao, Qi Fei. A comparative study of data mining methods in consumer loans credit scoring management. Journal of Systems Science and Systems Engineering, 2006, 15(4): 419-435 DOI:10.1007/s11518-006-5023-5

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Altman E.I.. Financial ratios, discriminant analysis and prediction of corporate bankruptcy. Finance, 1968, 23: 589-609.

[2]

Bishop C.M.. Neural Networks for Pattern Recognition, 1995, New York: Oxford University, Press

[3]

Breiman L., Friedman J.H., Olshen R.A., Stone C.J.. Classification and Regression Trees, 1984, Pacific Grove, CA: Wadsworth.

[4]

Chen M.S., Han J., Yu P.S.. Data mining: an overview from a database perspective. IEEE Transactions on Knowledge and Data Engineering, 1996, 8(6): 866-883.

[5]

Chung, C-C. & Lin, C-J. (2001). LIBSVM: a Library for Support Vector Machines, Software. available at http://www.csie.ntu.edu.tw/:_cjlin/libsvm

[6]

Curt H.. The devil’s in the detail: techniques, tools, and applications for database mining and knowledge discovery — Part 1. Intell, Software Strategies, 1995, 6: 1-15.

[7]

Cristianini N., Shawe-Taylor J.. An Introduction to Support Vector Machines, 2000, NewYork: Cambridge Univ, Cambridge

[8]

Desai V.S., Crook J.N., Overstreet G.A.. A comparison of neural networks and linear scoring models in the credit union environment. European Journal of Operational Research, 1996, 95(1): 24-37.

[9]

Dietterich T.G.. Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation, 1998, 10: 1895-1923.

[10]

Firedman J.H.. Multivariate adaptive regression splines (with discussion). Annals of Statistics, 1991, 19: 1-141.

[11]

Firedman J.H., Roosen C.B.. An introduction to multivariate adaptive regression splines. Statistical Methods in Medical Research, 1995, 14: 197-217.

[12]

Frydman H.E., Altman E.I., Kao D.. Introducing recursive partitioning for financial classification: the case of financial distress. Journal of Finance, 1985, 40(1): 53-65.

[13]

Gunn, S.R. (ed.) (1998). Support Vector Machines for Classification and Regression. Technical Report, University of Southampton

[14]

Henley W.E.. Statistical aspects of credit scoring, 1995, Milton Keynes, UK: The Open University

[15]

Henley W.E., Hand D.J.. K-nearest neighbor classifier for assessing consumer credit risk. Statistician, 1996, 44: 77-95.

[16]

Hosmer D.W., Lemeshow S.. Applied Logistic Regression, 2000, New York: John Wiley & Sons Inc.

[17]

Jo H., Han I., Lee H.. Bankruptcy prediction using case-based reasoning, neural networks, and discriminant analysis. Expert Systems Application, 1997, 13: 97-108.

[18]

Lee T.S., Chen I.F.. A two-stage hybrid credit scoring model using artificial neural networks and multivariate adaptive regression splines. Expert Systems with Applications, 2005, 28: 743-752.

[19]

Mester, L.J. (1997). What’s the point of credit scoring? Business Review — Federal Reserve Bank of Philadelphia. Sept/Oct: 3–16

[20]

Moody J., Darken C.J.. Fast learning in networks of locally tuned processing units. Neural Computation, 1989, 3: 213-25.

[21]

Steinburg D., Colla P.. Classification and Regression Trees, 1997, San Didgo, CA: Salford Systems.

[22]

Thomas L.C.. A survey of credit and behavioral scoring: Forecasting financial risks of lending to customers. International Journal of Forecasting, 2000, 16: 149-172.

[23]

Tam K.Y., Kiang M.Y.. Managerial applications of neural networks: the case of bank failure predictions. Management Science, 1992, 38(7): 926-47.

[24]

Vapnik N.. Statistical Learning Theory, 1999, New York: Springer & Verlag

[25]

West D.. Neural network credit scoring models. Computers & Operations Research, 2000, 27: 1131-1152.

AI Summary AI Mindmap
PDF

95

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/