Predicting status of Chinese listed companies based on features selected by penalized regression

Rui Ma , Honghao Zhao , Ligang Zhou

Journal of Systems Science and Systems Engineering ›› 2017, Vol. 26 ›› Issue (4) : 475 -486.

PDF
Journal of Systems Science and Systems Engineering ›› 2017, Vol. 26 ›› Issue (4) : 475 -486. DOI: 10.1007/s11518-017-5349-1
Article

Predicting status of Chinese listed companies based on features selected by penalized regression

Author information +
History +
PDF

Abstract

China’s companies have attracted much attention due to the development of stock market in China. The listing status of listed Chinese companies becomes an important indicator which implies the potential risk of a stock. Thus predicting the status of listed Chinese companies is obviously crucial for stockholders and investors when they make further decisions. According to the four possible listing statuses for Chinese companies, researchers formulate the above issue as a classification problem which is typical in data mining area. Plenty of classification techniques have been implemented to predict the status of the listing Chinese companies based on their financial factors. Usually, there are more than 150 financial factors for each of the listed companies, and feature selection is needed before the implementation of classification methods. In the literature, researcher used t-test with variance inflation factor (VIF) analysis to select relevant factors. However, such method can not be applied in the high dimensional case. In this paper, we apply the idea of penalized regression to select the interested factors based on a logistic regression model, and then apply popular classification methods to predict the companies’ statuses. Our results show that the proposed method can find more representative factors and improves the prediction accuracy of the classification methods.

Keywords

Classification / data mining / feature selection / penalized regression

Cite this article

Download citation ▾
Rui Ma, Honghao Zhao, Ligang Zhou. Predicting status of Chinese listed companies based on features selected by penalized regression. Journal of Systems Science and Systems Engineering, 2017, 26(4): 475-486 DOI:10.1007/s11518-017-5349-1

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Altman E.I.. Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The Journal of Finance, 1968, 23(4): 589-609.

[2]

Ding Y., Song X., Zen Y.. Forecasting financial condition of Chinese listed companies basedonsupport vector machine. Expert Systems with Applications, 2008, 34(4): 3081-3089.

[3]

Fiedman J., Hastie T., Hofling H., Tibshirani R.. Path wise coordinate optimization. Annals of Applied Statistics, 2007, 1(2): 302-332.

[4]

Fiedman J., Hastie T., Hofling H., Tibshirani R.. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 2010, 33: 1-22.

[5]

Gavin C.C., Talbot L.C.. Gene selection in cancer classification using sparse logistic regression with Bayesian regularization. Bioinformatics, 2006, 22(19): 2348-2355.

[6]

Geng R.B., Bose I., Chen X.. Prediction offinancial distress: anempirical study of listed chinese companies using data mining. European Journal of Operations Research, 2014, 241(1): 236-247.

[7]

Huang C., Dai C., Guo M.. A hybrid approach using two-level DEA for financial failure prediction and integrated SE-DEA and GCA for indicators selection. Applied Mathematics and Computation, 2015, 251: 431-441.

[8]

Li S.J., Wang S.. A financial early warning logit model and its efficiency verification approach. Knowledge based Systems, 2014, 70: 78-87.

[9]

Li Z.Y., Crook J., Andreeva G.. Chinese companies distress prediction: an application of data envelopment analysis. Journal of the Operational Research Society, 2014, 65(3): 466-479.

[10]

Liang Y., Liu C., Luan X.Z., Leung K.S., Chan T.M., Xu Z.B., Zhang H.. Sparse logistic regression with a L1=2 penalty for gene selection in cancer classification. BMC Bioinformatics, 2013, 14: 198.

[11]

Shevade S.K., Keerthi S.S.. A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 2003, 19(17): 2246-2253.

[12]

Shumway T.. Forecasting bankruptcy more accurately: a simple hazard model. Journal of Business, 2001, 74(1): 101-124.

[13]

Sun J., Shang Z.M., Li H.. Im balanceriented SVM methods for financial distress prediction: a comparative study among the new SBSVM-ensemble method and traditional methods. Journal of the Operational Research Society, 2014, 65(12): 1905-1919.

[14]

Tibshirani R.. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, 1996, 58(1): 267-288.

[15]

Xiao Z., Yang X.L., Pang Y., Dang X.. The prediction for listed companies financial distress by using multiple prediction methods with rough set and dempster-shafer evidence theory. Knowledge based Systems, 2012, 6: 196-206.

[16]

Zhang L., Altman E.I., Yen J.. Corporate financial distress diagnosis model and application in credit rating for listing firms in China. Frontiers of Computer Science in China, 2010, 4(2): 220-236.

[17]

Zhou L.G., Tam K.P., Fujita. Predictiing the listing status of Chinese Listed companies with multi-class classification models. Information Sciences, 2016, 328: 222-236.

[18]

Zou H., Hastie T.. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B, 2005, 67(2): 301-320.

AI Summary AI Mindmap
PDF

131

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/