A New Classifier for Imbalanced Data Based on a Generalized Density Ratio Model

Junjun Li , Wenquan Cui

Communications in Mathematics and Statistics ›› 2023, Vol. 11 ›› Issue (2) : 369 -401.

PDF
Communications in Mathematics and Statistics ›› 2023, Vol. 11 ›› Issue (2) : 369 -401. DOI: 10.1007/s40304-021-00254-7
Article

A New Classifier for Imbalanced Data Based on a Generalized Density Ratio Model

Author information +
History +
PDF

Abstract

Achieving higher true positive rate when decreasing false positive rate is always a great challenge to the imbalance learning community. This work combines penalized empirical likelihood method, lower bound algorithm and Nyström method and applies these techniques along with kernel method to density ratio model. The resulting classifier, density ratio classifier (DRC), is a combination of kernelization, regularization, efficient implementation and threshold moving, all of which are critical to enable DRC to be an effective and powerful method for solving difficult imbalance problems. Compared with other methods, DRC is competitive in that it is widely applicable and it is simple and easy to use without additional imbalance handling skills. In addition, the convergence rate of the estimate of log density ratio is discussed as well. And the results of numerical analysis also show that DRC outperforms other methods in AUC and G-mean score.

Keywords

Classifier / Density ratio model / Imbalance problems / Kernel method / ROC curve

Cite this article

Download citation ▾
Junjun Li, Wenquan Cui. A New Classifier for Imbalanced Data Based on a Generalized Density Ratio Model. Communications in Mathematics and Statistics, 2023, 11(2): 369-401 DOI:10.1007/s40304-021-00254-7

登录浏览全文

4963

注册一个新账户 忘记密码

References

Funding

Innovative Research Group Project of the National Natural Science Foundation of China(71873128)

AI Summary AI Mindmap
PDF

115

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/