One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning

Minggang DONG , Ming LIU , Chao JING

Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (2) : 278 -290.

PDF (415KB)
Front. Inform. Technol. Electron. Eng ›› 2022, Vol. 23 ›› Issue (2) : 278 -290. DOI: 10.1631/FITEE.2000417
Orginal Article
Orginal Article

One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning

Author information +
History +
PDF (415KB)

Abstract

Since traditional machine learning methods are sensitive to skewed distribution and do not consider the characteristics in multiclass imbalance problems, the skewed distribution of multiclass data poses a major challenge to machine learning algorithms. To tackle such issues, we propose a new splitting criterion of the decision tree based on the one-against-all-based Hellinger distance (OAHD). Two crucial elements are included in OAHD. First, the one-against-all scheme is integrated into the process of computing the Hellinger distance in OAHD, thereby extending the Hellinger distance decision tree to cope with the multiclass imbalance problem. Second, for the multiclass imbalance problem, the distribution and the number of distinct classes are taken into account, and a modified Gini index is designed. Moreover, we give theoretical proofs for the properties of OAHD, including skew insensitivity and the ability to seek a purer node in the decision tree. Finally, we collect 20 public real-world imbalanced data sets from the Knowledge Extraction based on Evolutionary Learning (KEEL) repository and the University of California, Irvine (UCI) repository. Experimental and statistical results show that OAHD significantly improves the performance compared with the five other well-known decision trees in terms of Precision, F-measure, and multiclass area under the receiver operating characteristic curve (MAUC). Moreover, through statistical analysis, the Friedman and Nemenyi tests are used to prove the advantage of OAHD over the five other decision trees.

Keywords

Decision trees / Multiclass imbalanced learning / Node splitting criterion / Hellinger distance / One-against-all scheme

Cite this article

Download citation ▾
Minggang DONG, Ming LIU, Chao JING. One-against-all-based Hellinger distance decision tree for multiclass imbalanced learning. Front. Inform. Technol. Electron. Eng, 2022, 23(2): 278-290 DOI:10.1631/FITEE.2000417

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University Press

AI Summary AI Mindmap
PDF (415KB)

Supplementary files

FITEE-0278-22008-MGD_suppl_1

FITEE-0278-22008-MGD_suppl_2

594

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/