Robust Simplex-Based Multinomial Logistic Regression
Shunqin Zhang , Sanguo Zhang , Hang Li , Sheng Fu
Communications in Mathematics and Statistics ›› : 1 -37.
The multicategory logistic regression (MLR) is one of the most popular large-margin classifiers in machine learning. Although it has been successfully applied in various fields, some intrinsic problems still remain. In particular, the existing MLR models suffer from the over-specification of decision functions or the choice of reference category and are very sensitive to the potential outliers due to the unbounded loss function. In this article, utilizing the prevalent simplex-based framework, we propose three robust MLR models based on truncated loss function, weighted learning and label-adjusted learning, respectively. Moreover, the first two approaches achieve robustness by removing potential outliers and the latter obtains robustness by adaptively relabeling outliers. Theoretical properties including Fisher consistency, probability estimation and breakdown point are well established. Intensive numerical studies demonstrate that the proposed methods are very competitive for problems with potential outliers.
Fisher consistency / Multicategory classification / Probability estimation / Robustness / Simplex-based framework / 62H30 / 62J12 / 68Q32
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992) |
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
Mroueh, Y., Poggio, T., Rosasco, L., Slotine, J.-J.: Multiclass learning with simplex coding. Adv. Neural Inf. Process. Syst. 25 (2012) |
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
R Core Team: R: a language and environment for statistical computing (2022) |
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
Wahba, G.: Support vector machines, reproducing kernel Hilbert spaces and the randomized GACV. In: Advances in Kernel Methods-Support Vector Learning, pp. 69–87 (1999) |
| [44] |
|
| [45] |
|
| [46] |
|
| [47] |
|
| [48] |
|
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature
/
| 〈 |
|
〉 |