Model-Free Feature Screening for Ultrahigh-Dimensional Multiclass Classification via a Likelihood Ratio-Based Measure of Dependence
Fei Ye , Weidong Ma , Jingsong Xiao , Ying Yang
Communications in Mathematics and Statistics ›› : 1 -42.
This article proposes a new likelihood ratio-based index (LR index for short), to measure the dependence between a categorical response variable and a continuous predictor variable. The LR index is nonnegative and is zero if and only if the variables are independent. We propose an estimate of the index, develop a novel independence test and derive the asymptotic null distribution. Next, based on the LR index, a feature screening procedure (LR-SIS for short) is developed for multiclass classification with ultrahigh-dimensional predictors. LR-SIS is model-free and robust to the heavy-tailed distribution of predictors and outliers. The sure screening property of LR-SIS is established allowing the number of response classes to be diverging. The finite sample performance of the proposed LR index in both independence testing and feature screening is demonstrated by comprehensive simulation studies. Application of the LR-SIS is also illustrated on a real data set.
Feature screening / Test of independence / Multiclass classification / Ultrahigh dimensionality / 62H20 / 62H30 / 62F07
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
Ni, L., Fang, F., Shao, J.: Feature screening for ultrahigh dimensional categorical data with covariates missing at random. Comput. Stat. Data Anal. 142, 106824-15 (2020). https://doi.org/10.1016/j.csda.2019.106824 |
| [24] |
Ni, L., Fang, F., Shao, J.: Feature screening for ultrahigh dimensional categorical data with covariates missing at random. Comput. Statist. Data Anal. 142, 106824-15 (2020). https://doi.org/10.1016/j.csda.2019.106824 |
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature
/
| 〈 |
|
〉 |