Significance Test for Multinomial Naive Bayes Classifier with Ultra-high-dimensional Binary Features
Baiguo An , Juan Zhang , Beibei Zhang , Wenliang Pan
Communications in Mathematics and Statistics ›› : 1 -26.
We developed a significance test method for multinomial naive Bayes classifier with ultra-high-dimensional binary features. A novel test statistic with asymptotic standard Gaussian null distribution is proposed. Under very mild assumptions, the proposed test statistic has powers that tend to 1 as the sample size tends to infinity. Then, a sequential test process is developed to perform variable screening. We applied the proposed methods to lots of numerical studies including simulated examples and two real text data classification examples. The results show that our methods have good finite sample performances.
Binary feature / Multinomial naive bayes / Significance test / Text classification / Ultra-high dimensional / 62H15 / 62H30
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, http://www.deeplearningbook.org (2016) |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature
/
| 〈 |
|
〉 |