PDF
Abstract
In one-sample mean testing for high-dimensional data, existing tests, e.g., Chen and Qin (Ann Stat 38(2):808–835, 2010) and Wang et al. (J Am Stat Assoc 110(512):1658–1669, 2015), assume that the data are either normally distributed or from a latent factor model. In this paper, we remove these restrictive assumptions and develop a new asymptotic theory, showing that the asymptotic null distribution is a mixture of $\chi ^2$ mixture and normal distributions. With more conditions on the eigenvalues of the covariance matrices, a normal or $\chi ^2$ mixture approximation for the limiting null distribution is derived. The power functions of two test statistic under high-dimensional version local and fixed alternative are also analyzed. A wild bootstrap procedure is proposed to determine the critical values of the mixture of $\chi ^2$ mixture and normal distributions, which is easy to implement and fast to run. Numerical simulations show that our proposed methods control the test’s size more precisely than existing methods using the normal approximation. The merit of the proposed methods is further demonstrated on a real data example.
Cite this article
Download citation ▾
Gongming Shi, Nan Lin, Baoxue Zhang.
Testing the Mean Vector for High-Dimensional Data.
Communications in Mathematics and Statistics 1-24 DOI:10.1007/s40304-024-00398-2
| [1] |
Alon U, Barkai N, Notterman DA, Gish K, Ybarra S, Mack D, Levine AJ. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci., 1999, 96(12): 6745-6750
|
| [2] |
Bai, Z., Saranadasa, H.: Effect of high dimension: by an example of a two sample problem. Stat. Sin. 311–329 (1996)
|
| [3] |
Chen SX, Qin Y-L. A two-sample test for high-dimensional data with applications to gene-set testing. Ann. Stat., 2010, 38(2): 808-835
|
| [4] |
Chen B, Wang H-M. High-dimensional tests for mean vector: approaches without estimating the mean vector directly. Acta Math. Appl. Sin. English Ser., 2022, 38(1): 78-86
|
| [5] |
Cui X, Li R, Yang G, Zhou W. Empirical likelihood test for a large-dimensional mean vector. Biometrika, 2020, 107(3): 591-607
|
| [6] |
Dehling H, Mikosch T. Random quadratic forms and the bootstrap for u-statistics. J. Multivar. Anal., 1994, 51(2): 392-413
|
| [7] |
Hu Z, Tong T, Genton MG. Diagonal likelihood ratio test for equality of mean vectors in high-dimensional data. Biometrics, 2019, 75(1): 256-267
|
| [8] |
Mammen E. Bootstrap and wild bootstrap for high dimensional linear models. Ann. Stat., 1993, 21(1): 255-285
|
| [9] |
Park J, Ayyala DN. A test for the mean vector in large dimension and small samples. J. Stat. Plan. Inference, 2013, 143(5): 929-943
|
| [10] |
Pauly M, Ellenberger D, Brunner E. Analysis of high-dimensional one group repeated measures designs. Statistics, 2015, 49(6): 1243-1261
|
| [11] |
Sattler P, Pauly M. Inference for high-dimensional split-plot-designs: a unified approach for small to large numbers of factor levels. Electron. J. Stat., 2018, 12(2): 2743-2805
|
| [12] |
Serfling RJ. Approximation Theorems of Mathematical Statistics, 2009 New York Wiley
|
| [13] |
Srivastava MS, Du M. A test for the mean vector with fewer observations than the dimension. J. Multivar. Anal., 2008, 99(3): 386-402
|
| [14] |
Vaart AW. Asymptotic Statistics, 2000 Cambridge Cambridge University Press 3
|
| [15] |
Wang R, Xu X. On two-sample mean tests under spiked covariances. J. Multivar. Anal., 2018, 167: 225-249
|
| [16] |
Wang L, Peng B, Li R. A high-dimensional nonparametric multivariate test for mean vector. J. Am. Stat. Assoc., 2015, 110(512): 1658-1669
|
| [17] |
Zhang, J.-T., Zhou, B., Guo, J.: Testing high-dimensional mean vector with applications. Stat. Pap. 1–33 (2021)
|
| [18] |
Zhang J-T, Guo J, Zhou B, Cheng M-Y. A simple two-sample test in high dimensions based on l 2-norm. J. Am. Stat. Assoc., 2020, 115(530): 1011-1027
|
| [19] |
Zhao J. A new test for the mean vector in large dimension and small samples. Commun. Stat. Simul. Comput., 2017, 46(8): 6115-6128
|
Funding
National Natural Science Foundation of China(12271370)
RIGHTS & PERMISSIONS
School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature
Just Accepted
This article has successfully passed peer review and final editorial review, and will soon enter typesetting, proofreading and other publishing processes. The currently displayed version is the accepted final manuscript. The officially published version will be updated with format, DOI and citation information upon launch. We recommend that you pay attention to subsequent journal notifications and preferentially cite the officially published version. Thank you for your support and cooperation.