Testing the Mean Vector for High-Dimensional Data

Gongming Shi , Nan Lin , Baoxue Zhang

Communications in Mathematics and Statistics ›› : 1 -24.

PDF
Communications in Mathematics and Statistics ›› : 1 -24. DOI: 10.1007/s40304-024-00398-2
Article

Testing the Mean Vector for High-Dimensional Data

Author information +
History +
PDF

Abstract

In one-sample mean testing for high-dimensional data, existing tests, e.g., Chen and Qin (Ann Stat 38(2):808–835, 2010) and Wang et al. (J Am Stat Assoc 110(512):1658–1669, 2015), assume that the data are either normally distributed or from a latent factor model. In this paper, we remove these restrictive assumptions and develop a new asymptotic theory, showing that the asymptotic null distribution is a mixture of $\chi ^2$ mixture and normal distributions. With more conditions on the eigenvalues of the covariance matrices, a normal or $\chi ^2$ mixture approximation for the limiting null distribution is derived. The power functions of two test statistic under high-dimensional version local and fixed alternative are also analyzed. A wild bootstrap procedure is proposed to determine the critical values of the mixture of $\chi ^2$ mixture and normal distributions, which is easy to implement and fast to run. Numerical simulations show that our proposed methods control the test’s size more precisely than existing methods using the normal approximation. The merit of the proposed methods is further demonstrated on a real data example.

Cite this article

Download citation ▾
Gongming Shi, Nan Lin, Baoxue Zhang. Testing the Mean Vector for High-Dimensional Data. Communications in Mathematics and Statistics 1-24 DOI:10.1007/s40304-024-00398-2

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Alon U, Barkai N, Notterman DA, Gish K, Ybarra S, Mack D, Levine AJ. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci., 1999, 96(12): 6745-6750

[2]

Bai, Z., Saranadasa, H.: Effect of high dimension: by an example of a two sample problem. Stat. Sin. 311–329 (1996)

[3]

Chen SX, Qin Y-L. A two-sample test for high-dimensional data with applications to gene-set testing. Ann. Stat., 2010, 38(2): 808-835

[4]

Chen B, Wang H-M. High-dimensional tests for mean vector: approaches without estimating the mean vector directly. Acta Math. Appl. Sin. English Ser., 2022, 38(1): 78-86

[5]

Cui X, Li R, Yang G, Zhou W. Empirical likelihood test for a large-dimensional mean vector. Biometrika, 2020, 107(3): 591-607

[6]

Dehling H, Mikosch T. Random quadratic forms and the bootstrap for u-statistics. J. Multivar. Anal., 1994, 51(2): 392-413

[7]

Hu Z, Tong T, Genton MG. Diagonal likelihood ratio test for equality of mean vectors in high-dimensional data. Biometrics, 2019, 75(1): 256-267

[8]

Mammen E. Bootstrap and wild bootstrap for high dimensional linear models. Ann. Stat., 1993, 21(1): 255-285

[9]

Park J, Ayyala DN. A test for the mean vector in large dimension and small samples. J. Stat. Plan. Inference, 2013, 143(5): 929-943

[10]

Pauly M, Ellenberger D, Brunner E. Analysis of high-dimensional one group repeated measures designs. Statistics, 2015, 49(6): 1243-1261

[11]

Sattler P, Pauly M. Inference for high-dimensional split-plot-designs: a unified approach for small to large numbers of factor levels. Electron. J. Stat., 2018, 12(2): 2743-2805

[12]

Serfling RJ. Approximation Theorems of Mathematical Statistics, 2009 New York Wiley

[13]

Srivastava MS, Du M. A test for the mean vector with fewer observations than the dimension. J. Multivar. Anal., 2008, 99(3): 386-402

[14]

Vaart AW. Asymptotic Statistics, 2000 Cambridge Cambridge University Press 3

[15]

Wang R, Xu X. On two-sample mean tests under spiked covariances. J. Multivar. Anal., 2018, 167: 225-249

[16]

Wang L, Peng B, Li R. A high-dimensional nonparametric multivariate test for mean vector. J. Am. Stat. Assoc., 2015, 110(512): 1658-1669

[17]

Zhang, J.-T., Zhou, B., Guo, J.: Testing high-dimensional mean vector with applications. Stat. Pap. 1–33 (2021)

[18]

Zhang J-T, Guo J, Zhou B, Cheng M-Y. A simple two-sample test in high dimensions based on l 2-norm. J. Am. Stat. Assoc., 2020, 115(530): 1011-1027

[19]

Zhao J. A new test for the mean vector in large dimension and small samples. Commun. Stat. Simul. Comput., 2017, 46(8): 6115-6128

Funding

National Natural Science Foundation of China(12271370)

RIGHTS & PERMISSIONS

School of Mathematical Sciences, University of Science and Technology of China and Springer-Verlag GmbH Germany, part of Springer Nature

AI Summary AI Mindmap
PDF

194

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/