Testing the Order of a Normal Mixture in Mean

Jiahua Chen , Pengfei Li

Communications in Mathematics and Statistics ›› 2016, Vol. 4 ›› Issue (1) : 21 -38.

PDF
Communications in Mathematics and Statistics ›› 2016, Vol. 4 ›› Issue (1) : 21 -38. DOI: 10.1007/s40304-015-0079-5
Article

Testing the Order of a Normal Mixture in Mean

Author information +
History +
PDF

Abstract

There has been a rapid progress in designing valid and effective statistical hypothesis tests for the order of a finite mixture model. In particular, EM-test for the order of the mixture model has been developed and found effective when the component distribution contains a single parameter. EM-test is found to be particularly effective and elegant for the order of normal mixture in both mean and variance. The idea behind EM-test has been found widely applicable. In this paper, we investigate the use of EM-test for the order of a finite normal mixture in the mean parameter with equal but unknown component variances. We show that for any positive integer $m_0 \ge 2,$ the limiting distribution of the EM-test for the order of $m_0$ against the higher order alternative is $\chi ^2_{m_0-1}.$ A genetic example is used to illustrate the application of the EM-test.

Keywords

EM-test / Homogeneity / Mixture model / Modified likelihood ratio test / Structural parameter

Cite this article

Download citation ▾
Jiahua Chen, Pengfei Li. Testing the Order of a Normal Mixture in Mean. Communications in Mathematics and Statistics, 2016, 4(1): 21-38 DOI:10.1007/s40304-015-0079-5

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Bickel P, Chernoff H. Ghosh JK. Asymptotic distribution of the likelihood ratio statistic in a prototypical non regular problem. Statistics and Probability. 1993 New Delhi: Wiley Eastern Limited. 83-96

[2]

Cai T, Jin J, Low M. Estimation and confidence sets for sparse normal mixtures. Ann. Stat.. 2007, 35 2421-2449

[3]

Chen H, Chen J. Tests for homogeneity in normal mixtures with presence of a structural parameter. Stat. Sin.. 2003, 13 351-365

[4]

Chen H, Chen J, Kalbfleisch JD. A modified likelihood ratio test for homogeneity in finite mixture models. J. R. Stat. Soc. B. 2001, 63 19-29

[5]

Chen H, Chen J, Kalbfleisch JD. Testing for a finite mixture model with two components. J. R. Stat. Soc. B. 2004, 66 95-115

[6]

Chen J. Optimal rate of convergence in finite mixture models. Ann. Stat.. 1995, 23 221-234

[7]

Chen J, Li P. Hypothesis test for normal mixture models: the EM approach. Ann. Stat.. 2009, 37 2523-2542

[8]

Chen J, Li P, Fu Y. Inference on the order of a normal mixture. J. Am. Stat. Assoc.. 2012, 107 1096-1105

[9]

Dacunha-Castelle D, Gassiat E. Testing the order of a model using locally conic parametrization: population mixtures and stationary ARMA processes. Ann. Stat.. 1999, 27 1178-1209

[10]

Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via EM algorithm (with discussion). J. R. Stat. Soc. B. 1977, 39 1-38

[11]

Efron B. Large-scale simulation hypothesis testing: the choice of a null hypothesis. J. Am. Stat. Assoc.. 2004, 99 96-104

[12]

Everitt BS. An introduction to finite mixture distributions. Stat. Methods Med. Res.. 1996, 5 107-127

[13]

Ghosh, J.K., Sen, P.K.: On the asymptotic performance of the log-likelihood ratio statistic for the mixture model and related results. In: LeCam, L., Olshen, R.A. (eds.) Proceedings of the Berkeley Conference in Honor of J. Neyman and Kiefer, vol. 2, pp. 789–806 (1985)

[14]

Hartigan, J.A.: A failure of likelihood asymptotics for normal mixtures. In: LeCam, L., Olshen, R.A. (eds.) Proceedings of the Berkeley Conference in Honor of J. Neyman and Kiefer, vol. 2, pp. 807–810 (1985)

[15]

Kon S. Models of stock returns—a comparison. J. Finance. 1984, 39 147-165

[16]

Li P, Chen J, Marriott P. Non-finite Fisher information and homogeneity: the EM approach. Biometrika. 2009, 96 411-442

[17]

Liu X, Pasarica C, Shao Y. Testing homogeneity in gamma mixture models. Scand. J. Stat.. 2003, 30 227-239

[18]

Liu X, Shao YZ. Asymptotics for likelihood ratio tests under loss of identifiability. Ann. Stat.. 2003, 31 807-832

[19]

Liu X, Shao YZ. Asymptotics for the likelihood ratio test in a two-component normal mixture model. J. Stat. Plan. Inference. 2004, 123 61-81

[20]

Loisel P, Goffinet B, Monod H, Montes De Oca G. Detecting a major gene in an F2 population. Biometrics. 1994, 50 512-516

[21]

MacKenzie SA, Bassett MJ. Genetics of fertility restoration in cytoplasmic sterile Phaseolus vulgaris L. I. Cytoplasmic alteration by a nuclear restorer gene. Theor. Appl. Genet.. 1987, 74 642-645

[22]

McLachlan GJ, Bean RW, Ben-Tovim Jones L. A simple implementation of a normal mixture approach to differential gene expression in multiclass microarrays. Bioinformatics. 2006, 22 1608-1615

[23]

Raftery AE, Dean N. Variable selection for model-based clustering. J. Am. Stat. Assoc.. 2006, 101 168-178

[24]

Roeder K. A graphical technique for determining the number of components in a mixture of normals. J. Am. Stat. Assoc.. 1994, 89 487-500

[25]

Schork NJ, Allison DB, Thiel B. Mixture distributions in human genetics. Stat. Methods Med. Res.. 1996, 5 155-178

[26]

Serfling, R.J.: Approximation theorems of mathematical statistics. Wiley, New York (1980)

[27]

Sun W, Cai T. Oracle and adaptive compound decision rules for false discovery rate control. J. Am. Stat. Assoc.. 2007, 102 901-912

[28]

Tadesse M, Sha N, Vannucci M. Bayesian variable selection in clustering high-dimensional data. J. Am. Stat. Assoc.. 2005, 100 602-617

[29]

Wu CFJ. On the convergence properties of the EM algorithm. Ann. Stat.. 1983, 11 95-103

Funding

Natural Sciences and Engineering Research Council of Canada(RGPIN-2014-03743)

Natural Sciences and Engineering Research Council of Canada(RGPIN-2015-06592)

AI Summary AI Mindmap
PDF

263

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/