A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines

Wensheng Zhu , Heping Zhang

Front. Math. China ›› 2012, Vol. 8 ›› Issue (3) : 731 -743.

PDF (124KB)
Front. Math. China ›› 2012, Vol. 8 ›› Issue (3) : 731 -743. DOI: 10.1007/s11464-012-0256-8
Research Article
RESEARCH ARTICLE

A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines

Author information +
History +
PDF (124KB)

Abstract

In genetic studies of complex diseases, particularly mental illnesses, and behavior disorders, two distinct characteristics have emerged in some data sets. First, genetic data sets are collected with a large number of phenotypes that are potentially related to the complex disease under study. Second, each phenotype is collected from the same subject repeatedly over time. In this study, we present a nonparametric regression approach to study multivariate and time-repeated phenotypes together by using the technique of the multivariate adaptive regression splines for analysis of longitudinal data (MASAL), which makes it possible to identify genes, gene-gene and gene-environment, including time, interactions associated with the phenotypes of interest. Furthermore, we propose a permutation test to assess the associations between the phenotypes and selected markers. Through simulation, we demonstrate that our proposed approach has advantages over the existing methods that examine each longitudinal phenotype separately or analyze the summarized values of phenotypes by compressing them into one-time-point phenotypes. Application of the proposed method to the Framingham Heart Study illustrates that the use of multivariate longitudinal phenotypes enhanced the significance of the association test.

Keywords

Multivariate phenotypes / longitudinal data analysis / genetic association test / multivariate adaptive regression splines

Cite this article

Download citation ▾
Wensheng Zhu, Heping Zhang. A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines. Front. Math. China, 2012, 8(3): 731-743 DOI:10.1007/s11464-012-0256-8

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Carlborg O, Haley C S. Epistasis: too often neglected in complex trait studies?. Nat Rev Genet, 2004, 5: 618-625

[2]

Friedman J H. Multivariate adaptive regression splines. Ann Stat, 1991, 19: 1-141

[3]

Kallberg H, Padyukov L, Plenge R M Gene-gene and gene-environment interactions involving HLA-DRB1, PTPN22, and smoking in two subsets of rheumatoid arthritis. Am J Hum Genet, 2007, 80: 867-875

[4]

Kannel W B, Dawber T R, Kagan A Factors of risk in the development of coronary heart disease-six year follow-up experience. The Framingham Study. Ann Intern Med, 1961, 55: 33-50

[5]

Kathiresan S, Manning A K, Demissie S A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study. BMC Med Genet, 2007, 8(Suppl1): S17

[6]

Kathiresan S, Melander O, Guiducci C Six new loci associated with blood lowdensity lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nat Genet, 2008, 40: 189-197

[7]

Kooner J S, Chambers J C, Aguilar-Salinas C A Genome-wide scan identifies variation in MLXIPL associated with plasma triglycerides. Nat Genet, 2008, 40: 149-151

[8]

Lange C, Silverman E, Xu X A multivariate family-based association test using generalized estimating equations: FBAT-GEE. Biostatistics, 2003, 4: 195-206

[9]

Liang K Y, Zeger S L. Longitudinal data analysis using generalized linear models. Biometrika, 1986, 3: 13-22

[10]

Miller N E, Miller G J. Letter: high-density lipoprotein and atherosclerosis. Lancet, 1975, 1: 10-33

[11]

Namboodiri K K, Kaplan E B, Heuch I The Collaborative Lipid Research Clinics Family Study: biological and cultural determinants of familial resemblance for plasma lipids and lipoproteins. Genet Epidemiol, 1985, 2: 227-254

[12]

Pollin T I, Damcott C M, Shen H A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science, 2008, 322: 1702-1705

[13]

Xu X, Tian L, Wei L J. Combining dependent tests for linkage or association across multiple phenotypic traits. Biostatistics, 2003, 4: 223-229

[14]

Yeager M, Orr N, Hayes R B Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet, 2007, 39: 645-649

[15]

Zhang H P. Multivariate adaptive splines for analysis of longitudinal data. J Comput Graph Stat, 1997, 6: 74-91

[16]

Zhang H P. Analysis of infant growth curves using multivariate adaptive splines. Biometrics, 1999, 55: 452-459

[17]

Zhang H P. Mixed effects multivariate adaptive splines model for the analysis of longitudinal and growth curve data. Stat Methods Med Res, 2004, 13: 63-82

[18]

Zhang H P, Liu C -T, Wang X Q. An association test for multiple traits based on the generalized Kendall’s tau. J Amer Stat Assoc, 2010, 105: 473-481

[19]

Zhang H P, Zhong X. Linkage analysis of longitudinal data and design consideration. BMC Genet, 2006, 7: 37

[20]

Zhu W S, Jiang Y, Zhang H P. Nonparametric covariate-adjusted association tests based on the generalized Kendall’s tau. J Amer Stat Assoc, 2012, 107: 1-11

[21]

Zhu W S, Zhang H P. Why do we test multiple traits in genetic association studies? (with discussion). J Korean Stat Soc, 2009, 38: 1-10

AI Summary AI Mindmap
PDF (124KB)

876

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/