A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines

Wensheng ZHU, Heping ZHANG

Front. Math. China ›› 2013, Vol. 8 ›› Issue (3) : 731-743.

PDF(124 KB)
PDF(124 KB)
Front. Math. China ›› 2013, Vol. 8 ›› Issue (3) : 731-743. DOI: 10.1007/s11464-012-0256-8
RESEARCH ARTICLE
RESEARCH ARTICLE

A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines

Author information +
History +

Abstract

In genetic studies of complex diseases, particularly mental illnesses, and behavior disorders, two distinct characteristics have emerged in some data sets. First, genetic data sets are collected with a large number of phenotypes that are potentially related to the complex disease under study. Second, each phenotype is collected from the same subject repeatedly over time. In this study, we present a nonparametric regression approach to study multivariate and time-repeated phenotypes together by using the technique of the multivariate adaptive regression splines for analysis of longitudinal data (MASAL), which makes it possible to identify genes, gene-gene and gene-environment, including time, interactions associated with the phenotypes of interest. Furthermore, we propose a permutation test to assess the associations between the phenotypes and selected markers. Through simulation, we demonstrate that our proposed approach has advantages over the existing methods that examine each longitudinal phenotype separately or analyze the summarized values of phenotypes by compressing them into one-time-point phenotypes. Application of the proposed method to the Framingham Heart Study illustrates that the use of multivariate longitudinal phenotypes enhanced the significance of the association test.

Keywords

Multivariate phenotypes / longitudinal data analysis / genetic association test / multivariate adaptive regression splines

Cite this article

Download citation ▾
Wensheng ZHU, Heping ZHANG. A nonparametric regression method for multiple longitudinal phenotypes using multivariate adaptive splines. Front Math Chin, 2013, 8(3): 731‒743 https://doi.org/10.1007/s11464-012-0256-8

References

[1]
Carlborg O, Haley C S. Epistasis: too often neglected in complex trait studies? Nat Rev Genet, 2004, 5: 618-625
CrossRef Google scholar
[2]
Friedman J H. Multivariate adaptive regression splines. Ann Stat, 1991, 191-141
CrossRef Google scholar
[3]
Kallberg H, Padyukov L, Plenge R M, . Gene-gene and gene-environment interactions involving HLA-DRB1, PTPN22, and smoking in two subsets of rheumatoid arthritis. Am J Hum Genet, 2007, 80: 867-875
CrossRef Google scholar
[4]
Kannel W B, Dawber T R, Kagan A, . Factors of risk in the development of coronary heart disease-six year follow-up experience. The Framingham Study. Ann Intern Med, 1961, 55: 33-50
CrossRef Google scholar
[5]
Kathiresan S, Manning A K, Demissie S, . A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study. BMC Med Genet, 2007, 8(Suppl 1): S17
CrossRef Google scholar
[6]
Kathiresan S, Melander O, Guiducci C, . Six new loci associated with blood lowdensity lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans. Nat Genet, 2008, 40: 189-197
CrossRef Google scholar
[7]
Kooner J S, Chambers J C, Aguilar-Salinas C A, . Genome-wide scan identifies variation in MLXIPL associated with plasma triglycerides. Nat Genet, 2008, 40: 149-151
CrossRef Google scholar
[8]
Lange C, Silverman E, Xu X, . A multivariate family-based association test using generalized estimating equations: FBAT-GEE. Biostatistics, 2003, 4: 195-206
CrossRef Google scholar
[9]
Liang K Y, Zeger S L. Longitudinal data analysis using generalized linear models. Biometrika, 1986, 3: 13-22
CrossRef Google scholar
[10]
Miller N E, Miller G J. Letter: high-density lipoprotein and atherosclerosis. Lancet, 1975, 1: 10-33
[11]
Namboodiri K K, Kaplan E B, Heuch I, . The Collaborative Lipid Research Clinics Family Study: biological and cultural determinants of familial resemblance for plasma lipids and lipoproteins. Genet Epidemiol, 1985, 2: 227-254
CrossRef Google scholar
[12]
Pollin T I, Damcott C M, Shen H, . A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection. Science, 2008, 322: 1702-1705
CrossRef Google scholar
[13]
Xu X, Tian L, Wei L J. Combining dependent tests for linkage or association across multiple phenotypic traits. Biostatistics, 2003, 4: 223-229
CrossRef Google scholar
[14]
Yeager M, Orr N, Hayes R B, . Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet, 2007, 39: 645-649
CrossRef Google scholar
[15]
Zhang H P. Multivariate adaptive splines for analysis of longitudinal data. J Comput Graph Stat, 1997, 6: 74-91
[16]
Zhang H P. Analysis of infant growth curves using multivariate adaptive splines. Biometrics, 1999, 55: 452-459
CrossRef Google scholar
[17]
Zhang H P. Mixed effects multivariate adaptive splines model for the analysis of longitudinal and growth curve data. Stat Methods Med Res, 2004, 13: 63-82
CrossRef Google scholar
[18]
Zhang H P, Liu C-T, Wang X Q. An association test for multiple traits based on the generalized Kendall’s tau. J Amer Stat Assoc, 2010, 105: 473-481
CrossRef Google scholar
[19]
Zhang H P, Zhong X. Linkage analysis of longitudinal data and design consideration. BMC Genet, 2006, 7: 37
CrossRef Google scholar
[20]
Zhu W S, Jiang Y, Zhang H P. Nonparametric covariate-adjusted association tests based on the generalized Kendall’s tau. J Amer Stat Assoc, 2012, 107: 1-11
CrossRef Google scholar
[21]
Zhu W S, Zhang H P. Why do we test multiple traits in genetic association studies? (with discussion), J Korean Stat Soc, 2009, 38: 1-10
CrossRef Google scholar

RIGHTS & PERMISSIONS

2014 Higher Education Press and Springer-Verlag Berlin Heidelberg
AI Summary AI Mindmap
PDF(124 KB)

Accesses

Citations

Detail

Sections
Recommended

/