Pathway-based analysis of genome-wide association study of circadian phenotypes

Didi Zhu , Jiamin Yuan , Rui Zhu , Yao Wang , Zhiyong Qian , Jiangang Zou

Journal of Biomedical Research ›› 2018, Vol. 32 ›› Issue (5) : 361 -370.

PDF (482KB)
Journal of Biomedical Research ›› 2018, Vol. 32 ›› Issue (5) : 361 -370. DOI: 10.7555/JBR.32.20170102
Original Article
Original Article

Pathway-based analysis of genome-wide association study of circadian phenotypes

Author information +
History +
PDF (482KB)

Abstract

Sleepiness affects normal social life, which attracts more and more attention. Circadian phenotypes contribute to obvious individual differences in susceptibility to sleepiness. We aimed to identify candidate single nucleotide polymorphisms (SNPs) which may cause circadian phenotypes, elucidate the potential mechanisms, and generate corresponding SNP-gene-pathways. A genome-wide association studies (GWAS) dataset of circadian phenotypes was utilized in the study. Then, the Identify Candidate Causal SNPs and Pathways analysis was employed to the GWAS dataset after quality control filters. Furthermore, genotype-phenotype association analysis was performed with HapMap database. Four SNPs in three different genes were determined to correlate with usual weekday bedtime, totally providing seven hypothetical mechanisms. Eleven SNPs in six genes were identified to correlate with usual weekday sleep duration, which provided six hypothetical pathways. Our results demonstrated that fifteen candidate SNPs in eight genes played vital roles in six hypothetical pathways implicated in usual weekday bedtime and six potential pathways involved in usual weekday sleep duration.

Keywords

circadian phenotypes / genome-wide association studies / pathway-based analysis

Cite this article

Download citation ▾
Didi Zhu, Jiamin Yuan, Rui Zhu, Yao Wang, Zhiyong Qian, Jiangang Zou. Pathway-based analysis of genome-wide association study of circadian phenotypes. Journal of Biomedical Research, 2018, 32(5): 361-370 DOI:10.7555/JBR.32.20170102

登录浏览全文

4963

注册一个新账户 忘记密码

Introduction

Sleepiness impairs social function, reduces quality of life and causes occupational and motor vehicle accidents[1]. While behavioral factors, circadian factors (time of day), duration of wakefulness and sleep disorders are closely linked to daytime sleepiness[2], there are great interindividual differences in susceptibility to sleepiness[3]. Accumulating evidence shows that excessive sleepiness is heritable[45]. In modern society, nearly one-fifth of employees are involved in long-term night shift[6]. As a result, work performance and scheduling have a significant impact on individual variability in diurnal preference. Studies also indicate that diurnal preference (namely usual weekday bedtime) is heritable[79]. In addition, usual weekday sleep duration plays a critical role in daytime sleepiness. It has been investigated whether short or long sleep duration has been related to coronary heart disease[10], diabetes mellitus[1112], hypertension[13], and mortality[14]. Likewise, usual day sleep duration is heritable[15].

To date, several single nucleotide polymorphisms (SNPs) associated with circadian phenotypes in some genes were detected from three genome-wide association studies (GWASs)[1618], but the functions of these SNPs remain undefined, which is a challenge in interpreting GWAS results[19]. Thus, pathway-based approaches were optimized gradually, and the Identify Candidate Causal SNPs and Pathways (ICSNPathway) was created to determine potential SNPs and hypothetical mechanisms through GWAS data, using linkage disequilibrium (LD) analysis, functional SNP annotation and pathway-based analysis (PBA)[20]. Herein, we used bioinformatics methods combining ICSNPathway analysis and HapMap database to identify candidate SNPs and relevant pathways, aiming to develop SNP-gene-pathway hypotheses regarding circadian phenotypes.

Materials and methods

Study population and data extraction

We applied publicly available databases to identify eligible GWASs on circadian phenotypes, which are the National Human Genome Research Institute GWAS catalog (http://www.genome.gov/26525384), the National Center for Biotechnology Information (NCBI) dbGap (http://www.ncbi.nlm.nih.gov/gap/), and the GWAS central (http://www.gwascentral.org/). In addition, both EMBASE and PUBMED databases were searched with the following key words: “GWAS” or “genome-wide association study” and “circadian”. All searches were completed up to April 20th, 2016 without language limitation. In order to reduce the effect of genotyping errors, two independent authors (DZ and JYuan) filtered the primary GWAS data set and removed individuals with a call rate<95%, minor allele frequency<0.01, and deviating from the Hardy-Weinberg equilibrium (HWE) test (P<0.001). During data extraction, discussion with a third author (YW) helped resolve the discrepancies, with consensus on each item reached in the end. After extracting data from the original papers and contacting the corresponding authors, we ruled out the studies without details as needed.

Identification of candidate causal SNPs and pathways

ICSNPathway analysis was conducted in two consecutive stages. In the first stage, the candidate SNPs were pre-selected by LD analysis and functional SNP annotation with P values of <0.05[20]. During the LD analysis, we queried GWAS to capture the SNPs in LD (with r2>0.8) and positioned in the flanking region (with up to 500 kb upstream and downstream) . The extended dataset including HapMap data (http://hapmap.ncbi.nlm.nih.gov) was utilized to obtain more possible candidate SNPs[21]. Additionally, to gain LD structures, we used SNAP dataset (http://www.broadinstitute.org/mpg/snap/)[22]. The other method involves the functional annotation on the SNPs by searching the international SNP function annotation databases, including PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/)[23], Ensembl database (http://www.ensembl.org)[24], SNPs3D (http://www.snps3d.org)[25], and SIFT (http://sift.jcvi.org)[26].

Genotypic frequencies of candidate SNPs was extracted from the International HapMap Project (phase II, release 23), consisting of 3.96 million SNP genotypes from 270 subjects[27]. Besides, the data of corresponding mRNA expression was acquired from lymphoblastic cell lines of the 270 individuals mentioned above[28], which was extracted from SNPexp (http://app3.titan.uio.no/biotools/help.php?app=snpexp/)[29].

During the second stage, PBA algorithm was employed to annotate biological pathways of selected SNPs by integrating data from four databases, including BioCarta (http://www.biocarta.com), MsiDB (http://www.broadinstitute.org/gsea/msigdb), Kyoto Encyclopedia of Genes and Genomes (KEGG, http://www.genome.jp/kegg) and gene ontology (GO, http://www.geneontology.org). Furthermore, SNP label normalization and permutation were adopted to correct gene variations and generate the distribution of significant proportion based enrichment score (SPES).According to the distributions of SPESs, a nominal P-value and a FDR (false discovery rate; cutoff value: 0.05) were calculated.

Statistical analysis

The expression levels were shown as mean±SEM, and the difference between two genotypes was evaluated by two-side Student's t test. Furthermore, one way ANOVA was utilized to assess the difference of transcript expression levels in more than two genotypes. The statistical analysis was performed with SPSS version 21.0. P values<0.05 were considered statistically significant.

Results

Characteristics of the study population

One GWAS drawn from NCBI dbGap (study accession: phs000007) was finally adopted in our study[16] with publicly available summary data after a thorough search. In the GWAS on circadian phenotypes(including usual weekday bedtime and usual weekday sleep duration), totally 749 subjects were collected from the Framingham Offspring Study containing 2848 participants who accomplished sleep habit questionnaires between 1995 and 1998 (Offspring Examination Cycle 6) for the Sleep Heart Health Study[30]. For usual weekday bedtime, 65,514 candidate causative SNPs were originally generated with an Affymetrix 100K SNP Gene Chip, and afterwards 47,285 SNPs passed the quality control filters which were employed for ultimate bioinformatics analysis. Besides, for usual weekday sleep duration, 65,514 SNPs were generated with the gene chip, while 47,301 SNPs met the quality control criterions and were then applied for subsequent analysis.

Candidate SNPs and pathways

As presented in Table 1, totally four SNPs in three genes were determined to correlate with usual weekday bedtime, namely, MT-ND5 rs10517616, GRSF1 rs3775728, and ENAM rs7671281, rs3796704 polymorphisms. Moreover, eleven SNPs in six genes were identified to correlate with usual weekday sleep duration, namely, HSPD1 rs8539, APOBEC2 rs2076472, GRSF1 rs3775728, TTN rs9808377, rs1001238, rs2042995, rs3829746, rs2042996, CENPE rs2243682, rs2615542 and SLC17A1 rs13213957. Of note, GRSF1 rs3775728 was linked with both usual weekday bedtime and usual weekday sleep duration. SNP rs3775728 was in LD with rs2278134 (r2=1.0) ; rs7671281 and rs3796704 were in LD with rs2553319 (r2=1.0, and 1.0, respectively); rs9808377, rs1001238 and rs2042995 were in LD with rs3829746 (r2=0.945, 0.946, and 0.945, respectively); rs2243682 and rs2615542 were in LD with rs2290943 (r2=1.0, and 1.0, respectively); SNP rs13213957 was in LD with rs3734523 (r2=0.828). Except for a repeated SNP, fourteen regional LD plots are shown in Fig. 1.

Then, we examined the roles of different genotypes in mRNA expression levels via HapMap c-DNA expression database which was publicly available. No significant association between all SNPs with the mRNA expressions of corresponding genes was found in Caucasians as presented in Table 2. However, the SLC17A1 rs13213957 polymorphisms might tend to affect the mRNA expression levels of SLC17A1 (with marginal P value=0.0785), which is consistent with the functional class indicated in Table 1. In addition, the functions of the corresponding proteins were examined, which demonstrated that all SNPs could cause residue change except for HSPD1 rs8539, summarized in Table 3. In addition, MT-ND5 rs10517616 was not estimated here because no data was available publicly.

During the ICSNPathway analysis, six pathways about usual weekday bedtime were detected and are summarized in Table 4. The first mechanism involved MT-ND5 rs10517616 polymorphism (nonsynonymous coding) in pathways such as NADH dehydrogenase activity (nominal P<0.001, FDR=0.011), respiratory electron transport chain (nominal P=0.001, FDR=0.011), oxidoreductase activity (nominal P=0.002, FDR=0.017), and oxidative phosphorylation (nominal P=0.004, FDR=0.047). The second was GRSF1 rs3775728 polymorphism (nonsynonymous coding) in mRNA binding pathway (nominal P<0.001, FDR=0.014). The third one included ENAM rs7671281, rs3796704 polymorphisms (nonsynonymous coding) in pathway of biomineral formation (nominal P<0.001, FDR=0.021).

In the ICSNPathway analysis of usual weekday sleep duration, six pathways were found and are presented in Table 4 similarly. The first was HSPD1 rs8539 polymorphism (nonsynonymous coding) in the unfolded protein binding pathway (nominal P=0.001 FDR=0.03). The second one was APOBEC2 rs2076472 polymorphism (nonsynonymous coding) in pathway of mRNA processing (nominal P<0.001, FDR=0.031). The third mechanism involved GRSF1 rs3775728 polymorphism (nonsynonymous coding) in pathways containing mRNA processing (nominal P<0.001, FDR=0.031), RNA processing (nominal P=0.002, FDR=0.039), and mRNA binding (nominal P<0.001, FDR=0.042). The fourth pathway consisted of TTN rs9808377, rs1001238, rs2042995, rs3829746, rs2042996, and CENPE rs2243682, rs2615542 polymorphisms (nonsynonymous coding) in cell cycle (nominal P<0.001, FDR=0.036). The last one was SLC17A1 rs13213957 polymorphism (regulatory region) in the anion transport pathway (nominal P<0.001, FDR=0.042).

Discussion

A compound molecular network may make a significant contribution to the development of circadian phenotypes, containing several cellular pathways[31]. GWASs are limited to detect single SNP associations and identify new loci, so we applied a pathway-based pattern to take the biological interplay between multiple genes into consideration, and propose novel views into how genes might help the development of circadian phenotypes[32].

In this study, we applied ICSNPathway analysis to identify six potential regulating mechanisms, respectively, in usual weekday bedtime and sleep duration. The most significant SNP-to-gene-to-effect hypothesis was that rs10517616 changes the feature of MT-ND5 in NADH dehydrogenase activity[33]. It was reported that NADH promoted the transcription of the lactate dehydrogenase (LDH) gene under redox state. This is based on the activation of E-box by binding heterodimer Bmal1/NPAS2, the master brain clock to regulate circadian rhythmicity[]. The second candidate gene GRSF1 found in this study and previous studies has been implied in the pathway of mRNA binding through SNP rs3775728[3435]. The third biological mechanism involves the modulation of ENAM by rs7671281 and rs3796704 to affect its role in mineral formation[3637]. The forth one involves the influence of rs8539 on HSPD1 in unfolded protein binding[38]. The fifth involves the modulation of APOBEC2 by rs2076472 to affect mRNA processing. The sixth involves the modulation of TTN by rs9808377, rs1001238, rs2042995, rs3829746, and rs2042996 as well as CENPE by rs2243682 and rs2615542 to influence its role in cell cycle[39]. The seventh involves the modulation of SLC17A1 by rs13213957 to affect anion transport[4041], which could influence the mRNA expression of SLC17A1.

As far as we know, these mechanisms of circadian phenotypes, including MT-ND5, GRSF1, ENAM, HSPD1, APOBEC2, TTN, CENPE and SLC17A1, have been firstly identified in our study. The ICSNPathway analysis has been conducted to identify candidate causal genes relevant to disease-related phenotypes such as rheumatoid arthritis[20]. Thus, the results received in our study might help the development of novel hypotheses for the further investigations.

Even though the abovementioned biological mechanisms may affect circadian phenotypes, several limitations should be acknowledged. Firstly, the data was obtained from only 749 subjects[16], which may limit the application to the whole populations and weaken the authority to identify the candidate SNPs. Secondly, with no study supplying strong supports for these results, the candidate SNP-gene-pathways should be verified in more studies.

In short, our results demonstrated fifteen candidate SNPs in eight genes (MT-ND5 rs10517616, GRSF1 rs3775728, ENAM rs7671281, rs3796704, HSPD1 rs8539, APOBEC2 rs2076472, GRSF1 rs3775728, TTN rs9808377, rs1001238, rs2042995, rs3829746, rs2042996, CENPE rs2243682, rs2615542 and SLC17A1 rs13213957 polymorphisms), which participate in six hypothetical pathways involved in usual weekday bedtime and six potential pathways implicated usual weekday sleep duration. However, further investigations are warranted to validate the identified genetic variations in the biological pathways related to circadian phenotypes.

References

[1]

Roth T, Rosenberg RP. Managing excessive daytime sleepiness[J]. J Clin Psychiatry, 2015, 76(11): 1518–1521, 1521.

[2]

Dijk DJ, Duffy JF, Czeisler CA. Circadian and sleep/wake dependent aspects of subjective alertness and cognitive performance[J]. J Sleep Res, 1992, 1(2): 112–117

[3]

Van Dongen HP, Baynard MD, Maislin G, Systematic interindividual differences in neurobehavioral impairment from sleep loss: evidence of trait-like differential vulnerability[J]. Sleep, 2004, 27(3): 423–433

[4]

Carmelli D, Bliwise DL, Swan GE, A genetic analysis of the Epworth Sleepiness Scale in 1560 World War II male veteran twins in the NAS-NRC Twin Registry[J]. J Sleep Res, 2001, 10(1): 53–58

[5]

Watson NF, Goldberg J, Arguelles L, Genetic and environmental influences on insomnia, daytime sleepiness, and obesity in twins[J]. Sleep, 2006, 29(5): 645–649

[6]

Drake CL, Roehrs T, Richardson G, Shift work sleep disorder: prevalence and consequences beyond that of symptomatic day workers[J]. Sleep, 2004, 27(8): 1453–1462

[7]

Heath AC, Kendler KS, Eaves LJ, Evidence for genetic influences on sleep disturbance and sleep pattern in twins[J]. Sleep, 1990, 13(4): 318–335

[8]

Vink JM, Groot AS, Kerkhof GA, Genetic analysis of morningness and eveningness[J]. Chronobiol Int, 2001, 18(5): 809–822

[9]

Klei L, Reitz P, Miller M, Heritability of morningness-eveningness and self-report sleep measures in a family-based sample of 521 hutterites[J]. Chronobiol Int, 2005, 22(6): 1041–1054

[10]

Ayas NT, White DP, Manson JE, A prospective study of sleep duration and coronary heart disease in women[J]. Arch Intern Med, 2003, 163(2): 205–209

[11]

Ayas NT, White DP, Al-Delaimy WK, A prospective study of self-reported sleep duration and incident diabetes in women[J]. Diabetes Care, 2003, 26(2): 380–384

[12]

Gottlieb DJ, Punjabi NM, Newman AB, Association of sleep time with diabetes mellitus and impaired glucose tolerance[J]. Arch Intern Med, 2005, 165(8): 863–867

[13]

Gottlieb DJ, Redline S, Nieto FJ, Association of usual sleep duration with hypertension: the Sleep Heart Health Study[J]. Sleep, 2006, 29(8): 1009–1014

[14]

Cappuccio FP, Cooper D, D’Elia L, Sleep duration predicts cardiovascular outcomes: a systematic review and meta-analysis of prospective studies[J]. Eur Heart J, 2011, 32(12): 1484–1492

[15]

Partinen M, Kaprio J, Koskenvuo M, Genetic and environmental determination of human sleep[J]. Sleep, 1983, 6(3): 179–185

[16]

Gottlieb DJ, O'Connor GT, Wilk JB. Genome-wide association of sleep and circadian phenotypes[J]. BMC Med Genet, 2007, 8 Suppl 1(Suppl 1): S9.

[17]

Lane JM, Vlasac I, Anderson SG, Genome-wide association analysis identifies novel loci for chronotype in 100,420 individuals from the UK Biobank[J]. Nat Commun, 2016, 7: 10889

[18]

Hu Y, Shmygelska A, Tran D, GWAS of 89,283 individuals identifies genetic variants associated with self-reporting of being a morning person[J]. Nat Commun, 2016, 7: 10448

[19]

Wang K, Li M, Hakonarson H. Analysing biological pathways in genome-wide association studies[J]. Nat Rev Genet, 2010, 11(12): 843–854

[20]

Zhang K, Chang S, Cui S, Guo L, Zhang L, Wang J.ICSNPathway: identify candidate causal SNPs and pathways from genome-wide association study by one analytical framework[J]. Nucleic Acids Res, 2011, 39(Web Server issue): W437–W443.

[21]

Altshuler DM, Gibbs RA, Peltonen L, Integrating common and rare genetic variation in diverse human populations[J]. Nature, 2010, 467(7311): 52–58

[22]

Johnson AD, Handsaker RE, Pulit SL, SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap[J]. Bioinformatics, 2008, 24(24): 2938–2939

[23]

Adzhubei I, Jordan DM, Sunyaev SR. Predicting functional effect of human missense mutations using PolyPhen-2[J]. Curr Protoc Hum Genet, 2013, Chapter 7: t7–t20.

[24]

Flicek P, Aken BL, Ballester B, Ensembl’s 10th year[J]. Nucleic Acids Res, 2010, 38(Database issue): D557–D562

[25]

Yue P, Melamud E, Moult J. SNPs3D: candidate gene and SNP selection for association studies[J]. BMC Bioinformatics, 2006, 7: 166

[26]

Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm[J]. Nat Protoc, 2009, 4(7): 1073–1081

[27]

He J, Shi TY, Zhu ML, Associations of Lys939Gln and Ala499Val polymorphisms of the XPC gene with cancer susceptibility: a meta-analysis[J]. Int J Cancer, 2013, 133(8): 1765–1775

[28]

Stranger BE, Forrest MS, Dunning M, Relative impact of nucleotide and copy number variation on gene expression phenotypes[J]. Science, 2007, 315(5813): 848–853

[29]

Holm K, Melum E, Franke A, SNPexp- A web tool for calculating and visualizing correlation between HapMap genotypes and gene expression levels[J]. BMC Bioinformatics, 2010, 11: 600

[30]

Quan SF, Howard BV, Iber C, The Sleep Heart Health Study: design, rationale, and methods[J]. Sleep, 1997, 20(12): 1077–1085

[31]

Bei B, Wiley JF, Trinder J, Beyond the mean: A systematic review on the correlates of daily intraindividual variability of sleep/wake patterns[J]. Sleep Med Rev, 2016, 28: 108–124

[32]

Pedroso I, Breen G. Gene set analysis and network analysis for genome-wide association studies[J]. Cold Spring Harb Protoc, 2011, 2011(9): pdb.top065581

[33]

Houštek J, Hejzlarová K, Vrbacký M, Nonsynonymous variants in mt-Nd2, mt-Nd4, and mt-Nd5 are linked to effects on oxidative phosphorylation and insulin sensitivity in rat conplastic strains[J]. Physiol Genomics, 2012, 44(9): 487–494

[34]

DeBruyne JP1, Weaver DR, Reppert SM. CLOCK and NPAS2 have overlapping roles in the suprachiasmatic circadian clock[J]. Nat Neurosci, 2007, 10(5): 543–545

[35]

Jourdain AA, Koppen M, Wydro M, GRSF1 regulates RNA processing in mitochondrial RNA granules[J]. Cell Metab, 2013, 17(3): 399–410

[36]

Antonicka H, Sasarman F, Nishimura T, The mitochondrial RNA-binding protein GRSF1 localizes to RNA granules and is required for posttranscriptional mitochondrial gene expression[J]. Cell Metab, 2013, 17(3): 386–398

[37]

Smith CE, Wazen R, Hu Y, Consequences for enamel development and mineralization resulting from loss of function of ameloblastin or enamelin[J]. Eur J Oral Sci, 2009, 117(5): 485–497

[38]

Hu JC, Hu Y, Smith CE, Enamel defects and ameloblast-specific expression in Enam knock-out/lacz knock-in mice[J]. J Biol Chem, 2008, 283(16): 10858–10871

[39]

Magnoni R, Palmfeldt J, Hansen J, The Hsp60 folding machinery is crucial for manganese superoxide dismutase folding and function[J]. Free Radic Res, 2014, 48(2): 168–179

[40]

Iemura K, Tanaka K. Chromokinesin Kid and kinetochore kinesin CENP-E differentially support chromosome congression without end-on attachment to microtubules[J]. Nat Commun, 2015, 6: 6447

[41]

Reimer RJ. SLC17: a functionally diverse family of organic anion transporters[J]. Mol Aspects Med, 2013, 34(2-3): 350–359

[42]

Togawa N, Miyaji T, Izawa S, A Na+-phosphate cotransporter homologue (SLC17A4 protein) is an intestinal organic anion exporter[J]. Am J Physiol Cell Physiol, 2012, 302(11): C1652–C1660

RIGHTS & PERMISSIONS

2018 by the Journal of Biomedical Research.

AI Summary AI Mindmap
PDF (482KB)

596

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/