Introduction
Cellular differentiation is a precisely regulated process by which hundreds of morphologically and functionally distinct cell lineages are derived from the same origin, embryonic stem cells (ESCs). Given that ESCs and somatic cell types share an identical genome, a fundamental question is how lineage-specific gene expression patterns are established and maintained. Accumulating evidence supports that two epigenetic mechanisms, DNA methylation and histone modifications, are crucial in this process (
Reik, 2007).
DNA methylation at the fifth carbon position of cytosine is an abundant covalent modification that primarily occurs in CpG dinucleotides (mCpGs), with 80% of CpGs being methylated in mammals. DNA methylation affects DNA–protein interactions and therefore influences transcription in various physiological processes, such as imprinting, X-inactivation, tumorigenesis, and embryogenesis (
Jaenisch and Bird, 2003). In cellular differentiation, DNA methylation plays a critical role as the loss of enzymes responsible for establishing or removing methylation in embryos or ESCs results in skewed lineage specification and transdifferentiation (
Jackson et al., 2004;
Tsumura et al., 2006;
Koh et al., 2011). Similar to DNA methylation, covalent modifications on histone tails play an imperative role in cellular differentiation and modulate transcription by altering DNA accessibility. Altering the activity of histone modifying enzymes, such as methyltransferases and acetyltransferases, diminishes the differentiation potential of ESCs (
Margueron and Reinberg, 2011). Therefore, although histone modifications and DNA methylation are required for successful differentiation, the precise function of each in determining cell fate decisions is unknown.
Investigating the role of these epigenetic mechanisms has recently become possible with the construction of single base-pair resolution profiles of DNA methylation (methylomes) and genome-wide histone modification profiles in multiple cell types, tissues, and organisms (
Lister and Ecker, 2009;
Rivera and Ren, 2013). Comparison of methylomes across cellular differentiation has revealed both
de novo methylation (
Reik, 2007;
Smith and Meissner, 2013) and
de novo demethylation (
Lister et al., 2009;
Hon et al., 2012;
Xie et al., 2013;
Ziller et al., 2013). These findings raise the possibility that mCpG is more dynamic than previously thought. Indeed, literature supports that mCpG undergoes a demethylation process mediated by the TET family of proteins that oxidize mCpG to hydroxymethyl-CpG (
Tahiliani et al., 2009). The acquisition of methylomes and histone modification profiles from multiple cell types also provides an opportunity to examine the interplay between these two epigenetic mechanisms. Increasing evidence supports that the loss of DNA methylation is coupled to the gain of histone modifications, such as H3K27me3 and H3K9me3 in tumorigenesis (
Hon et al., 2012); H3K27me3, H3K9me3, and H3K4me1 in differentiation (
Hawkins et al., 2010;
Gifford et al., 2013); and H3K27ac and H3K4me1 in development (
Lister et al., 2013). However, the interplay between these two epigenetic mechanisms has only been observed for a limited number of histone modifications, leaving the large scope of DNA-histone modification interactions to be explored.
In this study, we systematically integrated DNA CpG methylation, various histone modification, and transcriptome profiles in ESCs and two somatic cell types in both human and mouse. With the use of our novel unbiased, enrichment based statistical approach, we found that both DNA methylation and histone modifications are considerably altered during cellular differentiation. Approximately 70% of the reconfigurations in DNA methylation and histone modifications are cell type-dependent and locus-specific. Additionally, integrative analyses of these epigenomic reconfigurations revealed a widespread locus-specific switching from DNA methylation to histone modifications during cellular differentiation in both species. Furthermore, when these epigenetic changes at promoter regions were correlated with gene transcription, we found that the switching at promoter regions implicates a transition from stable gene silencing mediated by DNA methylation to flexible gene repression facilitated by histone modifications. Our study provides a comprehensive insight into the role of epigenomic reconfiguration in cellular differentiation.
Materials and methods
Bioinformatics analyses
All analyses in this study were processed in the MAC terminal window. Perl was used to do in-house programming. R (http://www.r-project.org/) and Bioconductor libraries (http://www.bioconductor.org) were used to do all statistical analyses. Integrative Genomics Viewer (IGV, version 2.2.4) (
Robinson et al., 2011;
Thorvaldsdóttir et al., 2013) was used as the local genome browser in this study. The first replicate of each histone modification (ChIP-seq) and gene expression (mRNA-seq) data set was used in the browser representation for all the figures (Table S1). Samtools version 0.1.18 (
Li et al., 2009) was used to convert sam files to bam files and to sort the bam files by name. Sorted bam files were converted to tdf files by igvtools version 2.2.1 using parameters “-z 5-w 25-e 250” for ChIP-seq files and “-z 5-w 25-e 250 –strands read” for strand-specific mRNA-seq files. The track scales for ChIP-seq and mRNA-seq were normalized to the total number of uniquely mapped reads.
Obtaining the raw data and FASTQ files
The SRA raw data files for whole-genome bisulfite sequencing (WGBS), ChIP-seq, and mRNA-seq were downloaded from NCBI Sequence Read Archive database (http://www.ncbi.nlm.nih.gov/Traces/sra) under the accession numbers listed in Table S1. The SRA Toolkit version 2.1.7 (http://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view= software) was used to obtain the FASTQ files from the SRA files. The command used was “fastq-dump.”
Genomic sequences and annotations
The reference genomic sequences used in this study were hg19 for the human and mm9 for the mouse. The genomic annotations, human GRCh37 and mouse NCBIM37 that were based on Ensembl release 66, were downloaded from Illumina iGenome database (http://cufflinks.cbcb.umd.edu/igenomes.html). The distal 5′ end of a gene was defined as the transcription start site (TSS). The exonic region was defined as all exons of a gene. The region from the 500 bp downstream of TSS to the distal 3′ end of a gene was defined as the gene body. Intergenic regions were downloaded from UCSC Genome Browser using Table Browser function (http://genome.ucsc.edu/cgi-bin/hgTables?command=start). Intergenic regions with at least 1 kb in length were included in subsequent analyses.
WGBS data mapping and methylation calling
The FASTQ raw data files were first processed by Trim Galore (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/) to remove adaptor contamination, to trim lower-quality sequences, and to discard sequences shorter than 36 nucleotides. The Trim Galore parameters used were “-s 2-length 36.” Bismark v0.7.4 (
Krueger and Andrews, 2011) was used to map the WGBS data to the hg19 and the mm9 genomes. We used “-n 2-l 40” and default parameters in Bismark. We excluded reads that had three or more methylated cytosines in non-CpG contexts or had cytosine in the reads but thymine in the corresponding location of the reference genomes. For mapped reads from a library of the same PCR reaction with identical 5′ sites, the one with highest average phred quality score was kept and the others were discarded to get rid of possible PCR amplification artifacts. Libraries from multiple PCR reactions and from different replicates for each cell line were merged together. SNPs between C57BL/6 and 129 were downloaded from the Mouse Genome Project of the Sanger Institute (ftp://ftp-mouse.sanger.ac.uk/REL-1003/SNPs/20100310-all-snps.tab.gz). For the mouse methylomes, CpGs that overlapped with known SNPs were excluded from subsequent analyses. The methylation status of both strands of a CpG pair were merged together. CpGs with 10 or more read coverage were included in subsequent analyses. The methylation level for each CpG was calculated as follows:
where
is the number of reads with C mapped to the sense strand of a CpG, is the number of reads with T mapped to the sense strand of a CpG, is the number of reads with C mapped to the antisense strand of a CpG, is the number of reads with T mapped to the antisense strand of a CpG.
Genome-wide and locus-specific comparisons of DNA methylation levels
For genome-wide analysis, the two-tailed Fisher’s exact test was used to compare the methylomes between two cell lines at single-CpG resolution. The p-value for each CpG was calculated as follows:
where
is the binomial coefficient,
! is the factorial operator,
is the number of methylated reads that covered this CpG in ESCs,
is the number of non-methylated reads that covered this CpG in ESCs,
is the number of methylated reads that covered this CpG in somatic cells,
is the number of non-methylated reads that covered this CpG in somatic cells.
Raw
p-values were adjusted using the Benjamini and Hochberg method (
Benjamini and Hochberg, 1995) to control the False Discovery Rate (FDR). CpGs with an adjusted p-value (FDR) less than 0.05 were defined as differentially methylated CpGs.
For locus-specific analysis, we developed an enrichment-based statistical approach to compare and identify differentially methylated genomic regions (Fig. 2A). We defined promoter regions using stringent criteria (Figure S2A) and analyzed them in parallel.
To determine the distribution of CpGs and mCpGs at promoter regions (Fig. 5A), the region 5 kb upstream and 1 kb downstream of TSS for each gene was dissected into 600 bins with 10 bp in length. For each bin, we calculated the total number of genomic CpGs, the ratio between the total number of mCpGs versus the total number of genomic CpGs, the ratio between the total number of diff-mCpGs and the total number of mCpGs, respectively.
ChIP-seq data mapping, peak calling, and sliding-window analysis
Bowtie version 0.12.7 (http://bowtie-bio.sourceforge.net/index.shtml) (
Langmead et al., 2009) was used to map the FASTQ files of ChIP-seq data to the genomes. “-v 2-m 1-p 10” and default parameters were used. The mapping results for each replicate were merged together.
FindPeaks in Homer version 4.2 (http://biowhat.ucsd.edu/homer/ngs/) (
Heinz et al., 2010) was used for peak calling using “-style histone” and default parameters. If the region 1 kb upstream and 1 kb downstream of the TSS for a gene overlapped with H3K27me3 or H3K9me3 peaks, we defined this gene as marked by that histone modification.
All replicates for each histone modification in each cell type were merged together. The total number of uniquely mapped reads was normalized to 40 million. For sliding window analysis, the human and mouse genomes were partitioned into 1 kb sized windows, and the number of normalized reads for each window was calculated. The fold enrichment score for each histone modification at each window between ESCs and somatic cells was calculated as follows:
where is the number of normalized reads in this window for somatic cells and is the number of normalized reads in this window for ESCs.
Based on the fold enrichment score for each window, we classified all windows into three categories: those in which histone modifications are gained, lost, or remained. The gain windows were defined as those that have higher read numbers in somatic cells than in ESCs (fold enrichment score>1), whereas the lose windows were defined as those that have lower read numbers in somatic cells than in ESCs (fold enrichment score<1). We then obtained the top 2000 gain windows and the top 2000 lose windows based on their fold enrichment scores, and termed them as the highly dynamic regions. Only those dynamic regions with greater than or equal to 2 fold enrichment or depletion were included in subsequent analyses. The DNA methylation change associated with each highly dynamic region was calculated as follows:
where n is the total number of CpGs with 10 or more read coverage in both cells in this region, is the methylation level of the number i CpG in somatic cells, and is the methylation level of the number i CpG in ESCs.
mRNA-seq data mapping and differential expression analysis
The mRNA-seq data for mouse frontal cortex were mapped to mouse NCBIM37/mm9 genomes by TopHat version 2.0.6 (
Trapnell et al., 2009) using parameters “–bowtie1 –color –quals-p 5-g 1 –no-coverage-search –no-novel-juncs –library-type fr-secondstrand-G UCSC_mm9.gtf mm9_c.” The other mRNA-seq data in FASTQ files were mapped to human GRCh37/hg19 or mouse NCBIM37/mm9 genomes by STAR for MAC version 2.2.1d (
Dobin et al., 2013). We used STAR parameters “–outFilterMultimapNmax 1 –outFilterMismatchNmax 3” to only report uniquely mapped reads with no more than three mismatches.
The number of mapped reads in the exonic region of each gene was calculated using an in-house Perl program. All replicates for each cell type were calculated separately. Reads that overlapped with the exonic regions of multiple genes were excluded. The edgeR Bioconductor package (
Robinson et al., 2010) was used to perform the differential expression analysis. The classic edgeR method, which is negative binomial model based exact test, was used. For differential expression analysis, we only included genes with a count per million (CPM) larger than 1 in at least two of the data sets. Genes with adjusted p-values less than 0.05 were defined as differentially expressed. Gene Ontology enrichment analysis was performed using DAVID v6.7 (http://david.abcc.ncifcrf.gov/) (Huang da et al., 2009a, 2009b).
Results
Global DNA hypomethylation in somatic cells
Given the essential role of DNA methylation in cellular differentiation, we first examined global DNA methylation in terminally differentiated somatic cells compared to ESCs using raw data from the NCBI GEO database (Table S1) (
Lister et al., 2009;
Stadler et al., 2011;
Hon et al., 2012;
Xie et al., 2012 ; Mann et al., 2013). Thus, we compared the methylomes from human H1 ESCs (hESC) and two distinct types of human somatic cells, IMR90 fetal lung fibroblasts (hFB) and mammary epithelial cells (hEP) (Figure S1A). To evaluate DNA methylation changes in a different species, we also compared the methylomes of mouse ESCs (mESC), mouse primary dermal fibroblasts (mFB), and mouse frontal cortex tissue (mFC) (Figure S1A).
We found that global mCpG levels are significantly lower in somatic cells compared to ESCs (Fig. 1). In human, 93% of the differentially methylated CpG sites (diff-mCpGs) exhibit a decrease in methylation in hFB compared to hESC and 89% of diff-mCpGs show a decrease in hEP (Fig. 1A-1B, two-tailed Fisher’s exact test, Benjamini and Hochberg (B-H) correction). Global DNA hypomethylation was also found in mouse somatic cells (Fig. 1A-1B). Moreover, we found that hypomethylated diff-mCpGs are evenly distributed across all autosomes in both species, indicating widespread genome-wide DNA hypomethylation in somatic cells compared to ESCs (Figure S1C). Of note, global mCpG levels are not significantly different across somatic cell types (Figure S1B), suggesting that hypomethylation is found upon lineage commitment, rather than across cell types.
DNA hypomethylation is locus- and cell type-specific
Next, we investigated whether changes in DNA methylation occur at particular genomic features: intergenic regions, gene body, and promoter regions. To do so, we developed an enrichment based statistical approach for identifying genomic regions in which the ratio of diff-mCpGs is statistically higher than the ratio expected by chance (Fig. 2A). Thus, we were able to identify genomic regions that are differentially methylated between different cell types.
Consistent with our finding of global hypomethylation, a significantly larger portion of the differentially methylated regions is hypomethylated in somatic cells compared to ESCs in both human and mouse (Figure S2A-S2C, FDR<0.0001, Hypergeometric test, B-H correction). For example,>90% of differentially methylated promoter, gene body, and intergenic regions are hypomethylated in hFB compared to hESC (Figure S2A-S2C). Notably, approximately 60%-80% of the identified differentially methylated genomic regions are unique to that specific somatic cell type regardless of being in the promoter, gene body, or intergenic region (Fig. 2B, Figure S2D). Genomic feature-specific alterations of DNA methylation are illustrated in the promoter regions of cell type-specific genes, including LHX8, SH3TC1, COL6A3, and FOXI2 (Fig. 2C). Only 30% of the genomic regions identified in hFB and hEP share the same changes in DNA methylation when compared to hESC (Fig. 2B), such as the promoter regions of TMEM173 and TRIM4 (Fig. 2C). Furthermore, we found that the changes in DNA methylation are strictly confined in a locus-specific manner, as surrounding regions do not show similar alterations (Fig. 2C, Figure S2E- S2F). Together, these results demonstrate that the decrease in DNA methylation across cellular differentiation is lineage-specific and restricted to particular loci.
Locus- and cell type-specific alterations of histone modifications in human
Modification of histone tails, such as methylation, acetylation, ubiquitination or phosphorylation, is an evolutionarily conserved epigenetic mechanism crucial for gene regulation in many biological processes (
Jenuwein and Allis, 2001;
Berger, 2007). Characteristic histone modification features include H3K9me3 and H3K27me3 marks in transcriptionally silent loci and H3K4me3 and H3K27ac in transcriptionally active promoters (
Turner, 2007). Moreover, genes that show transient transcriptional changes in response to environmental stimuli versus genes that are persistently transcribed are marked by distinct types of histone modifications (
Métivier et al., 2003). Given the importance of histone modifications in regulating cellular processes, we next analyzed the extent to which histone modifications are altered in somatic cells compared to ESCs. We acquired all publicly available ChIP-seq data sets for histone modifications in hESC (26 data sets in total), hFB (26 data sets), and hEP (11 data sets) and compared the enrichment profiles for each modification across cell types (Table S1).
We found that 70%-80% of the enriched sites for each of the 26 histone modifications are altered in hFB compared to hESC and 60%-90% of the 11 histone modifications are altered between hEP and hESC, indicating a widespread epigenetic reorganization in human somatic cells (Fig. 3A). Comparison of histone modification profiles between the two somatic cell types revealed that alterations of histone modifications are predominately cell type-specific (Fig. 3B). For example, 70% of the identified H2A.Z sites that change during cellular differentiation are unique to each somatic cell type (Fig. 3B). Consistent with previous findings (
Hawkins et al., 2010), we also found that repressive histone modifications, such as H3K27me3 and H3K9me3, are markedly enriched, including an increase in the total number of sites (Fig. 3A) as well as a significant expansion of occupancy for most of the enriched sites across cellular differentiation (Fig. 3C-3D,
p<2.2 × 10
-16, Student’s
t-test). These results show that regions of histone modification occupancy change dramatically in a cell type-specific manner across cellular differentiation.
Epigenetic switching from DNA methylation to histone modifications
We next examined whether changes in histone modifications are associated with changes in DNA methylation across differentiation. To this end, we focused on genomic regions in somatic cells that exhibited the most dynamic changes in histone modifications relative to hESC. For each histone modification, these regions were defined as those with an enrichment score≥2, and we limited our analyses to the top 4000 dynamic regions (2000 gain and 2000 loss).
We found that the majority of regions with increased histone modifications in somatic cells also exhibited decreased DNA methylation levels compared to hESC (Green boxes in Fig. 4A-4B). This correlation occurred for 85% of the histone modifications in hFB and 73% of modifications in hEP (Green boxes in Fig. 4A-4B). Moreover, we observed that many of the identified epigenetic switching from DNA methylation to histone modifications during cellular differentiation are specific to each cell type (Gray regions in Fig. 4D; Gray regions in Figure S3C). For example, two sites in the RIN2 gene locus with specific loss of DNA methylation in hEP, but not in hFB, are accompanied by an increase in histone modifications in hEP, but not in hFB (Gray regions in Fig. 4D). Similarly, when we analyzed the relationship between DNA methylation and histone modifications in mouse mESC and mFC, we found that gains of H3K4me3 and H3K27ac are associated with a decrease in DNA methylation, while the loss of H3K4me3 and H3K27ac are coupled with an increase in DNA methylation (Fig. 4C). This epigenetic switching is not found between regions exhibiting decreased histone modifications and DNA methylation for the majority of regions (Figure S3A-S3B). Together, these data demonstrate a cell type-dependent and locus-specific switching from DNA methylation to histone modifications over cellular differentiation in both species.
Epigenetic switching at promoters is associated with distinct gene regulation
We next set out to determine how switching from DNA methylation to histone modifications during cellular differentiation affects gene expression. We focused on promoter regions since changes in DNA methylation and histone modifications at promoter regions are correlated with changes in transcription relative to other genomic features (
Ball et al., 2009). We found that proximal promoters, regions 0.5kb upstream and downstream of transcriptional start sites (TSSs), have a higher CpG density and are more dynamically methylated than distal promoter regions, defined as regions from 0.5 to 5kb upstream and 0.5-1kb downstream of TSSs (Fig. 5A, Figure S4A). Given this finding, we identified differentially methylated proximal promoters over differentiation with the use of our enrichment-based statistical approach (Fig. 2A). Compared to ESCs, we found that the majority of differentially methylated promoters were hypomethylated: 88% of the promoters in hFB, 82% of the promoters in hEP, 77% of the promoters in mFB, and 65% of the promoters in mFC (Table S2-S5). Therefore, we focused on hypomethylated promoters for subsequent analyses. Interestingly, gene ontology (GO) analysis of genes with hypomethylated proximal promoters revealed significant enrichment in processes characteristic of each somatic cell type as well as pathways responsive to environmental signals (Figure S4B, Fisher′s exact test, B-H correction). For example, many of the significant GO terms for hFB were related to cell adhesion, which is an essential characteristic of hFB, and injury response, a pathway highly dependent on external cues (Figure S4B). These findings suggest that hypomethylation at promoter regions in somatic cells is important for defining cellular identity and rendering them responsive to environmental cues.
Examination of transcriptional changes of genes with hypomethylated promoters revealed that 25% of genes showed an increase in expression in hFB compared to hESC (Red regions in Fig. 5B). This is consistent with the prevailing model that a decrease in DNA methylation at promoters correlates with an increase in gene expression (
Ball et al., 2009). However, 65% of genes identified in hFB did not exhibit changes in transcription despite a significant decrease in DNA methylation at their promoters (Black regions in Fig. 5B, FDR<0.0001, Hypergeometric test, B-H correction), with 78% of these genes are transcriptionally silent in both hESC and hFB (RPKM<1). Alternative promoter usage (
Maniatis and Reed, 2002) may account for some of the genes in which promoter hypomethylation is associated with a decrease in gene expression in hFB (Green regions in Fig. 5B, Figure S5A). Similar percentages of genes with hypomethylated promoters show an increase (34%), no change (50%), or decrease (16%) in gene expression in hEP compared to hESC (Fig. 5B). In mice, promoter hypomethylation had similar effects on gene expression with the majority of genes with hypomethylated promoters either showing an increase in gene expression, 47% in mFB and 67% in mFC, or no change in gene expression, 46% in mFB and 26% in mFC, while only a small percentage exhibited a decrease in gene expression, 7% in mFB and 7% in mFC) (Fig. 5B). These data suggest that promoter hypomethylation during cellular differentiation is not only important for increasing lineage specific transcripts, but also for flexible gene repression facilitated by repressive histone modifications in somatic cells.
To gain an insight into the mechanism by which genes with hypomethylated promoters remain transcriptionally silent in human somatic cells, we examined the profiles of two repressive histone modifications, H3K27me3 and H3K9me3, at the promoter regions of these silent genes in hFB and hEP. We found that, 77% of these genes in hFB and 45% in hEP are marked by one or both of H3K27me3 and H3K9me3 at their promoter regions (Fig. 5C-5D, Figure S5B). Furthermore, these repressive histone modifications at the promoter regions are either established
de novo during cellular differentiation, such as at the promoter regions of
KBTBD5,
ZPBP2, and
VMO1 (Fig. 5D), or inherited from hESC, such as at the promoter region of
LHX8 (Fig. 5D). Given that DNA methylation is known to be a stable repressive mechanism (
Bird, 2002;
Jones, 2012), the epigenetic switching at the promoter regions of these genes during cellular differentiation may allow them to be dynamically expressed in response to external cues in somatic cells (Fig. 5E). Together, these results uncover a potential role for epigenomic reconfiguration in facilitating gene regulation, by which genes stably silenced by DNA methylation in ESCs switch to flexible repression mediated by repressive histone marks in somatic cells.
Discussion
Two longstanding questions in cellular differentiation are how ESCs maintain pluripotency and how they give rise to a multitude of diverse types of somatic cells. Given that ESCs and somatic cells share identical genomic sequences, the establishment and maintenance of each somatic cell identity has been attributed to epigenetic modifications. This has led to an appealing model in which a flexible epigenome in ESCs allows for a multitude of genes to be transiently activated or repressed to support pluripotency.
In support of this model, studies have found an increase in DNA methylation at the critically important pluripotency gene
Oct4 (
Ben-Shushan et al., 1993;
Deb-Rinker et al., 2005). Additionally, the promoters of key developmental regulators are enriched with histone modifications, such as H3K27me3 or H3K4me3, rather than more stable DNA methylation in ESCs compared to somatic cells (
Azuara et al., 2006;
Bernstein et al., 2006;
Pan et al., 2007). A limitation of these studies is the biased focus on loci involved in pluripotency and development. Therefore, to fully understand the role of the epigenome in differentiation, we took advantage of publicly available data sets from ESCs and distinct types of somatic cells in both human and mouse and performed an unbiased, genome-wide analysis of DNA methylation, histone modifications, and transcription across differentiation. Our data provides evidence against the prevailing model in which ESCs have a more flexible epigenome than somatic cells and instead supports that somatic cells have a more adaptable epigenetic landscape.
By analyzing two components of the epigenome, DNA methylation and histone modifications, across cellular differentiation in two somatic cell types in human and mouse, we found extensive, cell type-specific changes. Global analysis of DNA methylation revealed that millions of CpGs are differentially methylated across differentiation, with 90% of these differentially methylated CpGs being hypomethylated. Additionally, we found that 70% of these differentially methylated CpGs across differentiation are unique to each somatic cell type, which is in agreement with a recent finding of tissue-specific methylation patterns in mouse (
Hon et al., 2013). Strikingly, we found that genomic regions that are hypomethylated in somatic cells are accompanied by a gain of 22 different histone modifications in a cell type-specific manner. This suggests that the switching from DNA methylation to histone modifications is an important epigenetic event for differentiation and lineage specification.
The association between DNA hypomethylation and the gain of histone modifications has been suggested for only a few histone modifications, such as H3K27me3, H3K9me3, and H3K4me1 (
Lister et al., 2009 ; Hawkins et al., 2010;
Gifford et al., 2013). Our results have identified 18 new histone modifications that correlate with hypomethylation. We found that genes repressed in both ESCs and somatic cells are held silent in ESCs by DNA methylation, a more permanent epigenetic modification, whereas in somatic cells genes are repressed by histone modifications, a more transient modification. This switching from DNA methylation to histone modifications at repressed genes across differentiation supports that the epigenome is more flexible in terminally differentiated cells than in ESCs.
Our finding that epigenomic reconfiguration occurs in a cell type-dependent and locus-specific manner raises the importance of profiling epigenomic information in homogeneous cell populations. Although cellular heterogeneity has been difficult to address, especially for complex tissues such as the brain, advanced technology in combination with newly designed approaches should make this feasible in the near future. The data processing pipelines developed in this study can allow us to address the extent of reconfiguration, and interplay with histone modifications, including recently discovered hmCpG and non-mCpG, once genomic data for these modifications in different cell types become available.
Higher Education Press and Springer-Verlag Berlin Heidelberg