Introduction
Acute myeloid leukemia (AML) is a result of a series of genetic mutations and epigenetic dysregulations [
1]. In the past three decades, numerous works investigated the crosstalk among the mutated genes of transcription factors and signal transduction molecules related to AML based on the classical two-hit model. However, with the rapid development of sequencing technologies, additional newly reported mutated genes in AML have highlighted their pivotal roles. Many of these mutations are closely linked to epigenetic dysregulations, such as DNA methylation and histone modifications [
2−
5]. These novel classes of mutant disease alleles directly expanded the canonical gene mutational classification of myeloid malignancies from class I and II to an additional class III [
6,
7]. Class I mutations are widely accepted to confer proliferation advantages, whereas class II mutations are known to block cell differentiation. However, class III mutations are hardly endowed with unique features. Considering that perturbations of epigenetic modifiers may result in various alterations, including aberrant cell proliferation and differentiation [
8], we believe that strictly separating each kind of mutation from one another is difficult to attain. Interestingly, class III mutations are more often involved in the preleukemic transformation [
9]. According to the cancer-stem-cell evolution model, preleukemic stem cells serve as the root of AML pathogenesis [
10]. The related driver mutations not only play a key role in AML initiation but also survive chemotherapy and consequently trigger AML relapse [
11,
12]. Therefore, identifying the driver mutations from subsequent mutations may enable the targeting of perturbations in these AML initiating events and potentially minimize the risk of relapse. Herein, we briefly summarize the roles of preleukemic mutations in the initiation and relapse of AML and focus on the notable preleukemic mutation of DNMT3A.
Preleukemic stem cells in AML initiation and relapse
Normal hematopoiesis is hierarchically organized, beginning with the multipotent hematopoietic stem and progenitor cells (HSPCs) [
13−
16]. Several xenograft mouse models have demonstrated that in acute leukemias, hematopoietic stem cells (HSCs), common myeloid progenitors (CMPs), or granulocyte monocyte progenitors (GMPs) are believed to be the initial malignant cells-of-origin, which subsequently give rise to the pre-LSCs or the leukemic stem cells (LSCs) [
17−
22]. Pre-LSCs refer to a small subsection of cells directly derived from HSCs that bear a sole mutation and share many properties with normal HSCs, such as self-renewal and differentiation [
17]. Therefore, pre-LSCs are capable of maintaining and replenishing all blood cell types but unable to induce full-blown leukemia. On the contrary, LSCs are functionally defined by their capacity to produce fully penetrant, short-onset leukemia in murine transplantation models [
17,
20,
23,
24]. The existence of LSCs has been demonstrated for over more than 20 years following the reports that only a small sub-fraction of leukemic cells from AML models are capable of propagating the disease in mouse xenografts [
23−
26]. However, the identification of pre-LSCs has not been reported until recent years by studying human blood samples [
12,
27,
28]. For many years, only a logical postulation that a more naïve form of malignant stem cells than the LSCs might exist [
29]. The progression from a normal HSC to a fully malignant LSC demands the HSC to overcome the homeostatic control of their numbers and survival. In this regard, an evolutionary selection of the advantageous cells maintaining the first genetic lesion must have occurred, which are thought to be the pre-LSCs. Distinguishing pre-LSCs from normal HSCs is difficult because pre-LSCs could retain most HSC properties. This finding may explain why the first proof of principle for the identification of pre-LSCs has not been reported until recently after the hypothesis on pre-LSCs had been proposed for many years [
27]. However, pre-LSCs also holds a genetic lesion; the single mutation confers the pre-LSCs with selective advantages over the nonleukemic stem cells, allowing a more powerful clonal expansion and producing more progenies (Fig. 1).
In addition to that in the pre-LSCs, the first mutation might also occur in more differentiated myeloid progenitor cells, giving rise to the “preleukemic myeloid progenitors” (pre-LMP) [
30,
31]. Notably, this conclusion was only confirmed in knock-in mouse models, and no direct evidence has proven that human AML can be driven by pre-LMP. These preleukemic cells are not only pivotal in AML initiation, but recent findings have also implicated their core role in AML relapse [
12,
32−
34]. Models of AML relapse are closely related to the pathways of AML initiation and evolution [
29]. Typically, three possible scenarios for AML evolution might provide clues to the relapse of diseases (Fig. 2). First, the primitive mutation directly hits the HSCs, thereby producing the pre-LSCs, followed by the second hit in the pre-LSCs and thus generating the LSCs. In this scenario, both the initial and secondary mutations are at the stem-cell level; hence, the LSCs share many characteristics with the normal HSCs and consequently become resistant to drug therapy [
29,
35−
37]. Second, the HSCs are hit by the initial mutation, but afterward differentiate into pre-LMPs, in which the secondary mutations occur. In this category, the LSCs originate from more mature pre-LMPs, and thus are more vulnerable to chemotherapy. Unlike the pre-LMPs, such LSCs cease to serve as reservoirs for relapse [
30,
31,
38]. In the third scenario, initial mutation hits the downstream progenitor cells but not the HSCs. This process also produces pre-LMPs and LSCs, but all the mutations miss the stem-cell level, thus rendering the blasts, as well as the pre-LMPs and CMPs, susceptible to the therapeutic agents [
29−
31,
38]. In this case, AML derived from a myeloid progenitor origin might be completely treated.
In summary, we discussed the predominant roles of preleukemic condition in the initiation and relapse of AML. As already mentioned, inspection of DNMT3A-mutated HSCs was the first proof-of-principle identification of pre-LSCs in AML patients. Our group also conducted several works investigating this epigenetic modifier. As such, we further discuss the DNMT3A mutations in AML as typical paradigms of preleukemic mutations.
DNA methylation in leukemia
Epigenetic modifications are employed in various manners, such as DNA methylation, histone methylation, and chromatin remodeling. DNA methylation is a highly important epigenetic regulatory activity of gene expression occurring merely at the CpG residues in eukaryotic cells [
39]. CpG residues usually cluster together and form the so-called “CpG island,” rather than sporadically scatter throughout the genome [
7]. A landscape analysis of a DNA methylome suggested that in addition to CpG islands, CpG residues also occur on island “shores,” defined as those situated less than 2 kb from an island, and “shelves,” which are located 2−4 kb from an island [
39,
40]. Approximately 60% of human genes contain CpG islands in their gene promoter regions. DNA methylation of the promoter region is more commonly believed to downregulate gene expression [
41−
43].
DNA methylation is dynamically altered in both solid tumors and leukemia; it can affect cells in three main ways, namely, hypomethylation, hypermethylation, and loss of imprinting [
44]. Recent studies implied that DNA demethylation mechanisms can potentially alter the DNA methylation pattern in leukemias [
45]. Ten-eleven translocation (TET) proteins, especially TET2, play a vital role in lowering DNA methylation levels by sequentially converting 5-methyl-cytosine (5-mC) to 5-hydroxymethyl-cytosine (5-hmC), 5-formylcytosine (5fC), and 5-carboxycytosine (5caC) [
46]. On the other hand, isocitrate dehydrogenase (IDH) indirectly influences DNA demethylation by negatively regulating the function of TET proteins after IDH mutations [
47]. Global hypomethylation has been most frequently reported in different cancers; in these cases, the inhibitory effects of DNA methylation on promoters are postulated to be impaired, thus promoting the transcription of oncogenes and genes of cell replication [
48−
50]. However, several newly reported works indicated that global methylation of DNA promoter regions remained almost intact despite the mutation or ablation of DNMT3A [
51,
52]. Notably, gene bodies are markedly hypomethylated or hypermethylated in response to the aberrantly functioning DNA methyltransferase [
51,
53]. These counterintuitive observations remind us that gene bodies are associated with transcribed regions that possess higher methylation levels than promoter or intergenic sequences [
54]. In particular, the genes expressed mid-level exhibit the highest amount of gene-body methylation; this finding may explain why the methylation levels of gene bodies were more strikingly varied than that of the promoters in AML. Even so, what determines the hypo- or hypermethylation of gene bodies remains to be defined, and the consequent alterations of cell fate based must be explored.
DNMT3A mutations in AML
DNMT3A mutations were first reported in AML by Ley and colleagues and our group [
55,
56]. These mutations are widely reported in 17%−34% of AML cases with normal karyotype (CN-AML) [
57]. Almost 96.3% of the DNMT3A mutations are heterozygous. Moreover, 89% of the CN-AML patients harbored a single site mutation of DNMT3A, whereas the occurrence of two mutations in one allele was most frequently observed in T-cell acute lymphoblastic leukemia samples [
57,
58]. The patients with DNMT3A mutations are usually older than average, possess lower hemoglobin levels, and exhibit higher white blood cell counts. Typically, these patients are also more likely to suffer from monocytic or myelomonocytic leukemia (FAB M4/5) than other patients [
59].
DNMT3A mutations usually contain missense, nonsense, or frame-shift alterations. Among these mutations, the missenses at codon R882 are the most frequently detected, which account for nearly 60% of the DNMT3A mutations [
60]. To study the function of R882 mutations, several studies conducted both
in vitro and
in vivo experiments. Yan and colleagues found that R882H mutations reduced methylation activity twofold [
56]. Russler-Germain and coworkers co-expressed the wild-type and R882H-mutated DNMT3A
in vitro and found that the enzymatic function of the wild-type DNMT3A was profoundly impaired by its mutated partner, probably via inhibition of tetramerization [
61]. However,
in vivo, DNMT3A mutations do not markedly alter global methylation levels [
7,
26]. Similarly, Challen and colleagues established a DNMT3A-knock-out (KO) mouse model and neither found a whole-genome altered methylation pattern nor observed leukemia in serially transplanted mice [
52]. Recently, by detecting various hematologic malignancies after one year, Mayle and colleagues demonstrated that in the absence of serial transplantation and with longer
in vivo maintenance, the DNMT3A-KO mouse model could recapitulate the DNMT3A mutations [
62]. Notably, our group established a retrovirally transduced DNMT3A-R882H mutant mouse model. At 12 months after bone marrow transplantation, all mice developed chronic myelomonocytic leukemia with thrombocytosis [
51]. This result indicates that the DNMT3A-ablated mouse model might not thoroughly mimic the DNMT3A mutant model, and a DNMT3A knock-in mouse model may be fairly helpful in better investigating the biological effects of the DNMT3A mutation. Overall, the discrepancy between the
in vitro and
in vivo experiments indicates that DNMT3A mutations may trigger AML initiation through many unknown mechanisms rather than solely by the previously hypothesized loss-of-function. Intriguingly, our group discovered that DNMT3A mutations could specifically result in the upregulation of HOX genes (homeobox genes) and Meis1 (Meis homeobox 1), which are master regulators of HSC self-renewal [
51,
56]. Although this finding might provide a reason why pre-LSCs bearing DNMT3A mutations exhibited enhanced self-renewal as mentioned earlier, why DNMT3A would particularly target a subset of genes rather than induce global alterations consistent with its role as a global DNA methylation modifier remains a mystery. Recently, the newly reported structural insight into the DNMT3A interaction with histone H3 may aid in tackling the conundrum of DNMT3A mutations by extending the investigations to the histone epigenetic apparatus [
63]. Furthermore, mutations of TET and IDH were frequently reported by various groups, re-emphasizing the connection between aberrant DNA methylation levels and leukemogenesis [
64,
65].
DNMT3A and other mutations in pre-LSCs
Recently, Shlush and colleagues have identified DNMT3A mutations as preleukemic mutations in acute leukemias [
27]. To identify the original preleukemic mutations in HSCs, the authors compared the distribution of DNMT3A mutations with that of another supposed “founder” mutation, nucleophosmin-1 (NPM1), from the HSCs to the blast cells. The results indicated that although the NPM1 mutation can initiate AML in mouse models [
66], it seems to be a subsequent alteration in most cases of AML that carry a driving DNMT3A mutation [
67]. Interestingly, DNMT3A mutations are not the only verified preleukemic mutations; the TET methylcytosine dioxygenase 2 (TET2), and IDH 1 and 2 [
9,
12,
68]. Notably, DNMT3A, TET2, and IDH1/2 are closely linked epigenetic modifiers in terms of DNA methylation and demethylation [
64] (Fig. 3).
Particularly, IDH1/2 mutations can result in the abnormal accumulation of 2-hydroxyglutarate, which in turn impedes the DNA demethylation processes mediated by TET proteins [
65,
69,
70]. The mutations of TET2 and IDH1/2 may affect the same pathway and hence exist mutually exclusively [
71−
73]. The reason why epigenetic mutations are preferentially chosen as preleukemic mutations remains unknown, but this phenomenon might support the postulated model that primitive epigenetic abnormalities must accumulate in self-renewing HSCs unless a mutation confers self-renewal ability to a downstream cell, given that the mutations occurring in non-self-renewing cells are lost [
74]. Another explanation might be that early mutations in epigenetic genes that affect hematopoietic self-renewal and/or differentiation are not sufficient to induce leukemogenesis on their own, thus making them the most primitive mutations in the preleukemic phase [
75].
Notably, preleukemic mutations not only initiate AML; they also persist and survive chemotherapy in the evolution of the disease [
28]. Shlush and colleagues found that DNMT3A-mutant HSCs, together with their progeny cells, persisted in peripheral blood despite AML remission following chemotherapy, suggesting that at least some of these preleukemic ancestral cells were resistant to treatment [
27,
67]. Recent studies also found DNMT3A and TET2 as the most frequently involved somatic mutations in clonal expansions [
33,
34]. The authors tested that during “clonal hematopoiesis,” which refers to HSC population dynamics preceding visible hematologic cancers, stem cells carrying a subset of driver mutations are capable of becoming resistant to chemotherapy. Subsequently, these cells acquire novel mutations, triggering a relapse. This hypothesis suggests that a span of many years is necessary for a clonal expansion to occur from a single hematopietic stem cell carrying driver mutations and additional numerous years before further evolution of the disease from preleukemic HSCs (Fig. 4). Interestingly, for clonal hematopoiesis, the DNMT3A R882 mutations most preferentially occur in patients not younger than 90 years, although it could also be found in individuals as young as 25 years [
76]. This finding implies that DNMT3A mutations per se cause minimal effect on life span.
Summary and perspective
Destroying all LSCs, as well as pre-LSCs, is an ultimate goal in leukemia treatment. Therefore, understanding the genetic and epigenetic transformations during leukemic reprogramming should be given the highest priority to reach this end. Experimental evidence revealed that epigenetic processes may control leukemic reprogramming. Preleukemic mutations like DNMT3A, TET2, and IDH1/2 appear to tightly persist in AML patients from disease initiation to relapse. Therefore, targeting these preleukemic mutations in AML may be a promising strategy for treating leukemia and reducing relapse risk.
However, many issues remain. The target genes and pathways that DNMT3A mutations influence still need to be defined. Why TET2 functions as a DNA demethylation modifier opposite to DNMT3A but when mutated along with DNMT3A could equivalently induce pre-LSCs in AML is yet to be investigated. If DNMT3A and TET2 mutations target the same clusters of genes or pathways, they are expected to be equal in level in the primed stage. However, they do not exist mutually exclusively with each other, unlike TET2 and IDH1/2 mutations, hence entailing further study. Why epigenetic mutations are preferred to occur in the preleukemic stage must also be explored. We have hypothesized that pre-LSCs might sustain their clonal expansion advantages over normal HSCs through a greater number of rounds of self-renewal and a much longer survival that would allow subsequent mutations to occur [
77,
78]. Therefore, the foremost mutations may function to protect the mutated HSCs from apoptosis, autophagy, and ultimate death. These various anti-death effects are likely difficult to be achieved by mutated genes specifically involved in proliferation or differentiation. Hence, the upper epigenetic mutations more probably hold a more extensive control of these variations, which causes class III mutations to frequently occur at the preleukemic stage. However, this theory still hardly illustrates why pre-LSCs are able to survive chemotherapy, which is a dramatically significant issue in AML treatment. Why mutations of histone modifiers were seldom found in pre-LSCs, considering that the anomaly of DNA methylation or histone modification could analogously promote epigenetic aberration in leukemogenesis remains to be studied [
79].
Moreover, only a small proportion of leukemias are discussed in this review, which excluded those with cytogenetic abnormalities. Leukemias triggered by mixed-lineage leukemia translocations, t(9;22), and inv16 other than preleukemic mutations such as DNMT3A may not be effectively treated by targeting preleukemic lesions but by directly relieving the leukemic burden exerted by those cytogenetic abnormalities. The cell-of-origin in AML relapse may present another issue. The production of relapsed clones from the further evolution of a dominant clone or subclone outgrowth rather than the further evolution of a driving clone from a preleukemic HSC is a possibility worth investigating. In this regard, the clues and targets indicating cancer heterogeneity and evolution must be determined. Further studies are necessary to determine all the genetic and epigenetic crosstalks in leukemic reprogramming. Combining chemotherapy with promising regenerative medicine could facilitate better treatment for leukemias.
Higher Education Press and Springer-Verlag Berlin Heidelberg