INTRODUCTION
Juvenile myelomonocytic leukemia (JMML) is a rare but aggressive hematopoietic malignancy of early childhood. It is characterized by the overproduction of monocytic and granulocytic cells that circulate in the blood and infiltrate internal organs including the spleen, liver and other tissues. It is estimated to occur in 1.2 cases per million persons worldwide. The median age of children diagnosed with JMML is 2 years. Clinically, patients usually present with fever, leucocytosis, thrombocytopenia, and splenomegaly. The World Health Organization (WHO) has categorized JMML as an overlap myelodysplastic syndrome/myeloproliferative neoplasm (MDS/MPN) and the diagnostic criteria have been defined based on clinical features, hematologic parameters, and genetic lesions. The prognosis of JMML is usually poor, and the only known effective treatment option is hematopoietic stem cell transplantation (HSCT). Treatment failure occurs in half of the patients due to disease relapse or transformation to acute myeloid leukemia (AML) [
1].
Significant progress in understanding the molecular pathogenesis of JMML has been achieved by identifying the association of specific clinical syndromes with the genetic alterations in the RAS/MAPK signalling pathway, and by establishing the animal models with specific genetic lesions that mimic the disease [
2]. The majority of such mutations have been found to occur in genes encoding proteins of the RAS/MAPK signalling pathway [
3]. Mutations in
NRAS,
KRAS [
4] and
NF1 [
5] in the RAS/MAPK signalling pathway have been known to be initiating events for JMML. Germline mutations in
PTPN11 have also been reported in earlier studies in around 50% of the patients [
6]. Some of the patients with nonsyndromic JMML have been reported to have somatic mutations in those genes. Homozygous CBL mutations have also been previously reported in around 15% of the cases [
3]. Together, nearly 90% of the JMML patients have been found to carry somatic or germline mutations involving RAS/MAPK signalling pathway. Other mutations that are assumed to be secondary mutations and have only been reported in a few cases of JMML include
ASXL1 [
7],
FLT3 [
8],
SRSF2 [
9],
SETBP1 and
JAK3 [
10]. Like every other malignancy, JMML also shows aberrant methylation patterns, with methylation being particularly enriched in CpG islands [
11]. The microRNA profiles of JMML remain unexplored. Although it is now clear that JMML is fundamentally a disease of hyperactive RAS/MAPK signalling, the targeted therapy towards the signalling relay proteins in this pathway has been largely unsuccessful [
12].
In this study we utilized next-generation whole-exome sequencing strategy, which is an important technique with high coding region coverage and a high rate of phenotype association. This approach has been successfully used to identify important genomic variations in a number of diseases including driver mutations in cancer and germline mutations in inherited diseases [
10,
13]. Here, we report several novel mutations that are involved in the P13K/AKT signalling pathway, which have not been reported previously in JMML patients. We hypothesize that the mutation in the P13K/AKT pathway along with differential regulation of microRNAs could be an alternative mechanism for JMML pathogenesis. Interestingly, several kinases in this pathway can be targeted with existing drugs that provide an alternative therapeutic potential for this deadly disease.
RESULTS
A bone marrow sample from a 2-year old juvenile myelomonocytic leukemia patient was used in this study for whole-exome, cancer-panel, and small RNA sequencing analysis. Exome analysis revealed some mutations that were damaging in nature and were confirmed by cancer panel, while smRNA-seq indicates roles of oncomiRs in JMML.
Exome and cancer-panel analysis
The whole-exome sequencing generated from the JMML patient bone marrow sample had around 39 million reads, of which 99% were mapped to the human genome using default settings in BWA [
14]. Of those, 98% were proper pairs and around 90% were uniquely mapped as shown in Table 1. The mapping criterion used to identify whether a read was uniquely mapped or not is the mapping quality (QS>30). We were able to identify 126,835 variant calls in the JMML patient bone marrow leukemia cell DNA using the Mercury pipeline [
15]. Of these, only 6,411 variants were able to pass the variant filtration step in Atlas tools (as a part of the Mercury pipeline). In order to identify the novel mutations from the variants, we first removed the SNPs recorded in dbSNP and 1,000 genomes project databases. Then we excluded synonymous SNPs and SNPs from intronic and non-coding regions. We were able to identify nonsynonymous mutations in
NTKR1,
ITPR3,
PTPN11,
MLH3,
AKT1,
HMGA2,
MYH9, and
AR genes with average 30× coverage. The mutations we identified in whole-exome sequencing were further verified by a cancer panel targeted sequencing with 100× coverage of 573 genes (derived from the Sanger cancer gene list). The Mercury pipeline was again used to call variants. A total of 25,633 variants were called, only 6,335 of which passed the variant filtration in the Mercury pipeline. The following mutations were confirmed by both exome and cancer-panel sequencing, namely
ITPR3 (chr6:33651070, G>A, Exon35, A1562>T),
PTPN11(chr12:112888165, G>T, Exon3, D61>Y),
AKT1(chr14:105239869, G>A, Exon9, R251>C),
MLH3(chr14:75514537, A>T, Exon2, F608>I), and
MYH9(chr22:36685257, C>A, Exon32, K1477>N). Other mutations including
AR (chrX: 66765152, T>A, Exon1, L55>Q),
NTRK1 (chr1:156843653, C>T, Exon8, T360>M) and mutations in
HMGA2 gene (chr12:66260663, 66260665, C>G, G>T, Exon2, Q7>H, Q113>K) were only seen in the whole-exome sequencing as these genes are not part of the cancer panel.
PTPN11 gene mutation had already been found in the JMML patient tested in this research in a diagnosis test, and we were able to see the same mutation in both exome and cancer-panel sequencing. There is no discordant event between the mutations identified in the same genes in both sequencing methods. Table 2 shows the genes, which had nonsynonymous mutations along with their locations in the genomic regions and gene body, and their resulting amino acid change. SIFT and Provean [
16] predictions suggest that some mutations have damaging effects and are deleterious in nature for the functions of the coded proteins as shown in Table 2. Notably, the mutation of
AKT1 results in significant structural disruption and the mutation is shown encircled in Figure 1[
17], while other mutations shown in this figure did not pass the variant filtration step.
AKT1 (chr14:105239869, G>A, Exon9, R251>C) mutation results from SusPect [
18] and HOPE [
19] indicated a structural disruption due to this point mutations. Results from HOPE indicate that the mutation lies in an Interpro [
20] domain, i.e., the protein kinase domain (IPR000719). The domain is enriched with GO terms like ATP binding (GO: 0005524) and protein kinase activity (GO: 0004672). A homology model of the
AKT1 mutation based on the template [PDB: 3O96] with 97% identity is shown in Figure 2A superimposed with the native structure.
Ligplot results of the native structure of AKT1 (3O96) around ARG251 are shown and compared with the mutated residue CYS189 of AKT1 in Figure 2B and 2C. Ligplot results indicate a change in hydrogen bonding and hydrophobic contacts of CYS189 when compared with ARG251. Some of these contacts tend to lie in the active site regions of protein as predicted by Motif Scan results, e.g., PHE407, which shares a hydrogen bond with neighbours of ARG251 in the native structure but did not do so in the JMML mutated AKT1 protein. Also there was an overall change in the net charges due to the mutation of ARG251 to CYS189, which may have also affected active sites of the mutated protein. The importance of the AKT1 mutation as well as other mutations like in PTPN11, MYH9 and MLH3 is also shown in Supplementary Figure S1, where the position is highly conserved in other species. This result strongly suggests the ARG is an important amino acid residue for AKT1 enzyme activity.
Small RNA analysis
From our small RNA-seq analysis we identified differentially expressed microRNAs in comparison to JMML leukemia cells with monocytes from normal adult pooled peripheral blood samples (Supplementary Tables S1–S3). A total of 230 microRNAs showed significant
p-value (≤0.01) and fdr (≤0.01), of which 76 microRNAs showed>2-fold upregulation (Supplementary Table S1) and 66 microRNAs showed >2-fold downregulation (Supplementary Table S2) with respect to adult monocytes. Many of the microRNAs that were found to be differentially expressed were associated with some types of cancer, and some of them are categorized under the class of oncomiRs. These oncomiRs have been known to play a vital role in tumorigenesis, malignant transformation, and metastasis [
22]. We found hsa-miR-19a-3p (log2-fold-change=2.33) and hsa-miR-19b-3p(log2-fold-change=4.34) were upregulated in JMML compared with normal monocytes, and they were also upregulated in several other types of cancers such as gastric cancer, lung cancer, osteosarcoma and squamous cell carcinoma [
23,
24]. Likewise, hsa-miR-15a-5p (log2-fold-change=2.26) and hsa-miR-15b-3p(log2-fold-change=3.78) another well classified oncomiR family were found specifically upregulated in JMML. While the miR-15 family has been known to be down regulated in chronic lymphocytic leukemia [
25,
26], we found it to be significantly upregulated in JMML. Unlike many other cancers in which hsa-miR-145 [
27,
28] and hsa-miR-126 [
29,
30] are mostly down-regulated, JMML showed an upregulation for these two miRNAs. Many of the down-regulated miRNAs in JMML show similar properties in other types of cancers, e.g., hsa-miR-193a [
31] and hsa-miR-125a [
32], which have been found to be down-regulated in most cancers. Lastly, microRNAs like hsa-let7b/c/d/e/f were also downregulated in JMML, although the
Let-7 family of microRNAs is related to development, but in cancer their major function is related to tumour-suppression.
Let-7 family is known to regulate RAS expression in humans and is also known to target the
HMGA2 gene. The observed downregulation of
Let-7 could be associated with over-expression of RAS genes, which has been observed in some cancers [
33]. Many of these microRNAs show enrichment in important cell signalling pathways that show extensive “cross-talking” with each other, e.g., mTOR pathway, RAS pathway and PI3K-AKT pathway. The enrichment profile of the most significant miRNA’s based on
p-value, fold change, and number of predicted gene targets is shown in Figure 3. The enrichment heat map for all upregulated and all downregulated microRNAs is shown in Supplementary Figures S2 and S3, respectively, both of which show enrichment in some cancer related pathways.
DISCUSSION
Most disease-causing mutations occur in the coding regions of the genome. Whole-exome sequencing helps identify such variants that are correlated with phenotypes with high accuracy. In this study we were able to identify novel mutations in JMML bone marrow leukemia cells. In addition, we also performed targeted cancer panel deep sequencing, which confirmed our observations from whole-exome sequencing. We were able to confirm widely reported and previously tested mutations in the
PTPN11 gene (chr12:112888165, G>T, Exon3, D61>Y) in this study’s JMML patient.
PTPN11 played an important role in regulating the RAS/MAPK signalling pathway [
6]. Another mutation of considerable importance is the
AKT1 (chr14:105239869, G>A, Exon9, R251>C) mutation.
AKT1 gene codes for a serine/threonine specific kinase, which plays a key role in multiple cellular processes, such as glucose metabolism, apoptosis, cell proliferation and cell migration [
36]. The roles of
AKT1 in many human cancers (e.g., gastric, ovarian, squamous and lung) have also been widely reported [
37]. To our knowledge,
AKT1 mutation has not been reported in JMML.
MLH3 (chr14:75514537, A>T, Exon2, F608>I) mutation was also identified in both whole-exome and cancer-panel sequencing. This gene belongs to the family of DNA mismatch repair genes and is involved in maintaining genomic integrity during replication.
MLH3 mutation may be related to a high somatic mutation load in this particular patient [
38]. Somatic mutations in these genes have also been widely reported but their evidence for a role in cancers remains unclear [
39–
41].
MYH9 (chr22:36685257, C>A, Exon32, K1477>N) gene plays an important role in cytokinesis, cell shape and specialized functions such as secretion and capping. Genetic mutations in
MYH9 have been reported in lung cancer [
42,
43]. Other mutations that have a damaging effect and were only seen in whole-exome sequencing are in
HMGA2 gene (Q7>H, Q113>K) and
NTRK1 gene (T360>M). HMGA2, is an architectural protein and an essential component of enhanceosome [
44] while NTRK1 is a neurotrophic tyrosine kinase receptor and phosphorylates members of the MAPK pathway. Mutations in both
HMGA2 and
NTRK1 have also been implicated in cancers [
44–
46]. By comparing the microRNA expression profile of this JMML sample with the mature monocyte preparation, we were able to identify a group of microRNAs that are involved in different types of cancers. Of considerable importance is the microRNA-19, which is an oncomiR and is also known to target PTEN [
47] in the PI3K/AKT signalling pathway. It has been observed that activation of PI3K/AKT signalling plays an important role in stem cell cancers while the JMML is considered to be of hematopoietic stem cell origin [
48]. Lastly, JMML microRNA profiles bear some similarities with other types of cancers in terms of microRNAs that are up/down regulated but also have features specific to JMML.
Signalling networks usually display a PI3K/AKT pathway and a RAS/MAPK pathway as independent parallel pathways, but these two pathways share multiple cross-talk points between each other as shown in Figure 4[
49]. Roles of both RAS and PI3K pathway have been found in some cancers and drugs or combinations of drugs targeting both pathways have been suggested. Our findings reveal that mutations in the
AKT1 gene could cause dysregulation in the RAS pathway due to the apparent cross-talk between the two pathways. From the results above, we are able to hypothesize that a mutation in
AKT1 affects the PI3K/AKT/mTOR signalling pathway activity that may also cause dysregulation in the RAS/MAPK pathway [
50,
51]. While the RAS pathway has been known to be the most common route for JMML cancer causation or proliferation, we emphasize here that PI3K/AKT may serve as another alternative mechanism for JMML causation, which opens up the possibility of designing new inhibitors targeting both pathways as shown in Figure 4. Also, our identification of a number of microRNAs that are differentially expressed in JMML suggests a possible role of microRNAs as mediators or effectors in JMML initiation and progression. Some of these oncomiRs that were found to be upregulated in our analysis are known to interact with PTEN, whose mutations and/or silencing has been implicated in various cancers and is known to lead to hyperactive PI3K signalling.
AKT1 plays an important role in cell growth, apoptosis, proliferation, transcription and cell migration.
AKT1 exhibits these functions by mediating multiple signalling pathways including the PI3K/AKT/mTOR pathway.
AKT1 activation is associated with many malignancies including solid tumours and acute myeloid leukaemia. Current clinical trials demonstrate that the
AKT1 inhibitors can be used to treat hematopoietic malignancies as well as solid tumours. The most promising inhibitors include perifosine (KRX-0401, Aeterna Zentaris/Keryx), MK-2206 (Merck), and GSK-2141795 (Glaxo-SmithKline). In addition, “pan-PI3K” inhibitors and dual PI3K/mTOR inhibitors are in active clinical trials. In fact, two mTOR inhibitors, everolimus and temsirolimus, have been approved by Food and Drug Administration (FDA) to treat multiple types of solid tumours. From our studies, these drugs may potentially be used in treatment of JMML as an “off-label” mechanism [
52,
53]. Rare disorders like JMML provide a realistic future setting for personalized medicine using a single patient data. Our study highlights some of the ways in which multiple types of data can be used together to arrive at a possible treatment management plan. Use of personalized medicine or precision medicine in oncology based on patient specific genetics is now starting to be recognized [
54]. Due to the burst of next-generation sequencing data, there has been a trend among clinicians to use genomic data to more precisely diagnose cancer, which can lead to a treatment of the cancer patient that is both effective and efficient. In rare diseases such as JMML, most of the personalized approaches rely on a very small subset of patients, and in most cases they tend to follow an N-of-1 trial approach in which the single patient is the entire trial [
55]. Many times this N-of-1 study is also executed in an AB quasi experimental way where the causality cannot be definitely demonstrated. When it comes to precision medicine of rare disorders, such experimental setups can be useful. Thus rare disorders like JMML in this report provide a model for personalized medicine using a single patient data. Our preliminary discovery may help identify new genetic markers for diagnosis and new therapeutic targets using specific kinase inhibitors or RNA interference (RNAi) for JMML treatment.
CONCLUSIONS
By using whole-exome sequencing, we have confirmed the existing mutation in the PTPN11 gene (chr12:112888165, G>T, Exon3) and discovered several novel mutations including AKT1 (chr14:105239869, G>A, Exon9) in our JMML patient. The microRNA profiling revealed the dysregulation in multiple microRNAs that were enriched in PI3K/AKT and other pathways. Extensive cross-talking between the newly identified PI3K/AKT pathway and the classical RAS/MAPK pathway in JMML suggests a synergetic effect in JMML leukemogenesis. Our approach and sequencing data not only identified novel mutations that can potentially be used as a diagnostic biomarker and therapeutic target in JMML, but also provide a model to characterize the rare disease in the personalized precision medicine era.
MATERIALS AND METHODS
Whole exome sequencing
One human subject was involved in this research project. The DNA was extracted from patient blood and bone marrow samples. The authors (Dr. MICHAEL X. WANG and JASON DENNEY) have obtained an approval of Institutional review board of University of Missouri Health Science left (IRB# 1208122, HS IRB USE ONLY Approval Date: Aug 6, 2013) and signed a consent form from the parent. Genomic DNA was extracted from the bone marrow aspirate specimen of the 2-year old male patient with JMML after obtaining the IRB approval and a consent form signed by the parent. The libraries were prepared from fragmented DNA using the TruSeq DNA Library Preparation Kit (Illumina) and the enriched exome libraries were clonal amplified and sequenced using an Illumina HiSeq 2000 sequencer. The paired-end sequences were subject to quality control and trimmed using Fastqc [
56]. The quality-controlled reads were then processed using a locally installed version of the Mercury pipeline [
15]. The pipeline goes through several steps starting from mapping of the reads to a reference genome using BWA [
14], which produces a BAM file. The reference assembly used in this process was GRCh37 obtained from the Human Genome Sequencing left (https://www.hgsc.bcm.edu/). The aligned BAM file was further processed by tools like Picard (http://picard.sourceforge.net), SAMtools [
57] and GATK [
58] which further refined the BAM file by sorting, duplicate-marking, indel-realignment, base quality recalibration and indexing. The variants were called on the finished BAM file using Atlas-SNP and Atlas-Indel as part of the Atlas2 suite [
59]. The called variants were finally annotated using Cassandra which combines the Annovar [
60] output along with other public data sources like COSMIC (Catalogue of Somatic Mutations in Cancer) [
61] to produce annotated vcf files.
Cancer-panel sequencing
In order to validate the mutations that we identified from whole-exome sequencing, we further conducted deep sequencing with 100× coverage of a cancer gene panel derived from the Sanger Institutes Cancer gene list [
62]. Targeted sequencing of 533 cancer genes was performed using the Illumina HiSeq 2000 sequencer on the same DNA sample of this JMML patient. The reads obtained were subjected to quality control using Fastqc and analysed using the Mercury pipeline described in the exome analysis method.
Identifying nonsynonymous mutations
In order to identify disease-associated mutations from both whole-exome sequencing and cancer-panel sequencing, we removed the variants that were common to Dbsnp (hg19 assembly), Dbsnp (grch37 assembly) and 1,000 genomes SNPs. This was achieved using the bedtools intersect program [
63]. The vcf files for each of the above-mentioned data were obtained from the Human Genome Sequencing left(https://www.hgsc.bcm.edu/). Once the list of mutations was obtained, it was run through the SIFT [
16] algorithm to identify functionally important variants. Protein sequence of the mutated genes and conservation comparison was obtained using MutationTaster [
64] and structural analysis was performed using HOPE [
19] and SusPect [
18]. Further homology modelling of AKT1 protein was carried out using 3O96 PDB structure with SWISS-MODEL [
65] and the protein structure was visualized using UCSF Chimera [
66]. The polar contacts around the mutation and native residue were constructed using Ligplot+ [
21].
MicroRNA analysis
To identify microRNAs that might play regulatory roles in gene expression of JMML, we extracted total RNA from the same patient’s bone marrow aspirate sample. Illumina TruSeq small RNA preparation kits were used for generating small RNA libraries from total RNA. Strand specific paired-end RNA-seq data was obtained using TruSeq small RNA sequencing. The TruSeq adapters were removed using the bbduk program of the bbmap package (sourceforge.net/projects/bbmap/). The mate2 of the paired-end RNA-seq fastq file was reversely complemented using the FASTX-Toolkit (http://hannonlab.cshl.edu/fastx_toolkit/). The reversely complemented mate2 was then combined with the mate1 fastq file. sRNAbench [
67] a publicly available small RNA pipeline, was used for small RNA analysis. The mapping of all the reads was done against the GRCh37 human genome using sRNAbench after the genome had been indexed. After profiling expression of miRNA for JMML patient data we used publicly available peripheral blood monocyte data from 10 normal adults (GSE47189) of small RNA-seq to compare it with the JMML miRNA profile. The data was downloaded from NCBI GEO. Once the data was analysed using sRNAbench, counts were calculated using sRNAbenchDE for JMML sample and normal monocytes. Differential expression analysis was performed using both edgeR [
68] and DESeq [
69]. The microRNAs differentially expressed in both edgeR and DESeq were used for further analysis. Pathway enrichment for the microRNAs was done using miRPath [
34] from the DIANA-tools webserver. A network of genes involved in cross-talk was constructed using Cytoscape 3.2 [
70]. MicroRNA gene target information from miRTarBase Release 4.5 [
71] was used along with the
p-values and logFC values obtained from differential expression analysis for network visualization using Cytoscape for genes involved in cross-talk between RAS and PI3K/AKT pathway and their first degree neighbouring genes. AKT and RAS pathways were downloaded from KEGG as KGML/XML files and uploaded in Cytoscape. The genes shown in Figure 4 were selected along with first-degree neighbours and visualized. The network was merged with information of microRNA targets and microRNA expression. Log2-fold-change was used to do a continuous mapping of colour from green to red with green being upregulated and red being downregulated using vizmapper in Cytoscape. Two networks shown in Cytoscape are presented in Supplementary Figures S4 and S5.
Higher Education Press and Springer-Verlag GmbH Germany, part of Springer Nature