Transcriptomics and proteomics in stem cell research

Hai Wang; Qian Zhang; Xiangdong Fang

doi:10.1007/s11684-014-0336-0

Front. Med. ›› 2014, Vol. 8 ›› Issue (4) :433 -444. DOI: 10.1007/s11684-014-0336-0

REVIEW

Transcriptomics and proteomics in stem cell research

Author information +

History +

PDF (690KB)

Abstract

Stem cells are capable of self-renewal and differentiation, and the processes regulating these events are among the most comprehensively investigated topics in life sciences. In particular, the molecular mechanisms of the self-renewal, proliferation, and differentiation of stem cells have been extensively examined. Multi-omics integrative analysis, such as transcriptomics combined with proteomics, is one of the most promising approaches to the systemic investigation of stem cell biology. We reviewed the available information on stem cells by examining published results using transcriptomic and proteomic characterization of the different stem cell processes. Comprehensive understanding of these important processes can only be achieved using a systemic methodology, and employing such method will strengthen the study on stem cell biology and promote the clinical applications of stem cells.

Keywords

embryonic stem cells / transcriptomics / proteomics

Cite this article

Download citation ▾

Hai Wang, Qian Zhang, Xiangdong Fang. Transcriptomics and proteomics in stem cell research. Front. Med., 2014, 8(4): 433-444 DOI:10.1007/s11684-014-0336-0

登录浏览全文

4963

注册一个新账户忘记密码

1 Introduction

Stem cells are often related to as primitive, undifferentiated cells that have the ability to reproduce themselves indefinitely (self-renewal) and can generate various types of cells on reception of appropriate external or internal cues (pluripotency or multipotency) [1]. Stem cells are classified into two main groups, embryonic stem cells (ESCs) and adult stem cells. ESCs are mostly derived from the inner cell mass of the blastocyst and give rise to the fetus. In particular, ESCs have a nearly infinite self-renewal ability and the potential of differentiating into almost any cell type [2,3]. Adult stem cells, including somatic and germline stem cells, can maintain, replenish, and regenerate the tissue from which they originate in mature organisms [2]. Both ESCs and adult stem cells have been intensively and widely studied in several fields of science and medicine; the results promise to be very useful in many clinical applications, bringing new treatments and perhaps even curing some currently incurable diseases.

It is of biological and clinical importance to explore the molecular mechanisms involved in stem cell self-renewal, proliferation, and differentiation. Recent advances in so-called “omics” technologies have provided researchers with new opportunities for an overall understanding of biological features of stem cells. Omics covers an increasingly wide range of biology branches to perform precise analyses of biological processes and structures in an increasingly large number of life science fields. These fields range from genomics (examining protein-coding genes, noncoding regions, and regulatory elements), transcriptomics (analyzing transcription, gene expression, and alternative splicing), proteomics (protein identification, quantification, and posttranslational modifications) to epigenomics (DNA methylation, histone modification, and chromatin remodeling). The “omics” technologies are largely responsible for dramatic advances in the postgenomic biology and medicine [4,5]. Therefore, comprehensive, genome-wide analysis combining the techniques of genomics, proteomics, transcriptomics, and epigenomics will provide new insights in the field of stem cell biology and potential clinical applications. In this study, we focus on transcriptomics and proteomics of stem cells.

The development of the transcriptomic and proteomic technologies has enabled the investigation of stem cells using systems biology tools. In particular, high-throughput screening techniques have generated large amounts of data, facilitating systemic understanding of relationships between molecular components [6].

In this article, various aspects of transcriptomic and proteomic studies of stem cells are reviewed, and important findings regarding stem cell self-renewal, proliferation, and differentiation have been highlighted and discussed.

2 The characteristics of stem cell transcriptomics

Transcriptome is the complete set of RNAs in a cell at a particular developmental stage or under specific physiological conditions [7]. Research on the transcriptome is a crucial step in discovering the functional components of the genome, revealing the molecular features of particular cells and tissue, and understanding developmental processes and mechanisms underlying diseases [7]. Pluripotent stem cells are characterized by high levels of global transcriptional activity, leading to their plasticity, while loss of pluripotency and lineage specification cause considerable reduction in the transcription of few portions of the genome [8].

2.1 mRNA

Comparative analyses, with data mining, of transcriptional profiles of ESCs can quantify the alterations in the expression of each transcript and identify the key factors involved in stem cell self-renewal, proliferation, and differentiation. Examining undifferentiated human embryonic stem cells (hESCs), we can identify the differentially and specifically expressed genes in these cells [9]. Ginis et al. compared gene expression profiles of mouse and human ESCs and revealed differences in molecular signatures associated with maintaining pluripotency. These differences are species-specific rather than arising from differences in cell culture conditions [10]. Few genes that are expressed exclusively or predominantly in ESCs have been identified and verified in many studies. These are genes such as OCT4, SOX2, NANOG, REX-1, UTF1, TERT, ABCG2, NODAL, TDGF1, LEFTB, BEX1, and GATA4, and some components of signaling pathways, such as FGF, WNT, and BMP. These genes are important for maintaining the pluripotency [9,11–13]. In addition, there are different factors involved in stem cell differentiation into different lineages. Djouad et al. compared the transcriptomes of multipotent human mesenchymal stem cells (MSCs) and MSC-derived chondrocytes cultured in micropellets. They observed that the expression of 676 genes was upregulated in MSC-derived chondrocytes in comparison with the original MSCs. In particular, Foxo3A was highly expressed at day 21 of the culture and in mature chondrocytes. Furthermore, they suggested that upregulation of Foxo3A expression during chondrogenic differentiation plays a dual role; it inhibits the differentiation toward hypertrophy and promotes cell apoptosis [14].

However, in different expression profile studies, there is little overlap between the lists of the genes overexpressed in ESCs [13]. For example, Ivanova et al. [15] and Ramalho-Santos et al. [16] have independently identified>200 genes upregulated in ESCs, but there are only six genes in common among these two studies, although they used the same cell types and identical microarray chips [17]. We also downloaded different data sets about stem cells from GEO and used hierarchical cluster analysis to sort the expressed genes in ESC.1 (H1 cell line, GEO accession number: GSM817221), ESC.2 (H1 cell line, GEO accession number: GSM1006724), ESC.3 (H7 cell line, GEO accession number: GSM1273672), ESC.4 (human embryonic stem cell line, GEO accession number: GSM922224), HESC (human hematopoietic stem cells, GEO accession number: GSM1185603), 48hrESC (ESC.2 differentiated for 48 h, GEO accession number: GSE41009), MSC (mesenchymal stem cells, GEO accession number: GSE37521), CSC cells (cancer stem cells, GEO accession number: GSE33912) (Fig. 1) [18–23]. Interestingly ESC.1, ESC.2, ESC.3 and ESC.4 were not clustered together, while differentiated cells (48hrESC) and pluripotent cells were clustered together. Genetic diversity among different cell lines, differences in cell culture conditions, and sampling issues may account for such discrepancies in various lines of gene expression data. However, we cannot exclude the possibility of flawed methods employed in such comparative analyses [13]. Therefore, combined analyses of multiple experiments using appropriate statistical methods are essential in reaching coherent conclusions.

It has been established that alternative splicing plays a critical role in regulating ESC pluripotency and differentiation [24].Using Solexa sequencing system, Wu et al. demonstrated that greater splice junction diversity is observed in hESCs than in the cells undergoing neural differentiation [25]. This suggests that this high diversity of isoforms may contribute to the pluripotency of hESCs. The presence of large numbers of specialized transcripts, highest in undifferentiated hESCs and decreasing upon differentiation, is a part of the phenomenon termed isoform specialization [25].

Global transcriptional profiling also enables the detection of unknown RNAs. For example, Brandenberger et al. [26] identified 16 000 (approximately 50% of total tags) potentially novel expressed sequence tags (ESTs) in hESC, and Anisimov et al. [27] identified 16 000 (approximately 35% of total tags) potentially novel tags by serial analysis of gene expression (SAGE) in mouse ESCs. These transcripts may have functions or regulate the key factors which help stem cells maintain their particular characteristics.

2.2 MicroRNAs

Although the functions of many protein-coding genes have been extensively studied, little is known about the regulatory effects of microRNAs (miRNAs) on transcription. miRNAs are a family of small, noncoding RNAs that can bind to the 3′ nontranslational region (3′UTR) of target mRNAs to regulate their expressions [28]. miRNA studies can give us a new insight into the molecular mechanism of stem cell functions.

Various cloning studies have demonstrated that miR-368, miR-200c, miR-154*, miR-371, miR-372, miR-373, and miR-373* are expressed specifically in hESCs to maintain stem cell self-renewal [2,29]. miR-301, miR-374, miR-21, miR-29, and miR-29b play crucial roles in stem cell differentiation and their expression increases after the induction of differentiation [2,29]. However, not all miRNAs in the genome have been characterized. A thorough exploration of the global expression profiles of miRNAs (miRNome) during stem cell self-renewal, proliferation, and differentiation would generate profound influence in stem cell research. miRNome analysis of ESCs and epiblast stem cells (EpiSCs) derived from mouse embryos has shown that miR17-92, miR290-295, and a large repetitive cluster on chromosome 2 are highly expressed in ESCs, whereas miR-302d, miR-34c, miR-367, and let-7e are highly expressed in EpiSCs [30]. These data may indicate that miRNAs play dual roles of redundant and specific factors in the fine-tuning of pluripotency during stem cell development [30]. During T cell development, miRNA expression is an extremely highly-regulated and dynamic process. Using next-generation sequencing technology, 645 miRNAs were obtained from these cells [31]. In addition, Marson et al. generated an accurate positioning genome map for pluripotency factors Oct4, Nanog, Sox2, and Tcf3, and the histone modification H3K4 me3 of mESC occupancy (ChIP-seq). Their studies demonstrate that Oct4, Nanog, Sox2, and Tcf3 promote the ESC miRNA expression program; thus, integrate miRNAs into the regulatory network that controls ES cell identity [32].

2.3 lncRNAs

Non-protein-coding RNAs (or noncoding RNAs) participate in many processes, such as cellular regulation, development and disease [33]. Except for miRNAs, there is another noteworthy class of potential regulatory RNAs in the transcriptome. We refer to this class as long noncoding RNAs (lncRNAs), which are non-protein coding transcripts longer than 200 nucleotides. The limited number of functional studies of lncRNAs suggest that they play important roles in stem cell pluripotency and differentiation. lncRNAs display their function in many ways, such as expression, regulation, and mutation. Some lncRNAs have been identified to play important roles in pluripotency of stem cells by regulating the expression of some key factors. Mohamed et al. found that two of these lncRNAs, AK028326 (Oct-activated) and AK141205 (Nanog-repressed), were direct targets of Oct4 and Nanog in mESCs, in addition to alterations in cellular lineage-specific gene expression and in the pluripotency of mESCs [34]. In human ESC, the transcription of most lncRNA genes is coordinated with transcription of protein-coding genes, which implies that these lncRNAs have positive transcriptional regulation functions [19]. While some lncRNAs may have functions other than transcriptional regulation. Dinger et al. identified 945 ncRNAs expressed during embryoid body differentiation, of which 174 were differentially expressed, many correlating with pluripotency or specific differentiation events, in some cases through engagement of the epigenetic machinery [35]. Two such lncRNAs, Six3os and Dlx1as, are also found to play important roles in the glial-neuronal lineage specification of multipotent adult stem cells [36].

3 The characteristics of stem cell proteomics

Transcriptome approaches provide genome-wide coverage of the mRNAs and miRNAs. However, because of posttranscriptional events, these methods do not always reflect protein dynamics in stem cells. Proteomic analysis supplies the relative quantitation of proteins and peptides, identification of proteins, their subcellular localization, and identifies protein-protein interactions and posttranslational modifications (PTMs) [37]. The application of proteomics to study the processes controlling stem cell self-renewal, proliferation, and differentiation will provide valuable insight into the molecular mechanisms of the factors involved in the differentiation of these cells to specific lineages [38].

3.1 Proteins

Most of the stem cell proteomics studies aimed to examine the changes in the cytoplasmic protein content to identify markers, novel key proteins, and protein interaction maps during different stages of stem cell development [39,40]. Nagano et al. identified markers of ESC such as transcription factors Oct-3/4 and UTF-1; alkaline phosphatase; and others including nidogen 2, hepatoma-derived growth factor (HDGF), cadherin 1, catenin α1, transgelin, and disabled homolog 2 [20]. Comparing monkey ESCs during proliferation and at different stages of spontaneous differentiation (days 3, 6, 12, and 30), Nasrabadi et al. observed changes in the expression of novel key proteins involved in transcription regulation, cell proliferation (CDV3, RCN1, PCNP and homolog), Ras signaling (G3BP and TTC1), and chromatin remodeling (RUVBL1 and HDGF) [40]. SILAC proteomics of planarians identifies Ncoa5 as a conserved component of pluripotent stem cells [41]. Besides, the key proteins and biomarkers of cancer stem cell are also widely studied. DAC2 and CTNNB1 are detected as prognostic markers in the malignant transformation of hESCs in a recent study [42]. With proteome strategy, p63 is found to play an important role in cancer development by regulating the key steps of glycolysis in colon cancer stem cells [43].

Approximately 3400 genes have been predicted to encode single-pass transmembrane or secreted proteins in mammalian cells [44]. It is thus necessary to explore the physiological activities of the extracellular proteome during stem cell self-renewal, proliferation, and differentiation [45]. Gonzalez et al. analyzed the complete extracellular proteome of hESCs and suggested that activation of the pigment epithelium-derived factor (PEDF) receptor-Erk1/2 signaling pathway by the PEDF is sufficient to maintain the self-renewal of undifferentiated hESCs [45]. Moreover, ERK1/2 is also identified as a potential pathway correlated with processes that characterize tumorigenic potential and stemness of cancer stem cells in osteosarcoma, which exhibit a surface protein signature different from differentiated cells [46].

3.2 Phosphorylation

Cell-fate determination is also regulated by protein phosphorylation, a critical determinant of cell signaling [47,48]. Phosphorylation status exhibits dynamic changes during the differentiation period. Four recent phosphoproteomic analyses of hESCs, using different cell culture conditions and different technologies, have identified 3067, 2546, 11 995, and 23 522 protein phosphorylation sites [47–50]. Approximately 50% of these sites presented dynamic changes in the phosphorylation status during 24 h of differentiation [47,50]. Among the dynamically phosphorylated proteins, CDK1/2 was identified as a central factor in controlling stem cell self-renewal and lineage specification [47]. Brill et al. also discovered that 389 proteins contained more phosphorylation site identifications in undifferentiated hESCs, whereas 540 proteins contained more such identifications in differentiated derivatives [48]. Moreover, numerous phosphoproteins in receptor tyrosine kinase (RTK) signaling pathways were present in hESCs [48]. Understanding the phosphorylation landscape that controls stem cell pluripotency, self-renewal and differentiation will also improve our ability to develop stem cell-based therapies.

4 Systemic analysis of transcriptomic and proteomic data

With the progress of high-throughput approaches, such as RNA sequencing and high-throughput protein studies, the omics data sets are increasing rapidly, demanding new developments in bioinformatics approaches for further analysis of these data. Databases for stem cell omics data encourage researchers to share their experimental stem cell data globally (Table 1). Increasing numbers of computational methods and open source or commercial software packages are being developed. Comparisons of omics data obtained for stem cells and other kinds of cell lines at different regulatory levels give us an increasingly comprehensive view of the molecular mechanisms underlying self-renewal, proliferation, and differentiation of stem cells.

4.1 mRNA-seq data analysis

Numerous technologies have been applied to detect and quantify the transcriptome of stem cells at different differentiation and developmental stages. These include EST, SAGE, massively parallel signature sequencing, microarray analysis, and high-throughput sequencing [also known as “next-generation sequencing (NGS)”] [63–66]. Each of these technologies has its own advantages and limitations.

Microarrays have been widely used for obtaining genome-wide expression profiles of stem cells at different stages [67]. However, microarray technology suffers from insufficient sensitivity, narrow dynamic range, and nonspecific hybridizations [68]. In addition, this technology can only provide information regarding the transcripts hybridizing with the probes included on the array. Unlike microarrays, SAGE is a de novo sequencing method, which can identify novel genes; this method needs very little knowledge of sequences for probe construction [64]. However, the cloning and sequencing steps in this technique are laborious, which significantly limits its use [64]. The NGS technology, using SOLiD sequencing system, Solexa genome analyzer, and 454 GS FLX sequencer, overcomes the limitations of the traditional sequencing technologies and provides a high-speed, high-throughput, yet low-cost method for both mapping and quantifying transcriptomes [7,66]. Researchers often combine several technologies for transcriptome study of stem cells.

With the developments of NGS technology, several tools for NGS data analysis have been rapidly emerging. One of the critical steps for RNA-seq experiment is mapping the short reads to reference genomes. So mapping tools with different strategies have appeared to overcome this difficulty. TopHat is a fast mapping tool to align RNA-seq short reads into the reference genome using high-throughput sequence aligner Bowtie; the splice junctions between exons can be determined by analyzing the results of the mapping [69]. PALMapper combines powerful mapping tool GenomeMapper with splice alignment tool QPALMA, so it can exploit quality information of RNA-seq reads and predict splicing sites, which improves the accuracy of alignment [70]. SeqMap can detect multiple substitutions and insertions/deletions of the nucleotide bases in the sequences [71]. Different from the above tools, MapSplice is characterized by the sensitivity and specificity of splice detection, and the effective use of CPU and memory [72]. The algorithm used by this tool is independent of splice site features or intron length, so the novel canonical and noncanonical splices can be detected [72]. Other programs such as Scripture, SpliceMap, SOAP and BWA are also often used for RNA-seq mapping [73–75].

Other methods and software packages have been developed for further analysis, such as transcript assembly, FPKM/RPKM estimation, finding significant changes in transcript expression, identifying gene fusions, and alternative splicing. Cufflinks is a widely used tool for RNA-seq data analysis. It estimates transcript abundances, assembles transcripts and identifies differential expression, and regulation in RNA-seq samples [76]. Bioconductor package (www.bioconductor.org) is also widely used. It is an open source program for the analysis of genomic data, and it includes packages for RNA-seq analysis. The combination of several such tools will facilitate a rigorous RNA-seq data analysis.

4.2 miRNA-seq data analysis

Deep-sequencing technologies, such as miRNA-seq, provide a powerful strategy to explore miRNA populations with high specificity and sensitivity. For miRNA-seq data analysis, multiple computational approaches have been established to analyze miRNA-seq data, allowing differential expression analysis, identification of known and novel miRNAs, and prediction of miRNAs targets. miRDeep is a software package for miRNA-seq data analysis to determine known and novel miRNAs [77]. It scores compatibility of the position and frequency of sequenced RNA with the secondary structure of the miRNA precursor by constructing a probabilistic model that simulates miRNA biogenesis process [77]. miRNAkey is special in achieving the basic functions of miRNA-seq data analysis, and adding some unique characteristics such as multiple read determination and data statistics. The tool provides an innovative platform for the data mining of deep-sequencing of miRNAs [78]. miRanalyser and DSAP are commonly used web server tools for dealing with deep-sequencing data of miRNA [79,80]. Few online databases such as TargetScan [81], PicTar [82], Miranda [83], and DIANA-microT [84] are often used for the prediction of miRNA targets.

4.3 Proteome and phosphoproteome data analysis

Various issues associated with the proteome, such as abundance of proteins and peptides, stability, subcellular localization, PTMs, and their interactions, can be elucidated using different technologies [38]. Two-dimensional polyacrylamide gel electrophoresis (2D-PAGE), mass spectrometry (MS), and liquid chromatography(LC)-MS/MS techniques are widely applied to proteomic analyses. 2D-PAGE is a common tool for isolating proteins from a complex mixture on the basis of two independent parameters in two distinct steps. High-resolution 2D-PAGE of proteins is the fundamental technique of proteomics and can simultaneously analyze thousands of proteins [38]. Although 2D-PAGE has been broadly used for proteome analysis, it has several limitations such as low resolution and low sensitivity [85]. Application of MS has been a significant breakthrough in proteomics. This technique can identify proteins in the femtomole to picomole range and has replaced the classic Edman N-terminal sequencing, which is less sensitive, less automated, and requires an unblocked N terminus [86,87]. Liquid chromatography-mass spectrometry is now routinely used for the identification of peptides [6]. The approaches of this technique are preferred over 2D PAGE-based approaches to detect proteins. New chemical methods such as isotope-coded affinity tag (ICAT), stable isotope labeling with amino acids in cell culture (SILAC), and isobaric tag for relative and absolute quantification (iTRAQ) can further enhance the sensitivity [6]. We can also generate a global view of stem cell proteome dynamics using protein microarrays, a high-throughput technique for obtaining protein abundance and functional data [6,88].

The analysis of the dynamics of protein expression during stem cells self-renewal and differentiation can provide important clues regarding progression of the stem cell differentiation processes [89]. Different bioinformatics tools are developed for analyzing proteomic data from different resources. MSQuant, which is capable of handling multiple labeling strategies and supports several vendor data formats, is widely used for SILAC proteomics data analysis [90]. Other tools such as ASAPRatio [91], XPRESS [92], MaxQuant [93], and PVIEW [94] can also be used for SILAC proteomic data analysis. For data analysis using isobaric labels, Multi-Q [95], iTracker [96], IsobariQ [97], and Libra [98] are freely available software programs. These programs can import preprocessed MS/MS data from Sequest or Mascot. Another proteomic technique, label-free quantification, is a widely used alternative to label-based approaches. Software tools for label-free quantification, such as Corra [99], IDEAL-Q [100], MSQuant [101], and MaxQuant [93] also allow the analysis of low-resolution data. However, because of large dynamic range covered by most of the complex protein extracts, the biophysical properties of protein, and post translational modification, the coverage of a proteome is still not comprehensive [102].

4.4 Combined omics data analysis

The future of genomic and proteomic technologies holds great expectations. Nonetheless, for transcriptomic and proteomic data to achieve their potential, computational integration must be performed to link together all the information generated. A few computational algorithms and software packages have been recently developed, which can utilize multiple-dimension experimental data sets for stem cells to construct their models and regulation networks. A general strategy to integrate mRNA and microRNA expression profiles is to perform correlation analysis. First, we can use software to predict mRNA targets for each miRNA, which is differentially expressed. For each differentially expressed miRNA, we should perform a statistical test to identify whether the number of predicted target mRNAs that are differentially expressed is higher than that expected by chance (P<0.01/0.05). Furthermore, we can perform gene ontology (GO) analysis and network analysis using a variety of bioinformatics databases and software [103].

Despite data quality of proteome is not as satisfactory as transcriptome, comprehensive analysis of the two data sets is becoming more and more widely used. Above all, mapping for short reads of transcriptome raw data and amino acid sequences from proteome data to the reference genome is necessary. Comparison and integration of transcriptomic and proteomic data show that, except for a small number of examples, the two data sets are complementary rather than comparable [102,104,105]. For instance, Liu et al. identified around 40%–60% of the proteins detected in S. japonicum were consistent with the transcripts [104]. The reason why the two data sets are much different from each other is not only the imperfect technologies for omics analysis. There is another important cause that the presence and qualities of transcripts and their corresponding protein products depend on a series of post-transcriptional regulation and metabolic processes [102]. So the overlaps of the two data sets are not expected too much. Where the differences are we may find the post-transcriptional regulation and metabolic processes occur and the actions between transcription and translation will be figured out in the near future. Moreover, Unwin et al. found the proteome and transcriptome change in generally the same direction by a comparison of data on large numbers of mRNA transcripts and the levels of expression of their associated proteins in dynamic systems of primary hematopoietic stem cells [106].

Several laboratories have used other omics data sets to perform a comprehensive analysis of stem cell data. To analyze transcriptome and epigenomic data altogether, Xu et al. developed a classifier to predict self-renewal and pluripotency of mESCs stemness membership genes, using support vector machines [107]. The Stem Cell Discovery Engine (SCDE), a new platform for analysis of multiomics data, has allowed the users to consistently describe, share, and compare multiomics data at the gene and pathway level [108].

5 Perspectives

With the continuous development and improvement of experimental techniques and computational methods, omics research of stem cells has substantially progressed. This has been prompted primarily by major breakthroughs in stem cell biology, the potential of stem cells for biomedical application, and the awareness that transcriptomics and proteomics may be able to accelerate this progress further and possibly open yet unexplored areas of research. At the same time, these achievements bring us new and greater challenges. One of the major problems is how to utilize the existing experimental data more efficiently in the high-level analysis. To achieve this, we must eliminate the discrepancies caused by differences between various platforms and technologies. Only then we can make useful parallel comparisons of data from different sources. To date, we are no more than halfway to achieving this goal. However, the integration of the multilayered omics data are not the end. Our final goal should be the formulation of new hypotheses based on the results of transcriptomic and proteomic data analysis, and testing them in a low-throughput setup to obtain functional verification. The field will be able to move ahead more quickly to uncover the characteristics of stem cells, benefiting clinical applications such as transplants of stem cells and alternative therapies (Fig. 2).

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Ahn SM, Simpson R, Lee B. Genomics and proteomics in stem cell research: the road ahead. Anat Cell Biol 2010; 43(1): 1–14

[2]	Gangaraju VK, Lin H. MicroRNAs: key regulators of stem cells. Nat Rev Mol Cell Biol 2009; 10(2): 116–125

[3]	Wobus AM, Boheler KR. Embryonic stem cells: prospects for developmental biology and cell therapy. Physiol Rev 2005; 85(2): 635–678

[4]	Callinan PA, Feinberg AP. The emerging science of epigenomics. Hum Mol Genet 2006; 15(Spec No 1): R95–R101

[5]	Schneider MV, Orchard S. Omics technologies, data and bioinformatics principles. Methods Mol Biol 2011; 719: 3–30

[6]	Stanton LW, Bakre MM. Genomic and proteomic characterization of embryonic stem cells. Curr Opin Chem Biol 2007; 11(4): 399–404

[7]	Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 2009; 10(1): 57–63

[8]	Efroni S, Duttagupta R, Cheng J, Dehghani H, Hoeppner DJ, Dash C, Bazett-Jones DP, Le Grice S, McKay RDG, Buetow KH, Gingeras TR, Misteli T, Meshorer E. Global transcription in pluripotent embryonic stem cells. Cell Stem Cell 2008; 2(5): 437–447

[9]

Chin MH, Mason MJ, Xie W, Volinia S, Singer M, Peterson C, Ambartsumyan G, Aimiuwu O, Richter L, Zhang J, Khvorostov I, Ott V, Grunstein M, Lavon N, Benvenisty N, Croce CM, Clark AT, Baxter T, Pyle AD, Teitell MA, Pelegrini M, Plath K, Lowry WE. Induced pluripotent stem cells and embryonic stem cells are distinguished by gene expression signatures. Cell Stem Cell 2009; 5(1): 111–123

[10]	Ginis I, Luo Y, Miura T, Thies S, Brandenberger R, Gerecht-Nir S, Amit M, Hoke A, Carpenter MK, Itskovitz-Eldor J, Rao MS. Differences between human and mouse embryonic stem cells. Dev Biol 2004; 269(2): 360–380

[11]	Bhattacharya B, Miura T, Brandenberger R, Mejido J, Luo Y, Yang AX, Joshi BH, Ginis I, Thies RS, Amit M, Lyons I, Condie BG, Itskovitz-Eldor J, Rao MS, Puri RK. Gene expression in human embryonic stem cell lines: unique molecular signature. Blood 2004; 103(8): 2956–2964

[12]	Brandenberger R, Khrebtukova I, Thies RS, Miura T, Jingli C, Puri R, Vasicek T, Lebkowski J, Rao M. MPSS profiling of human embryonic stem cells. BMC Dev Biol 2004; 4(1): 10

[13]	Zhan M. Genomic studies to explore self-renewal and differentiation properties of embryonic stem cells. Front Biosci 2008; 13(13): 276–283

[14]	Djouad F, Bony C, Canovas F, Fromigué O, Rème T, Jorgensen C, Noël D. Transcriptomic analysis identifies Foxo3A as a novel transcription factor regulating mesenchymal stem cell chrondrogenic differentiation. Cloning Stem Cells 2009; 11(3): 407–416

[15]	Ivanova NB, Dimos JT, Schaniel C, Hackney JA, Moore KA, Lemischka IR. A stem cell molecular signature. Science 2002; 298(5593): 601–604

[16]	Ramalho-Santos M, Yoon S, Matsuzaki Y, Mulligan RC, Melton DA. “Stemness”: transcriptional profiling of embryonic and adult stem cells. Science 2002; 298(5593): 597–600

[17]	Suárez-Fariñas M, Noggle S, Heke M, Hemmati-Brivanlou A, Magnasco MO. Comparing independent microarray studies: the case of human embryonic stem cells. BMC Genomics 2005; 6(1): 99

[18]	Yang Y, Wang H, Chang KH, Qu H, Zhang Z, Xiong Q, Qi H, Cui P, Lin Q, Ruan X, Yang Y, Li Y, Shu C, Li Q, Wakeland EK, Yan J, Hu S, Fang X. Transcriptome dynamics during human erythroid differentiation and development. Genomics 2013; 102(5-6): 431–441

[19]	Sigova AA, Mullen AC, Molinie B, Gupta S, Orlando DA, Guenther MG, Almada AE, Lin C, Sharp PA, Giallourakis CC, Young RA. Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells. Proc Natl Acad Sci USA 2013; 110(8): 2876–2881

[20]	Yan L, Yang M, Guo H, Yang L, Wu J, Li R, Liu P, Lian Y, Zheng X, Yan J, Huang J, Li M, Wu X, Wen L, Lao K, Li R, Qiao J, Tang F. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat Struct Mol Biol 2013; 20(9): 1131–1139

[21]	MacRae T, Sargeant T, Lemieux S, Hébert J, Deneault E, Sauvageau G. RNA-Seq reveals spliceosome and proteasome genes as most consistent transcripts in human cancer cells. PLoS ONE 2013; 8(9): e72884

[22]	Jääger K, Islam S, Zajac P, Linnarsson S, Neuman T. RNA-seq analysis reveals different dynamics of differentiation of human dermis- and adipose-derived stromal stem cells. PLoS ONE 2012; 7(6): e38833

[23]	Gargiulo G, Cesaroni M, Serresi M, de Vries N, Hulsman D, Bruggeman SW, Lancini C, van Lohuizen M.In vivo RNAi screen for BMI1 targets identifies TGF-β/BMP-ER stress pathways as key regulators of neural- and malignant glioma-stem cell homeostasis. Cancer Cell 2013; 23(5): 660–676

[24]

Salomonis N, Schlieve CR, Pereira L, Wahlquist C, Colas A, Zambon AC, Vranizan K, Spindler MJ, Pico AR, Cline MS, Clark TA, Williams A, Blume JE, Samal E, Mercola M, Merrill BJ, Conklin BR. Alternative splicing regulates mouse embryonic stem cell pluripotency and differentiation. Proc Natl Acad Sci USA 2010; 107(23): 10514–10519

[25]

Wu JQ, Habegger L, Noisa P, Szekely A, Qiu C, Hutchison S, Raha D, Egholm M, Lin H, Weissman S, Cui W, Gerstein M, Snyder M. Dynamic transcriptomes during neural differentiation of human embryonic stem cells revealed by short, long, and paired-end sequencing. Proc Natl Acad Sci USA 2010; 107(11): 5254–5259

[26]	Brandenberger R, Wei H, Zhang S, Lei S, Murage J, Fisk GJ, Li Y, Xu C, Fang R, Guegler K, Rao MS, Mandalam R, Lebkowski J, Stanton LW. Transcriptome characterization elucidates signaling networks that control human ES cell growth and differentiation. Nat Biotechnol 2004; 22(6): 707–716

[27]	Anisimov SV, Tarasov KV, Tweedie D, Stern MD, Wobus AM, Boheler KR. SAGE identification of gene transcripts with profiles unique to pluripotent mouse R1 embryonic stem cells. Genomics 2002; 79(2): 169–176

[28]	He L, Hannon GJ. MicroRNAs: small RNAs with a big role in gene regulation. Nat Rev Genet 2004; 5(7): 522–531

[29]	Suh MR, Lee Y, Kim JY, Kim SK, Moon SH, Lee JY, Cha KY, Chung HM, Yoon HS, Moon SY, Kim VN, Kim KS. Human embryonic stem cells express a unique set of microRNAs. Dev Biol 2004; 270(2): 488–498

[30]	Jouneau A, Ciaudo C, Sismeiro O, Brochard V, Jouneau L, Vandormael-Pournin S, Coppée JY, Zhou Q, Heard E, Antoniewski C, Cohen-Tannoudji M. Naive and primed murine pluripotent stem cells have distinct miRNA expression profiles. RNA 2012; 18(2): 253–264

[31]	Kirigin FF, Lindstedt K, Sellars M, Ciofani M, Low SL, Jones L, Bell F, Pauli F, Bonneau R, Myers RM, Littman DR, Chong MMW. Dynamic microRNA gene transcription and processing during T cell development. J Immunol 2012; 188(7): 3257–3267

[32]

Marson A, Levine SS, Cole MF, Frampton GM, Brambrink T, Johnstone S, Guenther MG, Johnston WK, Wernig M, Newman J, Calabrese JM, Dennis LM, Volkert TL, Gupta S, Love J, Hannett N, Sharp PA, Bartel DP, Jaenisch R, Young RA. Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell 2008; 134(3): 521–533

[33]	Mattick JS. A new paradigm for developmental biology. J Exp Biol 2007; 210(Pt 9): 1526–1547

[34]	Sheik Mohamed J, Gaughwin PM, Lim B, Robson P, Lipovich L. Conserved long noncoding RNAs transcriptionally regulated by Oct4 and Nanog modulate pluripotency in mouse embryonic stem cells. RNA 2010; 16(2): 324–337

[35]	Dinger ME, Amaral PP, Mercer TR, Pang KC, Bruce SJ, Gardiner BB, Askarian-Amiri ME, Ru K, Soldà G, Simons C, Sunkin SM, Crowe ML, Grimmond SM, Perkins AC, Mattick JS. Long noncoding RNAs in mouse embryonic stem cell pluripotency and differentiation. Genome Res 2008; 18(9): 1433–1445

[36]	Ramos AD, Diaz A, Nellore A, Delgado RN, Park KY, Gonzales-Roybal G, Oldham MC, Song JS, Lim DA. Integration of genome-wide approaches identifies lncRNAs of adult neural stem cells and their progeny in vivo. Cell Stem Cell 2013; 12(5): 616–628

[37]	Unwin RD, Gaskell SJ, Evans CA, Whetton AD. The potential for proteomic definition of stem cell populations. Exp Hematol 2003; 31(12): 1147–1159

[38]	Baharvand H, Fathi A, van Hoof D, Salekdeh GH. Concise review: trends in stem cell proteomics. Stem Cells 2007; 25(8): 1888–1903

[39]	Nagano K, Taoka M, Yamauchi Y, Itagaki C, Shinkawa T, Nunomura K, Okamura N, Takahashi N, Izumi T, Isobe T. Large-scale identification of proteins expressed in mouse embryonic stem cells. Proteomics 2005; 5(5): 1346–1361

[40]	Nasrabadi D, Rezaei Larijani M, Pirhaji L, Gourabi H, Shahverdi A, Baharvand H, Salekdeh GH. Proteomic analysis of monkey embryonic stem cell during differentiation. J Proteome Res 2009; 8(3): 1527–1539

[41]	Böser A, Drexler HCA, Reuter H, Schmitz H, Wu G, Schöler HR, Gentile L, Bartscherer K. SILAC proteomics of planarians identifies Ncoa5 as a conserved component of pluripotent stem cells. Cell Reports 2013; 5(4): 1142–1155

[42]	Sun Y, Yang Y, Zeng S, Tan Y, Lu G, Lin G. Identification of proteins related to epigenetic regulation in the malignant transformation of aberrant karyotypic human embryonic stem cells by quantitative proteomics. PLoS ONE 2014; 9(1): e85823

[43]	D’Aguanno S, Barcaroli D, Rossi C, Zucchelli M, Ciavardelli D, Cortese C, De Cola A, Volpe S, D’Agostino D, Todaro M, Stassi G, Di Ilio C, Urbani A, De Laurenzi V. p63 Isoforms Regulate Metabolism of Cancer Stem Cells. J Proteome Res 2014; 13(4): 2120–2136

[44]	Lin H, Lee E, Hestir K, Leo C, Huang M, Bosch E, Halenbeck R, Wu G, Zhou A, Behrens D, Hollenbaugh D, Linnemann T, Qin M, Wong J, Chu K, Doberstein SK, Williams LT. Discovery of a cytokine and its receptor by functional screening of the extracellular proteome. Science 2008; 320(5877): 807–811

[45]

Gonzalez R, Jennings LL, Knuth M, Orth AP, Klock HE, Ou W, Feuerhelm J, Hull MV, Koesema E, Wang Y, Zhang J, Wu C, Cho CY, Su AI, Batalov S, Chen H, Johnson K, Laffitte B, Nguyen DG, Snyder EY, Schultz PG, Harris JL, Lesley SA. Screening the mammalian extracellular proteome for regulators of embryonic human stem cell pluripotency. Proc Natl Acad Sci USA 2010; 107(8): 3552–3557

[46]	Gemei M, Corbo C, D’Alessio F, Di Noto R, Vento R, Del Vecchio L. Surface proteomic analysis of differentiated versus stem-like osteosarcoma human cells. Proteomics 2013; 13(22): 3293–3297

[47]	Van Hoof D, Muñoz J, Braam SR, Pinkse MWH, Linding R, Heck AJR, Mummery CL, Krijgsveld J. Phosphorylation dynamics during early differentiation of human embryonic stem cells. Cell Stem Cell 2009; 5(2): 214–226

[48]	Brill LM, Xiong W, Lee KB, Ficarro SB, Crain A, Xu Y, Terskikh A, Snyder EY, Ding S. Phosphoproteomic analysis of human embryonic stem cells. Cell Stem Cell 2009; 5(2): 204–213

[49]	Swaney DL, Wenger CD, Thomson JA, Coon JJ. Human embryonic stem cell phosphoproteome revealed by electron transfer dissociation tandem mass spectrometry. Proc Natl Acad Sci USA 2009; 106(4): 995–1000

[50]	Rigbolt KT, Prokhorova TA, Akimov V, Henningsen J, Johansen PT, Kratchmarova I, Kassem M, Mann M, Olsen JV, Blagoev B. System-wide temporal characterization of the proteome and phosphoproteome of human embryonic stem cell differentiation. Sci Signal 2011; 4(164): rs3

[51]	Xu H, Baroukh C, Dannenfelser R, Chen EY, Tan CM, Kou Y, Kim YE, Lemischka IR, Ma’ayan A. ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells. Database (Oxford) 2013; 2013: bat045

[52]

[53]	Yu J, Hu K, Smuga-Otto K, Tian S, Stewart R, Slukvin II, Thomson JA. Human induced pluripotent stem cells free of vector and transgene sequences. Science 2009; 324(5928): 797–801

[54]	Van Hoof D, Muñoz J, Braam SR, Pinkse MWH, Linding R, Heck AJR, Mummery CL, Krijgsveld J. Phosphorylation dynamics during early differentiation of human embryonic stem cells. Cell Stem Cell 2009; 5(2): 214–226

[55]	Munoz J, Low TY, Kok YJ, Chin A, Frese CK, Ding V, Choo A, Heck AJR. The quantitative proteomes of human-induced pluripotent stem cells and embryonic stem cells. Mol Syst Biol 2011; 7: 550

[56]

Sridharan R, Gonzales-Cope M, Chronis C, Bonora G, McKee R, Huang C, Patel S, Lopez D, Mishra N, Pellegrini M, Carey M, Garcia BA, Plath K. Proteomic and genomic approaches reveal critical functions of H3K9 methylation and heterochromatin protein-1γ in reprogramming to pluripotency. Nat Cell Biol 2013; 15(7): 872–882

[57]	Phanstiel DH, Brumbaugh J, Wenger CD, Tian S, Probasco MD, Bailey DJ, Swaney DL, Tervo MA, Bolin JM, Ruotti V, Stewart R, Thomson JA, Coon JJ. Proteomic and phosphoproteomic comparison of human ES and iPS cells. Nat Methods 2011; 8(10): 821–827

[58]	Perez-Iratxeta C, Palidwor G, Porter CJ, Sanche NA, Huska MR, Suomela BP, Muro EM, Krzyzanowski PM, Hughes E, Campbell PA, Rudnicki MA, Andrade MA. Study of stem cell function using microarray experiments. FEBS Lett 2005; 579(8): 1795–1801

[59]

Sansone SA, Rocca-Serra P, Field D, Maguire E, Taylor C, Hofmann O, Fang H, Neumann S, Tong W, Amaral-Zettler L, Begley K, Booth T, Bougueleret L, Burns G, Chapman B, Clark T, Coleman LA, Copeland J, Das S, de Daruvar A, de Matos P, Dix I, Edmunds S, Evelo CT, Forster MJ, Gaudet P, Gilbert J, Goble C, Griffin JL, Jacob D, Kleinjans J, Harland L, Haug K, Hermjakob H, Ho Sui SJ, Laederach A, Liang S, Marshall S, McGrath A, Merrill E, Reilly D, Roux M, Shamu CE, Shang CA, Steinbeck C, Trefethen A, Williams-Jones B, Wolstencroft K, Xenarios I, Hide W. Toward interoperable bioscience data. Nat Genet 2012; 44(2): 121–126

[60]

Ho Sui SJ, Begley K, Reilly D, Chapman B, McGovern R, Rocca-Sera P, Maguire E, Altschuler GM, Hansen TAA, Sompallae R, Krivtsov A, Shivdasani RA, Armstrong SA, Culhane AC, Correll M, Sansone SA, Hofmann O, Hide W. The Stem Cell Discovery Engine: an integrated repository and analysis system for cancer stem cell comparisons. Nucleic Acids Res 2012; 40(Database issue): D984–D991

[61]	Jung M, Peterson H, Chavez L, Kahlem P, Lehrach H, Vilo J, Adjaye J. A data integration approach to mapping OCT4 gene regulatory networks operative in embryonic stem cells and embryonal carcinoma cells. PLoS ONE 2010; 5(5): e10709

[62]	Mallon BS, Chenoweth JG, Johnson KR, Hamilton RS, Tesar PJ, Yavatkar AS, Tyson LJ, Park K, Chen KG, Fann YC, McKay RDG. StemCellDB: the human pluripotent stem cell database at the National Institutes of Health. Stem Cell Res (Amst) 2013; 10(1): 57–66

[63]	Costa V, Angelini C, De Feis I, Ciccodicola A. Uncovering the complexity of transcriptomes with RNA-Seq. J Biomed Biotechnol 2010; 2010: 853916.

[64]	Velculescu VE, Zhang L, Vogelstein B, Kinzler KW. Serial analysis of gene expression. Science 1995; 270(5235): 484–487

[65]	Nagaraj SH, Gasser RB, Ranganathan S. A hitchhiker’s guide to expressed sequence tag (EST) analysis. Brief Bioinform 2007; 8(1): 6–21

[66]	Mardis ER. The impact of next-generation sequencing technology on genetics. Trends Genet 2008; 24(3): 133–141

[67]	Uchida S, Gellert P, Braun T. Deeply dissecting stemness: making sense to non-coding RNAs in stem cells. Stem Cell Rev 2012; 8(1): 78–86

[68]	Asmann YW, Wallace MB, Thompson EA. Transcriptome profiling using next-generation sequencing. Gastroenterology 2008; 135(5): 1466–1468

[69]	Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 2009; 10(3): R25

[70]	Jean G, Kahles A, Sreedharan VT, De Bona F, Ratsch G. RNA-Seq read alignments with PALMapper. Curr Protoc Bioinformatics 2010; Chapter 11: Unit 11 6

[71]	Jiang H, Wong WH. SeqMap: mapping massive amount of oligonucleotides to the genome. Bioinformatics 2008; 24(20): 2395–2396

[72]	Wang K, Singh D, Zeng Z, Coleman SJ, Huang Y, Savich GL, He X, Mieczkowski P, Grimm SA, Perou CM, MacLeod JN, Chiang DY, Prins JF, Liu J. MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res 2010; 38(18): e178

[73]	Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009; 25(14): 1754–1760

[74]	Guttman M, Garber M, Levin JZ, Donaghey J, Robinson J, Adiconis X, Fan L, Koziol MJ, Gnirke A, Nusbaum C, Rinn JL, Lander ES, Regev A. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat Biotechnol 2010; 28(5): 503–510

[75]	Au KF, Jiang H, Lin L, Xing Y, Wong WH. Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res 2010; 38(14): 4570–4578

[76]	Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol 2011; 12(3): R22

[77]	Friedländer MR, Chen W, Adamidi C, Maaskola J, Einspanier R, Knespel S, Rajewsky N. Discovering microRNAs from deep sequencing data using miRDeep. Nat Biotechnol 2008; 26(4): 407–415

[78]	Ronen R, Gan I, Modai S, Sukacheov A, Dror G, Halperin E, Shomron N. miRNAkey: a software for microRNA deep sequencing analysis. Bioinformatics 2010; 26(20): 2615–2616

[79]	Hackenberg M, Rodríguez-Ezpeleta N, Aransay AM. miRanalyzer: an update on the detection and analysis of microRNAs in high-throughput sequencing experiments. Nucleic Acids Res 2011; 39(Web Server issue): W132–138

[80]	Huang PJ, Liu YC, Lee CC, Lin WC, Gan RRC, Lyu PC, Tang P. DSAP: deep-sequencing small RNA analysis pipeline. Nucleic Acids Res 2010; 38(Web Server issue): W385–391

[81]	Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 2005; 120(1): 15–20

[82]	Krek A, Grün D, Poy MN, Wolf R, Rosenberg L, Epstein EJ, MacMenamin P, da Piedade I, Gunsalus KC, Stoffel M, Rajewsky N. Combinatorial microRNA target predictions. Nat Genet 2005; 37(5): 495–500

[83]	Betel D, Wilson M, Gabow A, Marks DS, Sander C. The microRNA.org resource: targets and expression. Nucleic Acids Res 2008; 36(Database issue): D149–D153

[84]

Maragkakis M, Reczko M, Simossis VA, Alexiou P, Papadopoulos GL, Dalamagas T, Giannopoulos G, Goumas G, Koukis E, Kourtis K, Vergoulis T, Koziris N, Sellis T, Tsanakas P, Hatzigeorgiou AG. DIANA-microT web server: elucidating microRNA functions through target prediction. Nucleic Acids Res 2009; 37(Web Server issue): W273-276

[85]	Rabilloud T, Chevallet M, Luche S, Lelong C. Two-dimensional gel electrophoresis in proteomics: Past, present and future. J Proteomics 2010; 73(11): 2064–2077

[86]	Aebersold R, Mann M. Mass spectrometry-based proteomics. Nature 2003; 422(6928): 198–207

[87]	Domon B, Aebersold R. Mass spectrometry and protein analysis. Science 2006; 312(5771): 212–217

[88]	Stoevesandt O, Taussig MJ, He M. Protein microarrays: high-throughput tools for proteomics. Expert Rev Proteomics 2009; 6(2): 145–157

[89]	Novak A, Amit M, Ziv T, Segev H, Fishman B, Admon A, Itskovitz-Eldor J. Proteomics profiling of human embryonic stem cells in the early differentiation stage. Stem Cell Rev 2012; 8(1): 137–149

[90]	Gouw JW, Krijgsveld J. MSQuant: a platform for stable isotope-based quantitative proteomics. Methods Mol Biol 2012; 893: 511–522

[91]

Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R. A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol 2004; 22(11): 1459–1466

[92]	Mueller LN, Brusniak MY, Mani DR, Aebersold R. An assessment of software solutions for the analysis of mass spectrometry based quantitative proteomics data. J Proteome Res 2008; 7(1): 51–61

[93]	Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 2008; 26(12): 1367–1372

[94]	Khan Z, Bloom JS, Garcia BA, Singh M, Kruglyak L. Protein quantification across hundreds of experimental conditions. Proc Natl Acad Sci USA 2009; 106(37): 15544–15548

[95]	Lin WT, Hung WN, Yian YH, Wu KP, Han CL, Chen YR, Chen YJ, Sung TY, Hsu WL. Multi-Q: a fully automated tool for multiplexed protein quantitation. J Proteome Res 2006; 5(9): 2328–2338

[96]	Shadforth IP, Dunkley TPJ, Lilley KS, Bessant C. i-Tracker: for quantitative proteomics using iTRAQ. BMC Genomics 2005; 6(1): 145

[97]	Arntzen MO, Koehler CJ, Barsnes H, Berven FS, Treumann A, Thiede B. IsobariQ: software for isobaric quantitative proteomics using IPTL, iTRAQ, and TMT. J Proteome Res 2011; 10(2): 913–920

[98]	Keller A, Eng J, Zhang N, Li XJ, Aebersold R. A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol 2005; 1: 2005.0017

[99]	Brusniak MY, Bodenmiller B, Campbell D, Cooke K, Eddes J, Garbutt A, Lau H, Letarte S, Mueller LN, Sharma V, Vitek O, Zhang N, Aebersold R, Watts JD. Corra: Computational framework and tools for LC-MS discovery and targeted mass spectrometry-based proteomics. BMC Bioinformatics 2008; 9(1): 542

[100]

Tsou CC, Tsai CF, Tsui YH, Sudhir PR, Wang YT, Chen YJ, Chen JY, Sung TY, Hsu WL. IDEAL-Q, an automated tool for label-free quantitation analysis using an efficient peptide alignment approach and spectral data validation. Mol Cell Proteomics 2010; 9(1): 131–144

[101]

Mortensen P, Gouw JW, Olsen JV, Ong SE, Rigbolt KTG, Bunkenborg J, Cox J, Foster LJ, Heck AJR, Blagoev B, Andersen JS, Mann M. MSQuant, an open source platform for mass spectrometry-based quantitative proteomics. J Proteome Res 2010; 9(1): 393–403

[102]

Hokke CH, Fitzpatrick JM, Hoffmann KF. Integrating transcriptome, proteome and glycome analyses of Schistosoma biology. Trends Parasitol 2007; 23(4): 165–174

[103]

Nielsen JA, Lau P, Maric D, Barker JL, Hudson LD. Integrating microRNA and mRNA expression profiles of neuronal progenitors to identify regulatory networks underlying the onset of cortical neurogenesis. BMC Neurosci 2009; 10(1): 98

[104]

Liu F, Lu J, Hu W, Wang SY, Cui SJ, Chi M, Yan Q, Wang XR, Song HD, Xu XN, Wang JJ, Zhang XL, Zhang X, Wang ZQ, Xue CL, Brindley PJ, McManus DP, Yang PY, Feng Z, Chen Z, Han ZG. New perspectives on host-parasite interplay by comparative transcriptomic and proteomic analyses of Schistosoma japonicum. PLoS Pathog 2006; 2(4): e29

[105]

Tarun AS, Peng X, Dumpit RF, Ogata Y, Silva-Rivera H, Camargo N, Daly TM, Bergman LW, Kappe SHI. A combined transcriptome and proteome survey of malaria parasite liver stages. Proc Natl Acad Sci USA 2008; 105(1): 305–310

[106]

Unwin RD, Whetton AD. Systematic proteome and transcriptome analysis of stem cell populations. Cell Cycle 2006; 5(15): 1587–1591

[107]

Xu H, Lemischka IR, Ma’ayan A. SVM classifier to predict genes important for self-renewal and pluripotency of mouse embryonic stem cells. BMC Syst Biol 2010; 4(1): 173

[108]

Ho Sui SJ, Begley K, Reilly D, Chapman B, McGovern R, Rocca-Sera P, Maguire E, Altschuler GM, Hansen TA, Sompallae R, Krivtsov A, Shivdasani RA, Armstrong SA, Culhane AC, Correll M, Sansone SA, Hofmann O, Hide W. The Stem Cell Discovery Engine: an integrated repository and analysis system for cancer stem cell comparisons. Nucleic Acids Res 2012; 40(Database issue): D984–D991