Introduction
In the multi-cellular organisms of higher plants, different cells are specialized into different identities and so compose the organism. The identities of individual plant cells are specified by response to numerous different components, such as DNA, RNA, proteins and small molecules, which participate in the regulatory networks to coordinate multiple biological functions of cells. A global overview of gene expression will promote the elucidation of the transcriptional networks linked to cellular identity and function. The availability of high-throughput experimental methods has enabled researchers to determine the expression levels for thousands of genes, and promises to provide such cellular information on a genome- and proteome-wide scale. Thus the transcriptome profiles in a particular type of cells can ultimately be analyzed and recorded, and such a fingerprint of gene-expression patterns should provide useful markers for specific cells and ultimately contribute to understanding the molecular mechanism of cell-fate decisions. However, the efficiency of information derived from RNA and protein expression profiling is dependent on the specificity of the biological starting materials. It has been challenging to obtain one kind of cell at a specific developmental stage and/or from a unique location in plants, and thus most transcriptome profiling is carried out with whole-tissue resolution.
Tissue transcriptional profiling analysis has provided important spatial information of gene expression, even though plant tissues almost invariably contain varying proportions of multiple cell types. To investigate the transcriptional profiles of specific cell types, different techniques have been adopted. These techniques are applied respectively at the two different steps: the isolation methods of specific types of cells, and the obtaining and analyzing of cell transcriptomes. Techniques for isolating specific cells mainly include manual and enzymatic micromanipulation, laser microdissection (LMD) and fluorescence activated cell sorting (FACS) (
Hu and Yang, 2002;
Birnbaum et al., 2003;
Nelson et al., 2006). cDNA library construction and DNA microarray are widely used to analyze cell transcriptomes, and the RNA-Seq (high-throughput cDNA sequencing) technique by next-generation (NG) sequencing is another emerging powerful tool (
Dresselhaus et al., 1994;
Honys and Twell, 2004;
Tang et al., 2009). In this review, we summarize the current techniques for analyzing the transcriptomes of specific types of cells, especially in male and female gametes, zygotes and early proembryos of higher plants.
Sexual reproduction is an important biological event not only for evolution but also for breeding in higher plants. Plant reproduction occurs through the production of gametes from the gametophyte as a haploid generation. In flowering plants, there are highly reduced male and female gametophytes, the pollen grain and embryo sac, respectively, which consist of only a few cells. During double fertilization, one sperm fuses with an egg cell and another sperm fuses with a central cell to produce embryo and endosperm, respectively (
Russell, 1992). Embryo development in higher plants is an important phase of the life cycle, in which the main body plan of a plantlet is established. In this article, we summarize the applications of techniques for investigating cell type-specific transcriptomes in sexual plant reproduction, and review recent progress in understanding the mechanisms underlying gamete and gametophytic generation, egg cell fertilization, zygote activation and asymmetrical division, orientated differentiation of apical and basal cells from two-celled proembryos, and polarity formation in proembryo development in higher plants.
Techniques for isolating specific types of cells
Micromanipulation
Micromanipulation, a powerful skill, is used to isolate single cells from complex tissues and organs, especially for egg cells and zygotes. There are three major isolation methods: direct manual microdissection, enzymatic maceration assisted by shaking or grinding, and the combination of enzymatic maceration and microdissection (
Fu et al., 1996;
Hu and Yang, 2002). Enzyme digestion combined with shaking requires extended incubation in enzymatic solution, and therefore can affect the vitality of living cells and so has been infrequently adopted. The other two methods, direct manual microdissection and enzyme digestion combined with microdissection, are used more universally in many plants. As early as 1985, viable embryo sacs and their protoplasts in
Nicotiana tabacum were isolated from ovaries using enzymatic degradation and grinding methods (
Hu et al., 1985). The isolation methods for embryo sac cells and zygotes are well developed in a wide range of plant species: e.g.,
Plumbago zeylanica (
Huang and Russell, 1989),
Zea mays (
Kranz et al., 1991;
Kumlehn et al., 1998),
Triticum aestivum (
Holm et al., 1994),
Brassica napus (
Katoh et al., 1997),
Oryza sativa (
Zhao et al., 2000),
Torenia fournieri (
Chen et al., 2008),
Antirrhinum majus and
Arabidopsis thaliana (
Wang et al., 2010).
Compared with female gametes, male gamete cells are usually exposed and easily isolated. Grinding and osmotic shock methods are widely used to release viable sperm cells from mature pollen grains, and the sperm cells can then be purified using Percoll density gradient centrifugation or manual collection with a micropipette under a microscope (
Hu and Yang, 2002). By using these methods, viable sperm cells have been successfully isolated from species such as tobacco (
Cao et al., 1996),
P. zeylanica (
Russell 1986;
Zhang et al., 1998),
Lilium longiflorum (
Tanaka, 1988) and maize (
Kranz et al., 1991). The isolated specific types of cells were used as samples for cDNA library construction and microarray analysis for investigating the cell specific transcriptomes, which greatly broadened knowledge of gene expression in sperm, egg and zygote cells (
Dresselhaus et al., 1994;
Lê et al., 2005;
Ning et al., 2006;
Sprunck et al., 2005;
Gou et al., 2009). In addition to the above, other methods, such as microcapillaries or micropipettes have been used to directly obtain the contents of leaf epidermal, mesophyll, companion and guard cells (
Karrer et al., 1995;
Brandt et al., 1999,
2002). Although micromanipulation is a powerful method for isolating specific types of cells, its application is relatively limited since it is time consuming and requires extensive skills.
Laser microdissection
Laser microdissection (LMD) is a technique based on the application of laser technology. It circumvents some limitations of micromanipulation, and provides a rapid and reliable method to procure pure populations of cells from specific regions of tissue sections. Plant tissues containing cells of interest are first fixed with appropriate fixatives, and then embedded to prepare sections for LMD. The dissection process is accomplished by adapting a laser beam device with the microscopy and software system. The laser dissection device is used to incise the specific types of cells from tissue sections, which are immobilized on a slide under direct view of a microscope (
Emmert-Buck et al., 1996;
Simone et al., 1998). The isolated cell samples can then be collected by various methods, for example, with an adhesive film in conjunction with a collector tube under the slide (
Kerk et al., 2003), or by catapulting the cells with laser pulses and harvesting them in a collector above the sample (
Nakazono et al., 2003).
The first report of LMD used in plant cells was for isolating phloem cells of rice (
Asano et al., 2002). After optimization of the isolation procedures, many different kinds of cells from a variety of plant tissues and species were successfully isolated (
Kerk et al., 2003;
Nakazono et al., 2003). The most important application of LMD in researching sexual plant reproduction is to isolate specific cells of female gametes and gametophytes, as well as embryos, and helps to overcome the difficulty of accessing specific cells from the complex maternal tissues. With the LMD-harvested pure cells, the transcriptomes of specific cells were investigated and provided sufficient information for gene expression in female gametes, zygotes and embryos (
Casson et al., 2005;
Le et al., 2007;
Wuest et al., 2010). The LMD technique has the advantages of ease of use and automating the tedious procedures of cell collection. However, it is still very labor intensive in the sample preparation steps and it is easy to contaminate samples with neighboring cells, especially for small or few layers of cells and tissues such as in meristem.
Fluorescence activated cell sorting
The fluorescence activated cell sorting (FACS) was developed for quick and accurate isolation of specific cells. FACS is the technique of choice to purify specific cell populations, and allows the purification of individual cells based on size, granularity and fluorescence. The purification process requires a flow cytometer with sorting capacity and appropriate software. In the process of the sorting, suspension cells are passed as a stream in droplets with each containing a single cell in front of a laser. The fluorescence detection system detects the cells of interest, based on predetermined fluorescent parameters. The instrument first applies a charge to the droplet containing a cell of interest, and then an electrostatic deflection system facilitates collection of charged droplets into appropriate collection tubes (
Givan, 2001). There are two common approaches to label specific cells with fluorescence, one is cell pre-staining with fluorescently-tagged monoclonal antibodies that recognize specific surface-markers on the desired cell population (
Herzenberg and De Rosa, 2000), and the other is genetic modification of the cells of interest with fluorescent protein driven by cell-specific promoters (
Brandt et al., 1999).
The first application of FACS in plant cell-specific transcriptome analyses was reported in the sperm cells of maize (
Engel et al., 2003). Thereafter, the method was used to isolate sperm cells of
Arabidopsis for transcriptome analysis at a genome-wide level (
Borges et al., 2008). Apart from sperm cells, FACS was adopted to isolate different types of protoplasts from
Arabidopsis root, which provided materials for obtaining detailed spatiotemporal gene expression maps of the root (
Birnbaum et al., 2003;
Brady et al., 2007). Although FACS requires specialized equipment and training, it is the preferred method for isolating highly purified cell populations. Up to now, the application of FACS in plant cells has been relatively limited since it relies on the prior identification of cell-specific markers; however, the recent availability of increasingly cell type-specific fluorescent reporter lines will greatly advance the progress of FACS application.
Techniques of cell type-specific transcriptome analysis
cDNA library construction from limited cells
In higher eukaryotes, biological processes such as cell growth and differentiation are regulated by cell-specific gene expression. To understand the mechanism of molecular regulation, the transcriptomes for special types of cells should be studied to provide an overview of their expression profiles. The construction of cDNA libraries for specific cells serves as a useful method for decades. However, normal cDNA library construction is usually insufficient for isolating rare RNA at the ng-level from limited cell samples. To overcome the limitation of cell population, PCR-based cDNA library construction was developed to generate cDNA libraries at the level of a few cells, and the pre-amplification of the first-strand cDNA by long-distance PCR provided the necessary amount of cDNA for library construction (
Belyavsky et al., 1989;
Dresselhaus et al., 1994). The wide existence of abundant species of cDNA hampers the identification of genes with low abundance. Subtractive cDNA hybridization, a kinetic method to deplete high abundance cDNA between two cell samples, is a powerful approach to identify and isolate differentially expressed genes. The subtracted cDNA libraries greatly enrich the differentially expressed sequences, and enable identification of cDNA corresponding to species of mRNA present at very low levels or only in a small fraction of cells from complex tissues (
Diatchenko et al., 1996;
Lê et al., 2005;
Ning et al., 2006;
Hu et al., 2010). These specialized cDNA libraries from a variety of cell types are a necessary prerequisite to more detailed analysis of the distribution of mRNA populations (
Sprunck et al., 2005;
Okada et al., 2006;
Yang et al., 2006). Further comparison of library transcripts derived from different cell types leads to ease of isolation of the expressed sequences unique to specific cell-types (
Sprunck et al., 2005;
Gou et al., 2009). cDNA library construction provides an approach for investigating the transcriptome profiles of different cells, but the information obtained greatly depends on the sequencing of library transcripts, which is often time consuming and expensive.
DNA microarray
Apart from the cDNA library, the DNA microarray technique is another important tool in the research of functional genomics, and can help researchers bypass cDNA library construction and avoid bias generation in the sub-cloning process. The advantage of this method lies in miniaturization, automation and parallelism, and it permits large-scale and genome-wide acquisition of quantitative transcriptome information from different cell samples. At present, there are two main approaches in DNA microarray fabrication, involving
in situ synthesis of oligonucleotides (oligonucleotide microarrays) and deposition of pre-synthesized DNA fragments (cDNA microarrays) on solid surfaces (
Aharoni and Vorst, 2002). DNA microarray technology has been widely used in functional genomics, disease genomics, systems biology and other fields, with the main applications in comprehensive, simultaneous gene expression monitoring and in DNA variation analyses for the identification of mutations and genotyping of polymorphisms. For plant cells, the DNA microarray technique potentially allows a global analysis of gene expression at whole-genome level, and has been used to compare transcriptional changes in different types of cells. These cell-specific transcriptome analyses provide important insights into the regulation networks that are linked to cellular identities and functions (
Borges et al., 2008;
Haerizadeh et al., 2009;
Le et al., 2010;
Wei et al., 2010). Exciting new information will be gained in the years to come, not only from genome-wide expression analyses on a few model plant species, but also from extensive application to less thoroughly studied species on a relatively limited scale (
Okada et al., 2007;
Gou et al., 2009). Since the DNA microarray has already become a mature systemic platform, there are standardized protocols for sample preparation, experiment execution and data analysis, along with large public databases available to all scientists. However, the shortcoming of DNA microarray is that it is a ‘closed system’, which can only detect the characteristics of known sequences (or with limited variation) in a large number of biological samples, and rare RNA expressed at low levels is usually beyond its detection capability.
RNA-Seq by NG sequencing
Since its invention, DNA sequencing has played an essential role in the advancement of molecular biology. The first-generation sequencing technique was based on the fluorescently labeled Sanger’s chain-termination method developed in the mid-1970s (
Sanger, 1988). The NG sequencing technology, which depends on massive parallel DNA sequencing platforms, greatly increased analysis speed and reduced sequencing costs (
Shendure and Ji, 2008). Three widely used NG commercial platforms (Illumina Genome Analyzer, Roche 454 Genome Sequencer and Life Technologies SOLiD System) were all invented and developed toward the end of 1990s, and commercialized in the middle of the first decade of the century. All utilize the sequencing-by-synthesis strategy, despite their variation in DNA array formation, fragment amplification, enzyme-based sequencing biochemistry and subsequent sequential analysis (
Ansorge, 2009). Next-generation DNA sequencing, which dramatically accelerated biologic and biomedical research by making the comprehensive analysis of genomes inexpensive, and enabling it become routine and widespread, is also starting to be used for transcriptome analysis of plant cells. Comparative research between traditional capillary sequencing and NG ultra high-throughput techniques was carried out in
Arabidopsis,
Eschscholzia californica and
Persea americana, and showed that NG sequencing was a dramatic advance over capillary-based sequencing in terms of sequence coverage (
Wall et al., 2009). In recent reports, NG sequencing has been used for comprehensive studies of transcriptomes at single-base resolution in rice (
Zhang et al., 2010), and investigation of DNA methylation, activating and repressive histone modifications, small RNAs and mRNA in the maize genome (
Elling and Deng, 2009). Moreover, cell-specific digital gene expression profiling with only a single mouse blastomere was successfully carried out using mRNA-Seq assay ~75% more expressed genes were detected with this technique compared to microarray, and this assay identified 1753 previously unknown splice junctions (
Tang et al., 2009). Although microarray is still the most widely used technique in current reports of plant cell transcriptome studies, RNA-Seq by NG sequencing has already displayed extraordinary charisma in all areas of genome research, and it is increasingly replacing DNA microarrays in cell-specific transcriptome analysis.
Application of the techniques in plant sexual reproduction
Cell type-specific transcriptome analysis has been carried out on many different kinds of cells in higher plants. Here we focus on the application in gene expression analysis of male and female gametes, zygotes and early proembryos, and summarize some recent advances in sexual plant reproduction.
Male gametes
A pollen grain is the highly reduced haploid male gametophyte in flowering plants, and consists of just two or three cells when it matures and is released from the anther. Mature pollen grains are carried by wind or insects onto the stigma of the pistil, where the pollen germinates and produces pollen tubes. Then the pollen tubes grow through the style into the female gametophyte in the ovule. In the tips of pollen tubes, two sperm cells are delivered into the embryo sac and fuse with an egg cell and a central cell, respectively. This double fertilization event along with the functional specialization of the male gametophyte is considered to be a key innovation in the evolutionary success of flowering plants. During recent decades, there have been great advances in understanding the molecular regulation network of male gamete and gametophyte development. The first cDNA libraries of male gametes were constructed in the mature pollen of maize and
Tradescantia. paludosa, and further northern hybridizations revealed the existence of pollen-specific cDNA clones (
Stinson et al., 1987). In recent years, several cDNA libraries from mature anthers of tomato, pollen of
Brassica campestris, and generative cells of lily were constructed, and library screenings with northern blot analysis led to the isolation of male-gamete-specific genes, such as
LAT52 and
LGC1 (
Twell et al., 1989;
Theerakulpisut et al., 1991;
Xu et al., 1998,
1999). Since large scale EST (expressed sequence tag) sequencing was not applied in these studies, the information obtained about gene expression in the male gametes was still relatively limited.
The wide application of microarray and large-scale cDNA library sequencing in the first decade of this century greatly promoted the characterization of transcriptomes in male gametes, such as pollen development at different stages in
Arabidopsis (
Honys and Twell, 2004), rapeseed (
Whittle et al. 2010) and rice (
Hobo et al., 2008;
Wei et al., 2010). These provide an overview of the dynamic gene expression in male gametes, revealing a large-scale repression of early program genes and the activation of a unique late gene-expression in the progress of pollen maturation. More researches, based on microarray analysis, compared the transcriptional profiles of mature/hydrated pollens with sporophytic tissues in the tri-cellular pollen model plant
Arabidopsis (
Becker et al., 2003;
Honys and Twell, 2003;
Pina et al., 2005) and the bi-cellular pollen plant soybean (
Haerizadeh et al., 2009). All these studies indicated that the pollen possesses a restricted and unique repertoire of genes, with a significantly greater proportion of specifically expressed genes compared to sporophyte tissues. Moreover, comparison of the upregulated transcription factors between soybean and
Arabidopsis reveals some divergence in the numbers and kinds of regulatory proteins in both species.
Considering the relative complexity of the content in pollen, the transcriptomes of generative cells in lily were studied by EST analysis of cDNA library and microarrays (
Okada et al., 2006,
2007). The results showed that a subset of genes was specifically expressed in generative cells, and that some genes showed significant similarities to the male gametophyte-specific genes from maize and
Arabidopsis, indicating conservation of such genes across plant species. Using the technique of directly isolating sperm cells, the first cDNA library from the sperm cells of rice was constructed (
Gou et al., 2001), and the sperm cell cDNA libraries of tobacco (
Xu et al., 2002;
Xin et al., 2010) and maize (
Engel et al., 2003) were also obtained. Screening and sequencing of the sperm-cell-specific libraries enabled the identification of the special sperm-cell-expressed transcripts, and revealed the complex and special composition of mRNAs in the sperm. In the isolated sperm cells of
Arabidopsis using FACS, the genome-scale transcriptome analysis with the GeneChip technique revealed that sperm cells possessed distinct transcriptional information compared with those of seedlings and also pollen (
Borges et al., 2008). To study the dimorphism of sperm cells, two populations of the dimorphic sperm cells in
P. zeylanica were collected to construct cDNA libraries and carry out microarray analysis – this revealed the distinct complements of expressed transcripts in the two kinds of sperm cells (
Gou et al., 2009). All these analyses indicate that male gametes possess a unique composition of gene expression compared with sporophytic tissues, and that a significant proportion of transcripts displayed a male gamete cell-specific expression. The identification and functional characterization of the genes promises the elucidation of their regulatory networks operating in male gamete development and double fertilization of higher plants.
Female gametes
In flowering plants, the haploid female gametophyte is a typical seven-celled structure with four types of cells: an egg cell, a central cell, two synergid cells, and three antipodal cells. These cells derive from a functional megaspore and perform their respective essential functions required for double fertilization and seed development. Differentiation of these distinct cells likely involves the coordinated changes of gene expression regulated by transcription factors. Therefore, understanding of female gametophyte cell differentiation and function is required to study the gene regulatory networks operating in each cell type. For this purpose, many researches have been conducted to elucidate the molecular mechanisms of cell specification and differentiation in the female gametes and gametophytes. The first representative cDNA library of egg cells in higher plants was generated from 128 isolated maize egg-cells using a reverse transcriptase/polymerase chain reaction (RT-PCR) method (
Dresselhaus et al., 1994).
Egg Apparatus 1 of maize from this library was identified and proved to function in guiding pollen tubes into the micropyle (
Márton et al., 2005). A more detailed transcriptome assay was carried out with cDNA libraries from female gametophytes and egg cells of maize (
Yang et al., 2006), wheat (
Kumlehn et al., 2001;
Sprunck et al., 2005) and tobacco (
Ning et al., 2006), and more ESTs were sequenced and analyzed for investigating transcription profiles in female gametes and for isolating the egg cell-specific genes.
For identifying specific and differentially expressed transcripts in different gamete cells, the subtractive cDNA libraries between maize egg cells and central cells were constructed, and 340 differentially expressed clones were identified through microarray screening (
Lê et al., 2005). Taking advantage of the protruding embryo sacs and easily obtained member cells of
T. fournieri, the cDNA library from manually-isolated synergid cells was constructed, and then several cysteine-rich polypeptides, which function as pollen tube attractants secreted from synergid cells, were successfully identified (
Okuda et al., 2009). To determine cell-type-specific expression profiles in the female gametophyte of
Arabidopsis, the LMD technique was used to isolate individual cells including the synergids and the two female gametes (egg and central cells) as samples for microarray analysis, and revealed that these distinct types of cells were characterized by their differentially expressed post-transcriptional regulatory modules and metabolic pathways (
Wuest et al., 2010).
Besides direct analysis of the specific gamete cells, another widely applied approach to characterize genes involved in female gamete development is based on analysis in the ovules. Comparative transcriptome analyses between the normal ovules and those lacking the complete embryo sacs such as the
determinate infertile1 (
dif1),
sporocyteless (
spl) and
coatlique (
coa) mutant ovules were performed by several different groups with the GeneChip hybridization (
Yu et al., 2005;
Johnston et al., 2007;
Steffen et al., 2007). Similar research was carried out by large-scale real-time RT-PCR screening or high-throughput RNA-Seq and whole-genome tiling arrays in the
dif1 ovules and
myb98 ovules that were impaired in pollen tube attraction (
Jones-Rhoades et al., 2007;
Wang et al., 2010). Apart from comparative analysis with mutants, the cDNA libraries of rapeseed ovules were also constructed and 10468 ESTs obtained, providing new insights into the transcriptional profile of rarely studied ovules (
Whittle et al., 2010). The results of the analyses defined a large number of embryo sac-expressed transcripts, and some candidate transcripts were found to be expressed in specific cells of the female gametophytes. Since these specific transcripts of gamete cells may act during embryo sac development, fertilization and even embryogenesis, further functional characterization of these genes will promote elucidating the molecular mechanisms and regulatory networks in the processes of development.
Zygotes and embryos
Embryogenesis is a crucial developmental event in the life cycle of flowering plants. During embryogenesis, a single-celled zygote follows a defined pattern of cell division and differentiation to form a mature embryo. To investigate the developmental expression of mRNAs in embryos, the isolated embryos from many species (e.g., soybean, rapeseed, barley and rice) were first used for the cDNA library construction and the isolation of embryo specific transcripts (
Goldberg et al., 1981;
Loader et al., 1993;
Ranford et al., 2002;
Ito et al., 2004). Hand-dissected embryo proper and suspensor cells from the globular embryos of the scarlet runner bean were used for differential screening to identify suspensor-specific mRNAs (
Weterings et al., 2001). In maize and rice, the expressed genes during the embryo development were investigated by DNA microarray and EST sequencing (
Lee et al., 2002;
Ge et al., 2008), providing more detailed molecular insights into embryonic development. For more specific analysis of specific cells of embryos, LMD was applied to isolate the apical and basal domains of the globular-, heart- and torpedo-stage embryos from tissue sections of
Arabidopsis (
Casson et al., 2005;
Spencer et al., 2007). In the development of embryos, the transition from globular embryo to heart embryo is a key phase of polarity establishment and organ differentiation. Global transcriptional profiling with specific domains of embryos not only enabled the isolation of embryo cell type-specific genes, but also revealed the dynamics and complexity of gene expression of the embryos in both spatial and temporal dimensions, which provided much useful information for understanding of embryo polarity establishment and organ differentiation.
For investigating gene expression in zygotes and early embryos, a subtractive cDNA library was constructed to compare the gene expression profiles between isolated egg cells and zygotes of tobacco. The identification of many zygote-expressed genes revealed that zygotic gene activation occurs shortly after fertilization in higher plants (
Ning et al., 2006). The first division of zygotes is a critical event during early embryogenesis, and thereafter, the apical and basal daughter cells become specified to follow their different developmental pathways. To gain understanding of the processes, the transcriptional profiles of the two-celled proembryos in wheat (
Sprunck et al., 2005), the apical and basal cells of
in vitro two-celled proembryos in maize (
Okamoto et al., 2005) and the
in vivo apical and basal cells in tobacco (
Hu et al., 2010) were obtained and investigated. Using an
in vitro zygote culture system, the two-celled proembryos with the same size of cells from symmetric division of zygotes were obtained as a comparison sample. The differential gene expression in the two kinds of different zygotic division processes, the asymmetric division
in vivo and symmetric division
in vitro, was well studied (Hu et al. unpublished data). The analyses showed a global view of the transcript profiles in the different two-celled proembryos, and the apical and basal cells, respectively. The identification of transcripts with specific expressions, which were up- or downregulated only in the apical or basal cells and in the asymmetric/symmetric two-celled proembryos, will provide important clues for understanding of zygote activation and division, and proembryo polarity formation and differentiation.
To identify gene expressions in the specific stages of seed development, LMD was also applied to dissect different compartments of the entire seeds (e.g., endosperm, suspensor, embryo proper, endothelium, inner integument, outer integument, epidermis and hilum from a globular-stage soybean seed). The global gene-expression profiles in these compartments were all investigated using microarray-based analysis, and approximately 20000 diverse transcripts were found in the whole-mount globular-stage soybean seeds, with small sets of seed region-specific mRNAs (
Le et al., 2007). In
Arabidopsis, the transcription profiles of ovules or seeds from fertilization to maturation and germination were obtained and studied, including ovules with an unfertilized egg cell, zygotes, globular embryos, cotyledon embryos, mature green embryos, postmature green embryos and postgermination seedlings. The amount of expressed unique mRNAs varied from about 9000 to 14000 at different developmental stages with about 7000 mRNAs detected at all stages of tested seeds (
Le et al., 2010). This research provided a detailed map of gene expression in various seed compartments at different developmental stages and enabled the identification of embryo, endosperm and seed coat-specific transcripts. These ovules or seed-specific genes should facilitate the elucidation of regulatory networks for programming seed development.
Future perspectives
Although the cell-specific transcriptome analysis reviewed here provides much information about gene expression in the gamete cells, zygotes and embryos, and has greatly enhanced understanding of the molecular mechanisms involved in regulating the sexual reproduction process, many challenges still remain. The first challenge is that it is currently very difficult to elucidate the regulation network controlling gametogenesis and embryogenesis. Although transcriptome information of angiosperm gametes and embryos has greatly increased in recent years, there is still a need for larger-scale, more specific and extensive genome-wide studies, especially for processes of zygote activation and directional division, and embryogenesis and early polar axis establishment. The application of FACS and RNA-Seq by NG sequencing will accelerate genome-wide detection of mRNA expression levels in the sexual reproduction process. There is still much work remaining to elucidate the detailed functions of cell-specific transcripts. Another challenge is whether the expressed mRNAs indeed translate to functional proteins. Proteomic approaches are useful tools for investigating transcriptional networks at the cellular level. Improved high-throughput proteomics techniques have shifted attention to protein profiling, which attempts to identify all proteins present in particular tissues or cells (
Baginsky and Gruissem, 2006). A maize proteome map has been established in isolated egg cells (
Okamoto et al., 2004), and more sensitive methods for cell-specific proteomic analysis will hopefully be established in the near future. Despite the challenges, the recent development in cell type-specific transcriptome and proteome analysis techniques holds great promise for the future. Analysis of transcriptomes and proteomes, and further identification of gene functions and signaling pathways in the gametes and embryos, will ultimately lead to the identification of the molecular networks that regulate the processes of sexual plant reproduction.
Higher Education Press and Springer-Verlag Berlin Heidelberg