Establishment of core collection for Chinese tea germplasm based on cultivated region grouping and phenotypic data

Xinchao WANG , Liang CHEN , Yajun YANG

Front. Agric. China ›› 2011, Vol. 5 ›› Issue (3) : 344 -350.

PDF (130KB)
Front. Agric. China ›› 2011, Vol. 5 ›› Issue (3) : 344 -350. DOI: 10.1007/s11703-011-1097-z
RESEARCH ARTICLE
RESEARCH ARTICLE

Establishment of core collection for Chinese tea germplasm based on cultivated region grouping and phenotypic data

Author information +
History +
PDF (130KB)

Abstract

Tea is an important economic crop in China. About 2665 accessions of tea germplasm (Camellia spp.) are conserved in the China National Germplasm Tea Repository. However, only a small fraction of the collections have been exhaustively used in tea plant improvement programs. The objective of the present study was to develop a core collection of tea plant to enhance the utilization of genetic resources in improvement programs and simplify their management. For this purpose, based on the cultivated region grouping and phenotypic trait data, a core collection of tea plant was established using logarithm proportion and Ward’s cluster method. Approximately 20% of the accessions were then randomly selected from these distinct groups to form a core collection of 532 accessions. Different statistical methods including comparison of mean using Newman-Keuls test, variance using Levene’s test, and frequency distribution using Chi-square test for the traits validated that the variation present in the initial collection was retained in the core collection. The Shannon-Weaver diversity index for different traits was also similar in the core and initial collection. The phenotypic correlations among different quantitative traits that may be under the control of coadapted gene complexes were also preserved in the core collection. This core collection will play an important role in the utilization of tea germplasms.

Keywords

tea plant (Camellia sinensis) / germplasm / core collection / cultivated region / phenotypic trait

Cite this article

Download citation ▾
Xinchao WANG, Liang CHEN, Yajun YANG. Establishment of core collection for Chinese tea germplasm based on cultivated region grouping and phenotypic data. Front. Agric. China, 2011, 5(3): 344-350 DOI:10.1007/s11703-011-1097-z

登录浏览全文

4963

注册一个新账户 忘记密码

Introduction

Tea plant originated from the Yunnan Province of south-west China, where there are rich and diverse tea genetic resources (Yu, 1986). In the past two decades, tea genetic resources have been broadly collected, conserved, evaluated, and utilized in China. Two permanent exsitu conservation facilities, the China National Germplasm Tea Repository (CNGTR), including the China National Germplasm Hangzhou Tea Repository (CNGHTR) in the Tea Research Institute of the Chinese Academy of Agricultural Sciences (TRICAAS) and the Menghai Tea Repository Branch (CNGMTRB) in the Tea Research Institute Yunnan Academy of Agricultural Sciences (TRIYAAS), were established in 1990. Up to 2004, 2665 accessions including wild tea plants, landraces, improved cultivars, introduced varieties, genetic materials, and related species were preserved in the CNGTR (Chen et al., 2004). Agronomic traits, made tea quality, chemical components, tolerance, and resistance to biotic and abiotic stresses, of more than 1600 tea genetic resource accessions have been evaluated and appraised using multidisciplinary approaches in the past 20 years, and several elite and/or special germplasms have been identified and utilized for tea breeding (Chen et al., 2004).

Recently, following the tendency of aggravated competition on tea industry among international and domestic trades, tea industry tends to diversify in market demand. However, the shortage of elite cultivars satisfactory to the market has become an obstacle to tea industry sustainable development. Tea genetic resources are one of the most valuable and fundamental materials for breeding. Numerous tea plant germplasms have been conserved in tea repositories, and the absence of knowledge on some important traits and/or complex traits of in the genetic resources has hindered their utilization. However, the large number of accessions results in complications as their evaluation, management, and utilization are difficult, particularly in the case of initiation of new breeding programs for the improvement of agronomic traits of resistance to diseases and pests as well as quality. This is because the screening of the initial collection for such traits might be quite expensive and time consuming.

Core collections, with a subset of accessions selected to represent the genetic diversity in the collection and provide a minimum of repetitiveness, have been proposed as a good way of maintaining and managing the germplasm collections (Frankel, 1984; Brown, 1989). The core collection as a convenient working collection can be evaluated and utilized preferentially, which could solve the problem in large-scale collection hindering the preservation and utilization of germplasm resources, which has received extensive attention all over the world. A great number of core collections have been established, such as rice (Li et al., 2004), maize (Li et al., 2004), wheat (Balfourier et al., 2007), etc. In tea, Li and Jiang (2004) established a core collection of 615 tea accessions based on 10 traits data. However, the development of a more representative and abundant genetic diversity core collection of tea plant in China is necessary. The objectives of our research were therefore to characterize the tea plant accessions available in CNGTR’s genebank using passport data and botanical descriptors and analyze this diversity to define a core collection, which may be a valuable entry point to the initial collection for further research or use by tea plant breeders.

Materials and methods

Materials and data standardization

The CNGTR holds 2665 accessions tea germplasms, including five species and two varieties of Sect. Thea (L.) Dyer (Chen et al., 2000), originating from eight countries, such as China, Japan, Kenya, Georgia, India, Vietnam, Burma, Sri Lanka, etc. About 1669 accessions were systematically appraised and comprehensively evaluated using multidisciplinary approaches (Chen et al., 2004; Wang et al., 2008). Data of 33 passport data and phenotypic traits, including agronomical, cup tea quality, and main chemical components, were used to establish core collection (Table 1). The data were arranged according to procedures outlined in “Descriptors and Data Standard for Tea (Camellia spp.)” (Chen et al., 2005), and the qualitative traits were assigned as 1, 2, 3, etc. (Table 2). Because the number of style (1 to 5) splitting was a continuous quantitative trait, this trait was considered as a qualitative trait, and the quantitative traits were divided to 10 classes based on means±0.5σ as 1, 2, 3…10 (Qiu et al., 2003).

Establishment of core collection

A method of stratification in cultivated ecological areas and then cluster sampling was adopted by recording the number of accessions from each cultivated ecological area, calculating the logarithm (log) of the number of accessions from each cultivated ecological area and analyzing hierarchical clusters on the standardized data using Ward’s minimum variance method (Ward, 1963) with 0.75 R2 (squared multiple correlation) for grouping the accessions. The Ward’s method was used with PROC-CLUSTER program in SAS 8.0 for Windows (SAS Institute, 1989); approximately 20% accessions were randomly selected from each cluster; some germplasms with special traits, such as white young shoot and “Zigzag” branch, were directly selected into the core collection, and the germplasms were selected and not evaluated based on the logarithm (log) of the number of accessions from each cultivated ecological area (Wang et al., 2009).

Validating the core collection

The means of the core collection and the initial collection were compared using the Newman-Keuls procedure (Newman, 1939; Keuls, 1952) for all the nine quantitative traits. The homogeneity of variances of the core and initial collections was tested using Levene’s test (Levene, 1960). The percentage of significant differences between the core collection and initial collection was calculated for the mean difference percentage (MD%) or the variance difference percentage (VD%) (Hu et al., 2000). The coincidence rate (CR%) and variable rate (VR%) were calculated to evaluate properties of the core collection (Hu et al., 2000). The homogeneity of distribution of the initial and core collection for each trait was measured by comparing the frequency distribution of the trait using the chi-square test. Phenotypic correlations between the nine quantitative traits in both the core and initial collections were estimated independently to assess whether these associations under same genetic control, which were conserved in the core collection. The diversity index (H′) of Shannon–Weaver (1949) was used as a measure of the phenotypic diversity of each trait. The index was calculated independently in both the core collection and initial collection to determine whether the diversity was retained in the core collection. The coincidence rate of index numbers (CRIN%) was used to test whether the core collection retained enough variability (Zhao et al., 2005). Data analysis was done using SAS software (SAS Institute, 1989).

H′ = -∑Pj lnPj, where Pj is the traits appearing frequency of the code j.

Coincidence rate of index numbers (CRIN%)=(Nj/Nj)×100, where Nj is the traits indicator of core collection of number j, and Nj is the initial germplasm traits indicator of the number j.

Results

Core collection

In accordance with the national tea production zoning, the tea cultivated regions in China are divided into four: North of the Yangtze River (NYR), South of the Yangtze River (SYR), South China (SC), and South-west China (SWC) (Tea Research Institute Chinese Academy of Agricultural Sciences, 1986). The introduced genetic resources are dealt with as a separate group; hence, there are five groups. The grouping result of each tea production region is shown in Table 3. Using the logarithmic sampling strategy, as per the proportion of 20% of the overall sample, the primary core collection of tea plant contained 532 accessions, among which there were 79 accessions from NYR, 125 accessions from SYR, 128 accessions from SC, 124 accessions from SWC, and 76 accessions from the introduced resources. Compared with the initial collection, in the core collection, the proportion of NYR (from 4.7% to 14.8%) and the introduced resources (from 4.0% to 14.3%) increased, the proportion of SYR (from 29.8% to 23.5%) and SC (from 34.4% to 24.1%) reduced, and the ratio of SWC had a slight change (Table 3).

Validating the core collection sampling

Differences between the means of the initial and core collections were found to be nonsignificant for all traits except for CAF (Table 4). The recorded MD% (11.1%) indicated that the core collection represented the initial collection adequately (Hu et al., 2000). The variances of the initial and the core collections were homogeneous for all traits (Table 4). The CR% of WE, CAF, TP, AA, and TP/AA were over 89%, and the CR% of LB, WB, FD, and SL were between 75.5% and 83.3%. The high CR% retained in the core collection (89.9%) indicated that it was representative of the initial collection. The coefficient of variations for all nine traits of core collection was higher than that of the initial collection, which showed the variation of core collection was greater, and most of the traits of extreme materials were retained in the core collection (VR% = 119.0%).

The analysis of frequency distribution, except germplasm types (P = 0.028), confirmed the homogeneity of the distribution between the initial and collections (Table 5). Analysis of the frequency distribution of germplasm types, in the core collection, improved the share of wild germplasms, improved cultivars and related species (other), and reduced the proportion of landraces and strains. Wild germplasms and related species were likely to hold the specific traits that were not possessed in cultivation type. Moreover, the improved cultivars were vastly spread in the production, since they contained many traits suitable for cultivation. Therefore, the rising proportion of these germplasms in the ratio of core collection will help to keep some specific traits needed for production and can be used in the future for cultivar improvement.

The Shannon-Weaver diversity index (H') was calculated and compared with the phenotypic diversity among traits in the initial and core collections. The index provided a measurement of both allelic richness and allelic evenness. A low index showed that frequency classes were unbalanced for a trait lacking genetic diversity. The average indexes for the 31 traits in the core collection (H' = 1.224) and the initial collection (H' = 1.184) were similar (Table 6), indicating that the diversity of initial collection was well represented in the core collection. In addition, it can be seen that the average genetic diversity index of core collection was greater than that of initial collection, indicating that some genetic redundancy was eliminated from the core collections, and the level of genetic diversity was improved.

Adequate and proper sampling that was essential in developing a representative core collection should consider the conservation of phenotypic associations arising out of coadapted gene complexes (Ortiz et al., 1998). Phenotypic correlations were calculated between the nine quantitative traits in the core collection as well as the initial collection, separately. In the initial collection, 22 correlations were significant at P = 0.05 whereas in the core collection, and 20 were significant at P = 0.05 level (Table 7). The difference between initial and core collections mainly existed in the TP / AA to LB and CAF, TP to FD, and WB to FD, while the other phenotypic correlations were consistent. This indicated a fairly good representation of coadapted gene complexes from the initial collection to the core collection.

The coincidence rate of index numbers (CRIN%) refers the proportion of the retained phenotypic index in the core collection to the total phenotypic index in the initial collection. The higher the proportion of the index number reserved, the stronger the representation of core collection indicating no loss of traits (Zhao et al., 2005). The analysis of the coincidence rate of index numbers except WB (90%) indicated that the core collection retained sufficient variations of initial collection (CRIN% = 99.7%) (Table 8).

Discussion

One of the major challenges today for genebank curators is to ensure effective and economical ways of conserving diversity. Duplication redundancy in collections has always been a major challenge faced by the curators (Mahalakshmi et al., 2007). Core collections represent the entire diversity in crops with limited use to those users who are not interested in entire diversity but rather in accessions meeting a specific diversity of traits or domain. The most important criterion for a good core collection is its representation of the entire collection, which can be evaluated by two parameters, namely, to what degree does the core collection maintain the full range of genetic variation of the entire collection and how close does its pattern of variation resemble that of the entire collection (Ortiz et al., 1998). Brown (1989) believed that less than 30% significant difference ratio in average and range of all traits between core and overall samples and 70% average rate of all the traits of the amplitude of core samples higher than the amplitude of the whole resource groups traits could come to a conclusion that the core collection basically represent the genetic diversity of the original resources. In this study, in comparison with the means, variances, frequency distribution, phenotypic correlation, Shanon-Weaver diversity index, and coincidence rate of index numbers between the core and the initial collections, we found that there were no significant differences in the mean and range for the traits studied, and frequency distributions were not affected due to proportional sampling. The core collection also retained the overall phenotypic diversity present in the initial collection. The representative of the core collections reached more than 90%, which was found to be representative of the initial collection of tea plant. Core collection is a good candidate for the evaluation of new traits because the core collection can improve the efficiency of identifying valuable genes in a germplasm collection (Holbrook et al., 2000). This core collection reduces the number of entries to be evaluated and provides a working collection of tea plant germplasm that can be used in extensive examination for all economically important traits. The multidisciplinary evaluation of this core collection will contribute to identifying useful parents for improvement programs so as to enhance the use of genetic resources for improving quantitative traits. In addition, this core collection can be used for molecular characterization research, since the extent of diversity can be inferred for the initial collection.

Core collections are not static but dynamic to accommodate the various needs (Jaradat, 1995). As the origin center of the tea plant, China has the most abundant tea genetic resources in the world (Yu, 1986). Therefore, with the enriching of germplasms and the changing of genetic research and goals of breeding needs in the future, there exists dynamic exchange and adjustment between the core and the retained collection, rather than unchangeable. With the continual enrichment and abundance of identification and evaluation data of tea germplasms as well as the emergence of new breeding materials and innovation of germplasms, coupled with the continuous adjustment of breeding objectives, the core collection should be timely adjusted and improved in order to remain dynamic and practical in meeting the needs of breeders as well as farmers, such as the pest and disease resistant germplasms, cold resistance germplasms, more efficient use of fertilizer germplasms, etc. At the same time, as the established core collection also contains some of the germplasms not yet identified systematically, therefore, this part of the genetic resources should be identified and evaluated in order to improve the core collection database. Currently, we are using molecular markers to analyze the genetic diversity and genetic structure of the established core collection. This would remove the redundancy of resources and further reduce the group size to establish mini core collection as well as make the core collection more practical (Zhao et al., 2008).

References

[1]

Balfourier F, Roussel V, Strelchenko P, Exbrayat-Vinson F, Sourdille P, Boutet G, Koenig J, Ravel C, Mitrofanova O, Beckert M, Charmet G (2007). A worldwide bread wheat core collection arrayed in a 384-well plate. Theor Appl Genet, 114(7): 1265-1275

[2]

Brown A H D (1989). Core collections: a practical approach to genetic resources management. Genome, 31(2): 818-824

[3]

Chen L, Yang Y J, Yu F L (2004). Tea germplasm research in China: recent progresses and prospects. Journal of Plant Genetic Resources, 5(4): 389-392 (in Chinese)

[4]

Chen L, Yang Y J, Yu F L (2005). Descriptors and Data Standard for Tea (Camellia spp.). Beijing: Chinese Agricultural Press (in Chinese)

[5]

Chen L, Yu F L, Tong Q Q (2000). Discussions on phylogenetic classification and evolution of Sect. Thea. Journal of Tea Science, 20(2): 89-94 (in Chinese)

[6]

Frankel O H (1984). Genetic perspective of germplasm conservation. In: Arber W, Llimensee K, Peacock W J, Starlinger P, eds. Genetic Manipulations: Impact on Man and Society. Cambridge: Cambridge University Press, 161-170

[7]

Holbrook C C, Timper P, Xue H Q (2000). Evaluation of the core collection approach for identifying resistance to Meloidogyne arenaria in peanut. Crop Sci, 40(4): 1172-1175

[8]

Hu J, Zhu J, Xu H M (2000). Methods of constructing core collections by stepwise clustering with three sampling strategies based on the genotypic values of crops. Theor Appl Genet, 101(1-2): 264-268

[9]

Jaradat A A (1995). The dynamics of a core collection. In: Hodgkin T, Brown A H D, van Hintum T J L, Morales E A V, eds. Core Collection of Plant Genetic Resources. IPGRI, A Wiley-Sayce Publication, 179-186

[10]

Keuls M (1952). The use of the ‘Studentized range’ in connection with an analysis of variance. Euphytica, 1(2): 112-122

[11]

Levene H (1960). Robust tests for equality of variances. In: Olkin I, ed. Contributions to Probability and Statistics: Essays in Honour of Harold Hotelling, Stanford: Stanford University Press, 278-292.

[12]

Li C T, Shi C H, Wu J G, Xu H M, Zhang H Z, Ren Y L (2004) . Methods of developing core collections based on the predicted genotypic value of rice (Oryza sativa L.). Theor Appl Genet, 108(6): 1172-1176

[13]

Li J, Jiang C J (2004). Preliminary construction of core germplasm of Camellia sinensis in China. Journal of Anhui Agricultural University, 31(3): 282-287 (in Chinese)

[14]

Li Y, Shi Y, Cao Y, Wang T (2004). Establishment of a core collection for maize germplasm preserved in Chinese National Genebank using geographic distribution and characterization data. Genet Resour Crop Evol, 51(8): 845-852

[15]

Mahalakshmi V, Ng Q, Lawson M, Ortiz R (2007). Cowpea (Vigna unguiculata L. Walp.) core collection defined by geographical, agronomical and botanical descriptors. Plant Genet Resour: Characterization Util, 5(3): 113-119

[16]

Newman D (1939). The distribution of range in samples from a normal population expressed in terms an independent estimate of standard deviation. Biometrika, 31(1-2): 20-30

[17]

Ortiz R, Ruiz-Tapia E N, Mujica-Sanchez A (1998). Sampling strategy for a core collection of Peruvian quinoa germplasm. Theor Appl Genet, 96(3-4): 475-483

[18]

Qiu L J, Cao Y S, Chang R Z, Zhou X A, Wang G X, Sun J Y, Xie H, Zhang B, Li X H, Xu Z Y, Liu L H (2003). Establishment of Chinese soybean (G. max) core collection I. sampling strategy. Scientia Agricultura Sinica, 36(12): 1442-1449 (in Chinese)

[19]

SAS Institute (1989). SAS User’s Guide: Statistics. State University Press, Cary, NC, USA

[20]

Shannon C E, Weaver W (1949). The Mathematical Theory of Communication. Urbana: University of Illinois Press

[21]

Tea Research Institute Chinese Academy of Agricultural Sciences (1986). Planting Science of Chinese Tea Plant. Shanghai: Shanghai Scientific and Technical Publishers, 36-49 (in Chinese)

[22]

Wang X C, Liu Z, Yao M Z, Ma C L, Chen L, Yang Y J (2009). Sampling strategy to establish a primary core collection of Chinese tea germplasms. Journal of Tea Science, 29(2): 157-165 (in Chinese)

[23]

Wang X C, Yao M Z, Ma C L, Chen L (2008). Analysis and evaluation of biochemical components in bitter tea plant germplasms. Chinese Agricultural Science Bulletin, 24(6): 65-69 (in Chinese)

[24]

Ward Jr J H (1963). Hierarchical grouping to optimize an objective function. J Am Stat Assoc, 58(301): 236-244

[25]

Yu F L (1986). Discussion on the originating place and the originating center of tea plants. Journal of Tea Science, 6(1): 1-8 (in Chinese)

[26]

Zhao L, Dong Y, Liu B, Hao S, Wang K, Li X (2005). Establishment of a core collection for the Chinese annual wild soybean (Glycine soja). Chin Sci Bull, 50(10): 989-996

[27]

Zhao L P, Liu Z, Chen L, Yao M Z, Wang X C (2008). Generation and characterization of 24 novel EST derived microsatellites from tea plant (Camellia sinensis) and cross-species amplification in its closely related species and varieties. Conserv Genet, 9(5): 1327-1331

RIGHTS & PERMISSIONS

Higher Education Press and Springer-Verlag Berlin Heidelberg

AI Summary AI Mindmap
PDF (130KB)

1003

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/