Decoding hybrid origins and genetic architecture of leaf traits variation in camellia via high-density 21K SNP array for genomic prediction

Jiayu Li , Yixuan Luo , Rui Zhang , Xinchun Li , Hongwei Pan , Hengfu Yin

Horticulture Research ›› 2025, Vol. 12 ›› Issue (11) : 221

PDF (1932KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (11) :221 DOI: 10.1093/hr/uhaf221
Article
research-article
Decoding hybrid origins and genetic architecture of leaf traits variation in camellia via high-density 21K SNP array for genomic prediction
Author information +
History +
PDF (1932KB)

Abstract

The domestication of ornamental plants is primarily driven by aesthetic values and usually involves frequent hybridization events. Camellia spp., a globally famous woody flower, exemplifies the complex origins and extensive phenotypic variation. Here, based on the whole genome resequencing 220 germplasms, we developed Camellia21K, a high-density SNP array enabling cost-effective genome-wide genotyping. We demonstrated that Camellia21K accurately resolves 69 cultivars with complex hybridization histories. For molecular identification of closely related varieties, we developed a set of fingerprinting SNPs to support variety discrimination. To dissect the genomic basis of ornamental traits, we performed a genome-wide association study (GWAS) analysis of five leaf shape traits using the Camellia21K array and screened 31 SNP loci significantly associated with the traits. Further, by analyzing the genotypes of the SNP loci and the haplotypes of the surrounding segments, we identified potential genes regulating leaf tip length, thus demonstrating the versatility of the array. To enhance breeding efficiency, we evaluated and optimized four genomic selection (GS) models for leaf trait prediction. We found that the number of SNPs and model selection significantly affected prediction performance, with optimal predictive accuracy (PC) from 0.362 to 0.542, which was positively correlated with heritability. Finally, we integrated fixed-effects SNPs from GWAS and found significant enhancement of PC (24.7%-64.7%), indicating that the combination of GWAS and GS is indispensable for precision breeding applications. We demonstrated that Camellia21K is effective in discriminating the origin of varieties, in genetic analysis of traits and in genomic prediction, and thus informative for crop breeding.

Cite this article

Download citation ▾
Jiayu Li, Yixuan Luo, Rui Zhang, Xinchun Li, Hongwei Pan, Hengfu Yin. Decoding hybrid origins and genetic architecture of leaf traits variation in camellia via high-density 21K SNP array for genomic prediction. Horticulture Research, 2025, 12(11): 221 DOI:10.1093/hr/uhaf221

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This work was supported by the National Science Foundation of China, grant 32271839 and the Zhejiang Science and Technology Major Program on Agricultural New Variety Breeding (2021C02070-1).

Author contributions

Writing—review & editing: H.Y. and J.L.; data curation: J.L., X. L., Y. L.; formal analysis: J. L., R. Z., H. P.; conceptualization: H. P. and H. Y.; writing—original draft: J. L. and H. Y.; sample collecting: Y. L., H. P., R. Z.

Data availability

All raw sequencing data are publicly accessible through the National Genomics Data Center at https://www.cncb.ac.cn/under BioProject ID: PRJCA039435. All associated data are provided in the supplementary materials of the manuscript, and all codes used in the analysis are also available upon request.

Conflict of interest statement

The authors declare that they have no known competing interests associated with the work reported in this paper.

Supplementary data

Supplementary data is available at Horticulture Research online.

References

[1]

Altman A, Shennan S, Odling-Smee J. Ornamental plant domes-tication by aesthetics-driven human cultural niche construc-tion. Trends Plant Sci. 2022; 27:124-38

[2]

Noman A, Aqeel M, Deng J. et al. Biotechnological advancements for improving floral attributes in ornamental plants. Front Plant Sci. 2017; 8:530

[3]

Kuligowska K, Lütken H, Müller R. Towards development of new ornamental plants: status and progress in wide hybridization. Planta. 2016; 244:1-17

[4]

Bendahmane M, Dubois A, Raymond O. et al. Genetics and genomics of flower initiation and development in roses. JExp Bot. 2013; 64:847-57

[5]

Chen F, Song Y, Li X. et al. Genome sequences of horticultural plants: past, present, and future. Hortic Res. 2019; 6:112

[6]

Zheng T, Li P, Li L. et al. Research advances in and prospects of ornamental plant genomics. Hortic Res. 2021; 8:65

[7]

Zhang Q, Chen W, Sun L. et al. The genome of Prunus mume. Nat Commun. 2012; 3:1318

[8]

Zhang Q, Zhang H, Sun L. et al. The genetic architecture of floral traits in the woody plant Prunus mume. Nat Commun. 2018; 9:1702

[9]

Katoch M, Verma K, Sharma D. et al. Ornamental plant breeding for improved floral attributes:entering a new era. In: Bhargava B, Kumar P, Verma V,eds. Ornamental Horticulture:Latest Cultivation Practices and Breeding Technologies. Springer Nature: Singapore, 2024,45-62

[10]

Chao J, Li Y, Yang S. et al. Design and application of the HbG-BTS80K liquid chip in rubber tree. T. 2024; 3:0

[11]

You Q, Yang X, Peng Z. et al. Development and applications of a high throughput genotyping tool for polyploid crops: single nucleotide polymorphism (SNP) array. Front Plant Sci. 2018; 9: 104

[12]

Zhang C, Li M, Liang L. et al. Rice3K56 is a high-quality SNP array for genome-based genetic studies and breeding in rice (Oryza sativa L.). Crop J. 2023; 11:800-7

[13]

Samorodnitsky E, Datta J, Jewell BM. et al. Comparison of custom capture for targeted next-generation DNA sequencing. JMol Diagn. 2015; 17:64-75

[14]

Shen Y, Wang J, Shaw RK. et al. Development of GBTS and KASP panels for genetic diversity, population structure, and fingerprinting of a large collection of broccoli (Brassica oleracea L. var. italica) in China. Front Plant Sci. 2021; 12:655254

[15]

Li Z, Jia Z, Li J. et al. Development of a 45K pepper GBTS liquid-phase gene chip and its application in genome-wide association studies. Front Plant Sci. 2024; 15:1405190

[16]

Guo Z, Yang Q, Huang F. et al. Development of high-resolution multiple-SNP arrays for genetic analyses and molecular breed-ing through genotyping by target sequencing and liquid chip. Plant Commun. 2021; 2:100230

[17]

Guo Z, Wang H, Tao J. et al. Development of multiple SNP marker panels affordable to breeders through genotyping by target sequencing (GBTS) in maize. Mol Breeding. 2019; 39:37

[18]

Guan H, Lu Y, Li X. et al. Development of a MaizeGerm50K array and application to maize genetic studies and breeding. Crop J. 2024; 12:1686-96

[19]

Si Z, Jin S, Li J. et al. The design, validation, and utility of the “ZJU CottonSNP40K” liquid chip through genotyping by target sequencing. Ind Crop Prod. 2022; 188:115629

[20]

Li C, Ye X, Jin Z. et al. GenoBaits Cassava35K: high-resolution multi-SNP arrays for genetic analysis and molecular breeding using targeted sequencing and liquid chip technology. Hortic Res. 2025;12:uhae305

[21]

Yu Q, Li S, Su X. et al. Melon2K array: a versatile 2K liquid SNP chip for melon genetics and breeding. Hortic Plant J. 2025; 11: 314-22

[22]

Zhu J, Liu Q, Diao S. et al. Development of a 101.6K liquid-phased probe for GWAS and genomic selection in pine wilt disease-resistance breeding in Masson pine. Plant Genome. 2025; 18:e70005

[23]

Diao S, Ding X, Luan Q. et al. Development of 51 K liquid-phased probe array for loblolly and slash pines and its appli-cation to GWAS of slash pine breeding population. Ind Crop Prod. 2024; 216:118777

[24]

Bernhardsson C, Zan Y, Chen Z. et al. Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its trans-ferability to other spruce species. Mol Ecol Resour. 2021; 21: 880-96

[25]

Li Q, Su X, Ma H. et al. Development of genic SSR marker resources from RNA-seq data in Camellia japonica and their appli-cation in the genus camellia. Sci Rep. 2021; 11:9919

[26]

Fu M, Yang X, Zheng J. et al. Unraveling the regulatory mech-anism of color diversity in Camellia japonica petals by integra-tive transcriptome and metabolome analysis. Front Plant Sci. 2021; 12:685136

[27]

Pan L, Li J, Yin H. et al. Integrated physiological and transcrip-tomic analyses reveal a regulatory network of anthocyanin metabolism contributing to the ornamental value in a novel hybrid cultivar of Camellia japonica. Plants. 2020; 9:1724

[28]

Su M, Zhang C, Feng S. Identification and genetic diversity analysis of hybrid offspring of azalea based on EST-SSR markers. Sci Rep. 2022; 12:15239

[29]

Oyama-Okubo N, Tanikawa N, Nakayama M. et al. Screening of genetic resources of camellia LUTCHUENSIS for fragrant camel-lia breeding; analysis of floral scent compounds. Acta Hortjic. 2009; 813:399-406

[30]

Lin P, Wang K, Wang Y. et al. The genome of oil-camellia and population genomics analysis provide insights into seed oil domestication. Genome Biol. 2022; 23:14

[31]

Gao Q, Tong W, Li F. et al. TPIA2: an updated tea plant infor-mation archive for camellia genomics. Nucleic Acids Res. 2024;52: D1661-7

[32]

Lei X, Wang Y, Zhou Y. et al. TeaPGDB: tea plant genome database. B. 2021; 1:1-12

[33]

Wang Z, Huang R, Moon DG. et al. Achievements and prospects of QTL mapping and beneficial genes and alleles mining for important quality and agronomic traits in tea plant (Camellia sinensis). Beverage Plant Res. 2023; 3:22

[34]

Liu D, Zhang C, Ye Y. et al. TEA5K: a high-resolution and liquid-phase multiple-SNP array for molecular breeding in TEA plant. J Nanobiotechnol. 2025; 23:23

[35]

Hu Z, Fan Z, Li S. et al. Genomics insights into flowering and flo-ral pattern formation: regional duplication and seasonal pattern of gene expression in camellia. BMC Biol. 2024; 22:50

[36]

Shen TF, Huang B, Xu M. et al. The reference genome of camellia chekiangoleosa provides insights into camellia evolution and tea oil biosynthesis. Hortic Res. 2022;9:uhab083

[37]

Lu Y, Liang H, Liao J. et al. Chromosome-scale assembly and analysis of yellow camellia (Camellia limonia) genome reveal plant adaptation mechanism and flavonoid biosynthesis in karst region. Glob Ecol Conserv. 2024; 56:e03296

[38]

Wei K, Wang X, Hao X. et al. Development of a genome-wide 200K SNP array and its application for high-density genetic mapping and origin analysis of Camellia sinensis. Plant Biotechnol J. 2022; 20: 414-6

[39]

Shen B, Shen A, Tan Y. et al. Development of KASP markers, SNP fingerprinting and population genetic analysis of Cymbidium ensifolium (L.) Sw. Germplasm resources in China. Front Plant Sci. 2025; 15:1460603

[40]

Cohen P. The structure and regulation of protein phosphatases. Annu Rev Biochem. 1989; 58:453-508

[41]

Spartz AK, Ren H, Park MY. et al. SAUR inhibition of PP2C-D phosphatases activates plasma membrane H+-ATPases to promote cell expansion in Arabidopsis. Plant Cell. 2014; 26:2129-42

[42]

Yu LP, Miller AK, Clark SE. POLTERGEIST encodes a protein phosphatase 2C that regulates CLAVATA pathways controlling stem cell identity at Arabidopsis shoot and flower meristems. Curr Biol. 2003; 13:179-88

[43]

Kwong QB, Ong AL, Teh CK. et al. Genomic selection in commer-cial perennial crops: applicability and improvement in oil palm (Elaeis guineensis Jacq.). Sci Rep. 2017; 7:2872

[44]

Spindel JE, Begum H, Akdemir D. et al. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity. 2016; 116:395-408

[45]

Li M, Zhang YW, Xiang Y. et al. IIIVmrMLM: the R and C++ tools associated with 3VmrMLM, a comprehensive GWAS method for dissecting quantitative traits. Mol Plant. 2022; 15:1251-3

[46]

Ganal MW, Polley A, Graner EM. et al. Large SNP arrays for genotyping in crop plants. J Biosci. 2012; 37:821-8

[47]

Gao J, Parks CR, Du Y. Collected Species of the Genus Camellia, an Illustrated Outline. Hangzhou: Zhejiang Science and Technology Press; 2005:

[48]

Zang F, Ma Y, Wu Q. et al. Resequencing of Rosa rugosa accessions revealed the history of population dynamics, breed origin, and domestication pathways. BMC Plant Biol. 2023; 23:235

[49]

Heo MS, Han K, Kwon JK. et al. Development of SNP markers using genotyping-by-sequencing for cultivar identification in rose (Rosa hybrida). Hortic Environ Biotechnol. 2017; 58:292-302

[50]

Schreiber M, Jayakodi M, Stein N. et al. Plant pangenomes for crop improvement, biodiversity and evolution. Nat Rev Genet. 2024; 25: 563-77

[51]

Hulse-Kemp AM, Lemm J, Plieske J. et al. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp. G3 (Bethesda). 2015; 5:1187-209

[52]

Hembree WG, Ranney TG, Jackson BE. et al. Cytogenetics, ploidy, and genome sizes of camellia and related genera. HortScience. 2019; 54:1124-42

[53]

Nicotra AB, Leigh A, Boyce CK. et al. The evolution and functional significance of leaf shape in the angiosperms. Funct Plant Biol. 2011; 38:535-52

[54]

Tsukaya H. Mechanism of Leaf-shape determination. Annu Rev Plant Biol. 2006; 57:477-96

[55]

Wang Y, Strauss S, Liu S. et al. The cellular basis for synergy between RCO and KNOX1 homeobox genes in leaf shape diver-sity. Curr Biol. 2022; 32:3773-3784.e5

[56]

Wang H, Kong F, Zhou C. From genes to networks: the genetic controlofleafdevelopment.JIPB. 2021; 63:1181-96

[57]

Kumar R, Kushalappa K, Godt D. et al. The Arabidopsis BEL1-LIKE HOMEODOMAIN proteins SAW1 and SAW2 act redundantly to regulate KNOX expression spatially in leaf margins. Plant Cell. 2007; 19:2719-35

[58]

Bilsborough GD, Runions A, Barkoulas M. et al. Model for the regulation of Arabidopsis thaliana leaf margin development. Proc Natl Acad Sci USA. 2011; 108:3424-9

[59]

Kasprzewska A, Carter R, Swarup R. et al. Auxin influx importers modulate serration along the leaf margin. Plant J. 2015; 83:705-18

[60]

Du F, Guan C, Jiao Y. Molecular mechanisms of leaf morphogen-esis. Mol Plant. 2018; 11:1117-34

[61]

Drost DR, Puranik S, Novaes E. et al. Genetical genomics of Populus leaf shape variation. BMC Plant Biol. 2015; 15:166

[62]

Mähler N, Schiffthaler B, Robinson KM. et al. Leaf shape in Populus tremula is a complex, omnigenic trait. Ecol Evol. 2020; 10:11922-40

[63]

Song SK, Clark SE. POL and related phosphatases are dosage-sensitive regulators of meristem and organ development in Arabidopsis. Dev Biol. 2005; 285:272-84

[64]

Weight C, Parnham D, Waites R. TECHNICAL ADVANCE: Leaf-Analyser: a computational method for rapid and large-scale analyses of leaf shape variation. Plant J. 2008; 53:578-86

[65]

Hu J, Chen B, Zhao J. et al. Genomic selection and genetic archi-tecture of agronomic traits during modern rapeseed breeding. Nat Genet. 2022; 54:694-704

[66]

Lebedev VG, Lebedeva TN, Chernodubov AI. et al. Genomic selec-tion for Forest tree improvement: methods, achievements and perspectives. Forests. 2020; 11:1190

[67]

Zheng WY, Wang HR, Chang YS. et al. Quantitative trait loci mapping and genomic selection for leaf-related traits in a ‘Luli’ × ‘red No. 1’ apple hybrid population. Sci Hortic. 2025; 339:113863

[68]

Manley A, Brown M, Ravelombola W. et al. Genome-wide asso-ciation study and genomic selection for plant growth habit in peanuts using the USDA public data. AJPS. 2024; 15:811-34

[69]

Wang Q, Yu Y, Yuan J. et al. Effects of marker density and population structure on the genomic prediction accuracy for growth trait in Pacific white shrimp Litopenaeus vannamei. BMC Genet. 2017; 18:45

[70]

Clark SA, Van Der Werf J. Genomic best linear unbiased predic-tion (gBLUP) for the estimation of genomic breeding values. In: Gondro C, Van Der Werf J, Hayes B,eds. Genome-Wide Association Studies and Genomic Prediction. Vol. 1019. Methods in Molecular Biology. Humana Press: Totowa, NJ, 2013,321-30

[71]

Meher PK, Rustgi S, Kumar A. Performance of Bayesian and BLUP alphabets for genomic prediction: analysis, comparison and results. Heredity. 2022; 128:519-30

[72]

Heino M. Chapter Four - Quantitative Traits. In: Cadrin SX, Kerr LA, Mariani S,eds. Stock Identification Methods. 2nd ed. Academic Press, 2014,59-76

[73]

Wang CL, Ding XD, Wang JY. et al. Bayesian methods for estimat-ing GEBVs of threshold traits. Heredity. 2013; 110:213-9

[74]

Danecek P, Auton A, Abecasis G. et al. The variant call format and VCFtools. Bioinformatics. 2011; 27:2156-8

[75]

Purcell S, Neale B, Todd-Brown K. et al. PLINK: a tool set for whole-genome association and population-based linkage anal-yses. Am J Hum Genet. 2007; 81:559-75

[76]

Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25:1754-60

[77]

McKenna A, Hanna M, Banks E. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20:1297-303

[78]

Xu L, He W, Tai S. et al. VCF2Dis: an ultra-fast and efficient tool to calculate pairwise genetic distance and construct population phylogeny from VCF files. GigaScience. 2025;14:giaf032

[79]

Vilella AJ, Severin J, Ureta-Vidal A. et al. EnsemblCompara Gene-Trees: complete, duplication-aware phylogenetic trees in verte-brates. Genome Res. 2009; 19:327-35

[80]

Letunic I, Bork P. Interactive tree of life (iTOL) v6: recent updates to the phylogenetic tree display and annotation tool. Nucleic Acids Res. 2024;52:W78-82

[81]

Wickham H. ggplot2. WIREs Computational Stats. 2011; 3:180-5

[82]

Alexander DH, Novembre J, Lange K. Fast model-based estima-tion of ancestry in unrelated individuals. Genome Res. 2009; 19: 1655-64

[83]

Francis RM. Pophelper: an R package and web app to analyse and visualize population structure. Mol Ecol Resour. 2017; 17: 27-32

[84]

Yang W, Yao D, Wu H. et al. Multivariate genome-wide associa-tion study of leaf shape in a Populus deltoides and P. simonii F1 pedigree. PLoS One. 2021; 16:e0259278

[85]

Hu D, He S, Sun G. et al. Integrating genome-wide associa-tion and whole transcriptome analysis to reveal genetic con-trol of leaf traits in Gossypium arboreum L. Genomics. 2022; 114: 110331

[86]

Xu M, Jiang X, He F. et al. Genome-wide association study (GWAS) identifies key candidate genes associated with leaf size in alfalfa (Medicago sativa L.). Agriculture. 2023; 13:2237

[87]

Schneider CA, Rasband WS, Eliceiri KW. NIH image to ImageJ: 25 years of image analysis. Nat Methods. 2012; 9:671-5

[88]

Kang HM, Sul JH, Service SK. et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010; 42:348-54

[89]

Zhou X, Stephens M. Genome-wide efficient mixed-model anal-ysis for association studies. Nat Genet. 2012; 44:821-4

[90]

Dong SS, He WM, Ji JJ. et al. LDBlockShow: a fast and con-venient tool for visualizing linkage disequilibrium and haplo-type blocks based on variant call format files. Brief Bioinform. 2021;22:bbaa227

[91]

Covarrubias G. Software update: moving the R Package Som-mer to multivariate mixed models for genome-assisted predic-tion. biorxiv. 2018;354639

[92]

Pérez P, de los Campos G. Genome-wide regression and predic-tion with the BGLR statistical package. Genetics. 2014; 198:483-95

[93]

Gianola D, De Los CG, Hill WG. et al. Additive genetic variability and the Bayesian alphabet. Genetics. 2009; 183:347-63

[94]

Isik F, Holland J, Maltecca C. Genetic Data Analysis for Plant and Animal Breeding. Cham: Springer International Publishing; 2017:

PDF (1932KB)

388

Accesses

0

Citation

Detail

Sections
Recommended

/