Integrating whole-genome resequencing and machine learning to refine QTL analysis for fruit quality traits in peach

Jiaqi Fan , Jinlong Wu , Pere Arús3 , Yong Li , Ke Cao , Lirong Wang

Horticulture Research ›› 2025, Vol. 12 ›› Issue (7) : 87

PDF (1885KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (7) :87 DOI: 10.1093/hr/uhaf087
Articles
research-article
Integrating whole-genome resequencing and machine learning to refine QTL analysis for fruit quality traits in peach
Author information +
History +
PDF (1885KB)

Abstract

Increasing marker density results in better map coverage and efficiency of genetic analysis. Here, we resequenced a large (N = 235) F1 progeny from two distant peach cultivars, ‘Zhongyou Pan #9’ and ‘September Free’, and constructed two parental maps (1:1 segregations) and one combined map (1:2:1 segregations) with 134 277 SNPs. Markers with the same genotype for all individuals studied were grouped in bins and a unique genotype for each bin was inferred to avoid mapping problems derived from erroneous data. The total genetic distance of the two parental maps was 431.9 and 594.2 cM with a short mean distance, 0.9 cM, between contiguous bins (groups of markers with the same genotype) and high collinearity with the peach genome. The genetics of eight fruit-related traits was analyzed for 2 years, allowing the positions of two major genes, fruit shape (S) and flesh adhesion to the stone (F), to be established, along with nine quantitative trait loci (QTLs) for quantitative traits including fruit soluble solids concentration, titratable acidity, weight, maturity date, and flesh color (yellow to orange). We developed a machine learning-based linear model to assess flesh color, which proved more efficient than physical colorimetric parameters (L, a*, b*), detecting consistent QTLs. Based on map position, gene expression patterns, and function, candidate genes were identified. Overall, our results provide two new elements: ultra-high-density maps with resequencing data to enhance mapping resolution and phenotyping strategies based on machine learning models that improve the quality of quantitative measurements to help understand the genetic control of key fruit quality traits.

Cite this article

Download citation ▾
Jiaqi Fan, Jinlong Wu, Pere Arús3, Yong Li, Ke Cao, Lirong Wang. Integrating whole-genome resequencing and machine learning to refine QTL analysis for fruit quality traits in peach. Horticulture Research, 2025, 12(7): 87 DOI:10.1093/hr/uhaf087

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

The authors would like to express their gratitude to Guizhi Li and Hongyang Xing for their contributions to fruit sampling. This work was financially supported by the National Key Research and Development Program (grant nos. 2023YFE0105400, 2022YFD1200503), the National Natural Science Foundation of China (32102328, 32472701), China Agriculture Research System (grant no. CARS-30-1-04), and the China Scholarship Council.

Author contributions

L.W. and J.W. conceived the project and its components. J.F. and P.A. wrote the paper. J.F. collected samples. Y.L., J.W., and K.C. supervised the study. J.F. conducted the experimental work. J.F. and P.A. analyzed the data. J.W. and P.A. reviewed the original manuscript. All authors read and approved the final manuscript.

Data availability

The data that support the results are partly available in the paper and the supplementary materials published online. Resequencing data for the parents and the 235 progenies, markers used for mapping ordered in bins (the ‘bin’ set), and markers used for mapping with only one marker per bin (the ‘one bin’ set) for the ‘September Free’, ‘Zhongyou Pan #9’, and the SF × ZP (1:2:1) combined map are available in https://doi.org/10.6084/m9.figshare.27309288.v1.Sequencing data are in the Sequence Read Archive of the National Center for Biotechnology Information (NCBI) under BioProjects PRJNA1167467.

Conflict of interest statement

The authors declare no competing interests.

Supplementary Data

Supplementary data is available at Horticulture Research online.

References

[1]

Shulaev V, Korban SS, Sosinski B. et al. Multiple models for Rosaceae genomics. Plant Physiol. 2008; 147:985-1003

[2]

Verde I, Abbott AG, Scalabrin S. et al. The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution. Nat Genet. 2013; 45:487-94

[3]

Aranzana MJ, Decroocq V, Dirlewanger E. et al. Prunus genetics and applications after de novo genome sequencing: achieve-ments and prospects. Hortic Res. 2019; 6:6

[4]

Li Y, Wang L. Genetic resources, breeding programs in China, and gene mining of peach: a review. Hortic Plant J. 2020; 6:205-15

[5]

Chaparro JX, Werner DJ, O’Malley D. et al. Targeted mapping and linkage analysis of morphological, isozyme, and RAPD markers in peach. Theor Appl Genet. 1994; 87:805-15

[6]

Joobeur T, Viruel MA, de Vicente MC. et al. Construction of a saturated linkage map for Prunus using an almond × peach F2 progeny. Theor Appl Genet. 1998; 97:1034-41

[7]

Dirlewanger E, Graziano E, Joobeur T. et al. Comparative mapping and marker-assisted selection in Rosaceae fruit crops. Proc Natl Acad Sci USA. 2004; 101:9891-6

[8]

Verde I, Bassil NV, Scalabrin S. et al. Development and evaluation of a 9K SNP array for peach by internationally coordinated SNP detection and validation in breeding germplasm. PLoS One. 2012; 7:e35668

[9]

Feng Y, Yang C, Zhang J. et al. Construction of a high-density Paulownia genetic map and QTL mapping of important phe-notypic traits based on genome assembly and whole-genome resequencing. Int J Mol Sci. 2023; 24:15647

[10]

Wang C, Qin K, Shang X. et al. Mapping quantitative trait loci associated with self-(in)compatibility in goji berries (Lycium bar-barum). BMC Plant Biol. 2024; 24:441

[11]

Cao K, Zhou Z, Wang Q. et al. Genome-wide association study of 12 agronomic traits in peach. Nat Commun. 2016; 7:13246

[12]

Guo J, Cao K, Deng C. et al. An integrated peach genome struc-tural variation map uncovers genes associated with fruit traits. Genome Biol. 2020; 21:258

[13]

Monet R, Guye A, Roy M. et al. Peach Mendelian genetics: a short review and new results. Agronomie. 1996; 16:321-9

[14]

da Silva LC, Hardner C, Debnar M. et al. Multi-locus genome-wide association studies reveal fruit quality hotspots in peach genome. Front Plant Sci. 2021; 12:644799

[15]

Hernández Mora JR, Micheletti D, Bink MCAM. et al. Integrated QTL detection for key breeding traits in multiple peach proge-nies. BMC Genomics. 2017; 18:404

[16]

Adami M, De Franceschi P, Brandi F. et al. Identifying a carotenoid cleavage dioxygenase (ccd4) gene controlling yellow/white fruit flesh color of peach. Plant Mol Biol Report. 2013; 31:1166-75

[17]

Cobb JN, DeClerck G, Greenberg A. et al. Next-generation phe-notyping: requirements and strategies for enhancing our under-standing of genotype-phenotype relationships and its relevance to crop improvement. Theor Appl Genet. 2013; 126:867-87

[18]

Van Dijk ADJ, Kootstra G, Kruijer W. et al. Machine learning in plant science and plant breeding. iScience. 2021; 24:101890

[19]

Gasic K, Martinez-Garcia PJ, van der Knaap E. et al. Development and evaluation of a 9K SNP addition to the peach IPSC 9K SNP array v1. HortScience. 2019; 54:S188

[20]

da Silva LC, Antanaviciute L, Abdelghafar A. et al. High-density multi-population consensus genetic linkage map for peach. PLoS One. 2018; 13:e0207724

[21]

Qin MF, Li LT, Singh J. et al. Construction of a high-density bin-map and identification of fruit quality-related quantitative trait loci and functional genes in pear. Hortic Res. 2022; 9:9

[22]

Vision TJ, Brown DG, Shmoys DB. et al. Selective mapping: a strategy for optimizing the construction of high-density linkage maps. Genetics. 2000; 155:407-20

[23]

Martínez-García PJ, Parfitt DE, Ogundiwin EA. et al. High-density SNP mapping and QTL analysis for fruit quality characteristics in peach (Prunus persica L.). Tree Genet Genomes. 2013; 9:19-36

[24]

Serra O, Giné-Bordonaba J, Eduardo I. et al. Genetic analysis of the slow melting flesh character in peach. Tree Genet Genomes. 2017; 13:77

[25]

Kalluri N, Eduardo I, Arús P. Comparative QTL analysis in peach ’Earlygold’ F2 and backcross progenies. Sci Hortic. 2022; 293:110726

[26]

López-Girona E, Zhang Y, Eduardo I. et al. A deletion affecting an LRR-RLK gene co-segregates with the fruit flat shape trait in peach. Sci Rep. 2017; 7:6714

[27]

Cirilli M, Rossini L. Many candidates for a single chair: a critical review of the genetic determinant of flat fruit shape trait in peach (Prunus persica L. Batsch). Tree Genet Genomes. 2021; 17:34

[28]

Zhou H, Ma R, Gao L. et al. A 1.7-Mb chromosomal inversion downstream of a PpOFP1 gene is responsible for flat fruit shape in peach. Plant Biotechnol J. 2021; 19:192-205

[29]

Gu C, Wang L, Wang W. et al. Copy number variation of a gene cluster encoding endopolygalacturonase mediates flesh texture and stone adhesion in peach. JExp Bot. 2016; 67:1993-2005

[30]

Wang Q, Cao K, Cheng L. et al. Multi-omics approaches identify a key gene, PpTST1, for organic acid accumulation in peach. Hortic Res. 2022; 9:9

[31]

Boudehri K, Bendahmane A, Cardinet G. et al. Phenotypic and fine genetic characterization of the D locus controlling fruit acidity in peach. BMC Plant Biol. 2009; 9:59

[32]

Wang J, Cao K, Li Y. et al. Genome variation and LTR-RT analyses of an ancient peach landrace reveal mechanism of blood-flesh fruit color formation and fruit maturity date advancement. Hortic Res. 2024; 11:11

[33]

Pirona R, Eduardo I, Pacheco I. et al. Fine mapping and identifica-tion of a candidate gene for a major locus controlling maturity date in peach. BMC Plant Biol. 2013; 13:68

[34]

Shen ZJ, Confolent C, Lambert P. et al. Characterization and genetic mapping of a new blood-flesh trait controlled by the single dominant locus DBF in peach. Tree Genet Genomes. 2013; 9: 1435-46

[35]

Donoso JM, Eduardo I, Picañol R. et al. Exploring almond genetic variability useful for peach improvement: mapping major genes and QTLs in two interspecific almond × peach populations. Mol Breed. 2016; 36:16

[36]

Kalluri N, Serra O, Donoso JM. et al. Construction of a collection of introgression lines of ’Texas’ almond DNA fragments in the ’Earlygold’ peach genetic background. Hortic Res. 2022; 9:9

[37]

Dirlewanger E, Quero-García J, Le Dantec L. et al. Comparison of the genetic determinism of two key phenological traits, flower-ing and maturity dates, in three Prunus species: peach, apricot and sweet cherry. Heredity. 2012; 109:280-92

[38]

Singh A, Jones S, Ganapathysubramanian B. et al. Challenges and opportunities in machine-augmented plant stress phenotyping. Trends Plant Sci. 2021; 26:53-69

[39]

Bock CH, Poole GH, Parker PE. et al. Plant disease severity esti-mated visually, by digital photography and image analysis, and by hyperspectral imaging. Crit Rev Plant Sci. 2010; 29:59-107

[40]

Bai XD, Cao ZG, Wang Y. et al. Crop segmentation from images by morphology modeling in the CIE L∗ a∗ b∗ color space. Comput Electron Agric. 2013; 99:21-34

[41]

Kuehni RG. Color space and its divisions. Color Res Appl. 2001; 26: 209-22

[42]

Bora DJ, Gupta AK,Khan FA. Comparing the performance of L∗ a∗ b∗ and HSV color spaces with respect to color image segmentationarXiv preprint. 2015.

[43]

Schloss KB, Lessard L, Racey C. et al. Modeling color preference using color space metrics. Vis Res. 2018; 151:99-116

[44]

Mitchell TM. Machine Learning. New York: McGraw-Hill; 1997:

[45]

Cunningham P, Cord M, Delany SJ. Supervised learning. In: Cord M, Cunningham P (eds.), Machine Learning Techniques for Multi-media:Case Studies on Organization and Retrieval. Berlin: Springer; 2008:21-49

[46]

Lichtenberg JM, ¸Sim¸sek Ö. Simple regression models. In: Guy TV, Kárný M, Rios-Insua D, Wolpert DH (eds.), Imperfect Decision Makers: Admitting Real-World Rationality. Proceedings of Machine Learning Research; 2017:13-25

[47]

Maulud D, Abdulazeez AM. A review on linear regression com-prehensive in machine learning. J Appl Sci Technol Trends. 2020; 1: 140-7

[48]

Nordey T, Davrieux F, Léchaudel M. Predictions of fruit shelf life and quality after ripening: are quality traits measured at harvest reliable indicators? Postharvest Biol Technol. 2019; 153:52-60

[49]

Matias RG, da Silva DF, Miranda PMD. et al. Relationship between fruit traits and contents of ascorbic acid and carotenoids in peach. Crop Breed Appl Biotechnol. 2016; 16:348-54

[50]

Eduardo I, Pacheco I, Chietera G. et al. QTL analysis of fruit qual-ity traits in two peach intraspecific populations and importance of maturity date pleiotropic effect. Tree Genet Genomes. 2011; 7: 323-35

[51]

Zaracho N, Reig G, Kalluri N. et al. Inheritance of fruit red-flesh patterns in peach. Plan Theory. 2023; 12:394

[52]

Wang LR, Zhu GR. Descriptors and Data Standard for Peach (Prunus persica L.). Beijing: China Agriculture Press; 2005:

[53]

Okamoto M, Kuwahara A, Seo M. et al. CYP707A1 and CYP707A2, which encode abscisic acid 8’-hydroxylases, are indispensable for proper control of seed dormancy and germination in Ara-bidopsis. Plant Physiol. 2006; 141:97-107

[54]

Cao K, Wang B, Fang W. et al. Combined nature and human selec-tions reshaped peach fruit metabolome. Genome Biol. 2022; 23:146

[55]

Murray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 1980; 8:4321-6

[56]

Verde I, Jenkins J, Dondini L. et al. The peach v2.0 release: high-resolution linkage mapping and deep resequencing improve chromosome-scale assembly and contiguity. BMC Genomics. 2017; 18:225

[57]

Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter esti-mation from sequencing data. Bioinformatics. 2011; 27:2987-93

[58]

Rastas P. Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data. Bioinformatics. 2017; 33: 3726-32

[59]

Grattapaglia D, Sederoff R. Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: map-ping strategy and RAPD markers. Genetics. 1994; 137:1121-37

[60]

Van Ooijen J. JoinMap 5.0, Software for the Calculation of Genetic Linkage Maps. Wageningen: Kyazma BV; 2019:

[61]

Tang H, Zhang X, Miao C. et al. ALLMAPS: robust scaffold ordering based on multiple maps. Genome Biol. 2015; 16:3

[62]

Van Ooijen J. MapQTL 6, Software for the Mapping of Quantitative Trait Loci in Experimental Populations of Diploid Species.Kyazma BV: Wageningen; 2009:

[63]

Tanksley SD. Mapping polygenes. Annu Rev Genet. 1993; 27:205-33

[64]

Cao K, Li Y, Deng CH. et al. Comparative population genomics identified genomic regions and candidate genes associated with fruit domestication traits in peach. Plant Biotechnol J. 2019; 17: 1954-70

[65]

Pedregosa F, Varoquaux G, Gramfort A. et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011; 12:2825-30

[66]

Fan J, Guo J, Li Y. et al. A novel mutation of LTR insertion in PpCCD4 exon identified in Prunus ferganensis leads to yellow-fleshed fruits. Sci Hortic. 2023; 313:111914

[67]

Wang Q, Cao K, Li Y. et al. Identification of co-expressed net-works and key genes associated with organic acid in peach fruit. Sci Hortic. 2023; 307:111496

PDF (1885KB)

326

Accesses

0

Citation

Detail

Sections
Recommended

/