A comprehensive analysis of two Chinese cucumber genomes and a mutant population as resources for precision breeding

Jiaxi Han , Jingwei Wei , Weiliang Kong , Weili Miao , Lidong Zhang , Yuhe Li , Jiawang Li , Xin Li , Tao Lin , Hongyu Huang

Horticulture Research ›› 2026, Vol. 13 ›› Issue (2) : 284

PDF (2466KB)
Horticulture Research ›› 2026, Vol. 13 ›› Issue (2) :284 DOI: 10.1093/hr/uhaf284
Articles
research-article
A comprehensive analysis of two Chinese cucumber genomes and a mutant population as resources for precision breeding
Author information +
History +
PDF (2466KB)

Abstract

Cucumis sativus L., commonly known as cucumber, is an important vegetable crop worldwide, with China as the largest producer, particularly of the North and South China types. While extensive genomic research has focused on the North China type, especially the Chinese Long 9930, studies on the South China type remain limited. In this study, we assembled high-quality genomes of two widely cultivated and representative parent varieties: S36 (North China type) and H19 (South China type), and conducted mutagenesis analyses. Comparative genome analysis revealed a large number of structural variants between two North China types and two South China types, with many of the affected genes showing strong homology to known functional loci, potentially contributing to phenotypic divergence. We also constructed an EMS mutant library through the mutagenesis of S36 and identified a gene that encodes chlorophyll oxidase, demonstrating the method’s effectiveness for rapid gene discovery. In conclusion, this study provides valuable insights into the classification and evolution of cucumber, highlighting the promising potential of forward genetic approaches in cucumber breeding.

Cite this article

Download citation ▾
Jiaxi Han, Jingwei Wei, Weiliang Kong, Weili Miao, Lidong Zhang, Yuhe Li, Jiawang Li, Xin Li, Tao Lin, Hongyu Huang. A comprehensive analysis of two Chinese cucumber genomes and a mutant population as resources for precision breeding. Horticulture Research, 2026, 13(2): 284 DOI:10.1093/hr/uhaf284

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgments

The research was supported by the State Key Laboratory of Vegetable Biobreeding (SKLVB202506 to X.L.), the National Key Research and Development Program of China (2023YFF1000100 to T.L.), the Tianjin Major Project for Seed Industry (23ZXZYSN00010), the 111 Project (B17043 to T.L.), and the Construction of Beijing Science and Technology Innovation and Service Capacity in Top Subjects (CEFF-PXM2019_014207_000032 to T.L.).

Author Contributions

H.H. and T.L. conceived and designed the study. H.H., J.W., W.K., W.M., L.Z., Y.L., and J.L. planted and prepared the materials. J.H. performed the bioinformatics analysis. J.W. and X.L. designed and performed molecular experiments. J.H and J.W. wrote the manuscript. T.L., H.H., and X.L. edited and improved the manuscript. All authors approved the final manuscript.

Data availability

The original sequencing data for genome assembly and annotation as well as the sequencing data from mutants and bulked pools for BSA analysis have been deposited in the BIG Submission Portal (https://ngdc.cncb.ac.cn/) under BioProject accession number PRJCA038364. The assembled genomes S36 and H19 were deposited in the Genome Warehouse (GWH) database of the Big Data Center (https://bigd.big.ac.cn/gwh/) under the accession number GWHGDGZ00000000.1 and GWHGDGY00000000.1, respectively.

Conflicts of interest statement

The authors declare no competing interest.

Supplementary material

Supplementary material is available at Horticulture Research online.

References

[1]

Guan J, Miao H, Zhang Z. et al. A near-complete cucumber ref-erence genome assembly and Cucumber-DB, a multi-omics database. Mol Plant. 2024; 17:1178-82

[2]

Lin T, Zhu G, Zhang J. et al. Genomic analyses provide insights into the history of tomato breeding. Nat Genet. 2014; 46:1220-6

[3]

Zhao G, Lian Q, Zhang Z. et al. A comprehensive genome variation map of melon identifies multiple domestication events and loci influencing agronomic traits. Nat Genet. 2019; 51:1607-15

[4]

Xie D, Xu Y, Wang J. et al. The wax gourd genomes offer insights into the genetic diversity and ancestral cucurbit karyotype. Nat Commun. 2019; 10:5158

[5]

Lyu X, Xia Y, Wang C. et al. Pan-genome analysis sheds light on structural variation-based dissection of agronomic traits in melon crops. Plant Physiol. 2023; 193:1330-48

[6]

Liu Y, du H, Li P. et al. Pan-genome of wild and cultivated soybeans. Cell. 2020; 182:162-176.e13

[7]

Cai X, Chang L, Zhang T. et al. Impacts of allopolyploidiza-tion and structural variation on intraspecific diversification in Brassica rapa. Genome Biol. 2021; 22:166

[8]

Li H, Wang S, Chai S. et al. Graph-based pan-genome reveals structural and sequence variations related to agronomic traits and domestication in cucumber. Nat Commun. 2022; 13: 682

[9]

Della Coletta R, Qiu Y, Ou S. et al. How the pan-genome is changing crop genomics and improvement. Genome Biol. 2021; 22:3

[10]

Jayakodi M, Schreiber M, Stein N. et al. Building pan-genome infrastructures for crop plants and their use in association genetics. DNA Res. 2021;28:dsaa030

[11]

Bayer PE, Golicz AA, Scheben A. et al. Plant pan-genomes are the new reference. Nat Plants. 2020; 6:914-20

[12]

Hübner S. Are we there yet? Driving the road to evolutionary graph-pangenomics. Curr Opin Plant Biol. 2022; 66:102195

[13]

Shang L, Li X, He H. et al. A super pan-genomic landscape of rice. Cell Res. 2022; 32:878-96

[14]

Tran QH, Shang L, Kappel C. et al. Mapping-by-sequencing via MutMap identifies a mutation in ZmCLE7 underlying fascia-tion in a newly developed EMS mutant population in an elite tropical maize inbred. Genes (Basel). 2020; 11:281

[15]

Sun X, Li X, Lu Y. et al. Construction of a high-density mutant population of Chinese cabbage facilitates the genetic dissec-tion of agronomic traits. Mol Plant. 2022; 15:913-24

[16]

Deng Y, Liu S, Zhang Y. et al. A telomere-to-telomere gap-free reference genome of watermelon and its mutation library provide important resources for gene discovery and breeding. Mol Plant. 2022; 15:1268-84

[17]

Fu A, Zheng Y, Guo J. et al. Telomere-to-telomere genome assembly of bitter melon (Momordica charantia L. var. abbreviata Ser.) reveals fruit development, composition and ripening genetic characteristics Hortic Res. 2023;10: uhac228

[18]

Jiang F, Guo M, Yang F. et al. Mutations in an AP2 transcrip-tion factor-like gene affect internode length and leaf shape in maize. PLoS One. 2012; 7:e37040

[19]

Zhao J, Jiang L, Che G. et al. A functional allele of CsFUL1 reg-ulates fruit length through repressing CsSUP and inhibiting auxin transport in cucumber. Plant Cell. 2019; 31:1289-307

[20]

Niu H, Liu X, Tong C. et al. The WUSCHEL-related home-obox1 gene of cucumber regulates reproductive organ devel-opment. J Exp Bot. 2018; 69:5373-87

[21]

Cheng F, Song M, Zhang M. et al. A SNP mutation in the CsCLAVATA1 leads to pleiotropic variation in plant architec-ture and fruit morphogenesis in cucumber (Cucumis sativus L.). Plant Sci. 2022; 323:111397

[22]

Lai W, Zhou Y, Pan R. et al. Identification and expression anal-ysis of stress-associated proteins (SAPs) containing A20/AN1 zinc finger in cucumber. Plants (Basel). 2020; 9:400

[23]

Renner SS, Schaefer H, Kocyan A. Phylogenetics of Cucumis (Cucurbitaceae): cucumber (C. sativus) belongs in an Asian/Australian clade far from melon (C. melo). BMC Evol Biol. 2007; 7:58

[24]

Lv J, Qi J, Shi Q. et al. Genetic diversity and population structure of cucumber (Cucumis sativus L.). PLoS One. 2012; 7: e46919

[25]

Yu B, Ming F, Liang Y. et al. Heat stress resistance mechanisms of two cucumber varieties from different regions. Int J Mol Sci. 2022; 23:1817

[26]

Huang H, Yang Q, Zhang L. et al. Genome-wide association analysis reveals a novel QTL CsPC1 for pericarp color in cucumber. BMC Genomics. 2022; 23:383

[27]

Li Q, Li H, Huang W. et al. A chromosome-scale genome assembly of cucumber (Cucumis sativus L.). GigaScience. 2019;8:giz072

[28]

Dong S, Liu X, Han J. et al. CsMLO8/11 are required for full susceptibility of cucumber stem to powdery mildew and interact with CsCRK2 and CsRbohD. Hortic Res. 2023;11: uhad295

[29]

Tian H, Wang X, Guo H. et al. NTL8 regulates trichome forma-tion in Arabidopsis by directly activating R3 MYB genes TRY and TCL11. Plant Physiol. 2017; 174:2363-75

[30]

Downes BP, Stupar RM, Gingerich DJ. et al. The HECT ubiquitin-protein ligase (UPL) family in Arabidopsis: UPL3 has a specific role in trichome development. Plant J. 2003; 35: 729-42

[31]

Wang W, Zhang Y, Xu C. et al. Cucumber ECERIFERUM1 (CsCER1), which influences the cuticle properties and drought tolerance of cucumber, plays a key role in VLC alkanes biosyn-thesis. Plant Mol Biol. 2015; 87:219-33

[32]

Manabe Y, Nafisi M, Verhertbruggen Y. et al. Loss-of-function mutation of REDUCED WALL ACETYLATION2 in Arabidopsis leads to reduced cell wall acetylation and increased resistance to Botrytis cinerea. Plant Physiol. 2011; 155:1068-78

[33]

Chen L, Xiao J, Li Y. et al. The Raf-like MAPKKKs STY8, STY17, and STY46 negatively regulate Botrytis cinerea resistance by limiting MKK7 protein accumulation in Arabidopsis. Plant J. 2024; 117:1503-16

[34]

Chovelon V, Feriche-Linares R, Barreau G. et al. Building a cluster of NLR genes conferring resistance to pests and pathogens: the story of the Vat gene cluster in cucurbits. Hor-tic Res. 2021; 8:72

[35]

McHale LK, Haun WJ, Xu WW. et al. Structural variants in the soybean genome localize to clusters of biotic stress-response genes. Plant Physiol. 2012; 159:1295-308

[36]

Hirochika H, Sugimoto K, Otsuki Y. et al. Retrotransposons of rice involved in mutations induced by tissue culture. Proc Natl AcadSciUSA. 1996; 93:7783-8

[37]

Komatsu M, Shimamoto K, Kyozuka J. Two-step regulation and continuous retrotransposition of the rice LINE-type retro-transposon karma. Plant Cell. 2003; 15:1934-44

[38]

Cingolani P, Platts A, Wang LL. et al. A program for annotat-ing and predicting the effects of single nucleotide polymor-phisms, SnpEff. Fly (Austin). 2012; 6:80-92

[39]

Chen N, Wang P, Li C. et al. A single nucleotide mutation of the IspE gene participating in the MEP pathway for isoprenoid biosynthesis causes a green-revertible yellow leaf phenotype in rice. Plant Cell Physiol. 2018; 59:1905-17

[40]

Zhang J, Yang J, Zhang L. et al. A new SNP genotyping tech-nology Target SNP-seq and its application in genetic analysis of cucumber varieties. Sci Rep. 2020; 10:5623

[41]

Qi J, Liu X, Shen D. et al. A genomic variation map provides insights into the genetic basis of cucumber domestication and diversity. Nat Genet. 2013; 45:1510-5

[42]

Huang H, du Y, Long Z. et al. Fine mapping of a novel QTL CsFSG1 for fruit skin gloss in cucumber (Cucumis sativus L.). Mol Breed. 2022; 42:25

[43]

LeVan TD, Guerra S, Klimecki W. et al. The impact of CD 14 polymorphisms on the development of soluble CD14 levels during infancy. Genes Immun. 2006; 7:77-80

[44]

Nie S, Wang B, Ding H. et al. Genome assembly of the Chinese maize elite inbred line RP125 and its EMS mutant collection provide new resources for maize genetics research and crop improvement. Plant J. 2021; 108:40-54

[45]

Tang S, Liu DX, Lu S. et al. Development and screening of EMS mutants with altered seed oil content or fatty acid composi-tion in Brassica napus. Plant J. 2020; 104:1410-22

[46]

Zhang T, Dong X, Yuan X. et al. Identification and characteri-zation of CsSRP43, a major gene controlling leaf yellowing in cucumber. Hortic Res. 2022;9:uhac212

[47]

Pendleton M, Sebra R, Pang AWC. et al. Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nat Methods. 2015; 12:780-6

[48]

Grob S, Schmid MW, Grossniklaus U. Hi-C Analysis in Arabidop-sis identifies the KNOT, a structure with similarities to the flamenco locus of Drosophila. Mol Cell. 2014; 55:678-93

[49]

Cheng H, Concepcion GT, Feng X. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021; 18:170-5

[50]

Durand NC, Shamim MS, Machol I. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016; 3:95-8

[51]

Dudchenko O, Batra SS, Omer AD. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017; 356:92-5

[52]

Robinson JT, Turner D, Durand NC. et al. Juicebox.js provides a cloud-based visualization system for Hi-C data. Cell Syst. 2018; 6:256-258.e1

[53]

Manni M, Berkeley MR, Seppey M. et al. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021; 38:4647-54

[54]

Manni M, Berkeley MR, Seppey M. et al. BUSCO: assessing genomic data quality and beyond. Curr Protoc. 2021; 1:e323

[55]

Delcher AL, Salzberg SL, Phillippy AM. Using MUMmer to iden-tify similar regions in large sequence sets. Curr Protoc Bioin-formatics. 2003; 00:10.3.1-10.3.18

[56]

Marçais G, Delcher AL, Phillippy AM. et al. MUMmer4: a fast and versatile genome alignment system. PLoS Comput Biol. 2018; 14:e1005944

[57]

Goel M, Sun H, Jiao W-B. et al. SyRI: finding genomic rear-rangements and local sequence differences from whole-genome assemblies. Genome Biol. 2019; 20:277

[58]

Flynn JM, Hubley R, Goubert C. et al. RepeatModeler2 for auto-mated genomic discovery of transposable element families. Proc Natl Acad SciUSA. 2020; 117:9451-7

[59]

Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioin-formatics. 2009; 25:4.10.1-4.10.14

[60]

Li Z, Zhang Z, Yan P. et al. RNA-Seq improves annota-tion of protein-coding genes in the cucumber genome. BMC Genomics. 2011; 12:540

[61]

Grabherr MG, Haas BJ, Yassour M. et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat Biotechnol. 2011; 29:644-52

[62]

Haas BJ, Delcher AL, Mount SM. et al. Improving the Arabidop-sis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003; 31:5654-66

[63]

Brůna T, Lomsadze A, Borodovsky M. GeneMark-EP+: eukary-otic gene prediction with self-training in the space of genes and proteins. NAR Genom Bioinform. 2020;2:lqaa026

[64]

Keilwagen J, Hartung F, Grau J. GeMoMa: homology-based gene prediction utilizing intron position conservation and RNA-seq data. Methods Mol Biol. 2019; 1962:161-77

[65]

Holt C, Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011; 12:491

[66]

Buchfink B, Reuter K, Drost H-G. Sensitive protein align-ments at tree-of-life scale using DIAMOND. Nat Methods. 2021; 18:366-8

[67]

Jones P, Binns D, Chang HY. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30: 1236-40

[68]

Kanehisa M, Sato Y, Morishima K. BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. J Mol Biol. 2016; 428: 726-31

[69]

Calle García J, Guadagno A, Paytuvi-Gallart A. et al. PRGdb 4.0:an updated database dedicated to genes involved in plant disease resistance process. Nucleic Acids Res. 2021;50: D1483-90

[70]

Li WH, Wu CI, Luo CC. A new method for estimating synony-mous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. Mol Biol Evol. 1985; 2:150-74

[71]

Retief JD. Phylogenetic analysis using PHYLIP. In: Misener S, Krawetz SA,eds. Bioinformatics Methods and Protocols. Humana Press: New Jersey, 1999,243-58

[72]

Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018; 34:3094-100

[73]

Hao Z, Lv D, Ge Y. et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput Sci. 2020; 6:e251

[74]

Xu Z, Wang H. LTR_FINDER: an efficient tool for the predic-tion of full-length LTR retrotransposons. Nucleic Acids Res. 2007;35:W265-8

[75]

Ellinghaus D, Kurtz S, Willhoeft U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotrans-posons. BMC Bioinformatics. 2008; 9:18

[76]

Ou S, Jiang N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotrans-posons. Plant Physiol. 2018; 176:1410-22

[77]

Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999; 27:573-80

[78]

Lin Y, Ye C, Li X. et al. quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat iden-tification. Hortic Res. 2023;10:uhad127

[79]

Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25: 1754-60

[80]

Li H, Handsaker B, Wysoker A. et al. The sequence alignmen-t/map format and SAMtools. Bioinformatics. 2009; 25:2078-9

[81]

Genovese G, Rockweiler NB, Gorman BR. et al. BCFtool-s/liftover: an accurate and comprehensive tool to convert genetic variants across genome assemblies. Bioinformatics. 2024;40:btae038

[82]

Rausch T, Zichner T, Schlattl A. et al. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics. 2012;28:i333-9

[83]

Lichtenthaler HK. Chlorophylls and carotenoids:pigments of photosynthetic biomembranes. In: Packer L (ed.), Methods in Enzymology, Vol. 148. Plant Cell Membranes. San Diego: Academic Press, 1987,350-82

[84]

Sun W, Li X, Huang H. et al. Mutation of CsARC6 affects fruit color and increases fruit nutrition in cucumber. Theor Appl Genet. 2023; 136:111

[85]

Lichtenthaler HK, Buschmann C. Chlorophylls and carotenoids:measurement and characterization by UV-VIS spectroscopy. In: Wrolstad RE, et al. (ed.), Current Protocols in Food Analytical Chemistry. New York: John Wiley & Sons, 2001,F4.3.1-8

[86]

McKenna A, Hanna M, Banks E. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20:1297-303

[87]

Hill JT, Demarest BL, Bisgrove BW. et al. MMAPPR: mutation mapping analysis pipeline for pooled RNA-seq. Genome Res. 2013; 23:687-97

[88]

Xi X, Wei K, Gao B. et al. BrFLC5: a weak regulator of flow-ering time in Brassica rapa. Theor Appl Genet. 2018; 131: 2107-16

[89]

Fang L, Wei XY, Liu LZ. et al. A tobacco ringspot virus-based vector system for gene and microRNA function studies in cucurbits. Plant Physiol. 2021; 186:853-64

[90]

Xu S, Wang Y, Yang S. et al. The WUSCHEL-related homeobox transcription factor CsWOX3 negatively regulates fruit spine morphogenesis in cucumber ( Cucumis sativus L.). Hortic Res. 2024;11:uhae163

PDF (2466KB)

432

Accesses

0

Citation

Detail

Sections
Recommended

/