A haplotype-resolved chromosome-level genome assembly of autotetraploid Chinese yam (Dioscorea polystachya) elucidates dioscin biosynthesis and regulation

Nan Shan , Yao Xiao , Tianyao Li , Putao Wang , Asjad Ali , Jingyu Sun , Shenglin Wang , Qianglong Zhu , Tianxu Cao , Sha Luo , Jiali Lin , Zihao Li , Qinghong Zhou , Yingjin Huang

Horticulture Research ›› 2026, Vol. 13 ›› Issue (3) : 344

PDF (2985KB)
Horticulture Research ›› 2026, Vol. 13 ›› Issue (3) :344 DOI: 10.1093/hr/uhaf344
Articles
research-article
A haplotype-resolved chromosome-level genome assembly of autotetraploid Chinese yam (Dioscorea polystachya) elucidates dioscin biosynthesis and regulation
Author information +
History +
PDF (2985KB)

Abstract

Chinese yam (Dioscorea polystachya) is extensively cultivated for nutritional and medicinal applications. However, the lack of a high-quality reference genome has hindered molecular genetic analysis and breeding advancements. Here, we present a haplotype-resolved chromosome-level assembly for this autotetraploid species, featuring a 1.56-Gb genome anchored to 80 chromosomes across four haplotypes and comprising 95 668 protein-coding genes. Following divergence from Dioscorea alata about 4.64 million years ago (Mya), D. polystachya underwent a specific whole-genome duplication ∼1.42 Mya, resulting in an autotetraploid species without subgenomic dominance. Notably, the biosynthetic pathway genes of dioscin, an important steroidal saponin primarily accumulating in tubers, were generally over-retained in D. polystachya compared to the diploid species D. alata. Of these genes, 7-dehydrocholesterol reductase (Dp7-DR) promoted the accumulation of dioscin, exhibiting tuber-specific expression and strong inducibility by abscisic acid, based on transcriptome and gene function analyses. We determined that the transcription factor DpbZIP12 activates Dp7-DR transcription, as supported by yeast one-hybrid, dual-luciferase reporter, and electrophoretic mobility shift assays. Notably, overexpressing Dp7-DR or DpbZIP12 resulted in lower cholesterol levels and elevated dioscin levels, while silencing either gene produced opposite metabolic profiles. These findings delineate promising targets for manipulating dioscin content and expand genetic resources for enhancing yam nutritional quality.

Cite this article

Download citation ▾
Nan Shan, Yao Xiao, Tianyao Li, Putao Wang, Asjad Ali, Jingyu Sun, Shenglin Wang, Qianglong Zhu, Tianxu Cao, Sha Luo, Jiali Lin, Zihao Li, Qinghong Zhou, Yingjin Huang. A haplotype-resolved chromosome-level genome assembly of autotetraploid Chinese yam (Dioscorea polystachya) elucidates dioscin biosynthesis and regulation. Horticulture Research, 2026, 13 (3) : 344 DOI:10.1093/hr/uhaf344

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This work was supported by grants from the National Natural Science Foundation of China (32460767 to Q.Z. and 32402560 to N.S.), the Jiangxi Province Key Research and Development Program (20232BBF60007 to N.S.), the Jiangxi Province Natural Science Foundation Project (20224BAB205024 to N.S.), and the Jiangxi Agriculture Research System (JXARS-02 to Q.Z.).

Author contributions

Y.H. and Q.Z. conceived and designed the experiments. N.S., Y.X., and T.L. performed most of the experiments. J.S. extracted the genomic DNA. N.S., Y.X., T.L., and S.W. conducted genome assem-blies, genome annotations, and repeat content analysis. N.S., P.W., Q.Z., and T.C. performed the RNA-seq, mapping, and data process-ing. A.A., S.L., J.L., and Z.L. commented on the manuscript. N.S., Y.X., Y.H., and Q.Z. wrote the paper with input from all authors.

Data availability

The assembled genome sequences are available under accession code PRJCA034592 at the Genome Warehouse (GWH) of the China National Center for Bioinformation (CNCB-NGDC, https://ngdc.cncb.ac.cn/gwh/), and all transcriptome raw sequence read data were uploaded to the China National GeneBank DataBase (CNGBdb, https://db.cngb.org/) website with the accession number CNP0008466. For comparative analysis, the genome of D. alata was obtained from YamBase (https://yambase.org/), and the gene, protein, and annotation files for D. rotundata were sourced from http://genome-e.ibrc.or.jp/home/bioinformatics-team/yam.

Conflicts of interest statement

The authors declare that they have no conflicts of interest.

Supplementary material

Supplementary material is available at Horticulture Research online.

References

[1]

Shan N, Wang PT, Zhu QL. et al. Comprehensive characterization of yam tuber nutrition and medicinal quality of Dioscorea opposita and D. alata from different geographic groups in China. J Integr Agric. 2020; 19:2839-48

[2]

Chen H, Hu Z, Liu D. et al. Composition and physicochemical properties of three Chinese yam (Dioscorea opposita Thunb.) starches: a comparison study. Molecules. 2019; 24:2973

[3]

Liu Y, Li H, Fan Y. et al. Antioxidant and antitumor activities of the extracts from Chinese yam (Dioscorea opposita Thunb.) flesh and peel and the effective compounds. J Food Sci. 2016;81:H1553-64

[4]

Wang P, Shan N, Ali A. et al. Comprehensive evaluation of functional components, biological activities, and minerals of yam species (Dioscorea polystachya and D. alata) from China. LWT-Food Sci Technol. 2022; 168:113964

[5]

Cheng J, Chen J, Liu X. et al. The origin and evolution of the diosgenin biosynthetic pathway in yam. Plant Commun. 2021; 2:100197

[6]

Li Y, Tan C, Li Z. et al. The genome of Dioscorea zingiberensis sheds light on the biosynthesis, origin and evolution of the medicinally important diosgenin saponins. Hortic Res. 2022;9:uhac165

[7]

Sonawane, Pollier J, Panda S. et al. Plant cholesterol biosynthetic pathway overlaps with phytosterol metabolism. Nat Plants. 2016; 3:16205

[8]

Christ B, Xu C, Xu M. et al. Repeated evolution of cytochrome P450-mediated spiroketal steroid biosynthesis in plants. Nat Commun. 2019; 10:3206

[9]

Nakayasu M, Kawasaki T, Lee HJ. et al. Identification of furostanol glycoside 26-O-β-glucosidase involved in steroidal saponinbiosynthesisfrom Dioscoreaesculenta. PlantBiotechnol. 2015; 32:299-308

[10]

Ye T, Song W, Zhang JJ. et al. Identification and functional characterization of DzS3GT, a cytoplasmic glycosyltransferase catalyzing biosynthesis of diosgenin 3-O-glucoside in Dioscorea zingiberensis. Plant Cell Tissue Org. 2017; 129:399-410

[11]

Zhou C, Yang Y, Tian J. et al. 22R- but not 22S-hydroxycholesterol is recruited for diosgenin biosynthesis. Plant J. 2022; 109:940-51

[12]

Cheng H, Concepcion GT, Feng X. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021; 18:170-5

[13]

Yin X, Liu J, Kou C. et al. Deciphering the network of cholesterol biosynthesis in Paris polyphylla laid a base for efficient diosgenin production in plant chassis. Metab Eng. 2023; 76:232-46

[14]

Sugihara Y, Kudoh A, Oli MT. et al. Population genomics of yams:evolution and domestication of Dioscorea species. In: Rajora OP,ed. Population Genomics:Crop Plants. Springer International Publishing: Cham, 2021,837-64

[15]

Sugihara Y, Darkwa K, Yaegashi H. et al. Genome analyses reveal the hybrid origin of the staple crop white Guinea yam (Dioscorea rotundata). Proc Natl Acad Sci U S A. 2020; 117:31987-92

[16]

Bredeson JV, Lyons JB, Oniyinde IO. et al. Chromosome evolution and the genetic basis of agronomically important traits in greater yam. Nat Commun. 2022; 13:2001

[17]

Zhang Y, Wei Z, Yang C. et al. A telomere-to-telomere genome assembly for greater yam (Dioscorea alata). Plant Commun. 2025; 6:101326

[18]

Hu K, Feng Y, Li P. et al. Haplotype-resolved genome and population genomics provide insights into dioscin biosynthesis and evolutionary history of the medicinal species Dioscorea nipponica. Plant J. 2025; 121:e17237

[19]

Arnau G, Némorin A, Maledon E. et al. Revision of ploidy status of Dioscorea alata L. (Dioscoreaceae) by cytogenetic and microsatellite segregation analysis. Theor Appl Genet. 2009; 118:1239-49

[20]

Tamiru M, Natsume S, Takagi H. et al. Genome sequencing of the staple food crop white Guinea yam enables the development of a molecular marker for sex determination. BMC Biol. 2017; 15:1-20

[21]

Cao T, Wang S, Ali A. et al. Transcriptome and metabolome analysis reveals the potential mechanism of tuber dynamic developmentinyam(Dioscoreapolystachya Turcz.). LWT-Food Sci Technol. 2023; 181:114764

[22]

Soltis PS, Soltis DE. The role of hybridization in plant speciation. Annu Rev Plant Biol. 2009; 60:561-88

[23]

Hu N, Gong J, Zhang B. Characterization of the complete chloroplast genome of Dioscorea polystachya Turcz. Mito-chondrial DNA Part B. 2021; 6:1652-3

[24]

Osborn TC, Chris Pires J, Birchler JA. et al. Understanding mechanisms of novel gene expression in polyploids. Trends Genet. 2003; 19:141-7

[25]

Sattler MC, Carvalho CR, Clarindo WR. The polyploidy and its key role in plant breeding. Planta. 2016; 243:281-96

[26]

Van de Peer Y, Mizrachi E, Marchal K. The evolutionary significance of polyploidy. Nat Rev Genet. 2017; 18:411-24

[27]

Jansky SH, Charkowski AO, Douches DS. et al. Reinventing potato as a diploid inbred line-based crop. Crop Sci. 2016; 56:1412-22

[28]

Li Z, Chen ZJ. Nonadditive gene expression and epigenetic changes in polyploid plants and crops. Adv Agron. 2022; 176:179-208

[29]

Batiru G, Lübberstedt T. Polyploidy in maize: from evolution to breeding. Theor Appl Genet. 2024; 137:182

[30]

Chao DY, Dilkes B, Luo H. etal. Polyploids exhibit higher potas-sium uptake and salinity tolerance in Arabidopsis. Science. 2013; 341:658-9

[31]

Morineau C, Bellec Y, Tellier F. et al. Selective gene dosage byCRISPR-Cas9genomeeditinginhexaploid Camelinasativa. Plant Biotechnol J. 2017; 15:729-39

[32]

Zhang X, Tang C, Jiang B. et al. Refining polyploid breeding in sweet potato through allele dosage enhancement. NatPlants. 2025; 11:36-48

[33]

Fan D, Zhong H, Hu B. et al. Agro-ecological suitability assessment of Chinese medicinal yam under future climate change. Environ Geochem Health. 2020; 42:987-1000

[34]

Freeling M. Picking up the ball at the K/Pg boundary: the distribution of ancient polyploidies in the plant phylogenetic tree as a spandrel of asexuality with occasional sex. Plant Cell. 2017; 29:202-6

[35]

Li D, Xu G, Ren G. et al. The application of ultra-high-performance liquid chromatography coupled with a LTQ-Orbitrap mass technique to reveal the dynamic accumulation of secondary metabolites in licorice under ABA stress. Molecules. 2017; 22:2115

[36]

Wu Q, Bai J, Tao X. et al. Synergistic effect of abscisic acid and ethylene on color development in tomato (Solanum lycopersicum L.) fruit. Sci Hortic. 2018; 235:169-80

[37]

Sun Q, He Z, Feng D. et al. The abscisic acid-responsive transcriptional regulatory module CsERF110-CsERF 53 orchestrates citrus fruit coloration. Plant Commun. 2024; 5:101065

[38]

Sun Q, He Z, Wei R. et al. The transcriptional regulatory module CsHB5-CsbZIP44 positively regulates abscisic acid-mediated carotenoid biosynthesis in citrus (Citrus spp.). Plant Biotechnol J. 2024; 22:722-37

[39]

Mansouri H, Asrar Z. Effects of abscisic acid on content and biosynthesis of terpenoids in Cannabis sativa at vegetative stage. Biol Plant. 2012; 56:153-6

[40]

Yuan M, Shu G, Zhou J. et al. AabHLH113 integrates jasmonic acid and abscisic acid signaling to positively regulate artemisinin biosynthesis in Artemisia annua. New Phytol. 2023; 237:885-99

[41]

Jia J, Luo Y, Wu Z. et al. OsJMJ718, a histone demethylase gene, positively regulates seed germination in rice. Plant J. 2024; 118:191-202

[42]

Zhang H, Zhao Y, Zhu JK. Thriving under stress: how plants balance growth and the stress response. Dev Cell. 2020; 55:529-43

[43]

Yamaguchi-Shinozaki K, Shinozaki K. Transcriptional regulatory networks in cellular responses and tolerance to dehydration and cold stresses. Annu Rev Plant Biol. 2006; 57:781-803

[44]

Yoshida T, Mogami J, Yamaguchi-Shinozaki K. ABA-dependent and ABA-independent signaling in response to osmotic stress in plants. Curr Opin Plant Biol. 2014; 21:133-9

[45]

Zhang F, Fu X, Lv Z. et al. A basic leucine zipper transcription factor, AabZIP1, connects abscisic acid signaling with artemisinin biosynthesis in Artemisia annua. Mol Plant. 2015; 8:163-75

[46]

Zhang Y, Xu Z, Ji A. et al. Genomic survey of bZIP transcription factor genes related to tanshinone biosynthesis in Salvia miltiorrhiza. Acta Pharm Sin B. 2018; 8:295-305

[47]

Miyamoto K, Nishizawa Y, Minami E. et al. Overexpression of the bZIP transcription factor OsbZIP79 suppresses the production of diterpenoid phytoalexin in rice cells. J Plant Phys-iol. 2015; 173:19-27

[48]

Upadhyay S, Jeena GS, Kumar S. et al. Asparagus racemosus bZIP transcription factor-regulated squalene epoxidase (ArSQE) promotes germination and abiotic stress tolerance in transgenic tobacco. Plant Sci. 2020; 290:110291

[49]

Sun S, Li Y, Jia L. et al. Identification of genetic variants controlling diosgenin content in Dioscorea zingiberensis tuber by genome-wide association study. BMC Plant Biol. 2024; 24:540

[50]

Chen H, Zeng Y, Yang Y. et al. Allele-aware chromosome-level genome assembly and efficient transgene-free genome editing for the autotetraploid cultivated alfalfa. Nat Commun. 2020; 11:2494

[51]

Guan D, McCarthy SA, Wood J. et al. Identifying and removing haplotypic duplication in primary genome assemblies. Bioin-formatics. 2020; 36:2896-8

[52]

Durand NC, Shamim MS, Machol I. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 2016; 3:95-8

[53]

Durand NC, Robinson JT, Shamim MS. etal. Juiceboxprovides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 2016; 3:99-101

[54]

Kurtz S, Phillippy A, Delcher AL. et al. Versatile and open software for comparing large genomes. Genome Biol. 2004;5:R12

[55]

Lomsadze A, Ter-Hovhannisyan V, Chernoff Y. Gene identification in novel eukaryotic genomes by self-training algorithm. Nucleic Acids Res. 2005; 33:6494-506

[56]

Hoff KJ, Stanke M. Predicting genes in single genomes with AUGUSTUS. Curr Protoc Bioinformatics. 2019; 65:e57

[57]

Marchler-Bauer A, Lu S, Anderson JB. et al. CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2010;39:D225-9

[58]

Koonin EV, Fedorova ND, Jackson JD. et al. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol. 2004; 5:1-28

[59]

Dimmer EC, Huntley RP, Alam-Faruque Y. et al.The UniProt-GO annotation database in 2011. Nucleic Acids Res. 2012;40:D565-70

[60]

Bairoch A, Apweiler R.The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 2000; 28:45-8

[61]

Boeckmann B, Bairoch A, Apweiler R. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003; 31:365-70

[62]

Haas BJ, Papanicolaou A, Yassour M. et al. De novo transcript sequence reconstruction from RNA-seq using the trinity platform for reference generation and analysis. Nat Protoc. 2013; 8:1494-512

[63]

Flynn JM, Hubley R, Goubert C. etal. RepeatModeler2 for auto-mated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020; 117:9451-7

[64]

Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioin-formatics. 2009; 4:4.10.11-14.10.14

[65]

Zhang K, Yang Y, Zhang X. et al. The genome of Orychophrag-mus violaceus provides genomic insights into the evolution of Brassicaceae polyploidization and its distinct traits. Plant Commun. 2023; 4:100431

[66]

Cheng F, Wu J, Fang L. et al. Syntenic gene analysis between Brassica rapa and other Brassicaceae species. Front Plant Sci. 2012; 3:30895

[67]

Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014; 30:923-30

[68]

Zhang Z, Li J, Zhao XQ. et al. KaKs_Calculator: calculating Ka and Ks through model selection and model averaging. Genom Proteom Bioinform. 2006; 4:259-63

[69]

Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019; 20:238

[70]

Sanderson MJ. r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics. 2003; 19:301-2

[71]

Yang Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci. 1997; 13:555-6

[72]

Mendes FK, Vanderpool D, Fulton B. et al. CAFE 5 models variation in evolutionary rates among gene families. Bioinfor-matics. 2021; 36:5516-8

[73]

Chernomor O, von Haeseler A, Minh BQ. Terrace aware data structure for phylogenomic inference from supermatrices. Syst Biol. 2016; 65:997-1008

[74]

Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner withlowmemoryrequirements. NatMethods. 2015; 12:357-60

[75]

Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106

[76]

Zhang S, Yu Z, Sun L. et al. T2T reference genome assembly and genome-wide association study reveal the genetic basis of Chinese bayberry fruit quality. Hortic Res. 2024;11:uhae033

[77]

Kovaka S, Zimin AV, Pertea GM. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019; 20:1-13

[78]

Langmead B, Salzberg S. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9:357-9

[79]

Mortazavi A, Williams BA, McCue K. et al. Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat Methods. 2008; 5:621-8

[80]

Tarazona S, García-Alcalde F, Dopazo J. et al. Differential expression in RNA-seq: a matter of depth. Genome Res. 2011; 21:2213-23

[81]

Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016; 33:1870-4

[82]

Yuan Y, Ren S, Liu X. et al. SlWRKY35 positively regulates carotenoid biosynthesis by activating the MEP pathway in tomato fruit. New Phytol. 2022; 234:164-78

[83]

Fernandez-Pozo N, Rosli HG, Martin GB. et al. The SGN VIGS tool: user-friendly software to design virus-induced gene silencing(VIGS)constructsforfunctionalgenomics. MolPlant. 2015; 8:486-8

[84]

An JP, Wang XF, Zhang XW. et al. Apple B-box protein BBX37 regulates jasmonic acid mediated cold tolerance through the JAZ-BBX37-ICE1-CBF pathway and undergoes MIEL1-mediated ubiquitination and degradation. New Phytol. 2021; 229:2707-29

PDF (2985KB)

0

Accesses

0

Citation

Detail

Sections
Recommended

/