Telomere-to-telomere and gap-free genome assembly of a susceptible grapevine species (Thompson Seedless) to facilitate grape functional genomics

Xianhang Wang, Mingxing Tu, Ya Wang, Yali Zhang, Wuchen Yin, Jinghao Fang, Min Gao, Zhi Li, Wei Zhan, Yulin Fang, Junyang Song, Zhumei Xi, Xiping Wang

Horticulture Research ›› 2024, Vol. 11 ›› Issue (1) : 260.

PDF
Horticulture Research ›› 2024, Vol. 11 ›› Issue (1) : 260. DOI: 10.1093/hr/uhad260
ARTICLES

Telomere-to-telomere and gap-free genome assembly of a susceptible grapevine species (Thompson Seedless) to facilitate grape functional genomics

Author information +
History +

Abstract

Grapes are globally recognized as economically significant fruit trees. Among grape varieties, Thompson Seedless holds paramount influence for fresh consumption and for extensive applications in winemaking, drying, and juicing. This variety is one of the most efficient genotypes for grape genetic modification. However, the lack of a high-quality genome has impeded effective breeding efforts. Here, we present the high-quality reference genome of Thompson Seedless with all 19 chromosomes represented as 19 contiguous sequences (N50 = 27.1 Mb) with zero gaps and prediction of all telomeres and centromeres. Compared with the previous assembly (TSv1 version), the new assembly incorporates an additional 31.5 Mb of high-quality sequenced data with annotation of a total of 30 397 protein-coding genes. We also performed a meticulous analysis to identify nucleotide-binding leucine-rich repeat genes (NLRs) in Thompson Seedless and two wild grape varieties renowned for their disease resistance. Our analysis revealed a significant reduction in the number of two types of NLRs, TIR-NB-LRR (TNL) and CC-NB-LRR (CNL), in Thompson Seedless, which may have led to its sensitivity to many fungal diseases, such as powdery mildew, and an increase in the number of a third type, RPW8 (resistance to powdery mildew 8)-NB-LRR (RNL). Subsequently, transcriptome analysis showed significant enrichment of NLRs during powdery mildew infection, emphasizing the pivotal role of these elements in grapevine’s defense against powdery mildew. The successful assembly of a high-quality Thompson Seedless reference genome significantly contributes to grape genomics research, providing insight into the importance of seedlessness, disease resistance, and color traits, and these data can be used to facilitate grape molecular breeding efforts.

Cite this article

Download citation ▾
Xianhang Wang, Mingxing Tu, Ya Wang, Yali Zhang, Wuchen Yin, Jinghao Fang, Min Gao, Zhi Li, Wei Zhan, Yulin Fang, Junyang Song, Zhumei Xi, Xiping Wang. Telomere-to-telomere and gap-free genome assembly of a susceptible grapevine species (Thompson Seedless) to facilitate grape functional genomics. Horticulture Research, 2024, 11(1): 260 https://doi.org/10.1093/hr/uhad260

References

[1.]
Li K, Jiang W, Hui Y. et al. Gapless indica rice genome reveals synergistic contributions of active transposable elements and segmental duplications to rice genome evolution. Mol Plant. 2021;14:1745-56
[2.]
Wang B, Yang X, Jia Y. et al. High-quality Arabidopsis thaliana genome assembly with Nanopore and HiFi long reads. Genomics, Proteomics Bioinformatics. 2022;20:4-13
[3.]
Navratilova P. et al. Prospects of telomere-to-telomere assembly in barley: analysis of sequence gaps in the MorexV3 reference genome. Plant Biotechnol J. 2022;20:1373-86
[4.]
Rousseau-Gueutin M, Belser C, da Silva C. et al. Long-read assembly of the Brassica napus reference genome Darmor-bzh. GigaScience. 2020;9:giaa137
[5.]
Song JM, Xie WZ, Wang S. et al. Two gap-free reference genomes and a global view of the centromere architecture in rice. Mol Plant. 2021;14:1757-67
[6.]
Belser C, Baurens FC, Noel B. et al. Telomere-to-telomere gapless chromosomes of banana using nanopore sequencing. Commun Biol. 2021;4:1047
[7.]
Huang H-R, Liu X, Arshad R. et al. Telomere-to-telomere haplotype-resolved reference genome reveals subgenome diver-gence and disease resistance in triploid Cavendish banana. Hor-ticult Res. 2023;10:uhad153
[8.]
Liu X, Arshad R, Wang X. et al. The phased telomere-to-telomere reference genome of Musa acuminata, a main contributor to banana cultivars. Sci Data. 2023;10:631
[9.]
Liu J, Seetharam AS, Chougule K. et al. Gapless assembly of maize chromosomes using long-read technologies. Genome Biol. 2020;21:121
[10.]
Zhang W, Zhang Y, Qiu H. et al. Genome assembly of wild tea tree DASZ reveals pedigree and selection history of tea varieties. Nat Commun. 2020;11:3719
[11.]
van Rengs WMJ, Schmidt MHW, Effgen S. et al. A chromosome scale tomato genome built from complementary PacBio and Nanopore sequences alone reveals extensive linkage drag during breeding. Plant J. 2022;110:572-88
[12.]
Deng Y, Liu S, Zhang Y. et al. A telomere-to-telomere gap-free reference genome of watermelon and its mutation library provide important resources for gene discovery and breeding. Mol Plant. 2022;15:1268-84
[13.]
Fu A, Zheng Y, Guo J. et al. Telomere-to-telomere genome assem-bly of bitter melon (Momordica charantia L. var. abbreviata Ser.) reveals fruit development, composition and ripening genetic characteristics. Hortic Res. 2023;10:uhac228
[14.]
Yue J, Chen Q, Wang Y. et al. Telomere-to-telomere and gap-free reference genome assembly of the kiwifruit Actinidia chinensis. Hortic Res. 2023;10:uhac264
[15.]
Zhang L, Liang J, Chen H. et al. A near-complete genome assem-bly of Brassica rapa provides new insights into the evolution of centromeres. Plant Biotechnol J. 2023;21:1022-32
[16.]
Bao Y, Zeng Z, Yao W. et al. A gap-free and haplotype-resolved lemon genome provides insights into flavor synthesis and huan-glongbing (HLB) tolerance. Hortic Res. 2023;10:uhad020
[17.]
Zhou Y, Xiong J, Shu Z. et al. The telomere to telomere genome of Fragaria vesca reveals the genomic evolution of Fragaria and the origin of cultivated octoploid strawberry. Hortic Res. 2023;10:uhad027
[18.]
Patel S, Robben M, Fennell A. et al. Draft genome of the native American cold hardy grapevine Vitis riparia Michx. Hortic Res. 2020;7:92
[19.]
Shirasawa K, Hirakawa H, Azuma A. et al. De Novo Whole-Genome Assembly in Interspecific Hybrid Table Grape, ’Shine Muscat’. Cold Spring Harbor Laboratory. 2019:
[20.]
Jaillon O, Aury JM, Noel B. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007;449:463-7
[21.]
Chin CS, Peluso P, Sedlazeck FJ. et al. Phased diploid genome assembly with single-molecule real-time sequencing. Nat Meth-ods. 2016;13:1050-4
[22.]
Di Genova A, Almeida AM, Mu ´noz-Espinoza C, et al. Whole genome comparison between table and wine grapes reveals a comprehensive catalog of structural variants. BMC Plant Biol. 2014;14:7
[23.]
Patel S, Lu Z, Jin X. et al. Comparison of three assembly strategies for a heterozygous seedless grapevine genome assembly. BMC Genomics. 2018;19:57
[24.]
Zhou Y, Minio A, Massonnet M. et al. The population genetics of structural variants in grapevine domestication. Nat Plants. 2019;5:965-79
[25.]
Park M, Vera D, Kambrianda D. et al. Chromosome-level genome sequence assembly and genome-wide association study of Mus-cadinia rotundifolia reveal the genetics of 12 berry-related traits. Hortic Res. 2022;9:uhab011
[26.]
Wang Y, Xin H, Fan P. et al. The genome of Shanputao (Vitis amurensis) provides a new insight into cold tolerance of grapevine. Plant J. 2021;105:1495-506
[27.]
Shi X, Cao S, Wang X. et al. The complete reference genome for grapevine (Vitis vinifera L.) genetics and breeding. Hortic Res. 2023;10:uhad061
[28.]
Tu M, Fang J, Zhao R. et al. CRISPR/Cas9-mediated mutagenesis of VvbZIP36 promotes anthocyanin accumulation in grapevine (Vitis vinifera). Hortic Res. 2022;9:uhac022
[29.]
Tu M, Wang X, Yin W. et al. Grapevine VlbZIP30 improves drought resistance by directly activating VvNAC17 and promoting lignin biosynthesis through the regulation of three peroxidase genes. Hortic Res. 2020;7:150
[30.]
Wang X, Tu M, Wang D. et al. CRISPR/Cas9-mediated efficient targeted mutagenesis in grape in the first generation. Plant Biotechnol J. 2018;16:844-55
[31.]
Wang X, Tu M, Wang Y. et al. Whole-genome sequencing reveals rare off-target mutations in CRISPR/Cas9-edited grapevine. Hor-tic Res. 2021;8:114
[32.]
Yin W, Wang X, Liu H. et al. Overexpression of VqWRKY31 enhances powdery mildew resistance in grapevine by promoting salicylic acid signaling and specific metabolite synthesis. Hortic Res. 2022;9:uhab064.
[33.]
Cheng H, Concepcion GT, Feng X. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021;18:170-5
[34.]
Simao FA, Waterhouse RM, Ioannidis P. et al. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31:3210-2
[35.]
Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26: 589-95
[36.]
Li H, Handsaker B, Wysoker A. et al. The sequence alignmen-t/map format and SAMtools. Bioinformatics. 2009;25:2078-9
[37.]
Danecek P, Bonfield JK, Liddle J. et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10:giab008
[38.]
Peska V, Garcia S. Origin, diversity, and evolution of telomere sequences in plants. Front Plant Sci. 2020;11:117
[39.]
Emms DM, Kelly S. OrthoFinder: phylogenetic orthology infer-ence for comparative genomics. Genome Biol. 2019;20:238
[40.]
Wu T, Hu E, Xu S. et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation (Camb). 2021;2:100141
[41.]
Howe K, Chow W, Collins J. et al. Significantly improving the quality of genome assemblies through curation. GigaScience. 2021;10:giaa153
[42.]
Andolfo G, Dohm JC, Himmelbauer H. Prediction of NB-LRR resistance genes based on full-length sequence homology. Plant J. 2022;110:1592-602
[43.]
Yuan M, Ngou BPM, Ding P. et al. PTI-ETI crosstalk: an inte-grative view of plant immunity. Curr Opin Plant Biol. 2021;62: 102030
[44.]
Shao ZQ, Xue JY, Wang Q. et al. Revisiting the origin of plant NBS-LRR genes. Trends Plant Sci. 2019;24:9-12
[45.]
Wang Y, Wang X, Fang J. et al. VqWRKY56 interacts with VqbZIPC22 in grapevine to promote proanthocyanidin biosyn-thesis and increase resistance to powdery mildew. New Phytol. 2023;237:1856-75
[46.]
Bioletti FT. The seedless raisin grapes. J Pomol Hortic Sci. 1919;1: 23-7
[47.]
Dokoozlian NK. Grape berry growth and development. In Chris-tensen LP (ed.) Raisin Production Manual. Oakland: Agriculture and Natural Resources Services—Publications. 2000;30-7
[48.]
Zhang Q, Li J, Singh VP. et al. SPI-based evaluation of drought events in Xinjiang, China. Nat Hazards. 2012;64:481-92
[49.]
Shi Y, Liu X, Zhao S. et al. The PYR-PP2C-CKL2 module regulates ABA-mediated actin reorganization during stomatal closure. New Phytol. 2022;233:2168-84
[50.]
YoshidaT, FujitaY, MaruyamaK. et al. Four Arabidopsis ARE-B/ABF transcription factors function predominantly in gene expression downstream of SnRK2 kinases in abscisic acid sig-nalling in response to osmotic stress. Plant Cell Environ. 2015;38: 35-49
[51.]
Tu M, Wang X, Feng T. et al. Expression of a grape (Vitis vinifera) bZIP transcription factor, VlbZIP36, in Arabidopsis thaliana confers tolerance of drought stress during seed ger-mination and seedling establishment. Plant Sci. 2016;252: 311-23
[52.]
Xiao JH, Yue Z, Jia LY. et al. Obligate mutualism within a host drives the extreme specialization of a fig wasp genome. Genome Biol. 2013;14:R141
[53.]
Varshney RK, Shi C, Thudi M. et al. Pearl millet genome sequence provides a resource to improve agronomic traits in arid environ-ments. Nat Biotechnol. 2017;35:969-76
[54.]
Chen SP, Sun WH, Xiong YF. et al. The Phoebe genome sheds light on the evolution of magnoliids. Hortic Res. 2020;7:146
[55.]
Xue T, Zheng X, Chen D. et al. A high-quality genome provides insights into the new taxonomic status and genomic charac-teristics of Cladopus chinensis (Podostemaceae). Hortic Res. 2020; 7:46
[56.]
Dong Y, Duan S, Xia Q. et al. Dual domestications and origin of traits in grapevine evolution. Science. 2023;379:892-901
[57.]
Xiao H, Liu Z, Wang N. et al. Adaptive and maladaptive intro-gression in grapevine domestication. Proc Natl Acad Sci USA. 2023;120:e2222041120
[58.]
Zhou Y, Massonnet M, Sanjak JS. et al. Evolutionary genomics of grape (Vitis vinifera ssp. vinifera) domestication. Proc Natl Acad Sci USA. 2017;114:11715-20
[59.]
Chang M, Chen H, Liu F. et al. PTI and ETI: convergent pathways with diverse elicitors. Trends Plant Sci. 2022;27:113-5
[60.]
Wang L, Zhao L, Zhang X. et al. Large-scale identification and functional analysis of NLR genes in blast resistance in the Tetep rice genome sequence. Proc Natl Acad Sci USA. 2019;116: 18479-87
[61.]
Zhang B, Zhang H, Li F. et al. Multiple alleles encoding atyp-ical NLRs with unique central tandem repeats in rice con-fer resistance to Xanthomonas oryzae pv. oryzae. Plant Commun. 2020;1:100088
[62.]
Saile SC, El Kasmi F. Small family, big impact: RNL helper NLRs and their importance in plant innate immunity. PLoS Pathog. 2023;19:e1011315
[63.]
Wang W, Feng B, Zhou JM. et al. Plant immune signaling: advanc-ing on two frontiers. J Integr Plant Biol. 2020;62:2-24
[64.]
Qiu W, Feechan A, Dry I. Current understanding of grapevine defense mechanisms against the biotrophic fungus (Erysiphe necator), the causal agent of powdery mildew disease. Hortic Res. 2015;2:15020
[65.]
Wan R, Guo C, Hou X. et al. Comparative transcriptomic analysis highlights contrasting levels of resistance of Vitis vinifera and Vitis amurensis to Botrytis cinerea. Hortic Res. 2021;8:103
[66.]
Yang S, Zhang X, Yue JX. et al. Recent duplications dominate NBS-encoding gene expansion in two woody species. Mol Genet Genomics. 2008;280:187-98
[67.]
Liu Y, Zeng Z, Zhang YM. et al. An angiosperm NLR atlas reveals that NLR gene reduction is associated with ecological special-ization and signal transduction component deletion. Mol Plant. 2021;14:2015-31
[68.]
Wang X, Guo R, Tu M. et al. Ectopic expression of the wild grape WRKY transcription factor VqWRKY52 in Arabidopsis thaliana enhances resistance to the biotrophic pathogen pow-dery mildew but not to the necrotrophic pathogen Botrytis cinerea. Front Plant Sci. 2017;8:97
[69.]
Rao SS, Huntley MH, Durand NC. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159:1665-80
[70.]
Zhang X, Zhang S, Zhao Q. et al. Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat Plants. 2019;5:833-45
[71.]
Marçais G, Delcher AL, Phillippy AM. et al. MUMmer4: a fast and versatile genome alignment system. PLoS Comput Biol. 2018;14:e1005944
[72.]
Hu J, Fan J, Sun Z. et al. NextPolish: a fast and efficient genome polishing tool for long-read assembly. Bioinformatics. 2020;36: 2253-5
[73.]
Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094-100
[74.]
Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573-80
[75.]
Krzywinski M, Schein J, Birol ˙I. et al. Circos: an informa-tion aesthetic for comparative genomics. Genome Res. 2009;19: 1639-45
[76.]
Zhou ZW, Yu ZG, Huang XM. et al. GenomeSyn: a bioinformatics tool for visualizing genome synteny and structural variations. J Genet Genomics. 2022;49:1174-6
[77.]
Goel M, Sun H, Jiao W-B. et al. SyRI: finding genomic rear-rangements and local sequence differences from whole-genome assemblies. Genome Biol. 2019;20:277
[78.]
Goel M, Schneeberger K. Plotsr: visualizing structural similar-ities and rearrangements between multiple genomes. Bioinfor-matics. 2022;38:2922-6
[79.]
Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Current Protoc Bioin-formatics. 2009; Chapter 4:4.10.1-14
[80.]
Flynn JM, Hubley R, Goubert C. et al. RepeatModeler2 for auto-mated genomic discovery of transposable element families. Proc Natl Acad Sci USA. 2020;117:9451-7
[81.]
McGinnis S, Madden TL. BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res. 2004;32:W20-5
[82.]
Stanke M, Steinkamp R, Waack S. et al. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 2004;32:W309-12
[83.]
Haas BJ, Salzberg SL, Zhu W. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008;9:R7
[84.]
Chan PP, Lin BY, Mak AJ. et al. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 2021;49:9077-96
[85.]
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32: 1792-7
[86.]
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609-12
[87.]
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312-3
[88.]
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586-91
[89.]
Yu G, Lam TT, Zhu H. et al. Two methods for mapping and visualizing associated data on phylogeny using Ggtree. Mol Biol Evol. 2018;35:3041-3
[90.]
De Bie T, Cristianini N, Demuth JP. et al. CAFE: a computa-tional tool for the study of gene family evolution. Bioinformatics. 2006;22:1269-71
[91.]
Pertea M, Kim D, Pertea GM. et al. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016;11:1650-67
[92.]
Chen S, Zhou Y, Chen Y. et al. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884-90
[93.]
Kim D, Paggi JM, Park C. et al. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37:907-15
[94.]
Lescot M, Déhais P, Thijs G. et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002;30: 325-7
[95.]
Chen C, Chen H, Zhang Y. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13:1194-202
PDF

Accesses

Citations

Detail

Sections
Recommended

/