Deciphering octoploid strawberry evolution with serial LTR similarity matrices for subgenome partition

Haomin Lyu , Shujun Ou , Won Cheol Yim , Qingyi Yu

Horticulture Research ›› 2025, Vol. 12 ›› Issue (8) : 132

PDF (1677KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (8) :132 DOI: 10.1093/hr/uhaf132
Articles
research-article
Deciphering octoploid strawberry evolution with serial LTR similarity matrices for subgenome partition
Author information +
History +
PDF (1677KB)

Abstract

Polyploidization has been recognized as a major force in plant evolution. With the continuous progress in sequencing technologies and genome assembly algorithms, high-quality chromosome-level assemblies of polyploid genomes have become increasingly attainable. However, accurately delineating these assemblies into subgenomes remains a challenging task, especially in cases where known diploid ancestors are absent. In this study, we introduce a novel approach that leverages long terminal repeat retrotransposons (LTR-RTs) coupled with the serial similarity matrix (SSM) method to assign genome assemblies to subgenomes, particularly beneficial for those without known diploid progenitor genomes. The SSM method helps identify subgenome-specific LTRs and facilitates the inference of the timing of allopolyploidization events. We validated the efficacy of the SSM approach using well-studied allopolyploid genomes, Eragrostis tef and Gossypium hirsutum, alongside artificially created allotetraploid genomes, GarGra and GmaGso. Our results demonstrated the robustness of the method and its effectiveness in assigning chromosomes to subgenomes. We then applied the SSM method to the octoploid strawberry genome. Our analysis revealed three allopolyploidization events in the evolutionary trajectory of the octoploid strawberry genome, shedding light on the evolutionary process of the origin of the octoploid strawberry genome and enhancing our understanding of allopolyploidization in this complex species.

Cite this article

Download citation ▾
Haomin Lyu, Shujun Ou, Won Cheol Yim, Qingyi Yu. Deciphering octoploid strawberry evolution with serial LTR similarity matrices for subgenome partition. Horticulture Research, 2025, 12(8): 132 DOI:10.1093/hr/uhaf132

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This work was supported by the National Institute of Food and Agriculture (NIFA)—Specialty Crop Research Initiative (SCRI) Grant 2022-51181-38241 to Q.Y.

Author contributions

H.L. designed and performed experiments and wrote the manuscript. S.O. and W.C.Y. revised the manuscript. Q.Y. provided supervision, guidance, and support to H.L. and supervised the project and wrote the manuscript.

Data availability

The code used to generate results is available at https://github.com/HaominLyu/SSM.git.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary data

Supplementary data is available at Horticulture Research online.

References

[1]

Ren R, Wang H, Guo C. et al. Widespread whole genome duplica-tions contribute to genome complexity and species diversity in angiosperms. Mol Plant. 2018; 11:414-28

[2]

Soltis DE, Segovia-Salcedo MC, Jordon-Thaden I. et al. Are poly-ploids really evolutionary dead-ends (again)? A critical reap-praisal of Mayrose et al. (2011). New Phytol. 2014; 202:1105-17

[3]

Van de Peer Y, Mizrachi E, Marchal K. The evolutionary signifi-cance of polyploidy. Nat Rev Genet. 2017; 18:411-24

[4]

Tang H, Bowers JE, Wang X. et al. Synteny and collinearity in plant genomes. Science. 2008; 320:486-8

[5]

Jaillon O, Aury JM, Noel B. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007; 449:463-7

[6]

Bowers JE, Chapman BA, Rong J. et al. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature. 2003; 422:433-8

[7]

Vanneste K, Baele G, Maere S. et al. Analysis of 41 plant genomes supports a wave of successful genome duplications in asso-ciation with the Cretaceous-Paleogene boundary. Genome Res. 2014; 24:1334-47

[8]

Spoelhof JP, Soltis PS, Soltis DE. Pure polyploidy: closing the gaps in autopolyploid research. J Syst Evol. 2017; 55:340-52

[9]

Tayalé A, Parisod C. Natural pathways to polyploidy in plants and consequences for genome reorganization. Cytogenet Genome Res. 2013; 140:79-96

[10]

Wood TE, Takebayashi N, Barker MS. et al. The frequency of polyploid speciation in vascular plants. Proc Natl Acad Sci U S A. 2009; 106:13875-9

[11]

Barker MS, Arrigo N, Baniaga AE. et al. On the relative abundance of autopolyploids and allopolyploids. New Phytol. 2016; 210:391-8

[12]

Kellogg EA. Flowering plants. Monocots: Poaceae. In: Kubitzki K,ed. The Families and Genera of Vascular Plants. Springer: Cham, 2015,1-416

[13]

Stebbins GL. Polyploidy, hybridization, and the invasion of new habitats. Ann Mo Bot Gard. 1985; 72:824-32

[14]

Session AM, Uno Y, Kwon T. et al. Genome evolution in the allotetraploid frog Xenopus laevis. Nature. 2016; 538:336-43

[15]

Sehrish T, Symonds VV, Soltis DE. et al. Gene silencing via DNA methylation in naturally occurring Tragopogon miscellus (Aster-aceae) allopolyploids. BMC Genomics. 2014; 15:701

[16]

Yoo MJ, Szadkowski E, Wendel JF. Homoeolog expression bias and expression level dominance in allopolyploid cotton. Heredity. 2013; 110:171-80

[17]

Cox MP, Dong T, Shen GG. et al. An interspecific fungal hybrid reveals cross-kingdom rules for allopolyploid gene expression patterns. PLoS Genet. 2014; 10:e1004180

[18]

Van de Peer Y, Maere S, Meyer A. The evolutionary significance of ancient genome duplications. Nat Rev Genet. 2009; 10:725-32

[19]

Sierro N, Battey JND, Ouadi S. et al. The tobacco genome sequence and its comparison with those of tomato and potato. Nat Commun. 2014; 5:3833

[20]

VanBuren R, Man Wai C, Wang X. et al. Exceptional subgenome stability and functional divergence in the allotetraploid Ethiopian cereal teff. Nat Commun. 2020; 11:884

[21]

Hu Y, Chen J, Fang L. et al. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat Genet. 2019; 51:739-48

[22]

Yang J, Liu D, Wang X. et al. The genome sequence of allopoly-ploid Brassica juncea and analysis of differential homoeolog gene expression influencing selection. Nat Genet. 2016; 48:1225-32

[23]

Yim WC, Swain ML, Ma D. et al. The final piece of the Triangle of U: evolution of the tetraploid Brassica carinata genome. Plant Cell. 2022; 34:4143-72

[24]

Appels R, Eversole K, Feuillet C. et al. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361:eaar7191

[25]

Zhang J, Zhang X, Tang H. et al. Allele-defined genome of the autopolyploid sugarcane Saccharum spontaneum L. Nat Genet. 2018; 50:1565-73

[26]

Edger PP, Poorten TJ, Vanburen R. et al. Origin and evolution of the octoploid strawberry genome. Nat Genet. 2019; 51:541-7

[27]

Scalabrin S, Magris G, Liva M. et al. A chromosome-scale assem-bly reveals chromosomal aberrations and exchanges generat-ing genetic diversity in Coffea arabica germplasm. Nat Commun. 2024; 15:463

[28]

Scalabrin S, Toniutti L, Di Gaspero G. et al. A single polyploidiza-tion event at the origin of the tetraploid genome of Coffea arabica is responsible for the extremely low genetic variation in wild and cultivated germplasm. Sci Rep. 2020; 10:4642

[29]

Darrow GM. The Strawberry; History Breeding and Physiology.1st ed. 1st ed. New York: Holt Rinehart and Winston; 1966:

[30]

Sargent DJ, Yang Y, Šurbanovski N. et al. HaploSNP affinities and linkage map positions illuminate subgenome composition in the octoploid, cultivated strawberry (Fragaria × ananassa). Plant Sci. 2016; 242:140-50

[31]

Senanayake YDA, Bringhurst RS. Origin of Fragaria polyploids. I. Cytological analysis. Am J Bot. 1967; 54:221-8

[32]

Tennessen JA, Govindarajulu R, Ashman T-L. et al. Evolution-ary origins and dynamics of octoploid strawberry subgenomes revealed by dense targeted capture linkage maps. Genome Biol Evol. 2014; 6:3295-313

[33]

Yang Y, Davis TM. A new perspective on polyploid Fragaria (strawberry) genome composition based on large-scale, multi-locus phylogenetic analysis. Genome Biol Evol. 2017; 9:3433-48

[34]

Bringhurst RS. Cytogenetics and evolution in American Fragaria. HortScience. 1990; 25:879-81

[35]

Kamneva OK, Syring J, Liston A. et al. Evaluating allopolyploid origins in strawberries (Fragaria) using haplotypes generated from target capture sequencing. BMC Evol Biol. 2017; 17:180

[36]

DiMeglio LM, Staudt G, Yu H. et al. A phylogenetic analysis of the genus Fragaria (strawberry) using intron-containing sequence from the ADH-1 gene. PLoS One. 2014; 9:e102237

[37]

Fedorova NJ. Crossability and phylogenetic relations in the main European species of Fragaria. Dokl Akad Nauk SSSR. 1946; 52: 545-7

[38]

Rousseau-Gueutin M, Gaston A, Aïnouche A. et al. Tracking the evolutionary history of polyploidy in Fragaria L. (strawberry): new insights from phylogenetic analyses of low-copy nuclear genes. Mol Phylogenet Evol. 2009; 51:515-30

[39]

Liston A, Wei N, Tennessen JA. et al. Revisiting the origin of octoploid strawberry. Nat Genet. 2020; 52:2-4

[40]

Feng C, Wang J, Harris A. et al. Tracing the diploid ancestry of the cultivated octoploid strawberry. Mol Biol Evol. 2021; 38: 478-85

[41]

Martelossi J, Nicolini F, Subacchi S. et al. Multiple and diversi-fied transposon lineages contribute to early and recent bivalve genome evolution. BMC Biol. 2023; 21:145

[42]

Stapley J, Santure AW, Dennis SR. Transposable elements as agents of rapid adaptation may explain the genetic paradox of invasive species. Mol Ecol. 2015; 24:2241-52

[43]

Lanciano S, Mirouze M. Transposable elements: all mobile, all different, some stress responsive, some adaptive? Curr Opin Genet Dev. 2018; 49:106-14

[44]

Lisch D. How important are transposons for plant evolution? Nat Rev Genet. 2013; 14:49-61

[45]

Belyayev A. Bursts of transposable elements as an evolutionary driving force. J Evol Biol. 2014; 27:2573-84

[46]

Jurka J, Bao W, Kojima KK. Families of transposable elements, population structure and the origin of species. Biol Direct. 2011; 6:44

[47]

Wang L, Sun X, Peng Y. et al. Genomic insights into the origin, adaptive evolution, and herbicide resistance of Leptochloa chinen-sis, a devastating tetraploid weedy grass in rice fields. Mol Plant. 2022; 15:1045-58

[48]

Session AM, Rokhsar DS. Transposon signatures of allopolyploid genome evolution. Nat Commun. 2023; 14:3180

[49]

Jia KH, Wang ZX, Wang L. et al. SubPhaser: a robust allopolyploid subgenome phasing method based on subgenome-specific k-mers. New Phytol. 2022; 235:801-9

[50]

Hawkins JS, Proulx SR, Rapp RA. et al. RapidDNA lossasa counterbalance to genome expansion through retrotransposon proliferation in plants. Proc Natl Acad Sci U S A. 2009; 106:17811-6

[51]

Bourque G, Burns KH, Gehring M. et al. Ten things you should know about transposable elements. Genome Biol. 2018; 19: 199

[52]

Lyu H, He Z, Wu CI. et al. Convergent adaptive evolution in marginal environments: unloading transposable elements as a common strategy among mangrove genomes. New Phytol. 2018; 217:428-38

[53]

Pereira V. Insertion bias and purifying selection of retro-transposons in the Arabidopsis thaliana genome. Genome Biol. 2004;5:R79

[54]

Seehausen O, Butlin RK, Keller I. et al. Genomics and the origin of species. Nat Rev Genet. 2014; 15:176-92

[55]

Feder JL, Egan SP, Nosil P. The genomics of speciation-with-gene-flow. Trends Genet. 2012; 28:342-50

[56]

Wu CI, Ting CT. Genes and speciation. Nat Rev Genet. 2004; 5: 114-22

[57]

Steige KA, Slotte T. Genomic legacies of the progenitors and the evolutionary consequences of allopolyploidy. Curr Opin Plant Biol. 2016; 30:88-93

[58]

Parisod C, Alix K, Just J. et al. Impact of transposable elements on the organization and function of allopolyploid genomes. New Phytol. 2010; 186:37-45

[59]

Edger PP, McKain MR, Bird KA. et al. Subgenome assignment in allopolyploids: challenges and future directions. Curr Opin Plant Biol. 2018; 42:76-80

[60]

Ou S, Jiang N. LTR_retriever: a highly accurate and sensitive pro-gram for identification of long terminal repeat retrotransposons. Plant Physiol. 2018; 176:1410-22

[61]

Xie M, Chung CY, Li M. et al. A reference-grade wild soybean genome. Nat Commun. 2019; 10:1216

[62]

Schmutz J, Cannon SB, Schlueter J. et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010; 463:178-83

[63]

Sedivy EJ, Wu F, Hanzawa Y. Soybean domestication: the origin, genetic architecture and molecular bases. New Phytol. 2017; 214: 539-53

[64]

Kim MY, Lee S, Van K. et al. Whole-genome sequencing and intensive analysis of the undomesticated soybean (Glycine soja Sieb. and Zucc.) genome. Proc Natl Acad Sci U S A. 2010; 107: 22032-7

[65]

Dillenberger MS, Wei N, Tennessen JA. et al. Plastid genomes reveal recurrent formation of allopolyploid Fragaria. Am J Bot. 2018; 105:862-74

[66]

Fan Z, Liston A, Soltis D. et al. Homoploid hybridization resolves the origin of octoploid strawberries. bioRxiv. 2024. https://doi.org/10.1101/2024.09.12.612680

[67]

Qiao Q, Edger PP, Xue L. et al. Evolutionary history and pan-genome dynamics of strawberry (Fragaria spp.). Proc Natl Acad Sci. 2021; 118:e2105431118

[68]

Shulaev V, Sargent DJ, Crowhurst RN. et al. The genome of woodland strawberry (Fragaria vesca). Nat Genet. 2011; 43:109-16

[69]

Edger PP, McKain MR, Yocca AE. et al. Reply to: revisiting the origin of octoploid strawberry. Nat Genet. 2020; 52:5-7

[70]

Njuguna W, Liston A, Cronn R. et al. Insights into phylogeny, sex function and age of Fragaria based on whole chloroplast genome sequencing. Mol Phylogenet Evol. 2013; 66:17-29

[71]

Session AM. Allopolyploid subgenome identification and impli-cations for evolutionary analysis. Trends Genet. 2024; 40:621-31

[72]

Liu B, Poulsen EG, Davis TM. Insight into octoploid strawberry (Fragaria) subgenome composition revealed by GISH analysis of pentaploid hybrids. Genome. 2016; 59:79-86

[73]

Wei N, Tennessen JA, Liston A. et al. Present-day sympatry belies the evolutionary origin of a high-order polyploid. New Phytol. 2017; 216:279-90

[74]

Estep MC, McKain MR, Vela Diaz D. et al. Allopolyploidy, diversi-fication, and the Miocene grassland expansion. Proc Natl Acad Sci USA. 2014; 111:15149-54

[75]

Ellinghaus D, Kurtz S, Willhoeft U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 2008; 9:18

[76]

Xu Z, Wang H. LTR_FINDER: an efficient tool for the pre-diction of full-length LTR retrotransposons. Nucleic Acids Res. 2007;35:W265-8

[77]

Wang M, Li J, Qi Z. et al. Genomic innovation and regulatory rewiring during evolution of the cotton genus Gossypium. Nat Genet. 2022; 54:1959-71

[78]

Huang G, Bao Z, Feng L. et al. A telomere-to-telomere cotton genome assembly reveals centromere evolution and a Mutator transposon-linked module regulating embryo development. Nat Genet. 2024; 56:1953-63

[79]

Yu J, Jung S, Cheng CH. et al. CottonGen: a genomics, genetics and breeding database for cotton research. Nucleic Acids Res. 2014;42:D1229-36

[80]

Grant D, Nelson RT, Cannon SB. et al.SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Res. 2009;38:D843-6

[81]

Suzuki R, Shimodaira H. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics. 2006; 22: 1540-2

[82]

Venables WN, Ripley BD. Modern Applied Statistics with S. 4th ed. New York: Springer; 2002:

[83]

Jung S, Lee T, Cheng CH. et al.15 years of GDR: new data and functionality in the genome database for Rosaceae. Nucleic Acids Res. 2019;47:D1137-45

[84]

Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25:1754-60

[85]

Li H, Handsaker B, Wysoker A. et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009; 25: 2078-9

[86]

McKenna A, Hanna M, Banks E. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20:1297-303

[87]

Tang H, Krishnakumar V, Zeng X. et al. JCVI: a versatile toolkit for comparative genomics analysis. iMeta. 2024; 3:e211

[88]

Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004; 32: 1792-7

[89]

Wang D, Zhang Y, Zhang Z. et al. KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genomics Proteomics Bioinformatics. 2010; 8: 77-80

[90]

Price MN, Dehal PS, Arkin AP. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One. 2010; 5:e9490

[91]

Bouckaert RR. DensiTree: making sense of sets of phylogenetic trees. Bioinformatics. 2010; 26:1372-3

[92]

Chen C, Chen H, Zhang Y. et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020; 13:1194-202

PDF (1677KB)

236

Accesses

0

Citation

Detail

Sections
Recommended

/