Chromosome-scale and haplotype-resolved genome assembly of Populus trichocarpa

Wentao Gao , Sui Wang , Tao Jiang , Heng Hu , Runtian Gao , Murong Zhou , Guohua Wang

Horticulture Research ›› 2025, Vol. 12 ›› Issue (4) : 12

PDF (1564KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (4) :12 DOI: 10.1093/hr/uhaf012
ARTICLES
Chromosome-scale and haplotype-resolved genome assembly of Populus trichocarpa
Author information +
History +
PDF (1564KB)

Abstract

Populus trichocarpa, a pivotal model organism for woody transgenic research, not only garners substantial scientific interest but plays an integral role in forestry economics. Previous genomic assemblies of P. trichocarpa predominantly treated its heterozygous genome as homozygous, thereby neglecting crucial haplotypic diversity. Leveraging the high-fidelity (HiFi) sequencing capabilities of PacBio sequencing and the chromosome conformation capture insights provided by Illumina's Hi-C technique, this study is the first to achieve a near telomere-to-telomere assembly of both paternal and maternal haplotypes in P. trichocarpa. Comparative genomic analysis between these haplotypes has uncovered several allelic variants and pathways critical for trait determination through allele-specific expression. Furthermore, utilizing RNA-seq data from multiple tissues, this investigation has detailed the tissue-specific expression patterns of the leucine-rich repeat gene family, which are essential in mediating plant signal transduction and developmental regulation. Our results not only illuminate the functional genomics landscape of P. trichocarpa but also provide invaluable theoretical underpinnings for the genetic improvement of woody plants and a robust framework for exploring genetic variability and allelic expression disparities in arboreal species.

Cite this article

Download citation ▾
Wentao Gao, Sui Wang, Tao Jiang, Heng Hu, Runtian Gao, Murong Zhou, Guohua Wang. Chromosome-scale and haplotype-resolved genome assembly of Populus trichocarpa. Horticulture Research, 2025, 12(4): 12 DOI:10.1093/hr/uhaf012

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgments

This work was supported by National Natural Science Foundation of China (Nos 62225109, 62072095) and the Fundamental Research Funds for the Central Universities (Nos 2572022BD04,HIT.BRET.2022003).

Author Contributions

G.H.W. and S.W. initiated and designed the research. W.T.G., G.H.W, S.W., T.J., H.H., R.T.G. and M.R.Z performed the data processing and experiment. W.T.G. wrote the paper. G.H.W, S.W., T.J. were involved in finalizing the manuscript.

Data availability

The whole genome sequence data reported in this paper have been deposited in the Genome Warehouse in National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences/China National Center for Bioinformation, under accession number GWHERCM00000000 and GWHERCL00000000 that is publicly accessible at https://ngdc.cncb.ac.cn/gwh. The raw sequences are available at https://bigd.big.ac.cn/gsa/browse/CRA015670. The supplemental files can be downloaded at: http://www.wangsui.net.cn/resource/database/public/plant/Populus/trichocarpa/Supplemental_Files.zip

Conflict of interest statement

The authors declare no competing interest.

Supplementary Data

Supplementary data is available at Horticulture Research online.

References

[1]

Tuskan GA, Difazio S, Jansson S. et al. The genome of black cottonwood, Populus trichocarpa (torr. & gray). Science. 2006; 313: 1596-604

[2]

Yates TB, Feng K, Zhang J. et al. The ancient Salicoid genome duplication event: a platform for reconstruction of De novo gene evolution in Populus trichocarpa. Genome Biol Evol. 2021; 13:evab198

[3]

Zhang Z, Chen Y, Zhang J. et al. Improved genome assembly provides new insights into genome evolution in a desert poplar (Populus euphratica). Mol Ecol Resour. 2020; 20:781-94

[4]

Li Y, Wang D, Wang W. et al. A chromosome-level Populus qiongdaoensis genome assembly provides insights into tropical adaptation and a cryptic turnover of sex determination. Mol Ecol. 2023; 32:1366-80

[5]

Bai S, Wu H, Zhang J. et al. Genome assembly of Salicaceae Populus deltoides (eastern cottonwood) I-69 based on Nanopore sequencing and hi-C technologies. J Hered. 2021; 112:303-10

[6]

Chen S, Yu Y, Wang X. et al. Chromosome-level genome assembly of a triploid poplar Populus alba ‘Berolinensis’. Mol Ecol Resour. 2023b; 23:1092-107

[7]

An X, Gao K, Chen Z. et al. High quality haplotype-resolved genome assemblies of Populus tomentosa Carr., a stabilized inter-specific hybrid species widespread in Asia. Mol Ecol Resour. 2022; 22:786-802

[8]

Lin YC, Wang J, Delhomme N. et al. Functional and evolutionary genomic inferences in Populus through genome and population sequencing of American and European aspen. Proc Natl Acad Sci USA. 2018; 115:10970-8

[9]

Shi TL, Jia KH, Bao YT. et al. High-quality genome assembly enables prediction of allele-specific gene expression in hybrid poplar. Plant Physiol. 2024a; 195:652-70

[10]

Shi T, Zhang X, Hou Y. et al. The super-pangenome of Populus unveils genomic facets for its adaptation and diversification in widespread forest trees. Mol Plant. 2024b; 17:725-46

[11]

Zhang B, Zhu W, Diao S. et al. The poplar pangenome provides insights into the evolutionary history of the genus. Commun Biol. 2019; 2:215

[12]

Mansfeld BN, Boyher A, Berry JC. et al. Large structural varia-tions in the haplotype-resolved African cassava genome. Plant J. 2021; 108:1830-48

[13]

Koren S, Rhie A, Walenz BP. et al. De novo assembly of haplotype-resolved genomes with trio binning. Nat Biotechnol. 2018; 36: 1174-82

[14]

Sun H, Jiao WB, Krause K. et al. Chromosome-scale and haplotype-resolved genome assembly of a tetraploid potato cul-tivar. Nat Genet. 2022; 54:342-8

[15]

Campoy JA, Sun H, Goel M. et al. Gamete binning: chromosome-level and haplotype-resolved genome assembly enabled by high-throughput single-cell sequencing of gamete genomes. Genome Biol. 2020; 21:306

[16]

Mao J, Wang Y, Wang B. et al. High-quality haplotype-resolved genome assembly of cultivated octoploid strawberry. Hortic Res. 2023; 10:uhad002

[17]

Zhou Q, Tang D, Huang W. et al. Haplotype-resolved genome analyses of a heterozygous diploid potato. Nat Genet. 2020; 52: 1018-23

[18]

Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun. 2020; 11:1432

[19]

Cheng H, Concepcion GT, Feng X. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021; 18:170-5

[20]

Zhou C, McCarthy SA, Durbin R. YaHS: yet another hi-C scaffold-ing tool. Bioinformatics. 2023; 39:btac808

[21]

Gurevich A, Saveliev V, Vyahhi N. et al. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013; 29: 1072-5

[22]

Ou S, Su W, Liao Y. et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehen-sive pipeline. Genome Biol. 2019; 20:275

[23]

Gabriel L, Br ˚una T, Hoff KJ. et al. BRAKER3: fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Res. 2024; 34: 769-77

[24]

He L, Jia KH, Zhang RG. et al. Chromosome-scale assembly of the genome of Salix dunnii reveals a male-heterogametic sex determination system on chromosome 7. Mol Ecol Resour. 2021; 21:1966-82

[25]

Chen JH, Huang Y, Brachi B. et al. Genome-wide analysis of cushion willow provides insights into alpine plant divergence in a biodiversity hotspot. Nat Commun. 2019; 10:5230

[26]

Dai X, Hu Q, Cai Q. et al. The willow genome and divergent evolution from poplar after the common genome duplication. Cell Res. 2014; 24:1274-7

[27]

Almeida P, Proux-Wera E, Churcher A. et al. Genome assembly of the basket willow, Salix viminalis, reveals earliest stages of sex chromosome expansion. BMC Biol. 2020; 18:78

[28]

Ma T, Wang J, Zhou G. et al. Genomic insights into salt adaptation in a desert poplar. Nat Commun. 2013; 4:2797

[29]

Chen Z, Ai F, Zhang J. et al. Survival in the tropics despite isolation, inbreeding and asexual reproduction: insights from the genome of the world’s southernmost poplar (Populus ilicifolia). Plant J. 2020; 103:430-42

[30]

Liu YJ, Wang XR, Zeng QY. De novo assembly of white poplar genome and genetic diversity of white poplar popula-tion in Irtysh River basin in China. Sci China Life Sci. 2019; 62: 609-18

[31]

Li C, Xing H, Li C. et al. Chromosome-scale genome assembly provides insights into the molecular mechanisms of tissue development of Populus wilsonii. Commun Biol. 2022; 5:1125

[32]

Chen Q, Yang C, Zhang G. et al. GCI: a continuity inspector for complete genome assembly. Bioinformatics. 2024; 40:btae633

[33]

Liu W, Liu C, Chen S. et al. A nearly gapless, highly contigu-ous reference genome for a doubled haploid line of Populus ussuriensis, enabling advanced genomic studies. For Res (Fayettev). 2024; 4:e019

[34]

Shi R, Wang JP, Lin YC. et al. Tissue and cell-type co-expression networks of transcription factors and wood component genes in Populus trichocarpa. Planta. 2017; 245:927-38

[35]

Marçais G, Kingsford C. A fast, lock-free approach for effi-cient parallel counting of occurrences of k-mers. Bioinformatics. 2011; 27:764-70

[36]

Vurture GW, Sedlazeck FJ, Nattestad M. et al. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics. 2017; 33:2202-4

[37]

Durand NC, Robinson JT, Shamim MS. et al. Juicebox provides a visualization system for hi-C contact maps with unlimited zoom. Cell Syst. 2016; 3:99-101

[38]

Lin Y, Ye C, Li X. et al. quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identifi-cation. Hortic Res. 2023; 10:

[39]

Br ˚una T, Hoff KJ, Lomsadze A. et al. BRAKER2: automatic eukary-otic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database. NAR Genom Bioinform. 2021; 3:lqaa108

[40]

Hoff KJ, Lomsadze A, Borodovsky M. et al. Whole-genome anno-tation with BRAKER. Methods Mol Biol. 2019; 1962:65-95

[41]

Kuznetsov D, Tegenfeldt F, Manni M. et al. OrthoDB v11: annota-tion of orthologs in the widest sampling of organismal diversity. Nucleic Acids Res. 2023; 51:D445-51

[42]

Kim D, Paggi JM, Park C. et al. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019; 37:907-15

[43]

Flynn JM, Hubley R, Goubert C. et al. RepeatModeler2 for auto-mated genomic discovery of transposable element families. Proc Natl Acad Sci USA. 2020; 117:9451-7

[44]

Tarailo-Graovac M, Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinfor-matics. 2009; 25:4.10.1-14

[45]

Storer J, Hubley R, Rosen J. et al. The Dfam community resource of transposable element families, sequence models, and genome annotations. Mob DNA. 2021; 12:2

[46]

Barnett DW, Garrison EK, Quinlan AR. et al. BamTools: a C++ API and toolkit for analyzing and managing BAM files. Bioinformatics. 2011; 27:1691-2

[47]

Br ˚una T, Lomsadze A, Borodovsky M. GeneMark-ETP signifi-cantly improves the accuracy of automatic annotation of large eukaryotic genomes. Genome Res. 2024; 34:757-68

[48]

Stanke M, Diekhans M, Baertsch R. et al. Using native and syn-tenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008; 24:637-44

[49]

Kovaka S, Zimin AV, Pertea GM. et al. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019; 20:278

[50]

Pertea G, Pertea M. GFF utilities: GffRead and GffCompare. F1000Res. 2020; 9:304

[51]

Quinlan AR. BEDTools: the Swiss-Army tool for genome fea-ture analysis. Curr Protoc Bioinformatics 2014; 47:11.12.1-11.12.34, 11.1234.

[52]

Deng YY, Li JQ, Wu SF. et al.Integrated nr database in protein annotation system and its localization. Comput Eng. 2006; 32:71-2

[53]

Boeckmann B, Bairoch A, Apweiler R. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003; 31:365-70

[54]

Ashburner M, Ball CA, Blake JA. et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000; 25:25-9

[55]

Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000; 28:27-30

[56]

Finn RD, Coggill P, Eberhardt RY. et al.The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016; 44:D279-85

[57]

Buchfink B, Reuter K, Drost HG. Sensitive protein align-ments at tree-of-life scale using DIAMOND. Nat Methods. 2021; 18:366-8

[58]

Chen C, Wu Y, Li J. et al. TBtools-II: a "one for all, all for one" bioinformatics platform for biological big-data mining. Mol Plant. 2023a; 16:1733-42

[59]

Wang Y, Tang H, Debarry JD. et al. MCScanX: a toolkit for detec-tion and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012; 40:e49

[60]

Wu TD, Watanabe CK. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics. 2005; 21: 1859-75

[61]

Camacho C, Coulouris G, Avagyan V. et al. BLAST+: architecture and applications. BMC Biol. 2009; 10:421

[62]

Van der Auwera GA, Carneiro MO, Hartl C. et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013; 43: 11.10. 1-33

[63]

Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15:550

[64]

Alexa A, Rahnenführer J, Lengauer T. Improved scoring of func-tional groups from gene expression data by decorrelating GO graph structure. Bioinformatics. 2006; 22:1600-7

[65]

Wu T, Hu E, Xu S. et al. clusterProfiler 4.0: a universal enrich-ment tool for interpreting omics data. Innovation (Camb). 2021; 2: 100141

[66]

Minh BQ, Schmidt HA, Chernomor O. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020; 37:2461-1

[67]

Sanderson MJ. r8s: inferring absolute rates of molecular evolu-tion and divergence times in the absence of a molecular clock. Bioinformatics. 2003; 19:301-2

[68]

Mendes FK, Vanderpool D, Fulton B. et al. CAFE 5 models vari-ation in evolutionary rates among gene families. Bioinformatics. 2021; 36:5516-8

[69]

Nguyen LT, Schmidt HA, von Haeseler A. et al. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015; 32:268-74

[70]

Chen H, Zwaenepoel A. Inference of ancient polyploidy from genomic data. Methods Mol Biol. 2023; 2545:3-18

[71]

Tang H, Krishnakumar V, Zeng X. et al. JCVI: a versa-tile toolkit for comparative genomics analysis. iMeta. 2024; 3: e211

[72]

Goel M, Sun H, Jiao WB. et al. SyRI: finding genomic rear-rangements and local sequence differences from whole-genome assemblies. Genome Biol. 2019; 20:277

[73]

Cingolani P, Platts A, Wang le L. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012; 6:80-92

[74]

Lescot M, Déhais P, Thijs G. et al.PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in sil-ico analysis of promoter sequences. Nucleic Acids Res. 2002; 30: 325-7

[75]

Wang J, Chitsaz F, Derbyshire MK. et al.The conserved domain database in 2023. Nucleic Acids Res. 2023; 51:D384-8

PDF (1564KB)

1180

Accesses

0

Citation

Detail

Sections
Recommended

/