Genome-wide terpene gene clusters analysis in Euphorbiaceae

Yinhang Wang , Yunxiao Zhao , Ming Gao , Yangdong Wang , Wei Li , Yicun Chen

Horticulture Research ›› 2025, Vol. 12 ›› Issue (7) : 97

PDF (1319KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (7) :97 DOI: 10.1093/hr/uhaf097
Articles
research-article
Genome-wide terpene gene clusters analysis in Euphorbiaceae
Author information +
History +
PDF (1319KB)

Abstract

Euphorbiaceae species are renowned not only for horticultural significance but for their production of numerous bicyclic diterpenes with antitumor and antiviral activities. However, the gene clusters responsible for the biosynthesis of these terpenes remain largely unidentified. We here initiated the construction of a comprehensive procedure for terpene gene clusters in Euphorbiaceae species. A total of 1824 candidate gene clusters with the range of 30-800 kb were identified across seven representative species including Ricinus communis, Hevea brasiliensis, Euphorbia peplus, Jatropha curcas, Manihot esculenta, Vernicia montana, and Vernicia fordii in Euphorbiaceae. The 16 high-confidence terpene gene clusters were ultimately pinpointed in Euphorbiaceae after satisfied the three stringent screening criteria: TPS/CYP pairwise relationship, copathway and coexpression patterns. Notably, the well-known casbene and casbene-derived diterpenoid gene cluster, involved in the biosynthesis of casbene, neocembrene, ingenanes, and jatrophanes, were identified. It was observed that casbene gene clusters were universally presented in Euphorbiaceae species, except M. esculenta. Among the casbene gene cluster, the alcohol dehydrogenase (ADH) was initially appeared, and neocembrene synthase is exclusively present in R. communis while absent in all the other species. These findings represent a significant step toward understanding the genetic basis of terpene biosynthesis in Euphorbiaceae species. Moreover, this knowledge on gene clusters responsible for the biosynthesis of pharmacologically relevant terpenes can serve as a theoretical foundation for future applications.

Cite this article

Download citation ▾
Yinhang Wang, Yunxiao Zhao, Ming Gao, Yangdong Wang, Wei Li, Yicun Chen. Genome-wide terpene gene clusters analysis in Euphorbiaceae. Horticulture Research, 2025, 12(7): 97 DOI:10.1093/hr/uhaf097

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This work was supported by The National Nonprofit Institute Research Grant of CAFINT (No. CAFYBB2023PA005), the National Natural Science Foundation of China (31971685), and the Ten Thousand People Plan of Science and Technology Innovation Leading Talent of Zhejiang, China (No. 2022R52028) awarded to Y.C.

Author contributions

Y.W.: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data Curation, Writing—Original Draft, Visualization. M.G., L.W., and Y.Z.: Methodology, Investigation. Y.W. and W.L.: Writing—Review & Editing. Y.C.: Supervision, Project administration, Funding acquisition. Special thanks to PlantClusterFinder development team member bxuecarnegie for his troubleshooting help.

Data availability

All data are presented inside the manuscript and its supplementary data. The V. montana reference genome data is available on NCBI under project number: PRJNA1147434. The V. montana transcriptome data is available on NCBI under project number: PRJNA1146716.

Conflict of interest statement

The authors declare that they have no competing interests.

Supplementary Data

Supplementary data is available at Horticulture Research online.

References

[1]

Wang HB, Wang XY, Liu LP. et al. Tigliane diterpenoids from the Euphorbiaceae and Thymelaeaceae families. Chem Rev. 2015; 115:2975-3011

[2]

Jones CG, Martynowycz MW, Hattne J. et al. The CryoEM method MicroED as a powerful tool for small molecule structure deter-mination. ACS Central Science. 2018; 4:1587-92

[3]

Siller G, Rosen R, Freeman M. et al. PEP 005 (ingenol mebutate) gel for the topical treatment of superficial basal cell carcinoma: results of a randomized phase IIa trial. Australas J Dermatol. 2010; 51:99-105

[4]

Chae L, Kim T, Nilo-Poyanco R. et al. Genomic signatures of specialized metabolism in plants. Science. 2014; 344:510-3

[5]

Medema MH, Kottmann R, Yilmaz P. et al. Minimum information about a biosynthetic gene cluster. Nat Chem Biol. 2015; 11:625-31

[6]

Schläpfer P, Zhang P, Wang C. et al. Genome-wide prediction of metabolic enzymes, pathways, and gene clusters in plants. Plant Physiol. 2017; 173:2041-59

[7]

Kautsar SA, Suarez Duran HG, Blin K. et al. plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters. Nucleic Acids Res. 2017;45: W55-63

[8]

Töpfer N, Fuchs LM, Aharoni A. The PhytoClust tool for metabolic gene clusters discovery in plant genomes. Nucleic Acids Res. 2017; 45:7049-63

[9]

Luo J. Metabolite-based genome-wide association studies in plants. Curr Opin Plant Biol. 2015; 24:31-8

[10]

Frey M, Schullehner K, Dick R. et al. Benzoxazinoid biosynthe-sis, a model for evolution of secondary metabolic pathways in plants. Phytochemistry. 2009; 70:1645-51

[11]

Field B, Osbourn AE. Metabolic diversification—independent assembly of operon-like gene clusters in different plants. Science. 2008; 320:543-7

[12]

Wurdack KJ, Hoffmann P, Chase MW. Molecular phylogenetic analysis of uniovulate Euphorbiaceae (Euphorbiaceae sensu stricto) using plastid rbcL and trnL-F DNA sequences. Am J Bot. 2005; 92:1397-420

[13]

Boycheva S, Daviet L, Wolfender JL. et al. The rise of operon-like gene clusters in plants. Trends Plant Sci. 2014; 19:447-59

[14]

Czechowski T, Forestier E, Swamidatta SH. et al. Gene discovery and virus-induced gene silencing reveal branched pathways to major classes of bioactive diterpenoids in Euphorbia peplus. Proc Natl Acad Sci. 2022; 119:e2203890119

[15]

King AJ, Brown GD, Gilday AD. et al. Production of bioactive diterpenoids in the Euphorbiaceae depends on evolutionarily conserved gene clusters. Plant Cell. 2014; 26:3286-98

[16]

Boutanaev AM, Moses T, Zi J. et al. Investigation of terpene diversification across multiple sequenced plant genomes. Proc Natl Acad Sci. 2015;112:E81-8

[17]

Tokuoka T, Tobe H. Phylogenetic analyses of Malpighiales using plastid and nuclear DNA sequences, with particular reference to the embryology of Euphorbiaceae sens. str. JPlant Res. 2006; 119: 599-616

[18]

Prochnik S, Marri PR, Desany B. et al. The cassava genome: current progress, future directions. Trop Plant Biol. 2012; 5:88-94

[19]

Strommer J. The plant ADH gene family. Plant J. 2011; 66:128-42

[20]

Zhao Y, Chen Y, Gao M. et al. Alcohol dehydrogenases regu-lated by a MYB44 transcription factor underlie Lauraceae citral biosynthesis. Plant Physiol. 2024; 194:1674-91

[21]

Karp PD, Midford PE, Billington R. et al. Pathway tools version 23.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform. 2021; 22:109-26

[22]

Altschul SF, Gish W, Miller W. et al. Basic local alignment search tool. J Mol Biol. 1990; 215:403-10

[23]

Claudel-Renard C, Chevalet C, Faraut T. et al. Enzyme-specific profiles for genome annotation: PRIAM. Nucleic Acids Res. 2003; 31:6633-9

[24]

Caspi R, Altman T, Billington R. et al.The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2014;42:D459-71

[25]

Goodstein DM, Shu S, Howson R. et al. Phytozome: a com-parative platform for green plant genomics. Nucleic Acids Res. 2012;40:D1178-86

[26]

Harper L, Gardiner J, Andorf C. et al. MaizeGDB:the maize genetics and genomics database. Plant bioinformatics: Methods and protocols. 2016; 1374:187-202

[27]

Kersey PJ, Allen JE, Armean I. et al. Ensembl genomes 2016: more genomes, more complexity. Nucleic Acids Res. 2016;44:D574-80

[28]

Usadel B, Obayashi T, Mutwil M. et al. Co-expression tools for plant biology: opportunities for hypothesis generation and caveats. Plant Cell Environ. 2009; 32:1633-51

[29]

Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30:2114-20

[30]

Pertea M, Kim D, Pertea GM. et al. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016; 11:1650-67

[31]

Liao Y, Smyth GK, Shi W. The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res. 2019; 47:e47-7

[32]

Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC bioinformatics. 2008; 9:1-13

[33]

Wisecaver JH, Wisecaver JH, Borowsky AT. et al. A global coex-pression network approach for connecting genes to specialized metabolic pathways in plants. Plant Cell. 2017; 29:944-59

[34]

Zdobnov EM, Apweiler R. InterProScan-an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001; 17:847-8

[35]

Chen C, Wu Y, Li J. et al. TBtools-II: a “one for all, all for one” bioinformatics platform for biological big-data mining. Mol Plant. 2023; 16:1733-42

PDF (1319KB)

280

Accesses

0

Citation

Detail

Sections
Recommended

/