Haplotype-resolved genome of Agastache rugosa (Huo Xiang) provides insight into monoterpenoid biosynthesis and gene cluster evolution

Chanchan Liu , DiShuai Li , Jingjie Dang , Juan Shu , Samuel J. Smit , QiNan Wu , Benjamin R. Lichman

Horticulture Research ›› 2025, Vol. 12 ›› Issue (5) : 34

PDF (2437KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (5) :34 DOI: 10.1093/hr/uhaf034
Article
research-article
Haplotype-resolved genome of Agastache rugosa (Huo Xiang) provides insight into monoterpenoid biosynthesis and gene cluster evolution
Author information +
History +
PDF (2437KB)

Abstract

Monoterpenoids are small volatile molecules produced by many plants that have applications in consumer products and healthcare. Plants from the mint family (Lamiaceae) are prodigious producers of monoterpenoids, including a chemotype of Agastache rugosa (Huo Xiang), which produces pulegone and isomenthone. We sequenced, assembled and annotated a haplotype-resolved chromosome-scale genome assembly of A. rugosa with a monoterpene chemotype. This genome assembly revealed that pulegone biosynthesis genes are in a biosynthetic gene cluster, which shares a common origin with the pulegone gene cluster in Schizonepeta tenuifolia. Using phylogenetics and synteny analysis, we describe how the clusters in these two species diverged through inversions and duplications. Using Hi-C analysis, we identified tentative evidence of contact between the pulegone gene cluster and an array of pulegone reductases, with both regions also enriched in retrotransposons. This genome and its analysis add valuable and novel insights to the organization and evolution of terpenoid biosynthesis in Lamiaceae.

Cite this article

Download citation ▾
Chanchan Liu, DiShuai Li, Jingjie Dang, Juan Shu, Samuel J. Smit, QiNan Wu, Benjamin R. Lichman. Haplotype-resolved genome of Agastache rugosa (Huo Xiang) provides insight into monoterpenoid biosynthesis and gene cluster evolution. Horticulture Research, 2025, 12(5): 34 DOI:10.1093/hr/uhaf034

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

We acknowledge Hans-Wilhelm Nützmann and Daniel Doerr for advice on 3D genome analysis. We thank Xiao Han and Dajiang Tian from Nanjing University of Chinese Medicine with Python language and emotional support. This research was supported by the National Natural Science Foundation of China (grant nos. 82373978, 81973435), the National Natural Science Foundation for Young Scientists of China (grant no. 81903756), the Natural Science Foundation of Jiangsu Province (grant no. BK20231307), the Open Project of Chinese Materia Medica First-Class Discipline of Nanjing University of Chinese Medicine (grant no. ZYXYL2024-002) and High-Level University 2024 Key Cultivation Project -Research Leadership Program (grant no. RC202411). B.R.L. and S.J.S were funded by the BBSRC (BB/V006452/1). B.R.L acknowledges UKRI fellowship funding (MR/S01862X/1 and MR/X010260/1) and the Royal Society (IEC\NSFC\233491).

Author contributions

C.L. designed these experiments, was responsible for the genome analysis and wrote the paper. D.S.L. conducted the Hi-C data analysis. J.D. performed the enzyme assays. J.S. performed the cultivation and sampling of the specimens. S.J.S. aided in genome analysis. Q.W. supervised the project. B.R.L. analyzed data and wrote the paper.

Data availability

The genome sequence and raw sequencing reads are available at National Genomics Data Center (BioProject PRJCA027417). The raw reads are available in the Genome Sequence Archive (PacBio and Hi-C: CRA017422; RNA-seq: CRA017425). Gene sequences for A. rugosa ISPD and IPRs can be found in accession GB0004888.

Conflict of interest statement

The authors declare no conflict of interest.

Supplementary Data

Supplementary data are available at Horticulture Research online.

References

[1]

Melo N, Capek M, Arenas OM. et al. The irritant receptor TRPA1 mediates the mosquito repellent effect of catnip. Curr Biol. 2021;31:1988-1994.e5

[2]

Perri F, Coricello A, Adams JD. Monoterpenoids: the next frontier in the treatment of chronic pain. J.. 2020;3:195-214

[3]

Boachon B, Buell CR, Crisovan E. et al. Phylogenomic mining of the mints reveals multiple mechanisms contributing to the evolution of chemical diversity in Lamiaceae. Mol Plant. 2018;11: 1084-96

[4]

Vining KJ, Pandelova I, Lange I. et al. Chromosome-level genome assembly of Mentha longifolia L. reveals gene organization under-lying disease resistance and essential oil traits. G. (Bethesda). 2022;12:jkac112

[5]

Lichman BR, Godden GT, Hamilton JP. et al. The evolutionary origins of the cat attractant nepetalactone in catnip. Sci Adv. 2020;6:eaba0721

[6]

Lichman BR, Kamileen MO, Titchiner GR. et al. Uncoupled activa-tion and cyclization in catmint reductive terpenoid biosynthesis. Nat Chem Biol. 2019;15:71-9

[7]

National Toxicology Program. Toxicology and carcinogenesis studies of pulegone (CAS No. 89-82-7) in F344/N rats and B6C3F1 mice (gavage studies). Natl Toxicol Program Tech Rep Ser. 2011;563: 1-201

[8]

de Sousa DP, Nóbrega FFF, de Lima MRV. et al. Pharmacological activity of (R)-(+)-pulegone, a chemical constituent of essential oils. Z Naturforsch C. 2011;66:353-9

[9]

Ortiz de Urbina AV, Martin ML, Montero MJ. et al. Antihistaminic activity of pulegone on the Guinea-pig ileum. J Pharm Pharmacol. 1990;42:295-6

[10]

Whitehead JN, Leferink NGH, Johannissen LO. et al. Decoding catalysis by terpene synthases. ACS Catal. 2023;13:12774-802

[11]

Lichtenthaler HK, Rohmer M, Schwender J. Two independent bio-chemical pathways for isopentenyl diphosphate and isoprenoid biosynthesis in higher plants. Physiol Plant. 1997;101:643-52

[12]

Zheng X, Li P, Lu X. Research advances in cytochrome P450-catalysed pharmaceutical terpenoid biosynthesis in plants. JExp Bot. 2019;70:4619-30

[13]

Smit SJ, Lichman BR. Plant biosynthetic gene clusters in the context of metabolic evolution. Nat Prod Rep. 2022;39:1465-82

[14]

Polturak G, Liu Z, Osbourn A. New and emerging concepts in the evolution and function of plant biosynthetic gene clusters. Curr Opin Green Sustain Chem. 2022;33:100568

[15]

Guo L, Winzer T, Yang X. et al. The opium poppy genome and morphinan production. Science. 2018;362:343-7

[16]

Li Y, Leveau A, Zhao Q. et al. Subtelomeric assembly of a multi-gene pathway for antimicrobial defense compounds in cereals. Nat Commun. 2021;12:2563

[17]

Zhang J, Peters RJ. Why are momilactones always associated with biosynthetic gene clusters in plants. Proc Natl Acad Sci USA. 2020;117:13867-9

[18]

Cao P, Yang J, Xia L. et al. Two gene clusters and their pos-itive regulator SlMYB13 that have undergone domestication-associated negative selection control phenolamide accumu-lation and drought tolerance in tomato. Mol Plant. 2024;17: 579-97

[19]

Nützmann H-W, Doerr D, Ramírez-Colmenero A. et al. Active and repressed biosynthetic gene clusters have spatially distinct chromosome states. Proc Natl Acad Sci USA. 2020;117:13800-9

[20]

Li C, Wood JC, Vu AH. et al. Single-cell multi-omics in the medic-inal plant Catharanthus roseus. Nat Chem Biol. 2023;19:1031-41

[21]

Sun L, Cao Y, Li Z. et al. Conserved H3K27me3-associated chro-matin looping mediates physical interactions of gene clusters in plants. J Integr Plant Biol. 2023;65:1966-82

[22]

Bryson AE, Lanier ER, Lau KH. et al. Uncovering a miltiradiene biosynthetic gene cluster in the Lamiaceae reveals a dynamic evolutionary trajectory. Nat Commun. 2023;14:343

[23]

Li H, Wu S, Lin R. et al. The genomes of medicinal skullcaps reveal the polyphyletic origins of clerodane diterpene biosynthesis in the family Lamiaceae. Mol Plant. 2023;16:549-70

[24]

Liu C, Smit SJ, Dang J. et al. A chromosome-level genome assem-bly reveals that a bipartite gene cluster formed via an inverted duplication controls monoterpenoid biosynthesis in Schizonepeta tenuifolia. Mol Plant. 2023;16:533-48

[25]

Dang J, Lin G, Liu L. et al. Comparison of pulegone and estragole Chemotypes provides new insight into volatile oil biosynthesis of Agastache rugosa. Front Plant Sci. 2022;13:850130

[26]

Shu J, Jia CL, Dang JJ. et al. Accumulation and biosynthesis mech-anism of volatile oils in glandular scale of Agastache rugosa. Zhongguo Zhong Yao Za Zhi. 2024;49:62-9

[27]

Park H-S, Jo IH, Raveendar S. et al. A chromosome-level genome assembly of Korean mint (Agastache rugosa). Sci Data. 2023;10: 792

[28]

Maruyama T, Saeki D, Ito M. et al. Molecular cloning, functional expression and characterization of d-limonene synthase from Agastache rugosa. Biol Pharm Bull. 2002;25:661-5

[29]

Liao B, Shen X, Xiang L. et al. Allele-aware chromosome-level genome assembly of Artemisia annua reveals the correla-tion between ADS expansion and artemisinin yield. Mol Plant. 2022;15:1310-28

[30]

Shen Y, Li W, Zeng Y. et al. Chromosome-level and haplotype-resolved genome provides insight into the tetraploid hybrid origin of patchouli. Nat Commun. 2022;13:3511

[31]

Zhang Y, Shen Q, Leng L. et al. Incipient diploidization of the medicinal plant Perilla within 10,000 years. Nat Commun. 2021;12:5508

[32]

Gonda I, Faigenboim A, Adler C. et al. The genome sequence of tetraploid sweet basil, Ocimum basilicum L., provides tools for advanced genome editing and molecular breeding. DNA Res. 2020;27:dsaa27

[33]

Emms DM, Kelly S.OrthoFinder: phylogenetic orthology infer-ence for comparative genomics. Genome Biol. 2019;20:238

[34]

Li B, Cantino PD, Olmstead RG. et al. A large-scale chloroplast phylogeny of the Lamiaceae sheds new light on its subfamilial classification. Sci Rep. 2016;6:34343

[35]

Rose JP, Wiese J, Pauley N. et al. East Asian-north Ameri-can disjunctions and phylogenetic relationships within sub-tribe Nepetinae (Lamiaceae). Mol Phylogenet Evol. 2023;187: 107873

[36]

Chen Y-P, Turdimatovich TO, Nuraliev MS. et al. Phylogeny and biogeography of the northern temperate genus Dracocephalum s.l.(Lamiaceae). Cladistics. 2022;38:429-51

[37]

Lewinsohn E, Ziv-Raz I, Dudai N. et al. Biosynthesis of estragole and methyl-eugenol in sweet basil (Ocimum basilicum L). Developmental and chemotypic association of allylphenol O-methyltransferase activities. Plant Sci. 2000;160:27-35

[38]

Liu C, Gao Q, Shang Z. et al. Functional characterization and structural insights into stereoselectivity of pulegone reductase in menthol biosynthesis. Front Plant Sci. 2021;12: 780970

[39]

Méteignier L-V, Nützmann H-W, Papon N. et al. Emerging mech-anistic insights into the regulation of specialized metabolism in plants. Nat Plants. 2022;9:22-30

[40]

Liu Z, Cheema J, Vigouroux M. et al. Formation and diversification of a paradigm biosynthetic gene cluster in plants. Nat Commun. 2020;11:5354

[41]

Villoutreix R, Ayala D, Joron M. et al. Inversion breakpoints and the evolution of supergenes. Mol Ecol. 2021;30:2738-55

[42]

Schmidt C, Fransz P, Rönspies M. et al. Changing local recom-bination patterns in Arabidopsis by CRISPR/Cas mediated chro-mosome engineering. Nat Commun. 2020;11:4418

[43]

Sun L, Jing Y, Liu X. et al. Heat stress-induced transposon activa-tion correlates with 3D chromatin organization rearrangement in Arabidopsis. Nat Commun. 2020;11:1886

[44]

Cheng H, Concepcion GT, Feng X. et al. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods. 2021;18:170-5

[45]

Durand NC, Shamim MS, Machol I. et al. Juicer provides a one-click system for analyzing loop-resolution hi-C experiments. Cell Syst. 2016;3:95-8

[46]

Dudchenko O, Batra SS, Omer AD. et al. De novo assembly of the Aedes aegypti genome using hi-C yields chromosome-length scaffolds. Science. 2017;356:92-5

[47]

Manni M, Berkeley MR, Seppey M. et al. BUSCO update: novel and streamlined workflows along with broader and deeper phyloge-netic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol Biol Evol. 2021;38:4647-54

[48]

Bao W, Kojima KK, Kohany O. Repbase update, a database of repetitive elements in eukaryotic genomes. Mob DNA. 2015;6:11

[49]

Tarailo-Graovac M, Chen N. Repbase update, a database of repetitive elements in eukaryotic genomes. Chapter 4:4.10.1-14

[50]

Ellinghaus D, Kurtz S, Willhoeft U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics. 2008;9:18

[51]

Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27:573-80

[52]

Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841-2

[53]

Wang L, Lee M, Sun F. et al. A chromosome-level genome assem-bly of chia provides insights into high omega-3 content and coat color variation of its seeds. Plant Commun. 2022;3:100326

[54]

Zhao D, Hamilton JP, Bhat WW. et al. A chromosomal-scale genome assembly of Tectona grandis reveals the importance of tandem gene duplication and enables discovery of genes in nat-ural product biosynthetic pathways. Gigascience. 2019;8:giz005

[55]

Keilwagen J, Hartung F, Grau J. GeMoMa: homology-based gene prediction utilizing intron position conservation and RNA-seq data. Methods Mol Biol. 2019;1962:161-77

[56]

Stanke M, Keller O, Gunduz I. et al. AUGUSTUS: ab initio predic-tion of alternative transcripts. Nucleic Acids Res. 2006;34:W435-9

[57]

Johnson AD, Handsaker RE, Pulit SL. et al. SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008;24:2938-9

[58]

Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12:357-60

[59]

Pertea M, Pertea GM, Antonescu CM. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33:290-5

[60]

Haas BJ, Salzberg SL, Zhu W. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008;9:R7

[61]

Buchfink B, Xie C, Huson DH. Fast and sensitive protein align-ment using DIAMOND. Nat Methods. 2015;12:59-60

[62]

Zdobnov EM, Apweiler R. InterProScan—an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001;17:847-8

[63]

Boeckmann B, Bairoch A, Apweiler R. et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003;31:365-70

[64]

Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27-30

[65]

Ashburner M, Ball CA, Blake JA. et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet. 2000;25:25-9

[66]

Chen Y, Chen Y, Shi C. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Giga-science. 2018;7:1-6

[67]

Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinfor-matics. 2011;12:323

[68]

Wang L, Feng Z, Wang X. et al. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2010;26:136-8

[69]

Wolff J, Rabbani L, Gilsbach R. et al. Galaxy HiCExplorer 3: a web server for reproducible hi-C, capture hi-C and single-cell hi-C data analysis, quality control and visualization. Nucleic Acids Res. 2020;48:W177-84

[70]

Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772-80

[71]

Minh BQ, Schmidt HA, Chernomor O. et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37:1530-4

[72]

Kalyaanamoorthy S, Minh BQ, Wong TKF. et al. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587-9

[73]

Hoang DT, Chernomor O, von Haeseler A. et al. UFBoot2: improv-ing the ultrafast bootstrap approximation. MolBiolEvol. 2018;35: 518-22

[74]

Guindon S, Dufayard J-F, Lefort V. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59: 307-21

[75]

Letunic I, Bork P. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:W293-6

[76]

Tang H, Krishnakumar V, Zeng X. et al. JCVI: a versa-tile toolkit for comparative genomics analysis. iMeta. 2024;3: e211

PDF (2437KB)

1039

Accesses

0

Citation

Detail

Sections
Recommended

/