The use of web resources for metabolomics in horticultural crops
Esra Karakas , Mustafa Bulut , Alisdair R. Fernie
Horticulture Advances ›› 2025, Vol. 3 ›› Issue (1) : 18
Metabolomics, a rapidly evolving field, has revolutionized horticultural crop research by enabling comprehensive analysis of metabolites that influence plant yield, growth, quality and nutritional value. The integration of web-based resources, including databases, computational tools and analytical platforms has significantly enhanced metabolomics studies by facilitating data processing, metabolite identification and pathway analysis. Moreover, the application of machine learning algorithms to these web resources has further optimized data interpretation, enabling more accurate prediction of metabolic profiles. Publicly available reference libraries and bioinformatic tools support precision of breeding, postharvest quality assessment and ultimately improving crop yield and sustainability. In this mini-review, we explore the current status of the diverse range of plant metabolomics databases in horticultural crops, highlighting the synergy between machine learning and traditional bioinformatics methods, their applications, challenges and future prospects in advancing plant science and agricultural innovation.
Metabolite databases / Bioinformatic tools / Plant metabolomics and horticultural crops
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
Ara T, Sakurai N, Takahashi S, Waki N, Suganuma H, Aizawa K, et al. TOMATOMET: A metabolome database consists of 7118 accurate mass values detected in mature fruits of 25 tomato cultivars. 2021;5:e00318. https://doi.org/10.1002/pld3.318 . |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
Bueschl C, Kluger B, Neumann NKN, Doppler M, Maschietto V, Thallinger GG, et al. MetExtract II: a software suite for stable isotope-assisted untargeted metabolomics. Anal Chem. 2017;89:9518–26. https://doi.org/10.1021/acs.analchem.7b02518 . |
| [9] |
Caspi R, Billington R, Keseler IM, Kothari A, Krummenacker M, Midford PE, et al. The MetaCyc database of metabolic pathways and enzymes - a 2019 update. Nucleic Acids Res. 2020;48:D445–53. https://doi.org/10.1093/nar/gkz862 . |
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
Davey MP, Burrell MM, Woodward FI, Quick WP. Population-specific metabolic phenotypes of Arabidopsis lyrata ssp. petraea. New Phytol. 2008;177:380–8. https://doi.org/10.1111/j.1469-8137.2007.02282.x |
| [14] |
|
| [15] |
|
| [16] |
van Dijk ADJ, Kootstra G, Kruijer W, de Ridder D. Machine learning in plant science and plant breeding. iScience, 2020;24:101890.. https://doi.org/10.1016/j.isci.2020.101890 |
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
Fang L, Liu T, Li M, Dong X, Han Y, Xu C, et al. MODMS: a multi-omics database for facilitating biological studies on alfalfa (Medicago sativa L.). Hortic Res. 2023;11:uhad245. https://doi.org/10.1093/hr/uhad245 . |
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
Fiehn O, Kopka J, Dörmann P, Altmann T, Trethewey RN, Willmitzer L.. Metabolite profiling for plant functional genomics. Nat Biotechnol. 2000;18:1157–61.https://doi.org/10.1038/81137 . |
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
Kuhl C, Tautenhahn R, Böttcher C, Larson TR, Neumann S. CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. 2012;84:283–9. https://doi.org/10.1021/ac202450g . |
| [42] |
|
| [43] |
|
| [44] |
Li D, Ma B, Xu X, Chen G, Li T, He N. MMHub, a database for the mulberry metabolome. Database (Oxford). 2020;2020:baaa011. https://doi.org/10.1093/database/baaa011 . |
| [45] |
Lin WJ, Shen PC, Liu HC, Cho YC, Hsu MK, Lin IC, et al. LipidSig: a web-based tool for lipidomic data analysis.Nucleic Acids Res. 2021;49:W336–45. https://doi.org/10.1093/nar/gkab419 |
| [46] |
|
| [47] |
Liu T, Salguero P, Petek M, Martinez-Mira C, Balzano-Nogueira L, Ramšak Ž, et al. PaintOmics 4: new tools for the integrative analysis of multi-omics datasets supported by multiple pathway databases. 2022;50:W551–9. https://doi.org/10.1093/nar/gkac352 . |
| [48] |
Liu CH, Shen PC, Lin WJ, Liu HC, Tsai MH, Huang TY, et al. LipidSig 2.0: integrating lipid characteristic insights into advanced lipidomics data analysis. Nucleic Acids Res. 2024;52:W390–7. https://doi.org/10.1093/nar/gkae335 . |
| [49] |
|
| [50] |
|
| [51] |
|
| [52] |
Mason AR, Johnson G Jr, Krampen J, Nguyen JNT, Balunas MJ, Schloss PD.mpactR: an R adaptation of the metabolomics peak analysis computational tool (MPACT) for use in reproducible data analysis pipelines. Microbiol Resour Announc. 202511;14:e0099724. https://doi.org/10.1128/mra.00997-24 . |
| [53] |
|
| [54] |
|
| [55] |
Miao BB, Dong W, Gu YX, Han ZF, Luo X, Ke CH, et al. OmicsSuite: a customized and pipelined suite for analysis and visualization of multi-omics big data. Hortic Res. 2023;10:uhad195. https://doi.org/10.1093/hr/uhad195 . |
| [56] |
|
| [57] |
Montenegro-Burke JR, Guijas C, Siuzdak G. METLIN: a tandem mass spectral library of standards. In Li S, editors.Computational Methods and Data Analysis for Metabolomics .New York:Springer;2020.p.149–63. https://doi.org/10.1007/978-1-0716-0239-3_9 |
| [58] |
|
| [59] |
|
| [60] |
Perez de Souza L, Fernie AR. Computational methods for processing and interpreting mass spectrometry-based metabolomics. Essays Biochem. 2024;68:5–13. https://doi.org/10.1042/EBC20230019 |
| [61] |
Perez de Souza L, Naake T, Tohge T, Fernie AR. From chromatogram to analyte to metabolite. How to pick horses for courses from the massive web resources for mass spectral plant metabolomics. Gigascience. 2017;6:1–20. https://doi.org/10.1093/gigascience/gix037 |
| [62] |
|
| [63] |
|
| [64] |
|
| [65] |
|
| [66] |
|
| [67] |
Sauter H, Lauer M, Fritsch H. Metabolic profiling of plants. In: Baker DR, Fenyes JG, Moberg WK, editors. Synthesis and Chemistry of Agrochemicals II. American Chemical Society;1991.p.288–99https://doi.org/10.1021/bk-1991-0443.ch024 |
| [68] |
|
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
Smith CA, Want EJ, O'Maille G, Abagyan R, Siuzdak G. XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification.Anal Chem. 2006;78:779–87. https://doi.org/10.1021/ac051437y |
| [75] |
|
| [76] |
|
| [77] |
|
| [78] |
|
| [79] |
|
| [80] |
|
| [81] |
|
| [82] |
Tian Z, Hu X, Xu Y, Liu M, Liu H, Li D, et al. PMhub 1.0: a comprehensive plant metabolome database. Nucleic Acids Res. 2024;52:D1579–87. https://doi.org/10.1093/nar/gkad811 . |
| [83] |
|
| [84] |
|
| [85] |
|
| [86] |
|
| [87] |
|
| [88] |
|
| [89] |
|
| [90] |
|
| [91] |
|
| [92] |
|
| [93] |
|
| [94] |
Wishart DS, Kruger R, Sivakumaran A, Harford K, Sanford S, Doshi R, et al. PathBank 2.0—the pathway database for model organism metabolomics. Nucleic Acids Res. 2024;52:D654–62.. https://doi.org/10.1093/nar/gkad1041 . |
| [95] |
Wishart DS, Tzur D, Knox C, Eisner R, Guo A, Young N, et al. HMDB: the Human Metabolome Database. Nucleic Acids Res. 2007;35:D521-D26. https://doi.org/10.1093/nar/gkl923. |
| [96] |
|
| [97] |
Xue J, Wang B, Ji H, Li W. RT-Transformer: retention time prediction for metabolite annotation to assist in metabolite identification. Bioinformatics. 2024;40:btae084. https://doi.org/10.1093/bioinformatics/btae084 . |
| [98] |
|
| [99] |
|
| [100] |
|
| [101] |
|
| [102] |
|
The Author(s)
/
| 〈 |
|
〉 |