Identification of Soybean E1-E4 Gene Orthologs in the Guar Genome Using Comprehensive Transcriptome Assembly and Annotation
Luisa Criollo Delgado , Aleksei Zamalutdinov , Elena Potokina
Frontiers in Bioscience-Scholar ›› 2025, Vol. 17 ›› Issue (2) : 26548
We publish the first available transcriptome assembly of guar (Cyamopsis tetragonoloba (L.) Taub.), a well-known source of guar gum (food additive E 412). At high latitudes, e.g., in Russia, the main challenge for guar cultivation is the long photoperiod during summer, which delays flowering and maturation of guar plants. Meanwhile, identifying of genes affecting the photoperiod sensitivity of guar would have a major impact on the development of marker-assisted breeding of this valuable food crop.
RNA isolated from leaves of early and late flowering guar plants grown under long-day conditions were used to generate de novo transcriptome assembly. A similarity search was conducted using BLASTN 2.2.31+ with default settings to identify homologous sequences of soybean maturity genes E1-E4 in guar transcriptome and genome assembly. Gene prediction tools such as AUGUSTUS and FGENESH+ were used to predict the exon-intron structure of the candidate genes. Functional annotation of the amino acid sequence was performed using InterProScan v. 5.68-100.
The transcriptome assembly contained sequences of 96,447 clustered transcript isoforms in the leaves of guar plants grown under long-day conditions. The transcriptome assembly was annotated using BLAST against the Glycine max genome, and 42,615 guar transcripts (44.2%) were found to be similar to soybean genes. We used the developed transcriptome assembly to discover orthologs of the E1-E4 soybean loci in the guar genome that have the greatest impact on the flowering and maturation of this closely related, short-day legume crop. A high level of identity was detected between peptide sequences encoding by orthologous genes E1 and CtE1 (80%), E2 and CtE2 (93%), E3 and CtE3 (83%), and E4 and CtE4 (91%). The sequences and the intron-exon structure of the genes in soybean and guar were similar, suggesting that the genetic pathways underlying basic flowering mechanisms are conserved between these two legume crops.
The revealed intron-exon structure of the guar genes CtE1-CtE4 creates possibilities for their targeted mutagenesis, e.g., using CRISPR-Cas and developing new guar germplasm with low sensitivity to photoperiod.
guar / transcriptome assembly / maturation loci / orthologs / exon-intron
| [1] |
Hasan AMA, Abdel-Raouf ME. Applications of guar gum and its derivatives in petroleum industry: A review. Egyptian Journal of Petroleum. 2018; 27: 1043–1050. https://doi.org/10.1016/j.ejpe.2018.03.005. |
| [2] |
Grigoreva E, Tkachenko A, Arkhimandritova S, Beatovic A, Ulianich P, Volkov V, et al. Identification of Key Metabolic Pathways and Biomarkers Underlying Flowering Time of Guar (Cyamopsis tetragonoloba (L.) Taub.) via Integrated Transcriptome-Metabolome Analysis. Genes. 2021; 12: 952. https://doi.org/10.3390/genes12070952. |
| [3] |
Liew LC, Singh MB, Bhalla PL. Unique and conserved features of floral evocation in legumes. Journal of Integrative Plant Biology. 2014; 56: 714–728. https://doi.org/10.1111/jipb.12187. |
| [4] |
Liu L, Song W, Wang L, Sun X, Qi Y, Wu T, et al. Allele combinations of maturity genes E1-E4 affect adaptation of soybean to diverse geographic regions and farming systems in China. PloS One. 2020; 15: e0235397. https://doi.org/10.1371/journal.pone.0235397. |
| [5] |
Bu T, Lu S, Wang K, Dong L, Li S, Xie Q, et al. A critical role of the soybean evening complex in the control of photoperiod sensitivity and adaptation. Proceedings of the National Academy of Sciences of the United States of America. 2021; 118: e2010241118. https://doi.org/10.1073/pnas.2010241118. |
| [6] |
Cao D, Takeshima R, Zhao C, Liu B, Jun A, Kong F. Molecular mechanisms of flowering under long days and stem growth habit in soybean. Journal of Experimental Botany. 2017; 68: 1873–1884. https://doi.org/10.1093/jxb/erw394. |
| [7] |
Han J, Guo B, Guo Y, Zhang B, Wang X, Qiu LJ. Creation of Early Flowering Germplasm of Soybean by CRISPR/Cas9 Technology. Frontiers in Plant Science. 2019; 10: 1446. https://doi.org/10.3389/fpls.2019.01446. |
| [8] |
Cai Y, Wang L, Chen L, Wu T, Liu L, Sun S, et al. Mutagenesis of GmFT2a and GmFT5a mediated by CRISPR/Cas9 contributes for expanding the regional adaptability of soybean. Plant Biotechnology Journal. 2020; 18: 298–309. https://doi.org/10.1111/pbi.13199. |
| [9] |
Xu M, Yamagishi N, Zhao C, Takeshima R, Kasai M, Watanabe S, et al. The Soybean-Specific Maturity Gene E1 Family of Floral Repressors Controls Night-Break Responses through Down-Regulation of FLOWERING LOCUS T Orthologs. Plant Physiology. 2015; 168: 1735–1746. https://doi.org/10.1104/pp.15.00763. |
| [10] |
Watanabe S, Xia Z, Hideshima R, Tsubokura Y, Sato S, Yamanaka N, et al. A map-based cloning strategy employing a residual heterozygous line reveals that the GIGANTEA gene is involved in soybean maturity and flowering. Genetics. 2011; 188: 395–407. https://doi.org/10.1534/genetics.110.125062. |
| [11] |
Gao Y, Zhang Y, Ma C, Chen Y, Liu C, Wang Y, et al. Editing the nuclear localization signals of E1 and E1Lb enables the production of tropical soybean in temperate growing regions. Plant Biotechnology Journal. 2024; 22: 2145–2156. https://doi.org/10.1111/pbi.14335. |
| [12] |
Grigoreva E, Ulianich P, Ben C, Gentzbittel L, Potokina E. First Insights into the Guar (Cyamopsis tetragonoloba (L.) Taub.) Genome of the ‘Vavilovskij 130’ Accession, Using Second and Third-Generation Sequencing Technologies. Russian Journal of Genetics. 2019; 55: 1406–1416. https://doi.org/10.1134/S102279541911005X. |
| [13] |
Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006; 22: 1658–1659. https://doi.org/10.1093/bioinformatics/btl158. |
| [14] |
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009; 10: 421. https://doi.org/10.1186/1471-2105-10-421. |
| [15] |
Buchfink B, Reuter K, Drost HG. Sensitive protein alignments at tree-of-life scale using DIAMOND. Nature Methods. 2021; 18: 366–368. https://doi.org/10.1038/s41592-021-01101-x. |
| [16] |
Valliyodan B, Cannon SB, Bayer PE, Shu S, Brown AV, Ren L, et al. Construction and comparison of three reference-quality genome assemblies for soybean. The Plant Journal: for Cell and Molecular Biology. 2019; 100: 1066–1082. https://doi.org/10.1111/tpj.14500. |
| [17] |
Brown AV, Conners SI, Huang W, Wilkey AP, Grant D, Weeks NT, et al. A new decade and new data at SoyBase, the USDA-ARS soybean genetics and genomics database. Nucleic Acids Research. 2021; 49: D1496–D1501. https://doi.org/10.1093/nar/gkaa1107. |
| [18] |
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011; 12: 323. https://doi.org/10.1186/1471-2105-12-323. |
| [19] |
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26: 139–140. https://doi.org/10.1093/bioinformatics/btp616. |
| [20] |
Chen Y, McCarthy D, Baldoni P, Robinson M, Smyth G. edgeR: differential analysis of sequence read count data User’s Guide. 2024. Available at: https://bioconductor.org/packages/release/bioc/html/edgeR.html (Accessed: 6 June 2024). |
| [21] |
Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008; 24: 637–644. https://doi.org/10.1093/bioinformatics/btn013. |
| [22] |
Solovyev V. Statistical approaches in Eukaryotic gene prediction. Handbook of Statistical genetics (p. 1616). 3rd edn. Wiley-Interscience: Hoboken, New Jersey. 2007. |
| [23] |
FGENESH+ - HMM plus similar protein-based gene prediction. Available at: http://www.softberry.com/berry.phtml?topic=fgenes_plus&group=programs&subgroup=gfs (Accessed: 2 August 2024). |
| [24] |
Hu B, Jin J, Guo AY, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2015; 31: 1296–1297. https://doi.org/10.1093/bioinformatics/btu817. |
| [25] |
Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30: 1236–1240. https://doi.org/10.1093/bioinformatics/btu031. |
| [26] |
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Research. 2012; 40: D1178–86. https://doi.org/10.1093/nar/gkr944. |
| [27] |
Sayers EW, Bolton EE, Brister JR, Canese K, Chan J, Comeau DC, et al. Database resources of the national center for biotechnology information. Nucleic Acids Research. 2022; 50: D20–D26. https://doi.org/10.1093/nar/gkab1112. |
| [28] |
Sakai H, Naito K, Takahashi Y, Sato T, Yamamoto T, Muto I, et al. The Vigna Genome Server, ‘VigGS’: A Genomic Knowledge Base of the Genus Vigna Based on High-Quality, Annotated Genome Sequence of the Azuki Bean, Vigna angularis (Willd.) Ohwi & Ohashi. Plant & Cell Physiology. 2016; 57: e2. https://doi.org/10.1093/pcp/pcv189. |
| [29] |
Zhai H, Lü S, Liang S, Wu H, Zhang X, Liu B, et al. GmFT4, a homolog of FLOWERING LOCUS T, is positively regulated by E1 and functions as a flowering repressor in soybean. PloS One. 2014; 9: e89030. https://doi.org/10.1371/journal.pone.0089030. |
| [30] |
Cai Y, Chen L, Liu X, Guo C, Sun S, Wu C, et al. CRISPR/Cas9-mediated targeted mutagenesis of GmFT2a delays flowering time in soya bean. Plant Biotechnology Journal. 2018; 16: 176–185. https://doi.org/10.1111/pbi.12758. |
| [31] |
Zhang X, Zhai H, Wang Y, Tian X, Zhang Y, Wu H, et al. Functional conservation and diversification of the soybean maturity gene E1 and its homologs in legumes. Scientific Reports. 2016; 6: 29548. https://doi.org/10.1038/srep29548. |
| [32] |
Imoto Y, Yoshikawa S, Horiuchi Y, Iida T, Oka T, Matsuda S, et al. Flowering Date1, a major photoperiod sensitivity gene in adzuki bean, is a soybean floral repressor E1 ortholog. Breeding Science. 2022; 72: 132–140. https://doi.org/10.1270/jsbbs.21051. |
| [33] |
Tanwar UK, Pruthi V, Randhawa GS. RNA-Seq of Guar (Cyamopsis tetragonoloba, L. Taub.) Leaves: De novo Transcriptome Assembly, Functional Annotation and Development of Genomic Resources. Frontiers in Plant Science. 2017; 8: 91. https://doi.org/10.3389/fpls.2017.00091. |
| [34] |
Rawal HC, Kumar S, Mithra S V A, Solanke AU, Nigam D, Saxena S, et al. High Quality Unigenes and Microsatellite Markers from Tissue Specific Transcriptome and Development of a Database in Clusterbean (Cyamopsis tetragonoloba, L. Taub). Genes. 2017; 8: 313. https://doi.org/10.3390/genes8110313. |
| [35] |
Chaudhury A, Kaila T, Gaikwad K. Elucidation of Galactomannan Biosynthesis Pathway Genes through Transcriptome Sequencing of Seeds Collected at Different Developmental Stages of Commercially Important Indian Varieties of Cluster Bean (Cyamopsis tetragonoloba L.). Scientific Reports. 2019; 9: 11539. https://doi.org/10.1038/s41598-019-48072-w. |
| [36] |
Al-Qurainy F, Alshameri A, Gaafar AR, Khan S, Nadeem M, Alameri AA, et al. Comprehensive Stress-Based De Novo Transcriptome Assembly and Annotation of Guar (Cyamopsis tetragonoloba (L.) Taub.): An Important Industrial and Forage Crop. International Journal of Genomics. 2019; 2019: 7295859. https://doi.org/10.1155/2019/7295859. |
| [37] |
Acharya BR, Sandhu D, Dueñas C, Ferreira JFS, Grover KK. Deciphering Molecular Mechanisms Involved in Salinity Tolerance in Guar (Cyamopsis tetragonoloba (L.) Taub.) Using Transcriptome Analyses. Plants. 2022; 11: 291. https://doi.org/10.3390/plants11030291. |
| [38] |
Li JH, Li MJ, Li WL, Li XY, Ma YB, Tan X, et al. Leguminous industrial crop guar (Cyamopsis tetragonoloba): The chromosome-level reference genome de novo assembly. Industrial Crops and Products. 2024; 216: 118748. https://doi.org/10.1016/j.indcrop.2024.118748. |
| [39] |
Hecht V, Foucher F, Ferrándiz C, Macknight R, Navarro C, Morin J, et al. Conservation of Arabidopsis flowering genes in model legumes. Plant Physiology. 2005; 137: 1420–1434. https://doi.org/10.1104/pp.104.057018. |
| [40] |
Zhao F, Lyu X, Ji R, Liu J, Zhao T, Li H, et al. CRISPR/Cas9-engineered mutation to identify the roles of phytochromes in regulating photomorphogenesis and flowering time in soybean. The Crop Journal. 2022; 10: 1654–1664. https://doi.org/10.1016/j.cj.2022.03.008. |
| [41] |
Li JH, Hou XY, Li CX, Li XY, Chen Y, Li WL, et al. Photoperiod effects on guar (Cyamopsis tetragonoloba L.). Industrial Crops and Products. 2023; 203: 117148. https://doi.org/10.1016/j.indcrop.2023.117148. |
| [42] |
Li F, Zhang X, Hu R, Wu F, Ma J, Meng Y, et al. Identification and molecular characterization of FKF1 and GI homologous genes in soybean. PLoS ONE. 2013; 8: e79036. https://doi.org/10.1371/journal.pone.0079036. |
| [43] |
Zheng JT, Lin CX, Fang ZY, Li HD. Intron Retention as a Mode for RNA-Seq Data Analysis. Frontiers in Genetics. 2020; 11: 586. https://doi.org/10.3389/fgene.2020.00586. |
| [44] |
Liu B, Kanazawa A, Matsumura H, Takahashi R, Harada K, Abe J. Genetic redundancy in soybean photoresponses associated with duplication of the phytochrome A gene. Genetics. 2008; 180: 995–1007. https://doi.org/10.1534/genetics.108.092742. |
| [45] |
Watanabe S, Hideshima R, Xia Z, Tsubokura Y, Sato S, Nakamoto Y, et al. Map-based cloning of the gene associated with the soybean maturity locus E3. Genetics. 2009; 182: 1251–1262. https://doi.org/10.1534/genetics.108.098772. |
| [46] |
Rajaprakasam S, Rahman H, Karunagaran S, Bapu K, J R, Kulandivelu G, et al. Comparative transcriptome and metabolome profiling in the maturing seeds of contrasting cluster bean (Cyamopsis tetragonoloba L. Taub) cultivars identified key molecular variations leading to increased gum accumulation. Gene. 2021; 791: 145727. https://doi.org/10.1016/j.gene.2021.145727. |
| [47] |
Mahla HR, Rani R, Choudhary KB, Rajput LS, Sharma R. Genomics-Aided Breeding Strategies for Biotic Stress in Cluster Bean. In Parihar AK, Bohra A, Lamichaney A, Mishra RK, Varshney RK (eds.) Genomics-aided Breeding Strategies for Biotic Stress in Grain Legumes (pp. 495–518). Springer Nature Singapore: Singapore. 2024. |
| [48] |
Priti, Panchta R, Yadav RC, Pahuja SK, Yadav NR. Linkage mapping and identification of quantitative trait loci associated with bacterial leaf blight resistance and gum content in cluster bean using the interspecific population (HG563 × Cyamopsis serrata). Genetic Resources and Crop Evolution. 2023; 70: 1501–1516. https://doi.org/10.1007/s10722-022-01517-0. |
| [49] |
Grigoreva E, Barbitoff Y, Changalidi A, Karzhaev D, Volkov V, Shadrina V, et al. Development of SNP Set for the Marker-Assisted Selection of Guar (Cyamopsis tetragonoloba (L.) Taub.) Based on a Custom Reference Genome Assembly. Plants. 2021; 10: 2063. https://doi.org/10.3390/plants10102063. |
| [50] |
Wang L, Li H, He M, Dong L, Huang Z, Chen L, et al. GIGANTEA orthologs, E2 members, redundantly determine photoperiodic flowering and yield in soybean. Journal of Integrative Plant Biology. 2023; 65: 188–202. https://doi.org/10.1111/jipb.13398. |
Russian Science Foundation(24-26-00073)
/
| 〈 |
|
〉 |