Genome-wide association analysis identifies a candidate gene controlling seed size and yield in Xanthoceras sorbifolium Bunge

Ziquan Zhao , Chongjun Liang , Wei Zhang , Yingying Yang , Quanxin Bi , Haiyan Yu , Libing Wang

Horticulture Research ›› 2024, Vol. 11 ›› Issue (1) : 243

PDF
Horticulture Research ›› 2024, Vol. 11 ›› Issue (1) :243 DOI: 10.1093/hr/uhad243
ARTICLES
Genome-wide association analysis identifies a candidate gene controlling seed size and yield in Xanthoceras sorbifolium Bunge
Author information +
History +
PDF

Abstract

Yellow horn (Xanthoceras sorbifolium Bunge) is a woody oilseed tree species whose seed oil is rich in unsaturated fatty acids and rare neuronic acids, and can be used as a high-grade edible oil or as a feedstock for biodiesel production. However, the genetic mechanisms related to seed yield in yellow horn are not well elucidated. This study identified 2 164 863 SNP loci based on 222 genome-wide resequencing data of yellow horn germplasm. We conducted genome-wide association study (GWAS) analysis on three core traits (hundred-grain weight, single-fruit seed mass, and single-fruit seed number) that influence seed yield for the years 2022 and 2020, and identified 399 significant SNP loci. Among these loci, the Chr10_24013014 and Chr10_24012613 loci caught our attention due to their consistent associations across multiple analyses. Through Sanger sequencing, we validated the genotypes of these two loci across 16 germplasms, confirming their consistency with the GWAS analysis results. Downstream of these two significant loci, we identified a candidate gene encoding an AP2 transcription factor protein, which we named XsAP2. RT-qPCR analysis revealed high expression of the XsAP2 gene in seeds, and a significant negative correlation between its expression levels and seed hundred-grain weight, as well as single-fruit seed mass, suggesting its potential role in the normal seed development process. Transgenic Arabidopsis lines with the overexpressed XsAP2 gene exhibited varying degrees of reduction in seed size, number of seeds per silique, and number of siliques per plant compared with wild-type Arabidopsis. Combining these results, we hypothesize that the XsAP2 gene may have a negative regulatory effect on seed yield of yellow horn. These results provide a reference for the molecular breeding of high-yielding yellow horn.

Cite this article

Download citation ▾
Ziquan Zhao, Chongjun Liang, Wei Zhang, Yingying Yang, Quanxin Bi, Haiyan Yu, Libing Wang. Genome-wide association analysis identifies a candidate gene controlling seed size and yield in Xanthoceras sorbifolium Bunge. Horticulture Research, 2024, 11(1): 243 DOI:10.1093/hr/uhad243

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This work was financially supported by the Youth Top Talent Project of the Ten Thousand Talents Program of the State and Natural Science Foundation of China (31870594).

Author contributions

Z.Z., H.Y., and L.W. designed the research. Z.Z. performed data collection and analysis and the field experiments, and wrote the manuscript. C.L. and W.Z. provided experimental materials and supplemented the experimental protocol. Y.Y. and Z.Z. com-pleted the phenotyping. Q.B. provided resequencing data. H.Y. and L.W. helped to solve the experimental problems, reviewed the manuscript, and provided constructive comments.

Data availability

The data underlying this article are available in the NCBI database at https://www.ncbi.nlm.nih.gov/sra, and can be accessed with PRJNA1031336. The detailed run code and parameters for the GWAS analysis are available on GitHub (https://github.com/ziquanzhao/Python_GX/tree/master/Python_apply/EasyGWAS).

Conflict of interest

The authors declare no conflicts of interest.

Supplementary data

Supplementary data is available at Horticulture Research online.

References

[1]

Moles AT, Ackerly DD, Webb CO. et al. A brief history of seed size. Science. 2005;307:576-80

[2]

Alonso-Blanco C, Blankestijn-de Vries H, Hanhart CJ. et al. Nat-ural allelic variation at seed size loci in relation to other life history traits of Arabidopsis thaliana. Proc Natl Acad Sci USA. 1999;96:4710-7

[3]

Xum J, Zhang S. Mitogen-activated protein kinase cascades in signaling plant growth and development. Trends Plant Sci. 2015;20:56-64

[4]

Zhang M, Wu H, Su J. et al. Maternal control of embryogenesis by MPK6 and its upstream MKK4/MKK5 in Arabidopsis. Plant J. 2017;92:1005-19

[5]

Chakravorty D, Trusov Y, Zhang W. et al. An atypical het-erotrimeric G-protein γ -subunit is involved in guard cell K+-channel regulation and morphological development in Arabidop-sis thaliana. Plant J. 2011;67:840-51

[6]

Li S, Liu Y, Zheng L. et al. The plant-specific G protein γ subunit AGG3 influences organ size and shape in Arabidopsis thaliana. New Phytol. 2012;194:690-703

[7]

Li Y, Zheng L, Corke F. et al. Control of final seed and organ size by the DA1 gene family in Arabidopsis thaliana. Genes Dev. 2008;22: 1331-6

[8]

Xia T, Li N, Dumenil J. et al. The ubiquitin receptor DA1 interacts with the E3 ubiquitin ligase DA2 to regulate seed and organ size in Arabidopsis. Plant Cell. 2013;25:3347-59

[9]

Disch S, Anastasiou E, Sharma VK. et al. The E3 ubiquitin lig-ase BIG BROTHER controls Arabidopsis organ size in a dosage-dependent manner. Curr Biol. 2006;16:272-9

[10]

Jiang WB, Huang HY, Hu YW. et al. Brassinosteroid regulates seed size and shape in Arabidopsis. Plant Physiol. 2013;162:1965-77

[11]

Li S, Gao F, Xie K. et al. The OsmiR396c-OsGRF4-OsGIF1 regulatory module determines grain size and yield in rice. Plant Biotechnol J. 2016;14:2134-46

[12]

Ohto MA, Fischer RL, Goldberg RB. et al. Controlofseedmassby APETALA2. Proc Natl Acad Sci USA. 2005;102:3123-8

[13]

Jofuku KD, Omidyar PK, Gee Z. et al. Controlofseed massand seed yield by the floral homeotic gene APETALA2. Proc Natl Acad Sci USA. 2005;102:3117-22

[14]

Garcia D, Saingery V, Chambrier P. et al. Arabidopsis haiku mutants reveal new controls of seed size by endosperm. Plant Physiol. 2003;131:1661-70

[15]

Luo M, Dennis ES, Berger F. et al. MINISEED3 (MINI3), a WRKY family gene, and HAIKU2 (IKU2), a leucine-rich repeat (LRR) KINASE gene, are regulators of seed size in Arabidopsis. Proc Natl Acad Sci USA. 2005;102:17531-6

[16]

Yu HY, Fan SQ, Bi QX. et al. Seed morphology, oil content and fatty acid composition variability assessment in yellow horn (Xanthoceras sorbifolium Bunge) germplasm for optimum biodiesel production. Ind Crop Prod. 2017;97:425-30

[17]

Li J, Fu Y-J, Qu X-J. et al. Biodiesel production from yellow horn (Xanthoceras sorbifolia Bunge.) seed oil using ion exchange resin as heterogeneous catalyst. Bioresour Technol. 2012;108:112-8

[18]

Li J, Zu Y, Luo M. et al. Aqueous enzymatic process assisted by microwave extraction of oil from yellow horn (Xanthoceras sorbifolia Bunge.) seed kernels and its quality evaluation. Food Chem. 2013;138:2152-8

[19]

Venegas-Caleron M, Ruiz-Mendez MV, Martinez-Force E. et al. Characterization of Xanthoceras sorbifolium Bunge seeds: lipids, proteins and saponins content. IndCropProd. 2017;109:192-8

[20]

Ma Y, Bi Q, Li G. et al. Provenance variations in kernel oil content, fatty acid profile and biodiesel properties of Xanthoceras sorb-ifolium Bunge in northern China. Ind Crop. Prod. 2020;151:112487

[21]

Fei Z, Yan A, Hirst PM. et al. Suitable pollen source for the improvement of fruit and seed traits in Xanthoceras sorbifolium. Ind Crop. Prod. 2022;182:114858

[22]

Zhao Y, Bi Q-X, Ju J. et al. Geographic variation of seed and seedling growing traits in Xanthoceras sorbifolium. For Res. 2019;32: 160-8

[23]

Ou L, Zhang Y, Zhang Z. et al. The relationship between canopy microclimate, fruit and seed yield, and quality in Xanthoceras sorbifolium. J Plant Physiol. 2023;284:153975

[24]

Zhou QY, Cai Q. Role of ethylene in the regulatory mechanism underlying the abortion of ovules after fertilization in Xantho-ceras sorbifolium. Plant Mol Biol. 2021;106:67-84

[25]

Zhou Q, Cai Q. The superoxide dismutase genes might be required for appropriate development of the ovule after fer-tilization in Xanthoceras sorbifolium. Plant Cell Rep. 2018;37: 727-39

[26]

Wang L, Ruan CJ, Bao AM. et al. Small RNA profiling for identifi-cation of microRNAs involved in regulation of seed development and lipid biosynthesis in yellowhorn. BMC Plant Biol. 2021;21:464

[27]

Hirschhorn JN, Daly MJ. Genome-wide association studies for common diseases and complex traits. Nat Rev Genet. 2005;6: 95-108

[28]

Zhang K, He M, Fan Y. et al. Resequencing of global Tartary buck-wheat accessions reveals multiple domestication events and key loci associated with agronomic traits. Genome Biol. 2021;22:23

[29]

Wu J, Wang L, Fu J. et al. Resequencing of 683 common bean genotypes identifies yield component trait associations across a north-south cline. Nat Genet. 2020;52:118-25

[30]

Uffelmann E, Huang QQ, Munung NS. et al. Genome-wide asso-ciation studies. Nat Rev Methods Primers. 2021;1:59

[31]

Yoosefzadeh-Najafabadi M, Eskandari M, Belzile F. et al. Genome-wide association study statistical models:a review. In: Torkamaneh D, Belzile F, eds. Genome-Wide Association Studies. Springer: New York, 2022,43-62

[32]

Wen Y-J, Zhang H, Ni Y-L. et al. Methodological implementation of mixed linear models in multi-locus genome-wide association studies. Brief Bioinform. 2018;19:700-12

[33]

Thamilarasan SK, Park JI, Jung HJ. et al. Genome-wide analysis of the distribution of AP2/ERF transcription factors reveals duplica-tion and CBFs genes elucidate their potential function in Brassica oleracea. BMC Genomics. 2014;15:422

[34]

Zhu P, Chen Y, Zhang J. et al. Identification, classification, and characterization of AP2/ERF superfamily genes in Masson pine (Pinus massoniana Lamb.). Sci Rep. 2021;11:5441

[35]

Ohto MA, Floyd SK, Fischer RL. et al. Effects of APETALA2 on embryo, endosperm, and seed coat development determine seed size in Arabidopsis. Sex Plant Reprod. 2009;22:277-89

[36]

Kunst L, Klenz JE, Martinez-Zapater J. et al. AP2 gene determines the identity of perianth organs in flowers of Arabidopsis thaliana. Plant Cell. 1989;1:1195-208

[37]

Coen ES, Meyerowitz EM. The war of the whorls: genetic interac-tions controlling flower development. Nature. 1991;353:31-7

[38]

Chen S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta. 2023;2:e107

[39]

Bi Q, Zhao Y, Du W. et al. Pseudomolecule-level assembly of the Chinese oil tree yellowhorn (Xanthoceras sorbifolium) genome. GigaScience. 2019;8:giz070

[40]

Liu H, Yan X-M, Wang X-r. et al. Centromere-specific retro-transposons and very-long-chain fatty acid biosynthesis in the genome of yellowhorn (Xanthoceras sorbifolium, Sapindaceae), an oil-producing tree with significant drought resistance. Front. Plant Sci. 2021;12:766389

[41]

Liang Q, Li H, Li S. et al. The genome assembly and annota-tion of yellowhorn (Xanthoceras sorbifolium Bunge). GigaScience. 2019;8:giz071

[42]

Liang Q, Liu JN, Fang H. et al. Genomic and transcriptomic analyses provide insights into valuable fatty acid biosynthesis and environmental adaptation of yellowhorn. Front. Plant Sci. 2022;13:991197

[43]

Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114-20

[44]

Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26:589-95

[45]

DePristo MA, Banks E, Poplin R. et al. A framework for variation discovery and genotyping using next-generation DNA sequenc-ing data. Nat Genet. 2011;43:491-8

[46]

Chang CC, Chow CC, Tellier LC. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:4

[47]

Wang JB, Zhang ZW. GAPIT version 3: boosting power and accu-racy for genomic association and prediction. Genomics Proteomics Bioinformatics. 2021;19:629-40

[48]

Huang M, Liu XL, Zhou Y. et al. BLINK: a package for the next level of genome-wide association studies with both individuals and markers in the millions. GigaScience. 2019;8:giy154

[49]

Liu XL, Huang M, Fan B. et al. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016;12:e1005957

[50]

Zhang Z, Ersoz E, Lai CQ. et al. Mixed linear model approach adapted for genome-wide association studies. Nat Genet. 2010;42:355-60

[51]

Yu J, Pressoir G, Briggs WH. et al. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38:203-8

[52]

VanRaden PM. Efficient methods to compute genomic predic-tions. J Dairy Sci. 2008;91:4414-23

[53]

Zhang C, Dong SS, Xu JY. et al. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019;35:1786-8

[54]

Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis version 11. MolBiolEvol. 2021;38:3022-7

PDF

1284

Accesses

0

Citation

Detail

Sections
Recommended

/