Optimization and application of genome prediction model in rapeseed: flowering time, yield components, and oil content as examples

Wenkai Yu , Xinao Wang , Hui Wang , Wenxiang Wang , Hongtao Cheng , Desheng Mei , Lixi Jiang , Qiong Hu , Jia Liu

Horticulture Research ›› 2025, Vol. 12 ›› Issue (8) : 115

PDF (5664KB)
Horticulture Research ›› 2025, Vol. 12 ›› Issue (8) :115 DOI: 10.1093/hr/uhaf115
Articles
research-article
Optimization and application of genome prediction model in rapeseed: flowering time, yield components, and oil content as examples
Author information +
History +
PDF (5664KB)

Abstract

Rapeseed is the second largest oilseed crop in the world with short domestication and breeding history. This study developed a batch of genomic prediction models for flowering time (FT), oil content, and yield components in rapeseed. Using worldwide 404 breeding lines, the optimal prediction model for FT and five quality and yield traits was established by comparison with efficient traditional models and machine learning (ML) models. The results indicate that quantitative trait loci (QTLs) and significant variations identified by genome-wide association study (GWAS) can significantly improve the prediction accuracy of complex traits, achieving over 90% accuracy in predicting FT and thousand grain weight. The Genomic Best Linear Unbiased Prediction (GBLUP) and Bayes-Lasso models provided the most accurate prediction overall, while ML models such as GBDT (Gradient-Boosting Decision Trees) exhibited strong predictive performance. Our study provides genome selection solution for the high prediction accuracy and selection of complex traits in rapeseed breeding. The use of a diverse panel of 404 worldwide lines ensures that the findings are broadly applicable across different rapeseed breeding programs.

Cite this article

Download citation ▾
Wenkai Yu, Xinao Wang, Hui Wang, Wenxiang Wang, Hongtao Cheng, Desheng Mei, Lixi Jiang, Qiong Hu, Jia Liu. Optimization and application of genome prediction model in rapeseed: flowering time, yield components, and oil content as examples. Horticulture Research, 2025, 12(8): 115 DOI:10.1093/hr/uhaf115

登录浏览全文

4963

注册一个新账户 忘记密码

Acknowledgements

This research was supported by the National Key Research and Development Program of China (2023YFD120140203), the National Natural Science Foundation of China (U19A2029), the earmarked fund for CARS (CARS-12); the Science and Technology Innovation Project of the Chinese Academy of Agricultural Sciences and Shannan City Municipal Science and Technology Program Projects (SNSBJKJJHXM2023001) are also acknowledged.

Authors contributions

J.L. and Q.H. designed and supervised the study. W.Y., H.W., W.W., and H.C. collected the data. W.Y. and X.W. analyzed the data and finished the first manuscript. D.M., Q.H., and L.J. provided the germplasms. J.L., Q.H., and L.J. modified and reviewed the manuscript. All authors have read and agreed on the final version of the manuscript.

Data availability

The raw data of genome sequence could be publicly downloaded on the GSA website (https://ngdc.cncb.ac.cn/gsa/), project ID CRA013310, and on NCBI (https://www.ncbi.nlm.nih.gov/) ID PRJNA476657.

Conflict of interest statement: The authors declare no conflict of interest.

Supplementary data

Supplementary data is available at Horticulture Research online.

References

[1]

Zandberg JD, Fernandez CT, Danilevicz MF. et al. The global assessment of oilseed Brassica crop species yield, yield stability and the underlying genetics. Plan Theory. 2022; 11:2740

[2]

Nagaharu U, Nagaharu N. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jpn J Bot. 1935; 7:389-452

[3]

Chalhoub B, Denoeud F, Liu S. et al. Early allopolyploid evolu-tion in the post-Neolithic Brassica napus oilseed genome. Science. 2014; 345:950-3

[4]

Wang Q, Yan T, Long Z. et al. Prediction of heterosis in the recent rapeseed (Brassica napus) polyploid by pairing parental nucleotide sequences. PLoS Genet. 2010; 121:1289-301

[5]

Fan C, Cai G, Qin J. et al. Mapping of quantitative trait loci and development of allele-specific markers for seed weight in Brassica napus. Theor Appl Genet. 2010; 121:1289-301

[6]

Li F, Chen B, Xu K. et al. Genome-wide association study dissects the genetic architecture of seed weight and seed quality in rapeseed (Brassica napus L.). DNA Res. 2014; 21:355-67

[7]

Zhao W, Wang X, Wang H. et al. Genome-wide identification of QTL for seed yield and yield-related traits and construction of a high-density consensus map for QTL comparison in Brassica napus. Front Plant Sci. 2016; 7:17

[8]

Yu W, Yang Y, Liu L. et al. Decryption of superior allele and candidate genes for total lignin contents of rapeseed. Ind Crop Prod. 2024; 214:118483

[9]

Tan Z, Han X, Dai C. et al. Functional genomics of Brassica napus: Progress, challenges, and perspectives. J Integr Plant Biol. 2024; 66: 484-509

[10]

Gu J, Guan Z, Jiao Y. et al. The story of a decade: genomics, functional genomics and molecular breeding in Brassica napus. Plant Commun. 2024; 5:100884

[11]

Shah S, Weinholdt C, Jedrusik N. et al. Whole-transcriptome analysis reveals genetic factors underlying flowering time regu-lation in rapeseed (Brassica napus L.). Plant Cell Environ. 2018; 41: 1935-47

[12]

Chen S, Nelson M, Ghamkhar K. et al. Divergent patterns of allelic diversity from similar origins: the case of oilseed rape (Brassica napus L.) in China and Australia. Genome. 2008; 51: 1-10

[13]

Hu D, Jing J, Snowdon RJ. et al. Exploring the gene pool of Brassica napus by genomics-based approaches. Plant Biotechnol J. 2021; 19: 1693-712

[14]

Raman H, Raman R, Eckermann P. et al. Genetic and physical mapping of flowering time loci in canola (Brassica napus L.). Theor Appl Genet. 2013; 126:119-32

[15]

Zhang Y, Zhang D, Yu H. et al. Floral initiation in response to planting date reveals the key role of floral meristem differenti-ation prior to budding in canola (Brassica napus L.). Front Plant Sci. 2016; 7:1369

[16]

Tommey AM, Evans EJ. The influence of pre-floral growth and development on the pathway of floral development, dry matter distribution and seed yield in oilseed rape (Brassica napus L.). Ann Appl Biol. 1992; 121:687-96

[17]

Luo T, Zhang J, Khan MN. et al. Temperature variation caused by sowing dates significantly affects floral initiation and floral bud differentiation processes in rapeseed (Brassica napus L.). Plant Sci. 2018; 271:40-51

[18]

Kim D-H, Doyle MR, Sung S. et al. Vernalization: winter and the timing of flowering in plants. Annu Rev Cell Dev Biol. 2009; 25: 277-99

[19]

Kobayashi Y, Weigel D. Move on up, it’s time for change-mobile signals controlling photoperiod-dependent flowering. Genes Dev. 2007; 21:2371-84

[20]

Méndez-Vigo B, Picó FX, Ramiro M. et al. Altitudinal and cli-matic adaptation is mediated by flowering traits and FRI, FLC, and PHYC genes in Arabidopsis. Plant Physiol. 2011; 157: 1942-55

[21]

Xuezhen C, Hao X, Xin L. et al. Studies on correlationship of development stages and agronomic traits of summer sowing soybean. Mol Plant Breed. 2004; 2:247-52

[22]

Fornara F, de Montaigu A, Coupland G. SnapShot: control of flowering in Arabidopsis. Cell. 2010; 141:550-550.e2

[23]

Parkin IA, Clarke WE, Sidebottom C. et al. Towards unambiguous transcript mapping in the allotetraploid Brassica napus. Genome. 2010; 53:929-38

[24]

Xu L, Hu K, Zhang Z. et al. Genome-wide association study reveals the genetic architecture of flowering time in rapeseed (Brassica napus L.). DNA Res. 2016; 23:43-52

[25]

Vollrath P, Chawla HS, Schiessl SV. et al. A novel deletion in FLOWERING LOCUS T modulates flowering time in winter oilseed rape. Theor Appl Genet. 2021; 134:1217-31

[26]

Huang L, Min Y, Schiessl S. et al. Integrative analysis of GWAS and transcriptome to reveal novel loci regulation flowering time in semi-winter rapeseed. Plant Sci. 2021; 310:110980

[27]

Raman H, Raman R, Coombes N. et al. Genome-wide associa-tion analyses reveal complex genetic architecture underlying natural variation for flowering time in canola. Plant Cell Environ. 2016; 39:1228-39

[28]

Meuwissen TH, Hayes BJ, Goddard M. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001; 157:1819-29

[29]

Burgueño J, de los Campos G, Weigel K. et al. Genomic predic-tion of breeding values when modeling genotype× environment interaction using pedigree and dense molecular markers. Crop Sci. 2012; 52:707-19

[30]

Jarquín D, Kocak K, Posadas L. et al. Genotyping by sequencing for genomic prediction in a soybean breeding population. BMC Genomics. 2014; 15:1-10

[31]

Saint Pierre C, Burgueño J, Crossa J. et al. Genomic prediction models for grain yield of spring bread wheat in diverse agro-ecological zones. Sci Rep. 2016; 6:27312

[32]

Jarquín D, Lemes da Silva C, Gaynor RC. et al. Increasing genomic-enabled prediction accuracy by modeling genotype × environment interactions in Kansas wheat. Plant genome. 2016; 12:0130

[33]

Vats S, Sharma Y, Kumar V. et al. Recent advances and applica-bility of GBS, GWAS, and GS in oilseed crops. In: Sonah H, Goyal V, Shivaraj SM, Deshmukh RK ( Hoboken,eds.), Genotyping by Sequencing for Crop Improvement. NJ: Wiley, 2022,355-70

[34]

Luo Z, Yu Y, Xiang J. et al. Genomic selection using a subset of SNPs identified by genome-wide association analysis for disease resistance traits in aquaculture species. Aquaculture. 2021; 539:736620

[35]

Massman JM, Jung HJG, Bernardo R. Genomewide selection ver-sus marker-assisted recurrent selection to improve grain yield and stover-quality traits for cellulosic ethanol in maize. Crop Sci. 2013; 53:58-66

[36]

Asoro FG, Newell MA, Beavis WD. et al. Genomic, marker-assisted, and pedigree-BLUP selection methods for β-glucan concentration in elite oat. Crop Sci. 2013; 53:1894-906

[37]

Rutkoski J, Singh R, Huerta-Espino J. et al. Genetic gain from phenotypic and genomic selection for quantitative resistance to stem rust of wheat. Plant Genome. 2014; 10:0074

[38]

Tiede T, Smith KP. Evaluation and retrospective optimization of genomic selection for yield and disease resistance in spring barley. Mol Breed. 2018; 38:1-16

[39]

Watson A, Hickey LT, Christopher J. et al. Multivariate genomic selection and potential of rapid indirect selection with speed breeding in spring wheat. Crop Sci. 2019; 59:1945-59

[40]

Habier D, Fernando RL, Dekkers J. The impact of genetic relation-ship information on genome-assisted breeding values. Genetics. 2007; 177:2389-97

[41]

Legarra A, Robert-Granié C, Manfredi E. et al. Performance of genomic selection in mice. Genetics. 2008; 180:611-8

[42]

Lorenzana RE, Bernardo R. Accuracy of genotypic value predic-tions for marker-based selection in biparental plant populations. Theor Appl Genet. 2009; 120:151-61

[43]

Zhong S, Dekkers JC, Fernando RL. et al. Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a barley case study. Genetics. 2009; 182: 355-64

[44]

Xu Y, Ma K, Zhao Y. et al. Genomic selection: a breakthrough technology in rice breeding. Crop J. 2021; 9:669-77

[45]

Würschum T, Abel S, Zhao Y. Potential of genomic selection in rapeseed (Brassica napus L.) breeding. Plant Breed. 2014; 133:45-51

[46]

Zou J, Zhao Y, Liu P. et al. Seed quality traits can be predicted with high accuracy in Brassica napus using genomic data. PLoS One. 2016; 11:e0166624

[47]

Roy J, del Río Mendoza LE, Bandillo N. et al. Genetic mapping and genomic prediction of sclerotinia stem rot resistance to rapeseed/canola (Brassica napus L.) at seedling stage. Theor Appl Genet. 2022; 135:2167-84

[48]

Knoch D, Werner CR, Meyer RC. et al. Multi-omics-based predic-tion of hybrid performance in canola. Theor Appl Genet. 2021; 134: 1147-65

[49]

Wu Z, Wang BO, Chen X. et al. Evaluation of linkage disequilib-rium pattern and association study on seed oil content in Bras-sica napus using ddRAD sequencing. PLoS One. 2016; 11:e0146383

[50]

Minamikawa MF, Kunihisa M, Moriya S. et al. Genomic prediction and genome-wide association study using combined genotypic data from different genotyping systems: application to apple fruit quality traits. Hortic Res. 2024;11:uhae131

[51]

Liu HL. Beijing: China Agricultural University Press; 2000:4-5

[52]

Becker H, Engqvist G, Karlsson B. Comparison of rapeseed cul-tivars and resynthesized lines based on allozyme and RFLP markers. Theor Appl Genet. 1995; 91:62-7

[53]

Fu D, Qian W, Zou J. et al. Genetic dissection of intersubgenomic heterosis in Brassica napus carrying genomic components of B. rapa. Euphytica. 2012; 184:51-164

[54]

Zou J, Hu D, Mason AS. et al. Genetic changes in a novel breed-ing population of Brassica napus synthesized from hundreds of crosses between B. rapa and B. carinata. Plant Biotechnol J. 2018; 16: 507-19

[55]

Schiessl S. Regulation and subfunctionalization of flowering time genes in the allotetraploid oil crop Brassica napus. Front Plant Sci. 2020; 11:605155

[56]

Javed N, Geng J, Tahir M. et al. Identification of QTL influencing seed oil content, fatty acid profile and days to flowering in Brassica napus L. Euphytica. 2016; 207:191-211

[57]

Wu WD, Liang LZ, Yan YT. et al. Whole-genome resequencing of a worldwide collection of rapeseed accessions reveals the genetic basis of ecotype divergence. Mol Plant. 2019; 12:14

[58]

Imaizumi T, Schultz TF, Harmon FG. et al. FKF1 F-box protein mediates cyclic degradation of a repressor of CONSTANS in Arabidopsis. Science. 2005; 309:293-7

[59]

Michaels SD, Amasino RM. FLOWERING LOCUS C encodes a novel MADS domain protein that acts as a repressor of flow-ering. Plant Cell. 1999; 11:949-56

[60]

Sheldon CC, Finnegan EJ, Rouse DT. et al. The control of flowering by vernalization. Curr Opin Plant Biol. 2000; 3:418-22

[61]

Henderson IR, Liu F, Drea S. et al. An allelic series reveals essen-tial roles for FY in plant development in addition to flowering-time control. Development. 2005; 132:3597-607

[62]

Dong H, Tan C, Li Y. et al. Genome-wide association study reveals both overlapping and independent genetic loci to control seed weight and silique length in Brassica napus. Front Plant Sci. 2018; 9:921

[63]

Shen W, Qin P, Yan M. et al. Fine mapping of a silique length-and seed weight-related gene in Brassica napus. Theor Appl Genet. 2019; 132:2985-96

[64]

Khaki S, Wang L. Crop yield prediction using deep neural net-works. Front Plant Sci. 2019; 10:621

[65]

Song YY, Ying L. Decision tree methods: applications for classi-fication and prediction. Shanghai Arch Psychiatry. 2015; 27:130

[66]

Xu S, Xu Y, Gong L. et al. Metabolomic prediction of yield in hybrid rice. Plant J. 2016; 88:219-27

[67]

Perumal S, Higgins E, Sra S. et al. Harnessing genome prediction in Brassica napus through a nested association mapping popula-tion. bioRxiv. 2024; 2024.07. 29.604859

[68]

Li G, Dong Y, Zhao Y. et al. Genome-wide prediction in a hybrid maize population adapted to Northwest China. Crop J. 2020; 8: 830-42

[69]

Momen M, Mehrgardi AA, Sheikhi A. et al. Predictive ability of genome-assisted statistical models under various forms of gene action. Sci Rep. 2018; 8:12309

[70]

Werner CR, Voss-Fels KP, Miller CN. et al.Effective genomic selection in a narrow-genepool crop with low-density markers: Asian rapeseed as an example. Plant Genome. 2018; 11:170084

[71]

Crossa J, Pérez-Rodríguez P, Cuevas J. et al. Genomic selection in plant breeding: methods, models, and perspectives. Trends Plant Sci. 2017; 22:961-75

[72]

Montesinos-López A, Rivera C, Pinto F. et al. Multimodal deep learning methods enhance genomic prediction of wheat breed-ing. G3 (Bethesda). 2023;13:jkad045

[73]

Abdollahi-Arpanahi R, Gianola D, Peñagaricano F. Deep learning versus parametric and ensemble methods for genomic predic-tion of complex phenotypes. Genet Sel Evol. 2020; 52:1-15

[74]

Montesinos-López OA, Montesinos-López A, Pérez-Rodríguez P. et al. A review of deep learning applications for genomic selec-tion. BMC Genomics. 2021; 22:1-23

[75]

Wang K, Abid MA, Rasheed A. et al. DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants. Mol Plant. 2023; 16:279-93

[76]

Purcell S, Neale B, Todd-Brown K. et al. PLINK: a tool set for whole-genome association and population-based linkage anal-yses. Am J Hum Genet. 2007; 81:559-75

[77]

Zhang C, Dong S-S, Xu J-Y. et al. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics. 2019; 35:1786-8

[78]

Ayres DL, Darling A, Zwickl DJ. et al. BEAGLE: an application pro-gramming interface and high-performance computing library for statistical phylogenetics. Syst Biol. 2012; 61:170-3

PDF (5664KB)

305

Accesses

0

Citation

Detail

Sections
Recommended

/