The mammalian life cycle initiates with the transition of genetic control from the maternal to the embryonic genome during zygotic genome activation (ZGA), which becomes pivotal for development. Nevertheless, understanding the conservation of genes and transcription factors (TFs) that underlie mammalian ZGA remains limited. Here, we compiled a comprehensive set of ZGA genes from mice, humans, pigs, bovines and goats, including Nr5a2 and TPRX1/2. The identification of 111 homologous genes through comparative analyses was followed by the discovery of a conserved genetic coding region, suggesting potential sequence preferences for ZGA genes. Notably, an interpretable machine learning model based on k-mer core features showed excellent performance in predicting ZGA genes (area under the ROC curve [AUC] > 0.81), revealing abundant and intricate 6-base sequence specific patterns and potential binding TFs, including motifs from NR5A2 and TPRX1/2. Further analysis demonstrated that gene sequence features and epigenetic modification features play equally important roles in regulating ZGA genes. Ultimately, we developed the ZGAExplorer platform to provide an invaluable resource for screening ZGA genes. Our study unravels the sequence determinants of ZGA genes across species through multi-omics data integration and machine learning, yielding insights into ZGA regulatory mechanisms and embryonic developmental arrest.
| [1] |
M. Sakamoto, A. Ito, S. Wakayama, H. Sasaki, T. Wakayama, and T. Ishiuchi, “Detection of Newly Synthesized RNA Reveals Transcriptional Reprogramming During ZGA and a Role of Obox3 in Totipotency Acquisition,” Cell Reports 43 (2024): 114118, https://doi.org/10.1016/j.celrep.2024.114118.
|
| [2] |
M. Vega-Sendino and S. Ruiz, “Transition From Totipotency to Pluripotency in Mice: Insights Into Molecular Mechanisms,” Biochemical Society Transactions 52 (2024): 231–239, https://doi.org/10.1042/bst20230442.
|
| [3] |
D. Jukam, S. A. M. Shariati, and J. M. Skotheim, “Zygotic Genome Activation in Vertebrates,” Developmental Cell 42 (2017): 316–332, https://doi.org/10.1016/j.devcel.2017.07.026.
|
| [4] |
P. Kravchenko and K. Tachibana, “Rise and SINE: Roles of Transcription Factors and Retrotransposons in Zygotic Genome Activation,” Nature Reviews. Molecular Cell Biology 26, no. 1 (2025): 68–79, https://doi.org/10.1038/s41580-024-00772-6.
|
| [5] |
Y. Hu, Y. Wang, Y. He, et al., “Maternal KLF17 Controls Zygotic Genome Activation by Acting as a Messenger for RNA pol II Recruitment in Mouse Embryos,” Developmental Cell 59 (2024): 613–626, https://doi.org/10.1016/j.devcel.2024.01.013.
|
| [6] |
W. Tadros and H. D. Lipshitz, “The Maternal-To-Zygotic Transition: A Play in Two Acts,” Development 136 (2009): 3033–3042, https://doi.org/10.1242/dev.033183.
|
| [7] |
B. Fu, H. Ma, and D. Liu, “Pioneer Transcription Factors: The First Domino in Zygotic Genome Activation,” Biomolecules 14 (2024): 720, https://doi.org/10.3390/biom14060720.
|
| [8] |
N. L. Vastenhouw, W. X. Cao, and H. D. Lipshitz, “The Maternal-to-Zygotic Transition Revisited,” Development 146, no. 11 (2019): dev161471, https://doi.org/10.1242/dev.161471.
|
| [9] |
M. T. Lee, A. R. Bonneau, and A. J. Giraldez, “Zygotic Genome Activation During the Maternal-To-Zygotic Transition,” Annual Review of Cell and Developmental Biology 30 (2014): 581–613, https://doi.org/10.1146/annurev-cellbio-100913-013027.
|
| [10] |
F. Sotomayor-Lugo, N. Iglesias-Barrameda, Y. Castillo-Aleman, et al., “The Dynamics of Histone Modifications During Mammalian Zygotic Genome Activation,” International Journal of Molecular Sciences 25 (2024): 1459, https://doi.org/10.3390/ijms25031459.
|
| [11] |
P. Svoboda, “Mammalian Zygotic Genome Activation,” Seminars in Cell & Developmental Biology 84 (2018): 118–126, https://doi.org/10.1016/j.semcdb.2017.12.006.
|
| [12] |
C. Y. Zhou and R. Heald, “Principles of Genome Activation in the Early Embryo,” Current Opinion in Genetics & Development 81 (2023): 102062, https://doi.org/10.1016/j.gde.2023.102062.
|
| [13] |
J. Gassler, W. Kobayashi, I. Gáspár, et al., “Zygotic Genome Activation by the Totipotency Pioneer Factor Nr5a2,” Science 378, no. 6626 (2022): 1305–1315, https://doi.org/10.1126/science.abn7478.
|
| [14] |
S. Ji, F. Chen, P. Stein, et al., “OBOX Regulates Mouse Zygotic Genome Activation and Early Development,” Nature 620 (2023): 1047–1053, https://doi.org/10.1038/s41586-023-06428-3.
|
| [15] |
A. H. Royall, I. Maeso, T. L. Dunwell, and P. W. H. Holland, “Mouse Obox and Crxos Modulate Preimplantation Transcriptional Profiles Revealing Similarity Between Paralogous Mouse and Human Homeobox Genes,” EvoDevo 9 (2018): 2, https://doi.org/10.1186/s13227-018-0091-4.
|
| [16] |
Z. Zou, C. Zhang, Q. Wang, et al., “Translatome and Transcriptome Co-Profiling Reveals a Role of TPRXs in Human Zygotic Genome Activation,” Science 378, no. 6615 (2022): abo7923, https://doi.org/10.1126/science.abo7923.
|
| [17] |
L. Gawriyski, E.-M. Jouhilahti, M. Yoshihara, et al., “Comprehensive Characterization of the Embryonic Factor LEUTX,” iScience 26, no. 3 (2023): 106172, https://doi.org/10.1016/j.isci.2023.106172.
|
| [18] |
K. Zhou, T. Wang, J. Zhang, et al., “LEUTX Regulates Porcine Embryonic Genome Activation in Somatic Cell Nuclear Transfer Embryos,” Cell Reports 43 (2024): 114372, https://doi.org/10.1016/j.celrep.2024.114372.
|
| [19] |
C. Long, W. Li, P. Liang, S. Liu, and Y. Zuo, “Transcriptome Comparisons of Multi-Species Identify Differential Genome Activation of Mammals Embryogenesis,” IEEE Access 7 (2019): 7794–7802, https://doi.org/10.1109/ACCESS.2018.2889809.
|
| [20] |
G. Bu, W. Zhu, X. Liu, et al., “Coordination of Zygotic Genome Activation Entry and Exit by H3K4me3 and H3K27me3 in Porcine Early Embryos,” Genome Research 32 (2022): 1487–1501, https://doi.org/10.1101/gr.276207.121.
|
| [21] |
J. Wu, J. Xu, B. Liu, et al., “Chromatin Analysis in Human Early Development Reveals Epigenetic Transition During ZGA,” Nature 557 (2018): 256–260, https://doi.org/10.1038/s41586-018-0080-8.
|
| [22] |
J. Li, J. Zhang, W. Hou, et al., “Metabolic Control of Histone Acetylation for Precise and Timely Regulation of Minor ZGA in Early Mammalian Embryos,” Cell Discovery 8 (2022): 96, https://doi.org/10.1038/s41421-022-00440-z.
|
| [23] |
W. Xia, J. Xu, G. Yu, et al., “Resetting Histone Modifications During Human Parental-To-Zygotic Transition,” Science 365, no. 6451 (2019): 353–360, https://doi.org/10.1126/science.aaw5118.
|
| [24] |
K. Wu, D. Fan, H. Zhao, et al., “Dynamics of Histone Acetylation During Human Early Embryogenesis,” Cell Discovery 9 (2023): 29, https://doi.org/10.1038/s41421-022-00514-y.
|
| [25] |
J. Li, W. Hou, Q. Zhao, et al., “Lactate Regulates Major Zygotic Genome Activation by H3K18 Lactylation in Mammals,” National Science Review 11, no. 2 (2024): nwad295, https://doi.org/10.1093/nsr/nwad295.
|
| [26] |
R. Xu, S. Li, Q. Wu, et al., “Stage-Specific H3K9me3 Occupancy Ensures Retrotransposon Silencing in Human Pre-Implantation Embryos,” Cell Stem Cell 29 (2022): 1051–1066, https://doi.org/10.1016/j.stem.2022.06.001.
|
| [27] |
L. C. Lindeman, I. S. Andersen, A. H. Reiner, et al., “Prepatterning of Developmental Gene Expression by Modified Histones Before Zygotic Genome Activation,” Developmental Cell 21 (2011): 993–1004, https://doi.org/10.1016/j.devcel.2011.10.008.
|
| [28] |
D. S. Watson, “Interpretable Machine Learning for Genomics,” Human Genetics 141 (2022): 1499–1513, https://doi.org/10.1007/s00439-021-02387-9.
|
| [29] |
Q. Zhu, S. Gao, B. Xiao, Z. He, and S. Hu, “Plasmer: An Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-Mers and Genomic Features,” Microbiology Spectrum 11 (2023): e0464522, https://doi.org/10.1128/spectrum.04645-22.
|
| [30] |
J. W. Whitaker, Z. Chen, and W. Wang, “Predicting the Human Epigenome From DNA Motifs,” Nature Methods 12 (2015): 265–272, https://doi.org/10.1038/nmeth.3065.
|
| [31] |
Y. Zhang, W. Bao, Y. Cao, H. Cong, B. Chen, and Y. Chen, “A Survey on Protein-DNA-Binding Sites in Computational Biology,” Briefings in Functional Genomics 21 (2022): 357–375, https://doi.org/10.1093/bfgp/elac009.
|
| [32] |
L. L. Colbran, L. Chen, and J. A. Capra, “Short DNA Sequence Patterns Accurately Identify Broadly Active Human Enhancers,” BMC Genomics 18 (2017): 536, https://doi.org/10.1186/s12864-017-3934-9.
|
| [33] |
A. Li, J. Zhang, and Z. Zhou, “PLEK: A Tool for Predicting Long Non-Coding RNAs and Messenger RNAs Based on an Improved k-Mer Scheme,” BMC Bioinformatics 15 (2014): 311, https://doi.org/10.1186/1471-2105-15-311.
|
| [34] |
D. Benveniste, H. J. Sonntag, G. Sanguinetti, and D. Sproul, “Transcription Factor Binding Predicts Histone Modifications in Human Cell Lines,” Proceedings of the National Academy of Sciences of the United States of America 111 (2014): 13367–13372, https://doi.org/10.1073/pnas.1412081111.
|
| [35] |
C. Moeckel, M. Mareboina, M. A. Konnaris, et al., “A Survey of k-Mer Methods and Applications in Bioinformatics,” Computational and Structural Biotechnology Journal 23 (2024): 2289–2303, https://doi.org/10.1016/j.csbj.2024.05.025.
|
| [36] |
V. Ngo, Z. Chen, K. Zhang, J. W. Whitaker, M. Wang, and W. Wang, “Epigenomic Analysis Reveals DNA Motifs Regulating Histone Modifications in Human and Mouse,” Proceedings of the National Academy of Sciences of the United States of America 116 (2019): 3668–3677, https://doi.org/10.1073/pnas.1813565116.
|
| [37] |
B. Hu, L. Zheng, C. Long, et al., “EmExplorer: A Database for Exploring Time Activation of Gene Expression in Mammalian Embryos,” Open Biology 9 (2019): 190054, https://doi.org/10.1098/rsob.190054.
|
| [38] |
L. Zheng, S. Huang, N. Mu, et al., “RAACBook: A Web Server of Reduced Amino Acid Alphabet for Sequence-Dependent Inference by Using Chou's Five-Step Rule,” Database 2019 (2019): baz131, https://doi.org/10.1093/database/baz131.
|
| [39] |
L. Zheng, P. Liang, C. Long, et al., “EmAtlas: A Comprehensive Atlas for Exploring Spatiotemporal Activation in Mammalian Embryogenesis,” Nucleic Acids Research 51 (2023): D924–D932, https://doi.org/10.1093/nar/gkac848.
|
| [40] |
H. Li, C. Long, Y. Hong, L. Luo, and Y. Zuo, “Characterizing Cellular Differentiation Potency and Waddington Landscape via Energy Indicator,” Research 6 (2023): 0118, https://doi.org/10.34133/research.0118.
|
| [41] |
X. Yu, S. Liang, M. Chen, et al., “Recapitulating Early Human Development With 8C-Like Cells,” Cell Reports 39 (2022): 110994, https://doi.org/10.1016/j.celrep.2022.110994.
|
| [42] |
C. Zhang, M. Wang, Y. Li, and Y. Zhang, “Profiling and Functional Characterization of Maternal mRNA Translation During Mouse Maternal-to-Zygotic Transition,” Science Advances 8 (2022): eabj3967, https://doi.org/10.1126/sciadv.abj3967.
|
| [43] |
Q. Tian, Y. Yin, Y. Tian, et al., “Chromatin Modifier EP400 Regulates Oocyte Quality and Zygotic Genome Activation in Mice,” Advanced Science 11, no. 20 (2024): e2308018, https://doi.org/10.1002/advs.202308018.
|
| [44] |
Y. Guo, T. Kitano, K. Inoue, et al., “Obox4 Promotes Zygotic Genome Activation Upon Loss of Dux,” eLife 13 (2024): e95856, https://doi.org/10.7554/eLife.95856.
|
| [45] |
M. Guo, Y. Zhang, J. Zhou, et al., “Precise Temporal Regulation of Dux Is Important for Embryo Development,” Cell Research 29 (2019): 956–959, https://doi.org/10.1038/s41422-019-0238-4.
|
| [46] |
K. Chen, W. Liu, J. Zhu, et al., “Pivotal Role for Long Noncoding RNAs in Zygotic Genome Activation in Mice,” Science China. Life Sciences 67 (2024): 958–969, https://doi.org/10.1007/s11427-023-2502-5.
|
| [47] |
T. He, J. Peng, S. Yang, et al., “SINE-Associated LncRNA SAWPA Regulates Porcine Zygotic Genome Activation,” Advanced Science 11, no. 2 (2024): e2307505, https://doi.org/10.1002/advs.202307505.
|
| [48] |
M. Jin, L. Zhao, H. Yang, et al., “A Long Non-coding RNA Essential for Early Embryonic Development Improves Somatic Cell Nuclear Transfer Somatic Cell Nuclear Transfer Efficiency in Goats,” Reproduction 166 (2023): 285–297, https://doi.org/10.1530/rep-23-0053.
|
| [49] |
M. Deng, Z. Liu, C. Ren, et al., “Long Noncoding RNAs Exchange During Zygotic Genome Activation in Goat,” Biology of Reproduction 99 (2018): 707–717, https://doi.org/10.1093/biolre/ioy118.
|
| [50] |
Y. H. Ling, Q. Zheng, Y. S. Li, et al., “Identification of lncRNAs by RNA Sequencing Analysis During In Vivo Pre-Implantation Developmental Transformation in the Goat,” Frontiers in Genetics 10 (2019): 1040, https://doi.org/10.3389/fgene.2019.01040.
|
| [51] |
Q. Jia, H. T. Wu, X. J. Zhou, et al., “A “GC-Rich” Method for Mammalian Gene Expression: A Dominant Role of Non-coding DNA GC Content in Regulation of Mammalian Gene Expression,” Science China. Life Sciences 53 (2010): 94–100, https://doi.org/10.1007/s11427-010-0003-x.
|
| [52] |
A. Sharma, G. J. Dsilva, G. Deshpande, and S. Galande, “Exploring the Versatility of Zygotic Genome Regulators: A Comparative and Functional Analysis,” Cell Reports 43 (2024): 114680, https://doi.org/10.1016/j.celrep.2024.114680.
|
| [53] |
H. Li, C. Long, J. Xiang, P. Liang, X. Li, and Y. Zuo, “Dppa2/4 as a Trigger of Signaling Pathways to Promote Zygote Genome Activation by Binding to CG-Rich Region,” Briefings in Bioinformatics 22 (2021): bbaa342, https://doi.org/10.1093/bib/bbaa342.
|
| [54] |
K. Hu, W. Li, S. Ma, D. Fang, and J. Xu, “The Identification and Classification of Candidate Genes During the Zygotic Genome Activation in the Mammals,” Zygote 32 (2024): 119–129, https://doi.org/10.1017/s0967199423000631.
|
| [55] |
H. Du, Y. Zhao, J. He, et al., “YTHDF2 Destabilizes m(6)A-Containing RNA Through Direct Recruitment of the CCR4-NOT Deadenylase Complex,” Nature Communications 7 (2016): 12626, https://doi.org/10.1038/ncomms12626.
|
| [56] |
D. Dominissini, S. Moshitch-Moshkovitz, S. Schwartz, et al., “Topology of the Human and Mouse m6A RNA Methylomes Revealed by m6A-Seq,” Nature 485 (2012): 201–206, https://doi.org/10.1038/nature11112.
|
| [57] |
M. Deng, B. B. Chen, Z. Liu, et al., “YTHDF2 Regulates Maternal Transcriptome Degradation and Embryo Development in Goat,” Frontiers in Cell and Developmental Biology 8 (2020): 580367, https://doi.org/10.3389/fcell.2020.580367.
|
| [58] |
M. Iwafuchi-Doi and K. S. Zaret, “Pioneer Transcription Factors in Cell Reprogramming,” Genes & Development 28 (2014): 2679–2692, https://doi.org/10.1101/gad.253443.114.
|
| [59] |
S. Yang, J. Xing, D. Liu, et al., “Review and New Insights Into the Catalytic Structural Domains of the Fe(Ll) and 2-Oxoglutarate Families,” International Journal of Biological Macromolecules 278 (2024): 134798, https://doi.org/10.1016/j.ijbiomac.2024.134798.
|
| [60] |
Z. H. Zhao, T. G. Meng, A. Li, H. Schatten, Z. B. Wang, and Q. Y. Sun, “RNA-Seq Transcriptome Reveals Different Molecular Responses During Human and Mouse Oocyte Maturation and Fertilization,” BMC Genomics 21 (2020): 475, https://doi.org/10.1186/s12864-020-06885-4.
|
| [61] |
W. Hu, H. Zeng, Y. Shi, et al., “Single-Cell Transcriptome and Translatome Dual-Omics Reveals Potential Mechanisms of Human Oocyte Maturation,” Nature Communications 13 (2022): 5114, https://doi.org/10.1038/s41467-022-32791-2.
|
| [62] |
M. N. U. Alam and U. F. Chowdhury, “Short k-Mer Abundance Profiles Yield Robust Machine Learning Features and Accurate Classifiers for RNA Viruses,” PLoS One 15 (2020): e0239381, https://doi.org/10.1371/journal.pone.0239381.
|
| [63] |
M. Ghandi, D. Lee, M. Mohammad-Noori, and M. A. Beer, “Enhanced Regulatory Sequence Prediction Using Gapped k-Mer Features,” PLoS Computational Biology 10 (2014): e1003711, https://doi.org/10.1371/journal.pcbi.1003711.
|
| [64] |
J. M. Kirk, S. O. Kim, K. Inoue, et al., “Functional Classification of Long Non-coding RNAs by k-Mer Content,” Nature Genetics 50 (2018): 1474–1482, https://doi.org/10.1038/s41588-018-0207-8.
|
| [65] |
Z. Li, D. Li, Y. Li, X. Guo, and R. Yang, “Deciphering the Regulatory Code of Histone Modifications in Plants,” Journal of Genetics and Genomics 49 (2022): 1064–1067, https://doi.org/10.1016/j.jgg.2022.07.003.
|
| [66] |
Y. Yang, R. Zhang, S. Singh, and J. Ma, “Exploiting Sequence-Based Features for Predicting Enhancer-Promoter Interactions,” Bioinformatics 33 (2017): i252–i260, https://doi.org/10.1093/bioinformatics/btx257.
|
| [67] |
S. C. Sun, X. G. Wang, X. S. Ma, X. J. Huang, J. Li, and H. L. Liu, “TBP Dynamics During Mouse Oocyte Meiotic Maturation and Early Embryo Development,” PLoS One 8 (2013): e55425, https://doi.org/10.1371/journal.pone.0055425.
|
| [68] |
L. Hai-Xia, G. Xin-Yu, X. Yan, Y. Qi-Long, G. Ming-Xiao, and Z. Jin-Yu, “Study of the Dynamic Expression of Meis1 in Mice,” Iranian Journal of Reproductive Medicine 11 (2013): 139–144.
|
| [69] |
M. A. Eckersley-Maslin, C. Alda-Catalinas, and W. Reik, “Dynamics of the Epigenetic Landscape During the Maternal-To-Zygotic Transition,” Nature Reviews. Molecular Cell Biology 19 (2018): 436–450, https://doi.org/10.1038/s41580-018-0008-z.
|
| [70] |
W. Zhu, G. Bu, R. Hu, et al., “KLF4 Facilitates Chromatin Accessibility Remodeling in Porcine Early Embryos,” Science China. Life Sciences 67 (2024): 96–112, https://doi.org/10.1007/s11427-022-2349-9.
|
| [71] |
L. Shi, Y. Zhai, Y. Zhao, et al., “ELF4 Is Critical to Zygotic Gene Activation and Epigenetic Reprogramming During Early Embryonic Development in Pigs,” Frontiers in Veterinary Science 9 (2022): 954601, https://doi.org/10.3389/fvets.2022.954601.
|
| [72] |
M. Pertea, D. Kim, G. M. Pertea, J. T. Leek, and S. L. Salzberg, “Transcript-Level Expression Analysis of RNA-Seq Experiments With HISAT, StringTie and Ballgown,” Nature Protocols 11 (2016): 1650–1667, https://doi.org/10.1038/nprot.2016.095.
|
| [73] |
Y. Liao, G. K. Smyth, and W. Shi, “featureCounts: An Efficient General Purpose Program for Assigning Sequence Reads to Genomic Features,” Bioinformatics 30 (2014): 923–930, https://doi.org/10.1093/bioinformatics/btt656.
|
| [74] |
M. I. Love, W. Huber, and S. Anders, “Moderated Estimation of Fold Change and Dispersion for RNA-seq Data With DESeq2,” Genome Biology 15 (2014): 550, https://doi.org/10.1186/s13059-014-0550-8.
|
| [75] |
B. Langmead and S. L. Salzberg, “Fast Gapped-Read Alignment With Bowtie 2,” Nature Methods 9 (2012): 357–359, https://doi.org/10.1038/nmeth.1923.
|
| [76] |
T. L. Bailey, J. Johnson, C. E. Grant, and W. S. Noble, “The MEME Suite,” Nucleic Acids Research 43 (2015): W39–W49, https://doi.org/10.1093/nar/gkv416.
|
| [77] |
G. Yu, L. G. Wang, and Q. Y. He, “ChIPseeker: An R/Bioconductor Package for ChIP Peak Annotation, Comparison and Visualization,” Bioinformatics 31 (2015): 2382–2383, https://doi.org/10.1093/bioinformatics/btv145.
|
| [78] |
D. Szklarczyk, R. Kirsch, M. Koutrouli, et al., “The STRING Database in 2023: Protein-Protein Association Networks and Functional Enrichment Analyses for any Sequenced Genome of Interest,” Nucleic Acids Research 51 (2023): D638–D646, https://doi.org/10.1093/nar/gkac1000.
|
| [79] |
H. Thorvaldsdóttir, J. T. Robinson, and J. P. Mesirov, “Integrative Genomics Viewer (IGV): High-Performance Genomics Data Visualization and Exploration,” Briefings in Bioinformatics 14 (2013): 178–192, https://doi.org/10.1093/bib/bbs017.
|
RIGHTS & PERMISSIONS
2025 The Author(s). Cell Proliferation published by Beijing Institute for Stem Cell and Regenerative Medicine and John Wiley & Sons Ltd.