Artificial intelligence (AI) has emerged as a paradigm-shifting force in enzyme engineering, enabling data-driven prediction of catalytic activity, stability, and substrate specificity. By integrating large-scale datasets from structured databases (e.g., PDB, BRENDA) and high-throughput experimentation (e.g., deep mutational scanning, microfluidics), machine learning (ML) approaches—including random forests, support vector machines, and deep neural networks—have accelerated enzyme optimization across key domains: mutational profiling, catalytic condition refinement, and mechanistic elucidation. Notably, AlphaFold has revolutionized structure prediction, while AI-directed evolution enhanced enantioselectivity in nonnatural reactions (e.g., C─Si bond formation). Nevertheless, persistent challenges include data heterogeneity, model overfitting with sparse datasets, and limited interpretability of deep learning frameworks. Future advancements necessitate hybrid strategies merging AI with physics-based simulations (e.g., molecular dynamics), rigorous standardization of databases (aligned with FAIR principles), and synergistic integration of rational design with data-driven optimization. This review critically evaluates AI's transformative potential and methodological gaps in enzyme engineering, highlighting implications for sustainable biomanufacturing and industrial biocatalysis.
| [1] |
Abramson, J., J. Adler, J. Dunger, et al. 2024. “Accurate Structure Prediction of Biomolecular Interactions With Alphafold 3.” Nature 630, no. 8016: 493–500. https://doi.org/10.1038/s41586-024-07487-w.
|
| [2] |
Ali, R., and Y. Zhang. 2024. “Machine Learning Meets Enzyme Engineering: Examples in the Design of Polyethylene Terephthalate Hydrolases.” Frontiers of Chemical Science and Engineering 18, no. 12: 149. https://doi.org/10.1007/s11705-024-2500-7.
|
| [3] |
Baek, M., R. McHugh, I. Anishchenko, H. Jiang, D. Baker, and F. DiMaio. 2024. “Accurate Prediction of Protein-Nucleic Acid Complexes Using RoseTTAFoldNA.” Nature Methods 21, no. 1: 117–121. https://doi.org/10.1038/s41592-023-02086-5.
|
| [4] |
Bateman, A., M. J. Martin, S. Orchard, et al. 2020. “UniProt: The Universal Protein Knowledgebase in 2021.” Nucleic Acids Research 49, no. D1: D480–D489. https://doi.org/10.1093/nar/gkaa1100.
|
| [5] |
Bian, J., P. Tan, T. Nie, L. Hong, and G. Y. Yang. 2024. “Optimizing Enzyme Thermostability by Combining Multiple Mutations Using Protein Language Model.” mLife 3: 492–504. https://doi.org/10.1002/mlf2.12151.
|
| [6] |
Biswas, S., G. Khimulya, E. C. Alley, K. M. Esvelt, and G. M. Church. 2021. “Low-N Protein Engineering With Data-Efficient Deep Learning.” Nature Methods 18, no. 4: 389–396. https://doi.org/10.1038/s41592-021-01100-y.
|
| [7] |
Bornscheuer, U. T. 2017. “The Fourth Wave of Biocatalysis Is Approaching.” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 376, no. 2100: 20170063. https://doi.org/10.1098/rsta.2017.0063.
|
| [8] |
Bornscheuer, U. T., G. W. Huisman, R. J. Kazlauskas, S. Lutz, J. C. Moore, and K. Robins. 2012. “Engineering the Third Wave of Biocatalysis.” Nature 485, no. 485: 185–194. https://doi.org/10.1038/nature11117.
|
| [9] |
Brotzakis, Z. F., S. Zhang, M. H. Murtada, and M. Vendruscolo. 2025. “AlphaFold Prediction of Structural Ensembles of Disordered Proteins.” Nature Communications 16, no. 1: 1632. https://doi.org/10.1038/s41467-025-56572-9.
|
| [10] |
Buel, G. R., and K. J. Walters. 2022. “Can AlphaFold2 Predict the Impact of Missense Mutations on Structure?” Nature Structural & Molecular Biology 29, no. 1: 1–2. https://doi.org/10.1038/s41594-021-00714-2.
|
| [11] |
Cai, H., Z. Zhang, M. Wang, et al. 2024. “Pretrainable Geometric Graph Neural Network for Antibody Affinity Maturation.” Nature Communications 15, no. 1: 7785. https://doi.org/10.1038/s41467-024-51563-8.
|
| [12] |
Callaway, E. 2024. “Could AI-Designed Proteins Be Weaponized? Scientists Lay out Safety Guidelines.” Nature 627, no. 8004: 478. https://doi.org/10.1038/d41586-024-00699-0.
|
| [13] |
Casadevall, G., C. Duran, and S. Osuna. 2023. “AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design.” JACS Au 3, no. 6: 1554–1562. https://doi.org/10.1021/jacsau.3c00188.
|
| [14] |
Che, X., X. Tao, J. Chen, et al. 2025. “Mining Highly Active Oleate Hydratases by Structure Clustering, Sequence Clustering, and Ancestral Sequence Reconstruction.” Journal of Agricultural and Food Chemistry 73, no. 12: 7335–7346. https://doi.org/10.1021/acs.jafc.4c10815.
|
| [15] |
Chen, A., S. Ye, Z. Wang, Y. Han, J. Cai, and J. Li. 2023. “Machine-Learning-Assisted Rational Design of 2D Doped Tellurene for Fin Field-Effect Transistor Devices.” Patterns 4, no. 4: 100722-100722. https://doi.org/10.1016/j.patter.2023.100722.
|
| [16] |
Chen, Y., K. Tao, W. Ji, V. B. Kumar, S. Rencus-Lazar, and E. Gazit. 2022. “Histidine as a Key Modulator of Molecular Self-Assembly: Peptide-Based Supramolecular Materials Inspired by Biological Systems.” Materials Today 60: 106–127. https://doi.org/10.1016/j.mattod.2022.08.011.
|
| [17] |
Chen, Z., S. Yu, J. Liu, et al. 2024. “Concentration Recognition-Based Auto-Dynamic Regulation System (CRUISE) Enabling Efficient Production of Higher Alcohols.” Advanced Science 11, no. 23: e2310215. https://doi.org/10.1002/advs.202310215.
|
| [18] |
Cheng, H., M. Zhang, and J. Q. Shi. 2024. “A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations.” IEEE Transactions on Pattern Analysis and Machine Intelligence 46, no. 12: 10558–10578. https://doi.org/10.1109/tpami.2024.3447085.
|
| [19] |
Choi, J. M., S. S. Han, and H. S. Kim. 2015. “Industrial Applications of Enzyme Biocatalysis: Current Status and Future Aspects.” Biotechnology Advances 33, no. 7: 1443–1454. https://doi.org/10.1016/j.biotechadv.2015.02.014.
|
| [20] |
Choudhury, S., B. Narayanan, M. Moret, V. Hatzimanikatis, and L. Miskovic. 2024. “Generative Machine Learning Produces Kinetic Models That Accurately Characterize Intracellular Metabolic States.” Nature Catalysis 7, no. 10: 1086–1098. https://doi.org/10.1038/s41929-024-01220-6.
|
| [21] |
Dai, A. D., X. L. Tang, Z. M. Wu, J. T. Tang, R. C. Zheng, and Y. G. Zheng. 2022. “Rational Regulation of Reaction Specificity of Nitrilase for Efficient Biosynthesis of 2-Chloronicotinic Acid Through a Single Site Mutation.” Applied and Environmental Microbiology 88: aem0239721. https://doi.org/10.1128/aem.02397-21.
|
| [22] |
Dauparas, J., I. Anishchenko, N. Bennett, et al. 2022. “Robust Deep Learning–Based Protein Sequence Design Using ProteinMPNN.” Science 378, no. 6615: 49–56. https://doi.org/10.1126/science.add2187.
|
| [23] |
Dauparas, J., G. R. Lee, R. Pecoraro, et al. 2025. “Atomic Context-Conditioned Protein Sequence Design Using LigandMPNN.” Nature Methods 22, no. 4: 717–723. https://doi.org/10.1038/s41592-025-02626-1.
|
| [24] |
Delépine, B., T. Duigou, P. Carbonell, and J. L. Faulon. 2018. “RetroPath2.0: A Retrosynthesis Workflow for Metabolic Engineers.” Metabolic Engineering 45: 158–170. https://doi.org/10.1016/j.ymben.2017.12.002.
|
| [25] |
Diao, H. J., L. M. Lin, L. Y. Xu, J. H. Yao, R. C. Zheng, and Y. G. Zheng. 2025. “Engineering Catalytically Promiscuous Enzymes to Serve New Functions.” Biotechnology Advances 82: 108601. https://doi.org/10.1016/j.biotechadv.2025.108601.
|
| [26] |
Ding, Y., X. Luo, J. Guo, et al. 2025. “Identification of Gut Microbial Bile Acid Metabolic Enzymes Via an AI-Assisted Pipeline.” Cell 188, no. 21: 6012–6027. https://doi.org/10.1016/j.cell.2025.07.017.
|
| [27] |
Ding, X. W., J. Rong, Z. P. Pan, et al. 2024. “De Novo Multienzyme Synthetic Pathways for Lactic Acid Production.” ACS Catalysis 14, no. 7: 4665–4674. https://doi.org/10.1021/acscatal.3c05489.
|
| [28] |
Du, J. H., K. X. Zhou, X. Y. Hong, Z. Z. Xu, J. B. Xu, and X. Huang. 2025. “Retrieval Augmented Zero-Shot Enzyme Generation for Specified Substrate.” International Conference on Machine Learning, 5934.
|
| [29] |
Eisenstein, M. 2023. “AI-Enhanced Protein Design Makes Proteins That Have Never Existed.” Nature Biotechnology 41, no. 3: 303–305. https://doi.org/10.1038/s41587-023-01705-y.
|
| [30] |
Feehan, R., D. Montezano, and J. Slusky. 2021. “Machine Learning for Enzyme Engineering, Selection and Design.” Protein Engineering, Design & Selection: PEDS 34: gzab019. https://doi.org/10.1093/protein/gzab019.
|
| [31] |
Gado, J. E., M. Knotts, A. Y. Shaw, et al. 2025. “Machine Learning Prediction of Enzyme Optimum pH.” Nature Machine Intelligence 7, no. 5: 716–729. https://doi.org/10.1038/s42256-025-01026-6.
|
| [32] |
George, A., and T. Walsh. 2022. “Artificial Intelligence Is Breaking Patent Law.” Nature 605, no. 7911: 616–618. https://doi.org/10.1038/d41586-022-01391-x.
|
| [33] |
Geraldene, M., I.V. Ramiro, and F. Silvia, et al. 2024. “ZymCTRL: A conditional Language Model for the Controllable Generation of Artificial Enzymes.” Preprint, bioRxiv. https://doi.org/10.1101/2024.05.03.592223.
|
| [34] |
Goldman, S., R. Das, K. K. Yang, and C. W. Coley. 2022. “Machine Learning Modeling of Family Wide Enzyme-Substrate Specificity Screens.” PLoS Computational Biology 18, no. 2: e1009853. https://doi.org/10.1371/journal.pcbi.1009853.
|
| [35] |
Gong, J. S., T. T. Dong, B. C. Gu, et al. 2017. “Semirational Engineering Accelerates the Laboratory Evolution of Nitrilase Catalytic Efficiency for Nicotinic Acid Biosynthesis.” ChemCatChem 9, no. 17: 3395–3401.
|
| [36] |
Gong, X. M., Z. Qin, F. L. Li, B. B. Zeng, G. W. Zheng, and J. H. Xu. 2018. “Development of an Engineered Ketoreductase With Simultaneously Improved Thermostability and Activity for Making a Bulky Atorvastatin Precursor.” ACS Catalysis 9, no. 1: 147–153.
|
| [37] |
Hadadi, N., J. Hafner, A. Shajkofci, A. Zisaki, and V. Hatzimanikatis. 2016. “ATLAS of Biochemistry: A Repository of all Possible Biochemical Reactions for Synthetic Biology and Metabolic Engineering Studies.” ACS Synthetic Biology 5, no. 10: 1155–1166. https://doi.org/10.1021/acssynbio.6b00054.
|
| [38] |
He, X., and M. Yan. 2024. “GraphKM: Machine and Deep Learning for KM Prediction of Wildtype and Mutant Enzymes.” BMC Bioinformatics 25, no. 1: 135. https://doi.org/10.1186/s12859-024-05746-1.
|
| [39] |
Hopf, T. A., J. B. Ingraham, F. J. Poelwijk, et al. 2017. “Mutation Effects Predicted From Sequence Co-Variation.” Nature Biotechnology 35, no. 2: 128–135. https://doi.org/10.1038/nbt.3769.
|
| [40] |
Hu, D., Z. Y. Lu, X. Liao, Y. Y. Hu, X. Y. Qi, and Y. C. He. 2024. “Rational Design of the Substrate Binding Pocket in Epoxide Hydrolase for Enhanced Enantioselectivity Towards Chiral Ortho-Methylstyrene Oxide.” Molecular Catalysis 568: 114494. https://doi.org/10.1016/j.mcat.2024.114494.
|
| [41] |
Hu, Y., H. Yang, M. Li, et al. 2024. “Exploring Protein Conformational Changes Using a Large-Scale Biophysical Sampling Augmented Deep Learning Strategy.” Advanced Science 11, no. 44: 2400884. https://doi.org/10.1002/advs.202400884.
|
| [42] |
Hult, K., and P. Berglund. 2007. “Enzyme Promiscuity: Mechanism and Applications.” Trends in Biotechnology 25, no. 5: 231–238. https://doi.org/10.1016/j.tibtech.2007.03.002.
|
| [43] |
Humphreys, I. R., J. Pei, M. Baek, et al. 2021. “Computed Structures of Core Eukaryotic Protein Complexes.” Science 374, no. 6573: 1340. eabm4805. https://doi.org/10.1126/science.abm4805.
|
| [44] |
Jahn, L. J., V. M. Rekdal, and M. O. A. Sommer. 2023. “Microbial Foods for Improving Human and Planetary Health.” Cell 186, no. 3: 469–478. https://doi.org/10.1016/j.cell.2022.12.002.
|
| [45] |
Jiang, R., L. Shang, R. Wang, D. Wang, and N. Wei. 2023. “Machine Learning Based Prediction of Enzymatic Degradation of Plastics Using Encoded Protein Sequence and Effective Feature Representation.” Environmental Science & Technology Letters 10, no. 7: 557–564. https://doi.org/10.1021/acs.estlett.3c00293.
|
| [46] |
Jiang, S., H. Li, L. Zhang, et al. 2024. “Generic Diagramming Platform (GDP): A Comprehensive Database of High-Quality Biomedical Graphics.” Nucleic Acids Research 53, no. D1: D1670–D1676. https://doi.org/10.1093/nar/gkae973.
|
| [47] |
Jiménez-Osés, G., and F. Peccati. 2025. “Structure Prediction of Alternate Frame Folding Systems With AlphaFold3.” Journal of Chemical Information and Modeling 65, no. 15: 8229–8237. https://doi.org/10.1021/acs.jcim.5c00906.
|
| [48] |
Jinich, A., S. Z. Nazia, A. V. Tellez, D. Rappoport, and K. Y. Rhee. 2022. “ Predicting Enzyme Substrate Chemical Structure With Protein Language Models.” Preprint, bioRxiv. https://doi.org/10.1101/2022.09.28.509940.
|
| [49] |
Jumper, J., R. Evans, A. Pritzel, et al. 2021. “Highly Accurate Protein Structure Prediction With Alphafold.” Nature 596, no. 7873: 583–589. https://doi.org/10.1038/s41586-021-03819-2.
|
| [50] |
Jyoti, K.Bhatia, K.Chauhan, CAttri, A. and Seth 2017. “Improving Stability and Reusability of Rhodococcus pyridinivorans NIT-36 Nitrilase by Whole Cell Immobilization Using Chitosan.” International Journal of Biological Macromolecules 103: 8–15. https://doi.org/10.1016/j.ijbiomac.2017.05.012.
|
| [51] |
Khanra, M., V. Ravichandiran, and S. P. Swain. 2025. “Lipase Enzymes for Sustainable Synthesis of Pharmaceuticals and Chiral Organic Building Blocks.” Advanced Sustainable Systems 9, no. 1: e2400495. https://doi.org/10.1002/adsu.202400495.
|
| [52] |
Khersonsky, O., R. Lipsh, Z. Avizemer, et al. 2018. “Automated Design of Efficient and Functionally Diverse Enzyme Repertoires.” Molecular Cell 72, no. 1: 178–186.e5. https://doi.org/10.1016/j.molcel.2018.08.033.
|
| [53] |
Klemanska, A., K. Dwyer, and G. Walsh. 2024. “Truncation of a Novel C-Terminal Domain of a β-glucanase Improves Its Thermal Stability and Specific Activity.” Biotechnology Journal 19, no. 8: e2400245. https://doi.org/10.1002/biot.202400245.
|
| [54] |
Kortemme, T. 2024. “De Novo Protein Design—From New Structures to Programmable Functions.” Cell 187, no. 3: 526–544. https://doi.org/10.1016/j.cell.2023.12.028.
|
| [55] |
Lancaster, L., W. Abdallah, S. Banta, and I. Wheeldon. 2018. “Engineering Enzyme Microenvironments for Enhanced Biocatalysis.” Chemical Society Reviews 47, no. 14: 5177–5186. https://doi.org/10.1039/c8cs00085a.
|
| [56] |
Landwehr, G. M., J. W. Bogart, C. Magalhaes, E. G. Hammarlund, A. S. Karim, and M. C. Jewett. 2025. “Accelerated Enzyme Engineering by Machine-Learning Guided Cell-Free Expression.” Nature Communications 16, no. 1: 865. https://doi.org/10.1038/s41467-024-55399-0.
|
| [57] |
Lannelongue, L., and M. Inouye. 2023. “Environmental Impacts of Machine Learning Applications in Protein Science.” Cold Spring Harbor Perspectives in Biology 15, no. 12: a041473. https://doi.org/10.1101/cshperspect.a041473.
|
| [58] |
Lauko, A., S. J. Pellock, K. H. Sumida, et al. 2025. “Computational Design of Serine Hydrolases.” Science 388, no. 6744: adu2454. https://doi.org/10.1126/science.adu2454.
|
| [59] |
Lee, C., B. H. Su, and Y. J. Tseng. 2022. “Comparative Studies of AlphaFold, RoseTTAFold and Modeller: A Case Study Involving the Use of G-Protein-Coupled Receptors.” Briefings in Bioinformatics 23, no. 5: bbac308. https://doi.org/10.1093/bib/bbac308.
|
| [60] |
Lenz, F., P. Zurek, M. Umlauf, I. E. P. Tozakidis, and J. Jose. 2020. “Tailor-Made β-glucosidase With Increased Activity at Lower Temperature Without Loss of Stability and Glucose Tolerance.” Green Chemistry 22, no. 7: 2234–2243. https://doi.org/10.1039/c9gc04166d.
|
| [61] |
Li, F., Y. Chen, M. Anton, and J. Nielsen. 2023. “Gotenzymes: An Extensive Database of Enzyme Parameter Predictions.” Nucleic Acids Research 51, no. D1: D583–D586. https://doi.org/10.1093/nar/gkac831.
|
| [62] |
Li, F., L. Yuan, H. Lu, et al. 2022. “Deep Learning-Based kcat Prediction Enables Improved Enzyme-Constrained Model Reconstruction.” Nature Catalysis 5, no. 8: 662–672. https://doi.org/10.1038/s41929-022-00798-z.
|
| [63] |
Li, P., J. Yang, M. A. Islam, and S. Ren. 2025. “Making AI Less ‘Thirsty’.” Communications of the ACM 68, no. 7: 54–61. https://doi.org/10.1145/3724499.
|
| [64] |
Li, S. F., J. Y. Xie, S. Qiu, et al. 2021. “Semirational Engineering of an Aldo–Keto Reductase KmAKR for Overcoming Trade-Offs Between Catalytic Activity and Thermostability.” Biotechnology and Bioengineering 118, no. 11: 4441–4452.
|
| [65] |
Li, Y., R. Cao, Q. He, T. Xiao, and J. Zhu. 2023. “Learning Reliable Neural Networks With Distributed Architecture Representations.” ACM Transactions on Asian and Low-Resource Language Information Processing 22, no. 4: 1–20. https://doi.org/10.1145/3578709.
|
| [66] |
Li, Y., and K. Wang. 2025. “A Deep Learning-Based Approach for Predicting Michaelis Constants From Enzymatic Reactions.” Applied Sciences 15, no. 7: 4017. https://doi.org/10.3390/app15074017.
|
| [67] |
Li, Z. L., S. Pei, Z. Chen, et al. 2024. “Machine Learning-Assisted Amidase-Catalytic Enantioselectivity Prediction and Rational Design of Variants for Improving Enantioselectivity.” Nature Communications 15, no. 1: 8778. https://doi.org/10.1038/s41467-024-53048-0.
|
| [68] |
Liang, T., C. Jiang, J. Yuan, Y. Othman, X. Q. Xie, and Z. Feng. 2022. “Differential Performance of RoseTTAFold in Antibody Modeling.” Briefings in Bioinformatics 23, no. 5: bbac152. https://doi.org/10.1093/bib/bbac152.
|
| [69] |
Liao, M., S. Feng, X. Liu, et al. 2024. “Novel Insights into Enzymatic Thermostability: The ‘Short Board’ Theory and Zero-Shot Hamiltonian Model.” Advanced Science 11, no. 45: e2402441. https://doi.org/10.1002/advs.202402441.
|
| [70] |
Lin, Y. R., N. Koga, R. Tatsumi-Koga, et al. 2015. “Control over Overall Shape and Size in De Novo Designed Proteins.” Proceedings of the National Academy of Sciences 112, no. 40: E5478–E5485. https://doi.org/10.1073/pnas.1509508112.
|
| [71] |
Listov, D., R. Lipsh-Sokolik, S. Rosset, C. Yang, B. E. Correia, and S. J. Fleishman. 2022. “Assessing and Enhancing Foldability in Designed Proteins.” Protein Science 31, no. 9: e4400. https://doi.org/10.1002/pro.4400.
|
| [72] |
Listov, D., E. Vos, G. Hoffka, et al. 2025. “Complete Computational Design of High-Efficiency Kemp Elimination Enzymes.” Nature 643: 1421–1427. https://doi.org/10.1038/s41586-025-09136-2.
|
| [73] |
Liu, Q., D. Jiang, H. Zhou, et al. 2023. “Pyrolysis-Catalysis Upcycling of Waste Plastic Using a Multilayer Stainless-Steel Catalyst Toward a Circular Economy.” Proceedings of the National Academy of Sciences 120, no. 39: e2305078120. https://doi.org/10.1073/pnas.2305078120.
|
| [74] |
Liu, S. H., L. Bai, X. D. Wang, et al. 2025. “Machine Learning-Guided Protein Engineering to Improve the Catalytic Activity of Transaminases Under Neutral pH Conditions.” Organic Chemistry Frontiers 12: 4788–4793. https://doi.org/10.1039/d5qo00423c.
|
| [75] |
Liu, W., P. Wang, X. Zhuang, et al. 2025. “RDBSB: A Database for Catalytic Bioparts With Experimental Evidence.” Nucleic Acids Research 53, no. D1: D709–D716. https://doi.org/10.1093/nar/gkae844.
|
| [76] |
Lu, H., D. J. Diaz, N. J. Czarnecki, et al. 2022. “Machine Learning-Aided Engineering of Hydrolases for PET Depolymerization.” Nature 604, no. 7907: 662–667. https://doi.org/10.1038/s41586-022-04599-z.
|
| [77] |
Lu, J., Y. Wu, C. Deng, et al. 2022. “Model-Based Dynamic Engineering of Escherichia coli for N-Acetylglucosamine Overproduction.” Biotechnology Notes 3: 15–24. https://doi.org/10.1016/j.biotno.2022.02.001.
|
| [78] |
Lu, X. F., H. J. Diao, Z. M. Wu, Z. L. Zhang, R. C. Zheng, and Y. G. Zheng. 2022. “Engineering of Reaction Specificity, Enantioselectivity, and Catalytic Activity of Nitrilase for Highly Efficient Synthesis of Pregabalin Precursor.” Biotechnology and Bioengineering 119, no. 9: 2399–2412. https://doi.org/10.1002/bit.28165.
|
| [79] |
Luo, Y., G. Jiang, T. Yu, et al. 2021. “ECNet Is an Evolutionary Context-Integrated Deep Learning Framework for Protein Engineering.” Nature Communications 12, no. 1: 5743. https://doi.org/10.1038/s41467-021-25976-8.
|
| [80] |
Ma, E. J., E. Siirola, C. Moore, et al. 2021. “Machine-Directed Evolution of an Imine Reductase for Activity and Stereoselectivity.” ACS Catalysis 11, no. 20: 12433–12445. https://doi.org/10.1021/acscatal.1c02786.
|
| [81] |
Ma, W., S. Zhang, Z. Li, et al. 2022. “Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures.” Journal of Chemical Information and Modeling 62, no. 17: 4008–4017. https://doi.org/10.1021/acs.jcim.2c00885.
|
| [82] |
McBride, J. M., K. Polev, A. Abdirasulov, V. Reinharz, B. A. Grzybowski, and T. Tlusty. 2023. “AlphaFold2 Can Predict Single-Mutation Effects.” Physical Review Letters 131, no. 21: 218401. https://doi.org/10.1103/PhysRevLett.131.218401.
|
| [83] |
Meng, Q., N. Capra, C. M. Palacio, et al. 2020. “Robust ω-Transaminases by Computational Stabilization of the Subunit Interface.” ACS Catalysis 10, no. 5: 2915–2928. https://doi.org/10.1021/acscatal.9b05223.
|
| [84] |
Mitchell, D. A., N. Krieger, and D. C. Estumano. 2023. “Estimation of Selectivities in the Lipase-Catalyzed Esterification of Trimethylolpropane With Fatty Acids.” Biochemical Engineering Journal 198: 109024. https://doi.org/10.1016/j.bej.2023.109024.
|
| [85] |
Montanucci, L., E. Capriotti, Y. Frank, N. Ben-Tal, and P. Fariselli. 2019. “DDGun: An Untrained Method for the Prediction of Protein Stability Changes Upon Single and Multiple Point Variations.” BMC Bioinformatics 20, no. 1: 335. https://doi.org/10.1186/s12859-019-2923-1.
|
| [86] |
Mou, Z., J. Eakes, C. J. Cooper, et al. 2021. “Machine Learning-Based Prediction of Enzyme Substrate Scope: Application to Bacterial Nitrilases.” Proteins: Structure, Function, and Bioinformatics 89, no. 3: 336–347. https://doi.org/10.1002/prot.26019.
|
| [87] |
Nestl, B. M., and B. Hauer. 2014. “Engineering of Flexible Loops in Enzymes.” ACS Catalysis 4, no. 9: 3201–3211. https://doi.org/10.1021/cs500325p.
|
| [88] |
Pak, M. A., K. A. Markhieva, M. S. Novikova, et al. 2023. “Using Alphafold to Predict the Impact of Single Mutations on Protein Stability and Function.” PLoS One 18, no. 3: e0282689. https://doi.org/10.1371/journal.pone.0282689.
|
| [89] |
Pandi, A., D. Adam, A. Zare, et al. 2023. “Cell-Free Biosynthesis Combined With Deep Learning Accelerates De Novo-Development of Antimicrobial Peptides.” Nature Communications 14, no. 1: 7197. https://doi.org/10.1038/s41467-023-42434-9.
|
| [90] |
Pearce, R., X. Huang, G. S. Omenn, and Y. Zhang. 2023. “De Novo Protein Fold Design Through Sequence-Independent Fragment Assembly Simulations.” Proceedings of the National Academy of Sciences 120, no. 4: e2208275120. https://doi.org/10.1073/pnas.2208275120.
|
| [91] |
Pertusi, D. A., M. E. Moura, J. G. Jeffryes, S. Prabhu, B. Walters Biggs, and K. E. J. Tyo. 2017. “Predicting Novel Substrates for Enzymes With Minimal Experimental Effort With Active Learning.” Metabolic Engineering 44: 171–181. https://doi.org/10.1016/j.ymben.2017.09.016.
|
| [92] |
Poddar, A., and S. R. Rao. 2024. “Evolving Intellectual Property Landscape for AI-Driven Innovations in the Biomedical Sector: Opportunities in Stable IP Regime for Shared Success.” Frontiers in Artificial Intelligence 7: 1372161. https://doi.org/10.3389/frai.2024.1372161.
|
| [93] |
Qian, H., Y. Wang, X. Zhou, et al. 2025. “ESM-EZY: A Deep Learning Strategy for the Mining of Novel Multicopper Oxidases With Superior Properties.” Nature Communications 16, no. 1: 3274. https://doi.org/10.1038/s41467-025-58521-y.
|
| [94] |
Radivojac, P., W. T. Clark, T. R. Oron, et al. 2013. “A Large-Scale Evaluation of Computational Protein Function Prediction.” Nature Methods 10, no. 3: 221–227. https://doi.org/10.1038/nmeth.2340.
|
| [95] |
Rahamathulla, M. P., S. Alsubai, and M. Sha. 2025. “Unveiling the Genetic Symphony: Deep Learning for Decoding Promoters and Non-Promoters in DNA Sequence.” Gene Reports 40: 102283. https://doi.org/10.1016/j.genrep.2025.102283.
|
| [96] |
Rapp, J. T., B. J. Bremer, and P. A. Romero. 2024. “Self-Driving Laboratories to Autonomously Navigate the Protein Fitness Landscape.” Nature Chemical Engineering 1, no. 1: 97–107. https://doi.org/10.1038/s44286-023-00002-4.
|
| [97] |
Reardon, S. 2024. “Five Protein-Design Questions That Still Challenge AI.” Nature 635, no. 8037: 246–248. https://doi.org/10.1038/d41586-024-03595-9.
|
| [98] |
Robinson, S. L., M. D. Smith, J. E. Richman, K. G. Aukema, and L. P. Wackett. 2020. “Machine Learning-Based Prediction of Activity and Substrate Specificity for OleA Enzymes in the Thiolase Superfamily.” Synthetic Biology 5, no. 1: ysaa004. https://doi.org/10.1093/synbio/ysaa004.
|
| [99] |
Rodríguez-Alonso, M., F. Rodríguez-Vico, F. Las heras-Vázquez, and J. Clemente-Jiménez. 2017. “L-amino Acid productionL-Amino Acid Production by a Immobilized Double-Racemase Hydantoinase Process: Improvement and Comparison With a Free Protein System.” Catalysts 7, no. 6: 192. https://doi.org/10.3390/catal7060192.
|
| [100] |
Romero, P. A., A. Krause, and F. H. Arnold. 2013. “Navigating the Protein Fitness Landscape With Gaussian Processes.” Proceedings of the National Academy of Sciences 110, no. 3: E193–E201. https://doi.org/10.1073/pnas.1215251110.
|
| [101] |
Roy, S., D. A. Vargas, P. Ma, et al. 2024. “Stereoselective Construction of β-, γ- and δ-lactam Rings via Enzymatic C-H Amidation.” Nature Catalysis 7, no. 1: 65–76. https://doi.org/10.1038/s41929-023-01068-2.
|
| [102] |
Scarabelli, G., E. O. Oloo, J. K. X. Maier, and A. Rodriguez-Granillo. 2022. “Accurate Prediction of Protein Thermodynamic Stability Changes Upon Residue Mutation Using Free Energy Perturbation.” Journal of Molecular Biology 434, no. 2: 167375. https://doi.org/10.1016/j.jmb.2021.167375.
|
| [103] |
Shao, Q., A. C. Hollenbeak, Y. Jiang, X. Ran, B. O. Bachmann, and Z. J. Yang. 2025. “SubTuner Leverages Physics-Based Modeling to Complement AI in Enzyme Engineering Toward Non-Native Substrates.” Chem Catalysis 5, no. 6: 101334. https://doi.org/10.1016/j.checat.2025.101334.
|
| [104] |
Sharma, A., G. Gupta, T. Ahmad, S. Mansoor, and B. Kaur. 2021. “Enzyme Engineering: Current Trends and Future Perspectives.” Food Reviews International 37, no. 2: 121–154. https://doi.org/10.1080/87559129.2019.1695835.
|
| [105] |
Shen, S., J. Li, Y. Wang, S. Li, H. E. Xu, and X. He. 2025. “An Update for AlphaFold3 Versus Experimental Structures: Assessing the Precision of Small Molecule Binding in GPCRS.” Acta Pharmacologica Sinica 46: 3355–3364. https://doi.org/10.1038/s41401-025-01617-4.
|
| [106] |
Shi, Y., J. S. Tabet, D. E. Milkie, et al. 2024. “Smart Lattice Light-Sheet Microscopy for Imaging Rare and Complex Cellular Events.” Nature Methods 21, no. 2: 301–310. https://doi.org/10.1038/s41592-023-02126-0.
|
| [107] |
Singh, N., S. Lane, T. Yu, et al. 2025. “A Generalized Platform for Artificial Intelligence-Powered Autonomous Enzyme Engineering.” Nature Communications 16, no. 1: 5648. https://doi.org/10.1038/s41467-025-61209-y.
|
| [108] |
Soleimany, A. P., C. Martin-Alonso, M. Anahtar, C. S. Wang, and S. N. Bhatia. 2022. “Protease Activity Analysis: A Toolkit for Analyzing Enzyme Activity Data.” ACS Omega 7, no. 28: 24292–24301. https://doi.org/10.1021/acsomega.2c01559.
|
| [109] |
Song, F., Z. Qin, K. Qiu, et al. 2024. “Development of a Vitamin B5 Hyperproducer in Escherichia coli by Multiple Metabolic Engineering.” Metabolic Engineering 84: 158–168. https://doi.org/10.1016/j.ymben.2024.06.006.
|
| [110] |
Song, Y., Q. Yuan, S. Chen, Y. Zeng, H. Zhao, and Y. Yang. 2024. “Accurately Predicting Enzyme Functions Through Geometric Graph Learning on ESMFold-Predicted Structures.” Nature Communications 15, no. 1: 8180. https://doi.org/10.1038/s41467-024-52533-w.
|
| [111] |
Stoffel, F., M. Papp, M. Gil-Garcia, et al. 2025. “Enhancement of Enzymatic Activity by Biomolecular Condensates Through pH Buffering.” Nature Communications 16, no. 1: 6368. https://doi.org/10.1038/s41467-025-61013-8.
|
| [112] |
Szymczak, P., W. Zarzecki, J. Wang, et al. 2025. “AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.” Accounts of Chemical Research 58, no. 12: 1831–1846. https://doi.org/10.1021/acs.accounts.0c00594.
|
| [113] |
Tang, X. L., Y. Y. Li, Y. Mao, R. C. Zheng, and Y. G. Zheng. 2022. “Improvement of Multicatalytic Properties of Nitrilase From Paraburkholderia graminis for Efficient Biosynthesis of 2-Chloronicotinic Acid.” Biotechnology and Bioengineering 119, no. 12: 3421–3431. https://doi.org/10.1002/bit.28218.
|
| [114] |
Tang, X. L., X. F. Lu, Z. M. Wu, R. C. Zheng, and Y. G. Zheng. 2018. “Biocatalytic Production of (S)−2-aminobutanamide by a Novel D-Aminopeptidase From Brucella sp With High Activity and Enantioselectivity.” Journal of Biotechnology 266: 20–26. https://doi.org/10.1016/j.jbiotec.2017.12.003.
|
| [115] |
Varadi, M., S. Anyango, M. Deshpande, et al. 2022. “AlphaFold Protein Structure Database: Massively Expanding the Structural Coverage of Protein-Sequence Space With High-Accuracy Models.” Nucleic Acids Research 50, no. D1: D439–D444. https://doi.org/10.1093/nar/gkab1061.
|
| [116] |
Varadi, M., D. Bertoni, P. Magana, et al. 2023. “AlphaFold Protein Structure Database in 2024: Providing Structure Coverage for Over 214 Million Protein Sequences.” Nucleic Acids Research 52, no. D1: D368–D375. https://doi.org/10.1093/nar/gkad1011.
|
| [117] |
Vázquez Torres, S., P. J. Y. Leung, P. Venkatesh, et al. 2024. “De Novo Design of High-Affinity Binders of Bioactive Helical Peptides.” Nature 626, no. 7998: 435–442. https://doi.org/10.1038/s41586-023-06953-1.
|
| [118] |
Victorino da Silva Amatto, I., N. Gonsales da Rosa-Garzon, F. Antônio de Oliveira Simões, et al. 2022. “Enzyme Engineering and Its Industrial Applications.” Biotechnology and Applied Biochemistry 69, no. 2: 389–409. https://doi.org/10.1002/bab.2117.
|
| [119] |
Wang, J., X. Ouyang, S. Meng, et al. 2025. “Rational Multienzyme Architecture Design With iMARS.” Cell 188, no. 5: 1349–1362.e17. https://doi.org/10.1016/j.cell.2024.12.029.
|
| [120] |
Wang, Q., X. Liu, H. Zhang, et al. 2024. “Cytochrome P450 Enzyme Design by Constraining the Catalytic Pocket in a Diffusion Model.” Research 7: 0413. https://doi.org/10.34133/research.0413.
|
| [121] |
Wang, T., B. R. Qin, S. Li, et al. 2025. “Discovery of Diverse and High-Quality mRNA Capping Enzymes Through a Language Model-Enabled Platform.” Science Advances 11, no. 15: eadt0402. https://doi.org/10.1126/sciadv.adt0402.
|
| [122] |
Wang, X., X. Yin, D. Jiang, et al. 2024. “Multi-Modal Deep Learning Enables Efficient and Accurate Annotation of Enzymatic Active Sites.” Nature Communications 15, no. 1: 7348. https://doi.org/10.1038/s41467-024-51511-6.
|
| [123] |
Wang, Y., H. Wang, Y. Li, et al. 2024. “Chemically Conjugated Branched Staples for Super-DNA Origami.” Journal of the American Chemical Society 146, no. 6: 4178–4186. https://doi.org/10.1021/jacs.3c13331.
|
| [124] |
Wang, Z., D. Xie, D. Wu, et al. 2025. “Robust Enzyme Discovery and Engineering With Deep Learning Using CataPro.” Nature Communications 16, no. 1: 2736. https://doi.org/10.1038/s41467-025-58038-4.
|
| [125] |
Watson, J. L., D. Juergens, N. R. Bennett, et al. 2023. “De Novo Design of Protein Structure and Function With RF Diffusion.” Nature 620, no. 7976: 1089–1100. https://doi.org/10.1038/s41586-023-06415-8.
|
| [126] |
Wayment-Steele, H. K., A. Ojoawo, R. Otten, et al. 2024. “Predicting Multiple Conformations via Sequence Clustering and AlphaFold2.” Nature 625, no. 7996: 832–839. https://doi.org/10.1038/s41586-023-06832-9.
|
| [127] |
Wei, H., and J. P. Smith. 2023. “Modernized Machine Learning Approach to Illuminate Enzyme Immobilization for Biocatalysis.” ACS Central Science 9, no. 10: 1913–1926. https://doi.org/10.1021/acscentsci.3c00757.
|
| [128] |
Wu, B., B. Zhong, L. Zheng, et al. 2025. “Harnessing Protein Language Model for Structure-Based Discovery of Highly Efficient and Robust PET Hydrolases.” Nature Communications 16, no. 1: 6211. https://doi.org/10.1038/s41467-025-61599-z.
|
| [129] |
Xiong, W., B. Liu, Y. Shen, K. Jing, and T. R. Savage. 2021. “Protein Engineering Design From Directed Evolution to De Novo Synthesis.” Biochemical Engineering Journal 174: 108096. https://doi.org/10.1016/j.bej.2021.108096.
|
| [130] |
Xu, W., J. Cai, W. Wu, Q. Yuan, Z. Mao, and H. Ma. 2025. “Improving Metabolic Engineering Design With Enzyme-Thermo Optimization.” Metabolic Engineering 91: 356–365. https://doi.org/10.1016/j.ymben.2025.05.009.
|
| [131] |
Yang, B., H. Wang, W. Song, et al. 2017. “Engineering of the Conformational Dynamics of Lipase to Increase Enantioselectivity.” ACS Catalysis 7, no. 11: 7593–7599. https://doi.org/10.1021/acscatal.7b02404.
|
| [132] |
Yang, C., Y. Yang, G. Chu, et al. 2024. “AutoESDCas: A Web-Based Tool for the Whole-Workflow Editing Sequence Design for Microbial Genome Editing Based on the CRISPR/CAS System.” ACS Synthetic Biology 13, no. 6: 1737–1749. https://doi.org/10.1021/acssynbio.4c00063.
|
| [133] |
Yang, L., S. Dong, S. Gai, et al. 2024. “Deep Insight of Design, Mechanism, and Cancer Theranostic Strategy of Nanozymes.” Nano-Micro Letters 16, no. 1: 28. https://doi.org/10.1007/s40820-023-01224-0.
|
| [134] |
Yang, M., C. Fehl, K. V. Lees, et al. 2018. “Functional and Informatics Analysis Enables Glycosyltransferase Activity Prediction.” Nature Chemical Biology 14, no. 12: 1109–1117. https://doi.org/10.1038/s41589-018-0154-9.
|
| [135] |
Yu, H., H. Deng, J. He, J. D. Keasling, and X. Luo. 2023. “UniKP: A Unified Framework for the Prediction of Enzyme Kinetic Parameters.” Nature Communications 14, no. 1: 8211. https://doi.org/10.1038/s41467-023-44113-1.
|
| [136] |
Yu, H., S. Ma, Y. Li, and P. A. Dalby. 2022. “Hot Spots-Making Directed Evolution Easier.” Biotechnology Advances 56: 107926. https://doi.org/10.1016/j.biotechadv.2022.107926.
|
| [137] |
Yu, S., Q. Li, Y. Zhang, and H. Zhou. 2024. “New Possibility for PET Plastic Recycling by a Tailored Hydrolytic Enzyme.” Green Energy & Environment 9, no. 2: 163–165. https://doi.org/10.1016/j.gee.2023.02.007.
|
| [138] |
Yu, T., A. G. Boob, M. J. Volk, X. Liu, H. Cui, and H. Zhao. 2023. “Machine Learning-Enabled Retrobiosynthesis of Molecules.” Nature Catalysis 6, no. 2: 137–151. https://doi.org/10.1038/s41929-022-00909-w.
|
| [139] |
Yu, T., H. Cui, J. C. Li, Y. Luo, G. Jiang, and H. Zhao. 2023. “Enzyme Function Prediction Using Contrastive Learning.” Science 379, no. 6639: 1358–1363. https://doi.org/10.1126/science.adf2465.
|
| [140] |
Yılmaz, M. A., A. A. Ceylan, G. Kaynar, and A. E. Çiçek. 2025. “LYCEUM: Learning to Call Copy Number Variants on Low-Coverage Ancient Genomes.” Supplement. Bioinformatics 41, no. S1: i285–i293. https://doi.org/10.1093/bioinformatics/btaf244.
|
| [141] |
Zeng, Y. C., O. J. Young, C. M. Wintersinger, et al. 2024. “Fine Tuning of CpG Spatial Distribution With DNA Origami for Improved Cancer Vaccination.” Nature Nanotechnology 19, no. 7: 1055–1065. https://doi.org/10.1038/s41565-024-01615-3.
|
| [142] |
Zeng, Z., J. Guo, J. Jin, and X. Luo. 2025. “CLAIRE: A Contrastive Learning-Based Predictor for EC Number of Chemical Reactions.” Journal of Cheminformatics 17, no. 1: 2. https://doi.org/10.1186/s13321-024-00944-8.
|
| [143] |
Zhang, Q., X. F. Lu, Y. Zhang, X. L. Tang, R. C. Zheng, and Y. G. Zheng. 2020. “Development of a Robust Nitrilase by Fragment Swapping and Semi-Rational Design for Efficient Biosynthesis of Pregabalin Precursor.” Biotechnology and Bioengineering 117, no. 2: 318–329. https://doi.org/10.1002/bit.27203.
|
| [144] |
Zhang, Y., R. Li, G. Zou, et al. 2024. “Discovery of Antimicrobial Lysins From the ‘Dark Matter’ of Uncharacterized Phages Using Artificial Intelligence.” Advanced Science 11, no. 32: e2404049. https://doi.org/10.1002/advs.202404049.
|
| [145] |
Zhang, Z., Z. Li, Q. Wang, et al. 2024. “A Protein Fitness Predictive Framework Based on Feature Combination and Intelligent Searching.” Protein Science 33, no. 12: e5211. https://doi.org/10.1002/pro.5211.
|
| [146] |
Zhang, Z., Z. Li, M. Yang, F. Zhao, and S. Han. 2024. “Machine Learning-Guided Multi-Site Combinatorial Mutagenesis Enhances the Thermostability of Pectin Lyase.” International Journal of Biological Macromolecules 277: 134530. https://doi.org/10.1016/j.ijbiomac.2024.134530.
|
| [147] |
Zhao, P. C., X. X. Wei, Q. Wang, et al. 2025. “Single-Step Retrosynthesis Prediction via Multitask Graph Representation Learning.” Nature Communications 16, no. 1: 814. https://doi.org/10.1038/s41467-025-56062-y.
|
RIGHTS & PERMISSIONS
2025 The Author(s). Food Bioengineering published by John Wiley & Sons Australia, Ltd on behalf of State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology.