Accelerating Enzyme Engineering With Artificial Intelligence in Biocatalysis: Challenges and Opportunities

Xiafeng Lu , Min Cao , Muqing Ma , Yan Wu , Maohua Qu , Feng Du , Rongtao Ji , Mimi Duan , Leichi Dong , Kai Liu , Xueqi Zhang , Zhibo Luo

Food Bioengineering ›› 2025, Vol. 4 ›› Issue (4) : 589 -611.

PDF
Food Bioengineering ›› 2025, Vol. 4 ›› Issue (4) :589 -611. DOI: 10.1002/fbe2.70029
REVIEW ARTICLE
Accelerating Enzyme Engineering With Artificial Intelligence in Biocatalysis: Challenges and Opportunities
Author information +
History +
PDF

Abstract

Artificial intelligence (AI) has emerged as a paradigm-shifting force in enzyme engineering, enabling data-driven prediction of catalytic activity, stability, and substrate specificity. By integrating large-scale datasets from structured databases (e.g., PDB, BRENDA) and high-throughput experimentation (e.g., deep mutational scanning, microfluidics), machine learning (ML) approaches—including random forests, support vector machines, and deep neural networks—have accelerated enzyme optimization across key domains: mutational profiling, catalytic condition refinement, and mechanistic elucidation. Notably, AlphaFold has revolutionized structure prediction, while AI-directed evolution enhanced enantioselectivity in nonnatural reactions (e.g., C─Si bond formation). Nevertheless, persistent challenges include data heterogeneity, model overfitting with sparse datasets, and limited interpretability of deep learning frameworks. Future advancements necessitate hybrid strategies merging AI with physics-based simulations (e.g., molecular dynamics), rigorous standardization of databases (aligned with FAIR principles), and synergistic integration of rational design with data-driven optimization. This review critically evaluates AI's transformative potential and methodological gaps in enzyme engineering, highlighting implications for sustainable biomanufacturing and industrial biocatalysis.

Keywords

artificial intelligence / deep learning / enzyme engineering / industrial biocatalysis / machine learning

Cite this article

Download citation ▾
Xiafeng Lu, Min Cao, Muqing Ma, Yan Wu, Maohua Qu, Feng Du, Rongtao Ji, Mimi Duan, Leichi Dong, Kai Liu, Xueqi Zhang, Zhibo Luo. Accelerating Enzyme Engineering With Artificial Intelligence in Biocatalysis: Challenges and Opportunities. Food Bioengineering, 2025, 4(4): 589-611 DOI:10.1002/fbe2.70029

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Abramson, J., J. Adler, J. Dunger, et al. 2024. “Accurate Structure Prediction of Biomolecular Interactions With Alphafold 3.” Nature 630, no. 8016: 493–500. https://doi.org/10.1038/s41586-024-07487-w.

[2]

Ali, R., and Y. Zhang. 2024. “Machine Learning Meets Enzyme Engineering: Examples in the Design of Polyethylene Terephthalate Hydrolases.” Frontiers of Chemical Science and Engineering 18, no. 12: 149. https://doi.org/10.1007/s11705-024-2500-7.

[3]

Baek, M., R. McHugh, I. Anishchenko, H. Jiang, D. Baker, and F. DiMaio. 2024. “Accurate Prediction of Protein-Nucleic Acid Complexes Using RoseTTAFoldNA.” Nature Methods 21, no. 1: 117–121. https://doi.org/10.1038/s41592-023-02086-5.

[4]

Bateman, A., M. J. Martin, S. Orchard, et al. 2020. “UniProt: The Universal Protein Knowledgebase in 2021.” Nucleic Acids Research 49, no. D1: D480–D489. https://doi.org/10.1093/nar/gkaa1100.

[5]

Bian, J., P. Tan, T. Nie, L. Hong, and G. Y. Yang. 2024. “Optimizing Enzyme Thermostability by Combining Multiple Mutations Using Protein Language Model.” mLife 3: 492–504. https://doi.org/10.1002/mlf2.12151.

[6]

Biswas, S., G. Khimulya, E. C. Alley, K. M. Esvelt, and G. M. Church. 2021. “Low-N Protein Engineering With Data-Efficient Deep Learning.” Nature Methods 18, no. 4: 389–396. https://doi.org/10.1038/s41592-021-01100-y.

[7]

Bornscheuer, U. T. 2017. “The Fourth Wave of Biocatalysis Is Approaching.” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 376, no. 2100: 20170063. https://doi.org/10.1098/rsta.2017.0063.

[8]

Bornscheuer, U. T., G. W. Huisman, R. J. Kazlauskas, S. Lutz, J. C. Moore, and K. Robins. 2012. “Engineering the Third Wave of Biocatalysis.” Nature 485, no. 485: 185–194. https://doi.org/10.1038/nature11117.

[9]

Brotzakis, Z. F., S. Zhang, M. H. Murtada, and M. Vendruscolo. 2025. “AlphaFold Prediction of Structural Ensembles of Disordered Proteins.” Nature Communications 16, no. 1: 1632. https://doi.org/10.1038/s41467-025-56572-9.

[10]

Buel, G. R., and K. J. Walters. 2022. “Can AlphaFold2 Predict the Impact of Missense Mutations on Structure?” Nature Structural & Molecular Biology 29, no. 1: 1–2. https://doi.org/10.1038/s41594-021-00714-2.

[11]

Cai, H., Z. Zhang, M. Wang, et al. 2024. “Pretrainable Geometric Graph Neural Network for Antibody Affinity Maturation.” Nature Communications 15, no. 1: 7785. https://doi.org/10.1038/s41467-024-51563-8.

[12]

Callaway, E. 2024. “Could AI-Designed Proteins Be Weaponized? Scientists Lay out Safety Guidelines.” Nature 627, no. 8004: 478. https://doi.org/10.1038/d41586-024-00699-0.

[13]

Casadevall, G., C. Duran, and S. Osuna. 2023. “AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design.” JACS Au 3, no. 6: 1554–1562. https://doi.org/10.1021/jacsau.3c00188.

[14]

Che, X., X. Tao, J. Chen, et al. 2025. “Mining Highly Active Oleate Hydratases by Structure Clustering, Sequence Clustering, and Ancestral Sequence Reconstruction.” Journal of Agricultural and Food Chemistry 73, no. 12: 7335–7346. https://doi.org/10.1021/acs.jafc.4c10815.

[15]

Chen, A., S. Ye, Z. Wang, Y. Han, J. Cai, and J. Li. 2023. “Machine-Learning-Assisted Rational Design of 2D Doped Tellurene for Fin Field-Effect Transistor Devices.” Patterns 4, no. 4: 100722-100722. https://doi.org/10.1016/j.patter.2023.100722.

[16]

Chen, Y., K. Tao, W. Ji, V. B. Kumar, S. Rencus-Lazar, and E. Gazit. 2022. “Histidine as a Key Modulator of Molecular Self-Assembly: Peptide-Based Supramolecular Materials Inspired by Biological Systems.” Materials Today 60: 106–127. https://doi.org/10.1016/j.mattod.2022.08.011.

[17]

Chen, Z., S. Yu, J. Liu, et al. 2024. “Concentration Recognition-Based Auto-Dynamic Regulation System (CRUISE) Enabling Efficient Production of Higher Alcohols.” Advanced Science 11, no. 23: e2310215. https://doi.org/10.1002/advs.202310215.

[18]

Cheng, H., M. Zhang, and J. Q. Shi. 2024. “A Survey on Deep Neural Network Pruning: Taxonomy, Comparison, Analysis, and Recommendations.” IEEE Transactions on Pattern Analysis and Machine Intelligence 46, no. 12: 10558–10578. https://doi.org/10.1109/tpami.2024.3447085.

[19]

Choi, J. M., S. S. Han, and H. S. Kim. 2015. “Industrial Applications of Enzyme Biocatalysis: Current Status and Future Aspects.” Biotechnology Advances 33, no. 7: 1443–1454. https://doi.org/10.1016/j.biotechadv.2015.02.014.

[20]

Choudhury, S., B. Narayanan, M. Moret, V. Hatzimanikatis, and L. Miskovic. 2024. “Generative Machine Learning Produces Kinetic Models That Accurately Characterize Intracellular Metabolic States.” Nature Catalysis 7, no. 10: 1086–1098. https://doi.org/10.1038/s41929-024-01220-6.

[21]

Dai, A. D., X. L. Tang, Z. M. Wu, J. T. Tang, R. C. Zheng, and Y. G. Zheng. 2022. “Rational Regulation of Reaction Specificity of Nitrilase for Efficient Biosynthesis of 2-Chloronicotinic Acid Through a Single Site Mutation.” Applied and Environmental Microbiology 88: aem0239721. https://doi.org/10.1128/aem.02397-21.

[22]

Dauparas, J., I. Anishchenko, N. Bennett, et al. 2022. “Robust Deep Learning–Based Protein Sequence Design Using ProteinMPNN.” Science 378, no. 6615: 49–56. https://doi.org/10.1126/science.add2187.

[23]

Dauparas, J., G. R. Lee, R. Pecoraro, et al. 2025. “Atomic Context-Conditioned Protein Sequence Design Using LigandMPNN.” Nature Methods 22, no. 4: 717–723. https://doi.org/10.1038/s41592-025-02626-1.

[24]

Delépine, B., T. Duigou, P. Carbonell, and J. L. Faulon. 2018. “RetroPath2.0: A Retrosynthesis Workflow for Metabolic Engineers.” Metabolic Engineering 45: 158–170. https://doi.org/10.1016/j.ymben.2017.12.002.

[25]

Diao, H. J., L. M. Lin, L. Y. Xu, J. H. Yao, R. C. Zheng, and Y. G. Zheng. 2025. “Engineering Catalytically Promiscuous Enzymes to Serve New Functions.” Biotechnology Advances 82: 108601. https://doi.org/10.1016/j.biotechadv.2025.108601.

[26]

Ding, Y., X. Luo, J. Guo, et al. 2025. “Identification of Gut Microbial Bile Acid Metabolic Enzymes Via an AI-Assisted Pipeline.” Cell 188, no. 21: 6012–6027. https://doi.org/10.1016/j.cell.2025.07.017.

[27]

Ding, X. W., J. Rong, Z. P. Pan, et al. 2024. “De Novo Multienzyme Synthetic Pathways for Lactic Acid Production.” ACS Catalysis 14, no. 7: 4665–4674. https://doi.org/10.1021/acscatal.3c05489.

[28]

Du, J. H., K. X. Zhou, X. Y. Hong, Z. Z. Xu, J. B. Xu, and X. Huang. 2025. “Retrieval Augmented Zero-Shot Enzyme Generation for Specified Substrate.” International Conference on Machine Learning, 5934.

[29]

Eisenstein, M. 2023. “AI-Enhanced Protein Design Makes Proteins That Have Never Existed.” Nature Biotechnology 41, no. 3: 303–305. https://doi.org/10.1038/s41587-023-01705-y.

[30]

Feehan, R., D. Montezano, and J. Slusky. 2021. “Machine Learning for Enzyme Engineering, Selection and Design.” Protein Engineering, Design & Selection: PEDS 34: gzab019. https://doi.org/10.1093/protein/gzab019.

[31]

Gado, J. E., M. Knotts, A. Y. Shaw, et al. 2025. “Machine Learning Prediction of Enzyme Optimum pH.” Nature Machine Intelligence 7, no. 5: 716–729. https://doi.org/10.1038/s42256-025-01026-6.

[32]

George, A., and T. Walsh. 2022. “Artificial Intelligence Is Breaking Patent Law.” Nature 605, no. 7911: 616–618. https://doi.org/10.1038/d41586-022-01391-x.

[33]

Geraldene, M., I.V. Ramiro, and F. Silvia, et al. 2024. “ZymCTRL: A conditional Language Model for the Controllable Generation of Artificial Enzymes.” Preprint, bioRxiv. https://doi.org/10.1101/2024.05.03.592223.

[34]

Goldman, S., R. Das, K. K. Yang, and C. W. Coley. 2022. “Machine Learning Modeling of Family Wide Enzyme-Substrate Specificity Screens.” PLoS Computational Biology 18, no. 2: e1009853. https://doi.org/10.1371/journal.pcbi.1009853.

[35]

Gong, J. S., T. T. Dong, B. C. Gu, et al. 2017. “Semirational Engineering Accelerates the Laboratory Evolution of Nitrilase Catalytic Efficiency for Nicotinic Acid Biosynthesis.” ChemCatChem 9, no. 17: 3395–3401.

[36]

Gong, X. M., Z. Qin, F. L. Li, B. B. Zeng, G. W. Zheng, and J. H. Xu. 2018. “Development of an Engineered Ketoreductase With Simultaneously Improved Thermostability and Activity for Making a Bulky Atorvastatin Precursor.” ACS Catalysis 9, no. 1: 147–153.

[37]

Hadadi, N., J. Hafner, A. Shajkofci, A. Zisaki, and V. Hatzimanikatis. 2016. “ATLAS of Biochemistry: A Repository of all Possible Biochemical Reactions for Synthetic Biology and Metabolic Engineering Studies.” ACS Synthetic Biology 5, no. 10: 1155–1166. https://doi.org/10.1021/acssynbio.6b00054.

[38]

He, X., and M. Yan. 2024. “GraphKM: Machine and Deep Learning for KM Prediction of Wildtype and Mutant Enzymes.” BMC Bioinformatics 25, no. 1: 135. https://doi.org/10.1186/s12859-024-05746-1.

[39]

Hopf, T. A., J. B. Ingraham, F. J. Poelwijk, et al. 2017. “Mutation Effects Predicted From Sequence Co-Variation.” Nature Biotechnology 35, no. 2: 128–135. https://doi.org/10.1038/nbt.3769.

[40]

Hu, D., Z. Y. Lu, X. Liao, Y. Y. Hu, X. Y. Qi, and Y. C. He. 2024. “Rational Design of the Substrate Binding Pocket in Epoxide Hydrolase for Enhanced Enantioselectivity Towards Chiral Ortho-Methylstyrene Oxide.” Molecular Catalysis 568: 114494. https://doi.org/10.1016/j.mcat.2024.114494.

[41]

Hu, Y., H. Yang, M. Li, et al. 2024. “Exploring Protein Conformational Changes Using a Large-Scale Biophysical Sampling Augmented Deep Learning Strategy.” Advanced Science 11, no. 44: 2400884. https://doi.org/10.1002/advs.202400884.

[42]

Hult, K., and P. Berglund. 2007. “Enzyme Promiscuity: Mechanism and Applications.” Trends in Biotechnology 25, no. 5: 231–238. https://doi.org/10.1016/j.tibtech.2007.03.002.

[43]

Humphreys, I. R., J. Pei, M. Baek, et al. 2021. “Computed Structures of Core Eukaryotic Protein Complexes.” Science 374, no. 6573: 1340. eabm4805. https://doi.org/10.1126/science.abm4805.

[44]

Jahn, L. J., V. M. Rekdal, and M. O. A. Sommer. 2023. “Microbial Foods for Improving Human and Planetary Health.” Cell 186, no. 3: 469–478. https://doi.org/10.1016/j.cell.2022.12.002.

[45]

Jiang, R., L. Shang, R. Wang, D. Wang, and N. Wei. 2023. “Machine Learning Based Prediction of Enzymatic Degradation of Plastics Using Encoded Protein Sequence and Effective Feature Representation.” Environmental Science & Technology Letters 10, no. 7: 557–564. https://doi.org/10.1021/acs.estlett.3c00293.

[46]

Jiang, S., H. Li, L. Zhang, et al. 2024. “Generic Diagramming Platform (GDP): A Comprehensive Database of High-Quality Biomedical Graphics.” Nucleic Acids Research 53, no. D1: D1670–D1676. https://doi.org/10.1093/nar/gkae973.

[47]

Jiménez-Osés, G., and F. Peccati. 2025. “Structure Prediction of Alternate Frame Folding Systems With AlphaFold3.” Journal of Chemical Information and Modeling 65, no. 15: 8229–8237. https://doi.org/10.1021/acs.jcim.5c00906.

[48]

Jinich, A., S. Z. Nazia, A. V. Tellez, D. Rappoport, and K. Y. Rhee. 2022. “ Predicting Enzyme Substrate Chemical Structure With Protein Language Models.” Preprint, bioRxiv. https://doi.org/10.1101/2022.09.28.509940.

[49]

Jumper, J., R. Evans, A. Pritzel, et al. 2021. “Highly Accurate Protein Structure Prediction With Alphafold.” Nature 596, no. 7873: 583–589. https://doi.org/10.1038/s41586-021-03819-2.

[50]

Jyoti, K.Bhatia, K.Chauhan, CAttri, A. and Seth 2017. “Improving Stability and Reusability of Rhodococcus pyridinivorans NIT-36 Nitrilase by Whole Cell Immobilization Using Chitosan.” International Journal of Biological Macromolecules 103: 8–15. https://doi.org/10.1016/j.ijbiomac.2017.05.012.

[51]

Khanra, M., V. Ravichandiran, and S. P. Swain. 2025. “Lipase Enzymes for Sustainable Synthesis of Pharmaceuticals and Chiral Organic Building Blocks.” Advanced Sustainable Systems 9, no. 1: e2400495. https://doi.org/10.1002/adsu.202400495.

[52]

Khersonsky, O., R. Lipsh, Z. Avizemer, et al. 2018. “Automated Design of Efficient and Functionally Diverse Enzyme Repertoires.” Molecular Cell 72, no. 1: 178–186.e5. https://doi.org/10.1016/j.molcel.2018.08.033.

[53]

Klemanska, A., K. Dwyer, and G. Walsh. 2024. “Truncation of a Novel C-Terminal Domain of a β-glucanase Improves Its Thermal Stability and Specific Activity.” Biotechnology Journal 19, no. 8: e2400245. https://doi.org/10.1002/biot.202400245.

[54]

Kortemme, T. 2024. “De Novo Protein Design—From New Structures to Programmable Functions.” Cell 187, no. 3: 526–544. https://doi.org/10.1016/j.cell.2023.12.028.

[55]

Lancaster, L., W. Abdallah, S. Banta, and I. Wheeldon. 2018. “Engineering Enzyme Microenvironments for Enhanced Biocatalysis.” Chemical Society Reviews 47, no. 14: 5177–5186. https://doi.org/10.1039/c8cs00085a.

[56]

Landwehr, G. M., J. W. Bogart, C. Magalhaes, E. G. Hammarlund, A. S. Karim, and M. C. Jewett. 2025. “Accelerated Enzyme Engineering by Machine-Learning Guided Cell-Free Expression.” Nature Communications 16, no. 1: 865. https://doi.org/10.1038/s41467-024-55399-0.

[57]

Lannelongue, L., and M. Inouye. 2023. “Environmental Impacts of Machine Learning Applications in Protein Science.” Cold Spring Harbor Perspectives in Biology 15, no. 12: a041473. https://doi.org/10.1101/cshperspect.a041473.

[58]

Lauko, A., S. J. Pellock, K. H. Sumida, et al. 2025. “Computational Design of Serine Hydrolases.” Science 388, no. 6744: adu2454. https://doi.org/10.1126/science.adu2454.

[59]

Lee, C., B. H. Su, and Y. J. Tseng. 2022. “Comparative Studies of AlphaFold, RoseTTAFold and Modeller: A Case Study Involving the Use of G-Protein-Coupled Receptors.” Briefings in Bioinformatics 23, no. 5: bbac308. https://doi.org/10.1093/bib/bbac308.

[60]

Lenz, F., P. Zurek, M. Umlauf, I. E. P. Tozakidis, and J. Jose. 2020. “Tailor-Made β-glucosidase With Increased Activity at Lower Temperature Without Loss of Stability and Glucose Tolerance.” Green Chemistry 22, no. 7: 2234–2243. https://doi.org/10.1039/c9gc04166d.

[61]

Li, F., Y. Chen, M. Anton, and J. Nielsen. 2023. “Gotenzymes: An Extensive Database of Enzyme Parameter Predictions.” Nucleic Acids Research 51, no. D1: D583–D586. https://doi.org/10.1093/nar/gkac831.

[62]

Li, F., L. Yuan, H. Lu, et al. 2022. “Deep Learning-Based kcat Prediction Enables Improved Enzyme-Constrained Model Reconstruction.” Nature Catalysis 5, no. 8: 662–672. https://doi.org/10.1038/s41929-022-00798-z.

[63]

Li, P., J. Yang, M. A. Islam, and S. Ren. 2025. “Making AI Less ‘Thirsty’.” Communications of the ACM 68, no. 7: 54–61. https://doi.org/10.1145/3724499.

[64]

Li, S. F., J. Y. Xie, S. Qiu, et al. 2021. “Semirational Engineering of an Aldo–Keto Reductase KmAKR for Overcoming Trade-Offs Between Catalytic Activity and Thermostability.” Biotechnology and Bioengineering 118, no. 11: 4441–4452.

[65]

Li, Y., R. Cao, Q. He, T. Xiao, and J. Zhu. 2023. “Learning Reliable Neural Networks With Distributed Architecture Representations.” ACM Transactions on Asian and Low-Resource Language Information Processing 22, no. 4: 1–20. https://doi.org/10.1145/3578709.

[66]

Li, Y., and K. Wang. 2025. “A Deep Learning-Based Approach for Predicting Michaelis Constants From Enzymatic Reactions.” Applied Sciences 15, no. 7: 4017. https://doi.org/10.3390/app15074017.

[67]

Li, Z. L., S. Pei, Z. Chen, et al. 2024. “Machine Learning-Assisted Amidase-Catalytic Enantioselectivity Prediction and Rational Design of Variants for Improving Enantioselectivity.” Nature Communications 15, no. 1: 8778. https://doi.org/10.1038/s41467-024-53048-0.

[68]

Liang, T., C. Jiang, J. Yuan, Y. Othman, X. Q. Xie, and Z. Feng. 2022. “Differential Performance of RoseTTAFold in Antibody Modeling.” Briefings in Bioinformatics 23, no. 5: bbac152. https://doi.org/10.1093/bib/bbac152.

[69]

Liao, M., S. Feng, X. Liu, et al. 2024. “Novel Insights into Enzymatic Thermostability: The ‘Short Board’ Theory and Zero-Shot Hamiltonian Model.” Advanced Science 11, no. 45: e2402441. https://doi.org/10.1002/advs.202402441.

[70]

Lin, Y. R., N. Koga, R. Tatsumi-Koga, et al. 2015. “Control over Overall Shape and Size in De Novo Designed Proteins.” Proceedings of the National Academy of Sciences 112, no. 40: E5478–E5485. https://doi.org/10.1073/pnas.1509508112.

[71]

Listov, D., R. Lipsh-Sokolik, S. Rosset, C. Yang, B. E. Correia, and S. J. Fleishman. 2022. “Assessing and Enhancing Foldability in Designed Proteins.” Protein Science 31, no. 9: e4400. https://doi.org/10.1002/pro.4400.

[72]

Listov, D., E. Vos, G. Hoffka, et al. 2025. “Complete Computational Design of High-Efficiency Kemp Elimination Enzymes.” Nature 643: 1421–1427. https://doi.org/10.1038/s41586-025-09136-2.

[73]

Liu, Q., D. Jiang, H. Zhou, et al. 2023. “Pyrolysis-Catalysis Upcycling of Waste Plastic Using a Multilayer Stainless-Steel Catalyst Toward a Circular Economy.” Proceedings of the National Academy of Sciences 120, no. 39: e2305078120. https://doi.org/10.1073/pnas.2305078120.

[74]

Liu, S. H., L. Bai, X. D. Wang, et al. 2025. “Machine Learning-Guided Protein Engineering to Improve the Catalytic Activity of Transaminases Under Neutral pH Conditions.” Organic Chemistry Frontiers 12: 4788–4793. https://doi.org/10.1039/d5qo00423c.

[75]

Liu, W., P. Wang, X. Zhuang, et al. 2025. “RDBSB: A Database for Catalytic Bioparts With Experimental Evidence.” Nucleic Acids Research 53, no. D1: D709–D716. https://doi.org/10.1093/nar/gkae844.

[76]

Lu, H., D. J. Diaz, N. J. Czarnecki, et al. 2022. “Machine Learning-Aided Engineering of Hydrolases for PET Depolymerization.” Nature 604, no. 7907: 662–667. https://doi.org/10.1038/s41586-022-04599-z.

[77]

Lu, J., Y. Wu, C. Deng, et al. 2022. “Model-Based Dynamic Engineering of Escherichia coli for N-Acetylglucosamine Overproduction.” Biotechnology Notes 3: 15–24. https://doi.org/10.1016/j.biotno.2022.02.001.

[78]

Lu, X. F., H. J. Diao, Z. M. Wu, Z. L. Zhang, R. C. Zheng, and Y. G. Zheng. 2022. “Engineering of Reaction Specificity, Enantioselectivity, and Catalytic Activity of Nitrilase for Highly Efficient Synthesis of Pregabalin Precursor.” Biotechnology and Bioengineering 119, no. 9: 2399–2412. https://doi.org/10.1002/bit.28165.

[79]

Luo, Y., G. Jiang, T. Yu, et al. 2021. “ECNet Is an Evolutionary Context-Integrated Deep Learning Framework for Protein Engineering.” Nature Communications 12, no. 1: 5743. https://doi.org/10.1038/s41467-021-25976-8.

[80]

Ma, E. J., E. Siirola, C. Moore, et al. 2021. “Machine-Directed Evolution of an Imine Reductase for Activity and Stereoselectivity.” ACS Catalysis 11, no. 20: 12433–12445. https://doi.org/10.1021/acscatal.1c02786.

[81]

Ma, W., S. Zhang, Z. Li, et al. 2022. “Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures.” Journal of Chemical Information and Modeling 62, no. 17: 4008–4017. https://doi.org/10.1021/acs.jcim.2c00885.

[82]

McBride, J. M., K. Polev, A. Abdirasulov, V. Reinharz, B. A. Grzybowski, and T. Tlusty. 2023. “AlphaFold2 Can Predict Single-Mutation Effects.” Physical Review Letters 131, no. 21: 218401. https://doi.org/10.1103/PhysRevLett.131.218401.

[83]

Meng, Q., N. Capra, C. M. Palacio, et al. 2020. “Robust ω-Transaminases by Computational Stabilization of the Subunit Interface.” ACS Catalysis 10, no. 5: 2915–2928. https://doi.org/10.1021/acscatal.9b05223.

[84]

Mitchell, D. A., N. Krieger, and D. C. Estumano. 2023. “Estimation of Selectivities in the Lipase-Catalyzed Esterification of Trimethylolpropane With Fatty Acids.” Biochemical Engineering Journal 198: 109024. https://doi.org/10.1016/j.bej.2023.109024.

[85]

Montanucci, L., E. Capriotti, Y. Frank, N. Ben-Tal, and P. Fariselli. 2019. “DDGun: An Untrained Method for the Prediction of Protein Stability Changes Upon Single and Multiple Point Variations.” BMC Bioinformatics 20, no. 1: 335. https://doi.org/10.1186/s12859-019-2923-1.

[86]

Mou, Z., J. Eakes, C. J. Cooper, et al. 2021. “Machine Learning-Based Prediction of Enzyme Substrate Scope: Application to Bacterial Nitrilases.” Proteins: Structure, Function, and Bioinformatics 89, no. 3: 336–347. https://doi.org/10.1002/prot.26019.

[87]

Nestl, B. M., and B. Hauer. 2014. “Engineering of Flexible Loops in Enzymes.” ACS Catalysis 4, no. 9: 3201–3211. https://doi.org/10.1021/cs500325p.

[88]

Pak, M. A., K. A. Markhieva, M. S. Novikova, et al. 2023. “Using Alphafold to Predict the Impact of Single Mutations on Protein Stability and Function.” PLoS One 18, no. 3: e0282689. https://doi.org/10.1371/journal.pone.0282689.

[89]

Pandi, A., D. Adam, A. Zare, et al. 2023. “Cell-Free Biosynthesis Combined With Deep Learning Accelerates De Novo-Development of Antimicrobial Peptides.” Nature Communications 14, no. 1: 7197. https://doi.org/10.1038/s41467-023-42434-9.

[90]

Pearce, R., X. Huang, G. S. Omenn, and Y. Zhang. 2023. “De Novo Protein Fold Design Through Sequence-Independent Fragment Assembly Simulations.” Proceedings of the National Academy of Sciences 120, no. 4: e2208275120. https://doi.org/10.1073/pnas.2208275120.

[91]

Pertusi, D. A., M. E. Moura, J. G. Jeffryes, S. Prabhu, B. Walters Biggs, and K. E. J. Tyo. 2017. “Predicting Novel Substrates for Enzymes With Minimal Experimental Effort With Active Learning.” Metabolic Engineering 44: 171–181. https://doi.org/10.1016/j.ymben.2017.09.016.

[92]

Poddar, A., and S. R. Rao. 2024. “Evolving Intellectual Property Landscape for AI-Driven Innovations in the Biomedical Sector: Opportunities in Stable IP Regime for Shared Success.” Frontiers in Artificial Intelligence 7: 1372161. https://doi.org/10.3389/frai.2024.1372161.

[93]

Qian, H., Y. Wang, X. Zhou, et al. 2025. “ESM-EZY: A Deep Learning Strategy for the Mining of Novel Multicopper Oxidases With Superior Properties.” Nature Communications 16, no. 1: 3274. https://doi.org/10.1038/s41467-025-58521-y.

[94]

Radivojac, P., W. T. Clark, T. R. Oron, et al. 2013. “A Large-Scale Evaluation of Computational Protein Function Prediction.” Nature Methods 10, no. 3: 221–227. https://doi.org/10.1038/nmeth.2340.

[95]

Rahamathulla, M. P., S. Alsubai, and M. Sha. 2025. “Unveiling the Genetic Symphony: Deep Learning for Decoding Promoters and Non-Promoters in DNA Sequence.” Gene Reports 40: 102283. https://doi.org/10.1016/j.genrep.2025.102283.

[96]

Rapp, J. T., B. J. Bremer, and P. A. Romero. 2024. “Self-Driving Laboratories to Autonomously Navigate the Protein Fitness Landscape.” Nature Chemical Engineering 1, no. 1: 97–107. https://doi.org/10.1038/s44286-023-00002-4.

[97]

Reardon, S. 2024. “Five Protein-Design Questions That Still Challenge AI.” Nature 635, no. 8037: 246–248. https://doi.org/10.1038/d41586-024-03595-9.

[98]

Robinson, S. L., M. D. Smith, J. E. Richman, K. G. Aukema, and L. P. Wackett. 2020. “Machine Learning-Based Prediction of Activity and Substrate Specificity for OleA Enzymes in the Thiolase Superfamily.” Synthetic Biology 5, no. 1: ysaa004. https://doi.org/10.1093/synbio/ysaa004.

[99]

Rodríguez-Alonso, M., F. Rodríguez-Vico, F. Las heras-Vázquez, and J. Clemente-Jiménez. 2017. “L-amino Acid productionL-Amino Acid Production by a Immobilized Double-Racemase Hydantoinase Process: Improvement and Comparison With a Free Protein System.” Catalysts 7, no. 6: 192. https://doi.org/10.3390/catal7060192.

[100]

Romero, P. A., A. Krause, and F. H. Arnold. 2013. “Navigating the Protein Fitness Landscape With Gaussian Processes.” Proceedings of the National Academy of Sciences 110, no. 3: E193–E201. https://doi.org/10.1073/pnas.1215251110.

[101]

Roy, S., D. A. Vargas, P. Ma, et al. 2024. “Stereoselective Construction of β-, γ- and δ-lactam Rings via Enzymatic C-H Amidation.” Nature Catalysis 7, no. 1: 65–76. https://doi.org/10.1038/s41929-023-01068-2.

[102]

Scarabelli, G., E. O. Oloo, J. K. X. Maier, and A. Rodriguez-Granillo. 2022. “Accurate Prediction of Protein Thermodynamic Stability Changes Upon Residue Mutation Using Free Energy Perturbation.” Journal of Molecular Biology 434, no. 2: 167375. https://doi.org/10.1016/j.jmb.2021.167375.

[103]

Shao, Q., A. C. Hollenbeak, Y. Jiang, X. Ran, B. O. Bachmann, and Z. J. Yang. 2025. “SubTuner Leverages Physics-Based Modeling to Complement AI in Enzyme Engineering Toward Non-Native Substrates.” Chem Catalysis 5, no. 6: 101334. https://doi.org/10.1016/j.checat.2025.101334.

[104]

Sharma, A., G. Gupta, T. Ahmad, S. Mansoor, and B. Kaur. 2021. “Enzyme Engineering: Current Trends and Future Perspectives.” Food Reviews International 37, no. 2: 121–154. https://doi.org/10.1080/87559129.2019.1695835.

[105]

Shen, S., J. Li, Y. Wang, S. Li, H. E. Xu, and X. He. 2025. “An Update for AlphaFold3 Versus Experimental Structures: Assessing the Precision of Small Molecule Binding in GPCRS.” Acta Pharmacologica Sinica 46: 3355–3364. https://doi.org/10.1038/s41401-025-01617-4.

[106]

Shi, Y., J. S. Tabet, D. E. Milkie, et al. 2024. “Smart Lattice Light-Sheet Microscopy for Imaging Rare and Complex Cellular Events.” Nature Methods 21, no. 2: 301–310. https://doi.org/10.1038/s41592-023-02126-0.

[107]

Singh, N., S. Lane, T. Yu, et al. 2025. “A Generalized Platform for Artificial Intelligence-Powered Autonomous Enzyme Engineering.” Nature Communications 16, no. 1: 5648. https://doi.org/10.1038/s41467-025-61209-y.

[108]

Soleimany, A. P., C. Martin-Alonso, M. Anahtar, C. S. Wang, and S. N. Bhatia. 2022. “Protease Activity Analysis: A Toolkit for Analyzing Enzyme Activity Data.” ACS Omega 7, no. 28: 24292–24301. https://doi.org/10.1021/acsomega.2c01559.

[109]

Song, F., Z. Qin, K. Qiu, et al. 2024. “Development of a Vitamin B5 Hyperproducer in Escherichia coli by Multiple Metabolic Engineering.” Metabolic Engineering 84: 158–168. https://doi.org/10.1016/j.ymben.2024.06.006.

[110]

Song, Y., Q. Yuan, S. Chen, Y. Zeng, H. Zhao, and Y. Yang. 2024. “Accurately Predicting Enzyme Functions Through Geometric Graph Learning on ESMFold-Predicted Structures.” Nature Communications 15, no. 1: 8180. https://doi.org/10.1038/s41467-024-52533-w.

[111]

Stoffel, F., M. Papp, M. Gil-Garcia, et al. 2025. “Enhancement of Enzymatic Activity by Biomolecular Condensates Through pH Buffering.” Nature Communications 16, no. 1: 6368. https://doi.org/10.1038/s41467-025-61013-8.

[112]

Szymczak, P., W. Zarzecki, J. Wang, et al. 2025. “AI-Driven Antimicrobial Peptide Discovery: Mining and Generation.” Accounts of Chemical Research 58, no. 12: 1831–1846. https://doi.org/10.1021/acs.accounts.0c00594.

[113]

Tang, X. L., Y. Y. Li, Y. Mao, R. C. Zheng, and Y. G. Zheng. 2022. “Improvement of Multicatalytic Properties of Nitrilase From Paraburkholderia graminis for Efficient Biosynthesis of 2-Chloronicotinic Acid.” Biotechnology and Bioengineering 119, no. 12: 3421–3431. https://doi.org/10.1002/bit.28218.

[114]

Tang, X. L., X. F. Lu, Z. M. Wu, R. C. Zheng, and Y. G. Zheng. 2018. “Biocatalytic Production of (S)−2-aminobutanamide by a Novel D-Aminopeptidase From Brucella sp With High Activity and Enantioselectivity.” Journal of Biotechnology 266: 20–26. https://doi.org/10.1016/j.jbiotec.2017.12.003.

[115]

Varadi, M., S. Anyango, M. Deshpande, et al. 2022. “AlphaFold Protein Structure Database: Massively Expanding the Structural Coverage of Protein-Sequence Space With High-Accuracy Models.” Nucleic Acids Research 50, no. D1: D439–D444. https://doi.org/10.1093/nar/gkab1061.

[116]

Varadi, M., D. Bertoni, P. Magana, et al. 2023. “AlphaFold Protein Structure Database in 2024: Providing Structure Coverage for Over 214 Million Protein Sequences.” Nucleic Acids Research 52, no. D1: D368–D375. https://doi.org/10.1093/nar/gkad1011.

[117]

Vázquez Torres, S., P. J. Y. Leung, P. Venkatesh, et al. 2024. “De Novo Design of High-Affinity Binders of Bioactive Helical Peptides.” Nature 626, no. 7998: 435–442. https://doi.org/10.1038/s41586-023-06953-1.

[118]

Victorino da Silva Amatto, I., N. Gonsales da Rosa-Garzon, F. Antônio de Oliveira Simões, et al. 2022. “Enzyme Engineering and Its Industrial Applications.” Biotechnology and Applied Biochemistry 69, no. 2: 389–409. https://doi.org/10.1002/bab.2117.

[119]

Wang, J., X. Ouyang, S. Meng, et al. 2025. “Rational Multienzyme Architecture Design With iMARS.” Cell 188, no. 5: 1349–1362.e17. https://doi.org/10.1016/j.cell.2024.12.029.

[120]

Wang, Q., X. Liu, H. Zhang, et al. 2024. “Cytochrome P450 Enzyme Design by Constraining the Catalytic Pocket in a Diffusion Model.” Research 7: 0413. https://doi.org/10.34133/research.0413.

[121]

Wang, T., B. R. Qin, S. Li, et al. 2025. “Discovery of Diverse and High-Quality mRNA Capping Enzymes Through a Language Model-Enabled Platform.” Science Advances 11, no. 15: eadt0402. https://doi.org/10.1126/sciadv.adt0402.

[122]

Wang, X., X. Yin, D. Jiang, et al. 2024. “Multi-Modal Deep Learning Enables Efficient and Accurate Annotation of Enzymatic Active Sites.” Nature Communications 15, no. 1: 7348. https://doi.org/10.1038/s41467-024-51511-6.

[123]

Wang, Y., H. Wang, Y. Li, et al. 2024. “Chemically Conjugated Branched Staples for Super-DNA Origami.” Journal of the American Chemical Society 146, no. 6: 4178–4186. https://doi.org/10.1021/jacs.3c13331.

[124]

Wang, Z., D. Xie, D. Wu, et al. 2025. “Robust Enzyme Discovery and Engineering With Deep Learning Using CataPro.” Nature Communications 16, no. 1: 2736. https://doi.org/10.1038/s41467-025-58038-4.

[125]

Watson, J. L., D. Juergens, N. R. Bennett, et al. 2023. “De Novo Design of Protein Structure and Function With RF Diffusion.” Nature 620, no. 7976: 1089–1100. https://doi.org/10.1038/s41586-023-06415-8.

[126]

Wayment-Steele, H. K., A. Ojoawo, R. Otten, et al. 2024. “Predicting Multiple Conformations via Sequence Clustering and AlphaFold2.” Nature 625, no. 7996: 832–839. https://doi.org/10.1038/s41586-023-06832-9.

[127]

Wei, H., and J. P. Smith. 2023. “Modernized Machine Learning Approach to Illuminate Enzyme Immobilization for Biocatalysis.” ACS Central Science 9, no. 10: 1913–1926. https://doi.org/10.1021/acscentsci.3c00757.

[128]

Wu, B., B. Zhong, L. Zheng, et al. 2025. “Harnessing Protein Language Model for Structure-Based Discovery of Highly Efficient and Robust PET Hydrolases.” Nature Communications 16, no. 1: 6211. https://doi.org/10.1038/s41467-025-61599-z.

[129]

Xiong, W., B. Liu, Y. Shen, K. Jing, and T. R. Savage. 2021. “Protein Engineering Design From Directed Evolution to De Novo Synthesis.” Biochemical Engineering Journal 174: 108096. https://doi.org/10.1016/j.bej.2021.108096.

[130]

Xu, W., J. Cai, W. Wu, Q. Yuan, Z. Mao, and H. Ma. 2025. “Improving Metabolic Engineering Design With Enzyme-Thermo Optimization.” Metabolic Engineering 91: 356–365. https://doi.org/10.1016/j.ymben.2025.05.009.

[131]

Yang, B., H. Wang, W. Song, et al. 2017. “Engineering of the Conformational Dynamics of Lipase to Increase Enantioselectivity.” ACS Catalysis 7, no. 11: 7593–7599. https://doi.org/10.1021/acscatal.7b02404.

[132]

Yang, C., Y. Yang, G. Chu, et al. 2024. “AutoESDCas: A Web-Based Tool for the Whole-Workflow Editing Sequence Design for Microbial Genome Editing Based on the CRISPR/CAS System.” ACS Synthetic Biology 13, no. 6: 1737–1749. https://doi.org/10.1021/acssynbio.4c00063.

[133]

Yang, L., S. Dong, S. Gai, et al. 2024. “Deep Insight of Design, Mechanism, and Cancer Theranostic Strategy of Nanozymes.” Nano-Micro Letters 16, no. 1: 28. https://doi.org/10.1007/s40820-023-01224-0.

[134]

Yang, M., C. Fehl, K. V. Lees, et al. 2018. “Functional and Informatics Analysis Enables Glycosyltransferase Activity Prediction.” Nature Chemical Biology 14, no. 12: 1109–1117. https://doi.org/10.1038/s41589-018-0154-9.

[135]

Yu, H., H. Deng, J. He, J. D. Keasling, and X. Luo. 2023. “UniKP: A Unified Framework for the Prediction of Enzyme Kinetic Parameters.” Nature Communications 14, no. 1: 8211. https://doi.org/10.1038/s41467-023-44113-1.

[136]

Yu, H., S. Ma, Y. Li, and P. A. Dalby. 2022. “Hot Spots-Making Directed Evolution Easier.” Biotechnology Advances 56: 107926. https://doi.org/10.1016/j.biotechadv.2022.107926.

[137]

Yu, S., Q. Li, Y. Zhang, and H. Zhou. 2024. “New Possibility for PET Plastic Recycling by a Tailored Hydrolytic Enzyme.” Green Energy & Environment 9, no. 2: 163–165. https://doi.org/10.1016/j.gee.2023.02.007.

[138]

Yu, T., A. G. Boob, M. J. Volk, X. Liu, H. Cui, and H. Zhao. 2023. “Machine Learning-Enabled Retrobiosynthesis of Molecules.” Nature Catalysis 6, no. 2: 137–151. https://doi.org/10.1038/s41929-022-00909-w.

[139]

Yu, T., H. Cui, J. C. Li, Y. Luo, G. Jiang, and H. Zhao. 2023. “Enzyme Function Prediction Using Contrastive Learning.” Science 379, no. 6639: 1358–1363. https://doi.org/10.1126/science.adf2465.

[140]

Yılmaz, M. A., A. A. Ceylan, G. Kaynar, and A. E. Çiçek. 2025. “LYCEUM: Learning to Call Copy Number Variants on Low-Coverage Ancient Genomes.” Supplement. Bioinformatics 41, no. S1: i285–i293. https://doi.org/10.1093/bioinformatics/btaf244.

[141]

Zeng, Y. C., O. J. Young, C. M. Wintersinger, et al. 2024. “Fine Tuning of CpG Spatial Distribution With DNA Origami for Improved Cancer Vaccination.” Nature Nanotechnology 19, no. 7: 1055–1065. https://doi.org/10.1038/s41565-024-01615-3.

[142]

Zeng, Z., J. Guo, J. Jin, and X. Luo. 2025. “CLAIRE: A Contrastive Learning-Based Predictor for EC Number of Chemical Reactions.” Journal of Cheminformatics 17, no. 1: 2. https://doi.org/10.1186/s13321-024-00944-8.

[143]

Zhang, Q., X. F. Lu, Y. Zhang, X. L. Tang, R. C. Zheng, and Y. G. Zheng. 2020. “Development of a Robust Nitrilase by Fragment Swapping and Semi-Rational Design for Efficient Biosynthesis of Pregabalin Precursor.” Biotechnology and Bioengineering 117, no. 2: 318–329. https://doi.org/10.1002/bit.27203.

[144]

Zhang, Y., R. Li, G. Zou, et al. 2024. “Discovery of Antimicrobial Lysins From the ‘Dark Matter’ of Uncharacterized Phages Using Artificial Intelligence.” Advanced Science 11, no. 32: e2404049. https://doi.org/10.1002/advs.202404049.

[145]

Zhang, Z., Z. Li, Q. Wang, et al. 2024. “A Protein Fitness Predictive Framework Based on Feature Combination and Intelligent Searching.” Protein Science 33, no. 12: e5211. https://doi.org/10.1002/pro.5211.

[146]

Zhang, Z., Z. Li, M. Yang, F. Zhao, and S. Han. 2024. “Machine Learning-Guided Multi-Site Combinatorial Mutagenesis Enhances the Thermostability of Pectin Lyase.” International Journal of Biological Macromolecules 277: 134530. https://doi.org/10.1016/j.ijbiomac.2024.134530.

[147]

Zhao, P. C., X. X. Wei, Q. Wang, et al. 2025. “Single-Step Retrosynthesis Prediction via Multitask Graph Representation Learning.” Nature Communications 16, no. 1: 814. https://doi.org/10.1038/s41467-025-56062-y.

RIGHTS & PERMISSIONS

2025 The Author(s). Food Bioengineering published by John Wiley & Sons Australia, Ltd on behalf of State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology.

PDF

5

Accesses

0

Citation

Detail

Sections
Recommended

/