Multi-fidelity learning in materials informatics: methodologies, applications, and outlook
Bo Wang , Yangyang Xu , Yumei Zhou , Dezhen Xue
Journal of Materials Informatics ›› 2026, Vol. 6 ›› Issue (1) -16.
Data-driven methods are transforming materials design by accelerating the discovery of new compounds and the optimization of existing systems. However, the progress of such approaches is often constrained by the scarcity of high-fidelity data from experiments and advanced simulations. Multi-fidelity (MF) learning has emerged as a powerful strategy to address this challenge by integrating information from diverse data sources that vary in accuracy and cost. In this review, we provide a systematic overview of the major methodologies for MF learning, including statistical and parametric models, machine learning models with fidelity features, correction-based models such as co-kriging, deep learning frameworks, and active learning frameworks. We discuss the strengths, limitations, and typical applications of each method in materials science, with illustrative examples spanning electronic structure modeling, alloy design, and interatomic potential development. Cross-cutting issues are also examined, including the bias-variance trade-off, data requirements for nested vs. non-nested designs, and computational scalability. Finally, we highlight outstanding challenges and outline emerging opportunities, such as physics-informed and generative MF models, standardized datasets, and integration with autonomous laboratories. Together, these perspectives define a roadmap for advancing MF learning as a core enabler of next-generation materials discovery.
Multi-fidelity learning / materials informatics / surrogate modeling / active learning / data-driven materials design
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
| [31] |
|
| [32] |
|
| [33] |
|
| [34] |
|
| [35] |
|
| [36] |
|
| [37] |
|
| [38] |
|
| [39] |
|
| [40] |
|
| [41] |
|
| [42] |
|
| [43] |
|
| [44] |
|
| [45] |
Yi, J.; Cheng, J.; Bessa, M. A. Practical multi-fidelity machine learning: fusion of deterministic and Bayesian models. arXiv 2025, arXiv:2407.15110. Available online: https://arxiv.org/abs/2407.15110 (accessed 26 March 2026). |
| [46] |
|
| [47] |
|
| [48] |
Le Gratiet, L. Multi-fidelity Gaussian process regression for computer experiments. Ph.D. Thesis, Université Paris-Diderot - Paris VII, Paris, France, 2013. https://theses.hal.science/tel-00866770/file/manuscrit.pdf (accessed 2026-3-26). |
| [49] |
|
| [50] |
Moss, H. B.; Leslie, D. S.; Rayson, P. MUMBO: MUlti-task max-value Bayesian optimization. In Machine Learning and Knowledge Discovery in Databases; Hutter, F., Kersting, K., Lijffijt, J., Valera, I., Eds.; Springer International Publishing: Cham, 2021; pp 447-62. |
| [51] |
|
| [52] |
|
| [53] |
|
| [54] |
|
| [55] |
|
| [56] |
|
| [57] |
|
| [58] |
|
| [59] |
|
| [60] |
|
| [61] |
Weisberg, S. Nonlinear regression. In Wiley Series in Probability and Statistics; Wiley, 2005. |
| [62] |
Bishop CM. Pattern recognition and machine learning. Information Science and Statistics. Springer; 2006. https://books.google.com/books/about/Pattern_Recognition_and_Machine_Learning.html?id=kTNoQgAACAAJ (accessed 2026-03-26). |
| [63] |
|
| [64] |
Sella, V.; Pham, J.; Chaudhuri, A.; Willcox, K. E. Projection-based multifidelity linear regression for data-poor applications. In AIAA SCITECH 2023 Forum; National Harbor, MD & Online; Reston, VA: American Institute of Aeronautics and Astronautics, 2023. |
| [65] |
|
| [66] |
|
| [67] |
|
| [68] |
Zanjani Foumani Z, Shishehbor M, Yousefpour A, Bostanabad R. Multi-fidelity cost-aware Bayesian optimization.Comput Methods Appl Mech Eng2023;407:115937 |
| [69] |
|
| [70] |
|
| [71] |
|
| [72] |
|
| [73] |
|
| [74] |
Cutajar, K.; Pullin, M.; Damianou, A.; Lawrence, N.; González, J. Deep Gaussian processes for multi-fidelity modeling. arXiv 2019, arXiv:1903.07320. Available online: https://arxiv.org/abs/1903.07320 (accessed 26 March 2026). |
| [75] |
Raissi, M.; Karniadakis, G. Deep multi-fidelity Gaussian processes. arXiv 2016, arXiv:1604.07484. Available online: https://arxiv.org/abs/1604.07484 (accessed 26 March 2026). |
| [76] |
|
| [77] |
|
| [78] |
|
| [79] |
|
| [80] |
|
| [81] |
|
| [82] |
|
| [83] |
|
| [84] |
|
| [85] |
|
| [86] |
Motamed, M. A multi-fidelity neural network surrogate sampling method for uncertainty quantification. arXiv 2020, arXiv:1909.01859. Available online: https://arxiv.org/abs/1909.01859 (accessed 26 March 2026). |
| [87] |
Song, D. H.; Tartakovsky, D. M. Transfer Learning on Multi-Fidelity Data. arXiv 2021, arXiv:2105.00856. Available online: https://arxiv.org/abs/2105.00856 (accessed 26 March 2026). |
| [88] |
|
| [89] |
|
| [90] |
|
| [91] |
|
| [92] |
|
| [93] |
De Breuck P, Heymans G, Rignanese G. Accurate experimental band gap predictions with multifidelity correction learning.J Mater Inf2022;2:10 |
| [94] |
|
| [95] |
|
| [96] |
Hernández-García, A.; Saxena, N.; Jain, M.; Liu, C. H.; Bengio, Y. Multi-Fidelity Active Learning with GFlowNets. 2024. https://openreview.net/forum?id=dLaazW9zuF (accessed 2026-03-26). |
| [97] |
|
| [98] |
|
| [99] |
|
| [100] |
|
| [101] |
Giannoukou, K.; Marelli, S.; Sudret, B. Uncertainty-aware multi-fidelity surrogate modeling with noisy data. arXiv 2024, arXiv:2401.06447. Available online: https://arxiv.org/abs/2401.06447 (accessed 26 March 2026). |
| [102] |
Konomi, B. A.; Karagiannis, G. Bayesian analysis of multifidelity computer models with local features and non-nested experimental designs: application to the WRF model. arXiv 2019, arXiv:1910.08063. Available online: https://arxiv.org/abs/1910.08063 (accessed 26 March 2026). |
| [103] |
|
| [104] |
Rasmussen, CE.; Williams, CKI. Gaussian processes for machine learning; The MIT Press, 2005. |
| [105] |
|
/
| 〈 |
|
〉 |