Evaluating the use of synthetic data for machine learning prediction of self-healing capacity of concrete

Franciana Sokoloski de Oliveira , Ricardo Stefani

AI in Civil Engineering ›› 2025, Vol. 4 ›› Issue (1) : 25

PDF
AI in Civil Engineering ›› 2025, Vol. 4 ›› Issue (1) :25 DOI: 10.1007/s43503-025-00074-6
Original Article
research-article

Evaluating the use of synthetic data for machine learning prediction of self-healing capacity of concrete

Author information +
History +
PDF

Abstract

The scarcity of experimental data poses a significant challenge in predicting the self-healing capacity of bacteria-driven concrete. To address this issue, we explored the use of synthetic data generation to augment the limited available dataset. By creating a synthetic dataset derived from real-world data, we substantially expanded the original data volume. We then trained and evaluated multiple machine learning (ML) models, encompassing both probabilistic and ensemble methods, for predicting self-healing capacity. Our comparative analysis revealed that ensemble methods, specifically the random forest (RF) algorithm, achieved the highest performance with an accuracy and F1-score of 0.863, surpassing the probabilistic models. Furthermore, when applied to real-world cases, the models maintained high predictive accuracy. This work confirms the value of synthetic data for enhancing the accuracy and reliability of predictive models in civil engineering, especially in data-scarce contexts. Our findings underscore the potential of machine learning and artificial intelligence to transform concrete research and highlight the role of synthetic data in overcoming common data limitations.

Keywords

Self-healing / Concrete / Bacteria / Synthetic data / Machine learning

Cite this article

Download citation ▾
Franciana Sokoloski de Oliveira, Ricardo Stefani. Evaluating the use of synthetic data for machine learning prediction of self-healing capacity of concrete. AI in Civil Engineering, 2025, 4(1): 25 DOI:10.1007/s43503-025-00074-6

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Alabduljabbar H, Khan K, Awan HH, Alyousef R, Mohamed AM, Eldin SM. Modeling the capacity of engineered cementitious composites for self-healing using AI-based ensemble techniques. Case Studies in Construction Materials, 2023, 18 e01805

[2]

Alghamdi SJ. Prediction of concrete’s compressive strength via artificial neural network trained on synthetic data. Engineering, Technology & Applied Science Research, 2023, 13(6): 12404-12408

[3]

Althoey F, Amin MN, Khan K, Usman MM, Khan MA, Javed MF, Sabri MMS, Alrowais R, Maglad AM. Machine learning based computational approach for crack width detection of self-healing concrete. Case Studies in Construction Materials, 2022, 17 e01610

[4]

Bisong E. Bisong E. Logistic Regression. Building Machine Learning and Deep Learning Models on Google Cloud Platform A Comprehensive Guide for Beginners, 2019Academic Press

[5]

Branco P, Ribeiro RP, Torgo L, Krawczyk B, Moniz N. SMOGN: A pre-processing approach for imbalanced regression. Proceedings of Machine Learning Research, 2017, 74: 36-50

[6]

Burges CJC. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, 2(2): 121-167

[7]

Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, Lopez A. A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing, 2020, 408: 189-215

[8]

Chaabene WB, Flah M, Nehdi ML. Machine learning prediction of mechanical properties of concrete: Critical review. Construction and Building Materials, 2020, 260(119889): 1-18

[9]

Chen G, Tang W, Chen S, Wang S, Cui H. Prediction of self-healing of engineered cementitious composite using machine learning approaches. Applied Sciences (Switzerland), 2022, 12(7): 1-27

[10]

Cloutier LM, Sirois S. Bayesian versus frequentist statistical modeling: A debate for hit selection from HTS campaigns. Drug Discovery Today, 2008, 13(11–12): 536-542

[11]

Cook R, Lapeyre J, Ma H, Kumar A, Asce AM. Prediction of compressive strength of concrete: critical comparison of performance of a hybrid machine learning model with standalone models. Journal of Geotechnical and Geoenvironmental Engineering, 2019

[12]

Dehestani A, Kazemi F, Abdi R, Nitka M. Prediction of fracture toughness in fibre-reinforced concrete, mortar, and rocks using various machine learning techniques. Engineering Fracture Mechanics, 2022, 276 108914

[13]

dos Santos Freitas MM, Barbosa JR, dos Santos Martins EM, da Silva Martins LH, de Souza Farias F, de Fátima Henriques Lourenço L, da Silva e Silva N. KNN algorithm and multivariate analysis to select and classify starch films. Food Packaging and Shelf Life, 2022, 34 100976

[14]

Ehrman TM, Barlow DJ, Hylands PJ. Virtual screening of Chinese herbs with random forest. Journal of Chemical Information and Modeling, 2007, 47(2): 264-278

[15]

Fan R-E, Chang K-W, Hsieh C-J, Wang X-R, Lin C-J. LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 2008, 9: 1871-1874

[16]

Feng DC, Liu ZT, Wang XD, Chen Y, Chang JQ, Wei DF, Jiang ZM. Machine learning-based compressive strength prediction for concrete: An adaptive boosting approach. Construction and Building Materials, 2020, 230 117000

[17]

Feng J, Su Y, Qian C. Coupled effect of PP fiber, PVA fiber and bacteria on self-healing efficiency of early-age cracks in concrete. Construction and Building Materials, 2019, 228 116810

[18]

Gupta S, Kua HW, Pang SD. Healing cement mortar by immobilization of bacteria in biochar: An integrated approach of self-healing and carbon sequestration. Cement and Concrete Composites, 2018, 86: 238-254

[19]

Hittmeir, M., Ekelhart, A., & Mayer, R. (2019). On the Utility of Synthetic Data: An Empirical Evaluation on Machine Learning Tasks. Proceedings of the 14th International Conference on Availability, Reliability and Security, 1–6. https://doi.org/10.1145/3339252.3339281

[20]

Hong Y, Park S, Kim H, Kim H. Synthetic data generation using building information models. Automation in Construction, 2021, 130 103871

[21]

Hossain MR, Sultana R, Patwary MM, Khunga N, Sharma P, Shaker SJ. Self-healing concrete for sustainable buildings. A review. Environmental Chemistry Letters, 2022, 20(Number 2): 1265-1273

[22]

Huang X, Sresakoolchai J, Qin X, Ho YF, Kaewunruen S. Self-healing performance assessment of bacterial-based concrete using machine learning approaches. Materials, 2022, 15(13): 4436

[23]

Janiesch C, Zschech P, Heinrich K. Machine learning and deep learning. Electronic Markets, 2021, 31: 685-695

[24]

Juan Y, Dai Y, Yang Y, Zhang J. Accelerating materials discovery using machine learning. Journal of Materials Science & Technology, 2021, 79: 178-190

[25]

Karthiga Shenbagam N, Praveena R. Performance of bacteria on self-healing concrete and its effects as carrier. Materials Today: Proceedings, 2022, 65: 1987-1989

[26]

Kaveh A. Applications of artificial neural networks and machine learning in civil engineering. Studies in Computational Intelligence, 2024, 1168: 472

[27]

Krüger M, Vogel-Heuser B, Hujo D, Walch J, Prinz T, Pohl D, Cha S, Kerausch C. Fottner Johannes MD, Nübel. Synthetic Data Generation for the Enrichment of Civil Engineering Machine Data. Construction Logistics Equipment and Robotics, 2024Springer

[28]

Kumar Jogi P, Vara Lakshmi TVS. Self healing concrete based on different bacteria: A review. Materials Today: Proceedings, 2020, 43: 1246-1252

[29]

Kumar Tipu R, Panchal VR, Pandya KS. An ensemble approach to improve BPNN model precision for predicting compressive strength of high-performance concrete. Structures, 2022, 45: 500-508

[30]

Li VC, Yang E-H. van der Zwaag S. Self Healing in Concrete Materials. Self healing materials: an alternative approach to 20 centuries of materials science, 2007Springer

[31]

Liao C-Y, Zhang L, Hu S-Y, Xia S-J, Li DM. Recent advances of self-healing materials for civil engineering: Models and simulations. Buildings, 2024, 14: 961-987

[32]

Luo M, Qian CX, Li RY. Factors affecting crack repairing capacity of bacteria-based self-healing concrete. Construction and Building Materials, 2015, 87: 1-7

[33]

Mammone A, Turchi M, Cristianini N. Support vector machines. Wiley Interdisciplinary Reviews: Computational Statistics, 2009, 1(Number 3283-289

[34]

Marani A, Nehdi ML. Predicting shear strength of FRP-reinforced concrete beams using novel synthetic data driven deep learning. Eng Struct, 2022, 257 114083

[35]

Mumuni A, Mumuni F. Data augmentation: A comprehensive survey of modern approaches. Array, 2022, 16 100258

[36]

Niazi A, Jameh-Bozorghi S, Nori-Shargh D. Prediction of toxicity of nitrobenzenes using ab initio and least squares support vector machines. Journal of Hazardous Materials, 2008, 151(2–3): 603-609

[37]

Nodehi M, Ozbakkaloglu T, Gholampour A. A systematic review of bacteria-based self-healing concrete: Biomineralization, mechanical, and durability properties. Journal of Building Engineering, 2022, 49 104038

[38]

Ochi T, Okubo S, Fukui K. Development of recycled PET fiber and its application as concrete-reinforcing fiber. Cement and Concrete Composites, 2007, 29(6): 448-455

[39]

Onyelowe KC, Adam AFH, Ulloa N, Garcia C, Andrade Valle AI, Zúñiga Rodríguez MG, Zarate Villacres AN, Shakeri J, Anyaogu L, Alimoradijazi M, Ganasen N. Modeling the influence of bacteria concentration on the mechanical properties of self-healing concrete (SHC) for sustainable bio-concrete structures. Scientific Reports, 2024, 14: 8414

[40]

Patki, N., Wedge, R., & Veeramachaneni, K. (2016). The synthetic data vault. Proceedings - 3rd IEEE International Conference on Data Science and Advanced Analytics, DSAA 2016, 399–410. https://doi.org/10.1109/DSAA.2016.49

[41]

Pessoa CLE, Peres Silva VH, Stefani R. Prediction of the self-healing properties of concrete modified with bacteria and fibers using machine learning. Asian Journal of Civil Engineering, 2024, 25(2): 1801-1810

[42]

Pilania G. Machine learning in materials science: From explainable predictions to autonomous design. Computational Materials Science, 2021, 193 110360

[43]

Rodríguez CR, de Mendonça Filho FF, Mercuri L, Gan Y, Rossi E, Anglani G, Antonaci P, Schlangen E, Šavija B. Chemo-physico-mechanical properties of the interface zone between bacterial PLA self-healing capsules and cement paste. Cement and Concrete Research, 2020, 138(May 106228

[44]

Rong H, Wei G, Ma G, Zhang Y, Zheng X, Zhang L, Xu R. Influence of bacterial concentration on crack self-healing of cement-based materials. Construction and Building Materials, 2020, 244 118372

[45]

de Rooij, M., Tittelboom, K. Van, Belie, N. De, & Schlangen, E. (Eds.). (2013). Self-Healing Phenomena in Cement-Based Materials: State-of-the-Art Report of RILEM Technical Committee 221-SHC: Self-Healing Phenomena in Cement-Based Materials (Vol. 11). Springer. http://www.springer.com/series/8780

[46]

Shields BJ, Stevens J, Li J, Parasram M, Damani F, Alvarado JIM, Janey JM, Adams RP, Doyle AG. Bayesian reaction optimization as a tool for chemical synthesis. Nature, 2021, 590(784489-96

[47]

Shorten C, Khoshgoftaar TM. A survey on image data augmentation for deep learning. Journal of Big Data, 2019, 6: 60

[48]

Su Y, Qian C, Rui Y, Feng J. Exploring the coupled mechanism of fibers and bacteria on self-healing concrete from bacterial extracellular polymeric substances (EPS). Cement and Concrete Composites, 2021, 116 103896

[49]

Suleiman AR, Nehdi ML. Modeling self-healing of concrete using hybrid genetic algorithm-artificial neural network. Materials, 2017, 10(2): 135

[50]

Talaiekhozan A, Keyvanfar A, Shafaghat A, Andalib R, Majid MZA, Fulazzaky MA, Zin RM, Lee CT, Hussin MW, Hamzah N, Marwar NF, Haidar HI. A review of self-healing concrete research development. Journal of Environmental Treatment Techniques, 2014, 2(11-11

[51]

Wendland P, Birkenbihl C, Gomez-Freixa M, Sood M, Kschischo M, Fröhlich H. Generation of realistic synthetic data using multimodal neural ordinary differential equations. Npj Digital Medicine, 2022, 5 122

[52]

Wiktor V, Jonkers HM. Quantification of crack-healing in novel bacteria-based self-healing concrete. Cement and Concrete Composites, 2011, 33(7763-770

[53]

Zheng W, Tropsha A. Novel variable selection quantitative structure–property relationship approach based on the k-nearest-neighbor principle. Journal of Chemical Information and Computer Sciences, 2000, 40(1): 185-194

[54]

Zhou Z-H. Machine Learning, 2021Springer

[55]

Zhuang X, Zhou S. The prediction of self-healing capacity of bacteria-based concrete using machine learning approaches. Computers, Materials & Continua, 2019, 59(1): 57-77

[56]

Ziolkowski P, Niedostatkiewicz M. Machine learning techniques in concrete mix design. Materials, 2019, 12: 1256

RIGHTS & PERMISSIONS

The Author(s)

PDF

41

Accesses

0

Citation

Detail

Sections
Recommended

/