PDF
Abstract
This research examines the application of eight different machine learning (ML) algorithms for predicting the compressive strength of high-performance concrete (HPC). Achieving precise predictions is crucial for enhancing structural reliability and optimizing resource usage in construction projects. The analysis utilized the “Concrete Compressive Strength” dataset, sourced from UC Irvine’s publicly available ML repository. The models evaluated include Gradient Boosting Regressor (GBR), Extreme Gradient Boosting Regression (XGBoost), Random Forest (RF), Support Vector Regression (SVR), Artificial Neural Network (ANN), Multilayer Perceptron (MLP), Lasso, and k-Nearest Neighbors (KNN). To enhance performance, critical data preprocessing steps were undertaken, which involved feature scaling, cleaning, and normalization. Hyperparameter tuning via Grid Search (GS) and K-fold cross-validation further optimized the models. Among those analyzed, XGBoost and GBR achieved the highest predictive accuracy, with R2 values of 93.49% and 92.09% respectively, coupled with lower mean squared error (MSE), mean absolute error (MAE), and root mean squared error (RMSE). SHapley Additive exPlanations (SHAP) analysis revealed cement content and curing age as the most significant factors affecting compressive strength. Validation against experimental data confirmed the reliability of XGBoost and GBR through consistent prediction patterns and close alignment with empirical measurements. The results establish ML as an effective approach for HPC strength prediction, offering advantages in computational efficiency and accuracy over conventional analytical methods.
Keywords
Concrete strength
/
Machine learning
/
Prediction
/
Boosting
/
Regression
/
High performance concrete
/
Hyperparameter tuning
/
Grid search
/
Cross validation
/
SHapley Additive exPlanations
Cite this article
Download citation ▾
Samuel Olaoluwa Abioye, Yusuf Olawale Babatunde, Oluwafikejimi Abigail Abikoye, Aisha Nene Shaibu, Bailey Jonathan Bankole.
Optimized machine learning algorithms with SHAP analysis for predicting compressive strength in high-performance concrete.
AI in Civil Engineering, 2025, 4(1): 16 DOI:10.1007/s43503-025-00061-x
| [1] |
AbdollahiA, LiD, DengJ, AminiA. An explainable artificial-intelligence-aided safety factor prediction of road embankments. Engineering Applications of Artificial Intelligence, 2024, 108, ArticleID: 108854
|
| [2] |
AhmadA, AhmadW, AslamF, JoykladP. Compressive strength prediction of fly ash-based geopolymer concrete via advanced machine learning techniques. Case Studies in Construction Materials, 2022
|
| [3] |
AitchPC. Cements of yesterday and today. Cement and Concrete Research, 2000, 30(9): 1349-1359
|
| [4] |
BreimanL. Random forests. Machine Learning, 2001, 45(1): 5-32
|
| [5] |
BarkhordariM, ArmaghaniD, MohammedA, UlrikhD. Data-driven compressive strength prediction of fly ash concrete using ensemble learner algorithms. Buildings, 2022
|
| [6] |
BühlmannP, van de GeerS. Statistics for high-dimensional data: Method, theory and applications. Springer, 2011
|
| [7] |
ComitoC, PizzutiC. Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review. Journal of Artificial Intelligence Research, 2020
|
| [8] |
CortesC, VapnikV. Support-vector networks. Machine Learning, 1995, 20(3): 273-297
|
| [9] |
ChouJS, PhamAD. Enhanced artificial intelligence for ensemble approach to predicting high performance concrete compressive strength. Construction & Building Materials, 2013
|
| [10] |
ChanKY, Abu-SalihB, QaddouraR, Al-ZoubiAM, PaladeV. Deep neural networks in the cloud: Review, applications, challenges, and research directions. Neurocomputing, 2023, 504, ArticleID: 126327
|
| [11] |
Chen T., Guestrin C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. https://doi.org/10.1145/2939672.2939785
|
| [12] |
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794). ACM. https://doi.org/10.1145/2939672.2939785
|
| [13] |
EkanayakeI, MeddageD, RathnayakeU. A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Studies in Construction Materials, 2022
|
| [14] |
ErdalHI. Two-level and hybrid ensembles of decision trees for high performance concrete compressive strength prediction. Engineering Applications of Artificial Intelligence, 2013
|
| [15] |
FixE, HodgesJL. Discriminatory analysis nonparametric discrimination: Consistency properties. International Statistical Review Revue Internationale De Statistique, 1989, 57(3): 238-247
|
| [16] |
FriedmanJH. Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 2001, 29(5): 1189-1232
|
| [17] |
HanQ, GuiC, XuJ, LacidognaG. A generalized method to predict the compressive strength of high-performance concrete by improved random forest algorithm. Construction & Building Materials, 2019
|
| [18] |
HuangX, KroeningD, RuanW, SharpJ, SunY, ThamoE, WuM, YiX. A survey of safety and trustworthiness of deep neural networks: Verification, testing, adversarial attack and defence, and interpretability. Computer Science Review, 2020, 37: 100270
|
| [19] |
IslamMM, DasP, RahmanMM, NazF, KashemA, NishatMH, TabassumN. Prediction of compressive strength of high-performance concrete using optimization machine learning approaches with SHAP analysis. Journal of Building Pathology and Rehabilitation, 2024, 9(2): 94
|
| [20] |
IslamN, NoorH, FaridDM Thai-NgheN, DoTN, HaddawyP. A novel ensemble K-nearest neighbours classifier with attribute bagging. Intelligent Systems and Data Science. ISDS 2023. Communications in Computer and Information Science, 2024 Singapore Springer 1950
|
| [21] |
KaloopMR, KumarD, SamuiP, HuJW, KimD. Compressive strength prediction of high-performance concrete using gradient tree boosting machine. Construction & Building Materials, 2020
|
| [22] |
KamolovS. Comprehensive analysis of machine learning models for predicting concrete compressive strength. Annals of Mathematics and Computer Science, 2024, 23: 119-130
|
| [23] |
KazemiF, ShafighfardT, JankowskiR, et al.. Active learning on stacked machine learning techniques for predicting compressive strength of alkali-activated ultra-high-performance concrete. Archives of Civil and Mechanical Engineering, 2025
|
| [24] |
KelekoAT, Kamsu-FoguemB, NgounaRH, TongneA. Health condition monitoring of a complex hydraulic system using deep neural network and DeepSHAP explainable XAI. Advances in Engineering Software, 2022, 170, ArticleID: 103339
|
| [25] |
LeeS, NguyenN, KaramanliA, LeeJ, VoTP. Super learner machine-learning algorithms for compressive strength prediction of high-performance concrete. Structural Concrete, 2022, 24(2): 2208-2228
|
| [26] |
LeeS, VoTP, ThaiHT, LeeJ, PatelV. Strength prediction of concrete-filled steel tubular columns using categorical gradient boosting algorithm. Engineering Structures/engineering Structures (Online), 2021
|
| [27] |
LeiX, SunL, XiaY, HeT. Vibration-based seismic damage states evaluation for regional concrete beam bridges using random forest method. Sustainability, 2020, 12(23): 1329
|
| [28] |
LiD, TangZ, KangQ, ZhangX, LiY. Machine learning-based method for predicting compressive strength of concrete. Processes, 2023, 11: 390
|
| [29] |
Lundberg S, Lee S-I. (2017). A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS 2017). https://doi.org/10.48550/arXiv.1705.07874
|
| [30] |
McCullochWS, PittsW. A logical calculus of the ideas immanent in nervous activity. The Bulletin of Mathematical Biophysics, 1943, 5(4): 115-133
|
| [31] |
MesutB, BaşkorA, AksuNB Role of artificial intelligence in quality profiling and optimization of drug products, 2023 Amsteradam Elsevier
|
| [32] |
MuhammadUJ, AminuII, MahmoudIA, et al.. An improved prediction of high-performance concrete compressive strength using ensemble models and neural networks. AI in Civil Engineering, 2024
|
| [33] |
NguyenDD, TranVL, HaDH, NguyenVQ, LeeTH. A machine learning-based formulation for predicting shear capacity of squat flanged RC walls. Structures, 2021
|
| [34] |
NguyenH, VuT, VoTP, ThaiHT. Efficient machine learning models for prediction of concrete strengths. Construction & Building Materials, 2021
|
| [35] |
OtchereDA, GanatTOA, OjeroJO, Tackie-OtooBN, TakiMY. Application of gradient boosting regression model for the evaluation of feature selection techniques in improving reservoir characterization predictions. Journal of Petroleum Science and Engineering, 2021, 208, ArticleID: 109244
|
| [36] |
PosmaJM Chapter 9 - multivariate statistical methods for metabolic phenotyping, 2019 Amsterdam Elsevier
|
| [37] |
RathakrishnanV, BedduSB, AhmedAN. Predicting compressive strength of high-performance concrete with high volume ground granulated blast-furnace slag replacement using boosting machine learning algorithms. Scientific Reports, 2022, 12: 9539
|
| [38] |
RumelhartDE, McClellandJL Parallel distributed processing: Explorations in the microstructure of cognition, 1986 Cambridge, MA MIT Press 1 & 2
|
| [39] |
SalmanHA, KalakechA, SteitiA. Random forest algorithm overview. Babylonian Journal of Machine Learning, 2024, 2024: 69-79
|
| [40] |
SatterfieldBC, et al.. Unraveling the genetic underpinnings of sleep deprivation-induced impairments in human cognition. Progress in Brain Research, 2019, 246: 193-220
|
| [41] |
SilvaVP, CarvalhoRA, da RêgoJH. Machine learning-based prediction of the compressive strength of Brazilian concretes: A dual-dataset study. Materials, 2023, 16: 4977
|
| [42] |
TerlumunS, OnyiaME, OkaforFO. Predicting the compressive strength of self-compacting concrete using artificial intelligence techniques: A review. AI in Civil Engineering, 2024
|
| [43] |
TibshiraniR. Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society: Series B (Methodological), 1996, 58(1): 267-288
|
| [44] |
WalczakS, CerpaN. Artificial neural networks. Encyclopedia of Physical Science and Technology, 2003
|
| [45] |
Xu, D., Chong, H., Main, I., Mineter, M., De Bold, R., Forde, M., Gair, C., Madden, P., Angus, E., & Ho, C. (2019). Using statistical models and machine learning techniques to process big data from the Forth Road Bridge. International Conference on Smart Infrastructure and Construction 2019 (ICSIC) Driving Data-Informed Decision-Making, ICE Publishing, 411–419. https://doi.org/10.1680/icsic.64669.411
|
| [46] |
XuD, XuX, FordeMC, CaballeroA. Concrete and steel bridge structural health monitoring Insight into choices for machine learning applications. Construction and Building Materials, 2023
|
| [47] |
YangL, ShamiA. Hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing, 2020, 415: 295-316
|
RIGHTS & PERMISSIONS
The Author(s)