1 Introduction
The usage of fiber-reinforced polymer (FRP) composites for confining concrete structure has revolutionized structural strengthening and seismic retrofitting. FRP-confined concrete columns display substantial improvements in strength, ductility, and energy dissipation capacity, making FRP confinement particularly effective in improving the performance of deficient or deteriorated structural members [
1–
9]. An essential mechanism behind these improvements is the hoop strain developed in the FRP wrap due to the lateral expansion of concrete. The effective hoop strain is an important parameter in evaluating the degree of confinement and the effectiveness of the FRP system [
10]. Unlike conventional reinforcement, FRP materials such as carbon (CFRP), glass (GFRP), and aramid (AFRP) possess lightweight, corrosion-resistant, and high-strength alternatives with tailorable properties. These merits have placed FRP as a preferred material in both retrofit and new construction. The confinement procedure works by restricting the lateral dilation of concrete, thereby introducing tri-axial compressive stress states in the core that enhance the axial performance of the structural systems [
8,
11–
13]. The hoop strain developed in the FRP wrap plays a crucial role in establishing the confinement pressure and in turn the ultimate strength and strain capacity of FRP-confined concrete. Hence, accurately calculating the hoop strain is vital to both design safety and material efficiency.
Over the past few decades, numerous analytical and semi-empirical models have been proposed to calculate the hoop strain in FRP-confined concrete. These models are normally grounded in mechanics-based relations, which combine force equilibrium, compatibility conditions, and idealized material behavior. Early works, such as Mander et al. [
14], laid the basis for understanding confined concrete behavior that was later adapted to FRP materials. Later studies such as [
10,
15–
19] introduced the concept of an efficiency factor, connecting the effective hoop strain to the ultimate FRP strain. Several models have since evolved to calculate this reduction factor based on parameters such as FRP stiffness and concrete strength [
20–
22]. Such models often treat fiber type explicitly, and allocate separate reduction factors or empirical coefficients for CFRP, GFRP, and AFRP. Meanwhile, from a purely mechanistic viewpoint, the fiber type alone should not govern confinement efficacy—rather, the mechanical interaction among FRP stiffness, thickness, rupture strain, and bond should dictates the performance. Despite the theoretical value, most of the existing analytical models have been calibrated on limited data sets, often derived from idealized or small-scale experiments. Hence, these models are constrained by simplifying assumptions such as perfect bonding, uniform confinement, and axisymmetric condition. Furthermore, many models fail to consider interdependencies between parameters or capture the nonlinear and coupled behavior observed in real-world specimens. Thereby, these models may suffer from limited generalizability, especially when applied to new materials, hybrid composites, or non-standard geometries.
In light of these limitations, machine learning (ML) has emerged as a powerful option for modeling complex engineering phenomena. ML models excel at revealing nonlinear, multivariate relationships from large data sets without needing rigid assumptions [
23–
34]. In the context of FRP-confined concrete, ML has been effectively applied to predict properties such as peak stress, ultimate strain, and ductility ratio. However, limited work has explicitly addressed the prediction of hoop strain, despite its central role in confinement mechanics and design of FRP-confined concrete. Accurately modeling of hoop strain through ML would enable designers to move beyond the use of conservative efficiency factors and instead use data-driven assessments tailored to specific materials, geometries, and loading scenarios. Such models can integrate multiple influencing factors simultaneously, such as diameter, height, concrete strength, FRP modulus, thickness, rupture strain, fiber type, and number of layers, capturing interactions that are infeasible to address analytically. Besides, ML models can be trained and validated on comprehensive experimental databases assembled from various sources, increasing robustness and generalization ability of ML-based models. When optimized using methods such as cross
-validation (CV) and hyperparameter tuning, these models can outperform conventional techniques in accuracy. On the other hand, model explanation techniques like SHapley Additive exPlanations (SHAP), provide interpretability and enable researchers and practitioners to understand which predictors most strongly influence hoop strain.
So, this study aims to bridge the gap between empirical, mechanistic, and ML-based approaches by developing an optimized ML model trained on a curated experimental database of 309 FRP-confined circular columns, covering CFRP, GFRP, and AFRP specimens. Eight ML algorithms, including both ensemble methods and non-ensemble learners, are tuned and evaluated, with the best-performing model deployed into a user-friendly prediction interface. This framework not only improves the reliability of the predictions of hoop strain but also advances the integration of explainable ML into structural design practice.
2 Existing empirical models
The effective hoop strain of FRP-confined concrete (
) is usually lower than the ultimate tensile strain obtained from uniaxial coupon tests or manufacturer provided ultimate tensile strain (
). This reduction is ascribed to various practical factors like premature rupture, stress concentrations, imperfect FRP application, as well as non-uniform strain distribution around the specimen perimeter [
10]. To account for this differences, numerous models, empirical and semi-empirical, have been proposed to relate
to
using either reduction factors or strain efficiency coefficients, as summarized in Table 1. While some models appear to offer separate recommendations for different FRP types (e.g., CFRP, GFRP, AFRP), this categorization naturally stems from empirical calibration rather than fundamental differences in confinement mechanics. Theoretically, the confinement effect is governed by parameters such as FRP stiffness, thickness, and configuration, rather than FRP type alone. Thus, care must be taken to interpret the FRP “type-based” models as proxies for material properties rather than a suggestion that type alone dictates performance. Remarkably, most existing models calculate
as a fixed fraction of
, typically using the manufacturer-reported ultimate tensile strain [
10,
15–
19]. Only a few studies [
20–
22] have proposed more adaptable approaches that estimate
based on mechanical parameters, e.g., compressive strength (
) and elastic modulus (
), rather than depend solely on
. Therefore, to ensure robustness and generalizability of the existing models, this study applies the empirical models listed in Table 1 to the assembled experimental database and subsequently assesses their predictive performance across a diverse set of samples.
3 Data aggregation
The initial stage of this study involved the gathering and fine-tuning of an experimental database focused on
. The preliminary data set consist of 573 experimental specimens, 448 CFRP, 94 GFRP, and 31 AFRP samples, initially assembled by Lobo et al. [
22] from various published sources [
17,
19,
35–
72]. However, many of the sourced studies included incomplete or partially reported experimental data, requiring rigorous data cleaning and validation procedures. After removing incomplete entries, a finalized data set of 309 specimens—228 CFRP, 59 GFRP, and 22 AFRP samples with complete hoop strain data was established for model training and validation. For each specimen, relevant geometric and mechanical parameters were extracted from the literature, including concrete properties, diameter (
D), height (
H), compressive strength (
), and corresponding strain (
, as well as FRP properties, elastic modulus (
), ultimate tensile strain (
), and thickness (
). The selected input parameters are grounded in the mechanics of FRP-confinement behavior in concrete. Typically,
is a direct result of the lateral expansion of concrete under axial load restrained by the externally bonded FRP jacket. The geometric parameters influence the lateral confinement efficiency and stress distribution within the concrete core,
and
characterize the stress–strain response of the unconfined concrete, which in turn governs the commencement and magnitude of radial expansion [
73]. Meanwhile, the mechanical properties of FRP confinement govern the stiffness, deformability, and confining pressure exerted by the FRP wrap. Thus, these input variables collectively govern the interaction between the expanding concrete and the resisting FRP, ultimately affecting
.
Figure 1 shows the numerical distribution of and the predictors while Fig. 2 presents a heatmap of descriptive statistics for the variables, including the mean, standard deviation, 25th percentile, median, and 75th percentile values. The heatmap elucidates a broad range of hoop strains, reflecting varied confinement conditions. The wide variability in geometric properties reflect a spectrum from small- to large-scale columns with varying concrete strengths. Meanwhile the spread in FRP parameters indicates a range of confinement capacities and strengthening levels. This diversity ensures that the data set comprehensively represents the influence of both concrete and FRP properties on hoop strain behavior.
Besides, Fig. 3 shows a pair plot matrix indicating the distribution and interrelationships among the variables. The 9 × 9 correlation matrix, filtered by fiber type, further confirms the heterogeneity of the data set. Collectively, the breadth and balance of the parameters provide a strong foundation for developing a robust and generalizable ML-based predictive model for .
4 Research methodology
The overall research methodology is outlined in the flowchart in Fig. 4, with detailed steps discussed in the following subsections.
4.1 Framework of machine learning model
The proposed ML framework aims to deliver precise, robust, and generalizable predictions of
by leveraging a different single and ensemble learners. The model training procedure initializes with the setup of eight distinct algorithms: four single learners [K
-Nearest Neighbors (KNN), Kernel Ridge Regression (KRR), Support Vector Regression (SVR), and Decision Tree (DT)] as well as four ensemble techniques [AdaBoost, Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), and Random Forest (RF)]. To ensure a comprehensive evaluation of regression performance, these ML algorithms were selected based on their diverse learning paradigms and established effectiveness in modeling nonlinear relationships in engineering data sets. KNN is a non-parametric, instance-based method that forecasts target values based on the average of
k nearest samples in the feature space. The algorithm is effective for capturing localized patterns without needing a predefined functional form [
74]. KRR integrates the regularization properties of ridge regression with the nonlinear mapping abilities of kernel functions, providing good generalization, especially in high-dimensional or collinear data scenarios [
75]. Meanwhile, SVR, grounded in the theory of support vector machines, uses kernel transformations and an ε-insensitive loss function, making it appropriate for high-dimensional, nonlinear problems with strong resistance to overfitting [
76]. DT is a tree-based learner that partitions the input space using hierarchical rules, offering high interpretability and the capability to model both linear and nonlinear interactions. However, the learner tendency to overfit motivates its use as a base learner in ensemble methods. AdaBoost enhances the predictive accuracy by adaptively reweighting misclassified instances across sequential weak learners, typically shallow DT [
77]. GBM extends this concept by using gradient descent to reduce a loss function, allowing greater flexibility in controlling the bias-variance trade-off. XGBoost further optimizes GBM by incorporating regularization, parallel computation, and advanced tree pruning strategies, leading to enhanced computational efficiency and predictive performance [
78]. RF, another ensemble of decorrelated DT built via bootstrap aggregation, reduces model variance and improves robustness against noise and overfitting [
79]. The inclusion of both single and ensemble models allows for a balanced comparison across instance-based, kernel-based, tree-based, and ensemble-based approaches, allowing the identification of the most suitable predictive strategy for the targeted output. The configurations of the ML framework (Optuna-tuned learner selection with SHAP-based interpretability) are summarized below.
1) Input predictors X and target variable y. Apply log-transform of .
2) non-missing rows only.
3) retain only rows with .
4) .
5) train_test_split.
6) MinMaxScaler().fit_transform.
7) For each models, KNN, Kernel ridge, SVR, DT, Adaboost, GBM, XGBoost, RF. Define objective function for Optuna using 10-fold CV. Optimize hyperparameters using 50 trials to minimize mean MAE. Identify and record the best hyperparameters and lowest overall MAE.
8) For each tuned model , train on full training set. Predict on both training and tests sets using . Calculate MAE, RMSE, MAPE as well as R2 for training, testing, and all data.
9) Identify model with lowest overall MAE on combined data.
10) Return the trained model , optimal parameters , and performance metrics.
11) Reconstruct predictor names from polynomial transformation: predictor names columns).
12) If is tree-based (e.g., RF, XGBoost), initialize ; else, use .
13) Compute .
14) Generate global explanation (SHAP summary bar plot) and local explanation (force plot).
15) Output: Best-performing model, optimal parameters, and SHAP explanations.
The parameters of each model are optimized using a comprehensive 10-fold CV approach in combination with Optuna-based Bayesian optimization, allowing systematic exploration of hyperparameter spaces. This optimization process simultaneously determines the optimal algorithm type, model configuration, as well as the hyperparameters that minimize prediction errors. In this study, 10-fold CV was applied using random shuffling without stratification. The architecture of the models adopts an adaptive and iterative mechanism, enabling dynamic selection and fine-tuning of model components. This methodology effectively reduces overfitting and captures complex nonlinear interactions between predictors, permitting strong generalization across diverse regression tasks. The model includes interpretability via both global and local SHAP analysis, providing insight into feature contributions and instance-specific predictions. Meanwhile, performance metrics, the coefficient of determination (R2), mean absolute error (MAE), root mean squared error (RMSE), and mean absolute percentage error (MAPE), were calculated for each model. The final model was selected based on the criterion of lowest MAE on the combined data set.
4.2 Deployment of machine learning-based predictive model
To ensure practical user-friendliness and encourage the adoption of the proposed ML model for engineering applications, a graphical user interface (GUI) was developed using PySimpleGUI in Python named Hoop Strain of FRP-confined concrete Predictor. The GUI serves as a lightweight desktop tool allowing users to input design parameters and receive instant predictions of in FRP-confined circular concrete columns. The GUI integrates the complete trained ML model including, , , MinMaxScaler, and . The input interface is categorized into two main sections—concrete Geometric and parameter and FRP parameters. The categorical predictors (e.g., fiber type) are internally mapped to numerical codes before model inference For instance, fiber types are mapped as: CFRP = 1, GFRP = 2, AFRP = 3. Besides, the GUI presents value ranges obtained from the training data to guide users in order to prevent out-of-bound inputs. Accordingly, the trained predicts the using . The final predicted value is displayed on the GUI in percentage format. Also, built-in error handling is embedded to flag missing or invalid entries, and a “Clear” button resets all input fields. A “Cancel” button exits the program.
5 Results and discussion
5.1 Predictive performance of existing empirical models
The scatter plots showing the predicted
values in FRP-confined concrete using widely adopted empirical models (summarized in Table 1) are shown in Fig. 5. A close analysis of the plots indicates a significant disparity between the predicted and experimentally observed values. Across all models, substantial deviations from the ideal 45° line are evident, signifying poor predictive precision and inconsistency. Even those models explicitly developed to address a broad spectrum of fiber types, such as those by Refs. [
18,
20], exhibiting a tendency to either overestimate or underestimate the
, underscoring the limited ability to generalize the models across different confinement circumstances and material properties. Therefore, the inability of the empirical models to capture the complex nonlinear relationship inherent in
in FRP-confined concrete undermines the reliability of the models in both design and assessment contexts.
In addition to the visual analysis, the performance of the models was quantitatively assessed using multiple error metrics. As shown in the accompanying heatmap (Fig. 5), the empirical models consistently produced high error values. The MAE ranged from 0.35% to 0.52%, while RMSE spanned 0.48% to 0.95%, demonstrating the presence of considerable residual variance in the predictions. Most strikingly, the MAPE values were exceedingly high, ranging from 40% to 73% that further reflects the inconsistency and lack of robustness of the empirical formulations when applied to diverse data sets. These findings highlight a critical limitation of conventional empirical equations—the reliance on simplified assumptions and linear regressions limits the ability of the models to capture the multi-dimensional and nonlinear behavior of . The formulation of these empirical models often depends on small, fiber-specific experimental data sets and is highly sensitive to changes in material parameters, such as , , and . Given these limitations, there is a compelling necessity to explore alternative modeling strategies capable of capturing the complicated behavior of FRP-confined concrete systems. In this context, ML offers a promising data-driven approach. Unlike empirical models, ML algorithms can learn complex relationships from large and diverse data sets without presuming a predefined functional form, making the ML-based model better suited for generalizing across a wide range of material properties and confinement configurations. Consequently, the next phase of this study investigates the development and validation of robust ML-based models to overcome the shortcomings of conventional empirical approaches.
5.2 Hyperparameters optimization for the models
To ensure optimal predictive performance, the ML models were fine-tuned using Optuna with 10-fold CV. The best hyperparameters obtained for the models are summarized in Table 2.
These hyperparameters were selected to minimize validation error while enhancing the generalization capability of each model. For nonlinear models such as SVR and KRR, the chosen settings effectively capture intricate patterns in the data while ensuring robustness through appropriate regularization. In ensemble-based methods like AdaBoost and XGBoost, the configured parameters facilitate the iterative correction of residual errors by weak learners, thereby improving accuracy without overfitting. Overall, the selected configurations achieve a balanced trade-off between model complexity, training efficiency, and predictive performance.
5.3 Predictive performance of machine learning-based models
Figure 6 presents scatter plots comparing the observed and predicted values obtained from the eight ML models. Each plot includes a 45° dashed reference line, signifying the ideal agreement between predicted and observed values. A close alignment of data points along this line represents strong model performance. Also, R2 is reported for the training, testing, and entire data sets, presenting a quantitative assessment of each model’s predictive precision and generalization capacity. Among the non-ensemble models, KNN and the DT models demonstrated the most favorable results. Explicitly, KNN achieved R2 values of 0.981 and 0.880 on the training and testing data sets, respectively, while the DT model achieved R2 values of 0.922 and 0.862. The predicted values for these models were compactly clustered around the 45° fit line, reflecting high predictive accuracy and minimal overfitting. Overall, the R2 values for all single models on the testing data set ranged from 0.862 to 0.891—which, while slightly lower than the training performance, still indicated strong generalization to unseen data. In contrast, the ensemble models, such as GBM, XGBoost, and RF, exhibited superior performance, with R2 values consistently above 90% on both training and testing data sets. The exception was AdaBoost that slightly underperformed relative to other ensemble methods. These findings show the strength of ensemble learning in capturing the nonlinear and interactive effects among the predictors influencing .
A more comprehensive evaluation of model performance was performed using error metrics, as visualized in the heatmap in Fig. 7. For the training data set, the majority of models achieved MAE values below 0.08%, with the exception of KRR, SVR, DT, and AdaBoost that recorded slightly higher MAEs above 0.10%. Similarly, RMSE and MAPE values were below 0.14% and 12.32%, respectively, for the top-performing ML models. For the testing data set, error values were slightly elevated, with MAE ranging from 0.13% to 0.18%, RMSE between 0.18% and 0.24%, and MAPE spanning 15.38% to 21.35%. Across the entire data set, the best models maintained consistently low error metrics, confirming the robustness and generalizability of the models.
Collectively, the results confirm that both single and ensemble ML models substantially outperform the conventional empirical models (Fig. 5), mostly in the ability of the models to capture complex, nonlinear relationships inherent in prediction. The superior performance of ML models shows the value as reliable, data-driven tools in structural modeling. Consequently, the model yielding the lowest MAE across the full data set was selected for further interpretation and deployment, comprising feature importance and explainability analysis.
On the other hand, a comparison between observed and model-predicted values for data set used for testing and training the XGBoost model is presented in Fig. 8. The close alignment between the plots indicates a strong predictive capability of a typical developed ML-based model. The residuals of the observed and model-predicted values are color-coded to distinguish under-predictions from over-predictions. Generally, the residuals are concerted moderately around zero with no significant clustering, indicating that the ML-based model does not show systematic bias across the prediction range. Thus, this further validates the efficiency of ML-based models in apprehending the nonlinear relationships between the and its underlying predictors.
5.4 Model interpretation
To enhance the interpretability of the developed ML-based model, SHAP were adopted to analyze predictor contributions at both the global and local levels. SHAP, rooted in cooperative game theory, assigns an importance value, known as a SHAP value, to each predictor, indicative of the contribution of such predictor to a given prediction [
80]. This framework allows for a robust, model-agnostic interpretation and transparency of complex ML algorithms by fairly allocating the prediction difference (between the baseline and actual prediction) among the input predictors. In this study, SHAP values were computed for all samples in the data set, offering both an aggregate understanding and individualized insight into the decision-making process of model. The global interpretability was examined using a SHAP bar chart, as illustrated in Fig. 9. The bar chart ranks predictors based on the mean absolute SHAP value, thus quantifying the overall influence of the predictors on model predictions across the data set. The SHAP summary plot offers a more nuanced view by integrating a color gradient that represents the original predictor value ranging from low (dark blue) to high (bright yellow). The results of the SHAP analysis divulged that FRP-related parameters dominate the prediction of
. Specially,
, fiber type, and
appeared as the top three contributors, showing the central role of the confining material in governing strain behavior. Subsequently, critical concrete properties, including
,
D, and
, were identified as the next most influential predictors. These findings align with existing structural mechanics theory, which proposes that both confinement material properties and the mechanical response of the core concrete substantially influence confinement effectiveness [
6,
14]. Conversely, predictors such as the number of FRP layers,
, and
H were found to have fairly minor impacts on the model’s predictions, suggesting limited sensitivity to these predictors within the data set.
The local interpretability was further examined using the information presented in Table 3 and the result is illustrated in Fig. 10. Scenarios 1 and 2 correspond to GFRP confinement systems with one and two layers, respectively, and vary in . The local SHAP waterfall plots offer a transparent breakdown of the model’s reasoning, clearly differentiating the positive and negative contributions of each predictors to the predicted . In Scenario 1, all predictors, except , , and H, contribute positively to increasing the predicted . In contrast, in Scenario 2, all predictors, except for the number of layers and , positively affect the predicted . The absolute residual errors are 0.042 and 0.108 for Scenarios 1 and Scenario 2, respectively. This level of insight is mainly valuable for structural engineers seeking to understand not only what the model predicts but also why the model predicts a certain value for a given case. Generally, the SHAP analysis confirms that the developed ML-based predictive model not only achieves high precision but also maintains interpretability, thus making it a practical tool for both research and design applications involving FRP-confined concrete systems.
5.5 Deployment of machine learning-based predictive model
To support practical implementation and promote the use of ML in structural engineering applications, a user-accessible software tool, named Hoop Strain of FRP-Confined Concrete Predictor, was established. This GUI-based tool is powered by the developed and optimized best-performing ML model identified through broad model evaluation and error analysis. The tool enables users to efficiently predict the in FRP-confined concrete systems by simply entering relevant concrete geometric and parameters as well as FRP parameters, without the necessity for coding expertise or access to the original data set or model code. A visual representation of the tool interface is presented in Fig. 11. Once the parameters are inserted, the trained ML model embedded within the tool instantly returns the predicted . This streamlined functionality bridges the gap between complex ML methods and the real-world application of models in structural design, particularly in the field of FRP confinement and retrofit systems.
To verify the accuracy of the prediction tool, a test case was performed using experimental data reported by Lobo et al. [
22]. The model predicted
of 0.91%, which closely aligned with the experimentally recorded value of 0.96%, indicating the practical reliability of the prediction tool. This case study serves to validate the predictive abilities of the prediction tool and reinforces its potential as a reliable aid for structural engineers. Typically, the prediction tool is intended to complement, not replace, conventional design checks and expert evaluation. Overall, the inclusion of this GUI-based tool enhances the transparency, usability, and applicability of the developed ML model. The tool empowers practitioners to harness the predictive power of advanced data-driven techniques without requiring in-depth programming knowledge, thereby supporting more informed decision-making in the design and assessment of FRP-strengthened concrete structures.
6 Limitations and future work
The predictive performance of data-driven models is fundamentally constrained by the quality, comprehensiveness, and variety of the training data set. Although the final data set used in this study included 309 experimentally validated specimens across CFRP, GFRP, and AFRP types, and certain parameters, such as failure modes, environmental exposure effects, etc., were unavailable or sparsely reported in the literature. Thus, the exclusion of such parameters may limit the ML-based model’s generalizability in real-world applications. On the other hand, the developed model assumes uniform circular confinement and ignores effects related to load eccentricity, FRP wrapping procedure (e.g., continuous vs. discontinuous wrapping), and non-circular geometries. These features may influence development and failure mechanisms in approaches not apprehended by the current input predictors. While the current study focuses on point predictions, future work should integrate uncertainty quantification (e.g., prediction intervals, conformal prediction) to further improve the model’s utility in risk-sensitive applications. Besides, the GUI tool was established as a static input form, suitable for single-scenario assessments. Meanwhile, batch prediction, model updating and/or retraining, and graphical visualization of confinement effects (e.g., interaction diagrams) are not supported in the current version of the tool.
Thus, future studies should aim to integrate additional experimental data, specifically for underrepresented FRP types and non-circular sections. Synthetic data generation methods such as CTGAN or physics-informed generative models could also be adopted to augment the data set while maintaining physical consistency of in FRP-confined concrete. The ML-based model could be extended to predict in rectangular or eccentrically loaded columns, where stress distributions are non-uniform. Besides, integrating Bayesian ML or quantile regression may permit for confidence interval predictions, providing users with uncertainty bounds around estimates, critical for safety-driven design decisions. A future version of the GUI could be deployed as a web-based application with real-time input validation, batch-processing abilities, and interactive visualization features (e.g., SHAP plots). This would improve user-friendliness for engineers and researchers worldwide.
7 Conclusions
This study proposed an innovative, interpretable ML framework consisting of both ensemble and non-ensemble models for the precise prediction of effective hoop strain in FRP-confined concrete. Using a curated database of experimental test results, eight ML algorithms were developed and optimized using the Optuna hyperparameter tuning framework. The performance of the models were thoroughly assessed using standard regression metrics on both training and testing data sets, and further interpreted using SHAP to enable transparency and explainability. Generally, improved predictions allow more accurate estimation of confinement efficacy that in turn supports optimized FRP layout and material usage. Thus reducing the risk of under- or over-design, improves structural reliability, and contributes to more cost-efficient strengthening strategies in practice. The key findings and contributions of the study are summarized as follows.
1) The conventional empirical models demonstrated poor predictive performance for hoop strain in FRP-confined concrete, even those explicitly established for a wide range of fiber types. These models often either overestimate or underestimate the hoop strain, reflecting limited generalizability and accuracy.
2) The top-performing ML models achieved outstanding results across both training and testing data sets. Specifically, the MAE and RMSE remained below 0.14%, and the MAPE was consistently under 15%, while R2 values exceeded 0.90, demonstrating strong predictive accuracy and good generalization on unseen data.
3) Unlike conservative black-box ML methods, the developed ML framework maintains high interpretability. Accordingly, FRP-related parameters, such as elastic modulus, fiber type, and tensile strain, were identified as the most influential predictors governing hoop strain in FRP-confined concrete, thereby aligning with fundamental structural mechanics principles.
4) To promote practical adoption, the best-performing ML model was incorporated into a standalone, user-friendly GUI tagged Hoop Strain of FRP-Confined Concrete Predictor. This tool allows engineers to enter relevant design parameters and instantly obtain hoop strain predictions, providing valuable support for FRP design, assessment, and quality control tasks.
5) Despite the promising capabilities of the ML framework, the study acknowledges the following limitations: the current observed experimental data set may not completely encompass rare or unconventional design scenarios, possibly constraining the model’s extrapolative ability; the GUI currently operates on a static model, without integrating real-time retraining or updates based on new data inputs.
6) Future enhancements should target: expanding the training data set using advanced synthetic data generation methods such as CTGAN; including uncertainty quantification to provide prediction confidence intervals; upgrading the GUI to support dynamic model updates, real-time analytics, and interactive visualizations for deeper user engagement.
In conclusion, this study presents a robust, explainable, and practical ML-based prediction framework, realized through the Hoop Strain in FRP-Confined Concrete Predictor, that enables accurate assessment of hoop strain in FRP-confined concrete. This tool offers substantial value for both researchers and practicing structural engineers, nurturing more informed and data-driven decision-making in FRP retrofit and confinement design.