Prediction of Concrete Compressive Strength Using Support Vector Machine Regression: Statistical Characterization, Model Performance, and Feature Importance Analysis
DOI:
https://doi.org/10.63278/mme.vi.1919Keywords:
Concrete compressive strength; Support Vector Machine; Machine learning; Statistical analysis; Feature importance; Regression modelingAbstract
This study presents the development and evaluation of a machine learning–based framework for predicting the compressive strength of concrete using a Support Vector Machine (SVM) regression model. A comprehensive dataset comprising 1,133 concrete mix designs was employed, incorporating key material composition parameters, including cement, blast-furnace slag, fly ash, water, super-plasticizer, fine aggregate, coarse aggregate, and curing age. Prior to model development, extensive descriptive and advanced statistical analyses were conducted to examine the distributional characteristics, variability, skewness, and robustness of the input variables, ensuring a sound understanding of the data structure. The analysis revealed substantial variability in cementitious materials and curing age, highlighting the nonlinear and heterogeneous nature of concrete strength development. An ε-SVM regression model with a radial basis function kernel was implemented to capture these complex relationships. Model performance was assessed using an 80:20 train–test split and multiple statistical metrics, including mean squared error, root mean squared error, mean absolute error, mean absolute percentage error, coefficient of determination, and coefficient of variation of RMSE. The results demonstrate that the SVM model achieved strong predictive accuracy, with an R² value of 0.89, RMSE of 5.32 MPa, and MAPE of 11.38%, indicating reliable generalization to unseen data. Error analysis confirmed stable prediction behavior for most samples, with only a limited number of isolated outliers. Feature importance evaluation using univariate regression and a Relief-based algorithm identified cement content and curing age as the most influential parameters, followed by super-plasticizer and water content, in agreement with established concrete technology principles. Overall, the study confirms the suitability of SVM regression for concrete compressive strength prediction while emphasizing the importance of thorough data characterization, multi-metric evaluation, and feature interpretability for robust and physically consistent machine learning applications in civil engineering.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Dr. M. Adil Khan, Aalia Faiz, Abdul Wahab, Hayat Ullah, Shahzad Ahmed, Akram Ullah Khan, Wasim Khan

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their published articles online (e.g., in institutional repositories or on their website, social networks like ResearchGate or Academia), as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

Except where otherwise noted, the content on this site is licensed under a Creative Commons Attribution 4.0 International License.



According to the