A general integrated machine learning pipeline: Its concept, main steps and application in shear strength prediction of RC beams strengthened with FRCM. (15th April 2023)
- Record Type:
- Journal Article
- Title:
- A general integrated machine learning pipeline: Its concept, main steps and application in shear strength prediction of RC beams strengthened with FRCM. (15th April 2023)
- Main Title:
- A general integrated machine learning pipeline: Its concept, main steps and application in shear strength prediction of RC beams strengthened with FRCM
- Authors:
- Chen, Jin-Xin
Zhao, Xin-Yu - Abstract:
- Highlights: A generic working pipeline is proposed to address concerns when using machine learning in civil engineering. The shear capacity of FRCM strengthened RC beams is predicted as a demonstration. The problem of limited training data is alleviated. The efficacy of machine learning models is boosted through transfer learning. Abstract: Data are the fibers out of which the extraordinary capabilities of machine learning (ML) are woven. But despite having gained widespread attention in civil engineering, ML still faces several overarching issues: small and incomplete training datasets, questionable generalization ability, and lack of physical interpretability. In fact, over-fitting is almost inescapable for models trained with limited data. Furthermore, the "black box" effect sharply restricts the use of ML. To deal with those issues, we here put forward a general integrated ML pipeline that, with the help of transfer learning and synthetic data augmentation, remains not only robust to highly limited data, but also enables engineers to carry transparent and interpretable analysis. To showcase the resulting advantages, the shear capacity of reinforced concrete (RC) beams strengthened with fiber-reinforced cementitious matrix (FRCM) is predicted as a demonstration. An experimental dataset containing only 91 tests is supplemented with data from a synthetic data generator, Synthpop. Six baseline ML algorithms are evaluated to seek out the most suitable model for predicting theHighlights: A generic working pipeline is proposed to address concerns when using machine learning in civil engineering. The shear capacity of FRCM strengthened RC beams is predicted as a demonstration. The problem of limited training data is alleviated. The efficacy of machine learning models is boosted through transfer learning. Abstract: Data are the fibers out of which the extraordinary capabilities of machine learning (ML) are woven. But despite having gained widespread attention in civil engineering, ML still faces several overarching issues: small and incomplete training datasets, questionable generalization ability, and lack of physical interpretability. In fact, over-fitting is almost inescapable for models trained with limited data. Furthermore, the "black box" effect sharply restricts the use of ML. To deal with those issues, we here put forward a general integrated ML pipeline that, with the help of transfer learning and synthetic data augmentation, remains not only robust to highly limited data, but also enables engineers to carry transparent and interpretable analysis. To showcase the resulting advantages, the shear capacity of reinforced concrete (RC) beams strengthened with fiber-reinforced cementitious matrix (FRCM) is predicted as a demonstration. An experimental dataset containing only 91 tests is supplemented with data from a synthetic data generator, Synthpop. Six baseline ML algorithms are evaluated to seek out the most suitable model for predicting the shear capacity. It turns out that the GBDT model performs the best among all the algorithms assessed. To further boost its efficacy, a transfer learning algorithm, two-stage TrAdaboost, is modified to enhance the GBDT model which is taken as a base learner via re-weighting the synthetic data points. The resulting emulator is termed TrAGBDT. Thereafter, the SHapley Additive exPlanation (SHAP) approach is utilized which lends itself ideally to deciphering the mechanism of the TrAGBDT. A graphic user interface is also provided along with the pertinent Python-based source codes, thereby allowing users and developers to access the aforesaid emulator without any barrier. The findings of this study indicate that over-fitting is largely balanced out for the TrAGBDT, which greatly improves the extendibility and interpretability of the proposed ML-centric pipeline. Hence it can be a viable solution to promote ML applications in civil engineering practice. … (more)
- Is Part Of:
- Engineering structures. Volume 281(2023)
- Journal:
- Engineering structures
- Issue:
- Volume 281(2023)
- Issue Display:
- Volume 281, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 281
- Issue:
- 2023
- Issue Sort Value:
- 2023-0281-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-04-15
- Subjects:
- Machine learning -- Synthetic data -- Transfer learning -- Two-stage TrAdaboost -- RC beams -- FRCM -- Shear capacity
Structural engineering -- Periodicals
Structural analysis (Engineering) -- Periodicals
Construction, Technique de la -- Périodiques
Génie parasismique -- Périodiques
Pression du vent -- Périodiques
Earthquake engineering
Structural engineering
Wind-pressure
Periodicals
624.105 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01410296 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engstruct.2023.115749 ↗
- Languages:
- English
- ISSNs:
- 0141-0296
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3770.032000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26151.xml