Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. (January 2020)
- Record Type:
- Journal Article
- Title:
- Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. (January 2020)
- Main Title:
- Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models
- Authors:
- Zounemat-Kermani, Mohammad
Stephan, Dietmar
Barjenbruch, Matthias
Hinkelmann, Reinhard - Abstract:
- Highlights: Three ensemble learners were evaluated in predicting concrete corrosion in sewers. Five soft computing base learners were employed. Network-based and tree-based machine learning methods were assessed. The random forests (RF) model performed better than the other models. Ensemble learners acted better in alleviating the problem of over/under-simulating. Abstract: This research aims to evaluate ensemble learning (bagging, boosting, and modified bagging) potential in predicting microbially induced concrete corrosion in sewer systems from the data mining (DM) perspective. Particular focus is laid on ensemble techniques for network-based DM methods, including multi-layer perceptron neural network (MLPNN) and radial basis function neural network (RBFNN) as well as tree-based DM methods, such as chi-square automatic interaction detector (CHAID), classification and regression tree (CART), and random forests (RF). Hence, an interdisciplinary approach is presented by combining findings from material sciences and hydrochemistry as well as data mining analyses to predict concrete corrosion. The effective factors on concrete corrosion such as time, gas temperature, gas-phase H2 S concentration, relative humidity, pH, and exposure phase are considered as the models' inputs. All 433 datasets are randomly selected to construct an individual model and twenty component models of boosting, bagging, and modified bagging based on training, validating, and testing for each DM baseHighlights: Three ensemble learners were evaluated in predicting concrete corrosion in sewers. Five soft computing base learners were employed. Network-based and tree-based machine learning methods were assessed. The random forests (RF) model performed better than the other models. Ensemble learners acted better in alleviating the problem of over/under-simulating. Abstract: This research aims to evaluate ensemble learning (bagging, boosting, and modified bagging) potential in predicting microbially induced concrete corrosion in sewer systems from the data mining (DM) perspective. Particular focus is laid on ensemble techniques for network-based DM methods, including multi-layer perceptron neural network (MLPNN) and radial basis function neural network (RBFNN) as well as tree-based DM methods, such as chi-square automatic interaction detector (CHAID), classification and regression tree (CART), and random forests (RF). Hence, an interdisciplinary approach is presented by combining findings from material sciences and hydrochemistry as well as data mining analyses to predict concrete corrosion. The effective factors on concrete corrosion such as time, gas temperature, gas-phase H2 S concentration, relative humidity, pH, and exposure phase are considered as the models' inputs. All 433 datasets are randomly selected to construct an individual model and twenty component models of boosting, bagging, and modified bagging based on training, validating, and testing for each DM base learners. Considering some model performance indices, (e.g., Root mean square error, RMSE ; mean absolute percentage error, MAPE ; correlation coefficient, r ) the best ensemble predictive models are selected. The results obtained indicate that the prediction ability of the random forests DM model is superior to the other ensemble learners, followed by the ensemble Bag-CHAID method. On average, the ensemble tree-based models acted better than the ensemble network-based models; nevertheless, it was also found that taking the advantages of ensemble learning would enhance the general performance of individual DM models by more than 10%. … (more)
- Is Part Of:
- Advanced engineering informatics. Volume 43(2020)
- Journal:
- Advanced engineering informatics
- Issue:
- Volume 43(2020)
- Issue Display:
- Volume 43, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 43
- Issue:
- 2020
- Issue Sort Value:
- 2020-0043-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-01
- Subjects:
- Concrete corrosion -- Machine learning -- Soft computing -- Sewer systems -- Artificial intelligence
Computer-aided engineering -- Periodicals
Engineering -- Data processing -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/14740346 ↗
http://books.google.com/books?id=KhFVAAAAMAAJ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.aei.2019.101030 ↗
- Languages:
- English
- ISSNs:
- 1474-0346
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 0696.851100
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 12953.xml