Technical Papers
Mar 31, 2021

Gradient Boosting Coupled with Oversampling Model for Prediction of Concrete Pipe-Joint Infiltration Using Designwise Data Set

Publication: Journal of Pipeline Systems Engineering and Practice
Volume 12, Issue 3

Abstract

Infiltration of groundwater through reinforced concrete pipe (RCP) joints under hydrostatic pressure has been a major costly challenge in municipal sewer network systems. Analysis of an exclusive designwise infiltration test data of RCP joints showed that conventional regression analysis failed to produce reliable predictions. Accordingly, tree-based machine-learning techniques including random forest, extra trees, and gradient boosting classifiers have been deployed in this study to create reliable models. A large designwise data set identifying failure of RCP joints and the effect of key design parameters was collected using a novel experimental program. Due to the resulting unbalanced experimental data set, oversampling techniques including synthetic minority over-sampling technique (SMOTE) and density based synthetic minority over-sampling technique (DBSMOTE) were employed to enhance predictive performance. Gradient boosting coupled with DBSMOTE offered a robust machine-learning model for predicting RCP joint hydrostatic infiltration. The hybrid gradient boosting classification (GBC)-DBSMOTE model achieved superior predictive accuracy in terms of several classification indicators, with promising capability to create RCP joint hydrostatic infiltration performance charts that capture the effects of key design parameters, such as pressure duration and level, pipe size, and gasket sealing. The robust predictive model could produce design charts that aid municipalities in proactively averting sewage system infiltration problems at low cost, instead of the prevailing reactive approach to this problem.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

All data, models, and code generated or used during the study appear in the published article.

Acknowledgments

This research was funded by the Natural Science and Engineering Research Council of Canada (NSERC) and Con Cast Pipe through a Collaborative Research and Development Grant No. 533877-2018 granted to Moncef L. Nehdi. The authors also acknowledge Con Cast Pipe for manufacturing the tested full-scale reinforced concrete pipes and for providing their apparatus for TEBT testing.

References

Ahmad, M. W., J. Reynolds, and Y. Rezgui. 2018. “Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression tree.” J. Cleaner Prod. 203 (Dec): 810–821. https://doi.org/10.1016/j.jclepro.2018.08.207.
Azar, A. T., H. I. Elshazly, A. E. Hassanien, and A. M. Elkorany. 2014. “A random forest classifier for lymph diseases.” Comput. Methods Programs Biomed. 113 (2): 465–473. https://doi.org/10.1016/j.cmpb.2013.11.004.
Breiman, L. 1996. “Bagging predictors.” Mach. Learn. 24 (2): 123–140. https://doi.org/10.1007/BF00058655.
Breiman, L., J. H. Friedman, R. A. Olshen, and C. J. Stone. 1984. Classification and regression trees. New York: Chapman & Hall.
Bunkhumpornpat, C. K., K. Sinapiromsaran, and C. Lursinsap. 2012. “DBSMOTE: Density-based synthetic minority over-sampling technique.” Appl. Intell. 36 (3): 664–684. https://doi.org/10.1007/s10489-011-0287-y.
Chawla, N. V., K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. 2002. “SMOTE: Synthetic minority over-sampling technique.” J. Artif. Intell. Res. 16: 321–357. https://doi.org/10.1613/jair.953.
Cordon, I., S. Garcia, A. Fernandez, and F. Herrera. 2018. “Imbalance: Oversampling algorithms for imbalanced classification in R.” Knowl.-Based Syst. 161 (Dec): 329–341. https://doi.org/10.1016/j.knosys.2018.07.035.
Friedl, M. A., and C. E. Brodley. 1997. Decision tree classification of land cover from remotely sensed data. New York: Elsevier.
Georgia State University News Hub. 2019. “Researchers study how growth affects water flows in DeKalb County.” Accessed October 10, 2019. https://news.gsu.edu/2019/09/25/geoscience-researchers-study-how-growth-affects-water-flows-in-dekalb-county/.
Geurts, P., D. Ernst, and L. Wehenkel. 2006. “Extremely randomized trees.” Mach. Learn. 63 (1): 3–42. https://doi.org/10.1007/s10994-006-6226-1.
Guerra, J. 2019. “$1.64 million contract helps Clearwater monitor health of sewage system.” Accessed October 10, 2019. https://www.tbnweekly.com/clearwater_beacon/article_8d269712-dae6-11e9-8238-a33146ec8c3d.html.
John, V., Z. Lui, C. Guo, S. Mita, and K. Kidono. 2015. “Real-time lane estimation using deep features and extra trees regression.” In Proc., Pacific-Rim Symp. 2015, 721–733. Cham, Switzerland: Springer.
Ke, G., Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Te, and T. Liu. 2017. “LightGBM: A highly efficient gradient boosting decision tree.” In Proc., 31st Conf. on Neural Information Processing Systems. Red Hook, NY: Curran Associates.
Kesik, T. 2015. Best practices guide: Management of inflow and infiltration in new urban developments. Toronto: Institute for Catastrophic Loss Reduction.
Kim, H., E. Ahn, M. Shin, and S. H. Sim. 2019. “Crack and noncrack classification from concrete surface images using machine learning.” Struct. Health Monit. 18 (3): 725–738. https://doi.org/10.1177/1475921718768747.
Koch, C., K. Georgieva, V. Kasireddy, B. Akinci, and P. Fieguth. 2015. “A review on computer vision-based defect detection and condition assessment of concrete and asphalt civil infrastructure.” Adv. Eng. Inf. 29 (2): 196–210. https://doi.org/10.1016/j.aei.2015.01.008.
Kovacs, G. 2019. “Smote-variants: A Python implementation of 85 minority oversampling techniques.” Neurocomputing 366 (Nov): 352–354. https://doi.org/10.1016/j.neucom.2019.06.100.
Liaw, A., and M. Wiener. 2002. “Classification and regression by randomforest.” R News 2/3: 18–22.
Liu, Y., Y. Gu, J. C. Nguyen, H. Li, J. Zhang, Y. Gao, and Y. Huang. 2017. “Symptom severity classification with gradient tree boosting.” Supplement, J. Biomed. Inf. 75 (Nov): 105–111. https://doi.org/10.1016/j.jbi.2017.05.015.
Mangalathu, S., and J. S. Jeon. 2018. “Classification of failure mode and prediction of shear strength for reinforced concrete beam-column joints using machine learning techniques.” Eng. Struct. 160 (Apr): 85–94. https://doi.org/10.1016/j.engstruct.2018.01.008.
Mangalathu, S., and J. S. Jeon. 2019. “Machine learning-based failure mode recognition of circular reinforced concrete bridge columns: Comparative study.” J. Struct. Eng. 145 (10): 04019104. https://doi.org/10.1061/(ASCE)ST.1943-541X.0002402.
Mason, L., J. Baxter, P. Bartlett, and M. Frean. 1999. “Boosting algorithms as gradient descent.” In Proc., 12th Int. Conf. on Neural Information Processing Systems (NIPS’99), 512–518. Cambridge, MA: MIT Press.
McGoff, C. 2019. “Dedham deals with infiltration and inflow issues in sewer system.” Accessed October 10, 2019. https://dedham.wickedlocal.com/news/20190408/dedham-deals-with-infiltration-and-inflow-issues-in-sewer-system.
Melhem, H. G., and Y. Cheng. 2003. “Prediction of remaining service life of bridge decks using machine learning.” J. Comput. Civ. Eng. 17 (1): 1–9. https://doi.org/10.1061/(ASCE)0887-3801(2003)17:1(1).
Melhem, H. G., and Y. Cheng. 2019. “Prediction of remaining service life of bridge decks using machine learning.” J. Comput. Civ. Eng. 17 (1): 1–9. https://doi.org/10.1061/(ASCE)0887-3801(2003)17:1(1).
Melo, F. 2019. “St. Paul wins Met Council grant to keep storm water out of sanitary sewer system.” Accessed October 10, 2019. https://www.twincities.com/2019/09/25/st-paul-wins-500000-three-year-wastewater-grant-for-west-side-flats-area/.
Nehdi, M., H. E. Chabib, and M. H. Naggar. 2001. “Predicting performance of self-compacting concrete mixtures using artificial neural networks.” ACI Mater. J. 98 (5): 394–401.
Pedregosa, F., et al. 2011. “Scikit-learn: Machine learning in Python.” J. Mach. Learn. Res. 12: 2825–2830.
Quinlan, J. R. 2006. Bagging, boosting, and C4.5. Sydney, Australia: Univ. of Sydney.
Rafiei, M. H., and H. Adeli. 2017. “A novel machine learning-based algorithm to detect damage in high-rise building structures.” Struct. Des. Tall Special Build. 26 (18): e1400. https://doi.org/10.1002/tal.1400.
Reich, Y. 1997. “Machine learning techniques for civil engineering problems.” Microcomput. Civ. Eng. 12 (4): 295–310. https://doi.org/10.1111/0885-9507.00065.
Reich, Y., and S. V. Barai. 1999. “Evaluating machine learning models for engineering problems.” Artif. Intell. Eng. 13 (3): 257–272. https://doi.org/10.1016/S0954-1810(98)00021-1.
Robinson, B., and D. Sandink. 2017. “Research shows ‘unacceptable’ high rates of sewer inflow, infiltration in new subdivisions.” Environmental Science and Engineering Magazine, October 2017 Issue, 42–45.
Robinson, B., D. Sandink, and D. Lapp. 2019. Reducing the risk of inflow and infiltration (I/I) in new sewer construction. Toronto: Institute for Catastrophic Loss Reduction.
Rodriguez-Galiano, V. F., B. Ghimire, J. Rogan, M. Chica-Olmo, and J. P. Rigol-Sanchez. 2012. “An assessment of the effectiveness of a random forest classifier for land-cover classification.” ISPRS J. Photogramm. Remote Sens. 67 (Jan): 63–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002.
Sattar, A. M., Ö. F. Ertuğrul, B. Gharabaghi, E. A. McBean, and J. Cao. 2019. “Extreme learning machine model for water network management.” Neural Comput. Appl. 31 (1): 157–169. https://doi.org/10.1007/s00521-017-2987-7.
Sokolova, M., N. Japkowicz, and S. Szpakowicz. 2006. “Beyond accuracy, F-score and ROC: A family of discriminant measures for performance evaluation.” In Vol. 4304 of Proc., Australasian Joint Conf. on Artificial Intelligence, 1015–1021. Berlin: Springer.
Sturgeon, K. 2019. “Fulton County to raise water rates, public hearing Tuesday.” Accessed October 10, 2019. https://patch.com/georgia/alpharetta/amp/28308162/fulton-county-to-raise-water-rates-public-hearing-tuesday.
Tharwat, A. 2018. “Classification assessment methods.” In Applied computing and informatics. Bingley, UK: Emerald Publishing.
Torres-Barran, A., A. Alonso, and J. R. Dorronsoro. 2019. “Regression tree ensembles for wind energy and solar radiation prediction.” Neurocomputing 326–327: 151–160. https://doi.org/10.1016/j.neucom.2017.05.104.
USEPA. 2014. Guide for estimating infiltration and inflow. Washington, DC: USEPA.
Varmuza, K., P. Filzmoser, M. Hilchenbach, H. Kruger, and J. Silen. 2014. “KNN classification—Evaluated by repeated double cross validation: Recognition of minerals relevant for comet dust.” Chemometr. Intell. Lab. Syst. 138 (Nov): 67–71. https://doi.org/10.1016/j.chemolab.2014.07.011.
Watson, S. 2019. “Amherst, Williamsville eye combining sanitary sewer systems” Accessed October 10, 2019. https://buffalonews.com/2019/07/05/amherst-williamsville-eye-combining-sanitary-sewer-systems/.
Wong, L., and M. L. Nehdi. 2018. “New test method for RCP joint hydrostatic infiltration.” In Proc., CSCE Conf. 2018. Fredericton, NB, Canada: Canadian Society for Civil Engineering.
Wong, L., and M. L. Nehdi. 2020. “Quantifying resistance of reinforced concrete pipe joints to water infiltration.” J. Pipeline Syst. Eng. Pract. 11 (3): 04020020. https://doi.org/10.1061/(ASCE)PS.1949-1204.0000462.
Wong, S. C., A. Gatt, V. Stamatescu, and M. D. McDonnell. 2016. “Understanding data augmentation for classification: When to wrap?” In Proc., 2016 Int. Conf. on Digital Image Computing: Techniques and Applications (DICTA). Gold Coast, QLD, Australia: Institute of Electrical and Electronic Engineers.
Yaseen, Z. M., R. C. Deo, A. Hilal, A. M. Abd, L. C. Bueno, S. Salcedo-Sanz, and M. L. Nehdi. 2018. “Predicting compressive strength of lightweight foamed concrete using extreme learning machine model.” Adv. Eng. Software 115 (Jan): 112–125. https://doi.org/10.1016/j.advengsoft.2017.09.004.
Zealand, C. M., D. H. Burn, and S. P. Simonovic. 1999. “Short term streamflow forecasting using artificial neural networks.” J. Hydrol. 214 (1–4): 32–48. https://doi.org/10.1016/S0022-1694(98)00242-X.
Zhang, A., C. P. Wang, B. Li, E. Yang, X. Dai, Y. Peng, Y. Fei, Y. Liu, J. Q. Li, and C. Chen. 2017. “Automated pixel-level pavement crack detection on 3D asphalt surfaces using a deep-learning network.” Comput.-Aided Civ. Infrastruct. Eng. 32 (10): 805–819. https://doi.org/10.1111/mice.12297.

Information & Authors

Information

Published In

Go to Journal of Pipeline Systems Engineering and Practice
Journal of Pipeline Systems Engineering and Practice
Volume 12Issue 3August 2021

History

Received: May 6, 2020
Accepted: Dec 30, 2020
Published online: Mar 31, 2021
Published in print: Aug 1, 2021
Discussion open until: Aug 31, 2021

Permissions

Request permissions for this article.

Authors

Affiliations

Lui S. Wong [email protected]
Ph.D. Candidate, Dept. of Civil and Environmental Engineering, Western Univ., London, ON, Canada N6A 5B9. Email: [email protected]
Afshin Marani [email protected]
Ph.D. Candidate, Dept. of Civil and Environmental Engineering, Western Univ., London, ON, Canada N6A 5B9. Email: [email protected]
Professor, Dept. of Civil and Environmental Engineering, Western Univ., London, ON, Canada N6A 5B9 (corresponding author). ORCID: https://orcid.org/0000-0002-2561-993X. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share