Development of Subsurface Geological Cross-Section from Limited Site-Specific Boreholes and Prior Geological Knowledge Using Iterative Convolution XGBoost
Publication: Journal of Geotechnical and Geoenvironmental Engineering
Volume 147, Issue 9
Abstract
The delineation of vertical geological cross-sections is an essential task in geotechnical site characterization and has a profound impact on subsequent geotechnical designs and analyses. It is a long-lasting challenge, particularly for complex geological settings, to properly produce a subsurface geological cross-section from limited boreholes that are usually encountered in engineering practice. Emerging machine learning methods, such as the convolutional neural network (CNN), provide a fresh perspective of this challenge and effective alternatives for exploiting the complex stratigraphic relationships between different soil deposits. In this study, a novel iterative convolution eXtreme Gradient Boosting model (IC-XGBoost) is proposed. This model interpolates a subsurface geological cross-section from limited site-specific boreholes and a training geological cross-section obtained from previous projects with similar geological settings. This direct application of previous geological cross-sections for training is based on the assumption of similar local spatial connectivity or stratigraphic relationships between soils in areas with similar geological settings. The proposed method can learn stratigraphic patterns from a training image in an automatic manner. In addition, the proposed method is purely data-driven and does not require the specification of any parametric function form. The model performance is illustrated using both a simulated example and real data from a tunnel project in Australia. The proposed method not only infers the most probable geological cross-section but also quantifies the associated interpolation uncertainty from multiple realizations. The effect of the borehole number on the interpolation performance is also explicitly investigated.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
The developed executable file for the IC-XGBoost algorithm is available at https://sites.google.com/site/yuwangcityu/ic-xgboost.
Acknowledgments
The work described in this paper was supported by grants from the Research Grants Council of the Hong Kong Special Administrative Region, China (Project Nos. CityU 11213119 and CityU 11213117). The financial supports are gratefully acknowledged.
References
Boeckmann, A. Z., and J. E. Loehr. 2016. Influence of geotechnical investigation and subsurface conditions on claims, change orders, and overruns. National cooperative highway research program synthesis 484. Washington, DC: Transportation Research Board.
Boisvert, J. B., M. J. Pyrcz, and C. V. Deutsch. 2007. “Multiple-point statistics for training image selection.” Nat. Resour. Res. 16 (4): 313–321. https://doi.org/10.1007/s11053-008-9058-9.
Boureau, Y., F. Bach, Y. LeCun, and J. Ponce. 2010a. “Learning mid-level features for recognition.” In Proc., 2010 IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, 2559–2566. New York: IEEE.
Boureau, Y., J. Ponce, and Y. LeCun. 2010b. “A theoretical analysis of feature pooling in visual recognition.” In Proc., 27th Int. Conf. on Machine Learning (ICML-10), 111–118. Alexandria, VA: National Science Foundation.
Chen, T., and C. Guestrin. 2016. “Xgboost: A scalable tree boosting system.” In Proc., 22nd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 785–794. New York: Special Interest Group on Management of Data.
Clayton, C. R. 2001. Managing geotechnical risk: Improving productivity in UK building and construction. London: Thomas Telford Publishing.
Deng, Z. P., S. H. Jiang, J. T. Niu, M. Pan, and L. L. Liu. 2020. “Stratigraphic uncertainty characterization using generalized coupled Markov chain.” Bull. Eng. Geol. Environ. 79 (10): 5061–5078. https://doi.org/10.1007/s10064-020-01883-y.
Deutsch, C., and L. Wang. 1996. “Hierarchical object-based stochastic modeling of fluvial reservoirs.” Math. Geol. 28 (7): 857–880. https://doi.org/10.1007/BF02066005.
Dumoulin, V., and F. Visin. 2016. “A guide to convolution arithmetic for deep learning.” Preprint, submitted March 23, 2016. http://arxiv.org/abs/1603.07285.
Elfeki, A., and M. Dekking. 2001. “A Markov chain model for subsurface characterization: Theory and applications.” Math. Geol. 33 (5): 569–589. https://doi.org/10.1023/A:1011044812133.
Hastie, T., R. Tibshirani, and J. Friedman. 2009. The elements of statistical learning. 2nd ed. New York: Springer.
Juang, C. H., T. Jiang, and R. A. Christopher. 2001. “Three-dimensional site characterisation: Neural network approach.” Géotechnique 51 (9): 799–809. https://doi.org/10.1680/geot.2001.51.9.799.
Juang, C. H., J. Zhang, M. Shen, and J. Hu. 2019. “Probabilistic methods for unified treatment of geotechnical and geological uncertainties in a geotechnical analysis.” Eng. Geol. 249 (Jan): 148–161. https://doi.org/10.1016/j.enggeo.2018.12.010.
Katiyar, S. K., and P. V. Arun. 2014. “Comparative analysis of common edge detection techniques in context of object extraction.” Accessed April 1, 2021. https://arxiv.org/ftp/arxiv/papers/1405/1405.6132.pdf.
Koltermann, C. E., and S. M. Gorelick. 1992. “Paleoclimatic signature in terrestrial flood deposits.” Science 256 (5065): 1775–1782. https://doi.org/10.1126/science.256.5065.1775.
Koltermann, C. E., and S. M. Gorelick. 1996. “Heterogeneity in sedimentary deposits: A review of structure-imitating, process-imitating, and descriptive approaches.” Water Resour. Res. 32 (9): 2617–2658. https://doi.org/10.1029/96WR00025.
Kumar, J. K., M. Konno, and N. Yasuda. 2000. “Subsurface soil-geology interpolation using fuzzy neural network.” J. Geotech. Geoenviron. Eng. 127 (7): 632–639. https://doi.org/10.1061/(ASCE)1090-0241(2000)126:7(632).
LeCun, Y., B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. 1989. “Backpropagation applied to handwritten zip code recognition.” Neural Comput. 1 (4): 541–551. https://doi.org/10.1162/neco.1989.1.4.541.
LeCun, Y., L. Bottou, and Y. Bengio. 1997. “Reading checks with multilayer graph transformer networks.” In Vol. 1 of Proc., IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 151–154. New York: IEEE.
Li, D., X. Qi, Z. Cao, X. Tang, K. Phoon, and C. Zhou. 2016a. “Evaluating slope stability uncertainty using coupled Markov chain.” Comput. Geotech. 73 (Mar): 72–82. https://doi.org/10.1016/j.compgeo.2015.11.021.
Li, Z., X. Wang, H. Wang, and R. Y. Liang. 2016b. “Quantifying stratigraphic uncertainties by stochastic simulation techniques based on Markov random field.” Eng. Geol. 201 (Feb): 106–122. https://doi.org/10.1016/j.enggeo.2015.12.017.
Ludwig, J. 2013. “Image convolution, Portland State University.” Accessed April 1, 2020. http://web.pdx.edu/∼jduh/courses/Archive/geog481w07/Students/Ludwig_ImageConvolution.pdf.
Mairal, J., P. Koniusz, Z. Harchaoui, and C. Schmid. 2014. “Convolutional kernel networks.” In Advances in neural information processing systems, 2627–2635. Cambridge, MA: MIT Press.
Mitchell, T. M. 1997. Machine learning. Burr Ridge, IL: McGraw Hill.
Mood, A. M. 1940. “The distribution theory of runs.” Ann. Math. Stat. 11 (4): 367–392. https://doi.org/10.1214/aoms/1177731825.
Pathak, D., P. Krahenbuhl, J. Donahue, T. Darrell, and A. A. Efros. 2016. “Context encoders: Feature learning by inpainting.” In Proc., IEEE Conf. on Computer Vision and Pattern Recognition, 2536–2544. New York: IEEE.
Prezzi, M., B. McCullouch, and V. K. D. Mohan. 2011. Analysis of change orders in geotechnical engineering work at INDOT. West Lafayette, IN: Joint Transportation Research Program, Indiana DOT and Purdue Univ.
Qi, X., D. Li, K. Phoon, Z. Cao, and X. Tang. 2016. “Simulation of geologic uncertainty using coupled Markov chain.” Eng. Geol. 207 (Jun): 129–140. https://doi.org/10.1016/j.enggeo.2016.04.017.
Rodríguez, P., M. A. Bautista, J. Gonzalez, and S. Escalera. 2018. “Beyond one-hot encoding: Lower dimensional target embedding.” Image Vision Comput. 75 (Jul): 21–31. https://doi.org/10.1016/j.imavis.2018.04.004.
Ruder, S. 2016. “An overview of gradient descent optimization algorithms.” Preprints, submitted September 15, 2016. http://arxiv.org/abs/1609.04747.
Shi, C., and Y. Wang. 2021a. “Nonparametric and data-driven interpolation of subsurface soil stratigraphy from limited data using multiple point statistics.” Can. Geotech. J. 58 (2): 261–280. https://doi.org/10.1139/cgj-2019-0843.
Shi, C., and Y. Wang. 2021b. “Smart determination of borehole number and locations for stability analysis of multi-layered slopes using multiple point statistics and information entropy.” Can. Geotech. J. https://doi.org/10.1139/cgj-2020-0327.
Srivastava, N., G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. 2014. “Dropout: A simple way to prevent neural networks from overfitting.” J. Mach. Learn. Res. 15 (1): 1929–1958.
Steffens, C. R., L. R. Messias, P. J. Drews Jr., and S. S. D. C. Botelho. 2020. “CNN based image restoration.” J. Intell. Rob. Syst. 99: 609–627. https://doi.org/10.1007/s10846-019-01124-9.
Thongsuwan, S., S. Jaiyen, A. Padcharoen, and P. Agarwal. 2021. “ConvXGB: A new deep learning model for classification problems based on CNN and XGBoost.” Nucl. Eng. Technol. 53 (2): 522–531. https://doi.org/10.1016/j.net.2020.04.008.
Van Vliet, L. J., I. T. Young, and G. L. Beckers. 1989. “A nonlinear laplace operator as edge detector in noisy images.” Comput. Vision Graphics Image Process. 45 (2): 167–195. https://doi.org/10.1016/0734-189X(89)90131-X.
Wang, Y., Y. Hu, and T. Zhao. 2020. “CPT-based subsurface soil classification and zonation in a 2D vertical cross-section using Bayesian compressive sampling.” Can. Geotech. J. 57 (7): 947–958. https://doi.org/10.1139/cgj-2019-0131.
Zhang, W., K. Itoh, J. Tanida, and Y. Ichioka. 1990. “Parallel distributed processing model with local space-invariant interconnections and its optical architecture.” Opt. Soc. Am. 29 (32): 4790–4797. https://doi.org/10.1364/AO.29.004790.
Zhang, W., R. Zhang, C. Wu, A. T. Goh, and L. Wang. 2020. “Assessment of basal heave stability for braced excavations in anisotropic clay using extreme gradient boosting and random forest regression.” Underground Space. https://doi.org/10.1016/j.undsp.2020.03.001.
Information & Authors
Information
Published In
Copyright
© 2021 American Society of Civil Engineers.
History
Received: Jul 18, 2020
Accepted: Apr 5, 2021
Published online: Jun 17, 2021
Published in print: Sep 1, 2021
Discussion open until: Nov 17, 2021
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Mao-Xin Wang, Andy Yat Fai Leung, Gang Wang, Pin Zhang, Semiempirical Predictive Models for Seismically Induced Slope Displacements Considering Ground Motion Directionality, Journal of Geotechnical and Geoenvironmental Engineering, 10.1061/JGGEFK.GTENG-11930, 150, 9, (2024).
- Chao Shi, Yu Wang, Data-driven spatio-temporal analysis of consolidation for rapid reclamation, Géotechnique, 10.1680/jgeot.22.00016, (1-21), (2023).
- Takayuki Shuku, Kok-Kwang Phoon, Data-driven subsurface modelling using a Markov random field model, Georisk: Assessment and Management of Risk for Engineered Systems and Geohazards, 10.1080/17499518.2023.2181973, (1-23), (2023).
- Chao Shi, Yu Wang, Development of training image database for subsurface stratigraphy, Georisk: Assessment and Management of Risk for Engineered Systems and Geohazards, 10.1080/17499518.2023.2169942, (1-18), (2023).
- Kok-Kwang Phoon, What Geotechnical Engineers Want to Know about Reliability, ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part A: Civil Engineering, 10.1061/AJRUA6.RUENG-1002, 9, 2, (2023).
- Hui Wang, Xingxing Wei, Stochastic Simulation of Soil Stratigraphic Profile Using Image Warping, Geo-Risk 2023, 10.1061/9780784484982.002, (12-24), (2023).
- Mohammad Faraz Athar, Sara Khoshnevisan, Laith Sadik, CPT-Based Soil Classification through Machine Learning Techniques, Geo-Congress 2023, 10.1061/9780784484708.026, (277-292), (2023).
- Yaxu Wang, Jiwen Wang, Ruirui Wang, Bin Liu, Yao Li, TBM penetration rate prediction ensemble model based on full-scale linear cutting test, Tunnelling and Underground Space Technology, 10.1016/j.tust.2022.104794, 131, (104794), (2023).
- Huaming Tian, Yu Wang, Data-driven and physics-informed Bayesian learning of spatiotemporally varying consolidation settlement from sparse site investigation and settlement monitoring data, Computers and Geotechnics, 10.1016/j.compgeo.2023.105328, 157, (105328), (2023).
- Wei Yan, Wan-Huan Zhou, Ping Shen, An Uncertainty-driven Peak-integration (UP) Strategy for 3D Borehole Layout Planning, Computers and Geotechnics, 10.1016/j.compgeo.2023.105280, 156, (105280), (2023).
- See more