Technical Papers
Jun 25, 2024

Dictionary Learning of Spatial Variability at a Specific Site Using Data from Other Sites

Publication: Journal of Geotechnical and Geoenvironmental Engineering
Volume 150, Issue 9

Abstract

Due to time, budget, and/or technical constraints, geotechnical site investigation data from a specific site are often limited and sparse, leading to a long-lasting challenge in characterization of spatially varying geotechnical properties. During preliminary stages of site characterization, geotechnical data from neighboring sites or sites with similar geological conditions are often collected and used as valuable prior knowledge in geotechnical engineering practice. Nevertheless, existing methods for spatial variability characterization often rely solely on site-specific data and cannot effectively incorporate prior knowledge or existing databases. To address this issue, this study proposes a novel machine learning method that systematically combines sparsely measured data at a specific site with existing data from neighboring sites or sites with similar geological settings for characterization of property spatial variability in a data-driven manner. The proposed method starts with the construction of a dictionary that draws the dominant spatially varying patterns from a property measured at sites with similar geology under a dictionary learning framework. Leveraging the developed dictionary, the spatial variability of a property is interpreted from sparse site-specific measurements using Bayesian learning. The effectiveness of the proposed method is demonstrated using real data, and improved performance over existing methods is observed.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Some or all data, models, or codes that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The work described in this paper was supported by grants from the Research Grant Council of Hong Kong Special Administrative Region (Project No.: CityU 11203322), the Science and Technology Development Fund, Macao SAR (Project No.: 001/2024/SKL), the National Natural Science Foundation of China (Grant No.: 42307215), and the Shenzhen Science and Technology Innovation Commission (Shenzhen-Hong Kong-Macau Science and Technology Project (Category C) No.: SGDX20210823104002020), China. The financial support is gratefully acknowledged.

References

Baecher, G. B., and J. T. Christian. 2005. Reliability and statistics in geotechnical engineering. Hoboken, NJ: Wiley.
Basaran, M., G. Erpul, A. U. Ozcan, D. S. Saygin, M. Kibar, I. Bayramin, and F. E. Yilman. 2011. “Spatial information of soil hydraulic conductivity and performance of cokriging over kriging in a semi-arid basin scale.” Environ. Earth Sci. 63 (4): 827–838. https://doi.org/10.1007/s12665-010-0753-6.
Basri, R., and D. W. Jacobs. 2003. “Lambertian reflectance and linear subspaces.” IEEE Trans. Pattern Anal. Mach. Intell. 25 (2): 218–233. https://doi.org/10.1109/TPAMI.2003.1177153.
Belhumeur, P. N., J. P. Hespanha, and D. J. Kriegman. 1997. “Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection.” IEEE Trans. Pattern Anal. Mach. Intell. 19 (7): 711–720. https://doi.org/10.1109/34.598228.
Brunton, S. L., and J. N. Kutz. 2022. Data-driven science and engineering: Machine learning, dynamical systems, and control. Cambridge, UK: Cambridge University Press.
Candès, E. J. 2006. “Compressive sampling.” In Vol. 3 of Proc., Int. Congr. Math., 1433–1452. Providence, RI: American Mathematical Society.
Candès, E. J., and T. Tao. 2006. “Near-optimal signal recovery from random projections: Universal encoding strategies?” IEEE Trans. Inf. Theory 52 (12): 5406–5425. https://doi.org/10.1109/TIT.2006.885507.
Candès, E. J., and M. B. Wakin. 2008. “An introduction to compressive sampling.” IEEE Signal Process. Mag. 25 (2): 21–30. https://doi.org/10.1109/MSP.2007.914731.
Cao, Z., Y. Wang, and D. Li. 2016. “Quantification of prior knowledge in geotechnical site characterization.” Eng. Geol. 203 (Mar): 107–116. https://doi.org/10.1016/j.enggeo.2015.08.018.
CEN (European Committee for Standardization). 2004. Eurocode 7: Geotechnical design—Part 1: General rules. EN 1997-1: 2004. Brussels, Belgium: CEN.
Chen, S. S., D. L. Donoho, and M. A. Saunders. 2001. “Atomic decomposition by basis pursuit.” SIAM Rev. 43 (1): 129–159. https://doi.org/10.1137/S003614450037906X.
Cheon, J. Y., and R. B. Gilbert. 2014. “Modeling spatial variability in offshore geotechnical properties for reliability-based foundation design.” Struct. Saf. 49 (Jul): 18–26. https://doi.org/10.1016/j.strusafe.2013.07.008.
Ching, J., W. H. Huang, and K. K. Phoon. 2020. “3D probabilistic site characterization by Sparse Bayesian Learning.” J. Eng. Mech. 146 (12): 04020134. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001859.
Ching, J., and K. K. Phoon. 2017. “Characterizing uncertain site-specific trend function by sparse Bayesian learning.” J. Eng. Mech. 143 (7): 04017028. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001240.
Ching, J., and K. K. Phoon. 2020a. “Constructing a site-specific multivariate probability distribution using sparse, incomplete, and spatially variable (MUSIC-X) data.” J. Eng. Mech. 146 (7): 04020061. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001779.
Ching, J., and K. K. Phoon. 2020b. “Measuring similarity between site-specific data and records from other sites.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 6 (2): 04020011. https://doi.org/10.1061/AJRUA6.0001046.
Ching, J., K. K. Phoon, J. L. Beck, and Y. Huang. 2017. “Identifiability of geotechnical site-specific trend functions.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 3 (4): 04017021. https://doi.org/10.1061/AJRUA6.0000926.
Ching, J., K. K. Phoon, and C. T. Wu. 2022a. “Data-centric quasi-site-specific prediction for compressibility of clays.” Can. Geotech. J. 59 (12): 2033–2049. https://doi.org/10.1139/cgj-2021-0658.
Ching, J., K. K. Phoon, Z. Y. Yang, and A. W. Stuedlein. 2022b. “Quasi-site-specific multivariate probability distribution model for sparse, incomplete, and three-dimensional spatially varying soil data.” Georisk 16 (1): 53–76. https://doi.org/10.1080/17499518.2021.1971256.
Ching, J., S. Wu, and K. K. Phoon. 2021a. “Constructing quasi-site-specific multivariate probability distribution using hierarchical Bayesian model.” J. Eng. Mech. 147 (10): 04021069. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001964.
Ching, J., X. Xu, K. K. Phoon, and M. Uzielli. 2023a. “Characterizing spatially variable cone tip resistance soundings from a global CPT database.” J. Geotech. Geoenviron. Eng. 149 (10): 04023090. https://doi.org/10.1061/JGGEFK.GTENG-11214.
Ching, J., Z. Y. Yang, and K. K. Phoon. 2021b. “Dealing with non-lattice data in three-dimensional probabilistic site characterization.” J. Eng. Mech. 147 (5): 06021003. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001907.
Ching, J., I. Yoshida, and K. K. Phoon. 2023b. “Comparison of trend models for geotechnical spatial variability: Sparse Bayesian learning vs. Gaussian process regression.” Gondwana Res. 123 (Nov): 174–183. https://doi.org/10.1016/j.gr.2022.07.011.
Ching, J. Y., K. K. Phoon, Y. H. Ho, and M. C. Weng. 2021c. “Quasi-site-specific prediction for deformation modulus of rock mass.” Can. Geotech. J. 58 (7): 936–951. https://doi.org/10.1139/cgj-2020-0168.
Christian, J. T. 2004. “Geotechnical engineering reliability: How well do we know what we are doing?” J. Geotech. Geoenviron. Eng. 130 (10): 985–1003. https://doi.org/10.1061/(ASCE)1090-0241(2004)130:10(985).
Donoho, D. L. 2006. “Compressed sensing.” IEEE Trans. Inf. Theory 52 (4): 1289–1306. https://doi.org/10.1109/TIT.2006.871582.
Guan, Z., and Y. Wang. 2020. “Statistical charts for determining sample size at various levels of accuracy and confidence in geotechnical site investigation.” Géotechnique 70 (12): 1145–1159. https://doi.org/10.1680/jgeot.18.P.315.
Guan, Z., and Y. Wang. 2022. “CPT-based probabilistic liquefaction assessment considering soil spatial variability, interpolation uncertainty and model uncertainty.” Comput. Geotech. 141 (Jan): 104504. https://doi.org/10.1016/j.compgeo.2021.104504.
Guan, Z., and Y. Wang. 2023. “Data-driven simulation of two-dimensional cross-correlated random fields from limited measurements using joint sparse representation.” Reliab. Eng. Syst. Saf. 238 (Oct): 109408. https://doi.org/10.1016/j.ress.2023.109408.
Guan, Z., Y. Wang, and K. K. Phoon. 2023. “Fusion of sparse non-co-located measurements from multiple sources for geotechnical site investigation.” Can. Geotech. J. https://doi.org/10.1139/cgj-2023-028.
Guan, Z., Y. Wang, and A. W. Stuedlein. 2022. “Efficient three-dimensional soil liquefaction potential and reconsolidation settlement assessment from limited CPTs considering spatial variability.” Soil Dyn. Earthquake Eng. 163 (Dec): 107518. https://doi.org/10.1016/j.soildyn.2022.107518.
Hibert, C., G. Grandjean, A. Bitri, J. Travelletti, and J. P. Malet. 2012. “Characterizing landslides through geophysical data fusion: Example of the La Valette landslide (France).” Eng. Geol. 128 (Mar): 23–29. https://doi.org/10.1016/j.enggeo.2011.05.001.
Hu, Y., Y. Wang, T. Zhao, and K. K. Phoon. 2020. “Bayesian supervised learning of site-specific geotechnical spatial variability from sparse measurements.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 6 (2): 04020019. https://doi.org/10.1061/AJRUA6.0001059.
Ji, S., Y. Xue, and L. Carin. 2008. “Bayesian compressive sensing.” IEEE Trans. Signal Process. 56 (6): 2346–2356. https://doi.org/10.1109/TSP.2007.914345.
Matheron, G. 1963. “Principles of geostatistics.” Economic Geol. 58 (8): 1246–1266. https://doi.org/10.2113/gsecongeo.58.8.1246.
Mayne, P. W., B. R. Christopher, and J. DeJong. 2001. Manual on subsurface investigations. Nat. Highway Inst. Sp. Pub. FHWA NHI-01–031. Washington, DC: Federal Highway Administration.
NZGD (New Zealand Geotechnical Database). 2012. “New Zealand geotechnical database.” Accessed July 6, 2023. https://www.nzgd.org.nz/.
Ozturk, C. A., and E. Simdi. 2014. “Geostatistical investigation of geotechnical and constructional properties in Kadikoy–Kartal subway, Turkey.” Tunnelling Underground Space Technol. 41 (Mar): 35–45. https://doi.org/10.1016/j.tust.2013.11.002.
Phoon, K. K. 2023. “What geotechnical engineers want to know about reliability.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 9 (2): 03123001. https://doi.org/10.1061/AJRUA6.RUENG-1002.
Phoon, K. K., and J. Ching. 2021. “Project DeepGeo - data-driven 3D subsurface mapping.” J. GeoEng. 16 (2): 61–74. https://doi.org/10.6310/jog.202106_16(2).2.
Phoon, K. K., and J. Ching. 2022. “Additional observations on the site recognition challenge.” J. GeoEng. 17 (4): 231–247. https://doi.org/10.6310/jog.202106_16(2).2.
Phoon, K. K., J. Ching, and Z. Cao. 2022a. “Unpacking data-centric geotechnics.” Underground Space 7 (6): 967–989. https://doi.org/10.1016/j.undsp.2022.04.001.
Phoon, K. K., J. Ching, and T. Shuku. 2022b. “Challenges in data-driven site characterization.” Georisk 16 (1): 114–126. https://doi.org/10.1080/17499518.2021.1896005.
Phoon, K. K., J. Ching, and Y. Tao. 2023. “Soil and rock parametric uncertainties: Chapter 2.” In Uncertainty, modelling, and decision making in geotechnics. Boca Raton, FL: CRC Press.
Phoon, K. K., and F. H. Kulhawy. 1999. “Characterization of geotechnical variability.” Can. Geotech. J. 36 (4): 612–624. https://doi.org/10.1139/t99-038.
Robert, C. P., G. Casella, and G. Casella. 1999. Monte Carlo statistical method. New York: Springer.
Shi, C., Y. Wang, and V. Kamchoom. 2023. “Data-driven multi-stage sampling strategy for a three-dimensional geological domain using weighted centroidal Voronoi tessellation and IC-XGBoost3D.” Eng. Geol. 325 (Nov): 107301. https://doi.org/10.1016/j.enggeo.2023.107301.
Tibshirani, R. 1996. “Regression shrinkage and selection via the lasso.” J. R. Stat. Soc. B 58 (1): 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x.
Tipping, M. E. 2001. “Sparse Bayesian learning and the relevance vector machine.” J. Mach. Learn. Res. 1 (Jun): 211–244.
Tropp, J. A., and A. C. Gilbert. 2007. “Signal recovery from random measurements via orthogonal matching pursuit.” IEEE Trans. Inf. Theory 53 (12): 4655–4666. https://doi.org/10.1109/TIT.2007.909108.
Wang, Y., Z. Guan, and T. Zhao. 2019. “Sample size determination in geotechnical site investigation considering spatial variation and correlation.” Can. Geotech. J. 56 (7): 992–1002. https://doi.org/10.1139/cgj-2018-0474.
Wang, Y., and T. Zhao. 2017. “Statistical interpretation of soil property profiles from sparse data using Bayesian Compressive Sampling.” Geotechnique 67 (6): 523–536. https://doi.org/10.1680/jgeot.16.P.143.
Webster, R., and M. A. Oliver. 2007. Geostatistics for environmental scientists. Chichester, UK: Wiley.
Wipf, D. 2011. “Sparse estimation with structured dictionaries.” In Advances in neural information processing systems, 24. Cambridge, MA: MIT Press.
Wipf, D. P., and B. D. Rao. 2004. “Sparse Bayesian learning for basis selection.” IEEE Trans. Signal Process. 52 (8): 2153–2164. https://doi.org/10.1109/TSP.2004.831016.
Wright, J., A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma. 2008. “Robust face recognition via sparse representation.” IEEE Trans. Pattern Anal. Mach. Intell. 31 (2): 210–227. https://doi.org/10.1109/TPAMI.2008.79.
Xu, J., Y. Wang, and L. Zhang. 2021. “Interpolation of extremely sparse geo-data by data fusion and collaborative Bayesian compressive sampling.” Comput. Geotech. 134 (Jun): 104098. https://doi.org/10.1016/j.compgeo.2021.104098.
Xu, J., Y. Wang, and L. Zhang. 2022. “Fusion of geotechnical and geophysical data for 2D subsurface site characterization using multi-sources Bayesian compressive sampling.” Can. Geotech. J. 59 (10): 1756–1773. https://doi.org/10.1139/cgj-2021-0323.
Yoshida, I., Y. Tomizawa, and Y. Otake. 2021. “Estimation of trend and random components of conditional random field using Gaussian process regression.” Comput. Geotech. 136 (Aug): 104179. https://doi.org/10.1016/j.compgeo.2021.104179.
Zhao, T., and Y. Wang. 2019. “Determination of efficient sampling locations in geotechnical site characterization using information entropy and Bayesian compressive sampling.” Can. Geotech. J. 56 (11): 1622–1637. https://doi.org/10.1139/cgj-2018-0286.
Zhao, T., and Y. Wang. 2020. “Non-parametric simulation of non-stationary non-Gaussian 3D random field samples directly from sparse measurements using signal decomposition and Markov Chain Monte Carlo.” Reliab. Eng. Syst. Saf. 203 (Nov): 107087. https://doi.org/10.1016/j.ress.2020.107087.

Information & Authors

Information

Published In

Go to Journal of Geotechnical and Geoenvironmental Engineering
Journal of Geotechnical and Geoenvironmental Engineering
Volume 150Issue 9September 2024

History

Received: Oct 23, 2023
Accepted: Mar 11, 2024
Published online: Jun 25, 2024
Published in print: Sep 1, 2024
Discussion open until: Nov 25, 2024

Permissions

Request permissions for this article.

ASCE Technical Topics:

Authors

Affiliations

Research Assistant Professor, State Key Laboratory of Internet of Things for Smart City, Dept. of Civil and Environmental Engineering, Univ. of Macau, Macao, China. Email: [email protected]
Professor, Dept. of Architecture and Civil Engineering, City Univ. of Hong Kong, Tat Chee Ave., Kowloon, Hong Kong (corresponding author). ORCID: https://orcid.org/0000-0003-4635-7059. Email: [email protected]
Professor, Information Systems Technology and Design/Architecture and Sustainable Design, Singapore Univ. of Technology and Design, 8 Somapah Rd., Singapore 487372. ORCID: https://orcid.org/0000-0003-2577-8639. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share