Effect of Normal Transformation Methods on Performance of Multivariate Normal Distribution
Publication: ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part A: Civil Engineering
Volume 8, Issue 1
Abstract
Multivariate normal distribution is used widely to characterize the uncertainties and correlations for correlated geotechnical data. The success of constructing a multivariate normal distribution depends on the reliable estimation of the marginal probability density functions (PDFs) and the correlation matrix. This paper focused on the normal transformation which is related to the fitted marginal PDFs and investigated its effect on the performance of the constructed multivariate normal distributions, i.e., the normality of the multivariate normal distribution, the fitness of the simulated data with the original data, the rationality of the derived point estimate equations, and validation of the equations based on validation data sets. Three normal transformation methods with different types of fitted marginal PDF, namely Johnson transformation, three-parameter lognormal transformation, and Box–Cox transformation, were compared based on their application to a real soil database. It was found that all the three normal transformation methods are applicable in the framework of multivariate normal distribution, although the transformed variables do not follow the multivariate normal distribution. The consistence of normality of the transformed variables with the performance of the constructed multivariate normal distribution in estimating the unknown parameters using Bayesian updating technique was verified. The Johnson transformation method is the recommended method for constructing the multivariate normal distribution for the real databases due to its robustness and superiority in normal transformation.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
All data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.
Acknowledgments
This study is substantially supported by the Natural Science Foundation Committee Program (No. 52022070) and by the Shanghai Municipal Science and Technology Committee Program (20dz1202200). The authors are grateful to these programs.
References
Ang, A. H. S., and W. H. Tang. 1984. Probability concepts in engineering planning and design, decision, risk, and reliability. Vol. II. New York: John Wiley & Sons.
Box, G. E. P., and D. R. Cox. 1964. “An analysis of transformations.” J. R. Stat. Soc. Ser. B 26 (2): 211–252. https://doi.org/10.1111/j.2517-6161.1964.tb00553.x.
Ching, J., G.-H. Lin, K.-K. Phoon, and J. Chen. 2017. “Correlations among some parameters of coarse-grained soils—the multivariate probability distribution model.” Can. Geotech. J. 54 (9): 1203–1220. https://doi.org/10.1139/cgj-2016-0571.
Ching, J., and K.-K. Phoon. 2012. “Modeling parameters of structured clays as a multivariate normal distribution.” Can. Geotech. J. 49 (5): 522–545. https://doi.org/10.1139/t2012-015.
Ching, J., and K.-K. Phoon. 2013. “Multivariate distribution for undrained shear strengths under various test procedures.” Can. Geotech. J. 50 (9): 907–923. https://doi.org/10.1139/cgj-2013-0002.
Ching, J., and K.-K. Phoon. 2014. “Correlations among some clay parameters—the multivariate distribution.” Can. Geotech. J. 51 (6): 686–704. https://doi.org/10.1139/cgj-2013-0353.
Ching, J., and K.-K. Phoon. 2015. “Constructing multivariate distribution for soil parameters.” Chap. 1 in Risk and reliability in geotechnical engineering, 3–76. New York: CRC Press.
Ching, J., and K.-K. Phoon. 2020. “Constructing a site-specific multivariate probability distribution using sparse, incomplete, and spatially variable (MUSIC-X) data.” ASCE J. Eng. Mech. 46 (7): 04020061. https://doi.org/10.1061/(ASCE)EM.1943-7889.0001779.
Ching, J., K.-K. Phoon, and C.-H. Chen. 2014. “Modeling piezocone cone penetration (CPTU) parameters of clays as a multivariate normal distribution.” Can. Geotech. J. 51 (1): 77–91. https://doi.org/10.1139/cgj-2012-0259.
Ching, J., K.-K. Phoon, and D.-Q. Li. 2016. “Robust estimation of correlation coefficients among soil parameters under the multivariate normal framework.” Struct. Saf. 63 (Nov): 21–32. https://doi.org/10.1016/j.strusafe.2016.07.002.
D’Ignazio, M., K.-K. Phoon, S. A. Tan, and T. Lansivaara. 2016. “Correlations for undrained shear strength of Finnish soft clays.” Can. Geotech. J. 53 (10): 1628–1645. https://doi.org/10.1139/cgj-2016-0037.
Feng, S., and P. J. Vardanega. 2019. “Correlation of the hydraulic conductivity of fine-grained soils with water content ratio using a database.” Environ. Geotech. 6 (5): 253–268. https://doi.org/10.1680/jenge.18.00166.
Fleishman, A. I. 1978. “A method for simulating non-normal distributions.” Psychometrika 43 (4): 521–532. https://doi.org/10.1007/BF02293811.
Henze, N., and B. Zirkler. 1990. “A class of invariant consistent tests for multivariate normality.” Commun. Stat. Theory Methods 19 (10): 3595–3617. https://doi.org/10.1080/03610929008830400.
Hill, I. D., R. Hill, and R. L. Holder. 1976. “Algorithm AS 99: Fitting Johnson curves by moments.” J. R. Stat. Soc. Ser. C 25 (2): 180–189. https://doi.org/10.2307/2346692.
Hotelling, H., and M. R. Pabst. 1936. “Rank correlation and tests of significance involving no assumption of normality.” Ann. Math. Stat. 7 (1): 29–43. https://doi.org/10.1214/aoms/1177732543.
Johnson, N. L. 1949. “Systems of frequency curves generated by methods of translation.” Biometrika 36 (1–2): 149–176. https://doi.org/10.1093/biomet/36.1-2.149.
Johnston, J. 1984. Econometric methods. New York: McGraw-Hill.
Li, D.-Q., X.-S. Tang, K.-K. Phoon, Y.-F. Chen, and C.-B. Zhou. 2013. “Bivariate simulation using copula and its application to probabilistic pile settlement analysis.” Int. J. Numer. Anal. Methods Geomech. 37 (6): 597–617. https://doi.org/10.1002/nag.1112.
Li, D.-Q., S.-B. Wu, C.-B. Zhou, and K. K. Phoon. 2012. “Performance of translation approach for modeling correlated non-normal variables.” Struct. Saf. 39 (Nov): 52–61. https://doi.org/10.1016/j.strusafe.2012.08.001.
Liu, S., H. Zou, G. Cai, T. V. Bheemasetti, A. J. Puppala, and J. Lin. 2016. “Multivariate correlation among resilient modulus and cone penetration test parameters of cohesive subgrade soils.” Eng. Geol. 209 (Jul): 128–142. https://doi.org/10.1016/j.enggeo.2016.05.018.
Lu, Z.-H., C.-H. Cai, and Y.-G. Zhao. 2017. “Structural reliability analysis including correlated random variables based on third-moment transformation.” J. Struct. Eng. 143 (8): 04017067. https://doi.org/10.1061/(ASCE)ST.1943-541X.0001801.
Lv, T. J., X. S. Tang, D. Q. Li, and X. H. Qi. 2019. “Modeling multivariate distribution of multiple soil parameters using vine copula model.” Comput. Geotech. 118 (Feb): 103340. https://doi.org/10.1016/j.compgeo.2019.103340.
Mage, D. T. 1980. “An explicit solution for SB parameters using four percentile points.” Technometrics 22 (2): 247–251. https://doi.org/10.2307/1268464.
Mecklin, C. J., and D. J. Mundfrom. 2003. “On using asymptotic critical values in testing for multivariate normality.” Accessed September 26, 2021. https://www.researchgate.net/publication/228830820_On_using_asymptotic_critical_values_in_testing_for_multivariate_normality.
Montgomery, D. C., G. C. Runger, and N. F. Hubele. 2010. Engineering statistics. New York: Wiley.
Phoon, K. K. 2006. “Modeling and simulation of stochastic data.” In Proc., GeoCongress. Reston, VA: ASCE.
Phoon, K.-K., and F. H. Kulhawy. 1999. “Characterization of geotechnical variability.” Can. Geotech. J. 36 (4): 612–624. https://doi.org/10.1139/t99-038.
Slifker, J. F., and S. S. Shapiro. 1980. “The Johnson system: Selection and parameter estimation.” Technometrics 22 (2): 239–246. https://doi.org/10.1080/00401706.1980.10486139.
Tang, X.-S., D.-Q. Li, G. Rong, K.-K. Phoon, and C.-B. Zhou. 2013. “Impact of copula selection on geotechnical reliability under incomplete probability information.” Comput. Geotech. 49 (Apr): 264–278. https://doi.org/10.1016/j.compgeo.2012.12.002.
Tong, Y. L. 1990. “Order statistics of normal variables.” Chap. 6 in The multivariate normal distribution, 123–149. Berlin: Springer.
Xiao, S., J. Zhang, J. Ye, and J. Zheng. 2021. “Establishing region-specific relationships through hierarchical Bayesian modeling.” Eng. Geol. 287 (6): 106105. https://doi.org/10.1016/j.enggeo.2021.106105.
Zhang, D., Y. Zhou, K.-K. Phoon, and H. Huang. 2020. “Multivariate probability distribution of Shanghai clay properties.” Eng. Geol. 273 (Aug): 105675. https://doi.org/10.1016/j.enggeo.2020.105675.
Zhang, J. Z., H. W. Huang, D. M. Zhang, K. K. Phoon, Z. Q. Liu, and C. Tang. 2021a. “Quantitative evaluation of geological uncertainty and its influence on tunnel structural performance using improved coupled Markov chain.” Acta Geotech. https://doi.org/10.1007/s11440-021-01287-6.
Zhang, J. Z., H. W. Huang, D. M. Zhang, M. L. Zhou, C. Tang, and D. J. Liu. 2021b. “Effect of ground surface surcharge on deformational performance of tunnel in spatially variable soil.” Comput. Geotech. 136: 104229. https://doi.org/10.1016/j.compgeo.2021.104229.
Zhao, Y.-G., and T. Ono. 2000. “Third-moment standardization for structural reliability analysis.” J. Struct. Eng. 126 (6): 724–732. https://doi.org/10.1061/(ASCE)0733-9445(2000)126:6(724).
Zhao, Y.-G., Y.-Y. Weng, and Z.-H. Lu. 2021. “An orthogonal normal transformation of correlated non-normal random variables for structural reliability.” Probab. Eng. Mech. 64 (Apr): 103130. https://doi.org/10.1016/j.probengmech.2021.103130.
Information & Authors
Information
Published In
Copyright
© 2021 American Society of Civil Engineers.
History
Received: May 30, 2021
Accepted: Sep 8, 2021
Published online: Oct 21, 2021
Published in print: Mar 1, 2022
Discussion open until: Mar 21, 2022
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Yongmin Cai, Kok-Kwang Phoon, Qiujing Pan, Wuzhang Luo, Modifying the Tailored Clustering Enabled Regionalization (TCER) framework for outlier site detection and inference efficiency, Engineering Geology, 10.1016/j.enggeo.2024.107537, (107537), (2024).
- Yuansheng Gao, A new conjecture: the high-order difference of prime sequence approximately obeys the normal distribution, Second International Conference on Applied Statistics, Computational Mathematics, and Software Engineering (ASCMSE 2023), 10.1117/12.2692923, (159), (2023).
- Rustam Eyniyev, Leyla Sadikh-Zada, Gariba Dadashova, Shems Ismayilova, Nigar Hajiyeva, Using Multivariate Normal Distribution and Poisson Distribution for Spawn Items in Battle Royale Games, 2023 IEEE 17th International Conference on Application of Information and Communication Technologies (AICT), 10.1109/AICT59525.2023.10313181, (1-6), (2023).
- Geetopriyo Roy, Subhrajit Dutta, Satyabrata Choudhury, An Integrated Uncertainty Quantification Framework for Probabilistic Seismic Hazard Analysis, ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part A: Civil Engineering, 10.1061/AJRUA6.RUENG-1035, 9, 2, (2023).
- Dongdong Yan, Tengyuan Zhao, Ling Xu, Lu Zuo, Han Wen, Jie Ren, Statistical modeling of multivariate loess properties in Taiyuan using regular vine copula with optimized tree structure, Transportation Geotechnics, 10.1016/j.trgeo.2023.101025, 41, (101025), (2023).
- Pengfei Ma, Yi Zhang, Kefei Li, Quanwang Li, Junjie Wang, Le Li, Ze Yuan, Smart database design for concrete durability analysis - An application in the Hongkong-Zhuhai-Macau bridge, Cement and Concrete Research, 10.1016/j.cemconres.2022.107033, 163, (107033), (2023).
- A. Sønsteby, R. Rivero, T. L. Woznicki, O. M. Heide, Floral initiation in gooseberry ( Ribes uva-crispa L.) and its control by daylength and temperature , The Journal of Horticultural Science and Biotechnology, 10.1080/14620316.2021.2009743, 97, 3, (336-345), (2022).