Rank and Linear Correlation Differences in Monte Carlo Simulation
Publication: ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part A: Civil Engineering
Volume 7, Issue 1
Abstract
Monte Carlo simulation is used to quantify and characterize uncertainty in a variety of applications, such as cost and engineering economic analysis and project management. The dependence or correlation between the random variables modeled can also be simulated to add more accuracy to simulations. However, there exists a difference between how correlation is most often estimated from data (linear correlation) and the correlation that is simulated (rank correlation). In this research an empirical methodology is developed to estimate the difference between the specified linear correlation between two random variables and the resulting linear correlation when rank correlation is simulated. It is shown that in some cases there can be relatively large differences. The methodology is based on the shape of the quantile-quantile plot of two distributions, the maximum possible linear correlation between two distributions, and the estimated level of correlation between the two random variables. This methodology also gives users the ability to estimate the rank correlation that, when simulated, generates a desired linear correlation. This methodology enhances the accuracy of simulations with dependent random variables while utilizing existing simulation software tools.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.
References
BadriAmr, M. A. 1997. “Effective analysis and planning of R&D stages: A simulation approach.” Int. J. Project Manage. 15 (6): 351–358.
Balakrishnan, N., and C.-D. Lai. 2009. Continuous bivariate distributions. 2nd ed. New York: Springer.
Chambers, J. M., W. S. Cleveland, K. Beat, and P. A. Tukey. 1983. Graphical methods for data analysis. Belmont, CA: Wadsworth International Group.
Cooper, D. F., and C. B. Chapman. 1986. Risk analysis for large projects, models, methods and cases. New York: Wiley.
Devore, J. L. 1999. Probability and statistics for engineering and sciences. 5th ed. Boston: Brooks.
Diekmann, J. E. 1983. “Probabilistic estimating: Mathematics and applications.” J. Constr. Eng. Manage. 109 (3): 297–308. https://doi.org/10.1061/(ASCE)0733-9364(1983)109:3(297).
Elkjaer, M. 2000. “Stochastic budget simulation.” Int. J. Project Manage. 18 (2): 139–147. https://doi.org/10.1016/S0263-7863(98)00078-7.
Fisher, R. A. 1915. “Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population.” Biometrika 10 (4): 507–521.
Fowlkes, E. B. 1987. Vol. 78 of A folio of distributions: A collection of theoretical quantile-quantile plots. Dallas: Marcel Dekker.
Garvey, P. R. 1999. Probability methods for cost uncertainty analysis: A systems engineering perspective. Dallas: Marcel Dekker.
Hines, W. W., and D. C. Montgomery. 1972. Probability and statistics in engineering and management science. New York: Ronald Press.
Hines, W. W., D. C. Montgomery, D. M. Goldsman, and C. M. Borror. 2003. Probability and statistics in engineering. 4th ed. New York: Wiley.
Iman, R. L., and W. J. Conover. 1982. “A distribution-free approach to inducing rank correlation among input variables.” Commun. Stat. Simul. Comput. 11 (3): 311–334. https://doi.org/10.1080/03610918208812265.
Johnson, M., and J. Ramberg. 1978. “Transformations of the multivariate normal distribution with applications to simulation.” In Proc., Int. Conf. on Systems Sciences. New York: IEEE.
Kendall, M., and J. D. Gibbons. 1990. Rank correlation methods. 5th ed. Oxford: Oxford University Press.
Kruskal, W. H. 1958. “Ordinal measures of association.” J. Am. Stat. Assoc. 53 (284): 814–861. https://doi.org/10.1080/01621459.1958.10501481.
Li, S. T., and J. L. Hammond. 1975. “Generation of pseudo-random numbers with specified univariate distributions and correlation coefficients.” IEEE Trans. Syst. Manage. Cybern. 5 (Sep): 557–561. https://doi.org/10.1109/TSMC.1975.5408380.
Lurie, P. M., and M. S. Goldberg. 1998. “An approximate method for sampling correlated random variables from partially-specified distributions.” Manage. Sci. 44 (2): 203–218. https://doi.org/10.1287/mnsc.44.2.203.
Mildenhall, S. 2006. “The report of the research working party on correlations and dependencies among all risk sources, part1.” In Casual actuarial society forum: CAS research working party on correlations and dependencies among all risk sources. Arlington, TX: Casualty Actuarial Society.
Nelsen, R. B. 2006. An introduction to copulas. 2nd ed. New York: Springer.
Price, J. 2002. “An implementation of the Lurie-Goldberg algorithm in schedule risk analysis.” In Proc., SCEA 2002 National Conf. Scottsdale. Vienna, VA: Society of Cost Estimating and Analysis.
Raftery, J. 1994. Risk analysis in project management. London: E&FN Spon.
Ranasinghe, M. 2000. “Impact of correlation and induced correlation on the estimation of project cost of buildings.” Constr. Manage. Econ. 18 (4): 395–406. https://doi.org/10.1080/01446190050024815.
Robinson, M., and S. Cole. 2004. “Rank correlation in crystal ball simulations.” In Proc., Society of Cost Estimating and Analysis Annual Meeting. Vienna, VA: Society of Cost Estimating and Analysis.
Schmid, J. 1947. “The relationship between the coefficient of correlation and the angle included between regression lines.” J. Educ. Res. 41 (4): 311–313. https://doi.org/10.1080/00220671.1947.10881608.
Seila, A. F. 2001. “Spreadsheet simulation.” In Proc., 2001 Winter Simulation Conf. Athens, GA: The Univ. of Georgia.
Touran, A., and L. Suphot. 1997. “Rank correlation in simulating construction costs.” J. Constr. Eng. Manage. 123 (3): 297–301. https://doi.org/10.1061/(ASCE)0733-9364(1997)123:3(297).
Touran, A., and E. P. Wiser. 1992. “Monte Carlo technique with correlated random variables.” J. Constr. Eng. Manage. 118 (2): 258–272. https://doi.org/10.1061/(ASCE)0733-9364(1992)118:2(258).
Tsai, D.-M., and C.-H. Yang. 2005. “A quantile–quantile plot based pattern matching for defect detection.” Pattern Recognit. Lett. 26 (13): 1948–1962. https://doi.org/10.1016/j.patrec.2005.02.002.
Tummala, V. M., and J. F. Burchett. 1999. “Applying a risk management process (RMP) to manage cost risk for an EHV transmission line project.” Int. J. Project Manage. 17 (4): 223–235. https://doi.org/10.1016/S0263-7863(98)00038-6.
Walpole, R. 2007. Probability and statistics for engineers and scientists. Upper Saddle River, NJ: Pearson Prentice Hall.
Yang, I.-T. 2005. “Simulation-based estimation for correlated cost elements.” Int. J. Project Manage. 23 (4): 275–282. https://doi.org/10.1016/j.ijproman.2004.12.002.
Yule, G., and M. Kendal. 1950. An introduction to the theory of statistics. 14th ed. Granville, OH: Charles Griffin.
Information & Authors
Information
Published In
Copyright
© 2020 American Society of Civil Engineers.
History
Received: Jun 5, 2020
Accepted: Oct 13, 2020
Published online: Dec 17, 2020
Published in print: Mar 1, 2021
Discussion open until: May 17, 2021
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.