Parallel Computing of Discontinuous Deformation Analysis Based on Graphics Processing Unit
Publication: International Journal of Geomechanics
Volume 17, Issue 5
Abstract
The performance of discontinuous deformation analysis (DDA) needs to be improved for large-scale analysis. In this study, the contact detection and open-close iteration, as the bottlenecks of DDA computing, are reimplemented on graphics processing units (GPUs). For contact detection, the proposed parallel method accelerates DDA computing by maintaining a high loading balance and data reuse ratio on the GPU. The open-close iteration is divided into two parts, simultaneous equations solver and interpenetration checker. For the simultaneous equations solver, both the parallel Jacobi method and the block Jacobi preconditioned conjugate gradient (BJPCG) methods are implemented on the GPU to substitute the original successive overrelaxation (SOR) method. The parallel interpenetration checker is improved by optimizing conditional branches on the GPU. Two applications of the new parallel methods are introduced. The results showed that the broad and narrow phases of contact detection showed 18 and 5 times speedup, respectively. The parallel BJPCG simultaneous equations solver showed 16 times speedup, and the parallel interpenetration checker showed 2 times speedup. The total performance of DDA is improved about 2 and 10 times, respectively, using the proposed methods.
Get full access to this article
View all available purchase options and get full access to this article.
Acknowledgments
The authors thank Dr. Genhua Shi for his supervision, guidance, and encouragement throughout this work. This work is supported by the National Natural Science Foundation of China (No. 61471338 and 61303155), the Knowledge Innovation Program of the Chinese Academy of Sciences and President Fund of UCAS, CRSRI Open Research Program (CKWV2015217/KY), and the Beijing Municipal Natural Science Foundation (Grant No. 4131004).
References
Avril, Q., Gouranton, V., and Amaldi, B. (2011). “Dynamic adaptation of broad phase collision detection algorithms.” Proc., 2011 IEEE Int. Symp. on VR Innovation (ISVRI), IEEE, Piscataway, NJ, 41–47.
Baskaran, M. M., and Bordawekar, R. (2008). “Optimizing sparse matrix-vector multiplication on GPUs.” IBM Tech. Rep. RC24704, IBM, Armonk, NY.
Bell, N., and Garland, M. (2008). “Efficient sparse matrix-vector multiplication on CUDA.” NVIDIA Tech. Rep. NVR-2008-004, NVIDIA Corporation, Santa Clara, CA.
Bolz, J., Farmer, I., Grinspun, E., and Schröoder, P. (2003). “Sparse matrix solvers on the GPU: Conjugate gradients and multigrid.” ACM Trans. Graphics, 22(3), 917–924.
Chen, X., Jie, Y. X., and Yu, Y. Z. (2013). “GPU accelerated iterative solutions for finite element analysis of soil-structure interaction problems.” Comput. Geosci., 17(4), 723–738.
Chen, Z. Y., et al. (2015). “Comparisons between centrifuge and numerical modeling results for slope toppling failure.” Sci. China Technol. Sci., 58(9), 1497–1508.
Cheng, X. L., Xiao, J., Miao, Q. H., and Wang, Y. (2015). “Design and implementation of a software architecture for 3D-DDA.” Sci. China Technol. Sci., 58(9), 1604–1608.
Chou, D. Y. (2000). GPGPU programming techniques from GLSL. CUDA to OpenCL, Machinery Industry Press, Beijing.
Cook, S. (2012). CUDA programming: A developer's guide to parallel computing with GPUs, Morgan Kaufmann, San Francisco.
Fu, Z., Lewis, T. J., Kirby, R. M., and Whitaker, R. T. (2014). “Architecting the finite element method pipeline for the GPU.” J. Comput. Appl. Math., 257(2), 195–211.
Geleri, F., Tosun, O., and Topcuoglu, H. (2013) “Parallelizing broad phase collision detection algorithms for sampling based path planners.” Proc., 2013 21st Euromicro Int. Conf. on Parallel, Distributed, and Network-Based Processing, IEEE, Piscataway, NJ, 384–391.
Göddeke, D., et al. (2007). “Exploring weak scalability for FEM calculations on a GPU-enhanced cluster.” Parallel Comput., 33(10–11), 685–699.
Göddeke, D., Strzodka, R., and Turek, S. (2005). “Accelerating double precision FEM simulations with GPUs.” Proc., ASIM 2005 18th Symp. on Simulation Technique, SCS European Publishing House, Erlangen, Germany, 139–144.
Jiao, Y. Y., Huang, G. H., Zhao, Z. Y., Zheng, F., and Wang, L. (2015). “An improved three-dimensional spherical DDA model for simulating rock failure.” Sci. China Technol. Sci., 58(9), 1533–1541.
Li, X. K., and Zheng, H. (2015). “Condensed form of complementarity formulation for discontinuous deformation analysis.” Sci. China Technol. Sci., 58(9), 1509–1519.
Lin, S. Z., Xu, H. W., and Xie, Z. Q. (2013). “Hybrid Programming Implementation of MPI+OpenMP on multicolor SSOR-PCG.” Comput. Aided Eng., 22(6), 79–83.
Lin, S. Z., and Xie, Z. Q. (2015). “Performance of DDA time integration.” Sci. China Technol. Sci., 58(9), 1558–1566.
Liu, H., Wu, B., and Li, Z. (2015). Preconditioned conjugate gradient method for static reanalysis with modifications of supports.” J. Eng. Mech., 04014111.
Luque, R. G., Comba, J. L. D., and Freitas, C. M. D. S. (2005). “Broad-phase collision detection using semi-adjusting BSP-trees.” Proc., Symp. on Interactive 3D Graphics, ACM, New York, 179–186.
Miao, Q. H., Huang, M., Wei, Q. (2009). “Parallel computing of numerical manifold with OpenMP.” Proc., 2009 IEEE Youth Conf. on Information, Computing and Telecommunications, IEEE, Piscataway, NJ, 343–346.
Miao, Q. H., Min, H., Xue, J., and Ben, Y. X. (2014). “Acceleration contact detection using spatial hashing for numerical manifold.” Geomech. Geoeng., 9(2), 153–159.
Nguyen, H. (2007). GPU gems 3, Addison Wesley, Boston.
OpenMP 2010 [Computer software]. 〈http://openmp.org/wp/openmp-compilers/〉.
OpenSees [Computer software]. Berkeley, CA, Pacific Earthquake Engineering Research Center.
Saad, Y. (2003). Iterative methods for sparse linear systems, Society for Industrial and Applied Mathematics, Philadelphia.
Shi, G. H. (1988). “Discontinuous deformation analysis: A new numerical model for the statics and dynamics of block systems.” Eng. Comput., 9(2), 157–168.
Shi, G. H. (1997). Working forum on manifold method of material analysis, the numerical manifold method and simplex integration, Vol. 2, Army Engineer Waterways Experiment Station, Geotechnical Laboratory, Vicksburg, MI.
Shi, G. H. (2013). “Basic equations of two dimensional and three dimensional contacts.” Proc., 47th U.S. Rock Mechanics/Geomechanics Symp., American Rock Mechanics Association, Alexandria, VA, 1504–1511.
Shi, G. H. (2015). “Contact theory.” Sci. China Technol. Sci., 58(9), 1450–1496.
Tian, Y., Xie, L., Xu, Z., and Lu, X. (2015). “GPU-powered high-performance computing for the analysis of large-scale structures based on OpenSees.” Proc, 2015 Int. Workshop on Computing in Civil Engineering, American Society of Civil Engineers, Reston, VA, 411–418.
Wang, L. X., Li, S. H., Zhang, G. X., Ma, Z. S., and Zhang, L. (2013). “A GPU-based parallel procedure for nonlinear analysis of complex structures using a coupled FEM/DEM approach.” Math. Prob. Eng., 618980.
Wilt, N. (2013). The CUDA handbook: A comprehensive guide to GPU programming, Addison Wesley Professional, Boston.
Zhang, G. X., Li, X., and Li, H. F. (2015). “Simulation of hydraulic fracture utilizing numerical manifold method.” Sci. China Technol. Sci., 58(9), 1542–1557.
Zhang, L., Zhang, G. X., Wang, L. X., Ma, Z. S., and Li, S. H. (2013). “A comparative study on different parallel solvers for nonlinear analysis of complex structures.” Math. Prob. Eng., 764237.
Zhang, Z. H., Miao, Q. H., and Wang, Y. (2009). “CUDA-based Jacobi’s iterative method.” Int. Forum on Computer Science Technology and Applications (IFCSTA 2009), IEEE, Piscataway, NJ, 25–27.
Information & Authors
Information
Published In
Copyright
© 2016 American Society of Civil Engineers.
History
Received: Jul 29, 2015
Accepted: Apr 8, 2016
Published online: Jun 17, 2016
Discussion open until: Nov 17, 2016
Published in print: May 1, 2017
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.