Correlated Random Parameter Marginalized Two-Part Model: Application to Refined-Scale Longitudinal Crash Rates Data
Publication: Journal of Transportation Engineering, Part A: Systems
Volume 144, Issue 2
Abstract
Crash rate data are mainly analyzed using the Tobit model. However, there are three major limitations associated with the Tobit model when it is applied to crash data: (1) the assumption that zeros are originated from the data generating process, (2) the presumption of a normal distribution of the latent response variable, and (3) the Tobit proportionality assumption. Moreover, unobserved heterogeneities are usually present, which lead to biased results in crash analyses. To address these limitations, the marginalized two-part model with random parameter specification is proposed as an alternative to the Tobit model. For comparison purposes, four models are developed: (1) Tobit model, (2) fixed parameter marginalized two-part (FPMTP) model, (3) uncorrelated random parameter marginalized two-part (URPMTP) model, and (4) correlated random parameter marginalized two-part (CRPMTP) model. The proposed methodology is demonstrated by investigating daily crash rates on a major freeway in Colorado. Model estimation results show that marginalized two-part models outperform the Tobit model, exhibiting good potential for future adoption when studying crash rates. Among the three two-part models, CRPMTP outperforms the other two, indicating that the correlated random parameter model can better capture the unobserved heterogeneities. Furthermore, the time-varying variables, including traffic and weather variables, are also found to play a significant role in crash occurrence.
Get full access to this article
View all available purchase options and get full access to this article.
Acknowledgments
This study was partially supported by the U.S. Department of Transportation (through the Mountain Plains Consortium). The real-time monitoring and crash data provided by the Colorado Department of Transportation and Colorado State Patrol are greatly appreciated. The content of this paper reflects the views of the authors, who are responsible for the facts and the accuracy of the information presented.
References
Anastasopoulos, P. C., Mannering, F. L., Shankar, V. N., and Haddock, J. E. (2012). “A study of factors affecting highway accident rates using the random-parameters Tobit model.” Accid. Anal. Prev., 45, 628–633.
Anastasopoulos, P. C., Tarko, A. P., and Mannering, F. L. (2008). “Tobit analysis of vehicle accident rates on interstate highways.” Accid. Anal. Prev., 40(2), 768–775.
Arabmazar, A., and Schmidt, P. (1982). “An investigation of the robustness of the Tobit esitmator to non-normality.” Econometrica, 50(4), 1055–1063.
Belasco, E. J., and Ghosh, S. K. (2012). “Modelling semi-continuous data using mixture regression models with an application to cattle production yields.” J. Agric. Sci., 150(1), 109–121.
Bera, A. K., Jarque, C. M., and Lee, L. (1984). “Testing the normality assumption in limited dependent variable models.” Int. Econ. Rev. (Philadelphia), 25(3), 563–578.
Chen, F., Ma, X., and Chen, S. (2014). “Refined-scale panel data crash rate analysis using random-effects Tobit model.” Accid. Anal. Prev., 73, 323–332.
Conway, K. S., and Kniesner, T. J. (1991). “The important econometric features of a linear regression model with cross-correlated random coefficients.” Econ. Lett., 35(2), 143–147.
Cragg, J. G. (1971). “Some statistical models for limited dependent variables with application to the demand for durable goods.” Econometrica, 39(5), 829–844.
Duan, N., Manning, W. G., Morris, C. N., and Newhouse, J. P. (1983). “A comparison of alternative models for the demand for medical care.” J. Bus. Econ. Stat., 1(2), 115–126.
Dulisse, B. (1997). “Methodological issues in testing the hypothesis of risk compensation.” Accid. Anal. Prev., 29(3), 285–292.
El-Basyouny, K., Barua, S., and Islam, T. (2014). “Investigation of time and weather effects on crash types using full Bayesian multivariate Poisson lognormal models.” Accid. Anal. Prev., 73, 91–99.
Liu, L., Strawderman, R. L., Johnson, B. A., and O’Quigley, J. M. (2016). “Analyzing repeated measures semi-continuous data, with application to an alcohol dependence study.” Stat. Methods Med. Res., 25(1), 133–152.
Lord, D., and Mannering, F. (2010). “The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives.” Transp. Res. Part A Policy Pract., 44(5), 291–305.
Ma, J., and Kockelman, K. M. (2006). “Bayesian multivariate Poisson regression for models of injury count, by severity.” Transp. Res. Rec., 1950, 24–34.
Ma, J., Kockelman, K. M., and Damien, P. (2008). “A multivariate Poisson-lognormal regression model for prediction of crash counts by severity, using Bayesian methods.” Accid. Anal. Prev., 40(3), 964–975.
Ma, X., Chen, F., and Chen, S. (2015). “Modeling crash rates for a mountainous highway using refined-scale panel data.” Transp. Res. Rec., 2515, 10–16.
Ma, X., Chen, S., and Chen, F. (2016). “Correlated random-effects bivariate Poisson lognormal model to study single-vehicle and multivehicle crashes.” J. Transp. Eng., 04016049.
Maddala, G. S., and Lahiri, K. (1992). Introduction to econometrics, 2nd Ed., Macmillan, New York.
Mannering, F. L., and Bhat, C. R. (2014). “Analytic methods in accident research: Methodological frontier and future directions.” Anal. Methods Accid. Res., 1, 1–22.
Mannering, F. L., Shankar, V., and Bhat, C. R. (2016). “Unobserved heterogeneity and the statistical analysis of highway accident data.” Anal. Methods Accid. Res., 11, 1–16.
National Highway Traffic Safety Administration. (2010). “Traffic safety facts 2010: A compilation of motor vehicle crash data from the fatality analysis reporting system and the general estimates system.” U.S. Dept. of Transportation, Washington, DC.
Pei, X., Wong, S. C., and Sze, N. N. (2012). “The roles of exposure and speed in road safety analysis.” Accid. Anal. Prev., 48, 464–471.
Pinheiro, J. C., and Bates, D. M. (1995). “Approximations to the log-likelihood function in the nonlinear mixed-effects model.” J. Comput. Graph. Stat., 4(1), 12–35.
Qi, Y., Smith, B. L., and Guo, J. (2007). “Freeway accident likelihood prediction using a panel data analysis approach.” J. Transp. Eng., 149–156.
Sigelman, L., and Zeng, L. (1999). “Analyzing censored and sample-selected data with Tobit and Heckit models.” Polit. Anal., 8(02), 167–182.
Smith, D. A., and Brame, R. (2003). “Tobit models in social science research: Some limitations and a more general alternative.” Sociol. Methods Res., 31(3), 364–388.
Smith, V. A., Preisser, J. S., Neelon, B., and Maciejewski, M. L. (2014). “A marginalized two-part model for semicontinuous data.” Stat. Med., 33(28), 4891–4903.
Su, L., Tom, B. D. M., and Farewell, V. T. (2009). “Bias in 2-part mixed models for longitudinal semicontinuous data.” Biostatistics, 10(2), 374–389.
Tooze, J. A., Grunwald, G. K., and Jones, R. H. (2002). “Analysis of repeated measures data with clumping at zero.” Stat. Methods Med. Res., 11(4), 341–355.
Usman, T., Fu, L., and Miranda-Moreno, L. F. (2010). “Quantifying safety benefit of winter road maintenance: Accident frequency modeling.” Accid. Anal. Prev., 42(6), 1878–1887.
Usman, T., Fu, L., and Miranda-Moreno, L. F. (2011). “Accident prediction models for winter road safety.” Transp. Res. Rec., 2237, 144–151.
Usman, T., Fu, L., and Miranda-Moreno, L. F. (2012). “A disaggregate model for quantifying the safety effects of winter road maintenance activities at an operational level.” Accid. Anal. Prev., 48, 368–378.
Venkataraman, N., Shankar, V., Ulfarsson, G. F., and Deptuch, D. (2014). “A heterogeneity-in-means count model for evaluating the effects of interchange type on heterogeneous influences of interstate geometrics on crash frequencies.” Anal. Methods Accid. Res., 2, 12–20.
Venkataraman, N., Ulfarsson, G. F., and Shankar, V. N. (2013). “Random parameter models of interstate crash frequencies by severity, number of vehicles involved, collision and location type.” Accid. Anal. Prev., 59, 309–318.
Venkataraman, N., Ulfarsson, G. F., Shankar, V. N., Oh, J., and Park, M. (2011). “Model of relationship between interstate crash occurrence and geometrics.” Transp. Res. Rec., 2236, 41–48.
Vuong, Q. H. (1989). “Likelihood ratio tests for model selection and non-nested hypotheses.” Econometrica, 57(2), 307–333.
Washington, S. P., Karlaftis, M. G., and Mannering, F. L. (2011). Statistical and econometric methods for transportation data analysis, 2nd Ed., CRC Press, New York.
Xiong, Y., and Mannering, F. L. (2013). “The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach.” Transp. Res. Part B Methodol., 49, 39–54.
Yu, R., Xiong, Y., and Abdel-Aty, M. (2015). “A correlated random parameter approach to investigate the effects of weather conditions on crash risk for a mountainous freeway.” Transp. Res. Part C Emerg. Technol., 50, 68–77.
Information & Authors
Information
Published In
Copyright
©2017 American Society of Civil Engineers.
History
Received: Oct 12, 2016
Accepted: Jul 11, 2017
Published online: Nov 17, 2017
Published in print: Feb 1, 2018
Discussion open until: Apr 17, 2018
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.