Analysis of Crash Data Using Quantile Regression for Counts
Publication: Journal of Transportation Engineering
Volume 140, Issue 4
Abstract
Statistical models that describe the relationship between crash frequency and its influencing factors have been widely studied for the last three decades. Most of the existing methodologies use these models with count data and their variants to study the mean effects of covariates on crash frequency. This study seeks to explore the use of quantile regression for counts as a methodological alternative in analyzing crash frequency. Compared with existing models, the proposed model provides a fuller and more robust analysis of crash data for at least two reasons. First, crash data usually follow typical count distributions with a large proportion of zeros, and the remaining values highly skew toward the right. This nature of crash data makes quantile regression appealing because it can provide more comprehensive information about the effects of covariates on crash frequency rather than just the mean because quantile regression allows various quantiles of a population to be estimated. Second, as a semiparametric technique, quantile regression for counts allows researchers to relax restrictions in the form of the distribution function of the response variable, resulting in more robust estimation. In addition, two prediction methods are proposed to take advantage of such analysis results to yield better point prediction. To illustrate the application of quantile regression, crash data for interstate highways in urban areas in Washington State in 2002 were extracted from the Highway Safety Information System (HSIS) and analyzed with the proposed model. The analysis results and prediction performance were then compared with those from the negative binomial regression model. The numerical case study shows that although the significance and signs of the effects derived from both models are consistent, the proposed quantile regression model reveals more detailed information on the marginal effects of covariates change across the conditional distribution of the response variable and provides more robust and accurate predictions on crash counts.
Get full access to this article
View all available purchase options and get full access to this article.
References
Abdel-Aty, M. A., and Pemmanaboina, R. (2005). “Assessing crash occurrence on urban freeways using static and dynamic factors.” Adv. Transp. Stud. , 5, 39–57.
Abdelwahab, H. T., and Abdel-Aty, M. A. (2002). “Artificial neural networks and logit models for traffic safety analysis of toll plazas.”, Transportation Research Board, Washington, DC, 115–125.
Alfonso, M. (2008). “Planned fertility and family background: A quantile regression for counts analysis.” J. Popul. Econ., 21(1), 67–81.
Anastasopoulos, P. C., and Mannering, F. L. (2009). “A note on modeling vehicle accident frequencies with random-parameters count models.” Accid. Anal. Prev., 41(1), 153–159.
Anastasopoulos, P. C., Tarko, A. P., and Mannering, F. L. (2008). “Tobit analysis of vehicle accident rates on interstate highways.” Accid. Anal. Prev., 40(2), 768–775.
Chang, L. Y. (2005). “Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network.” Saf. Sci., 43(8), 541–557.
Chiang, A., and Wainwright, K. (2005). Fundamental methods of mathematical economics, 4th Ed., McGraw-Hill, Boston.
Daniel, J., and Chien, S. I. J. (2004). “Truck safety factors on urban arterials.” J. Transp. Eng., 742–752.
El-Basyouny, K., and Sayed, T. (2009). “Accident prediction models with random corridor parameters.” Accid. Anal. Prev., 41(5), 1118–1123.
Hauer, E. (2001). “Overdispersion in modeling accidents on road sections and in empirical Bayes estimation.” Accid. Anal. Prev., 33(6), 799–808.
Hilbe, J. M. (2007). Negative binomial regression, Cambridge University Press, U.K.
Ivan, J. N., and Wang, C. (2000). “Explaining two-lane highway crash rates using land use and hourly exposure.” Accid. Anal. Prev., 32(6), 487–795.
Kumara, S. P., and Chin, H. C. (2003). “Modeling accident occurrence at signalized T intersections with special emphasis on excess zeros.” Traffic Inj. Prev., 4(1), 53–57.
Kumara, S. P., and Chin, H. C. (2005). “Application of Poisson underreporting model to examine crash frequencies at signalized three-legged intersections.”, Transportation Research Board, Washington, DC, 46–53.
Li, X., Lord, D., and Zhang, Y. (2011). “Development of accident modification factors for rural frontage road segments in Texas using results from generalized additive models.” J. Transp. Eng., 137(1), 74–83.
Lord, D., Washington, S. P., and Ivan, J. N. (2005). “Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: Balancing statistical fit and theory.” Accid. Anal. Prev., 37(1), 35–46.
Ma, L., and Pohlman, L. (2008). “Return forecasts and optimal portfolio construction: A quantile regression approach.” Eur. J. Finance, 14(5), 409–425.
Machado, J. A. F., and Santos Silva, J. M. C. (2005). “Quantiles for counts.” J. Am. Stat. Assoc., 100(472), 1226–1237.
Miaou, S. P. (2001). “Estimating roadside encroachment rates with the combined strengths of accident- and encroachment-based approaches.”, Federal Highway Administration (FHWA), U.S. DOT, Washington, DC.
Miaou, S. P., Hu, P. S., Wright, T., Rathi, A. K., and Davis, S. C. (1992). “Relationship between truck accidents and highway geometric design: A Poisson regression approach.”, Transportation Research Board, Washington, DC, 10–18.
Miaou, S. P., and Lord, D. (2003). “Modeling traffic crash-flow relationships for intersections: Dispersion parameter, functional form, and Bayes versus empirical Bayes.”, Transportation Research Board, Washington, DC, 31–40.
Miaou, S. P., and Lum, H. (1993). “Modeling vehicle accidents and highway geometric design relationships.” Accid. Anal. Prev., 25(6), 689–709.
Mitra, S., and Washington, S. (2007). “On the nature of overdispersion in motor vehicle crash prediction models.” Accid. Anal. Prev., 39(3), 459–468.
Miranda, A. (2006). “QCOUNT: Stata program to fit quantile regression models for count data.” Statistical Software Components S456714, Boston College Dept. of Economics.
Moreira, S., and Pita Barros, P. (2010). “Double health insurance coverage and health care utilisation: Evidence from quantile regression.” Health Econ., 19(9), 1075–1092.
Park, B., and Lord, D. (2009). “Application of finite mixture models for vehicle crash data analysis.” Accid. Anal. Prev., 41(4), 683–691.
Qin, X. (2012). “Quantile effects of factors on crash distributions.”, Transportation Research Board, Washington, DC, 40–46.
Qin, X., Ng, M., and Reyes, P. E. (2010). “Identifying crash-prone locations with quantile regression.” Accid. Anal. Prev., 42(6), 1531–1537.
Qin, X., and Reyes, P. E. (2011). “Conditional quantile analysis for crash count data.” J. Transp. Eng., 601–607.
Shankar, V. N., Mannering, F., and Barfield, W. (1995). “Effect of roadway geometric and environmental factors on rural freeway accident frequencies.” Accid. Anal. Prev., 27(3), 371–389.
Vogt, A., and Bared, J. (1998). “Accident models for two-lane rural segments and intersections.”, Transportation Research Board, Washington, DC, 18–29.
Wu, H. (2011). “A framework for developing road risk indices using quantile regression based crash prediction model.” Ph.D. dissertation, Dept. of Civil, Architectural, and Environmental Engineering, Univ. of Texas at Austin, TX.
Xie, Y., Lord, D., and Zhang, Y. (2007). “Predicting motor vehicle collisions using Bayesian neural networks: An empirical analysis.” Accid. Anal. Prev., 39(5), 922–933.
Xie, Y., and Zhang, Y. (2008). “Crash frequency analysis with generalized additive models.”, Transportation Research Board, Washington, DC, 39–45.
Zegeer, C. V., Stewart, J. R., Huang, H. H., and Lagerwey, P. A. (2001). “Safety effects of marked versus unmarked crosswalks at uncontrolled locations: Analysis of pedestrian crashes in 30 cities (with discussion and closure).”, Transportation Research Board, Washington, DC, 56–68.
Information & Authors
Information
Published In
Copyright
© 2013 American Society of Civil Engineers.
History
Received: Apr 23, 2013
Accepted: Nov 20, 2013
Published online: Dec 31, 2013
Published in print: Apr 1, 2014
Discussion open until: May 31, 2014
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.