Technical Papers
Dec 23, 2011

Application of GLASSO in Variable Selection and Crash Prediction at Unsignalized Intersections

Publication: Journal of Transportation Engineering
Volume 138, Issue 7

Abstract

In this paper, a new promising variable screening technique is proposed to select important covariates and to improve crash prediction; the group least absolute shrinkage and selection operator (GLASSO). The GLASSO’s main power lies in its ability to deal with data sets havinga large number of categorical variables, the case in this study. Identifying the significant factors affecting the safety of unsignalized intersections was also an essential objective. Two applications of GLASSO were investigated: before fitting the negative binomial (NB) model, and before fitting the promising multivariate adaptive regression splines (MARS) technique using extensive data representing 2,475 unsignalized intersections. Regarding the NB models, GLASSO yielded close prediction capability to the backward deletion and random forest techniques. Also, MARS model fitting after using GLASSO relatively outperformed that after using random forest, with similar prediction performance. Because of its outstanding performance with categorical variables and its simplicity, GLASSO is recommended as a promising variable selection technique. Some significant predictors affecting unsignalized intersections’ safety were traffic volume on the major road, upstream and downstream distances to the nearest signalized intersection, and median type on major and minor approaches.

Get full access to this article

View all available purchase options and get full access to this article.

Acknowledgments

The authors would like to thank the Florida DOT for providing sufficient resources for collecting the data used in this study.

References

Abdel-Aty, M., and Haleem, K. (2011). “Analyzing angle crashes at unsignalized intersections using machine learning techniques.” Accid. Anal. Prev., 43(1), 461–470.
Abdel-Aty, M., and Keller, J. (2005). “Exploring the overall and specific crash severity levels at signalized intersections.” Accid. Anal. Prev., 37(3), 417–425.
Abdel-Aty, M., and Radwan, E. (2000). “Modeling traffic accident occurrence and involvement.” Accid. Anal. Prev., 32(5), 633–642.
Anastasopoulos, P., and Mannering, F. (2009). “A note on modeling vehicle accident frequencies with random-parameters count models.” Accid. Anal. Prev., 41(1), 153–159.
Bach, F. R. (2008). “Consistency of the group lasso and multiple kernel learning.” J. Mach. Learn. Res., 9, 1179–1225.
Bauer, K., and Harwood, D. (1996). “Statistical models of at-grade intersection accidents.” Rep. No. FHWA-RD-96-125, Federal Highway Administration, Washington, DC.
Breiman, L. (1995). “Better subset regression using the nonnegative garrote.” TechnometricsTCMTA2, 37(4), 373–384.
Breiman, L. (2001). “Random forests.” Mach. Learn.MALEEZ, 45(1), 5–32.
Chen, S. S., Donoho, D., and Saunders, M. (1998). “Atomic decomposition by basis pursuit.” SIAM J. Sci. Statist. Comput.SIJCD, 20(1), 33–61.
Crash Analysis Reporting System. (2008). 〈http://tlhost01.dot.state.fl.us/bluezone/FDOT_Session/default.htm〉 (Jan. 20, 2008).
Dissanayake, S., and Lu, J. (2003). “Access management techniques to improve traffic operations and safety: A case study of a full vs. directional median opening.” Proc., 2003 Mid-Continent Transportation Research Symp., Iowa State Univ., Ames, IA.
Florida DOT. (2006). “Florida strategic highway safety plan.” Tallahassee, FL.
Friedman, J. (1991). “Multivariate adaptive regression splines.” Ann. Stat.ASTSC7, 19(1), 1–141.
Friedman, J., Hastie, T., and Tibshirani, R. (2010). “Applications of the lasso and grouped lasso to the estimation of sparse graphical models.” Technical Rep., Stanford University.
Google Earth. (2008). 〈http://earth.google.com/〉 (Jan. 20, 2008).
Haleem, K. (2009). “Comprehensive analytical investigation of the safety of unsignalized intersections.” Ph.D., dissertation, Univ. of Central Florida, Orlando, FL.
Haleem, K., and Abdel-Aty, M. (2010). “Examining traffic crash injury severity at unsignalized intersections.” J. Saf. Res.JSFRAV, 41(4), 347–357.
Haleem, K., Abdel-Aty, M., and Mackie, K. (2010a). “Using a reliability process to reduce uncertainty in predicting crashes at unsignalized intersections.” Accid. Anal. Prev., 42(2), 654–666.
Haleem, K., Abdel-Aty, M., and Santos, J. (2010b). “Multiple applications of the multivariate adaptive regression splines in predicting rear-end crashes at unsignalized intersections.” Transp. Res. Rec.TRREDM, 2165, 33–41.
Harb, R., Yan, X., Radwan, E., and Su, X. (2009). “Exploring precrash maneuvers using classification trees and random forests.” Accid. Anal. Prev., 41(1), 98–107.
Harwood, D., Council, F., Hauer, E., Hughes, W., and Vogt, A. (2000). “Prediction of the expected safety performance of rural two-lane highways.” Final Rep., FHWA-RD-99-207, Federal Highway Administration, Washington, DC.
Hastie, T., Tibshirani, R., and Friedman, J. (2001). “The elements of statistical learning: Data mining, inference, and prediction.” Springer series in statistics, New York.
Li, X., Lord, D., Zhang, Y., and Xie, Y. (2008). “Predicting motor vehicle crashes using support vector machine models.” Accid. Anal. Prev., 40(4), 1611–1618.
Liu, H., and Zhang, J. (2009). “Estimation consistency of the group lasso and its applications.” Proc., of the 12th Int. Conf. on Artificial Intelligence and Statistics, Clearwater Beach, FL.
Miaou, S. (1994). “The relationship between truck accidents and geometric design of road section: Poisson versus negative binomial regression.” Accid. Anal. Prev., 26(4), 471–482.
Miaou, S. (1996). “Measuring the goodness-of-fit of accident prediction models.” FHWA-RD-96-040, Federal Highway Administration, Washington, DC.
Obozinski, G., Wainwright, M., and Jordan, M. (2008). “High-dimensional union support recovery in multivariate regression.” Advances in neural information processing systems (NIPS), 22, MIT Press, Vancouver, BC, Canada.
Put, R., Xu, Q., Massart, D., and Heyden, Y. (2004). “Multivariate adaptive regression splines (mars) in chromatographic quantitative structure—retention relationship studies.” J. Chromatogr. AJCRAEY, 1055(1-2), 11–19.
R Software Package. (2010). 〈http://www.r-project.org/〉 (Apr. 28, 2010).
Roadway Characteristic Inventory. (2008) 〈http://webapp01.dot.state.fl.us/Login/default.asp〉 (Feb. 10, 2008).
Roth, V., and Fischer, B. (2008). “The group-lasso for generalized linear models: Uniqueness of solutions and efficient algorithms.” Proc., 25th Int. Conf. on Machine Learning, Association for Computing Machinery, New York.
SAS System 9 [Computer Software] Cary, NC, SAS Institute Inc.
Sawalha, Z., and Sayed, T. (2006). “Transferability of accident prediction models.” J. Saf. Sci.ZASKBO, 44(3), 209–219.
Stewart, J. (1996). “Applications of classification and regression tree methods in roadway safety studies.” Transportation Research Record 1542., Transportation Research Board, Washington, DC, 1–5.
Tibshirani, R. (1996). “Regression shrinkage and selection via the lasso.” J. R. Stat. Soc. Ser. BJSTBAJ, 58(1), 267–288.
Transportation Research Board of the National Academies. (2010). “Highway safety manual.” 1st Ed., 〈http://www.highwaysafetymanual.org/〉 (May 27, 2010)
Video Log Viewer Application. (2008). 〈http://webapp01.dot.state.fl.us/videolog/〉 (Feb. 10, 2008).
Vogt, A., and Bared, J. (1998). “Accident models for two-lane rural roads: Segments and intersections.” FHWA-RD-98-133, No. 3A5A, Final Rep., Federal Highway Administration, Washington, DC.
Wang, X., and Abdel-Aty, M. (2006). “Temporal and spatial analyses of rear-end crashes at signalized intersections.” Accid. Anal. Prev., 38(6), 1137–1150.
Xie, Y., Lord, D., and Zhang, Y. (2007). “Predicting motor vehicle collisions using bayesian neural network models: An empirical analysis.” Accid. Anal. Prev., 39(5), 922–933.
Yuan, M., and Lin, Y. (2004). “Model selection and estimation in regression with grouped variables.” Rep. No.1095, Dept. of Statistics, Univ. of Wisconsin, Madison, WI.
Yuan, M., and Lin, Y. (2006). “Model selection and estimation in regression with grouped variables.” J. R. Stat. Soc., Series B 68(1), 49–67.
Zuur, A., Ieno, E., and Smith, G. (2007). “Analyzing ecological data.” Statistics for Biology and Health, Springer, New York.

Information & Authors

Information

Published In

Go to Journal of Transportation Engineering
Journal of Transportation Engineering
Volume 138Issue 7July 2012
Pages: 949 - 960

History

Received: Nov 15, 2010
Accepted: Dec 20, 2011
Published online: Dec 23, 2011
Published in print: Jul 1, 2012

Permissions

Request permissions for this article.

Authors

Affiliations

Kirolos Haleem [email protected]
Post-doctoral Research Fellow, Dept. of Civil and Environmental Engineering, Florida International Univ., 10555 W Flagler Street, EC 3680, Miami, FL 33174 (corresponding author). E-mail: [email protected]
Mohamed Abdel-Aty [email protected]
Professor and Graduate Program Director, Dept. of Civil, Environmental & Construction Engineering, Univ. of Central Florida, 4000 Central FL Blvd, Orlando, FL 32816-2450. E-mail: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share