Prediction of Traffic Incident Duration Using Clustering-Based Ensemble Learning Method
Publication: Journal of Transportation Engineering, Part A: Systems
Volume 148, Issue 7
Abstract
Traffic incidents are a primary cause of traffic delays, which can cause severe economic losses. Effective traffic incident management requires integrating intelligent traffic systems, information dissemination, and the accurate prediction of incident duration. This study develops a clustering-based machine learning model to predict the incident duration. Unlike similar studies that train separate machine learning models for a fixed number of clusters, this study proposes an ensemble learning method based on multiple clustered individual models that can provide good and diverse prediction performance. The K-means clustering method is used in this study as a bootstrapping technique in the ensemble learning approach, with the individual models based on the artificial neural network model and random forest regression model. The models are tested using the incident data from Singapore, and the results show that the ensemble model outperforms both the traditional model with fixed clusters and the classical model without clustering. Additionally, this study attempted to determine the significance of different variables on traffic incident durations using the random forest feature importance function. The prediction of incident duration and the analysis of influence factors can contribute to several aspects of traffic management, such as improving traffic dissemination to mitigate traffic congestion caused by incidents.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or codes used during the study were provided by a third party. Direct requests for these materials may be made to the provider as indicated in the Acknowledgments. The data used in the paper was provided by the project grantor. We will need to request their permission for sharing the data.
Acknowledgments
This study is supported by the research project A-0005277-03-00 funded by ST Engineering IHQ Pte Ltd. The authors would like to thank Dr. Lae Chung Khim and Dr. Li Ruijie for collecting and processing the data.
References
Almotahari, A., M. A. Yazici, S. Mudigonda, and C. Kamga. 2019. “Analysis of incident-induced capacity reductions for improved delay estimation.” J. Transp. Eng. Part A Syst. 145 (2): 04018083. https://doi.org/10.1061/JTEPBS.0000207.
Auto & General Insurance Singapore. 2020. Public transport Singapore. Singapore: Auto & General Insurance Singapore.
Boyles, S., D. Fajardo, and S. T. Waller. 2007. “A naive Bayesian classifier for incident duration prediction.” In Proc., 86th Annual Meeting of the Transportation Research Board. Washington, DC: Transportation Research Board.
Breiman, L. 2001. “Random forests.” Mach. Learn. 45 (1): 5–32. https://doi.org/10.1023/A:1010933404324.
Chung, Y., L. F. Walubita, and K. Choi. 2010. “Modeling accident duration and its mitigation strategies on South Korean freeway systems.” Transp. Res. Rec. 2178 (1): 49–57. https://doi.org/10.3141/2178-06.
Dimitriou, L., and E. I. Vlahogianni. 2015. “Fuzzy modeling of freeway accident duration with rainfall and traffic flow interactions.” Anal. Methods Accid. Res. 5 (Jan): 59–71. https://doi.org/10.1016/j.amar.2015.04.001.
Evans, J., B. Waterson, and A. Hamilton. 2020. “Evolution and future of urban road incident detection algorithms.” J. Transp. Eng. Part A Syst. 146 (6): 03120001. https://doi.org/10.1061/JTEPBS.0000362.
Farradyne, P. B. 2000. Traffic incident management handbook. Washington, DC: Federal Highway Administration, Office of Travel Management.
Ghosh, B., M. T. Asif, J. Dauwels, W. Cai, H. Guo, and U. Fastenrath. 2016. “Predicting the duration of non-recurring road incidents by cluster-specific models.” In Proc., 2016 IEEE 19th Int. Conf. on Intelligent Transportation Systems (ITSC), 1522–1527. New York: IEEE.
Ghosh, B., M. T. Asif, J. Dauwels, U. Fastenrath, and H. Guo. 2018. “Dynamic prediction of the incident duration using adaptive feature set.” IEEE Trans. Intell. Transp. Syst. 20 (11): 4019–4031. https://doi.org/10.1109/TITS.2018.2878637.
Hastie, T., R. Tibshirani, and J. Friedman. 2009. “Random forests.” In The elements of statistical learning, 587–604. New York: Springer.
Hojati, A. T., L. Ferreira, S. Washington, P. Charles, and A. Shobeirinejad. 2014. “Modelling total duration of traffic incidents including incident detection and recovery time.” Accid. Anal. Prev. 71 (Oct): 296–305. https://doi.org/10.1016/j.aap.2014.06.006.
ITE (Institute of Transportation Engineers). 2000. Traffic management data dictionary. Washington, DC: AASHTO.
Kodinariya, T. M., and P. R. Makwana. 2013. “Review on determining number of cluster in K-means clustering.” Int. J. 1 (6): 90–95.
Lee, J.-Y., J.-H. Chung, and B. Son. 2009. “Incident clearance time analysis for Korean freeways using structural equation model.” In Proc., Eastern Asia Society for Transportation Studies Vol. 7 (The 8th Int. Conf. of Eastern Asia Society for Transportation Studies, 2009), 360. Kawana, Japan: Eastern Asia Society for Transportation Studies.
Lewinson, E. 2019. “Explaining feature importance by example of a random forest.” Towards Data Science. Accessed January 16, 2022. https://towardsdatascience.com/explaining-feature-importance-by-example-of-arandom-forest-d9166011959e.
Li, L., X. Sheng, B. Du, Y. Wang, and B. Ran. 2020. “A deep fusion model based on restricted Boltzmann machines for traffic accident duration prediction.” Eng. Appl. Artif. Intell. 93 (Aug): 103686. https://doi.org/10.1016/j.engappai.2020.103686.
Li, R., F. C. Pereira, and M. E. Ben-Akiva. 2015. “Competing risks mixture model for traffic incident duration prediction.” Accid. Anal. Prev. 75 (Feb): 192–201. https://doi.org/10.1016/j.aap.2014.11.023.
Li, R., F. C. Pereira, and M. E. Ben-Akiva. 2018. “Overview of traffic incident duration analysis and prediction.” Eur. Transport Res. Rev. 10 (2): 1–13. https://doi.org/10.1186/s12544-018-0300-1.
Likas, A., N. Vlassis, and J. J. Verbeek. 2003. “The global k-means clustering algorithm.” Pattern Recognit. 36 (2): 451–461. https://doi.org/10.1016/S0031-3203(02)00060-2.
Lin, L., Q. Wang, and A. W. Sadek. 2016. “A combined M5P tree and hazard-based duration model for predicting urban freeway traffic accident durations.” Accid. Anal. Prev. 91 (Jun): 114–126. https://doi.org/10.1016/j.aap.2016.03.001.
Liu, Y., C. Lyu, Y. Zhang, Z. Liu, W. Yu, and X. Qu. 2021. “DeepTSP: Deep traffic state prediction model based on large-scale empirical data.” Commun. Transp. Res. 1 (Dec): 100012. https://doi.org/10.1016/j.commtr.2021.100012.
Liu, Y., and Z. Yang. 2021. “Information provision and congestion pricing in a risky two-route network with heterogeneous travelers.” Transp. Res. Part C Emerging Technol. 128 (Jul): 103083. https://doi.org/10.1016/j.trc.2021.103083.
LTA (Land Transport Authority). 2020. “Expressway monitoring advisory system.” Accessed March 1, 2021. https://www.lta.gov.sg/content/ltagov/en/getting_around/driving_in_singapore/intelligent_transport_systems/expressway_monitoring_advisory_system.html.
Ozbay, K., and N. Noyan. 2006. “Estimation of incident clearance times using Bayesian networks approach.” Accid. Anal. Prev. 38 (3): 542–555. https://doi.org/10.1016/j.aap.2005.11.012.
Pereira, F. C., F. Rodrigues, and M. Ben-Akiva. 2013. “Text analysis in incident duration prediction.” Transp. Res. Part C Emerging Technol. 37 (Dec): 177–192. https://doi.org/10.1016/j.trc.2013.10.002.
Qi, Y., and H. Teng. 2008. “An information-based time sequential approach to online incident duration prediction.” J. Intell. Transp. Syst. 12 (1): 1–12. https://doi.org/10.1080/15472450701849626.
Qu, X., and S. Wang. 2021. “Communications in transportation research: Vision and scope.” Commun. Transp. Res. 2021 (Dec): 100001. https://doi.org/10.1016/j.commtr.2021.100001.
Ronaghan, S. 2018. “The mathematics of decision trees, random forest and feature importance in Scikit-learn and Spark.” Toward Data Science. Accessed May 11, 2018. https://towardsdatascience.com/the-mathematics-of-decision-trees-random-forest-and-feature-importance-in-scikit-learn-and-spark-f2861df67e3.
Tang, J., L. Zheng, C. Han, W. Yin, Y. Zhang, Y. Zou, and H. Huang. 2020. “Statistical and machine-learning methods for clearance time prediction of road incidents: A methodology review.” Anal. Methods Accid. Res. 27 (Sep): 100123. https://doi.org/10.1016/j.amar.2020.100123.
TRB (Transportation Research Board). 2000. Highway capacity manual. Washington, DC: National Research Council.
Wallace, C., and G. Speier. 2020. “Traffic incidents and unplanned events.” PIARC | World Road Association. Accessed July 29, 2021. https://rno-its.piarc.org/en/network-control-traffic-management-integrated-strategies/traffic-incidents.
Wang, S., X. Chen, and X. Qu. 2021. “Model on empirically calibrating stochastic traffic flow fundamental diagram.” Commun. Transp. Res. 1 (Dec): 100015. https://doi.org/10.1016/j.commtr.2021.100015.
Xie, T., and Y. Liu. 2022. “Impact of connected and autonomous vehicle technology on market penetration and route choices.” Transp. Res. Part C Emerging Technol. 139 (Jun): 1–26. https://doi.org/10.1016/j.trc.2022.103646.
Yan, R., S. Wang, L. Zhen, and G. Laporte. 2021. “Emerging approaches applied to maritime transport research: Past and future.” Commun. Transp. Res. 1 (Dec): 100011. https://doi.org/10.1016/j.commtr.2021.100011.
Ye, Y., and J. Zhang. 2020. “Accident-oriented delay propagation in high-speed railway network.” J. Transp. Eng. Part A Syst. 146 (4): 04020011. https://doi.org/10.1061/JTEPBS.0000330.
Zhang, H., and A. J. Khattak. 2010. “Analysis of cascading incident event durations on urban freeways.” Transp. Res. Rec. 2178 (1): 30–39. https://doi.org/10.3141/2178-04.
Zou, Y., K. Henrickson, D. Lord, Y. Wang, and K. Xu. 2016. “Application of finite mixture models for analysing freeway incident clearance time.” Transportmetrica A: Transport Sci. 12 (2): 99–115. https://doi.org/10.1080/23249935.2015.1102173.
Information & Authors
Information
Published In
Copyright
© 2022 American Society of Civil Engineers.
History
Received: Nov 24, 2021
Accepted: Feb 15, 2022
Published online: May 3, 2022
Published in print: Jul 1, 2022
Discussion open until: Oct 3, 2022
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Artur Grigorev, Adriana-Simona Mihăiţă, Khaled Saleh, Fang Chen, Automatic Accident Detection, Segmentation and Duration Prediction Using Machine Learning, IEEE Transactions on Intelligent Transportation Systems, 10.1109/TITS.2023.3323636, 25, 2, (1547-1568), (2024).
- Weiwei Qi, Bin Rao, Chuanyun Fu, A Novel Filtering Method of Travel-Time Outliers Extracted from Large-Scale Traffic Checkpoint Data, Journal of Transportation Engineering, Part A: Systems, 10.1061/JTEPBS.TEENG-8082, 150, 2, (2024).
- Huiping Li, Yunxuan Li, A Novel Explanatory Tabular Neural Network to Predicting Traffic Incident Duration Using Traffic Safety Big Data, Mathematics, 10.3390/math11132915, 11, 13, (2915), (2023).
- Dong Pan, Samer Hamdar, From Traffic Analysis to Real-Time Management: A Hazard-Based Modeling for Incident Durations Extracted Through Traffic Detector Data Anomaly Detection, Transportation Research Record: Journal of the Transportation Research Board, 10.1177/03611981231174445, 2678, 2, (389-400), (2023).
- Zehao Wang, Pengpeng Jiao, Jianyu Wang, Wei Luo, Huapu Lu, Improved Two-Layer Stacking Model for Prediction of the Level of Delay Caused by Crashes: An Empirical Analysis of Texas, Journal of Transportation Engineering, Part A: Systems, 10.1061/JTEPBS.TEENG-7577, 149, 2, (2023).
- Qiang Shang, Tian Xie, Yang Yu, Prediction of Duration of Traffic Incidents by Hybrid Deep Learning Based on Multi-Source Incomplete Data, International Journal of Environmental Research and Public Health, 10.3390/ijerph191710903, 19, 17, (10903), (2022).
- Naima Islam, Emmanuel K. Adanu, Alexander M. Hainen, Steve Burdette, Randy Smith, Steven Jones, Evaluating the Impact of Freeway Service Patrol on Incident Clearance Times: A Spatial Transferability Test, Journal of Advanced Transportation, 10.1155/2022/5272747, 2022, (1-10), (2022).