Technical Papers
Jan 29, 2024

Explainable Stacking-Based Learning Model for Traffic Forecasting

Publication: Journal of Transportation Engineering, Part A: Systems
Volume 150, Issue 4

Abstract

This paper implements a two-staged ensemble learning model for traffic forecasting, focusing on the interpretability of predictions. The stacking model leverages the advantages of its diverse component learning models. Experiments on high-dimensional and sparse data validate the stacking model’s superior predictive accuracy compared to baseline models, including LightGBM and XGBoost. In addition to validating the stacking model’s outstanding predictive performance, this paper emphasizes the interpretability of its predictions by proposing an innovative explanation model based on feature contributions. This explanation model addresses the high dimension and sparsity in data prevalent in transportation engineering with its integration of resampling and consensus clustering, offering a scalable, stable, and computationally efficient solution ideal for real-time and large-scale applications. The paper presents theoretical justification, experimental results, and empirical validation of the interpretation model. Extensive experiments demonstrate the model’s enhanced stability compared to traditional shapley additive explanations (SHAP) implementations such as kernel SHAP. Investigating trade-offs between stability and computational efficiency of resampling provides insights for optimal configuration choices. This paper contributes to traffic flow prediction with broad applicability in real-time and large-scale traffic management scenarios, underscoring the vital role of ensemble learning and interpretable machine learning in contemporary data-driven decision making processes.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

The authors confirm that all data, models, and codes that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

Chengyong Chen and Jinghan Liu contributed to the work equally and should be regarded as cofirst authors.

References

Altmann, A., L. Toloşi, O. Sander, and T. Lengauer. 2010. “Permutation importance: A corrected feature importance measure.” Bioinformatics 26 (10): 1340–1347. https://doi.org/10.1093/bioinformatics/btq134.
Brock, G., V. Pihur, S. Datta, and S. Datta. 2008. “clValid: An R package for cluster validation.” J. Stat. Software 25 (4): 1–22. https://doi.org/10.18637/jss.v025.i04.
Cheng, Q., Y. Lin, X. Zhou, and Z. Liu. 2023. “Analytical formulation for explaining the variation of traffic states: A fundamental diagram modeling perspective with stochastic parameters.” Eur. J. Oper. Res. 312 (Jan): 182–197. https://doi.org/10.1016/j.ejor.2023.07.005.
Covert, I., S. M. Lundberg, and S. Lee. 2020. “Understanding global feature contributions with additive importance measures.” In Vol. 33 of Proc., 34th Conf. on Neural Information Processing Systems (NeurIPS 2020), 17212–17223. Red Hook, NY: Curran Associates.
Dia, H. 2001. “An object-oriented neural network approach to short-term traffic forecasting.” Eur. J. Oper. Res. 131 (2): 253–261. https://doi.org/10.1016/S0377-2217(00)00125-9.
Dudoit, S., and J. Fridlyand. 2003. “Bagging to improve the accuracy of a clustering procedure.” Bioinformatics 19 (9): 1090–1099. https://doi.org/10.1093/bioinformatics/btg038.
Fred, A. L. N., and A. K. Jain. 2005. “Combining multiple clusterings using evidence accumulation.” IEEE Trans. Pattern Anal. Mach. Intell. 27 (6): 835–850. https://doi.org/10.1109/TPAMI.2005.113.
Fu, X., G. Yu, and Z. Liu. 2021. “Spatial–temporal convolutional model for urban crowd density prediction based on mobile-phone signaling data.” IEEE Trans. Intell. Transp. Syst. 23 (9): 14661–14673. https://doi.org/10.1109/TITS.2021.3131337.
Gu, Z., M. Saberi, M. Sarvi, and Z. Liu. 2018. “A big data approach for clustering and calibration of link fundamental diagrams for large-scale network simulation applications.” Transp. Res. Part C Emerging Technol. 94 (Sep): 151–171. https://doi.org/10.1016/j.trc.2017.08.012.
Huo, J., X. Wu, C. Lyu, W. Zhang, and Z. Liu. 2022. “Quantify the road link performance and capacity using deep learning models.” IEEE Trans. Intell. Transp. Syst. 23 (10): 18581–18591. https://doi.org/10.1109/TITS.2022.3153397.
Jia, R., Z. Li, Y. Xia, J. Zhu, N. Ma, H. Chai, and Z. Liu. 2020. “Urban road traffic condition forecasting based on sparse ride-hailing service data.” IET Intel. Transport Syst. 14 (7): 668–674. https://doi.org/10.1049/iet-its.2019.0338.
Li, P., S. Wang, H. Zhao, J. Yu, L. Hu, H. Yin, and Z. Liu. 2023. “IG-Net: An interaction graph network model for metro passenger flow forecasting.” IEEE Trans. Intell. Transp. Syst. 24 (4): 4147–4157. https://doi.org/10.1109/TITS.2023.3235805.
Li, W., Y. Ji, and T. Wang. 2020. “Adaptive real-time prediction model for short-term traffic flow uncertainty.” J. Transp. Eng. Part A Syst. 146 (8): 04020075. https://doi.org/10.1061/JTEPBS.0000396.
Liu, Y., Z. Liu, and R. Jia. 2019a. “DeepPF: A deep learning based architecture for metro passenger flow prediction.” Transp. Res. Part C Emerging Technol. 101 (Apr): 18–34. https://doi.org/10.1016/j.trc.2019.01.027.
Liu, Y., Z. Liu, C. Lyu, and J. Ye. 2019b. “Attention-based deep ensemble net for large-scale online taxi-hailing demand prediction.” IEEE Trans. Intell. Transp. Syst. 21 (11): 4798–4807. https://doi.org/10.1109/TITS.2019.2947145.
Liu, Y., J. Zhang, L. Fang, Q. Jiang, and B. Zhou. 2021. “Multimodal motion prediction with stacked transformers.” In Proc., IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, 7573–7582. New York: IEEE.
Liu, Z., Y. Liu, C. Lyu, and J. Ye. 2020. “Building personalized transportation model for online taxi-hailing demand prediction.” IEEE Trans. Cybern. 51 (9): 4602–4610. https://doi.org/10.1109/TCYB.2020.3000929.
Liu, Z., C. Lyu, J. Huo, S. Wang, and J. Chen. 2022. “Gaussian process regression for transportation system estimation and prediction problems: The deformation and a hat kernel.” IEEE Trans. Intell. Transp. Syst. 23 (11): 22331–22342. https://doi.org/10.1109/TITS.2022.3155527.
Lv, Y., Y. Duan, W. Kang, Z. Li, and F. Y. Wang. 2015. “Traffic flow prediction with big data: A deep learning approach.” IEEE Trans. Intell. Transp. Syst. 16 (2): 865–873. https://doi.org/10.1109/TITS.2014.2345663.
Monti, S., P. Tamayo, J. Mesirov, and T. Golub. 2003. “Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data.” Mach. Learn. 52 (1–2): 91–118. https://doi.org/10.1023/A:1023949509487.
Nigam, A., and S. Srivastava. 2023. “Hybrid deep learning models for traffic stream variables prediction during rainfall.” Multimodal Transp. 2 (1): 100052. https://doi.org/10.1016/j.multra.2022.100052.
Parishwad, O., S. Jiang, and K. Gao. 2023. “Investigating machine learning for simulating urban transport patterns: A comparison with traditional macro-models.” Multimodal Transp. 2 (3): 100085. https://doi.org/10.1016/j.multra.2023.100085.
Ren, N., X. Zhao, and X. Zhang. 2022. “Mortality prediction in ICU using a stacked ensemble model.” Comput. Math. Methods Med. 2022 (Nov): 3938492. https://doi.org/10.1155/2022/3938492.
Rudin, C. 2019. “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead.” Nat. Mach. Intell. 1 (5): 206–215. https://doi.org/10.1038/s42256-019-0048-x.
Stanitsa, A., S. H. Hallett, and S. Jude. 2023. “Investigating pedestrian behaviour in urban environments: A Wi-Fi tracking and machine learning approach.” Multimodal Transp. 2 (1): 100049. https://doi.org/10.1016/j.multra.2022.100049.
Tang, J., X. Chen, Z. Hu, F. Zong, C. Han, and L. Li. 2019. “Traffic flow prediction based on combination of support vector machine and data denoising schemes.” Physica A 534 (Nov): 120642. https://doi.org/10.1016/j.physa.2019.03.007.
Vlahogianni, E. I., M. G. Karlaftis, and J. C. Golias. 2014. “Short-term traffic forecasting: Where we are and where we’re going.” Transp. Res. Part C Emerging Technol. 43 (Jun): 3–19. https://doi.org/10.1016/j.trc.2014.01.005.
Williams, B. M., P. K. Durvasula, and D. E. Brown. 1998. “Urban freeway traffic flow prediction: Application of seasonal autoregressive integrated moving average and exponential smoothing models.” Transp. Res. Rec. 1644 (1): 132–141. https://doi.org/10.3141/1644-14.
Yao, J., Y. Wang, and Q. Liang. 2020. “Traffic flow estimation based on three-layer stacking model.” In Proc., 32nd Chinese Control and Decision Conf., CCDC 2020, 1195–1200. New York: IEEE. https://doi.org/10.1109/CCDC49329.2020.9164478.
Yoon, B., and H. Chang. 2014. “Potentialities of data-driven nonparametric regression in urban signalized traffic flow forecasting.” J. Transp. Eng. 140 (7): 04014027. https://doi.org/10.1061/(ASCE)TE.1943-5436.0000662.
Zhao, L., O. Gkountouna, D. Pfoser, L. Zhao, O. Gkountouna, and D. Pfoser. 2019. “Spatial auto-regressive dependency interpretable learning based on spatial topological constraints.” ACM Trans. Spatial Algorithms Syst. 5 (3): 1–28. https://doi.org/10.1145/3339823.
Zhou, Z., Y. Zhao, M. Li, and Y. Bao. 2023. “A causal inference–based speed control framework for discretionary lane-changing processes.” J. Transp. Eng. Part A Syst. 149 (8): 04023068. https://doi.org/10.1061/JTEPBS.TEENG-7807.

Information & Authors

Information

Published In

Go to Journal of Transportation Engineering, Part A: Systems
Journal of Transportation Engineering, Part A: Systems
Volume 150Issue 4April 2024

History

Received: Aug 3, 2023
Accepted: Nov 15, 2023
Published online: Jan 29, 2024
Published in print: Apr 1, 2024
Discussion open until: Jun 29, 2024

Permissions

Request permissions for this article.

Authors

Affiliations

Chengyong Chen [email protected]
Shandong High-Speed Infrastructure Construction Co., Ltd., Shandong Expressway Mansion, No. 8 Longao North Rd., Lixia District, Jinan, Shandong 250098, China. Email: [email protected]
Jinghan Liu [email protected]
School of Engineering and Applied Sciences, Univ. of Pennsylvania, Philadelphia, PA (corresponding author). Email: [email protected]
Yuexiang Li [email protected]
Shandong High-Speed Infrastructure Construction Co., Ltd., Shandong Expressway Mansion, No. 8 Longao North Rd., Lixia District, Jinan, Shandong 250098, China. Email: [email protected]
Shandong High-Speed Infrastructure Construction Co., Ltd., Shandong Expressway Mansion, No. 8 Longao North Rd., Lixia District, Jinan, Shandong 250098, China. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share