Explainable Stacking-Based Learning Model for Traffic Forecasting

Chen, Chengyong; Liu, Jinghan; Li, Yuexiang; Zhang, Yan

doi:10.1061/JTEPBS.TEENG-8208

Technical Papers

Jan 29, 2024

Explainable Stacking-Based Learning Model for Traffic Forecasting

Authors: Chengyong Chen [email protected], Jinghan Liu [email protected], Yuexiang Li [email protected], and Yan Zhang [email protected]Author Affiliations

Publication: Journal of Transportation Engineering, Part A: Systems

Volume 150, Issue 4

https://doi.org/10.1061/JTEPBS.TEENG-8208

Get Access

Abstract

This paper implements a two-staged ensemble learning model for traffic forecasting, focusing on the interpretability of predictions. The stacking model leverages the advantages of its diverse component learning models. Experiments on high-dimensional and sparse data validate the stacking model’s superior predictive accuracy compared to baseline models, including LightGBM and XGBoost. In addition to validating the stacking model’s outstanding predictive performance, this paper emphasizes the interpretability of its predictions by proposing an innovative explanation model based on feature contributions. This explanation model addresses the high dimension and sparsity in data prevalent in transportation engineering with its integration of resampling and consensus clustering, offering a scalable, stable, and computationally efficient solution ideal for real-time and large-scale applications. The paper presents theoretical justification, experimental results, and empirical validation of the interpretation model. Extensive experiments demonstrate the model’s enhanced stability compared to traditional shapley additive explanations (SHAP) implementations such as kernel SHAP. Investigating trade-offs between stability and computational efficiency of resampling provides insights for optimal configuration choices. This paper contributes to traffic flow prediction with broad applicability in real-time and large-scale traffic management scenarios, underscoring the vital role of ensemble learning and interpretable machine learning in contemporary data-driven decision making processes.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

The authors confirm that all data, models, and codes that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

Chengyong Chen and Jinghan Liu contributed to the work equally and should be regarded as cofirst authors.

References

Altmann, A., L. Toloşi, O. Sander, and T. Lengauer. 2010. “Permutation importance: A corrected feature importance measure.” Bioinformatics 26 (10): 1340–1347. https://doi.org/10.1093/bioinformatics/btq134.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!