Technical Papers
Feb 27, 2024

Novel Hybrid Spatiotemporal Convolution Neural Network Model for Short-Term Passenger Flow Prediction in a Large-Scale Metro System

Publication: Journal of Transportation Engineering, Part A: Systems
Volume 150, Issue 5

Abstract

Accurate and reliable prediction of subway passenger flow is a particularly challenging application of spatiotemporal forecasting, due to the time-varying travel patterns and the complex spatial dependencies on subway networks. To address these challenges, this study proposes a novel spatiotemporal graph convolutional bidirectional long short-term memory neural network model combined with an attention mechanism (At-STGCN-BiLSTM) to better predict short-term passenger flow for all stations in a large-scale metro system. The STGCN-BiLSTM aims to capture the attributes of spatiotemporal characteristics of subway stations, and the attention mechanism helps account for the correlation between historical data and current moment inbound passenger flow. The performance of the short-time passenger flow forecast model is analyzed by different time intervals. Experimental results show that the proposed model outperforms baseline models on Wuhan, China, subway data. The value of root-mean square error (RMSE) and mean absolute error (MAE) decreased by 7.33% and 9.38%, respectively, compared with the baseline models at the 15-min interval. The attention mechanism in the proposed model can effectively improve the prediction capability of peak and nonperiodic passenger flow variations. The research not only is of great help to the passenger flow organization and emergency management of the subway, but also plays a vital role in the work of rail transit regulation, rail transit alarm release, and service efficiency improvement.

Practical Applications

This research introduces a novel approach for predicting subway passenger flow, offering valuable insights for both transport authorities and commuters. By considering the spatiotemporal dynamics of passenger movement and incorporating an attention mechanism, the proposed model enhances short-term flow predictions. In practical terms, this means more accurate estimations of passenger numbers, travel times, and congestion levels for subway stations. The model’s adaptability to varying scenarios, including peak hours and unexpected disruptions, ensures reliable real-time predictions. For transit operators, this model aids in optimizing resource allocation, enhancing service efficiency, and facilitating emergency management. Commuters benefit from improved trip planning and a smoother travel experience. Beyond the subway context, the methodology’s fusion of advanced techniques showcases its potential to inform broader transportation systems and urban planning.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

The data used to support the findings are available from the corresponding author upon request.

Acknowledgments

This work was supported by National Social Science Fund Project of China (21FGLB014) and the Project of Construction and Support for High-Level Innovative Teams of Beijing Municipal Institutions (BPHR20220109).
Author contributions: Zhihong Li: study conception and design, and writing–original draft. Hua Cai: study conception and design, analysis and interpretation of results, and writing–original draft. Han Xu: study conception and design, and data collection. Xiaoyu Wang: data collection and analysis and interpretation of results. All authors reviewed the results and approved the final version of the manuscript.

References

Bahdanau, D., K. Cho, and Y. Bengio. 2014. “Neural machine translation by jointly learning to align and translate.” Preprint, submitted September 1, 2014. https://arxiv.org/abs/1409.0473.
Chen, Y., P. Xu, Y. Chu, W. Li, Y. Wu, L. Ni, Y. Bao, and K. Wang. 2017. “Short-term electrical load forecasting using the support vector regression (SVR) model to calculate the demand response baseline for office buildings.” Appl. Energy 195 (Jun): 659–670. https://doi.org/10.1016/j.apenergy.2017.03.034.
Defferrard, M., X. Bresson, and P. Vandergheynst. 2016. “Convolutional neural networks on graphs with fast localized spectral filtering.” In Proc., Advances in Neural Information Processing Systems, 3844–3852. Red Hook, NY: Curran Associates Inc.
Deng, J., L. Cheng, and Z. Wang. 2021. “Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification.” Comput. Speech Lang. 68 (Jul): 101182. https://doi.org/10.1016/j.csl.2020.101182.
Fu, R., Z. Zhang, and L. Li. 2016. “Using LSTM and GRU neural network methods for traffic flow prediction.” In Proc., 2016 31st Youth Academic Annual Conf. of Chinese Association of Automation (YAC), 324–328. New York: IEEE.
Guo, J., Z. Xie, Y. Qin, L. Jia, and Y. Wang. 2019. “Short-term abnormal passenger flow prediction based on the fusion of SVR and LSTM.” IEEE Access 7 (Mar): 42946–42955. https://doi.org/10.1109/ACCESS.2019.2907739.
Hao, S., D.-H. Lee, and D. Zhao. 2019. “Sequence to sequence learning with attention mechanism for short-term passenger flow prediction in large-scale metro system.” [In Chinese.] Transp. Res. Part C Emerging Technol. 107 (Oct): 287–300. https://doi.org/10.1016/j.trc.2019.08.005.
Hao, X., L. Jiayong, and W. Junfeng. 2021. “Network traffic prediction based on neural network and autoregressive model.” [In Chinese.] Supplement, Comput. Appl. 41 (S1): 180–184.
Hinton, G. E., and R. R. Salakhutdinov. 2006. “Reducing the dimensionality of data with neural networks.” Science 313 (5786): 504–507. https://doi.org/10.1126/science.1127647.
Hochreiter, S., and J. Schmidhuber. 1997. “Long short-term memory.” Neural Comput. 9 (8): 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.
Kingma, D. P., and J. Ba. 2014. “Adam: A method for stochastic optimization.” Preprint, submitted December 22, 2014. https://arxiv.org/abs/1412.6980.
Kumar, S. V., and L. Vanajakshi. 2015. “Short-term traffic flow prediction using seasonal ARIMA model with limited input data.” Eur. Transport Res. Rev. 7 (3): 21. https://doi.org/10.1007/s12544-015-0170-8.
Kumar Dubey, A., A. Kumar, V. García-Díaz, A. Kumar Sharma, and K. Kanhaiya. 2021. “Study and analysis of SARIMA and LSTM in forecasting time series data.” Sustainable Energy Technol. Assess. 47 (Oct): 101474. https://doi.org/10.1016/j.seta.2021.101474.
Li, Z., X. Wang, and K. Yang. 2023. “An effective self-attention-based hybrid model for short-term traffic flow prediction.” Adv. Civ. Eng. 2023: 1–10. https://doi.org/10.1155/2023/9308576.
Li, Z., H. Xu, X. Gao, Z. Wang, and W. Xu. 2022. “Fusion attention mechanism bidirectional LSTM for short-term traffic flow prediction.” J. Intell. Transp. Syst. 1–14. https://doi.org/10.1080/15472450.2022.2142049.
Liu, Y., Z. Liu, and R. Jia. 2019. “DeepPF: A deep learning based architecture for metro passenger flow prediction.” Transp. Res. Part C Emerging Technol. 101 (Apr): 18–34. https://doi.org/10.1016/j.trc.2019.01.027.
Lu, W., Y. Zhang, P. Li, and T. Wang. 2023. “Mul-DesLSTM: An integrative multi-time granularity deep learning prediction method for urban rail transit short-term passenger flow.” Eng. Appl. Artif. Intell. 125 (Oct): 106741. https://doi.org/10.1016/j.engappai.2023.106741.
Ma, C., G. Dai, and J. Zhou. 2021. “Short-term traffic flow prediction for urban road sections based on time series analysis and LSTM_BILSTM method.” IEEE Trans. Intell. Transp. Syst. 23 (6): 5615–5624. https://doi.org/10.1109/TITS.2021.3055258.
Ma, X., Z. Dai, Z. He, J. Ma, Y. Wang, and Y. Wang. 2017. “Learning traffic as images: A deep convolutional neural network for large-scale transportation network speed prediction.” Sensors 17 (4): 818. https://doi.org/10.3390/s17040818.
Ma, X., Z. Tao, Y. Wang, H. Yu, and Y. Wang. 2015. “Long short-term memory neural network for traffic speed prediction using remote microwave sensor data.” Transp. Res. Part C Emerging Technol. 54 (May): 187–197. https://doi.org/10.1016/j.trc.2015.03.014.
Meng, P., X. Li, H. Jia, and Y. Li. 2018. “Short-time rail transit passenger flow real-time prediction based on moving average.” J. Jilin Univ. 48 (2): 448–453.
Shekhar, S., and B. M. Williams. 2007. “Adaptive seasonal time series models for forecasting short-term traffic flow.” Transp. Res. Rec. 2024 (1): 116–125. https://doi.org/10.3141/2024-14.
Siami-Namini, S., N. Tavakoli, and A. S. Namin. 2019. “A comparative analysis of forecasting financial time series using ARIMA, LSTM, and BiLSTM.” Preprint, submitted November 21, 2019. https://arxiv.org/abs/1911.09512.
Tang, Q., M. Yang, and Y. Yang. 2019. “ST-LSTM: A deep learning approach combined spatio-temporal features for short-term forecast in rail transit.” J. Adv. Transport 2019 (Feb): 1–8. https://doi.org/10.1155/2019/8392592.
Tedjopurnomo, D. A., Z. Bao, B. Zheng, F. Choudhury, and A. K. Qin. 2020. “A survey on modern deep neural network for traffic prediction: Trends, methods and challenges.” IEEE Trans. Knowl. Data Eng. 34 (4): 1544–1561. https://doi.org/10.1109/TKDE.2020.3001195.
Vapnik, V. N. 1999. “An overview of statistical learning theory.” IEEE Trans. Neural Networks 10 (5): 988–999. https://doi.org/10.1109/72.788640.
Wang, J., Y. Zhang, Y. Wei, Y. Hu, X. Piao, and B. Yin. 2021. “Metro passenger flow prediction via dynamic hypergraph convolution networks.” IEEE Trans. Intell. Transp. Syst. 22 (12): 7891–7903. https://doi.org/10.1109/TITS.2021.3072743.
Xue, Q., W. Zhang, M. Ding, X. Yang, J. Wu, and Z. Gao. 2023. “Passenger flow forecasting approaches for urban rail transit: A survey.” Int. J. Gen. Syst. 52 (8): 919–947. https://doi.org/10.1080/03081079.2023.2231133.
Yang, B., S. Sun, J. Li, X. Lin, and Y. Tian. 2019. “Traffic flow prediction using LSTM with feature enhancement.” Neurocomputing 332 (Mar): 320–327. https://doi.org/10.1016/j.neucom.2018.12.016.
Yi, P., F. Huang, J. Wang, and J. Peng. 2023. “Topology augmented dynamic spatial-temporal network for passenger flow forecasting in urban rail transit.” Appl. Intell. 53 (21): 24655–24670. https://doi.org/10.1007/s10489-023-04651-z.
Yu, B., H. Yin, and Z. Zhu. 2017. “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting.” Preprint, submitted September 14, 2017. https://arxiv.org/abs/1709.04875.
Yu, F., D. Wei, S. Zhang, and Y. Shao. 2019. “3D CNN-based accurate prediction for large-scale traffic flow.” In Proc., 2019 4th Int. Conf. on Intelligent Transportation Engineering (ICITE), 99–103. New York: IEEE.
Zarei, N., M. A. Ghayour, and S. Hashemi. 2013. “Road traffic prediction using context-aware random forest based on volatility nature of traffic flows.” In Proc., Asian Conf. on Intelligent Information and Database Systems, 196–205. Berlin: Springer.
Zhang, J., F. Chen, Z. Cui, Y. Guo, and Y. Zhu. 2020a. “Deep learning architecture for short-term passenger flow forecasting in urban rail transit.” IEEE Trans. Intell. Transp. Syst. 22 (11): 7004–7014. https://doi.org/10.1109/TITS.2020.3000761.
Zhang, J., F. Chen, Y. Guo, and X. Li. 2020b. “Multi-graph convolutional network for short-term passenger flow forecasting in urban rail transit.” IET Intel. Transport Syst. 14 (10): 1210–1217. https://doi.org/10.1049/iet-its.2019.0873.
Zhang, Z., M. Li, X. Lin, Y. Wang, and F. He. 2019. “Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies.” Transp. Res. Part C: Emerging Technol. 105 (Aug): 297–322. https://doi.org/10.1016/j.trc.2019.05.039.
Zhao, L., Y. Song, C. Zhang, Y. Liu, P. Wang, T. Lin, M. Deng, and H. Li. 2019. “T-GCN: A temporal graph convolutional network for traffic prediction.” IEEE Trans. Intell. Transp. Syst. 21 (9): 3848–3858. https://doi.org/10.1109/TITS.2019.2935152.

Information & Authors

Information

Published In

Go to Journal of Transportation Engineering, Part A: Systems
Journal of Transportation Engineering, Part A: Systems
Volume 150Issue 5May 2024

History

Received: Mar 20, 2023
Accepted: Nov 9, 2023
Published online: Feb 27, 2024
Published in print: May 1, 2024
Discussion open until: Jul 27, 2024

Permissions

Request permissions for this article.

Authors

Affiliations

Associate Professor, Dept. of Transportation, Beijing Univ. of Civil Engineering and Architecture, Beijing 102616, China; mailing address: No. 15 YongYuan Rd., Daxing District, Beijing 102616, China (corresponding author). ORCID: https://orcid.org/0000-0001-6802-6731. Email: [email protected]
Master’s Student, Dept. of Transportation, Beijing Univ. of Civil Engineering and Architecture, Beijing 102616, China; mailing address: No. 15 YongYuan Rd., Daxing District, Beijing 102616, China. ORCID: https://orcid.org/0000-0002-0375-4866. Email: [email protected]
Hua Cai, Ph.D., M.ASCE, P.E. [email protected]
Associate Professor, Dept. of Industrial Engineering, Purdue Univ., West Lafayette, IN 47907; Associate Professor, Dept. of Environmental and Ecological Engineering, Purdue Univ., West Lafayette, IN 47907; mailing address: 315 N. Grant St., West Lafayette, IN 47907-2023. Email: [email protected]
Engineer and Assistant Professor, Research and Development Center, Beijing Transport Institute, Beijing 100073, China; mailing address: No. 9 Liuliqiao South Rd., Fengtai District, Beijing 100073, China. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share