Improved Approach for Forecasting Extra-Peak Hourly Subway Ridership at Station-Level Based on LASSO
Publication: Journal of Transportation Engineering, Part A: Systems
Volume 147, Issue 11
Abstract
Prediction of the extra-peak hourly ridership (EPHR) is directly related to the capacity design of subway station service facilities. In the traditional station-level EPHR prediction process, the predicted value is simply the result of the multiplication of the predicted peak hourly ridership (PHR) value by a unified extra-peak hour factor (EPHF). However, the station-level EPHR predicted by this method may be underestimated because the PHR prediction results are extracted from a line-level prediction value, rather than the station-level value. Moreover, while the existing EPHF is always determined by China’s Code for Design of Metro, it is too simple and unrefined to be applicable. The proposed station-level EPHR prediction approach exhibits significantly improved accuracy and applicability via the introduction of a least absolute shrinkage and selection operator (LASSO)-based feature selection method. The historical ridership and related attribute data of the stations are used to construct relationship models for the peak deviation coefficient (PDC) and the EPHF to make the model more explanatory. As a case study, this approach was evaluated on a real-world, large-scale passenger flow dataset from Xi’an, China, and compared with the results of the traditional method. The results indicate that the EPHR prediction accuracies of 10% to 51% of the stations are improved and the corresponding mean absolute percentage error (MAPE) is reduced by 6%–30%, as compared with the traditional method, suggesting wider applicability and higher precision for station-level prediction. A supplementary comparison with two other feature selection methods further verifies that the LASSO-based approach exhibits higher accuracy and applicability.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Station ridership data used during the study were provided by Xi’an Metro Group Co., Ltd. Requests for these materials may be made directly to the provider, as indicated in the Acknowledgements.
Acknowledgments
The authors would like to thank Xi’an Metro Group Co., Ltd. for the station ridership data. This research is supported by the National Natural Science Foundation of China, Grant No. 71871027. The authors confirm contribution to the paper as follows: study conception and design: Wei and Cheng; data collection: Wei, Yu, Zhang, and Chen; analysis and interpretation of results: Wei, Cheng, and Zhang; draft manuscript preparation: Wei; original draft preparation: Wei; and review and editing: Cheng and Chen.
References
Baidu Encyclopedia. 2020. “Xi’an Metro.” Accessed October 17, 2020. https://baike.baidu.com/item/%E8%A5%BF%E5%AE%89%E5%9C%B0%E9%93%81/9670679?fr=aladdin.
Cao, Z., Z. Cao, X. Yang, W. Huang, X. Zhang, and H. Zhao. 2020. “Multi-factor analysis and modeling of net energy of lactation (NEL) prediction in primiparous dairy cows.” Measurement 162 (Oct): 107881. https://doi.org/10.1016/j.measurement.2020.107881.
Cardozo, O. D., J. C. García-Palomares, and J. Gutiérrez. 2012. “Application of geographically weighted regression to the direct forecasting of transit ridership at station-level.” Appl. Geogr. 34 (2): 548–558. https://doi.org/10.1016/j.apgeog.2012.01.005.
Cervero, R. 2006. “Alternative approaches to modeling the travel-demand impacts of smart growth.” J. Am. Plann. Assoc. 72 (3): 285–295. https://doi.org/10.1080/01944360608976751.
Cervero, R., and K. Kockelman. 1997. “Travel demand and the 3Ds: Density, diversity, and design.” Transp. Res. Part D Transp. Environ. 2 (3): 199–219. https://doi.org/10.1016/S1361-9209(97)00009-6.
Chalermpong, S., and S. S. Wibowo. 2008. “Transit station access trips and factors affecting propensity to walk to transit stations in Bangkok, Thailand.” J. East. Asia Soc. Transp. Stud. 2007 (7): 1806–1919. https://doi.org/10.11175/easts.7.1806.
Chen, Y., B. Yi, Y. Jiang, J. Sun, and M. I. M. Wahab. 2018. “Inter-arrival time distribution of passengers at service facilities in underground subway stations: A case study of the metropolitan city of Chengdu in China.” Transp. Res. Part A: Policy Pract. 111 (May): 227–251. https://doi.org/10.1016/j.tra.2018.03.009.
Cheng, X., K. Huang, L. Qu, T. Zhang, and L. Li. 2020. “Effects of vehicle restriction policies on urban travel demand change from a built environment perspective.” J. Adv. Transp. 2020: 1–13. https://doi.org/10.1155/2020/9848095.
Cheng, Y., X. Ye, Z. Wang, and L. Zhou. 2018. “Forecasting model of peak period station to station origin destination matrix in urban rail transit systems.” J. Tongji Univ. 46 (3): 70–77. https://doi.org/10.11908/j.issn.0253-374x.2018.03.010.
China Association of Metros. 2020. “Overview of urban rail transit lines in mainland China in 2019.” Accessed January 1, 2020. https://www.camet.org.cn/xxfb/5802.
Dovey, K., E. Pafka, and M. Ristic. 2017. Mapping urbanities: Morphologies, flows, possibilities. Oxfordshire, UK: Routledge.
Drent, H. M., B. van den Hoofdakker, A. de Bildt, J. K. Buitelaar, P. J. Hoekstra, and A. Dietrich. 2020. “Factors related to parental pre-treatment motivation in outpatient child and adolescent mental health care.” Eur. Child. Adolesc. Psychiatry 29 (7): 947–958. https://doi.org/10.1007/s00787-019-01391-9.
Efron, B., T. Hastie, I. Johnstone, and J. R. Tibshirani. 2004. “Least angle regression.” Ann. Stat. 32 (2): 407–499. https://doi.org/10.1214/009053604000000067.
Fan, C., F. Xiao, and S. Wang. 2014. “Development of prediction models for next-day building energy consumption and peak power demand using data mining techniques.” Appl. Energy 127 (Aug): 1–10. https://doi.org/10.1016/j.apenergy.2014.04.016.
Feng, X., Q. Sun, J. Liu, Y. Yang, and X. Liang. 2010. “Time characteristic of input passenger in urban rail transit stations among high density residential areas.” In Proc., 29th Chinese Control Conf., 5453-5456. New York: IEEE.
García-Palomares, J. C., J. Gutiérrez, and O. D. Cardozo. 2013. “Walking accessibility to public transport: An analysis based on microdata and GIS.” Environ. Plann. B: Plann. Des. 40 (6): 1087–1102. https://doi.org/10.1068/b39008.
Gu, L., and X. Ye. 2014. “Research on the peak time of passenger flow entering and exiting railway stations in Osaka City.” [In Chinese.] Compr. Transp. 2014 (2): 57–61. https://doi.org/CNKI:SUN:YSZH.0.2014-02-012.
Gu, L., and X. Ye. 2019. “Research on the relationship between passenger flow peak time and land use in rail stations.” In Proc., Annual National Planning Conf. 2019. Beijing: China Construction Industry Press. https://doi.org/10.26914/c.cnkihy.2019.003610.
Guo, R., and Z. Huang. 2020. “Mass rapid transit ridership forecast based on direct ridership models: A case study in Wuhan, China.” J. Adv. Transp. 2020: 1–19. https://doi.org/10.1155/2020/7538508.
Gutierrez, J., O. D. Cardozo, and J. C. Garcia-Palomares. 2011. “Transit ridership forecasting at station level: An approach based on distance-decay weighted regression.” J. Transp. Geogr. 19 (6): 1081–1092. https://doi.org/10.1016/j.jtrangeo.2011.05.004.
Guyon, I., and A. Elisseeff, eds. 2003. “An introduction to variable and feature selection.” J. Mach. Learn. Res. 3 (7–8): 1157–1182. https://doi.org/10.1162/153244303322753616.
He, J. 2008. “Study of configuration quantity of auto fare collection machine in urban rail transit.” Railway Signal. Commun. 44 (10): 14–17. https://doi.org/10.13879/j.issn1000-7458.2008.10.026.
He, Y., Y. Zhao, and K. L. Tsui. 2021. “An adapted geographically weighted LASSO (Ada-GWL) model for predicting subway ridership.” Transportation 48 (3): 1185–1216. https://doi.org/10.1007/s11116-020-10091-2.
Hsu, N.-J., H.-L. Hung, and Y.-M. Chang. 2008. “Subset selection for vector autoregressive processes using Lasso.” Comput. Stat. Data Anal. 52 (7): 3645–3657. https://doi.org/10.1016/j.csda.2007.12.004.
Huang, Z., M. Zhang, and X. Liu. 2017. “Estimating light-rail transit peak-hour boarding based on accessibility at station and route levels in Wuhan, China.” Transp. Plann. Technol. 40 (5): 624–639. https://doi.org/10.1080/03081060.2017.1314497.
Iseki, H., C. Liu, and G. Knaap. 2018. “The determinants of travel demand between rail stations: A direct transit demand model using multilevel analysis for the Washington DC Metrorail system.” Transp. Res. Part A: Policy Pract. 116 (Oct): 635–649. https://doi.org/10.1016/j.tra.2018.06.011.
Ji, X., S. An, Y. Yu, and Y. Xie. 2017. “Statistical analysis for macroscopic indexes for urban rail transit passenger flow forecast.” Urban Rapid Rail Transit 30 (6): 39–46. https://doi.org/10.3969/j.issn.1672-6073.2017.06.007.
Jiang, Y., P. Christopher Zegras, and S. Mehndiratta. 2012. “Walk the line: Station context, corridor type and bus rapid transit walk access in Jinan, China.” J. Transp. Geogr. 20 (1): 1–14. https://doi.org/10.1016/j.jtrangeo.2011.09.007.
Kamel, E., S. Sheikh, and X. Huang. 2020. “Data-driven predictive models for residential building energy use based on the segregation of heating and cooling days.” Energy 206 (Sep): 118045. https://doi.org/10.1016/j.energy.2020.118045.
Kohavi, R., and G. H. John. 1997. “Wrappers for feature subset selection.” Artif. Intell. 97 (1–2): 273–324. https://doi.org/10.1016/S0004-3702(97)00043-X.
Kraft, G., and M. Wohl. 1967. “New directions for passenger demand analysis and forecasting.” Transp. Res. 1 (3): 205–230. https://doi.org/10.1016/0041-1647(67)90033-0.
Li, Q., F. Qiao, A. Mao, and C. McCreight. 2019. “Characterizing the importance of criminal factors affecting bus ridership using random forest ensemble algorithm.” Transp. Res. Rec. 2673 (4): 864–876. https://doi.org/10.1177/0361198119837504.
Li, S., D. Lyu, X. Liu, Z. Tan, F. Gao, G. Huang, and Z. Wu. 2020. “The varying patterns of rail transit ridership and their relationships with fine-scale built environment factors: Big data analytics from Guangzhou.” Cities 99 (Apr): 102580. https://doi.org/10.1016/j.cities.2019.102580.
Ma, L., Z. Li, and M. Xie. 2015. “Characteristic analysis of rail passenger flow peak hour.” In Proc., China Urban Transportation Planning Annual Meeting and 28th Academic Seminar. Beijing: China Construction Industry Press.
MOH (Ministry of Housing and Urban-Rural Development of the People’s Republic of China). 2011. Code for classification of urban land use and planning standards of development land. GB 50137. Beijing: China Construction Industry Press. http://www.jianbiaoku.com/webarbs/book/254/53837.shtml.
MOH (Ministry of Housing and Urban-Rural Development of the People’s Republic of China). 2013. Code for design of metro. GB 50157. Beijing: China Construction Industry Press. http://www.jianbiaoku.com/webarbs/book/1027/1073253.shtml.
Ping, S. 2018. “Characteristics of temporal passenger flow distribution at different stations on Shenzhen Metro Line 1.” Urban Mass Transit. 21 (6): 85–87. https://doi.org/10.16037/j.1007-869x.2018.06.023.
Satre-Meloy, A. 2019. “Investigating structural and occupant drivers of annual residential electricity consumption using regularization in regression models.” Energy 174 (May): 148–168. https://doi.org/10.1016/j.energy.2019.01.157.
Sermpinis, G., S. Tsoukas, and P. Zhang. 2018. “Modelling market implied ratings using LASSO variable selection techniques.” J. Empirical Finance 48 (Sep): 19–35. https://doi.org/10.1016/j.jempfin.2018.05.001.
Shen, J. 2008. “Simplified calculation for the width of the boarding and landing zone of a station platform.” Urban Rapid Rail Transit 21 (5): 9–12. https://doi.org/10.3969/j.issn.1672-6073.2008.05.003.
Shi, H., and Y. Sun. 2012. “Passenger flow characteristics and problems of guangzhou rail transit network operation.” Urban Rapid Rail Transit 25 (3): 29–33. https://doi.org/10.3969/j.issn.1672-6073.2012.03.007.
Sohn, K., and H. Shim. 2010. “Factors generating boardings at Metro stations in the Seoul metropolitan area.” Cities 27 (5): 358–368. https://doi.org/10.1016/j.cities.2010.05.001.
Stanciu, A., M. Banciu, A. Sadighi, K. A. Marshall, N. R. Holland, V. Abedi, and R. Zand. 2020. “A predictive analytics model for differentiating between transient ischemic attacks (TIA) and its mimics.” BMC Med. Inf. Decis. Making 20 (1). https://doi.org/10.1186/s12911-020-01154-6.
Tibshirani, R. 1996. “Regression shrinkage and selection via the lasso.” J. R. Stat. Soc. 58 (1): 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.2005.00490.x.
Wang, J., and R. Zuo. 2020. “Assessing geochemical anomalies using geographically weighted lasso.” Appl. Geochem. 119 (Aug): 104668. https://doi.org/10.1016/j.apgeochem.2020.104668.
Yang, X., X. Li, and L. Ma. 2019. “Research on the relationship between passenger flow characteristics and land use in Shenzhen rail station.” In Proc., China Urban Transportation Planning Annual Meeting and 28th Academic Seminar. Beijing: China Construction Industry Press.
Yu, J. 2019. “Characteristics of peak hour passenger flow at rail transit stations in Shanghai.” Urban Transp. China 17 (4): 50–57. https://doi.org/10.13813/j.cn11-5141/u.2019.0408.
Yu, L., Q. Chen, and K. Chen. 2019. “Deviation of peak hours for urban rail transit stations: A case study in Xi’an, China.” Sustainability 11 (10): 2733. https://doi.org/10.3390/su11102733.
Yu, L., Y. Cong, and K. Chen. 2020. “Determination of the peak hour ridership of metro stations in Xi’an, China using geographically-weighted regression.” Sustainability 12 (6): 2255. https://doi.org/10.3390/su12062255.
Zhao, J., and W. Deng. 2013. “Relationship of walk access distance to rapid rail transit stations with personal characteristics and station context.” J. Urban Plann. Dev. 139 (4): 311–321. https://doi.org/10.1061/(ASCE)UP.1943-5444.0000155.
Zhao, J., W. Deng, Y. Song, and Y. Zhu. 2013a. “Analysis of Metro ridership at station level and station-to-station level in Nanjing: An approach based on direct demand models.” Transportation 41: 133–155. https://doi.org/10.1007/s11116-013-9492-3.
Zhao, J., W. Deng, Y. Song, and Y. Zhu. 2013b. “What influences Metro station ridership in China? Insights from Nanjing.” Cities 35: 114–124. https://doi.org/10.1016/j.cities.2013.07.002.
Zhao, X., Y.-P. Wu, G. Ren, K. Ji, and W.-W. Qian. 2019. “Clustering analysis of ridership patterns at subway stations: A case in Nanjing, China.” J. Urban Plann. Dev. 145 (2): 04019005. https://doi.org/10.1061/(asce)up.1943-5444.0000501.
Information & Authors
Information
Published In
Copyright
© 2021 American Society of Civil Engineers.
History
Received: Nov 17, 2020
Accepted: May 14, 2021
Published online: Sep 2, 2021
Published in print: Nov 1, 2021
Discussion open until: Feb 2, 2022
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Ying Zhao, Jie Wei, Haijun Li, Yan Huang, Predicting Station-Level Peak Hour Ridership of Metro Considering the Peak Deviation Coefficient, Sustainability, 10.3390/su16031225, 16, 3, (1225), (2024).
- Jie Wei, Yanqiu Cheng, Kuanmin Chen, Meng Wang, Chen Ma, Xianbiao Hu, Nonlinear Model-Based Subway Station-Level Peak-Hour Ridership Estimation Approach in the Context of Peak Deviation, Transportation Research Record: Journal of the Transportation Research Board, 10.1177/03611981221075624, 2676, 6, (549-564), (2022).