Technical Papers
Mar 30, 2016

Decision Tree Based Station-Level Rail Transit Ridership Forecasting

Publication: Journal of Urban Planning and Development
Volume 142, Issue 4

Abstract

This paper presents a decision-tree based model to forecast rail transit ridership at the station level according to the surrounding land-use patterns. The canonical correlation analysis (CCA) method is used to identify key land use variables by evaluating their degrees of contribution to the rail transit station demand, which can effectively reduce dimensionality and complexity of the decision tree. A full month of Smart Card data and detailed regulatory land use plan from Chongqing, China are collected for model development and validation. The proposed model offers the capability of targeting key lane use patterns and associating them with rail transit station boarding and alighting demand at a high level of accuracy. The proposed model can reveal underlying rules between rail transit station demand and land use variables, and can be used to assist in developing the Transit Oriented Development (TOD) plans to improve land use and transit operational efficiency.

Get full access to this article

View all available purchase options and get full access to this article.

References

Arlot, S., and Celisse, A. (2010). “A survey of cross-validation procedures for model selection.” Stat. Surv., 4, 40–79.
Blainey, S. P., and Preston, J. M. (2010). “Modelling local rail demand in South Wales.” Transp. Plann. Technol., 33(1), 55–73.
Bradley, A. (1997). “The use of the area under the ROC curve in the evaluation of machine learning algorithms.” Pattern Recognit., 30(7), 1145–1159.
Cervero, R. (2006). “Alternative approaches to modeling the travel-demand impacts of smart growth.” J. Am. Plann. Assoc., 72(3), 285–295.
Cervero, R., Sarmiento, O. L., Jacoby, E., Gomez, L. F., and Neiman, A. (2009). “Influences of built environments on walking and cycling: Lessons from Bogotá.” Int. J. Sustainable Transp., 3(4), 203–226.
Chang, Y., and Mastrangelo, C. (2011). “Addressing multicollinearity in semiconductor manufacturing.” Qual. Reliab. Eng. Int., 27(6), 843–854.
Chu, X. (2004). “Ridership models at the stop level.” National Center for Transit Research, Center for Urban Transportation Research, Univ. of South Florida, Tampa, FL.
Dargay, J. (2010). “A forecasting model for long distance travel in Great Britain.” European Transport Conf., Association for European Transport, Henley-in-Arden, U.K.
Davis, J., and Goadrich, M. (2006). “The relationship between precision-recall and ROC curves.” Proc., 23rd Int. Conf. on Machine Learning, Association of Computing Machinery (ACM), New York, 233–240.
Do, A. Q., and Grudnitski, G. (1992). “A neural network approach to residential property appraisal.” Real Estate Appraiser, 58(3), 38–45.
Duduta, N. (2013). “Direct ridership model of Mexico City’s BRT and metro systems.” Transp. Res. Rec., 2394(1), 93–99.
Esmeir, S., and Markovitch, S. (2007). “Anytime learning of decision trees.” J. Mach. Learn. Res., 8, 891–933.
Fehr and Peers. (2013). “DRM (direct ridership models).” 〈http://www.fehrandpeers.com/drm-direct-ridership-models/〉 (Oct. 17, 2015).
Fujikoshi, Y., Ulyanov, V. V., and Shimizu, R. (2010). Multivariate statistics: High-dimensional and large-sample approximations, Wiley, Hoboken, NJ.
Guerra, E., Cervero, R., and Tischler, D. (2011). “The half-mile circle: Does it best represent transit station catchments?” Univ. of California, Berkeley, CA.
Gutierrez, J., Cardozo, O. D., and Garcia-Palomares, J. C. (2011). “Transit ridership forecasting at station level: An approach based on distance-decay weighted regression.” J. Transp. Geogr., 19(6), 1081–1092.
Hair, J. F., Anderson, R. E., Tatham, R. L., and Black, W. C. (1998). Multivariate data analysis, Prentice Hall, Upper Saddle River, NJ.
Hong, S., Chen, X., Jin, L., and Xiong, M. (2013). “Canonical correlation analysis for RNA-seq co-expression networks.” Nucleic Acids Res., 41(8), e95.
Hotelling, H. (1936). “Relations between two sets of variates.” Biometrika, 28(3–4), 321–377.
Khalil, B., Ouarda, T. B. M. J., and St-Hilaire, A. (2011). “Estimation of water quality characteristics at ungauged sites using artificial neural networks and canonical correlation analysis.” J. Hydrol., 405(3–4), 277–287.
Kohavi, R. (1995). “Wrappers for performance enhancement and oblivious decision graphs.” Ph.D. dissertation, Dept. of Computer Science, Stanford Univ., Standford, CA.
Kuby, M., Barranda, A., and Upchurch, C. (2004). “Factors influencing light-rail station boardings in the United States.” Transp. Res. Part A, 38(3), 223–247.
Larsen, J. E., and Peterson, M. O. (1988). “Correcting for errors in statistical appraisal equations.” Real Estate Appraiser Anal., 54(3), 45–49.
Lee, S. G., Hickman, M., and Tong, D. Q. (2013). “Development of a temporal and spatial linkage between transit demand and landuse patterns.” J. Transport Land Use, 6(2), 33–46.
Liu, C., Erdogan, S., Ma, T., and Ducca, F. W. (2014). “How to increase rail ridership in Maryland? Direct ridership models (DRM) for policy guidance.” TRB 93rd Annual Meeting Compendium of Papers, Transportation Research Board, Washington, DC, 3414–3417.
Mardia, K. V. (1975). “Assessment of multinormality and the robustness of Hotelling’s test.” Appl. Stat., 24(2), 163.
Mark, J., and Goldberg, M. (1988). “Multiple regression analysis and mass assessment: A review of the issues.” Appraisal J., 56(1), 89–109.
Marshall, N., and Grady, B. (2006). “Sketch transit modeling based on 2000 census data.” Transp. Res. Rec., 1986(1), 182–189.
McNally, M. G. (2000). “The four step model.” Institute for Transportation Studies, Univ. of California, Irvine, CA.
Misaki, M., Wallace, G. L., Dankner, N., Martin, A., and Bandettini, P. A. (2012). “Characteristic cortical thickness patterns in adolescents with autism spectrum disorders: Interactions with age and intellectual ability revealed by canonical correlation analysis.” NeuroImage, 60(3), 1890–1901.
Muata, K., and Bryson, O. (2004). “Evaluation of decision trees: A multi-criteria approach.” Comput. Oper. Res., 31(11), 1933–1945.
Naylor, M. G., Lin, X., Weiss, S. T., Raby, B. A., and Lange, C. (2010). “Using canonical correlation analysis to discover genetic regulatory variants.” PLoS One, 5(5), e10395.
Nejad, S. K., Seifi, F., Ahmadi, H., and Seifi, N. (2009). “Applying data mining in prediction and classification of urban traffic.” Computer Science and Information Engineering, 2009 WRI World Congress, IEEE Computer Society, Los Alamitos, CA, 674–678.
Olutayo, V. A, and Eludire, A. A. (2014). “Traffic accident analysis using decision trees and neural networks.” Int. J. Inf. Technol. Comput. Sci., 6(2), 22–28.
Owen, A. D., and Philips, G. D. A. P. (1987). “An econometric investigation into the characteristics of railway passenger demand.” J. Transp. Econ. Policy, 21, 231–253.
Preston, J. (1991). “Demand forecasting for new local rail stations and services.” J. Transp. Econ. Policy, 25, 183–202.
Quinlan, J. R. (1986). “Induction of decision trees.” Mach. Learn., 1(1), 81–106.
Sherry, A., and Henson, R. K. (2005). “Conducting and interpreting canonical correlation analysis in personality research: A user-friendly primer.” J. Pers. Assess., 84(1), 37–48.
Sohn, K., and Shim, H. (2010). “Factors generating boardings at metro stations in the Seoul metropolitan area.” Cities, 27(5), 358–368.
Sung, H. J., and Oh, J. T. (2011). “Transit-oriented development in a high-density city: Identifying its association with transit ridership in Seoul, Korea.” Cities, 28(1), 70–82.
Tabchnick, B. G., Fidell, L. S. (1996). Using multivariate statistics, Allyn and Bacon, London, 879.
Tang, L., Xiong, C., and Zhang, L. (2014). “Artificial intelligence approach to modeling travel mode switching in a dynamic behavioral process.” Transportation Research Board 93rd Annual Meeting, Transportation Research Board, Washington, DC.
Tsai, C. H., Mulley, C., and Clifton, G. (2013). “Forecasting public transport demand for the Sydney greater metropolitan area: A comparison of univariate and multivariate methods.” Australasian Transport Research Forum, Brisbane, Australia.
Walters, G., and Cervero, R. (2003). Forecasting transit demand in a fast growing corridor: The direct-ridership model approach, Fehr and Peers, Lafayette, CA.
Wang, J. J., Wang, J. F., Lu, F., Cao, Z. D., Liao, Y. L., and Deng, Y. (2009). “Comparison study on classification performance for short-term urban traffic flow condition using decision tree algorithms.” Software Eng., 4, 434–438.
Wardman, M., and Tyler, J. (2000). “Rail network accessibility and the demand for interurban rail travel.” Transp. Rev., 20(1), 3–24.
Xie, C., Lu, J., and Parkany, E. (2003). “Work travel mode choice modeling with data mining: Decision trees and neural networks.” Transp. Res. Rec., 1854, 50–61.
Zhang, X. F., and Fan, L. (2013). “A decision tree approach for traffic accident analysis of Saskatchewan highways.” Electrical and Computer Engineering (CCECE), 26th Annual IEEE Canadian Conf., IEEE, Piscataway, NJ, 1–4.
Zurada, J., Levitan, A., and Guan, J. (2011). “A comparison of regression and artificial intelligence methods in a mass appraisal context.” J. Real Estate Res., 33(3), 349–387.

Information & Authors

Information

Published In

Go to Journal of Urban Planning and Development
Journal of Urban Planning and Development
Volume 142Issue 4December 2016

History

Received: Aug 19, 2015
Accepted: Dec 23, 2015
Published online: Mar 30, 2016
Discussion open until: Aug 30, 2016
Published in print: Dec 1, 2016

Permissions

Request permissions for this article.

Authors

Affiliations

Ph.D. Candidate, Research Assistant, Dept. of Civil and Environmental Engineering, Univ. of Wisconsin at Milwaukee, P.O. Box 784, Milwaukee, WI 53201-0784. E-mail: [email protected]
Yue Liu, A.M.ASCE [email protected]
Associate Professor, Dept. of Civil and Environmental Engineering, Univ. of Wisconsin at Milwaukee, P.O. Box 784, Milwaukee, WI 53201-0784 (corresponding author). E-mail: [email protected]
Zhigang Gao [email protected]
Vice Chief, Chongqing Urban Transport Planning and Research Institute, No. 18, Yanghe’er Rd., Chongqing 400020, China. E-mail: [email protected]
Daizong Liu [email protected]
Senior Engineer, China Sustainable Transportation Center, Energy Foundation US, Room 1903, CITIC Bldg., No. 19 Jianguomenwai Ave., Beijing 100004, China. E-mail: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share