Spatial Autoregressive Analysis and Modeling of Housing Prices in City of Toronto
Publication: Journal of Urban Planning and Development
Volume 147, Issue 1
Abstract
Previous housing price studies based on hedonic price modeling have mainly focused on applying various factors, including built environment variables in the analysis, without establishing a comprehensive theoretical framework as a basis for the model formulation. To address this gap, this study introduces a more systematic framework for decomposing housing prices into land prices as determined by built form, neighborhood socioeconomic characteristics and individual dwellings' physical conditions. Following this logic, this study experiments with the related variables through regression analysis, including consideration of spatial lags, as well as develops a housing price model using a random forests (RF) algorithm. A comprehensive time-series database of housing transaction data for the City of Toronto is used. Modeling results show that neighborhood socioeconomic factors contribute the most to the explanation of housing prices, while housing characteristics and accessibility measures are also significantly influential. The RF model achieves an overall accuracy of 85%, a relatively good performance in reproducing observed prices. The findings provide insights for planners concerning factors influencing housing prices and, hence, residential location decision-making.
Get full access to this article
View all available purchase options and get full access to this article.
References
Anderson, W. P., P. S. Kanaroglou, and E. J. Miller. 1996. “Urban form, energy and the environment: A review of issues, evidence and policy.” Urban Stud. 33 (1): 7–35. https://doi.org/10.1080/00420989650012095.
Bailey, M. J., R. F. Muth, and H. O. Nourse. 1963. “A regression method for real estate price index construction.” J. Am. Stat. Assoc. 58 (304): 933–942. https://doi.org/10.1080/01621459.1963.10480679.
Balcilar, M., R. Gupta, and S. M. Miller. 2015. “The out-of-sample forecasting performance of nonlinear models of regional housing prices in the US.” Appl. Econ. 47 (22): 2259–2277. https://doi.org/10.1080/00036846.2015.1005814.
Belgiu, M., and L. Drăguţ. 2016. “Random forest in remote sensing: A review of applications and future directions.” ISPRS J. Photogramm. Remote Sens. 114: 24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011.
Bitter, C., G. F. Mulligan, and S. Dall’erba. 2007. “Incorporating spatial variation in housing attribute prices: A comparison of geographically weighted regression and the spatial expansion method.” J. Geog. Syst. 9 (1): 7–27. https://doi.org/10.1007/s10109-006-0028-7.
Bourassa, S. C., M. Hoesli, and J. Sun. 2006. “A simple alternative house price index method.” J. Housing Econ. 15 (1): 80–97. https://doi.org/10.1016/j.jhe.2006.03.001.
Bowen, W. M., B. A. Mikelbank, and D. M. Prestegaard. 2001. “Theoretical and empirical considerations regarding space in hedonic housing price model applications.” Growth Change 32 (4): 466–490. https://doi.org/10.1111/0017-4815.00171.
Breiman, L. 2001. “Random forests.” Mach. Learn. 45 (1): 5–32. https://doi.org/10.1023/A:1010933404324.
Brunsdon, C., A. S. Fotheringham, and M. Charlton. 2002. “Geographically weighted summary statistics—a framework for localised exploratory data analysis.” Comput. Environ. Urban Syst. 26 (6): 501–524. https://doi.org/10.1016/S0198-9715(01)00009-6.
Brunsdon, C., A. S. Fotheringham, and M. E. Charlton. 1996. “Geographically weighted regression: A method for exploring spatial nonstationarity.” Geog. Anal. 28 (4): 281–298. https://doi.org/10.1111/j.1538-4632.1996.tb00936.x.
Burt, M. R., L. Y. Aron, and E. Lee. 2001. Helping America’s homeless: Emergency shelter or affordable housing? Washington, DC: The Urban Institute.
Can, A. 1992. “Specification and estimation of hedonic housing price models.” Reg. Sci. Urban Econ. 22 (3): 453–474. https://doi.org/10.1016/0166-0462(92)90039-4.
Cao, K., M. Diao, and B. Wu. 2019. “A big data–based geographically weighted regression model for public housing prices: A case study in Singapore.” Ann. Am. Assoc. Geogr. 109 (1): 173–186. https://doi.org/10.1080/24694452.2018.1470925.
Case, B., H. O. Pollakowski, and S. M. Wachter. 1991. “On choosing among house price index methodologies.” Real Estate Econ. 19 (3): 286–307. https://doi.org/10.1111/1540-6229.00554.
Cervero, R., and K. Kockelman. 1997. “Travel demand and the 3Ds: Density, diversity, and design.” Transp. Res. Part D: Transp. Environ. 2 (3): 199–219. https://doi.org/10.1016/S1361-9209(97)00009-6.
Cervero, R., O. L. Sarmiento, E. Jacoby, L. F. Gomez, and A. Neiman. 2009. “Influences of built environments on walking and cycling: Lessons from Bogotá.” Int. J. Sustainable Transp. 3 (4): 203–226. https://doi.org/10.1080/15568310802178314.
Chau, K. W., and T. L. Chin. 2003. “A critical review of literature on the hedonic price model.” Int. J. Hous. Sci. Appl. 27 (2): 145–165.
Chen, Y., X. Liu, X. Li, Y. Liu, and X. Xu. 2016. “Mapping the fine-scale spatial pattern of housing rent in the metropolitan area by using online rental listings and ensemble learning.” Appl. Geogr. 75: 200–212. https://doi.org/10.1016/j.apgeog.2016.08.011.
Clapp, J. M., and C. Giaccotto. 1992. “Estimating price indices for residential property: A comparison of repeat sales and assessed value methods.” J. Am. Stat. Assoc. 87 (418): 300–306. https://doi.org/10.1080/01621459.1992.10475209.
Cohen, J. P., and C. C. Coughlin. 2008. “Spatial hedonic models of airport noise, proximity, and housing prices.” J. Reg. Sci. 48 (5): 859–878. https://doi.org/10.1111/j.1467-9787.2008.00569.x.
De La Barra, T., B. Pérez, and N. Vera. 1984. “TRANUS-J: Putting large models into small computers.” Environ. Plann. B: Plann. Des. 11 (1): 87–101. https://doi.org/10.1068/b110087.
Diamond, R., and T. McQuade. 2019. “Who wants affordable housing in their backyard? An equilibrium analysis of low-income property development.” J. Political Econ. 127 (3): 1063–1117. https://doi.org/10.1086/701354.
Dubin, R., R. K. Pace, and T. G. Thibodeau. 1999. “Spatial autoregression techniques for real estate data.” J. Real Estate Lit. 7 (1): 79–96. https://doi.org/10.1023/A:1008690521599.
Echenique, M. H., A. D. J. Flowerdew, J. D. Hunt, T. R. Mayo, I. J. Skidmore, and D. C. Simmonds. 1990. “The MEPLAN models of Bilbao, Leeds and Dortmund.” Transp. Rev. 10 (4): 309–322. https://doi.org/10.1080/01441649008716764.
Englund, P., J. M. Quigley, and C. L. Redfearn. 1999. “The choice of methodology for computing housing price indexes: Comparisons of temporal aggregation and sample definition.” J. Real Estate Finance Econ. 19 (2): 91–112. https://doi.org/10.1023/A:1007846404582.
Farber, S., and A. Páez. 2007. “A systematic investigation of cross-validation in GWR model estimation: Empirical analysis and Monte Carlo simulations.” J. Geog. Syst. 9 (4): 371–396. https://doi.org/10.1007/s10109-007-0051-3.
Fernández-Delgado, M., E. Cernadas, S. Barro, and D. Amorim. 2014. “Do we need hundreds of classifiers to solve real world classification problems?” J. Mach. Learn. Res. 15 (1): 3133–3181.
Goodman, A. C. 1978. “Hedonic prices, price indices and housing markets.” J. Urban Econ. 5 (4): 471–484. https://doi.org/10.1016/0094-1190(78)90004-9.
Goodman, A. C. 1988. “An econometric model of housing price, permanent income, tenure choice, and housing demand.” J. Urban Econ. 23 (3): 327–353. https://doi.org/10.1016/0094-1190(88)90022-8.
Gu, J., M. Zhu, and L. Jiang. 2011. “Housing price forecasting based on genetic algorithm and support vector machine.” Expert Syst. Appl. 38 (4): 3383–3386. https://doi.org/10.1016/j.eswa.2010.08.123.
Habib, M. A., and E. J. Miller. 2008. “Influence of transportation access and market dynamics on property values: Multilevel spatiotemporal models of housing price.” Transp. Res. Rec. 2076 (1): 183–191. https://doi.org/10.3141/2076-20.
Haider, M., and E. J. Miller. 2000. “Effects of transportation infrastructure and location on residential real estate values: Application of spatial autoregressive techniques.” Transp. Res. Rec. 1722 (1): 1–8. https://doi.org/10.3141/1722-01.
Hu, L., S. He, Z. Han, H. Xiao, S. Su, M. Weng, and Z. Cai. 2019. “Monitoring housing rental prices based on social media: An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies.” Land Use Policy 82: 657–673. https://doi.org/10.1016/j.landusepol.2018.12.030.
Hung, B. L., and H. A. Hung. 2014. “Using one-dimensional linear interpolation method to check over-fitting in neural network with multi-dimensional inputs.” App. Mech. Mater. 496–500: 2228–2232. https://doi.org/10.4028/www.scientific.net/AMM.496-500.2228.
Huang, B., B. Wu, and M. Barry. 2010. “Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices.” Int. J. Geog. Inf. Sci. 24 (3): 383–401. https://doi.org/10.1080/13658810802672469.
Hunt, J. D. 2003. “Design and application of the pecas land use modelling system.”
Ihlanfeldt, K. R. 2007. “The effect of land use regulation on housing and land prices.” J. Urban Econ. 61 (3): 420–435. https://doi.org/10.1016/j.jue.2006.09.003.
Kamusoko, C., and J. Gamba. 2015. “Simulating urban growth using a random forest-cellular automata (RF-CA) model.” ISPRS Int. J. Geo-Inf. 4 (2): 447–470. https://doi.org/10.3390/ijgi4020447.
Kim, H.-G., K.-C. Hung, and S. Y. Park. 2015. “Determinants of housing prices in Hong Kong: A Box-Cox quantile regression approach.” J. Real Estate Finance Econ. 50 (2): 270–287. https://doi.org/10.1007/s11146-014-9456-1.
Koenker, R., and K. F. Hallock. 2001. “Quantile regression.” J. Econ. Perspect. 15 (4): 143–156. https://doi.org/10.1257/jep.15.4.143.
Kouwenberg, R., and R. Zwinkels. 2014. “Forecasting the US housing market.” Int. J. Forecasting 30 (3): 415–425. https://doi.org/10.1016/j.ijforecast.2013.12.010.
Landis, J. D. 1994. “The California Urban Futures Model: A new generation of metropolitan simulation models.” Environ. Plann. B: Plann. Des. 21 (4): 399–420. https://doi.org/10.1068/b210399.
Levine, J. 1998. “Rethinking accessibility and jobs-housing balance.” J. Am. Plann. Assoc. 64 (2): 133–149. https://doi.org/10.1080/01944369808975972.
Malpezzi, S., G. H. Chun, and R. K. Green. 1998. “New place-to-place housing price indexes for US Metropolitan Areas, and their determinants.” Real Estate Econ. 26 (2): 235–274. https://doi.org/10.1111/1540-6229.00745.
Mark, J. H., and M. A. Goldberg. 1984. “Alternative housing price indices: An evaluation.” Real Estate Econ. 12 (1): 30–49. https://doi.org/10.1111/1540-6229.00309.
Martinez, F. 1996. “MUSSA: Land use model for Santiago city.” Transp. Res. Rec. 1552 (1): 126–134. https://doi.org/10.1177/0361198196155200118.
Mason, C., and J. M. Quigley. 1996. “Non-parametric hedonic housing prices.” Housing Stud. 11 (3): 373–385. https://doi.org/10.1080/02673039608720863.
Massey, D. S., and J. S. Rugh. 2017. “Zoning, affordable housing, and segregation in US metropolitan areas.” In Chap. 14 in The fight for fair housing: Causes, consequences, and future implications of the 1968 federal fair housing act, edited by G. D. Squires, 245–264. London: Routledge.
McMillen, D. 2013. “Local quantile house price indices.” In Univ. of Illinois, mimeo, 1–41. Urbana, IL: University of Illinois.
Mok, H. M., P. P. Chan, and Y.-S. Cho. 1995. “A hedonic price model for private properties in Hong Kong.” J. Real Estate Finance Econ. 10 (1): 37–48. https://doi.org/10.1007/BF01099610.
Morris, A. C., H. R. Neill, and N. E. Coulson. 2020. “Housing supply elasticity, gasoline prices, and residential property values.” J. Housing Econ. 48: 101669. https://doi.org/10.1016/j.jhe.2020.101669.
Nguyen, M. T. 2005. “Does affordable housing detrimentally affect property values? A review of the literature.” J. Plann. Lit. 20 (1): 15–26. https://doi.org/10.1177/0885412205277069.
Oladunni, T., and S. Sharma. 2016. “Hedonic housing theory—A machine learning investigation.” In Proc., 2016 15th IEEE Int. Conf. on Machine Learning and Applications, 522–527. New York: IEEE.
Osland, L., and I. Thorsen. 2008. “Effects on housing prices of urban attraction and labor-market accessibility.” Environ. Plann. A: Econ. Space 40 (10): 2490–2509. https://doi.org/10.1068/a39305.
Palczewska, A., J. Palczewski, R. M. Robinson, and D. Neagu. 2014. “Interpreting random forest classification models using a feature contribution method.” In Integration of reusable systems, T. Bouabana-Tebibel, and S. H. Rubin, 193–218. Cham, Switzerland: Springer.
Park, B., and J. K. Bae. 2015. “Using machine learning algorithms for housing price prediction: The case of Fairfax County, Virginia housing data.” Expert Syst. Appl. 42 (6): 2928–2934. https://doi.org/10.1016/j.eswa.2014.11.040.
Quigley, J. M. 1994. “A simple hybrid model for estimating real estate price indexes.” J. Housing Econ. 4 (1): 1–12. https://doi.org/10.1006/jhec.1995.1001.
Rafiei, M. H., and H. Adeli. 2016. “A novel machine learning model for estimation of sale prices of real estate units.” J. Constr. Eng. Manage. 142 (2): 04015066. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001047.
Rosen, S. 1974. “Hedonic prices and implicit markets: Product differentiation in pure competition.” J. Political Econ. 82 (1): 34–55. https://doi.org/10.1086/260169.
Selim, H. 2009. “Determinants of house prices in Turkey: Hedonic regression versus artificial neural network.” Expert Syst. Appl. 36 (2): 2843–2852. https://doi.org/10.1016/j.eswa.2008.01.044.
Song, Y., and G.-J. Knaap. 2004. “Measuring the effects of mixed land uses on housing values.” Reg. Sci. Urban Econ. 34 (6): 663–680. https://doi.org/10.1016/j.regsciurbeco.2004.02.003.
Statistics Canada. 2012. “Toronto, Ontario (Code 3520005) and Ontario (Code 35) (table). Census Profile.” 2011 Census. Statistics Canada Catalogue no. 98-316-XWE. Ottawa. Released October 24, 2012. http://www12.statcan.gc.ca/census-recensement/2011/dp-pd/prof/index.cfm?Lang=E.
Statistics Canada. 2018. Survey of household spending, 2017. Ottawa: Statistics Canada.
Tong, D., Y. Zhang, I. MacLachlan, and G. Li. 2019. “Migrant housing choices from a social capital perspective: The case of Shenzhen, China.” Habitat Int. 96: 102082. https://doi.org/10.1016/j.habitatint.2019.102082.
Toronto Life. 2018. “The ultimate neighbourhood rankings.”
Treyz, G. I. 1995. “Policy analysis applications of REMI economic forecasting and simulation models.” Int. J. Public Admin. 18 (1): 13–42. https://doi.org/10.1080/01900699508524997.
Waddell, P. 2002. “Urbansim: Modeling urban development for land use, transportation, and environmental planning.” J. Am. Plann. Assoc. 68 (3): 297–314. https://doi.org/10.1080/01944360208976274.
Wallace, N. E., and R. A. Meese. 1997. “The construction of residential housing price indices: A comparison of repeat-sales, hedonic-regression, and hybrid approaches.” J. Real Estate Finance Econ. 14 (1–2): 51–73. https://doi.org/10.1023/A:1007715917198.
Wang, S., S. Zheng, and J. Feng. 2007. “Spatial accessibility of housing to public services and its impact on housing price: A case study of Beijing’s inner city.” Prog. Geogr. 26 (6): 78–85.
Witte, A. D., H. J. Sumka, and H. Erekson. 1979. “An estimate of a structural hedonic price model of the housing market: An application of Rosen’s theory of implicit markets.” Econometrica 47 (5): 1151–1173. https://doi.org/10.2307/1911956.
Wyner, A. J., M. Olson, J. Bleich, and D. Mease. 2017. “Explaining the success of adaboost and random forests as interpolating classifiers.” J. Mach. Learn. Res. 18 (1): 1558–1590.
Xie, X., and G. Hu. 2007. “A comparison of Shanghai housing price index forecasting.” In Proc., 3rd Int. Conf. on Natural Computation, 221–225. New York: IEEE.
Yan, Y., X. Wei, B. Hui, S. Yang, W. Zhang, Y. Hong, and S.-y. Wang. 2007. “Method for housing price forecasting based on TEI@I methodology.” Syst. Eng. Theory Pract. 27 (7): 1–9. https://doi.org/10.1016/S1874-8651(08)60047-2.
Yang, C., Q. Zhan, Y. Lv, and H. Liu. 2019. “Downscaling land surface temperature using multiscale geographically weighted regression over heterogeneous landscapes in Wuhan, China.” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 12 (12): 5213–5222. https://doi.org/10.1109/JSTARS.2019.2955551.
Yao, Y., X. Liu, X. Li, J. Zhang, Z. Liang, K. Mai, and Y. Zhang. 2017. “Mapping fine-scale population distributions at the building level by integrating multisource geospatial big data.” Int. J. Geog. Inf. Sci. 31 (6): 1220–1244.
Zhang, D., X. Liu, X. Wu, Y. Yao, X. Wu, and Y. Chen. 2019. “Multiple intra-urban land use simulations and driving factors analysis: A case study in Huicheng, China.” GISci. Remote Sens. 56 (2): 282–308. https://doi.org/10.1080/15481603.2018.1507074.
Ziemke, D., K. Nagel, and R. Moeckel. 2016. “Towards an agent-based, integrated land-use transport modeling system.” Procedia Comput. Sci. 83: 958–963. https://doi.org/10.1016/j.procs.2016.04.192.
Zietz, J., E. N. Zietz, and G. S. Sirmans. 2008. “Determinants of house prices: A quantile regression approach.” J. Real Estate Finance Econ. 37 (4): 317–333. https://doi.org/10.1007/s11146-007-9053-7.
Information & Authors
Information
Published In
Copyright
© 2021 American Society of Civil Engineers.
History
Received: May 15, 2020
Accepted: Sep 15, 2020
Published online: Jan 11, 2021
Published in print: Mar 1, 2021
Discussion open until: Jun 11, 2021
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.