Technical Papers
Apr 6, 2022

Building Classification Using Random Forest to Develop a Geodatabase for Probabilistic Hazard Information

Publication: Natural Hazards Review
Volume 23, Issue 3

Abstract

To understand the community risk from severe weather threats, two components, including weather information and community assets, are crucial. Recently, probabilistic hazard information (PHI) from the National Oceanic and Atmospheric Administration (NOAA) Forecasting a Continuum of Environmental Threats (FACETs) program has been developed to provide dynamic weather-related information between the watch and warning systems to weather forecasters, emergency management agencies, and the public. To predict community physical risks on critical infrastructure and building properties using PHI, building type information is required. This study applied a machine learning technique to predict building types using building footprint and city zoning data. We collected Oklahoma county building property data to train and test a random forest model. The result of this study showed that building footprint and city zoning data can be applied to classify multiple building types with an accuracy of 96%. The machine learning–based building classification contributed to the acquisition of building type data in the Oklahoma City, Oklahoma, metropolitan area. This geodatabase will be utilized to predict real-time critical infrastructure and building damage assessment using PHI. In addition to their importance to physical building damage assessment, the results can be utilized to develop postdisaster responses and planning.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

The following data sets are available from the author ([email protected]) upon reasonable request:
building footprint,
city zoning data, and
building property data.

Acknowledgments

Funding was provided by the National Oceanic and Atmospheric Administration (NOAA) Office of Oceanic and Atmospheric Research (OAR) and Office of Weather and Air Quality (OWAQ) (Federal Grant No. NA18OAR4590386—Implementing convective storm statistics from a large reanalysis of WSR-88D data for model verification and forecasting probabilistic uncertainty).

References

Ahmad, M. W., M. Mourshed, and Y. Rezgui. 2017. “Trees vs neurons: Comparison between random forest and ANN for high-resolution prediction of building energy consumption.” Energy Build. 147 (Jul): 77–89. https://doi.org/10.1016/j.enbuild.2017.04.038.
Bai, Y., B. Adriano, E. Mas, and S. Koshimura. 2017. “Building damage assessment in the 2015 Gorkha, Nepal, earthquake using only post-event dual polarization synthetic aperture radar imagery.” Earthquake Spectra 33 (1_suppl): 185–195. https://doi.org/10.1193/121516eqs232m.
Baradaranshoraka, M., J. P. Pinelli, K. Gurley, X. Peng, and M. Zhao. 2017. “Hurricane wind versus storm surge damage in the context of a risk prediction model.” J. Struct. Eng. 143 (9): 1–10. https://doi.org/10.1061/(ASCE)ST.1943-541X.0001824.
Beck, A., G. Long, D. S. Boyd, J. F. Rosser, J. Morley, R. Duffield, M. Sanderson, and D. Robinson. 2020. “Automated classification metrics for energy modelling of residential buildings in the UK with open algorithms.” Environ. Plann. B: Urban Anal. City Sci. 47 (1): 45–64. https://doi.org/10.1177/2399808318762436.
Blagus, R., and L. Lusa. 2013. “SMOTE for high-dimensional class-imbalanced data.” BMC Bioinf. 14 (1): 1–16. https://doi.org/10.1186/1471-2105-14-106.
Brackett, R., and J. W. Childs. 2020. “Tornado outbreak tears across the south, killing 33.” Accessed March 2, 2021. https://weather.com/storms/tornado/news/2020-04-13-tornadoes-storms-impacts-south-carolina-georgia-mississippi-arkansas.
Breiman, L. 2001. “Random forests.” Mach. Learn. 45 (1): 5–32. https://doi.org/10.1023/A:1010933404324.
Buda, M., A. Maki, and M. A. Mazurowski. 2018. “A systematic study of the class imbalance problem in convolutional neural networks.” Neural Netw. 106 (Oct): 249–259. https://doi.org/10.1016/j.neunet.2018.07.011.
Burgess, D., et al. 2014. “20 May 2013 Moore, Oklahoma, tornado: Damage survey and analysis.” Weather Forecasting 29 (5): 1229–1237. https://doi.org/10.1175/WAF-D-14-00039.1.
Cao, Q. D., and Y. Choe. 2020. “Building damage annotation on post-hurricane satellite imagery based on convolutional neural networks.” Nat. Hazard. 103 (3): 3357–3376. https://doi.org/10.1007/s11069-020-04133-2.
Cerda, P., G. Varoquaux, and B. Kégl. 2018. “Similarity encoding for learning with dirty categorical variables.” Mach. Learn. 107 (8–10): 1477–1494. https://doi.org/10.1007/s10994-018-5724-2.
Chawla, N. V., K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer. 2002. “SMOTE: Synthetic minority over-sampling technique.” J. Artif. Intell. Res. 16 (Sep): 321–357. https://doi.org/10.1613/jair.953.
Chuang, K. V., and M. J. Keiser. 2018. “Comment on ‘Predicting reaction performance in C–N cross-coupling using machine learning.’” Science 362 (6416): 1–3. https://doi.org/10.1126/science.aat8603.
Dai, K., A. Li, H. Zhang, S. E. Chen, and Y. Pan. 2018. “Surface damage quantification of postearthquake building based on terrestrial laser scan data.” Struct. Control Health Monit. 25 (8): e2210–e2218. https://doi.org/10.1002/stc.2210.
Dong, L., and J. Shan. 2013. “A comprehensive review of earthquake-induced building damage detection with remote sensing techniques.” ISPRS J. Photogramm. Remote Sens. 84 (Oct): 85–99. https://doi.org/10.1016/j.isprsjprs.2013.06.011.
Douzas, G., F. Bacao, and F. Last. 2018. “Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE.” Inf. Sci. 465 (Oct): 1–20. https://doi.org/10.1016/j.ins.2018.06.056.
Gee, B., K. Bellware, M. Kornfield, T. Bella, and M. Cappucci. 2020. “Tennessee tornadoes kill at least 24 people, including children, and leave a trail of destruction.” Accessed March 2, 2021. https://www.washingtonpost.com/nation/2020/03/03/nashville-tornado/.
Gettelman, A., D. N. Bresch, C. C. Chen, J. E. Truesdale, and J. T. Bacmeister. 2018. “Projections of future tropical cyclone damage with a high-resolution global climate model.” Clim. Change 146 (3–4): 575–585. https://doi.org/10.1007/s10584-017-1902-7.
Ham, H. J., W. Yun, S. H. Choi, S. Lee, and H. J. Kim. 2018. “Quantitative wind risk assessment for low and mid-rise apartment buildings based on a probabilistic model.” J. Asian Archit. Build. Eng. 17 (2): 377–384. https://doi.org/10.3130/jaabe.17.377.
Hartmann, A., G. Meinel, R. Hecht, and M. Behnisch. 2016. “A workflow for automatic quantification of structure and dynamic of the German building stock using official spatial data.” ISPRS Int. J. Geo-Inf. 5 (8): 142. https://doi.org/10.3390/ijgi5080142.
Hecht, R., M. Kalla, and T. Krüger. 2018. “Crowd-sourced data collection to support automatic classification of building footprint data.” Proc. Int. Cartogr. Assoc. 1 (54): 1–7. https://doi.org/10.5194/ica-proc-1-54-2018.
Hecht, R., G. Meinel, and M. Buchroithner. 2015. “Automatic identification of building types based on topographic databases–A comparison of different data sources.” Int. J. Cartography 1 (1): 18–31. https://doi.org/10.1080/23729333.2015.1055644.
Hussain, E., and J. Shan. 2016. “Urban building extraction through object-based image classification assisted by digital surface model and zoning map.” Int. J. Image Data Fusion 7 (1): 63–82. https://doi.org/10.1080/19479832.2015.1119206.
Insurance Information Institute. 2019. Facts + statistics: Tornadoes and thunderstorms. New York: Insurance Information Institute.
Jamali, B., R. Löwe, P. M. Bach, C. Urich, K. Arnbjerg-Nielsen, and A. Deletic. 2018. “A rapid urban flood inundation and damage assessment model.” J. Hydrol. 564 (Feb): 1085–1098. https://doi.org/10.1016/j.jhydrol.2018.07.064.
James, J. J., C. Ling, C. D. Karstens, J. Correia Jr., K. Calhoun, T. Meyer, and D. LaDue. 2020. “Forecasters’ cognitive task analysis and mental workload analysis of issuing probabilistic hazard information (PHI) during FACETs PHI prototype experiment.” Weather Forecasting 35 (4): 1505–1521. https://doi.org/10.1175/WAF-D-19-0194.1.
Kang, J., M. Körner, Y. Wang, H. Taubenböck, and X. X. Zhu. 2018. “Building instance classification using street view images.” ISPRS J. Photogramm. Remote Sens. 145 (Nov): 44–59. https://doi.org/10.1016/j.isprsjprs.2018.02.006.
Kang, K., and H. Ryu. 2019. “Predicting types of occupational accidents at construction sites in Korea using random forest model.” Saf. Sci. 120 (Jul): 226–236. https://doi.org/10.1016/j.ssci.2019.06.034.
Karstens, C. D., et al. 2015. “Evaluation of a probabilistic forecasting methodology for severe convective weather in the 2014 hazardous weather testbed.” Weather Forecasting 30 (6): 1551–1570. https://doi.org/10.1175/WAF-D-14-00163.1.
Karstens, C. D., et al. 2018. “Development of a human-machine mix for forecasting severe convective events.” Weather Forecasting 33 (3): 715–737. https://doi.org/10.1175/WAF-D-17-0188.1.
Khajwal, A. B., and A. Noshadravan. 2020. “Probabilistic hurricane wind-induced loss model for risk assessment on a regional scale.” ASCE-ASME J. Risk Uncertainty Eng. Syst. Part A: Civ. Eng. 6 (2): 1–9. https://doi.org/10.1061/AJRUA6.0001062.
Klockow-McClain, K. E., R. A. McPherson, and R. P. Thomas. 2019. “Cartographic design for improved decision making: Trade-offs in uncertainty visualization for Tornado threats.” Ann. Am. Assoc. Geogr. 110 (1): 314–333. https://doi.org/10.1080/24694452.2019.1602467.
Korkmaz, S. 2020. “Deep learning-based imbalanced data classification for drug discovery.” J. Chem. Inf. Model. 60 (9): 4180–4190. https://doi.org/10.1021/acs.jcim.9b01162.
Kuligowski, E. D., F. T. Lombardo, L. T. Phan, M. L. Levitan, and D. P. Jorgensen. 2014. Technical investigation of the May 22, 2011 Tornado in Joplin, Missouri. Gaithersburg, MD: NIST.
Lee, J., H. Jang, J. Yang, and K. Yu. 2017. “Machine learning classification of buildings for map generalization.” ISPRS Int. J. Geo-Inf. 6 (10): 309. https://doi.org/10.3390/ijgi6100309.
Lee, S., and J. Kim. 2020. “Prediction of nanofiltration and reverse-osmosis-membrane rejection of organic compounds using random forest model.” J. Environ. Eng. 146 (11): 04020127. https://doi.org/10.1061/(ASCE)EE.1943-7870.0001806.
Lemaitre, G., F. Nogueira, and C. K. Aridas. 2017. “Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning.” J. Mach. Learn. Res. 18 (17): 1–5.
Lever, J., M. Krzywinski, and N. Altman. 2016. “Classification evaluation.” Nat. Methods 13 (8): 603–604. https://doi.org/10.1038/nmeth.3945.
Marasco, D. E., and C. E. Kontokosta. 2016. “Applications of machine learning methods to identifying and predicting building retrofit opportunities.” Energy Build. 128 (Sep): 431–441. https://doi.org/10.1016/j.enbuild.2016.06.092.
Masoomi, H., J. W. Van De Lindt, M. R. Ameri, T. Q. Do, and B. M. Webb. 2019. “Combined wind-wave-surge hurricane-induced damage prediction for buildings.” J. Struct. Eng. 145 (1): 1–15. https://doi.org/10.1061/(ASCE)ST.1943-541X.0002241.
McDonald, J. R., and K. C. Mehta. 2006. A recommendation for an enhanced Fujita scale (EF-scale). Lubbock, TX: Texas Tech Univ.
Microsoft. 2020. “Building footprints.” Accessed October 1, 2020. https://www.microsoft.com/en-us/maps/building-footprints.
Mietkiewicz, N., J. K. Balch, T. Schoennagel, S. Leyk, L. A. St. Denis, and B. A. Bradley. 2020. “In the line of fire: Consequences of human-ignited wildfires to homes in the U.S. (1992–2015).” Fire 3 (3): 50. https://doi.org/10.3390/fire3030050.
Miran, S. M., C. Ling, A. Gerard, and L. Rothfusz. 2018. “The effect of providing probabilistic information about a tornado threat on people’s protective actions.” Nat. Hazard. 94 (2): 743–758. https://doi.org/10.1007/s11069-018-3418-5.
Mishra, S., O. A. Vanli, B. P. Alduse, and S. Jung. 2017. “Hurricane loss estimation in wood-frame buildings using Bayesian model updating: Assessing uncertainty in fragility and reliability analyses.” Eng. Struct. 135 (Mar): 81–94. https://doi.org/10.1016/j.engstruct.2016.12.063.
Murfin, J., and M. Spiegel. 2020. “Is the risk of sea level rise capitalized in residential real estate?” Rev. Finance Stud. 33 (3): 1217–1255. https://doi.org/10.1093/rfs/hhz134.
Nafari, R. H., T. Ngo, and P. Mendis. 2016. “An assessment of the effectiveness of tree-based models for multi-variate flood damage assessment in Australia.” Water 8 (7): 282.
NOAA (National Oceanic and Atmospheric Administration). 2011. “Tornadoes 101—An essential guide to tornadoes: Stay alert to stay alive.” Accessed April 6, 2021. https://www.noaa.gov/stories/tornadoes-101.
NOAA/NCEI (National Oceanic and Atmospheric Administration/National Centers for Environmental Information). 2018. “U.S. billion-dollar weather and climate disasters.” Accessed March 10, 2018. https://www.ncdc.noaa.gov/billions/.
NOAA/NCEI (National Oceanic and Atmospheric Administration/National Centers for Environmental Information). 2021. “Billion-dollar events to affect the United States from 2000 to 2021.” Accessed March 14, 2022. https://www.ncdc.noaa.gov/billions/.
NRC (National Research Council). 2006. Completing the forecast: Characterizing and communicating uncertainty for better decisions using weather and climate forecasts. Washington, DC: National Academies.
NSSL (National Severe Storms Laboratory). 2015. “FACETs: Forcasting a continuum of environmental threats.” Accessed December 28, 2020. https://www.nssl.noaa.gov/news/factsheets/FACETs_2015.pdf.
Oklahoma City Planning Commission. 2020. “City zoning map.” Accessed October 1, 2020. https://www.okc.gov/departments/planning/planning-commission.
Oklahoma County Assessor. 2019. “Public record.” Accessed July 14, 2020. https://assessor.oklahomacounty.org/203/Search-Property-Records.
Park, Y., and J.-M. Guldmann. 2019. “Creating 3D city models with building footprints and LIDAR point cloud classification: A machine learning approach.” Comput. Environ. Urban Syst. 75 (Jan): 76–89. https://doi.org/10.1016/j.compenvurbsys.2019.01.004.
Pedregosa, F., et al. 2011. “Scikit-learn: Machine learning in python.” J. Mach. Learn. Res. 12 (85): 2825–2830.
Pérez-González, C. J., M. Colebrook, J. L. Roda-García, and C. B. Rosa-Remedios. 2019. “Developing a data analytics platform to support decision making in emergency and security management.” Expert Syst. Appl. 120: 167–184. https://doi.org/10.1016/j.eswa.2018.11.023.
Pinter, N., and J. C. Rees. 2021. “Assessing managed flood retreat and community relocation in the Midwest USA.” Nat. Hazard. 107 (1): 497–518. https://doi.org/10.1007/s11069-021-04592-1.
Pirasteh, S., P. Rashidi, H. Rastiveis, S. Huang, Q. Zhu, G. Liu, Y. Li, J. Li, and E. Seydipour. 2019. “Developing an algorithm for buildings extraction and determining changes from airborne LiDAR, and comparing with R-CNN method from drone images.” Remote Sens. 11 (11): 1272. https://doi.org/10.3390/rs11111272.
Plank, S. 2014. “Rapid damage assessment by means of multi-temporal SAR-A comprehensive review and outlook to Sentinel-1.” Remote Sens. 6 (6): 4870–4906. https://doi.org/10.3390/rs6064870.
Rothfusz, L. P., R. Schneider, D. Novak, K. Klockow-McClain, A. E. Gerard, C. Karstens, G. J. Stumpf, and T. M. Smith. 2018. “FACETs: A proposed next-generation paradigm for high-impact weather forecasting.” Bull. Am. Meteorol. Soc. 99 (10): 2025–2043. https://doi.org/10.1175/BAMS-D-16-0100.1.
Shivers-Williams, C. A., and K. E. Klockow-McClain. 2021. “Geographic scale and probabilistic forecasts: A trade-off for protective decisions?” Nat. Hazard. 105 (2): 2283–2306. https://doi.org/10.1007/s11069-020-04400-2.
Simmons, K. M., D. Sutter, and R. Pielke. 2013. “Normalized tornado damage in the United States: 1950-2011.” Environ. Hazards 12 (2): 132–147. https://doi.org/10.1080/17477891.2012.738642.
Smith, A. B. (2020). U.S. billion-dollar weather and climate disasters, 1980—Present. Asheville, NC: NOAA National Centers for Environmental Information.
Stumpf, G. J., and A. E. Gerard. 2021. “National weather service severe weather warnings as threats-in-motion.” Weather Forecasting 36 (2): 627–643. https://doi.org/10.1175/WAF-D-20-0159.1.
Tanha, J., Y. Abdi, N. Samadi, N. Razzaghi, and M. Asadpour. 2020. “Boosting methods for multi-class imbalanced data classification: An experimental review.” J. Big Data 7 (1): 1–47. https://doi.org/10.1186/s40537-020-00349-y.
Tardioli, G., R. Kerrigan, M. Oates, J. O’Donnell, and D. P. Finn. 2018. “Identification of representative buildings and building groups in urban datasets using a novel pre-processing, classification, clustering and predictive modelling approach.” Build. Environ. 140 (May): 90–106. https://doi.org/10.1016/j.buildenv.2018.05.035.
Thabtah, F., S. Hammoud, F. Kamalov, and A. Gonsalves. 2020. “Data imbalance in classification: Experimental evaluation.” Inf. Sci. 513 (Mar): 429–441. https://doi.org/10.1016/j.ins.2019.11.004.
Vetrivel, A., M. Gerke, N. Kerle, and G. Vosselman. 2016. “Identification of structurally damaged areas in airborne oblique images using a Visual-Bag-of-Words approach.” Remote Sens. 8 (3): 231. https://doi.org/10.3390/rs8030231.
Wu, F., L. Gong, C. Wang, H. Zhang, B. Zhang, and L. Xie. 2016. “Signature analysis of building damage with Terra SAR-X new staring SpotLight mode data.” IEEE Geosci. Remote Sens. Lett. 13 (11): 1696–1700. https://doi.org/10.1109/LGRS.2016.2604841.
Wurm, M., A. Schmitt, and H. Taubenbock. 2016. “Building types’ classification using shape-based features and linear discriminant functions.” IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 9 (5): 1901–1912. https://doi.org/10.1109/JSTARS.2015.2465131.
Yun, S. H., et al. 2015. “Rapid damage mapping for the 2015 Mw 7.8 Gorkha earthquake using synthetic aperture radar data from COSMO-SkyMed and ALOS-2 satellites.” Seismol. Res. Lett. 86 (6): 1549–1556. https://doi.org/10.1785/0220150152.
Zhai, W., and C. Huang. 2016. “Fast building damage mapping using a single post-earthquake PolSAR image: A case study of the 2010 Yushu earthquake 4. Seismology.” Earth Planets Space 68 (1): 1–2. https://doi.org/10.1186/s40623-016-0469-2.
Zheng, A., and A. Casari. 2016. Feature engineering for machine learning: Principles and techniques for data scientists. Beijing: O’Reilly Media.
Zhou, Z., and J. Gong. 2018. “Automated analysis of mobile LiDAR data for component-level damage assessment of building structures during large coastal storm events.” Comput.-Aided Civ. Infrastruct. Eng. 33 (5): 373–392. https://doi.org/10.1111/mice.12345.

Information & Authors

Information

Published In

Go to Natural Hazards Review
Natural Hazards Review
Volume 23Issue 3August 2022

History

Received: Jul 12, 2021
Accepted: Feb 2, 2022
Published online: Apr 6, 2022
Published in print: Aug 1, 2022
Discussion open until: Sep 6, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

Postdoctoral Research Associate, Cooperative Institute for Severe and High-Impact Weather Research and Operations (CIWRO) and NOAA National Severe Storms Laboratory, National Weather Center, 120 David L Boren Blvd., Norman, OK 73072 (corresponding author). ORCID: https://orcid.org/0000-0002-0395-5107. Email: [email protected]
Joshua J. Hatzis [email protected]
Postdoctoral Research Associate, Cooperative Institute for Severe and High-Impact Weather Research and Operations (CIWRO), National Weather Center, 120 David L Boren Blvd., Norman, OK 73072. Email: [email protected]
Kim Klockow [email protected]
Research Scientist, Cooperative Institute for Severe and High-Impact Weather Research and Operations (CIWRO) and NOAA National Severe Storms Laboratory, National Weather Center, 120 David L Boren Blvd., Norman, OK 73072. Email: [email protected]
Patrick A. Campbell [email protected]
Research Scientist, Cooperative Institute for Severe and High-Impact Weather Research and Operations (CIWRO) and NOAA National Severe Storms Laboratory, National Weather Center, 120 David L Boren Blvd., Norman, OK 73072. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • An Agent-Based Modeling Approach to Protective Action Decision-Related Travel during Tornado Warnings, Natural Hazards Review, 10.1061/NHREFO.NHENG-1783, 25, 1, (2024).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share