Technical Papers
Jul 26, 2024

Effectiveness of Image Augmentation Techniques on Detection of Building Characteristics from Street View Images Using Deep Learning

Publication: Journal of Construction Engineering and Management
Volume 150, Issue 10

Abstract

Two key building characteristics, namely the number of stories and typology, is vital across various domains such as construction management and architectural design. These aspects are particularly critical for disaster risk assessment and infrastructure planning. Although deep learning models are adept at extracting this information from Street view images (SVIs), their success is contingent upon the availability of large and diverse data sets with high accuracy. Image augmentation presents an alternative method to artificially broaden data set diversity. However, the impact of image augmentation techniques on identifying building stories and typologies from SVIs has not been adequately explored. This study proposes a methodology employing eight distinct image augmentation techniques—brightness, contrast, perspective, rotation, scale, shearing, and translation augmentations—as well as a combined approach using all these methods. The study evaluates the efficacy of models trained with these techniques by comparing the accuracy of different classes and architectures for each task, both with and without the application of augmentation. The findings revealed that while most augmentation methods enhance model accuracy, their effectiveness is task-dependent. Furthermore, it was observed that the most effective augmentation techniques differ among building classes and architectures within each task. This suggests that augmentation strategies need to be custom-designed to align with the unique features of each class and architectures for precise estimation of the number of stories and building typologies. While the focus of this research is on specific tasks, the evaluated augmentation techniques could also extend to related areas, such as ascertaining the age of buildings or identifying window types.

Practical Applications

In this study, the efficacy of augmentation techniques is explored within the framework of identifying the number of stories and building typologies. The models were assessed for average accuracy and class-specific accuracy across various architectures, comparing outcomes with and without the implementation of the proposed augmentation methods. A key finding is that the most effective augmentation method varies between architectures and individual classes. Contrary to common practice in deep learning, where applying multiple augmentation techniques is standard for accuracy enhancement, this study observed that such a strategy did not uniformly improve performance. Specifically, while combining augmentation methods generally resulted in higher average accuracy, this was not the case for some classes within MobileNetV3 when detecting the number of stories. Similarly, for ResNet-152, employing all augmentation techniques together led to the lowest accuracy in certain classes for building typology classification. These results indicate that augmentation strategies may require customization to cater to the distinct characteristics of each class and architecture for accurate estimation of number of stories and building typologies.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

All data, models, and code generated or used during the study appear in the submitted article. The data set obtained, trained model, and test results can be downloaded from Figshare data repository (Wang 2024).

References

Alomar, K., H. I. Aysel, and X. Cai. 2023. “Data augmentation in classification and segmentation: A survey and new strategies.” J. Imaging 9 (2): 46. https://doi.org/10.3390/jimaging9020046.
Boiarov, A., and E. Tyantov. 2019. “Large scale landmark recognition via deep metric learning.” In Proc., 28th ACM Int. Conf. on Information and Knowledge Management, 169–178. New York: Association for Computing Machinery.
Cha, Y. J., W. Choi, and O. Büyüköztürk. 2017. “Deep learning-based crack damage detection using convolutional neural networks.” Comput.-Aided Civ. Infrastruct. Eng. 32 (5): 361–378. https://doi.org/10.1111/mice.12263.
Cha, Y. J., W. Choi, G. Suh, S. Mahmoudkhani, and O. Büyüköztürk. 2018. “Autonomous structural visual inspection using region-based deep learning for detecting multiple damage types.” Comput.-Aided Civ. Infrastruct. Eng. 33 (9): 731–747. https://doi.org/10.1111/mice.12334.
Chen, F.-C., A. Subedi, M. R. Jahanshahi, D. R. Johnson, and E. J. Delp. 2022. “Deep learning–Based building attribute estimation from Google street view images for flood risk assessment using feature fusion and task relation encoding.” J. Comput. Civ. Eng. 36 (6): 04022031. https://doi.org/10.1061/(asce)cp.1943-5487.0001025.
Claridades, A. R. C., H. S. Choi, and J. Lee. 2023. “Detecting building entrances on street view images using deep learning for supporting indoor-outdoor seamless services.” J. Korean Soc. Surv. Geod. Photogramm. Cartogr. 41 (5): 351–365. https://doi.org/10.7848/ksgpc.2023.41.5.351.
Frid-Adar, M., E. Klang, M. Amitai, J. Goldberger, and H. Greenspan. 2018. “Synthetic data augmentation using GAN for improved liver lesion classification.” In Proc., 2018 IEEE 15th Int. Symp. on Biomedical Imaging (ISBI 2018). New York: IEEE.
Google. 2023. “Google street view static API.” Accessed June 15, 2023. https://developers.google.com/maps/documentation/streetview/intro.
Ilic, L., M. Sawada, and A. Zarzelli. 2019. “Deep mapping gentrification in a large Canadian city using deep learning and Google Street View.” PLoS One 14 (3): e0212814. https://doi.org/10.1371/journal.pone.0212814.
Jung, A. B., K. Wada, J. Crall, S. Tanaka, J. Graving, C. Reinders, S. Yadav, J. Banerjee, G. Vecsei, and A. Kraft. 2020. imgaug. San Francisco: GitHub.
Kalfarisi, R., M. Hmosze, and Z. Y. Wu. 2022. “Detecting and geolocating city-scale soft-story buildings by deep machine learning for urban seismic resilience.” Nat. Hazards Rev. 23 (1): 04021062. https://doi.org/10.1061/(asce)nh.1527-6996.0000541.
Kandel, I., M. Castelli, and L. Manzoni. 2022. “Brightness as an augmentation technique for image classification.” Emerg. Sci. J. 6 (4): 881–892. https://doi.org/10.28991/ESJ-2022-06-04-015.
Krizhevsky, A., I. Sutskever, and G. E. Hinton. 2012. “ImageNet classification with deep convolutional neural networks.” Adv. Neural Inf. Process. Syst. 25: 1097–1105. https://doi.org/10.1016/j.protcy.2014.09.007.
Meng, C., Y. Song, J. Ji, Z. Jia, Z. Zhou, P. Gao, and S. Liu. 2022. “Automatic classification of rural building characteristics using deep learning methods on oblique photography.” Build. Simul. 15 (Dec): 1161–1174. https://doi.org/10.1007/s12273-021-0872-x.
Mumuni, A., and F. Mumuni. 2022. “Data augmentation: A comprehensive survey of modern approaches.” Array 16 (Dec): 100258.
O’Gara, S., and K. McGuinness. 2019. “Comparing data augmentation strategies for deep image classification.” In Proc., IMVIP 2019: Irish Machine Vision & Image Processing. Dublin, Ireland: Technological Univ. Dublin.
Ordnance Survey. 2023. “OS Data Hub.” Accessed June 15, 2023. https://osdatahub.os.uk/.
Ottoni, A. L. C., R. M. de Amorim, M. S. Novo, and D. B. Costa. 2023. “Tuning of data augmentation hyperparameters in deep learning to building construction image classification with small datasets.” Int. J. Mach. Learn. Cybern. 14 (1): 171–186. https://doi.org/10.1007/s13042-022-01555-1.
Ramalingam, S. P., and V. Kumar. 2023. “Automatizing the generation of building usage maps from geotagged street view images using deep learning.” Build. Environ. 235 (May): 110215. https://doi.org/10.1016/j.buildenv.2023.110215.
Shehadeh, A., O. Alshboul, and G. Almasabha. 2024. “Slope displacement detection in construction: An automated management algorithm for disaster prevention.” Expert Syst. Appl. 237 (PB): 121505. https://doi.org/10.1016/j.eswa.2023.121505.
Shehadeh, A., O. Alshboul, and O. Hamedat. 2022. “Risk assessment model for optimal gain–pain share ratio in target cost contract for construction projects.” J. Constr. Eng. Manage. 148 (2): 04021197. https://doi.org/10.1061/(asce)co.1943-7862.0002222.
Shorten, C., and T. M. Khoshgoftaar. 2019. “A survey on image data augmentation for deep learning.” J. Big Data 6 (1): 1–48. https://doi.org/10.1186/s40537-019-0197-0.
Wang, K., B. Fang, J. Qian, S. Yang, X. Zhou, and J. Zhou. 2020. “Perspective transformation data augmentation for object detection.” IEEE Access 8 (Dec): 4935–4943. https://doi.org/10.1109/ACCESS.2019.2962572.
Wang, N., X. Zhao, P. Zhao, Y. Zhang, Z. Zou, and J. Ou. 2019. “Automatic damage detection of historic masonry buildings based on mobile deep learning.” Autom. Constr. 103 (Jul) 53–66. https://doi.org/10.1016/j.autcon.2019.03.003.
Wang, S. 2024. “Building facade images for classifying building stories and identifying building typologies.” In Figshare as a data repository. London: Figshare. https://doi.org/10.6084/m9.figshare.24979947.v1.
Wang, S., M. Kim, H. Hae, M. Cao, and J. Kim. 2023. “The development of a rebar-counting model for reinforced concrete columns: Using an unmanned aerial vehicle and deep-learning approach.” J. Constr. Eng. Manage. 149 (11): 1–13. https://doi.org/10.1061/JCEMD4.COENG-13686.
Wang, S., I. Korolija, and D. Rovas. 2022. “Impact of traditional augmentation methods on window state detection.” In Proc., CLIMA 2022 Conf., 1–8. Delft, Netherlands: Delft Univ. of Technology. https://doi.org/10.34641/clima.2022.375.
Xiao, B., and S.-C. Kang. 2021. “Development of an image data set of construction machines for deep learning object detection.” J. Comput. Civ. Eng. 35 (2): 05020005. https://doi.org/10.1061/(asce)cp.1943-5487.0000945.
Zhou, Q., Z. Situ, S. Teng, and G. Chen. 2023. “Comparative effectiveness of data augmentation using traditional approaches versus StyleGANs in automated sewer defect detection.” J. Water Resour. Plann. Manage. 149 (9): 04023045. https://doi.org/10.1061/jwrmd5.wreng-5831.
Zou, S., and L. Wang. 2021. “Detecting individual abandoned houses from Google street view: A hierarchical deep learning approach.” ISPRS J. Photogramm. Remote Sens. 175 (May): 298–310. https://doi.org/10.1016/j.isprsjprs.2021.03.020.

Information & Authors

Information

Published In

Go to Journal of Construction Engineering and Management
Journal of Construction Engineering and Management
Volume 150Issue 10October 2024

History

Received: Jan 22, 2024
Accepted: Apr 16, 2024
Published online: Jul 26, 2024
Published in print: Oct 1, 2024
Discussion open until: Dec 26, 2024

Permissions

Request permissions for this article.

Authors

Affiliations

Researcher, Dept. of Architectural Engineering, Hanyang Univ., Seungdong-Gu, Seoul 133791, Korea. ORCID: https://orcid.org/0009-0001-5207-1586
Professor, Dept. of Architectural Engineering, Hanyang Univ., Seungdong-Gu, Seoul 133791, Korea. ORCID: https://orcid.org/0000-0003-3931-6814
Seongkyung Kim
Researcher, Dept. of Architectural Engineering, Hanyang Univ., Seungdong-Gu, Seoul 133791, Korea.
Seunghyeon Wang [email protected]
Professor, Dept. of Architectural Engineering, Hanyang Univ., Seungdong-Gu, Seoul 133791, Korea (corresponding author). Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share