AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images

Guo, Yunhui; Wang, Chaofeng; Yu, Stella X.; McKenna, Frank; Law, Kincho H.

doi:10.1061/(ASCE)CP.1943-5487.0001034

Technical Papers

Jul 4, 2022

AdaLN: A Vision Transformer for Multidomain Learning and Predisaster Building Information Extraction from Images

Authors: Yunhui Guo, Chaofeng Wang, A.M.ASCE https://orcid.org/0000-0001-8534-9276 [email protected], Stella X. Yu https://orcid.org/0000-0002-3507-5761, Frank McKenna, and Kincho H. Law, F.ASCEAuthor Affiliations

Publication: Journal of Computing in Civil Engineering

Volume 36, Issue 5

https://doi.org/10.1061/(ASCE)CP.1943-5487.0001034

Get Access

Abstract

Satellite and street view images are widely used in various disciplines as a source of information for understanding the built environment. In natural hazard engineering, high-quality building inventory data sets are crucial for the simulation of hazard impacts and for supporting decision-making. Screening the building stocks to gather the information for simulation and to detect potential structural defects that are vulnerable to natural hazards is a time-consuming and labor-intensive task. This paper presents an automated method for extracting building information through the use of satellite and street view images. The method is built upon a novel transformer-based deep neural network we developed. Specifically, a multidomain learning approach is employed to develop a single compact model for multiple image-based deep learning information extraction tasks using multiple data sources (e.g., satellite and street view images). Our multidomain Vision Transformer is designed as a unified architecture that can be effectively deployed for multiple classification tasks. The effectiveness of the proposed approach is demonstrated in a case study in which we use pretrained models to collect regional-scale building information that is related to natural hazard risks.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Testing data, trained models, and the codes that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

A part of this study is based on work supported by the National Science Foundation under Grant No. 1612843. Opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

References

ATC (Applied Technology Council). 1988. Rapid visual screening of buildings for potential seismic hazards: A handbook. FEMA 154. Washington, DC: FEMA.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!