Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique

Shuang, Qing; Liu, Xishan; Wang, Zhaojing; Xu, Xinxin

doi:10.1061/JCEMD4.COENG-14515

Technical Papers

Jun 27, 2024

Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique

Authors: Qing Shuang [email protected], Xishan Liu https://orcid.org/0009-0001-8830-8995 [email protected], Zhaojing Wang https://orcid.org/0000-0001-6705-6082 [email protected], and Xinxin Xu [email protected]Author Affiliations

Publication: Journal of Construction Engineering and Management

Volume 150, Issue 9

https://doi.org/10.1061/JCEMD4.COENG-14515

Get Access

Abstract

Learning from prior incidents is crucial for improving safety, particularly in the construction industry where fatalities and injuries are frequent. High-precision classification of construction accident narratives is a laborious, time-consuming process that requires substantial domain expertise. However, automatic text classification had fallen short of expectations due to a lack of high-quality data sets, inadequate semantic interpretation, and primitive model architecture. To address these issues, this study developed a state-of-the-art text classification (TC) model to extract construction knowledge and classify construction accident narratives into predefined categories. The architecture of the TC deep-learning model was built based on the pretrained instruction-based omnifarious representations (INSTRUCTOR). A class-imbalance treatment (CIT) technique incorporating focal loss and weighted random sampling was embedded to make the model concentrate on hard samples and minority classes. The retrained and fine-tuned INSTRUCTOR-CIT model achieved an F1 score of 82.22% for the benchmark data set containing 1,000 accident narratives from the Occupational Health and Safety Administration (OSHA). Impressively, on a larger benchmark data set of 4,770 OSHA accident narratives labeled by another official system, the model achieved an F1 score of 94.84%, highlighting its generality. Furthermore, the experimental results demonstrated that our model was superior to existing methods with less preprocessing and higher accuracy. Finally, the contribution to construction project management was discussed to enhance unstructured data management in the construction industry. The findings of this study contribute to effective management practices and assist construction professionals focus on value-added tasks such as decision making and corrective action planning.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Some or all data, models, or code generated or used during the study are available in a repository online in accordance with funder data retention policies. The code is available at Github (n.d.-a). Data sets 1 and 2 can be downloaded from Github (n.d.-b, c), respectively.

Acknowledgments

This research was supported by the Fundamental Research Funds for the Central Universities (2020JBW007) and the Beijing Humanities and Social Science Development Foundation (20GLC049).

References

Baek, S., W. Jung, and S. H. Han. 2021. “A critical review of text-based research in construction: Data source, analysis method, and implications.” Autom. Constr. 132 (Oct): 103915. https://doi.org/10.1016/j.autcon.2021.103915.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!