Extracting Worker Unsafe Behaviors from Construction Images Using Image Captioning with Deep Learning–Based Attention Mechanism

Zhai, Peichen; Wang, Junjie; Zhang, Lite

doi:10.1061/JCEMD4.COENG-12096

Technical Papers

Nov 23, 2022

Extracting Worker Unsafe Behaviors from Construction Images Using Image Captioning with Deep Learning–Based Attention Mechanism

Authors: Peichen Zhai https://orcid.org/0000-0003-4247-0951 [email protected], Junjie Wang, Ph.D. https://orcid.org/0000-0002-1122-3336 [email protected], and Lite Zhang https://orcid.org/0000-0002-3937-0290 [email protected]Author Affiliations

Publication: Journal of Construction Engineering and Management

Volume 149, Issue 2

https://doi.org/10.1061/JCEMD4.COENG-12096

Get Access

Abstract

Safety in the construction industry has always been a focus of attention. Existing methods of detecting unsafe behavior of workers relied primarily on manual detection. Not only did it consume significant time and money, but it also inevitably produced omissions. Currently, automated techniques for detecting unsafe behaviors rely only on the unsafe factors of workers’ ontology to judge their behaviors, making it difficult to understand unsafe behaviors in complex scenes. To address the presented problems, this study proposed a method to automatically extract workers’ unsafe behaviors by combining information from complex scenes—an image captioning based on an attention mechanism. First, three different sets of image captioning models were constructed using convolutional neural network (CNN), which are widely used in AI. These models could extract key information from complex scenes was constructed. Then, two datasets dedicated to the construction domain were created for method validation. Finally, three sets of experiments were conducted by combining the datasets and the three different sets of models. The results showed that the method could detect the worker’s job type and output the interaction behavior between the worker and the target (unsafe behavior) based on the environmental information in the construction images. We introduced environmental information into the determination of workers’ unsafe behaviors for the first time and not only output the worker’s job type but also determine the worker’s behavior. This allows the model output to be better for ergonomic analysis.

Practical Applications

This study developed an intelligent solution for determining whether a worker had unsafe behavior in complex scenarios using behavioral norms. The operator would not need to prepare the appropriate construction safety knowledge, such as whether to wear a helmet, whether to wear a safety belt, or whether to work at height, but simply input the target image into the model, and the model would combine the predefined behavioral norms, scene information, and other factors to determine what kind of behavior (or unsafe behavior) was contained in the image and output a simple description of the information. Descriptions could also be set as fixed templates for easy management, such as worker A wearing (not) a helmet, and these descriptions would play a key role in daily management and project summaries. Using this method, managers could use the relevant equipment to automate the acquisition of possible good behaviors or violations of anyone on site. It also enables efficient organization and recording, improving the efficiency of managers.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Some or all data, models, or codes that support the findings of this study are available from the corresponding author upon reasonable request. (The dataset and model code are available from the first author upon request.)

Acknowledgments

This study was supported by the key R&D program of Shandong Province. The authors are very grateful to all laboratory staff for their help and to Hongyu Chang for his online guidance.

References

Ali, R., J. H. Chuah, M. S. Abu Talip, N. Mokhtar, and M. A. Shoaib. 2022. “Structural crack detection using deep convolutional neural networks.” Autom. Constr. 133 (Jan): 103989. https://doi.org/10.1016/j.autcon.2021.103989.

Abstract

Practical Applications

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!