Multitask Learning Method for Detecting the Visual Focus of Attention of Construction Workers

Cai, Jiannan; Yang, Liu; Zhang, Yuxi; Li, Shuai; Cai, Hubo

doi:10.1061/(ASCE)CO.1943-7862.0002071

Technical Papers

Apr 29, 2021

Multitask Learning Method for Detecting the Visual Focus of Attention of Construction Workers

Authors: Jiannan Cai, Ph.D., A.M.ASCE https://orcid.org/0000-0001-6110-5293 [email protected], Liu Yang [email protected], Yuxi Zhang [email protected], Shuai Li, Ph.D., A.M.ASCE [email protected], and Hubo Cai, Ph.D., M.ASCE [email protected]Author Affiliations

Publication: Journal of Construction Engineering and Management

Volume 147, Issue 7

https://doi.org/10.1061/(ASCE)CO.1943-7862.0002071

Get Access

Abstract

The visual focus of attention (VFOA) of construction workers is a critical cue for recognizing entity interactions, which in turn facilitates the interpretation of workers’ intentions, the prediction of movements, and the comprehension of the jobsite context. The increasing use of construction surveillance cameras provides a cost-efficient way to estimate workers’ VFOA from information-rich images. However, the low resolution of these images poses a great challenge to detecting the facial features and gaze directions. Recognizing that body and head orientations provide strong hints to infer workers’ VFOA, this study proposes to represent the VFOA as a collection of body orientations, body poses, head yaws, and head pitches and designs a convolutional neural network (CNN)-based multitask learning (MTL) framework to automatically estimate workers’ VFOA using low-resolution construction images. The framework is composed of two modules. In the first module, a Faster regional CNN (R-CNN) object detector is used to detect and extract workers’ full-body images, and the resulting full-body images serve as a single input to the CNN-MTL model in the second module. In the second module, the VFOA estimation is formulated as a multitask image classification problem where four classification tasks—body orientation, body pose, head yaw, and head pitch—are jointly learned by the newly designed CNN-MTL model. Construction videos were used to train and test the proposed framework. The results show that the proposed CNN-MTL model achieves an accuracy of 0.91, 0.95, 0.86, and 0.83 in body orientation, body pose, head yaw, and head pitch classification, respectively. Compared with the conventional single-task learning, the MTL method reduces training time by almost 50% without compromising accuracy.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Data Availability Statement

Some or all data, models, or code generated or used during the study are available from the corresponding author by request, including Python codes for data processing and multitask image classification.

Acknowledgments

This research was partially funded by the US National Science Foundation (NSF) via Grant Nos. 1850008 and 2038967. The authors gratefully acknowledge NSF’s support. Any opinions, findings, recommendations, and conclusions in this paper are those of the authors and do not necessarily reflect the views of NSF, the University of Texas at San Antonio, the University of Tennessee, Knoxville, and Purdue University.

References

Ahn, B., D. G. Choi, J. Park, and I. S. Kweon. 2018. “Real-time head pose estimation using multi-task deep neural network.” Rob. Auton. Syst. 103 (May): 1–12. https://doi.org/10.1016/j.robot.2018.01.005.

Abstract

Get full access to this article

Data Availability Statement

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!