Technical Papers
Dec 1, 2022

A Deep-Learning Classification Framework for Reducing Communication Errors in Dynamic Hand Signaling for Crane Operation

Publication: Journal of Construction Engineering and Management
Volume 149, Issue 2

Abstract

Crane operators and signalmen play an integral role in the safe and efficient operation of cranes on a construction site. Operating a crane is a complex and demanding task that requires careful coordination between operator and signalmen in order to avoid errors that could have dire consequences, including serious injury or loss of life. Therefore, special considerations should be taken to mitigate communication errors that could occur between the two parties. Technology can play an important role in enhancing communication, and, with recent advancements in technology, human–computer interaction has emerged as an active area of research within the field of computer vision. This paper presents a framework that integrates the YOLOv4 model (for object detection) and the long short-term memory (LSTM) model (a recurrent neural network) for dynamic hand signal classification in real time. The first step is the creation of a crane signalman dynamic hand signal data set with 18 classes. The YOLOv4 model is then customized for this application by modifying the activation function. Three modified YOLOv4 models are then integrated with the LSTM model. The modified YOLOv4 integrated with LSTM is found to achieve a maximum overall accuracy of 94.8% with an inference time of 55.1 frames per second. The model is further validated with real-time dynamic hand signal classification, achieving an accuracy of 93.5% and an inference time of 44 frames per second. The proposed models show improved quality in classification accuracy as well as in processing speed in comparison to some of the most widely used models currently in use. The proposed novel framework can be used as another layer of communication to supplement current practice and reduce communication errors between crane signalmen and crane operators.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.

References

Agarap, A. F. M. 2018. “Deep learning using rectified linear units (ReLU).” Preprint, submitted March 22, 2018. https://arxiv.org/abs/1803.08375.
Bae, J. H., N. T. Le, and J. T. Kim. 2017. “Smartphone image receiver architecture for optical camera communication.” Wireless Pers. Commun. 93 (4): 1043–1066. https://doi.org/10.1007/s11277-017-3971-3.
Beavers, J. E., F. Asce, J. R. Moore, R. Rinehart, and W. R. Schriver. 2006. “Crane-related fatalities in the construction industry.” J. Constr. Eng. Manage. 132 (9): 901–910. https://doi.org/10.1061/(ASCE)0733-9364(2006)132:9(901).
Bochkovskiy, A., C. Wang, and H. M. Liao. 2020. “YOLOv4: Optimal speed and accuracy of object detection.” Preprint, submitted April 23, 2020. https://arxiv.org/abs/2004.10934.
Boncelet, C. 2009. Image noise models: The essential guide to image processing. 1st ed. Amsterdam, Netherlands: Elsevier.
Bull, D. 2014. “Digital picture formats and representation.” In Communicating pictures, 99–132. Amsterdam, Netherlands: Elsevier.
Bust, P. D., A. G. F. Gibb, and S. Pink. 2008. “Managing construction health and safety: Migrant workers and communicating safety messages.” Saf. Sci. 46 (4): 585–602. https://doi.org/10.1016/j.ssci.2007.06.026.
Chen, Y., H. Chi, S. Kangm, and S. Hsieh. 2011. “A smart crane operations assistance system using augmented reality technology.” In Proc., 28th Int. Symp. on Automation and Robotics in Construction, ISARC 2011, 643–649. Seoul: International Association for Automation and Robotics in Construction.
Everett, B. J. G., and A. H. Slocum. 1993. “Device for improving crane.” J. Constr. Eng. Manage. 119 (1): 23–39. https://doi.org/10.1061/(ASCE)0733-9364(1993)119:1(23).
Fang, Q., H. Li, X. Luo, L. Ding, H. Luo, and C. Li. 2018a. “Computer vision aided inspection on falling prevention measures for steeplejacks in an aerial environment.” Autom. Constr. 93 (Sep): 148–164. https://doi.org/10.1016/j.autcon.2018.05.022.
Fang, Y., and Y. K. Cho. 2016. “A framework of lift virtual prototyping (LVP) approach for crane safety planning.” In Proc., 33rd Int. Symp. on Automation and Robotics in Construction, ISARC 2016, 291–297. Auburn, AL: International Association for Automation and Robotics in Construction.
Fang, Y., Y. K. Cho, F. Druso, and J. Seo. 2018b. “Assessment of operator’s situation awareness for smart operation of mobile cranes.” Autom. Constr. 85 (Jan): 65–75. https://doi.org/10.1016/j.autcon.2017.10.007.
Ghiasi, G., T. Lin, and Q. V. Le. 2018. “DropBlock: A regularization method for convolutional networks.” In Advances in neural information processing systems, 1–11. Pasadena, CA: Neural Information Processing Systems.
Greff, K., R. K. Srivastava, J. Koutník, B. R. Steunebrink, and J. Schmidhuber. 2016. “LSTM: A search space odyssey.” IEEE Trans. Neural Networks Learn. Syst. 28 (10): 2222–2232. https://doi.org/10.1109/TNNLS.2016.2582924.
Hernandez-Garcia, A., and P. Konig. 2018. “Further advantages of data augmentation on convolutional neural networks.” In Proc., Int. Conf. on Artificial Neural Networks, 95–103. New York: Springer.
Hou, X., Y. Zhang, and J. Hou. 2020. “Application of YOLO V2 in construction vehicle detection.” In Proc., Int. Conf. on Natural Computation, Fuzzy Systems and Knowledge Discovery, 1249–1256. New York: Springer.
Hu, J., X. Geo, H. Wu, and S. Gao. 2019. “Detection of workers without the helments in videos based on YOLO V3.” In Proc., 12th Int. Congress on Image and Signal Processing, BioMedical Engineering and Informatics, 1553–1560. New York: IEEE.
Huang, J., W. Zhou, Q. Zhang, H. Li, and W. Li. 2018. “Video-based sign language recognition without temporal segmentation.” In Vol. 32 of Proc., AAAI Conf. on Artificial Intelligence. Vancouver, BC, Canada: Public Knowledge Project.
Huang, Z., J. Wang, X. Fu, T. Yu, Y. Guo, and R. Wang. 2020. “DC-SPP-YOLO: Dense connection and spatial pyramid pooling based YOLO for object detection.” Inf. Sci. 522 (Jun): 241–258. https://doi.org/10.1016/j.ins.2020.02.067.
Kim, J. A., J. Y. Sung, and S. H. Park. 2020. “Comparison of Faster-RCNN, YOLO, and SSD for real-time vehicle type recognition.” In Proc., 2020 IEEE Int. Conf. on Consumer Electronics-Asia (ICCE-Asia), 1–4. New York: IEEE.
Kines, P., L. P. S. Andersen, S. Spangenberg, K. L. Mikkelsen, J. Dyreborg, and D. Zohar. 2010. “Improving construction site safety through leader-based verbal safety communication.” J. Saf. Res. 41 (5): 399–406. https://doi.org/10.1016/j.jsr.2010.06.005.
King, R. A. 2012. “Analysis of crane and lifting accidents in North America from 2004 to 2010.” Ph.D. dissertation, Dept. of Civil and Environmental Engineering, Massachusetts Institute of Technology.
Kukkala, V. K., J. Tunnell, S. Pasricha, and T. Bradley. 2018. “Advanced driver-assistance systems: A path toward autonomous vehicles.” IEEE Consum. Electron. Mag. 7 (5): 18–25. https://doi.org/10.1109/MCE.2018.2828440.
Liu, S., L. Qi, H. Qin, J. Shi, and J. Jia. 2018. “Path aggregation network for instance segmentation.” In Proc., IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, 8759–8768. New York: IEEE.
Mansoor, A., S. Liu, G. M. Ali, A. Bouferguene, and M. Al-Hussein. 2020. “Conceptual framework for safety improvement in mobile cranes.” In Proc., Construction Research Congress 2020: Computer Applications, 964–971. Reston, VA: ASCE.
Mikołajczyk, A., and M. Grochowski. 2018. “Data augmentation for improving deep learning in image classification problem.” In Proc., Int. Interdisciplinary PhD Workshop (IIPhDW), 117–122. New York: IEEE.
Misra, D. 2019. “MISH: A self regularized non-monotonic activation function.” Preprint, submitted August 23, 2019. https://arxiv.org/abs/1908.08681.
Misra, S., and Y. Wu. 2019. “Machine learning assisted segmentation of scanning electron microscopy images of organic-rich shales with feature extraction and feature ranking.” Mach. Learn. Subsurface Charact. 289 (Jan): 289–313. https://doi.org/10.1016/b978-0-12-817736-5.00010-7.
Molchanov, P., X. Yang, S. Gupta, K. Kim, S. Tyree, and J. Kautz. 2016. “Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural networks.” In Proc., IEEE Conf. on Computer Vision and Pattern Recognition, 4207–4215. New York: IEEE.
Neitzel, R. L., N. S. Seixas, and K. K. Ren. 2001. “A review of crane safety in the construction industry.” Appl. Occup. Environ. Hyg. 16 (12): 1106–1117. https://doi.org/10.1080/10473220127411.
Nepal, U., and H. Eslamiat. 2022. “Comparing YOLOv3, YOLOv4 and YOLOv5 for autonomous landing spot detection in faulty UAVs.” Sensors 22 (2): 464. https://doi.org/10.3390/s22020464.
Okan, K., G. Ahmet, K. Neslihan, and R. Gerhard. 2019. “Real-time hand gesture detection and classification using convolutional neural networks.” In Proc., 14th IEEE Int. Conf. on Automatic Face & Gesture Recognition, 1–8. New York: IEEE.
OSHA (Occupational Safety and Health Administration). 2021. “Cranes and derricks in construction.” Accessed April 19, 2020. https://open.alberta.ca/dataset/757fed78-8793-40bb-a920-6f000853172b/resource/9296e033-fd12-40dc-ac86-21e5873d4161/download/4403880-part-6-cranes-hoists-and-lifting-devices.pdf.
Oyedotun, O. K., and A. Khashman. 2017. “Deep learning in vision-based static hand gesture recognition.” Neural Comput. Appl. 28 (12): 3941–3951. https://doi.org/10.1007/s00521-016-2294-8.
Park, K., H. Lee, H. Kim, J. I. Kim, H. Lee, and M. W. Pyeon. 2011. “AR-HUD system for tower crane on construction field.” In Proc., 2011 IEEE Int. Symp. on Virtual Reality Innovation, 261–266. New York: IEEE.
Pickering, C. A., K. J. Burnham, and M. J. Richardson. 2007. “A research study of hand gesture recognition technologies and applications for human vehicle interaction.” In Proc., 3rd Institution of Engineering and Technology Conf. on Automotive Electronics, 1–15. New York: IEEE.
Potter, M. C., B. Wyble, C. E. Hagmann, and E. S. McCourt. 2014. “Detecting meaning in RSVP at 13 ms per picture.” Attention Percept. Psychophysics 76 (2): 270–279. https://doi.org/10.3758/s13414-013-0605-z.
Qasim, A. B., and A. Pettirsch. 2020. “Recurrent neural networks for video object detection.” Preprint, submitted October 29, 2020. https://arxiv.org/abs/2010.15740.
Rahman, E. U., Y. Zhang, S. Ahmad, H. I. Ahmad, and S. Jobaer. 2021. “Autonomous vision-based primary distribution systems porcelain insulators inspection using UAVs.” Sensors 21 (3): 974. https://doi.org/10.3390/s21030974.
Ramachandran, P., B. Zoph, and Q. V. Le. 2017. “Searching for activation functions.” Preprint, submitted October 16, 2017. https://arxiv.org/abs/1710.05941.
Raviv, G., and A. Shapira. 2018. “Systematic approach to crane-related near-miss analysis in the construction industry.” Int. J. Construct. Manage. 18 (4): 310–320. https://doi.org/10.1080/15623599.2017.1382067.
Shapira, A., Y. Rosenfeld, and I. Mizrahi. 2008. “Vision system for tower cranes.” J. Constr. Eng. Manage. 134 (5): 320–332. https://doi.org/10.1061/(ASCE)0733-9364(2008)134:5(320).
Su, H., W. Qi, C. Yang, J. Sandoval, G. Ferrigno, and E. De Momi. 2020. “Deep neural network approach in robot tool dynamics identification for bilateral teleoperation.” IEEE Rob. Autom. Lett. 5 (2): 2943–2949. https://doi.org/10.1109/LRA.2020.2974445.
US Bureau of Labor Statistics. 2017. “Fatal occupational injuries involving cranes.” Accessed September 13, 2021. https://www.bls.gov/iif/oshwc/cfoi/cranes-2017.htm.
US Dept. of Labor. 2003. “Census of fatal occupational injuries summary, 2003.” Accessed January 19, 2022. https://stats.bls.gov/news.release/cfoi.nr0.htm.
Xu, J., Z. Li, B. Du, M. Zhang, and J. Liu. 2020. “Reluplex made more practical: Leaky ReLU.” In Proc., 2020 IEEE Symp. on Computers and Communications, 1–7. New York: IEEE.
Yin, X., Y. Chen, A. Bouferguene, H. Zaman, M. Al-Hussein, and L. Kurach. 2020. “A deep learning-based framework for an automated defect detection system for sewer pipes.” Autom. Constr. 109 (Jan): 102967. https://doi.org/10.1016/j.autcon.2019.102967.
Zavichi, A., and A. H. Behzadan. 2011. “A real time decision support system for enhanced crane operations in construction and manufacturing.” In Computing in civil engineering, 586–593. Reston, VA: ASCE.
Zekavat, P. R., and L. Bernold. 2014. “Embedded wireless communication platform addresses crane safety and efficiency.” In Proc., Construction Research Congress 2014, 309–318. Reston, VA: ASCE.
Zekavat, P. R., S. Moon, and L. E. Bernold. 2015. “Holonic construction management: Unified framework for ICT-supported process control.” J. Manage. Eng. 31 (1): A4014008 . https://doi.org/10.1061/(ASCE)ME.1943-5479.0000316.
Zhao, Q. 2011. “Cause analysis of U.S. crane-related accidents.” Ph.D. dissertation, Dept. of Building Construction, Univ. of Florida.
Zheng, Z., P. Wang, W. Liu, J. Li, R. Ye, and D. Ren. 2019. “Distance-IoU loss: Faster and better learning for bounding box regression.” In Proc., AAAI Conf. on Artificial Intelligence, 12993–13000. Vancouver, BC, Canada: Public Knowledge Project.

Information & Authors

Information

Published In

Go to Journal of Construction Engineering and Management
Journal of Construction Engineering and Management
Volume 149Issue 2February 2023

History

Received: Jun 18, 2022
Accepted: Sep 28, 2022
Published online: Dec 1, 2022
Published in print: Feb 1, 2023
Discussion open until: May 1, 2023

Permissions

Request permissions for this article.

Authors

Affiliations

Ph.D. Candidate, Dept. of Civil and Environmental Engineering, Univ. of Alberta, 116 St. & 85 Ave., Edmonton, AB, Canada T6G 2R3. ORCID: https://orcid.org/0000-0001-8520-3437. Email: [email protected]
Ph.D. Candidate, Dept. of Civil and Environmental Engineering, Univ. of Alberta, 116 St. & 85 Ave., Edmonton, AB, Canada T6G 2R3 (corresponding author). Email: [email protected]
Ph.D. Candidate, Dept. of Civil and Environmental Engineering, Univ. of Alberta, 116 St. & 85 Ave., Edmonton, AB, Canada T6G 2R3. ORCID: https://orcid.org/0000-0002-1335-6191. Email: [email protected]
Ahmed Bouferguene [email protected]
Professor, Campus Saint-Jean, Univ. of Alberta, 8406 Rue Marie-Anne Gaboury, Edmonton, AB, Canada T6G 2R3. Email: [email protected]
Professor, Dept. of Civil and Environmental Engineering, Univ. of Alberta, 116 St. & 85 Ave., Edmonton, AB, Canada T6G 2R3. ORCID: https://orcid.org/0000-0002-1774-9718. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • Crane Signalman Hand-Signal Classification Framework Using Sensor-Based Smart Construction Glove and Machine-Learning Algorithms, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14458, 150, 8, (2024).
  • The Development of a Rebar-Counting Model for Reinforced Concrete Columns: Using an Unmanned Aerial Vehicle and Deep-Learning Approach, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-13686, 149, 11, (2023).
  • Scientometric analysis and critical review on the application of deep learning in the construction industry, Canadian Journal of Civil Engineering, 10.1139/cjce-2022-0379, (2022).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share