Technical Papers
Aug 1, 2022

A Novel and Intelligent Safety-Hazard Classification Method with Syntactic and Semantic Features for Large-Scale Construction Projects

Publication: Journal of Construction Engineering and Management
Volume 148, Issue 10

Abstract

To improve the efficiency of safety management, it is important to classify massive and complex construction site safety hazard texts in large-scale projects. High-precision safety hazard text classification is a lengthy and challenging process. Most existing safety hazard text classification methods capture semantic information using machine learning or deep learning, ignoring the syntactic dependency between words. However, syntactic dependency contains rich structural information that is useful to alleviate information loss and enrich text features. To address these issues, this study proposes a graph structure–based hybrid deep learning method to achieve the automatic classification of large-scale project safety hazard texts. The method uses syntactic dependency and Bidirectional Encoder Representation from Transformers to express the syntactic structure and semantic information of text, and a graph structure fusing the syntactic structure and semantic information is constructed to quantify text information. Further, an encoding-decoding mechanism is built using a graph convolutional neural network and bidirectional long short-term memory to address graph structure data and classify safety hazard texts. Our proposed method is used to classify hydraulic engineering construction safety hazard texts, and the classification accuracy reaches 86.56%. Meanwhile, the experimental results demonstrate that our model achieves superior performance compared to existing methods. This proves the ability of our model to capture and analyze text information and verifies the reliability and effectiveness of this method in large-scale project safety hazard management.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Data generated and analyzed during this study are available from the corresponding author by request.

Acknowledgments

This research was supported by the National Natural Science Foundation of China (Grant 52179139) and the Open Fund of Hubei Key Laboratory of Construction and Management in Hydropower Engineering (Grant 2020KSD05).

References

Alam, K. M., N. Siddique, and H. Adeli. 2020. “Dynamic ensemble learning algorithm for neural networks.” Neural Comput. Appl. 32 (12): 8675–8690. https://doi.org/10.1007/s00521-019-04359-7.
Bai, Y., C. Li, Z. Lin, Y. Wu, Y. Miao, Y. Liu, and Y. Xu. 2021. “Efficient data loader for fast sampling-based GNN training on large graphs.” IEEE Trans. Parallel Distrib. Syst. 32 (10): 2541–2556. https://doi.org/10.1109/TPDS.2021.3065737.
Baker, H., M. R. Hallowell, and A. J.-P. Tixier. 2020. “Automatically learning construction injury precursors from text.” Autom. Constr. 118 (Oct): 103145. https://doi.org/10.1016/j.autcon.2020.103145.
Barnes, J., R. Kurtz, S. Oepen, L. Ovrelid, and E. Velldal. 2021. “Structured sentiment analysis as dependency graph parsing.” In Proc., Joint Conf. of 59th Annual Meeting of the Association-for-Computational-Linguistics (ACL)/11th Int. Joint Conf. on Natural Language Processing (IJCNLP)/6th Workshop on Representation Learning for NLP (RepL4NLP), 3387–3402. Stroudsburg, PA: Association for Computational Linguistics.
Chen, S., J. Xi, Y. Chen, and J. Zhao. 2022. “Association mining of near misses in hydropower engineering construction based on convolutional neural network text classification.” Comput. Intell. Neurosci. 2022 (Jan): 1–16. https://doi.org/10.1155/2022/4851615.
Cheng, M.-Y., D. Kusoemo, and R. A. Gosno. 2020. “Text mining-based construction site accident classification using hybrid supervised machine learning.” Autom. Constr. 118 (Oct): 103265. https://doi.org/10.1016/j.autcon.2020.103265.
Chi, N.-W., K.-Y. Lin, N. El-Gohary, and S.-H. Hsieh. 2016. “Evaluating the strength of text classification categories for supporting construction field inspection.” Autom. Constr. 64 (Apr): 78–88. https://doi.org/10.1016/j.autcon.2016.01.001.
Chokor, A., H. Naganathan, W. K. Chong, and M. El Asmar. 2016. “Analyzing Arizona OSHA injury reports using unsupervised machine learning.” Procedia Eng. 145 (Jan): 1588–1593. https://doi.org/10.1016/j.proeng.2016.04.200.
Devlin, J., M. W. Chang, K. Lee, and K. Toutanova. 2018. “Bert: Pre-training of deep bidirectional transformers for language understanding.” Preprint, submitted October 11, 2018. https://arxiv.org/abs/1810.04805.
Ding, L. Y., and H. Li. 2013. “Information technologies in safety management of large-scale infrastructure projects.” Autom. Constr. 34 (Sep): 1–2. https://doi.org/10.1016/j.autcon.2012.10.016.
Fang, W., H. Luo, S. Xu, P. E. D. Love, Z. Lu, and C. Ye. 2020. “Automated text classification of near-misses from safety reports: An improved deep learning approach.” Adv. Eng. Inf. 44 (Apr): 101060. https://doi.org/10.1016/j.aei.2020.101060.
Feng, D., and H. Chen. 2021. “A small samples training framework for deep Learning-based automatic information extraction: Case study of construction accident news reports analysis.” Adv. Eng. Inf. 47 (Jan): 101256. https://doi.org/10.1016/j.aei.2021.101256.
Gao, W., and H. Huang. 2021. “A gating context-aware text classification model with BERT and graph convolutional networks.” J. Intell. Fuzzy Syst. 40 (3): 4331–4343. https://doi.org/10.3233/JIFS-201051.
Goh, Y. M., and C. U. Ubeynarayana. 2017. “Construction accident narrative classification: An evaluation of text mining techniques.” Accid. Anal. Prev. 108 (Nov): 122–130. https://doi.org/10.1016/j.aap.2017.08.026.
Guo, Q., X. Qiu, X. Xue, and Z. Zhang. 2021. “Syntax-guided text generation via graph neural network.” Sci. China Inf. Sci. 64 (5): 1–10. https://doi.org/10.1007/s11432-019-2740-1.
Han, Y., Y. Diao, Z. Yin, R. Jin, J. Kangwa, and O. J. Ebohon. 2021. “Immersive technology-driven investigations on influence factors of cognitive load incurred in construction site hazard recognition, analysis and decision making.” Adv. Eng. Inf. 48 (Apr): 101298. https://doi.org/10.1016/j.aei.2021.101298.
Hong, Y., H. Xie, G. Bhumbra, and I. Brilakis. 2021. “Comparing natural language processing methods to cluster construction schedules.” J. Constr. Eng. Manage. 147 (10): 04021136. https://doi.org/10.1061/(ASCE)CO.1943-7862.0002165.
Hu, G., G. Lu, and Y. Zhao. 2021. “FSS-GCN: A graph convolutional networks with fusion of semantic and structure for emotion cause analysis.” Knowl.-Based Syst. 212 (Jan): 106584. https://doi.org/10.1016/j.knosys.2020.106584.
Kim, T., and S. Chi. 2019. “Accident case retrieval and analyses: Using natural language processing in the construction industry.” J. Constr. Eng. Manage. 145 (3): 04019004. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001625.
Ko, T., and H. D. Jeong. 2020. “Syntactic approach to extracting key elements of work modification cause in change-order documents.” In Proc., Construction Research Congress (CRC) on Construction Research and Innovation to Transform Society, 134–142. Tucson: Construct Res Council.
Lecun, Y., Y. Bengio, and G. Hinton. 2015. “Deep learning.” Nature 521 (7553): 436–444. https://doi.org/10.1038/nature14539.
Li, R., L. Wang, Z. Jiang, D. Liu, M. Zhao, and X. Lu. 2021a. “Incremental BERT with commonsense representations for multi-choice reading comprehension.” Multimedia Tools Appl. 80 (21–23): 32311–32333. https://doi.org/10.1007/s11042-021-11197-0.
Li, X., M. Cui, J. Li, R. Bai, Z. Lu, and U. Aickelin. 2021b. “A hybrid medical text classification framework: Integrating attentive rule construction and neural network.” Neurocomputing 443 (Jul): 345–355. https://doi.org/10.1016/j.neucom.2021.02.069.
Lin, J.-R., Z.-Z. Hu, J.-L. Li, and L.-M. Chen. 2020. “Understanding on-site inspection of construction projects based on keyword extraction and topic modeling.” IEEE Access 8 (Nov): 198503–198517. https://doi.org/10.1109/ACCESS.2020.3035214.
Lin, P., P. Wei, Q. Fan, and W. Chen. 2019. “CNN model for mining safety hazard data from a construction site.” [In Chinese.] J. Tsinghua Univ. 59 (8): 628–634. https://doi.org/10.16511/j.cnki.qhdxxb.2019.26.008.
Liu, J., Z. S. Y. Wong, H.-Y. So, and K. L. Tsui. 2021. “Evaluating resampling methods and structured features to improve fall incident report identification by the severity level.” J. Am. Med. Inf. Assoc. 28 (8): 1756–1764. https://doi.org/10.1093/jamia/ocab048.
Lu, J., J. Xuan, G. Zhang, and X. Luo. 2018. “Structural property-aware multilayer network embedding for latent factor analysis.” Pattern Recognit. 76 (Apr): 228–241. https://doi.org/10.1016/j.patcog.2017.11.004.
Park, C., J. Park, and S. Park. 2020. “AGCN: Attention-based graph convolutional networks for drug-drug interaction extraction.” Expert Syst. Appl. 159 (Nov): 113538. https://doi.org/10.1016/j.eswa.2020.113538.
Qiu, Z., Q. Liu, X. Li, J. Zhang, and Y. Zhang. 2021. “Construction and analysis of a coal mine accident causation network based on text mining.” Process Saf. Environ. Prot. 153 (Sep): 320–328. https://doi.org/10.1016/j.psep.2021.07.032.
Ren, Q., M. Li, H. Li, and Y. Shen. 2021. “A novel deep learning prediction model for concrete dam displacements using interpretable mixed attention mechanism.” Adv. Eng. Inf. 50 (Oct): 101407. https://doi.org/10.1016/j.aei.2021.101407.
Salama, D. M., and N. M. El-Gohary. 2016. “Semantic text classification for supporting automated compliance checking in construction.” J. Comput. Civ. Eng. 30 (1): 04014106. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000301.
Tian, D., M. Li, J. Shi, Y. Shen, and S. Han. 2021. “On-site text classification and knowledge mining for large-scale projects construction by integrated intelligent approach.” Adv. Eng. Inf. 49 (Aug): 101355. https://doi.org/10.1016/j.aei.2021.101355.
Tixier, A. J. P., M. R. Hallowell, B. Rajagopalan, and D. Bowman. 2016. “Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports.” Autom. Constr. 62 (Feb): 45–56. https://doi.org/10.1016/j.autcon.2015.11.001.
Wang, Z., Z. Huang, and J. Gao. 2020a. “Chinese text classification method based on BERT word embedding.” In Proc., 5th Int. Conf. on Mathematics and Artificial Intelligence (ICMAI), 66–71. New York: Association for Computing Machinery. https://doi.org/10.1145/3395260.3395273.
Wang, Z., C.-H. Wu, Q.-B. Li, B. Yan, and K.-F. Zheng. 2020b. “Encoding text information with graph convolutional networks for personality recognition.” Appl. Sci. 10 (12): 4081. https://doi.org/10.3390/app10124081.
Xu, N., L. Ma, Q. Liu, L. Wang, and Y. Deng. 2021a. “An improved text mining approach to extract safety risk factors from construction accident reports.” Saf. Sci. 138 (Jun): 105216. https://doi.org/10.1016/j.ssci.2021.105216.
Xu, N., L. Ma, L. Wang, Y. Deng, and G. Ni. 2021b. “Extracting domain knowledge elements of construction safety management: Rule-based approach using Chinese natural language processing.” J. Manage. Eng. 37 (2): 04021001. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000870.
Zhang, F., H. Fleyeh, X. Wang, and M. Lu. 2019. “Construction site accident analysis using text mining and natural language processing techniques.” Autom. Constr. 99 (Mar): 238–248. https://doi.org/10.1016/j.autcon.2018.12.016.
Zhang, J., L. Zi, Y. Hou, D. Deng, W. Jiang, and M. Wang. 2020. “A C-BiLSTM approach to classify construction accident reports.” Appl. Sci. 10 (17): 5754. https://doi.org/10.3390/app10175754.
Zhong, B., X. Pan, P. E. D. Love, L. Ding, and W. Fang. 2020a. “Deep learning and network analysis: Classifying and visualizing accident narratives in construction.” Autom. Constr. 113 (May): 103089. https://doi.org/10.1016/j.autcon.2020.103089.
Zhong, B., X. Pan, P. E. D. Love, J. Sun, and C. Tao. 2020b. “Hazard analysis: A deep learning and text mining framework for accident prevention.” Adv. Eng. Inf. 46 (Oct): 101152. https://doi.org/10.1016/j.aei.2020.101152.
Zhong, B., X. Xing, H. Luo, Q. Zhou, H. Li, T. Rose, and W. Fang. 2020c. “Deep learning-based extraction of construction procedural constraints from construction regulations.” Adv. Eng. Inf. 43 (Jan): 101003. https://doi.org/10.1016/j.aei.2019.101003.
Zhou, J., J. X. Huang, Q. V. Hu, and L. He. 2020. “SK-GCN: Modeling Syntax and Knowledge via Graph Convolutional Network for aspect-level sentiment classification.” Knowl.-Based Syst. 205 (Oct): 106292. https://doi.org/10.1016/j.knosys.2020.106292.

Information & Authors

Information

Published In

Go to Journal of Construction Engineering and Management
Journal of Construction Engineering and Management
Volume 148Issue 10October 2022

History

Received: Feb 22, 2022
Accepted: May 31, 2022
Published online: Aug 1, 2022
Published in print: Oct 1, 2022
Discussion open until: Jan 1, 2023

Permissions

Request permissions for this article.

ASCE Technical Topics:

Authors

Affiliations

Ph.D. Candidate, State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin Univ., Tianjin 300350, China. Email: [email protected]
Professor, State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin Univ., Tianjin 300350, China (corresponding author). ORCID: https://orcid.org/0000-0002-3010-0892. Email: [email protected]
Postdoctoral Fellow, Dept. of Building and Real Estate, Hong Kong Polytechnic Univ., Hong Kong. Email: [email protected]
Senior Engineer, China Three Gorges Corporation, No. 1 Yuyuantan South Rd., Haidian District, Beijing 100038, China. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • Identification of Key Safety Risk Factors and Coupling Paths in Mega Construction Projects, Journal of Management in Engineering, 10.1061/JMENEA.MEENG-5926, 40, 4, (2024).
  • Predicting the Cost of Rework in High-Rise Buildings Using Graph Convolutional Networks, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14739, 150, 8, (2024).
  • Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14515, 150, 9, (2024).
  • Dispute Classification and Analysis: Deep Learning–Based Text Mining for Construction Contract Management, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14080, 150, 1, (2024).
  • A Data-Driven Recommendation System for Construction Safety Risk Assessment, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-13437, 149, 12, (2023).
  • Developing a National Data-Driven Construction Safety Management Framework with Interpretable Fatal Accident Prediction, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-12848, 149, 4, (2023).
  • A new hazard event classification model via deep learning and multifractal, Computers in Industry, 10.1016/j.compind.2023.103875, 147, (103875), (2023).
  • Construction Safety Risk Model with Construction Accident Network: A Graph Convolutional Network Approach, Sustainability, 10.3390/su142315906, 14, 23, (15906), (2022).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share