A Novel and Intelligent Safety-Hazard Classification Method with Syntactic and Semantic Features for Large-Scale Construction Projects
Publication: Journal of Construction Engineering and Management
Volume 148, Issue 10
Abstract
To improve the efficiency of safety management, it is important to classify massive and complex construction site safety hazard texts in large-scale projects. High-precision safety hazard text classification is a lengthy and challenging process. Most existing safety hazard text classification methods capture semantic information using machine learning or deep learning, ignoring the syntactic dependency between words. However, syntactic dependency contains rich structural information that is useful to alleviate information loss and enrich text features. To address these issues, this study proposes a graph structure–based hybrid deep learning method to achieve the automatic classification of large-scale project safety hazard texts. The method uses syntactic dependency and Bidirectional Encoder Representation from Transformers to express the syntactic structure and semantic information of text, and a graph structure fusing the syntactic structure and semantic information is constructed to quantify text information. Further, an encoding-decoding mechanism is built using a graph convolutional neural network and bidirectional long short-term memory to address graph structure data and classify safety hazard texts. Our proposed method is used to classify hydraulic engineering construction safety hazard texts, and the classification accuracy reaches 86.56%. Meanwhile, the experimental results demonstrate that our model achieves superior performance compared to existing methods. This proves the ability of our model to capture and analyze text information and verifies the reliability and effectiveness of this method in large-scale project safety hazard management.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Data generated and analyzed during this study are available from the corresponding author by request.
Acknowledgments
This research was supported by the National Natural Science Foundation of China (Grant 52179139) and the Open Fund of Hubei Key Laboratory of Construction and Management in Hydropower Engineering (Grant 2020KSD05).
References
Alam, K. M., N. Siddique, and H. Adeli. 2020. “Dynamic ensemble learning algorithm for neural networks.” Neural Comput. Appl. 32 (12): 8675–8690. https://doi.org/10.1007/s00521-019-04359-7.
Bai, Y., C. Li, Z. Lin, Y. Wu, Y. Miao, Y. Liu, and Y. Xu. 2021. “Efficient data loader for fast sampling-based GNN training on large graphs.” IEEE Trans. Parallel Distrib. Syst. 32 (10): 2541–2556. https://doi.org/10.1109/TPDS.2021.3065737.
Baker, H., M. R. Hallowell, and A. J.-P. Tixier. 2020. “Automatically learning construction injury precursors from text.” Autom. Constr. 118 (Oct): 103145. https://doi.org/10.1016/j.autcon.2020.103145.
Barnes, J., R. Kurtz, S. Oepen, L. Ovrelid, and E. Velldal. 2021. “Structured sentiment analysis as dependency graph parsing.” In Proc., Joint Conf. of 59th Annual Meeting of the Association-for-Computational-Linguistics (ACL)/11th Int. Joint Conf. on Natural Language Processing (IJCNLP)/6th Workshop on Representation Learning for NLP (RepL4NLP), 3387–3402. Stroudsburg, PA: Association for Computational Linguistics.
Chen, S., J. Xi, Y. Chen, and J. Zhao. 2022. “Association mining of near misses in hydropower engineering construction based on convolutional neural network text classification.” Comput. Intell. Neurosci. 2022 (Jan): 1–16. https://doi.org/10.1155/2022/4851615.
Cheng, M.-Y., D. Kusoemo, and R. A. Gosno. 2020. “Text mining-based construction site accident classification using hybrid supervised machine learning.” Autom. Constr. 118 (Oct): 103265. https://doi.org/10.1016/j.autcon.2020.103265.
Chi, N.-W., K.-Y. Lin, N. El-Gohary, and S.-H. Hsieh. 2016. “Evaluating the strength of text classification categories for supporting construction field inspection.” Autom. Constr. 64 (Apr): 78–88. https://doi.org/10.1016/j.autcon.2016.01.001.
Chokor, A., H. Naganathan, W. K. Chong, and M. El Asmar. 2016. “Analyzing Arizona OSHA injury reports using unsupervised machine learning.” Procedia Eng. 145 (Jan): 1588–1593. https://doi.org/10.1016/j.proeng.2016.04.200.
Devlin, J., M. W. Chang, K. Lee, and K. Toutanova. 2018. “Bert: Pre-training of deep bidirectional transformers for language understanding.” Preprint, submitted October 11, 2018. https://arxiv.org/abs/1810.04805.
Ding, L. Y., and H. Li. 2013. “Information technologies in safety management of large-scale infrastructure projects.” Autom. Constr. 34 (Sep): 1–2. https://doi.org/10.1016/j.autcon.2012.10.016.
Fang, W., H. Luo, S. Xu, P. E. D. Love, Z. Lu, and C. Ye. 2020. “Automated text classification of near-misses from safety reports: An improved deep learning approach.” Adv. Eng. Inf. 44 (Apr): 101060. https://doi.org/10.1016/j.aei.2020.101060.
Feng, D., and H. Chen. 2021. “A small samples training framework for deep Learning-based automatic information extraction: Case study of construction accident news reports analysis.” Adv. Eng. Inf. 47 (Jan): 101256. https://doi.org/10.1016/j.aei.2021.101256.
Gao, W., and H. Huang. 2021. “A gating context-aware text classification model with BERT and graph convolutional networks.” J. Intell. Fuzzy Syst. 40 (3): 4331–4343. https://doi.org/10.3233/JIFS-201051.
Goh, Y. M., and C. U. Ubeynarayana. 2017. “Construction accident narrative classification: An evaluation of text mining techniques.” Accid. Anal. Prev. 108 (Nov): 122–130. https://doi.org/10.1016/j.aap.2017.08.026.
Guo, Q., X. Qiu, X. Xue, and Z. Zhang. 2021. “Syntax-guided text generation via graph neural network.” Sci. China Inf. Sci. 64 (5): 1–10. https://doi.org/10.1007/s11432-019-2740-1.
Han, Y., Y. Diao, Z. Yin, R. Jin, J. Kangwa, and O. J. Ebohon. 2021. “Immersive technology-driven investigations on influence factors of cognitive load incurred in construction site hazard recognition, analysis and decision making.” Adv. Eng. Inf. 48 (Apr): 101298. https://doi.org/10.1016/j.aei.2021.101298.
Hong, Y., H. Xie, G. Bhumbra, and I. Brilakis. 2021. “Comparing natural language processing methods to cluster construction schedules.” J. Constr. Eng. Manage. 147 (10): 04021136. https://doi.org/10.1061/(ASCE)CO.1943-7862.0002165.
Hu, G., G. Lu, and Y. Zhao. 2021. “FSS-GCN: A graph convolutional networks with fusion of semantic and structure for emotion cause analysis.” Knowl.-Based Syst. 212 (Jan): 106584. https://doi.org/10.1016/j.knosys.2020.106584.
Kim, T., and S. Chi. 2019. “Accident case retrieval and analyses: Using natural language processing in the construction industry.” J. Constr. Eng. Manage. 145 (3): 04019004. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001625.
Ko, T., and H. D. Jeong. 2020. “Syntactic approach to extracting key elements of work modification cause in change-order documents.” In Proc., Construction Research Congress (CRC) on Construction Research and Innovation to Transform Society, 134–142. Tucson: Construct Res Council.
Lecun, Y., Y. Bengio, and G. Hinton. 2015. “Deep learning.” Nature 521 (7553): 436–444. https://doi.org/10.1038/nature14539.
Li, R., L. Wang, Z. Jiang, D. Liu, M. Zhao, and X. Lu. 2021a. “Incremental BERT with commonsense representations for multi-choice reading comprehension.” Multimedia Tools Appl. 80 (21–23): 32311–32333. https://doi.org/10.1007/s11042-021-11197-0.
Li, X., M. Cui, J. Li, R. Bai, Z. Lu, and U. Aickelin. 2021b. “A hybrid medical text classification framework: Integrating attentive rule construction and neural network.” Neurocomputing 443 (Jul): 345–355. https://doi.org/10.1016/j.neucom.2021.02.069.
Lin, J.-R., Z.-Z. Hu, J.-L. Li, and L.-M. Chen. 2020. “Understanding on-site inspection of construction projects based on keyword extraction and topic modeling.” IEEE Access 8 (Nov): 198503–198517. https://doi.org/10.1109/ACCESS.2020.3035214.
Lin, P., P. Wei, Q. Fan, and W. Chen. 2019. “CNN model for mining safety hazard data from a construction site.” [In Chinese.] J. Tsinghua Univ. 59 (8): 628–634. https://doi.org/10.16511/j.cnki.qhdxxb.2019.26.008.
Liu, J., Z. S. Y. Wong, H.-Y. So, and K. L. Tsui. 2021. “Evaluating resampling methods and structured features to improve fall incident report identification by the severity level.” J. Am. Med. Inf. Assoc. 28 (8): 1756–1764. https://doi.org/10.1093/jamia/ocab048.
Lu, J., J. Xuan, G. Zhang, and X. Luo. 2018. “Structural property-aware multilayer network embedding for latent factor analysis.” Pattern Recognit. 76 (Apr): 228–241. https://doi.org/10.1016/j.patcog.2017.11.004.
Park, C., J. Park, and S. Park. 2020. “AGCN: Attention-based graph convolutional networks for drug-drug interaction extraction.” Expert Syst. Appl. 159 (Nov): 113538. https://doi.org/10.1016/j.eswa.2020.113538.
Qiu, Z., Q. Liu, X. Li, J. Zhang, and Y. Zhang. 2021. “Construction and analysis of a coal mine accident causation network based on text mining.” Process Saf. Environ. Prot. 153 (Sep): 320–328. https://doi.org/10.1016/j.psep.2021.07.032.
Ren, Q., M. Li, H. Li, and Y. Shen. 2021. “A novel deep learning prediction model for concrete dam displacements using interpretable mixed attention mechanism.” Adv. Eng. Inf. 50 (Oct): 101407. https://doi.org/10.1016/j.aei.2021.101407.
Salama, D. M., and N. M. El-Gohary. 2016. “Semantic text classification for supporting automated compliance checking in construction.” J. Comput. Civ. Eng. 30 (1): 04014106. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000301.
Tian, D., M. Li, J. Shi, Y. Shen, and S. Han. 2021. “On-site text classification and knowledge mining for large-scale projects construction by integrated intelligent approach.” Adv. Eng. Inf. 49 (Aug): 101355. https://doi.org/10.1016/j.aei.2021.101355.
Tixier, A. J. P., M. R. Hallowell, B. Rajagopalan, and D. Bowman. 2016. “Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports.” Autom. Constr. 62 (Feb): 45–56. https://doi.org/10.1016/j.autcon.2015.11.001.
Wang, Z., Z. Huang, and J. Gao. 2020a. “Chinese text classification method based on BERT word embedding.” In Proc., 5th Int. Conf. on Mathematics and Artificial Intelligence (ICMAI), 66–71. New York: Association for Computing Machinery. https://doi.org/10.1145/3395260.3395273.
Wang, Z., C.-H. Wu, Q.-B. Li, B. Yan, and K.-F. Zheng. 2020b. “Encoding text information with graph convolutional networks for personality recognition.” Appl. Sci. 10 (12): 4081. https://doi.org/10.3390/app10124081.
Xu, N., L. Ma, Q. Liu, L. Wang, and Y. Deng. 2021a. “An improved text mining approach to extract safety risk factors from construction accident reports.” Saf. Sci. 138 (Jun): 105216. https://doi.org/10.1016/j.ssci.2021.105216.
Xu, N., L. Ma, L. Wang, Y. Deng, and G. Ni. 2021b. “Extracting domain knowledge elements of construction safety management: Rule-based approach using Chinese natural language processing.” J. Manage. Eng. 37 (2): 04021001. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000870.
Zhang, F., H. Fleyeh, X. Wang, and M. Lu. 2019. “Construction site accident analysis using text mining and natural language processing techniques.” Autom. Constr. 99 (Mar): 238–248. https://doi.org/10.1016/j.autcon.2018.12.016.
Zhang, J., L. Zi, Y. Hou, D. Deng, W. Jiang, and M. Wang. 2020. “A C-BiLSTM approach to classify construction accident reports.” Appl. Sci. 10 (17): 5754. https://doi.org/10.3390/app10175754.
Zhong, B., X. Pan, P. E. D. Love, L. Ding, and W. Fang. 2020a. “Deep learning and network analysis: Classifying and visualizing accident narratives in construction.” Autom. Constr. 113 (May): 103089. https://doi.org/10.1016/j.autcon.2020.103089.
Zhong, B., X. Pan, P. E. D. Love, J. Sun, and C. Tao. 2020b. “Hazard analysis: A deep learning and text mining framework for accident prevention.” Adv. Eng. Inf. 46 (Oct): 101152. https://doi.org/10.1016/j.aei.2020.101152.
Zhong, B., X. Xing, H. Luo, Q. Zhou, H. Li, T. Rose, and W. Fang. 2020c. “Deep learning-based extraction of construction procedural constraints from construction regulations.” Adv. Eng. Inf. 43 (Jan): 101003. https://doi.org/10.1016/j.aei.2019.101003.
Zhou, J., J. X. Huang, Q. V. Hu, and L. He. 2020. “SK-GCN: Modeling Syntax and Knowledge via Graph Convolutional Network for aspect-level sentiment classification.” Knowl.-Based Syst. 205 (Oct): 106292. https://doi.org/10.1016/j.knosys.2020.106292.
Information & Authors
Information
Published In
Copyright
© 2022 American Society of Civil Engineers.
History
Received: Feb 22, 2022
Accepted: May 31, 2022
Published online: Aug 1, 2022
Published in print: Oct 1, 2022
Discussion open until: Jan 1, 2023
ASCE Technical Topics:
- Business management
- Construction engineering
- Construction management
- Construction methods
- Construction sites
- Design (by type)
- Disaster risk management
- Engineering fundamentals
- Management methods
- Occupational safety
- Practice and Profession
- Public administration
- Public health and safety
- Risk management
- Safety
- Structural design
- Structural safety
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Pengcheng Xiang, Yingliu Yang, Kesheng Yan, Lianghai Jin, Identification of Key Safety Risk Factors and Coupling Paths in Mega Construction Projects, Journal of Management in Engineering, 10.1061/JMENEA.MEENG-5926, 40, 4, (2024).
- Fatemeh Mostofi, Onur Behzat Tokdemir, Vedat Toğan, David Arditi, Predicting the Cost of Rework in High-Rise Buildings Using Graph Convolutional Networks, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14739, 150, 8, (2024).
- Qing Shuang, Xishan Liu, Zhaojing Wang, Xinxin Xu, Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14515, 150, 9, (2024).
- Botao Zhong, Luoxin Shen, Xing Pan, Xueyan Zhong, Wanlei He, Dispute Classification and Analysis: Deep Learning–Based Text Mining for Construction Contract Management, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14080, 150, 1, (2024).
- Fatemeh Mostofi, Vedat Toğan, A Data-Driven Recommendation System for Construction Safety Risk Assessment, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-13437, 149, 12, (2023).
- Kerim Koc, Ömer Ekmekcioğlu, Asli Pelin Gurgun, Developing a National Data-Driven Construction Safety Management Framework with Interpretable Fatal Accident Prediction, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-12848, 149, 4, (2023).
- Zhenhua Wang, Bin Wang, Ming Ren, Dong Gao, A new hazard event classification model via deep learning and multifractal, Computers in Industry, 10.1016/j.compind.2023.103875, 147, (103875), (2023).
- Fatemeh Mostofi, Vedat Toğan, Yunus Emre Ayözen, Onur Behzat Tokdemir, Construction Safety Risk Model with Construction Accident Network: A Graph Convolutional Network Approach, Sustainability, 10.3390/su142315906, 14, 23, (15906), (2022).