Named Entity Recognition Algorithm for iBISDS Using Neural Network
Publication: Construction Research Congress 2022
ABSTRACT
Conversational Artificial Intelligence (AI) systems have become more and more popular to provide information support for human daily life. However, the construction industry still lags other industries in developing a conversational AI system to support construction activities. The developed intelligent Building Information Spoken Dialogue System (iBISDS) is a conversational AI system that provides a speech-based virtual assistant for construction personnel with considerable building information to support construction activities. The iBISDS enables construction personnel to use flexible spoken natural language queries instead of detecting exact keywords. To build an iBISDS, it is necessary to understand the intents of natural language queries for building information. This research aims to develop a named entity recognition (NER) algorithm for iBISDS to recognize and classify keywords within natural language queries. A dataset with 2,008 building information-related natural language queries was developed and manually annotated for training and testing. A Neural Network (NN) deep learning method was trained to recognize named entities within natural language queries. After training, the developed NER algorithm was applied to the testing dataset which achieved a precision of 99.74, a recall of 99.87, and an F1-score of 99.81. The preliminary result indicated that the developed NER algorithm can recognize named entities within the natural language queries accurately. This research will facilitate the further development of conversational AI systems in the construction industry.
Get full access to this article
View all available purchase options and get full access to this chapter.
REFERENCES
Akkasi, A., and Varoglu, E. (2017). “Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods.” IEEE/ACM Transactions on Computational Biology and Bioinformatics, Institute of Electrical and Electronics Engineers Inc., 14(6), 1327–1338.
Baker, H., Hallowell, M. R., and Tixier, A. J. P. (2020). “Automatically learning construction injury precursors from text.” Automation in Construction, Elsevier B.V., 118, 103145.
Bishop, C. M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag, Berlin, Heidelberg.
Du, Y., and Zhao, W. (2020). “Named Entity Recognition Method with Word Position.” Proceedings - 2020 International Workshop on Electronic Communication and Artificial Intelligence, IWECAI 2020, 154–159.
Lin, J., Hu, Z., Zhang, J., and Yu, F. (2016). “A Natural-Language-Based Approach to Intelligent Data Retrieval and Representation for Cloud BIM.” Computer-Aided Civil & Infrastructure Engineering, 31(1), 18–33.
Liu, K., and El-Gohary, N. (2017). “Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports.” Automation in Construction, Elsevier B.V., 81, 313–327.
Moon, S., Chung, S., and Chi, S. (2020). “Bridge Damage Recognition from Inspection Reports Using NER Based on Recurrent Neural Network with Active Learning.” Journal of Performance of Constructed Facilities, 34(6), 04020119.
Moon, S., Lee, G., Chi, S., and Oh, H. (2021). “Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing.” Journal of Construction Engineering and Management, 147(1), 04020147.
Murphy, K. P. (2012). Machine Learning: A Probabilistic Perspective. The MIT Press.
Nakayama, H., Kubo, T., Kamura, J., Taniguchi, Y., and Liang, X. (2018). “{doccano}: Text Annotation Tool for Human.” <https://github.com/doccano/doccano>.
Park, Y., and Kang, S. (2019). “Natural Language Generation Using Dependency Tree Decoding for Spoken Dialog Systems.” IEEE Access, Institute of Electrical and Electronics Engineers Inc., 7, 7250–7258.
Ramshaw, L. A., and Marcus, M. P. (1995). Text Chunking using Transformation-Based Learning.
Riaz, F., Anwar, M. W., and Muqades, H. (2020). “Maximum Entropy based Urdu Named Entity Recognition.” 2020 International Conference on Engineering and Emerging Technologies, ICEET 2020, Institute of Electrical and Electronics Engineers Inc.
spaCy. (2021). “Industrial-Strength Natural Language Processing.” <https://spacy.io/>(Apr. 6, 2021).
Trivedi, I., and Majhi, S. (2020). “Span level model for the construction of scientific knowledge graph.” Proceedings of the 2020 International Conference on Computing, Communication and Security, ICCCS 2020, Institute of Electrical and Electronics Engineers Inc.
Wang, Z., and Guan, H. (2020). “Research on Named Entity Recognition of Doctor-Patient Question Answering Community Based on BiLSTM-CRF Model.” Proceedings - 2020 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2020, Institute of Electrical and Electronics Engineers Inc., 1641–1644.
Wu, S., Shen, Q., Deng, Y., and Cheng, J. (2019). “Natural-language-based intelligent retrieval engine for BIM object database.” Computers in Industry, Elsevier, 108, 73–88.
Zhong, B., Xing, X., Luo, H., Zhou, Q., Li, H., Rose, T., and Fang, W. (2020). “Deep learning-based extraction of construction procedural constraints from construction regulations.” Advanced Engineering Informatics, Elsevier Ltd, 43.
Zhou, P., and El-Gohary, N. (2017). “Ontology-based automated information extraction from building energy conservation codes.” Automation in Construction, Elsevier B.V., 74, 103–117.
Information & Authors
Information
Published In
History
Published online: Mar 7, 2022
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.