Technical Papers
Jun 30, 2022

Hierarchical Representation and Deep Learning–Based Method for Automatically Transforming Textual Building Codes into Semantic Computable Requirements

Publication: Journal of Computing in Civil Engineering
Volume 36, Issue 5

Abstract

Most of the existing automated compliance checking (ACC) systems are unable to fully automatically convert building-code requirements, especially requirements that have hierarchically complex semantic and syntactic structures, into computer-processable forms. The state-of-the-art rule-based ACC methods that are able to deal with complex requirements are based on information extraction and transformation rules, which are inflexible when applied to different types of regulatory documents. More research is thus needed to develop a flexible method to automatically process and understand requirements to support the downstream tasks in ACC systems, such as information matching and compliance reasoning. To address this need, this paper proposes (1) a new representation of requirements, the requirement hierarchy, and (2) a deep learning-based method to automatically extract semantic relations between words from building-code sentences, which are used to transform the sentences into such hierarchies. The proposed method was evaluated using a corpus of sentences from multiple regulatory documents. It achieved high semantic relation and requirement hierarchy extraction performance.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Some or all data, models, or code generated or used during the study are available from the corresponding author by request (trained semantic relation extraction model and labeled gold standard data).

Acknowledgments

The authors would like to thank the National Science Foundation (NSF). This material is based on work supported by the NSF under Grant No. 1827733. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF.

References

Akanbi, T., and J. Zhang. 2021. “Design information extraction from construction specifications to support cost estimation.” Autom. Constr. 131 (Nov): 103835. https://doi.org/10.1016/j.autcon.2021.103835.
Al Qady, M., and A. Kandil. 2010. “Concept relation extraction from construction documents using natural language processing.” J. Constr. Eng. Manage. 136 (3): 294–302. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000131.
Beach, T. H., J.-L. Hippolyte, and Y. Rezgui. 2020. “Towards the adoption of automated regulatory compliance checking in the built environment.” Autom. Constr. 118 (Oct): 103285. https://doi.org/10.1016/j.autcon.2020.103285.
Berthelot, D., N. Carlini, I. Goodfellow, N. Papernot, A. Oliver, and C. Raffel. 2019. “Mixmatch: A holistic approach to semi-supervised learning.” Preprint, submitted May 6, 2019. https://arxiv.org/abs/1905.02249.
Bloch, T., and R. Sacks. 2020. “Clustering information types for semantic enrichment of building information models to support automated code compliance checking.” J. Comput. Civ. Eng. 34 (6): 04020040. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000922.
Buchholz, S., and E. Marsi. 2006. “CoNLL-X shared task on multilingual dependency parsing.” In Proc., 10th Conf. on Computational Natural Language Learning (CoNLL-X), 149–164. Stroudsburg, PA: Association for Computational Linguistics.
Chelba, C., T. Mikolov, M. Schuster, Q. Ge, T. Brants, P. Koehn, and T. Robinson. 2013. “One billion word benchmark for measuring progress in statistical language modeling.” Preprint, submitted December 11, 2013. https://arxiv.org/abs/1312.3005.
Chen, D., and C. D. Manning. 2014. “A fast and accurate dependency parser using neural networks.” In Proc., 2014 Conf. on Empirical Methods in Natural Language Processing (EMNLP), 740–750. Stroudsburg, PA: Association for Computational Linguistics.
Chen, T., S. Kornblith, K. Swersky, M. Norouzi, and G. Hinton. 2020. “Big self-supervised models are strong semi-supervised learners.” Preprint, submitted June 17, 2020. https://arxiv.org/abs/2006.10029.
Cho, K., B. van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio. 2014. “Learning phrase representations using RNN encoder-decoder for statistical machine translation.” Preprint, submitted June 3, 2014. https://arxiv.org/abs/1406.1078.
Clark, K., M.-T. Luong, C. D. Manning, and Q. V. Le. 2018. “Semi-supervised sequence modeling with cross-view training.” Preprint, submitted September 22, 2018. https://arxiv.org/abs/1809.08370.
Clark, V., and J. Creswell. 2008. The mixed methods readers. Thousand Oaks, CA: SAGE.
Dimyadi, J., P. Pauwels, and R. Amor. 2016. “Modelling and accessing regulatory knowledge for computer-assisted compliance audit.” J. Inf. Technol. Construct. 21: 317–336.
Dozat, T., and C. D. Manning. 2016. “Deep biaffine attention for neural dependency parsing.” Preprint, submitted November 6, 2016. https://arxiv.org/abs/1611.01734.
Farahmand, S., T. Riley, and K. Zarringhalam. 2020. “ModEx: A text mining system for extracting mode of regulation of transcription factor-gene regulatory interaction.” J. Biomed. Inf. 102 (Feb): 103353. https://doi.org/10.1016/j.jbi.2019.103353.
Greff, K., R. K. Srivastava, J. Koutník, B. R. Steunebrink, and J. Schmidhuber. 2016. “LSTM: A search space odyssey.” IEEE Trans. Neural Networks Learn. Syst. 28 (10): 2222–2232. https://doi.org/10.1109/TNNLS.2016.2582924.
Häußler, M., S. Esser, and A. Borrmann. 2021. “Code compliance checking of railway designs by integrating BIM, BPMN and DMN.” Autom. Constr. 121 (Jan): 103427. https://doi.org/10.1016/j.autcon.2020.103427.
Hjelseth, E., and N. Nisbet. 2011. “Exploring semantic based model checking.” Accessed January 1, 2021. https://www.irbnet.de/daten/iconda/CIB21890.pdf.
Hu, X., C. Zhang, F. Ma, C. Liu, L. Wen, and P. S. Yu. 2020. “Semi-supervised relation extraction via incremental meta self-training.” Preprint, submitted October 6, 2020. https://arxiv.org/abs/2010.16410.
ICC (International Code Council). 2018. 2018 International Building Code. Washington, DC: ICC.
Kim, H., J.-K. Lee, J. Shin, and J. Choi. 2019. “Visual language approach to representing KBimCode-based Korea building code sentences for automated rule checking.” J. Comput. Des. Eng. 6 (2): 143–148. https://doi.org/10.1016/j.jcde.2018.08.002.
Lample, G., M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer. 2016. “Neural architectures for named entity recognition.” Preprint, submitted March 4, 2016. https://arxiv.org/abs/1603.01360.
LeCun, Y., Y. Bengio, and G. Hinton. 2015. “Deep learning.” Nature 521 (7553): 436–444. https://doi.org/10.1038/nature14539.
Lee, J.-K., C. M. Eastman, and Y. C. Lee. 2015. “Implementation of a BIM domain-specific language for the building environment rule and analysis.” J. Intell. Rob. Syst. 79 (3): 507–522. https://doi.org/10.1007/s10846-014-0117-7.
Li, S., H. Cai, and V. R. Kamat. 2016. “Integrating natural language processing and spatial reasoning for utility compliance checking.” J. Constr. Eng. Manage. 142 (12): 04016074. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001199.
Liu, K., and N. El-Gohary. 2017. “Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports.” Autom. Constr. 81 (Sep): 313–327. https://doi.org/10.1016/j.autcon.2017.02.003.
Liu, K., and N. El-Gohary. 2021. “Semantic neural network ensemble for automated dependency relation extraction from bridge inspection reports.” J. Comput. Civ. Eng. 35 (4): 04021007. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000961.
Marzouk, M., and M. Enaba. 2019. “Text analytics to analyze and monitor construction project contract and correspondence.” Autom. Constr. 98 (Feb): 265–274. https://doi.org/10.1016/j.autcon.2018.11.018.
Miyato, T., S.-I. Maeda, M. Koyama, and S. Ishii. 2018. “Virtual adversarial training: A regularization method for supervised and semi-supervised learning.” IEEE Trans. Pattern Anal. Mach. Intell. 41 (8): 1979–1993. https://doi.org/10.1109/TPAMI.2018.2858821.
Naganathan, H., W. O. Chong, and X. Chen. 2016. “Building energy modeling (BEM) using clustering algorithms and semi-supervised machine learning approaches.” Autom. Constr. 72 (Dec): 187–194. https://doi.org/10.1016/j.autcon.2016.08.002.
Nawari, N. O. 2019. “A generalized adaptive framework (GAF) for automating code compliance checking.” Buildings 9 (4): 86. https://doi.org/10.3390/buildings9040086.
Pan, Y., and L. Zhang. 2020. “BIM log mining: Learning and predicting design commands.” Autom. Constr. 112 (Apr): 103107. https://doi.org/10.1016/j.autcon.2020.103107.
Pennington, J., R. Socher, and C. D. Manning. 2014. “GloVe: Global vectors for word representation.” In Proc., 2014 Conf. on Empirical Methods in Natural Language Processing (EMNLP), 1532–1543. Stroudsburg, PA: Association for Computational Linguistics. https://doi.org/10.3115/v1/D14-1162.
Pestian, J. P., L. Deleger, G. K. Savova, J. W. Dexheimer, and I. Solti. 2012. “Natural language processing—The basics.” In Pediatric Biomedical Informatics, 149–172. Dordrecht, Netherlands: Springer.
Preidel, C., and A. Borrmann. 2016. “Towards code compliance checking on the basis of a visual programming language.” J. Inf. Technol. Construct. 25: 402–421.
Ratner, A., S. H. Bach, H. Ehrenberg, J. Fries, S. Wu, and C. Ré. 2020. “Snorkel: Rapid training data creation with weak supervision.” VLDB J. 29 (2): 709–730. https://doi.org/10.1007/s00778-019-00552-1.
Ravikumar, K. E., M. Rastegar-Mojarad, and H. Liu. 2017. “BELMiner: Adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences.” Database 2017 (1): baw156. https://doi.org/10.1093/database/baw156.
Sacks, R., M. Girolami, and I. Brilakis. 2020. “Building information modelling, artificial intelligence and construction tech.” Dev. Built Environ. 4 (Nov): 100011. https://doi.org/10.1016/j.dibe.2020.100011.
Sohn, K., D. Berthelot, C.-L. Li, Z. Zhang, N. Carlini, E. D. Cubuk, A. Kurakin, H. Zhang, and C. Raffel. 2020. “FixMatch: Simplifying semi-supervised learning with consistency and confidence.” Preprint, submitted January 21, 2020. https://arxiv.org/abs/2001.07685.
Solihin, W., J. Dimyadi, Y. C. Lee, C. Eastman, and R. Amor. 2020. “Simplified schema queries for supporting BIM-based rule-checking applications.” Autom. Constr. 117 (Sep): 103248. https://doi.org/10.1016/j.autcon.2020.103248.
Solihin, W., and C. Eastman. 2015. “Classification of rules for automated BIM rule checking development.” Autom. Constr. 53 (May): 69–82. https://doi.org/10.1016/j.autcon.2015.03.003.
Solihin, W., and C. M. Eastman. 2016. “A knowledge representation approach in BIM rule requirement analysis using the conceptual graph.” J. Inf. Technol. Construct. 21: 370–401.
Sutskever, I., O. Vinyals, and Q. V. Le. 2014. “Sequence to sequence learning with neural networks.” In Advances in neural information processing systems, 3104–3112. Cambridge, MA: MIT Press.
Sydora, C., and E. Stroulia. 2020. “Rule-based compliance checking and generative design for building interiors using BIM.” Autom. Constr. 120 (Dec): 103368. https://doi.org/10.1016/j.autcon.2020.103368.
Wang, H., M. Chen, H. Zhang, and D. Roth. 2020. “Joint constrained learning for event-event relation extraction.” Preprint, submitted October 13, 2020. https://arxiv.org/abs/2010.06727.
Xu, X., and H. Cai. 2020. “Semantic approach to compliance checking of underground utilities.” Autom. Constr. 109 (Jan): 103006. https://doi.org/10.1016/j.autcon.2019.103006.
Xue, X., and J. Zhang. 2020. “Building codes part-of-speech tagging performance improvement by error-driven transformational rules.” J. Comput. Civ. Eng. 34 (5): 04020035. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000917.
Yang, K., C. R. Ahn, M. C. Vuran, and S. S. Aria. 2016. “Semi-supervised near-miss fall detection for ironworkers with a wearable inertial measurement unit.” Autom. Constr. 68 (Aug): 194–202. https://doi.org/10.1016/j.autcon.2016.04.007.
Ye, Z.-X., and Z.-H. Ling. 2019. “Distant supervision relation extraction with intra-bag and inter-bag attentions.” Preprint, submitted March 30, 2019. https://arxiv.org/abs/1904.00143.
Yurchyshyna, A., and A. Zarli. 2009. “An ontology-based approach for formalisation and semantic organisation of conformance requirements in construction.” Autom. Constr. 18 (8): 1084–1098. https://doi.org/10.1016/j.autcon.2009.07.008.
Zhai, C. X., and S. Massung. 2016. Text data management and analysis: A practical introduction to information retrieval and text mining. New York: ACM.
Zhang, J., and N. M. El-Gohary. 2013. “Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking.” J. Comput. Civ. Eng. 30 (2): 04015014. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346.
Zhang, J., and N. M. El-Gohary. 2015. “Automated information transformation for automated regulatory compliance checking in construction.” J. Comput. Civ. Eng. 29 (4): B4015001. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000427.
Zhang, R., and N. El-Gohary. 2020. “A deep-learning method for evaluating semantically-rich building code annotations.” In Proc., EG-ICE 2020 Workshop. Berlin: Universitatsverlag der TU Berlin.
Zhang, R., and N. El-Gohary. 2021a. “Clustering-based approach for building code computability analysis.” J. Comput. Civ. Eng. 35 (6): 04021021. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000967.
Zhang, R., and N. El-Gohary. 2021b. “A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking.” Autom. Constr. 132 (Dec): 103834. https://doi.org/10.1016/j.autcon.2021.103834.
Zhou, P., and N. El-Gohary. 2017. “Ontology-based automated information extraction from building energy conservation codes.” Autom. Constr. 74 (Feb): 103–117. https://doi.org/10.1016/j.autcon.2016.09.004.

Information & Authors

Information

Published In

Go to Journal of Computing in Civil Engineering
Journal of Computing in Civil Engineering
Volume 36Issue 5September 2022

History

Received: Jun 24, 2021
Accepted: Nov 23, 2021
Published online: Jun 30, 2022
Published in print: Sep 1, 2022
Discussion open until: Nov 30, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

Ruichuan Zhang, S.M.ASCE [email protected]
Graduate Student, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana–Champaign, 205 N. Mathews Ave., Urbana, IL 61801. Email: [email protected]
Nora El-Gohary, A.M.ASCE [email protected]
Associate Professor, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana–Champaign, 205 N. Mathews Ave., Urbana, IL 61801 (corresponding author). Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • Information Integration of Regulation Texts and Tables for Automated Construction Safety Knowledge Mapping, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14436, 150, 5, (2024).
  • Natural Language Processing with Multitask Classification for Semantic Prediction of Risk-Handling Actions in Construction Contracts, Journal of Computing in Civil Engineering, 10.1061/JCCEE5.CPENG-5218, 37, 6, (2023).
  • Deep Learning–Based Named Entity Recognition and Resolution of Referential Ambiguities for Enhanced Information Extraction from Construction Safety Regulations, Journal of Computing in Civil Engineering, 10.1061/(ASCE)CP.1943-5487.0001064, 37, 5, (2023).
  • Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements, Automation in Construction, 10.1016/j.autcon.2022.104696, 147, (104696), (2023).
  • Transformer-based approach for automated context-aware IFC-regulation semantic information alignment, Automation in Construction, 10.1016/j.autcon.2022.104540, 145, (104540), (2023).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share