Technical Papers
Jul 6, 2022

Semantic Information Extraction of Energy Requirements from Contract Specifications: Dealing with Complex Extraction Tasks

Publication: Journal of Computing in Civil Engineering
Volume 36, Issue 5

Abstract

Automated specification energy compliance checking aims to check the compliance of building designs captured in building information models (BIMs) with energy requirements from contract specifications. However, automated extraction of requirements from specifications to support automated compliance checking is challenging because of the text complexities of the specifications, including hierarchically complex text structures, incomplete sentence structures, and a variety of levels of development (LODs). To address this challenge, this paper proposes a semantic information extraction method to extract building energy requirements automatically from specifications. Specifically, three new submethods are proposed to deal with these complexities: a domain-specific text splitting and stitching method, an incompleteness-aware sequential dependency extraction method, and a detail-aware LOD extraction method. The proposed information extraction method was tested in extracting thermal-insulation and lighting-power requirements from MasterFormat specifications. A performance of 96.8% recall and 97.6% precision was achieved on the testing data, which indicates the effectiveness of the proposed method.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Some or all data, models, or code generated or used during the study are available from the corresponding author by request (extraction rules, tagging rules for domain-specific tags and for recognizing the action verbs, domain-specific POS tagger, and testing data set).

Acknowledgments

The authors thank the National Science Foundation (NSF). This material is based upon work supported by the NSF under Grant No. 1201170. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of NSF.

References

Abuzir, Y., and M. O. Abuzir. 2002. “Constructing the civil engineering thesaurus (CET) using ThesWB.” In Proc., 2002 Int. Workshop on Information Technology in Civil Engineering, 400–412. Reston, VA: ASCE.
Akanbi, T., and J. Zhang. 2020. “Automated design information extraction from construction specifications to support wood construction cost estimation.” In Proc., 2020 Construction Research Congress (CRC), 658–666. Reston, VA: ASCE.
Al Qady, M. A., and A. Kandil. 2010. “Concept relation extraction from construction documents using natural language processing.” J. Constr. Eng. Manage. 136 (3): 294–302. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000131.
Amor, R., and J. Dimyadi. 2021. “The promise of automated compliance checking.” Dev. Built Environ. 5 (Mar): 100039. https://doi.org/10.1016/j.dibe.2020.100039.
Beach, T. H., Y. Rezgui, H. Li, and T. Kasim. 2015. “A rule-based semantic approach for automated regulatory compliance in the construction sector.” Expert Syst. Appl. 42 (12): 5219–5231. https://doi.org/10.1016/j.eswa.2015.02.029.
BIMForum. 2017. “Level of development specification 2017.” Accessed August 19, 2017. https://bim-forum.com/resources/Documents/BIMForum_LOD_2017_reprint.pdf.
Cheng, J. C. P., and M. Das. 2014. “A BIM-based web service framework for green building energy simulation and code checking.” J. Inf. Technol. Construct. 19 (8): 150–168.
Choi, J., J. Choi, and I. Kim. 2014. “Development of BIM-based evacuation regulation checking system for high-rise and complex buildings.” Autom. Constr. 46 (Oct): 38–49. https://doi.org/10.1016/j.autcon.2013.12.005.
CSI and CSC (Construction Specifications Institute and Construction Specifications Canada). 2009. “SectionFormat/PageFormat.” Accessed June 19, 2017. http://icee.usm.edu/dkemp/download/ACT380/Presentations/SupportingDocs-Samples/PDPGDocs/SectionFormat_PageFormat_2009.pdf.
CSI and CSC (Construction Specifications Institute and Construction Specifications Canada). 2014. “MasterFormat—Master list of numbers and titles for the construction industry.” Accessed June 19, 2017. http://lor.rrc.mb.ca/file/9c8519dc-c193-4edc-a4b7-9084929d902a/1/MasterFormat_2014.pdf.
CSI and CSC (Construction Specifications Institute and Construction Specifications Canada). 2017. “SectionFormat/PageFormat.” Accessed June 16, 2021. https://www.csiresources.org/standards/sectionformat-pageformat.
CSI (Construction Specifications Institute). 2004. The project resource manual: CSI manual of practice. 5th ed. New York: McGraw-Hill.
CSI (Construction Specifications Institute). 2021. “Uniformat.” Accessed August 16, 2021. https://www.csiresources.org/standards/uniformat.
Cunningham, H., D. Maynard, and K. Bontcheva. 2011. Text processing with GATE (version 6). Sheffield, UK: Univ. of Sheffield, Dept. of Computer Science.
Dimyadi, J., C. Clifton, M. Spearpoint, and R. Amor. 2016. “Computerizing regulatory knowledge for building engineering design.” J. Comput. Civ. Eng. 30 (5): C4016001. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000572.
Goutte, C., and E. Gaussier. 2005. “A probabilistic interpretation of precision, recall and f-score, with implication for evaluation.” In Proc., European Conf. on Information Retrieval (ECIR’05), 345–359. Berlin: Springer.
Hassan, F., and T. Le. 2020. “Automated requirements identification from construction contract documents using natural language processing.” J. Leg. Aff. Dispute Resolut. Eng. Constr. 12 (2): 04520009. https://doi.org/10.1061/(ASCE)LA.1943-4170.0000379.
Häußler, M., S. Esser, and A. Borrmann. 2021. “Code compliance checking of railway designs by integrating BIM, BPMN and DMN.” Autom. Constr. 121 (Jan): 103427. https://doi.org/10.1016/j.autcon.2020.103427.
İlal, S. M., and H. M. Günaydın. 2017. “Computer representation of building codes for automated compliance checking.” Autom. Constr. 82 (Oct): 43–58. https://doi.org/10.1016/j.autcon.2017.06.018.
Jeon, J., X. Xu, Y. Zhang, L. Yang, and H. Cai. 2021. “Extraction of construction quality requirements from textual specifications via natural language processing.” Transp. Res. Rec. 2675 (9): 222–237. https://doi.org/10.1177/03611981211001385.
Jurafsky, D., and J. H. Martin. 2020. Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. 3rd ed. Upper Saddle River, NJ: Prentice Hall.
Kalin, M., R. S. Weygant, H. J. Rosen, and J. R. Regener. 2010. Construction specifications writing: Principles and procedures. Hoboken, NJ: Wiley.
Khemlani, L. 2005. “CORENET e-PlanCheck: Singapore’s automated code checking system.” AECBytes. Accessed May 10, 2018. https://www.aecbytes.com/feature/2005/CORENETePlanCheck.html.
Kim, H., J.-K. Lee, J. Shin, and J. Choi. 2019. “Visual language approach to representing KBimCode-based Korea building code sentences for automated rule checking.” J. Comput. Des. Eng. 6 (2): 143–148. https://doi.org/10.1016/j.jcde.2018.08.002.
Klebanov, B. B., and E. Beigman. 2009. “From annotator agreement to noise models.” Comput. Ling. 35 (4): 495–503. https://doi.org/10.1162/coli.2009.35.4.35402.
Lee, J., J.-S. Yi, and J. Son. 2019. “Development of automatic-extraction model of poisonous clauses in international construction contracts using rule-based NLP.” J. Comput. Civ. Eng. 33 (3): 04019003. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000807.
Liu, K., and N. El-Gohary. 2017. “Ontology-based semi-supervised conditional random fields for automated information extraction from bridge inspection reports.” Autom. Constr. 81 (Sep): 313–327. https://doi.org/10.1016/j.autcon.2017.02.003.
Manning, C. D., and H. Schutze. 1999. Foundations of statistical natural language processing. Cambridge, MA: MIT Press.
Maynard, D., W. Peters, and Y. Li. 2006. “Metrics for evaluation of ontology-based information extraction.” In Proc., WWW 2006 Workshop on Evaluation of Ontologies for the Web (EON). New York: Association for Computing Machinery.
Moens, M. F. 2006. Information extraction: Algorithms and prospects in a retrieval context. New York: Springer.
Moon, S., G. Lee, S. Chi, and H. Oh. 2021. “Automated construction specification review with named entity recognition using natural language processing.” J. Constr. Eng. Manage. 147 (1): 04020147. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001953.
Moreno, A., D. Isern, and A. C. López Fuentes. 2013. “Ontology-based information extraction of regulatory networks from scientific articles with case studies for Escherichia coli.” Expert Syst. Appl. 40 (8): 3266–3281. https://doi.org/10.1016/j.eswa.2012.12.090.
Nawari, N. O. 2012. “Automating codes conformance.” J. Archit. Eng. 18 (4): 315–323. https://doi.org/10.1061/(ASCE)AE.1943-5568.0000049.
Nawari, N. O. 2020. “Generalized adaptive framework for computerizing the building design review process.” J. Archit. Eng. 26 (1): 04019026. https://doi.org/10.1061/(asce)ae.1943-5568.0000382.
Newcombe, R. G. 1998. “Two-sided confidence intervals for the single proportion: Comparison of seven methods.” Stat. Med. 17 (8): 857–872. https://doi.org/10.1002/(SICI)1097-0258(19980430)17:8%3C857::AID-SIM777%3E3.0.CO;2-E.
Pauwels, P., D. Van Deursen, R. Verstraeten, J. De Roo, R. De Meyer, R. Van de Walle, and J. Van Campenhout. 2011. “A semantic rule checking environment for building performance checking.” Autom. Constr. 20 (5): 506–518. https://doi.org/10.1016/j.autcon.2010.11.017.
Pestian, J. P., L. Deleger, G. K. Savova, J. W. Dexheimer, and I. Solti. 2012. “Natural language processing—The basics.” In Pediatric biomedical informatics, 149–172. Dordrecht, Netherlands: Springer.
Preidel, C., and A. Borrmann. 2015. “Automated code compliance checking based on a visual language and building information modeling.” In Proc., 32nd Int. Symp. of Automation and Robotics in Construction and Mining, International Association for Automation & Robotics in Construction (IAARC), 256–263. London: International Association for Automation and Robotics in Construction.
Qi, J., R. R. A. Issa, S. Olbina, and J. Hinze. 2014. “Use of building information modeling in design to prevent construction worker falls.” J. Comput. Civ. Eng. 28 (5): A4014008. https://doi.org/10.1061/%28ASCE%29CP.1943-5487.0000365.
Robbins, A., L. Lamb, and E. Hannah. 2008. Learning the Vi and Vim editors. 7th ed. Sebastopol, CA: O’Reilly.
SMARTreview. 2021. “SMARTreview.” Accessed June 16, 2021. https://smartreview.biz/home.
SMC (Solibri Model Checker). 2009. “Development of Solibri Model Checker (SMC) accessibility rules set for checking building information models.” Accessed May 17, 2017. http://www.universell-utforming.miljo.no/file_upload/statsbygg_smc_accessibility.pdf.
Solihin, W., and C. Eastman. 2016. “A knowledge representation approach in BIM rule requirement analysis using the conceptual graph.” J. Inf. Technol. Construct. 21 (21): 370–401.
Sydora, C., and E. Stroulia. 2020. “Rule-based compliance checking and generative design for building interiors using BIM.” Autom. Constr. 120 (Dec): 103368. https://doi.org/10.1016/j.autcon.2020.103368.
Tan, X., A. Hammad, and P. Fazio. 2010. “Automated code compliance checking for building envelope design.” J. Comput. Civ. Eng. 24 (2): 203–211. https://doi.org/10.1061/(ASCE)0887-3801(2010)24:2(203).
USDOE. 2018. “Software and web tools.” Accessed May 29, 2018. https://www.energycodes.gov/software-tools.
Vieira, R., and M. Poesio. 2000. “An empirically based system for processing definite descriptions.” Comput. Ling. 26 (4): 538–593. https://doi.org/10.1162/089120100750105948.
Wilson, E. B. 1927. “Probable inference, the law of succession, and statistical inference.” J. Am. Stat. Assoc. 22 (158): 209–212. https://doi.org/10.2307/2276774.
Wimalasuriya, D. C., and D. J. Dou. 2010. “Ontology-based information extraction: An introduction and a survey of current approaches.” J. Inf. Sci. 36 (3): 306–323. https://doi.org/10.1177/0165551509360123.
Xu, N., L. Ma, L. Wang, Y. Deng, and G. Ni. 2021. “Extracting domain knowledge elements of construction safety management: Rule-based approach using Chinese natural language processing.” J. Manage. Eng. 37 (2): 04021001. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000870.
Xue, X. R., and J. Zhang. 2021. “Part-of-speech tagging of building codes empowered by deep learning and transformational rules.” Adv. Eng. Inf. 47 (Jan): 101235. https://doi.org/10.1016/j.aei.2020.101235.
Zhang, J., and N. M. El-Gohary. 2013. “Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking.” J. Comput. Civ. Eng. 30 (2): 04015014. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346.
Zhang, J., and N. M. El-Gohary. 2017. “Integrating semantic NLP and logic reasoning into a unified system for fully-automated code checking.” Autom. Constr. 73 (Jan): 45–57. https://doi.org/10.1016/j.autcon.2016.08.027.
Zhang, R., and N. El-Gohary. 2021. “A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking.” Autom. Constr. 132 (Dec): 103834. https://doi.org/10.1016/j.autcon.2021.103834.
Zhong, B. T., L. Y. Ding, H. B. Luo, Y. Zhou, Y. Z. Hu, and H. M. Hu. 2012. “Ontology-based semantic modeling of regulation constraint for automated construction quality compliance checking.” Autom. Constr. 28 (Dec): 58–70. https://doi.org/10.1016/j.autcon.2012.06.006.
Zhong, B. T., X. J. Xing, H. B. Luo, Q. R. Zhou, H. Li, T. Rose, and W. L. Fang. 2020. “Deep learning-based extraction of construction procedural constraints from construction regulations.” Adv. Eng. Inf. 43 (Jan): 101003. https://doi.org/10.1016/j.aei.2019.101003.
Zhou, H., S. Lee, and H. Ying. 2018. “VPL-based code translation for automated compliance checking of building envelope energy efficiency.” In Proc., 2018 ASCE Construction Research Congress (CRC), 1–12. Reston, VA: ASCE.
Zhou, P., and N. El-Gohary. 2016. “Ontology-based multilabel text classification of construction regulatory documents.” J. Comput. Civ. Eng. 30 (4): 04015058. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000530.
Zhou, P., and N. El-Gohary. 2017. “Ontology-based automated information extraction from building energy conservation codes.” Autom. Constr. 74 (Feb): 103–117. https://doi.org/10.1016/j.autcon.2016.09.004.

Information & Authors

Information

Published In

Go to Journal of Computing in Civil Engineering
Journal of Computing in Civil Engineering
Volume 36Issue 5September 2022

History

Received: Jun 24, 2021
Accepted: Oct 27, 2021
Published online: Jul 6, 2022
Published in print: Sep 1, 2022
Discussion open until: Dec 6, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

Assistant Professor, School of Management Science and Engineering, Central Univ. of Finance and Economics, 39 Xue Yuan South Rd., Beijing 100081, China. ORCID: https://orcid.org/0000-0002-4913-6254
Nora El-Gohary [email protected]
Associate Professor, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana-Champaign, 205 N. Mathews Ave., Urbana, IL 61801 (corresponding author). Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share