Automating Coordination Efforts for Reviewing Construction Contracts with Multilabel Text Classification
Publication: Journal of Construction Engineering and Management
Volume 148, Issue 6
Abstract
Construction projects involve multiple company departments and disciplines. The departments follow certain rules in implementing a project, also referred to as requirements in a construction contract. Current administration practices do not show which discipline or department is related to any requirement in the contracts. Thus, all departments need to review contract requirements but typically only from their perspective and with minimal communication with one another. In addition to the tendency of this manual process to error, time and money are lost in evaluating irrelevant departmental requirements. This study concentrates on one aspect of contract interpretation, coordination of the contract requirement review. Automating a classification of the contract requirements by relevant departments can increase the efficiency of contract reviews. This study proposes a robust approach to automating contract sentence classification by relevance to the company department. The approach comprises both natural language processing (NLP) and supervised machine learning techniques to train an algorithm. Training data are selected from an internationally and widely used standard form of construction contract. Precision metric results as high as 0.952 and recall metric results as high as 0.786 are acquired by support vector classifiers (SVCs). These are considered sufficient within the context of multilabel classification of construction contract sentences for construction professionals to operate without further training. The developed methodology reduces time spent on contract review, reliably and accurately predicts classification of contract sentences for departmental relevance, and also removes the dependence on expert participation in coordination efforts contract review.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or codes that support the findings of this study are available from the corresponding author upon reasonable request.
References
Aggarwal, C., and C. Zhai. 2012. “A survey of text classification algorithms.” In Mining text data, 163–222. New York: Springer.
Al Qady, M., and A. Kandil. 2010. “Concept relation extraction from construction documents using natural language processing.” J. Constr. Eng. Manage. 136 (3): 294–302. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000131.
Bird, S., E. Klein, and E. Loper. 2009. Natural language processing with Python. Cambridge, MA: O’Reilly.
Caldas, C. H., L. Soibelman, and J. Han. 2002. “Automated classification of construction project documents.” J. Comput. Civ. Eng. 16 (4): 234–243. https://doi.org/10.1061/(ASCE)0887-3801(2002)16:4(234).
Catterwell, R. 2020. “Automation in contract interpretation.” Law Innovation Technol. 12 (1): 81–112. https://doi.org/10.1080/17579961.2020.1727068.
FIDIC (International Federation of Consulting Engineers). 1999. Conditions of contract for plant and design-build projects. 1st ed. Geneva: FIDIC.
Gunduz, M., and H. A. Elsherbeny. 2019. “Operational framework for managing construction contract administration practitioners’ perspective through modified Delphi method.” J. Constr. Eng. Manage. 146 (3): 04019110. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001768.
Guo, Z., Y. Hu, and J. Liu. 2014. “The analysis of contractor’s risk clause based on the FIDIC construction contract.” Appl. Mech. Mater. 687–691 (Nov): 4815–4818. https://doi.org/10.4028/www.scientific.net/AMM.687-691.4815.
Joulin, A., E. Grave, P. Bojanowski, and T. Mikolov. 2017. “Bag of tricks for efficient text classification.” In Vol. 2 of Proc., 15th Conf. of the European Chapter of the Association for Computational Linguistics, 427–431. Valencia, Spain: Association for Computational Linguistics.
Kim, T., and S. Chi. 2019. “Accident case retrieval and analyses: Using natural language processing in the construction industry.” J. Constr. Eng. Manage. 145 (3): 04019004. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001625.
Lee, J. H., J. S. Yi, and J. W. Son. 2019. “Development of automatic-extraction model of poisonous clauses in international construction contracts using rule-based NLP.” J. Comput. Civ. Eng. 33 (3): 04019003. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000807.
Madjarov, G., D. Kocev, D. Gjorgjevikj, and S. Džeroski. 2012. “An extensive experimental comparison of methods for multilabel learning.” Pattern Recognit. 45 (9): 3084–3104. https://doi.org/10.1016/j.patcog.2012.03.004.
Moon, S., G. Lee, S. Chi, and H. Oh. 2019. “Automatic review of construction specifications using natural language processing.” In Proc., ASCE Int. Conf. on Computing in Civil Engineering 2019, 401–407. Reston, VA: ASCE. https://doi.org/10.1061/9780784482438.051.
Pawar, Y. P., and S. H. Gawande. 2012. “A comparative study on different types of approaches to text categorization.” Int. J. Mach. Learn. Comput. 2 (4): 423–426. https://doi.org/10.7763/IJMLC.2012.V2.158.
Pedregosa, F., et al. 2011. “Scikit-learn: Machine learning in Python.” J. Mach. Learn. Res. 12: 2825–2830.
Rameezdeen, R., and C. Rajapakse. 2007. “Contract interpretation: The impact of readability.” Construct. Manage. Econ. 25 (7): 729–737. https://doi.org/10.1080/01446190601099228.
Sebastiani, F. 2002. “Machine learning in automated text categorization.” ACM Comput. Surv. 34 (1): 1–47. https://doi.org/10.1145/505282.505283.
Sorower, M. S. 2010. A literature survey on algorithms for multilabel learning. Corvallis, OR: Oregon State Univ.
Spyromitros, E., G. Tsoumakas, and I. Vlahavas. 2008. “An empirical study of lazy multilabel classification algorithms.” In Lecture Notes in Computer Science (LNCS), 401–406. Berlin: Springer.
Thomas, H. R., G. R. Smith, and R. E. Mellott. 1994. “Interpretation of construction contracts.” J. Constr. Eng. Manage. 120 (2): 321–336. https://doi.org/10.1061/(ASCE)0733-9364(1994)120:2(321).
Tsoumakas, G., I. Katakis, and I. Vlahavas. 2010. “Mining multilabel data.” In Data mining and knowledge discovery handbook, 667–685. Berlin: Springer.
Wang, S., and C. D. Manning. 2012. “Baselines and bigrams: Simple, good sentiment and text classification.” In Vol. 2 of Proc., 50th Annual Meeting of the Association for Computational Linguistics, 90–94. Jeju Island, Korea: Association for Computational Linguistics.
Zhang, J., and N. El-Gohary. 2015. “Automated information transformation for automated regulatory compliance checking in construction.” J. Comput. Civ. Eng. 29 (4): B4015001. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000427.
Zhang, J., and N. El-Gohary. 2016. “Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking.” J. Comput. Civ. Eng. 30 (2): 04015014. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346.
Zhou, P., and N. El-Gohary. 2015. “Domain-specific hierarchical text classification for supporting automated environmental compliance checking.” J. Comput. Civ. Eng. 30 (4): 04015057. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000513.
Information & Authors
Information
Published In
Copyright
© 2022 American Society of Civil Engineers.
History
Received: Mar 30, 2021
Accepted: Jan 14, 2022
Published online: Mar 22, 2022
Published in print: Jun 1, 2022
Discussion open until: Aug 22, 2022
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited by
- Mariam Elazhary, Ossama Hosny, Automated Management of Time Extension Claims, Journal of Legal Affairs and Dispute Resolution in Engineering and Construction, 10.1061/JLADAH.LADR-1104, 16, 2, (2024).
- Hieu T. T. L. Pham, SangUk Han, Natural Language Processing with Multitask Classification for Semantic Prediction of Risk-Handling Actions in Construction Contracts, Journal of Computing in Civil Engineering, 10.1061/JCCEE5.CPENG-5218, 37, 6, (2023).