Technical Papers
Aug 27, 2021

Natural Language Processing–Driven Model to Extract Contract Change Reasons and Altered Work Items for Advanced Retrieval of Change Orders

Publication: Journal of Construction Engineering and Management
Volume 147, Issue 11

Abstract

Change orders are documents that describe a specific contract amendment to the original scope of work. Historical change orders are invaluable information sources that can provide practical and proven solutions for developing new change orders from similar cases. However, current change order management systems are not efficient in searching for and finding the most related and similar change orders due to inherent weaknesses in current archiving and search processes, such as keyword-based or reason code–based search. This study proposes and develops a natural language processing (NLP)–driven model that can significantly improve the accuracy and reliability of searching cases by restructuring how each change order’s information is stored and retrieved in change order management systems. The NLP-driven model proposed in this study can automatically detect change reasons and altered work items through text representation pattern analysis and training. The proposed model applies semantic frames to define essential semantic components and determines syntactic features for text representation pattern analysis. The model also utilizes a conditional random field (CRF) classifier, which can consider contexts in sequential texts at the model training stage. The proposed model can significantly improve the accuracy and relevancy of the search process to find the most similar cases by allowing context-driven classification, archiving, and retrieval of change orders.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

The data used during the study were provided by a third party. Direct requests for these materials may be made to the provider as indicated in the acknowledgments. Some models or codes that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to acknowledge that the Oklahoma DOT provided the historical change order data for this study. This work was financially supported by an Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korean Government (MSIT) (No. 2019-0-01559-001), Digitalizing Construction Project Requirements Using Artificial Intelligence and Natural Language Processing.

References

Al Qady, M., and A. Kandil. 2013. “Document management in construction: Practices and opinions.” J. Constr. Eng. Manage. 139 (10): 06013002. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000741.
Arif, M., C. Egbu, O. Alom, and M. M. Khalfan. 2009. “Measuring knowledge retention: A case study of a construction consultancy in the UAE.” Eng. Constr. Archit. Manage. 16 (1): 92–108. https://doi.org/10.1108/09699980910927912/.
Baker, C. F., C. J. Fillmore, and B. Cronin. 2003. “The structure of the FrameNet database.” Int. J. Lexicogr. 16 (3): 281–296. https://doi.org/10.1093/ijl/16.3.281.
Baker, C. F., C. J. Fillmore, and J. B. Lowe. 1998. “The Berkeley Framenet project.” In Vol. 1 of Proc., 36th Annual Meeting of the Association for Computational Linguistics and 17th Int. Conf. on Computational Linguistics, 86–90. Stroudsburg, PA: Association for Computational Linguistics.
Bunescu, R., and R. Mooney. 2005. “A shortest path dependency kernel for relation extraction.” In Proc., Human Language Technology Conf. and Conf. on Empirical Methods in Natural Language Processing, 724–731. Stroudsburg, PA: Association for Computational Linguistics.
Caltrans. 2019. Construction manual. Sacramento, CA: California DOT.
Camacho-Collados, J., and R. Navigli. 2017. “BabelDomains: Large-scale domain labeling of lexical resources.” In Proc., 15th Conf. of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, 223–228. Stroudsburg, PA: Association for Computational Linguistics.
Chen, J.-H. 2008. “KNN based knowledge-sharing model for severe change order disputes in construction.” Autom. Constr. 17 (6): 773–779. https://doi.org/10.1016/j.autcon.2008.02.005.
Desai, N., J. Pitroda, and J. Bhavsar. 2015. “A review of change order and assessing causes affecting change order in construction.” Int. Acad. Res. Multidiscip. 2 (12): 152–162.
El-adaway, I., S. Fawzy, T. Allard, and A. Runnels. 2016. “Change order provisions under national and international standard forms of contract.” J. Leg. Aff. Dispute Resolut. Eng. Constr. 8 (3): 03716001. https://doi.org/10.1061/(ASCE)LA.1943-4170.0000187.
Elsayed, A., A. S. Eldin, and D. S. El Zanfaly. 2013. “Enhancing keyword search over relational databases using ontologies.” In Proc., 3rd Int. Conf. on Advances in Computing & Information Technology (ACITY 2013, 147–154. Berlin: Springer.
Fillmore, C. J., C. R. Johnson, and M. R. Petruck. 2003. “Background to framenet.” Int. J. Lexicogr. 16 (3): 235–250.
Goyal, A., V. Gupta, and M. Kumar. 2018. “Recent named entity recognition and classification techniques: A systematic review.” Comput. Sci. Rev. 29 (Aug): 21–43. https://doi.org/10.1016/j.cosrev.2018.06.001.
Haque, M. M., S. Pervin, and Z. Begum. 2013. “Literature review of automatic single document text summarization using NLP.” Int. J. Innovation Appl. Stud. 3 (3): 857–865.
Hassell, J., B. Aleman-Meza, and I. B. Arpinar. 2006. “Ontology-driven automatic entity disambiguation in unstructured text.” In Proc., Int. Semantic Web Conf., 44–57. Berlin: Springer.
Hebert, D., S. Nicolas, and T. Paquet. 2013. “Discrete CRF based combination framework for document image binarization.” In Proc., 12th Int. Conf. on Document Analysis and Recognition, 1165–1169. New York: IEEE.
Hua, L., and C. Quan. 2016. “A shortest dependency path based convolutional neural network for protein-protein relation extraction.” Biomed. Res. Int. 2016.
Hwang, B.-G., and L. K. Low. 2012. “Construction project change management in Singapore: Status, importance and impact.” Int. J. Project Manage. 30 (7): 817–826. https://doi.org/10.1016/j.ijproman.2011.11.001.
Jenset, G. B., and B. McGillivray. 2019. “Enhancing domain-specific supervised natural language intent classification with a top-down selective ensemble model.” Mach. Learn. Knowl. Extr. 1 (2): 630–640. https://doi.org/10.3390/make1020037.
Khoso, A. R., J. S. Khan, R. U. Faiz, and M. A. Akhund. 2019. “Assessment of change orders attributes in preconstruction and construction phase.” Civ. Eng. J. 5 (3): 616–623. https://doi.org/10.28991/cej-2019-03091273.
Kim, T., and S. Chi. 2019. “Accident case retrieval and analyses: Using natural language processing in the construction industry.” J. Constr. Eng. Manage. 145 (3): 04019004. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001625.
Kudo, T., and Y. Matsumoto. 2001. “Chunking with support vector machines.” In Proc., 2nd Meeting of the North American Chapter of the Association for Computational Linguistics. Stroudsburg, PA: Association for Computational Linguistics.
Lampert, T. A., A. Stumpf, and P. Gançarski. 2016. “An empirical study into annotator agreement, ground truth estimation, and algorithm evaluation.” IEEE Trans. Image Process. 25 (6): 2557–2572. https://doi.org/10.1109/TIP.2016.2544703.
Le, T., C. Le, H. D. Jeong, S. B. Gilbert, and E. Chukharev-Hudilainen. 2019. “Requirement text detection from contract packages to support project definition determination.” In Advances in informatics and computing in civil and construction engineering, 569–576. Cham, Switzerland: Springer.
Lee, J., and J.-S. Yi. 2017. “Predicting project’s uncertainty risk in the bidding process by integrating unstructured text data and structured numerical data using text mining.” Appl. Sci. 7 (11): 1141. https://doi.org/10.3390/app7111141.
Lee, J., J.-S. Yi, and J. Son. 2019. “Development of automatic-extraction model of poisonous clauses in international construction contracts using rule-based NLP.” J. Comput. Civ. Eng. 33 (3): 04019003. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000807.
Li, G., J. Feng, B. C. Ooi, J. Wang, and L. Zhou. 2011. “An effective 3-in-1 keyword search method over heterogeneous data sources.” Inf. Syst. 36 (2): 248–266. https://doi.org/10.1016/j.is.2008.08.001.
Li, S., H. Cai, and V. R. Kamat. 2016. “Integrating natural language processing and spatial reasoning for utility compliance checking.” J. Constr. Eng. Manage. 142 (12): 04016074. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001199.
Liao, W., and S. Veeramachaneni. 2009. “A simple semi-supervised algorithm for named entity recognition.” In Proc., NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, 58–65. Stroudsburg, PA: Association for Computational Linguistics.
Liu, H., Y.-S. Liu, P. Pauwels, H. Guo, and M. Gu. 2017. “Enhanced explicit semantic analysis for product model retrieval in construction industry.” IEEE Trans. Ind. Inf. 13 (6): 3361–3369. https://doi.org/10.1109/TII.2017.2708727.
Marrero, M., J. Urbano, S. Sánchez-Cuadrado, J. Morato, and J. M. Gómez-Berbís. 2013. “Named entity recognition: Fallacies, challenges and opportunities.” Comput. Stand. Interfaces 35 (5): 482–489. https://doi.org/10.1016/j.csi.2012.09.004.
Matthies, B. 2017. “A text mining approach for extracting lessons learned from project documentation: An illustrative case study.” Int. J. Manage. Knowl. Learn. 2: 153–174.
Mavridis, N., and D. Roy. 2006. “Grounded situation models for robots: Where words and percepts meet.” In Proc., 2006 IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, 4690–4697. New York: IEEE.
Misra, J. 2020. “AutoNLP: NLP feature recommendations for text analytics applications.” Preprint, submitted February 8, 2020. http://arxiv.org/abs/03056.
Moreda, P., B. Navarro, and M. Palomar. 2007. “Corpus-based semantic role approach in information retrieval.” Data Knowl. Eng. 61 (3): 467–483. https://doi.org/10.1016/j.datak.2006.06.010.
Myers, B., S. E. Hudson, and R. Pausch. 2000. “Past, present, and future of user interface software tools.” ACM Trans. Comput.-Hum. Interact. 7 (1): 3–28. https://doi.org/10.1145/344949.344959.
NDDOT (North Dakota Department of Transportation). 2007. Construction records manual. Bismarck, ND: NDDOT.
Pham, M., S. Alse, C. A. Knoblock, and P. Szekely. 2016. “Semantic labeling: A domain-independent approach.” In Proc., Int. Semantic Web Conf., 446–462. Berlin: Springer.
Serag, E., A. Oloufa, L. Malone, and E. Radwan. 2010. “Model for quantifying the impact of change orders on project cost for US roadwork construction.” J. Constr. Eng. Manage. 136 (9): 1015–1027. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000206.
Shen, L., H. Yan, H. Fan, Y. Wu, and Y. Zhang. 2017. “An integrated system of text mining technique and case-based reasoning (TM-CBR) for supporting green building design.” Build. Environ. 124 (Nov): 388–401. https://doi.org/10.1016/j.buildenv.2017.08.026.
Sutton, C., and A. McCallum. 2012. “An introduction to conditional random fields.” Found. Trends Mach. Learn. 4 (4): 267–373. https://doi.org/10.1561/2200000013.
Syal, M., and M. Bora. 2016. “Change order clauses in standard contract documents.” Pract. Period. Struct. Des. Constr. 21 (2): 04015021. https://doi.org/10.1061/(ASCE)SC.1943-5576.0000281.
Systematics, C. 2013. Improving management of transportation information. Washington, DC: Transportation Research Board.
Taylor, T. R., M. Uddin, P. M. Goodrum, A. McCoy, and Y. Shan. 2012. “Change orders and lessons learned: Knowledge from statistical analyses of engineering change orders on Kentucky highway projects.” J. Constr. Eng. Manage. 138 (12): 1360–1369. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000550.
Tekli, J. 2016. “An overview on xml semantic disambiguation from unstructured text to semi-structured data: Background, applications, and ongoing challenges.” IEEE Trans. Knowl. Data Eng. 28 (6): 1383–1407. https://doi.org/10.1109/TKDE.2016.2525768.
TxDOT (Texas Department of Transportation). 2020. Construction contract administration manual. Austin, TX: TxDOT.
Whiteside, M. 2016. Project delivery document management systems. Sacramento, CA: Caltrans Division of Research, Innovation and System Information.
WSDOT (Washington Dept. of Transportation). 2017. WSDOT construction change order process guide. Olympia, WA: WSDOT.
Yadav, S., A. Ekbal, S. Saha, A. Kumar, and P. Bhattacharyya. 2019. “Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein–protein interaction.” Knowl.-Based Syst. 166: 18–29.
Zamin, N., and A. Oxley. 2011. “Building a corpus-derived gazetteer for named entity recognition.” In Proc., Int. Conf. on Software Engineering and Computer Systems, 73–80. Berlin: Springer.
Zhang, D., J. Wang, and X. Zhao. 2015. “Estimating the uncertainty of average F1 scores.” In Proc., 2015 Int. Conf. on the Theory of Information Retrieval, 317–320. New York: Association for Computing Machinery.
Zhang, F., H. Fleyeh, X. Wang, and M. Lu. 2019. “Construction site accident analysis using text mining and natural language processing techniques.” Autom. Constr. 99 (Mar): 238–248. https://doi.org/10.1016/j.autcon.2018.12.016.
Zhang, J., and N. M. El-Gohary. 2017. “Semantic-based logic representation and reasoning for automated regulatory compliance checking.” J. Comput. Civ. Eng. 31 (1): 04016037. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000583.
Zhou, P., and N. El-Gohary. 2015. “Ontology-based information extraction from environmental regulations for supporting environmental compliance checking.” Comput. Civ. Eng. 2015: 190–198. https://doi.org/10.1061/9780784479247.024.
Zhou, P., and N. El-Gohary. 2017. “Ontology-based automated information extraction from building energy conservation codes.” Autom. Constr. 74 (Feb): 103–117. https://doi.org/10.1016/j.autcon.2016.09.004.
Zou, Y., A. Kiviniemi, and S. W. Jones. 2017. “Retrieving similar cases for construction project risk management using Natural Language Processing techniques.” Autom. Constr. 80 (Aug): 66–76. https://doi.org/10.1016/j.autcon.2017.04.003.

Information & Authors

Information

Published In

Go to Journal of Construction Engineering and Management
Journal of Construction Engineering and Management
Volume 147Issue 11November 2021

History

Received: Dec 16, 2020
Accepted: Jun 24, 2021
Published online: Aug 27, 2021
Published in print: Nov 1, 2021
Discussion open until: Jan 27, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

Ph.D. Student, Dept. of Architecture, Texas A&M Univ., College Station, TX 77843. Email: [email protected]
Professor, Dept. of Construction Science, Texas A&M Univ., College Station, TX 77843 (corresponding author). ORCID: https://orcid.org/0000-0003-4074-1869. Email: [email protected]
Professor, Dept. of Architecture and Architectural Engineering, Yonsei Univ., 50 Yonsei-ro, Seodaemun-Gu, Seoul 120-749, Republic of Korea. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • Industry 4.0 and Construction Contract Management: A Bibliometric Survey, Journal of Legal Affairs and Dispute Resolution in Engineering and Construction, 10.1061/JLADAH.LADR-988, 16, 3, (2024).
  • Dispute Classification and Analysis: Deep Learning–Based Text Mining for Construction Contract Management, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14080, 150, 1, (2024).
  • Digital Technologies for Preventing, Mitigating, and Resolving Contractual Disagreements in the AEC Industry: A Systematic Literature Review, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-14032, 150, 6, (2024).
  • Project Requirements Prioritization through NLP-Driven Classification and Adjusted Work Items Analysis, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-13655, 150, 3, (2024).
  • Automated Identification of Active Players for International Construction Market Entry Using Natural Language Processing, Journal of Management in Engineering, 10.1061/JMENEA.MEENG-5298, 39, 5, (2023).
  • Natural Language Processing–Driven Similar Project Determination Using Project Scope Statements, Journal of Management in Engineering, 10.1061/JMENEA.MEENG-5229, 39, 3, (2023).
  • Convolutional Neural Network Algorithm–Based Novel Automatic Text Classification Framework for Construction Accident Reports, Journal of Construction Engineering and Management, 10.1061/JCEMD4.COENG-13523, 149, 12, (2023).
  • The Effect of Change-Order Management Factors on Construction Project Success: A Structural Equation Modeling Approach, Journal of Construction Engineering and Management, 10.1061/(ASCE)CO.1943-7862.0002350, 148, 9, (2022).
  • Named entity recognition of building construction defect information from text with linguistic noise, Automation in Construction, 10.1016/j.autcon.2022.104543, 143, (104543), (2022).
  • Improving the accuracy of schedule information communication between humans and data, Advanced Engineering Informatics, 10.1016/j.aei.2022.101645, 53, (101645), (2022).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share