Technical Papers
Mar 6, 2023

Developing a Construction Domain–Specific Artificial Intelligence Language Model for NCDOT’s CLEAR Program to Promote Organizational Innovation and Institutional Knowledge

Publication: Journal of Computing in Civil Engineering
Volume 37, Issue 3

Abstract

Transportation agency personnel gain valuable knowledge through their work, but such knowledge is lost if it is not documented properly after the worker leaves the organization. The risk of losing institutional knowledge is a current problem at state departments of transportation, including the North Carolina Department of Transportation (NCDOT), due to high personnel turnover. State transportation agencies have implemented knowledge repositories in the form of lessons learned/best practices databases to address this problem. However, motivating end-users to use such databases is challenging. This paper addresses this challenge through novel artificial intelligence technology whereby a neural network–based language model is implemented as part of the NCDOT’s new knowledge management program: Communicate Lessons, Exchange Advice, Record (CLEAR). The CLEAR program encompasses a database of lessons learned/best practices and a website to access and search the database. The developed methodology involves training a language model on transportation construction texts and using that trained model in a novel algorithm enabling users to search the CLEAR database easily. The developed language-processing model provides an easily accessible interface to suggest the most relevant CLEAR data based on the end-user’s searched keywords. The model learns an inference model of construction domain–specific vocabulary extracted from various sources, such as contract documents, textbooks, and specifications, to make meaningful connections between lessons learned/best practices in the CLEAR database and project-specific knowledge. The developed model has been validated by project managers for projects at various life cycle stages. The automation of information retrieval is intended to encourage NCDOT personnel to use and embrace the CLEAR program as part of their routine work to improve project workflow. In the long run, the NCDOT will benefit from consistent usage of the CLEAR program and its high quality content, thereby leading to enhanced institutional knowledge and organizational innovation.

Practical Applications

The construction industry, with a particular emphasis on transportation construction, currently faces tremendous challenges in retaining and retraining existing personnel to ensure business continuity on projects. Knowledge gained on projects by project personnel can be lost forever if not properly documented. While knowledge repositories are effective toward ensuring the storing and retrieving of past knowledge, extant literature underlines the need to ensure continued participation by the end-users for the success of such repositories. This research effort uses natural language processing, a subfield artificial intelligence that deals specifically with text sources, as a means to quickly and accurately enhance the quality of search results being displayed to the end-users within the North Carolina Department of Transportation’s recently commissioned knowledge management program called CLEAR. As a result, end-users can stay motivated and embrace the CLEAR program, thereby ensuring its long-term success. In the long run, the consistent usage of the CLEAR program and the high quality content that is input to the CLEAR database by the NCDOT end-users will lead to enhanced institutional knowledge and internal organizational innovation.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statement

Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request. The CLEAR Lessons Learned/Best Practices data and other information such as contract documents, and feasibility study reports used in this paper are proprietary to the NCDOT and such data can be obtained directly from the NCDOT Value Management Office. In addition, the training model and code used to develop the CD-SAIL model are currently copyrighted to the NCDOT and the North Carolina State University (NCSU) research team through a tech-transfer agreement, but can be provided to anyone interested upon submitting a reasonable request to the corresponding author.

Acknowledgments

The authors would like to thank Clare E. Fullerton and Alyson W. Tamer with NCDOT’s Value Management Unit for providing proprietary text data sources such as contract documents that helped us in preparing the transportation construction corpus of words. The authors would also like to thank the NCDOT project managers who helped us validate and iteratively fine-tune the CD-SAIL model. Finally, the authors thank the anonymous reviewers of this manuscript for providing constructive feedback to improve its quality.

References

AASHTO. 2021. “State DOT panel examines workforce recruitment, retention issues.” Accessed December 18, 2021. https://aashtojournal.org/2021/05/14/state-dot-panel-examines-workforce-recruitment-retention-issues/.
Addis, M. 2016. “Tacit and explicit knowledge in construction management.” Construct. Manage. Econ. 34 (7–8): 439–445. https://doi.org/10.1080/01446193.2016.1180416.
Alsharef, A., S. Banerjee, S. M. J. Uddin, A. Albert, and E. Jaselskis. 2021. “Early impacts of the COVID-19 pandemic on the united states construction industry.” Int. J. Environ. Res. Public Health 18 (4): 1559. https://doi.org/10.3390/ijerph18041559.
Amir, R., and J. Parvar. 2014. “Harnessing knowledge management to improve organizational performance.” Int. J. Trade Econ. Finance 5 (1): 31–38. https://doi.org/10.7763/IJTEF.2014.V5.336.
Anumba, C., C. Egbu, and P. Carrillo. 2005. Knowledge management in construction. Oxford, UK: Blackwell.
Assaad, R., and I. H. El-adaway. 2021. “Guidelines for responding to COVID-19 pandemic: Best practices, impacts, and future research directions.” J. Manage. Eng. 37 (3): 06021001. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000906.
Banerjee, S., E. J. Jaselskis, and A. Alsharef. 2020. “Design for six sigma (DFSS) approach for creating clear lessons learned database.” Periodica Polytechnica Archit. 51 (1): 75–82. https://doi.org/10.3311/PPar.15442.
Banerjee, S., C. M. Potts, A. H. Jhala, and E. J. Jaselskis. 2021. “Neural language model based intelligent semantic information retrieval on NCDOT projects for knowledge management.” In Proc., 2021 ASCE Int. Conf. on Computing in Civil Engineering (i3CE2021), 779–786. Orlando, FL: American Society of Civil Engineers.
Bengio, Y., R. Ducharme, P. Vincent, and C. Jauvin. 2003. “A neural probabilistic language model.” J. Mach. Learn. Res. 3: 1137–1155.
Berka, P. 2011. “NEST: A compositional approach to rule-based and case-based reasoning.” Adv. Artif. Intell. 2011: 1–15. https://doi.org/10.1155/2011/374250.
Bojanowski, P., E. Grave, A. Joulin, and T. Mikolov. 2017. “Enriching word vectors with subword information.” Trans. Assoc. Comput. Ling. 5: 135–146. https://doi.org/10.1162/tacl_a_00051.
Brown, T., et al. 2020. “Language models are few-shot learners.” In Proc., 34th Int. Conf. on Neural Information Processing Systems, 1877–1901. Vancouver, Canada: Curran Associates Inc.
Chalkidis, I., M. Fergadiotis, P. Malakasiotis, N. Aletras, and I. Androutsopoulos. 2020. “LEGAL-BERT: The muppets straight out of law school.” Preprint, submitted October 6, 2020. http://arxiv.org/abs/2010.02559.
Dekker, R., and R. de Hoog. 2000. “The monetary value of knowledge assets: A micro approach.” Expert Syst. Appl. 18 (2): 111–124. https://doi.org/10.1016/S0957-4174(99)00057-3.
Devlin, J., M. W. Chang, K. Lee, and K. Toutanova. 2019. “BERT: Pre-training of deep bidirectional transformers for language understanding.” In Proc., NAACL-HLT 2019, 4171–4186. Minneapolis: Association for Computational Linguistics.
du Plessis, M. 2007. “The role of knowledge management in innovation.” J. Knowledge Manage. 11 (4): 20–29. https://doi.org/10.1108/13673270710762684.
Egbu, C. 2004. “Managing knowledge and intellectual capital for improved organizational innovations in the construction industry: An examination of critical success factors.” Eng. Constr. Archit. Manage. 11 (5): 301–315. https://doi.org/10.1108/09699980410558494.
Everett, J. G., and S. Farghal. 1994. “Learning curve predictors for construction field operations.” J. Constr. Eng. Manage. 120 (3): 603–616. https://doi.org/10.1061/(ASCE)0733-9364(1994)120:3(603).
Ferrada, X., D. Nunez, A. Neyem, A. Serpell, and M. Sepulveda. 2016. “A lessons-learned system for construction project management: A preliminary application.” Proc. Soc. Behav. Sci. 226 (7): 302–309. https://doi.org/10.1016/j.sbspro.2016.06.192.
Fullerton, C. E., A. W. Tamer, S. Banerjee, A. Alsharef, and E. J. Jaselskis. 2021. “Development of north Carolina department of transportation’s clear program for enhanced project management.” Transp. Res. Rec. 2675 (7): 222–234. https://doi.org/10.1177/0361198121995195.
Gino, F., and B. Staats. 2015. “Why organizations don’t learn.” Accessed June 23, 2022. https://hbr.org/2015/11/why-organizations-dont-learn.
Goh, S. 2002. “Managing effective knowledge transfer: An integrative framework and some practice implications.” J. Knowledge Manage. 6 (1): 23–30. https://doi.org/10.1108/13673270210417664.
Hassan, F., and T. Le. 2020. “Automated requirements identification from construction contract documents using natural language processing.” J. Leg. Aff. Dispute Resolut. Eng. Constr. 12 (2): 04520009. https://doi.org/10.1061/(ASCE)LA.1943-4170.0000379.
Honnibal, M., I. Montani, S. Van Landeghem, and A. Boyd. 2020. spaCy: Industrial-strength natural language processing in python. Geneva: European Organization for Nuclear Research.
Issa, R. R. A., and J. Haddad. 2008. “Perceptions of the impacts of organizational culture and information technology on knowledge sharing in construction.” Constr. Innovation 8 (3): 182–201. https://doi.org/10.1108/14714170810888958.
Johannessen, J.-A. 2008. “Organisational innovation as part of knowledge management.” Int. J. Inf. Manage. 28 (5): 403–412. https://doi.org/10.1016/j.ijinfomgt.2008.04.007.
Johari, S., and K. N. Jha. 2021. “Learning curve models for construction workers.” J. Manage. Eng. 37 (5): 04021042. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000941.
Kim, T., and S. Chi. 2019. “Accident case retrieval and analyses: Using natural language processing in the construction industry.” J. Constr. Eng. Manage. 145 (3): 04019004. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001625.
Kim, Y., S. Bang, J. Sohn, and H. Kim. 2022. “Question answering method for infrastructure damage information retrieval from textual data using bidirectional encoder representations from transformers.” Autom. Constr. 134 (Feb): 104061. https://doi.org/10.1016/j.autcon.2021.104061.
Le, Q., and T. Mikolov. 2014. “Distributed representations of sentences and documents.” In Proc., 31st Int. Conf. on Machine Learning, 1188–1196. Beijing, China: JMLR: W & CP.
Lee, J. H., J. S. Yi, and J. W. Son. 2019. “Development of automatic-extraction model of poisonous clauses in international construction contracts using rule-based nlp.” J. Comput. Civ. Eng. 33 (3): 04019003. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000807.
Lee, J.-S., and J. Hsiang. 2019. “PatentBERT: Patent classification with fine-tuning a pre-trained BERT Model.” Preprint, submitted May 14, 2019. http://arxiv.org/abs/1906.02124.
Liu, W., P. Zhou, Z. Zhao, Z. Wang, Q. Ju, H. Deng, and P. Wang. 2020. “K-BERT: Enabling language representation with knowledge graph.” In Proc., AAAI Conf. on Artificial Intelligence, 2901–2908. New York: AAAI Press.
Liu, Y., M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer, and V. Stoyanov. 2019. “RoBERTa: A robustly optimized bert pretraining approach.” Preprint, submitted July 26, 2019. http://arxiv.org/abs/1907.11692v1.
McRae, G., C. Vallet, and J. Jewiss. 2018. Vermont agency of transportation employee retention and knowledge management study. Burlington, Canada: Univ. of Vermont Transportation Research Center.
Merchant, A., E. Rahimtoroghi, E. Pavlick, and I. Tenney. 2020. “What happens to {BERT} embeddings during fine-tuning?” In Proc., 3rd BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 33–44. Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.blackboxnlp-1.4.
Mikolov, T., K. Chen, G. Corrado, and J. Dean. 2013. “Efficient estimation of word representations in vector space.” Preprint, submitted January 16, 2013. https://arxiv.org/abs/1301.3781v3.
National Skills Coalition. 2021. “Building a people-centered infrastructure plan for the 21st century.” Accessed January 10, 2022. https://nationalskillscoalition.org/wp-content/uploads/2021/01/11102020-Building-a-People-Centered-Infrastructure-Plan-Memo-Public.pdf.
NCDOT (North Carolina DOT). 2004. North Carolina generalassembly NCDOT project delivery study. Raleigh, NC: Dye Management Group.
Nonaka, I. 1994. “A dynamic theory of organizational knowledge creation.” Organ. Sci. 5 (1): 14–37. https://doi.org/10.1287/orsc.5.1.14.
Pennington, J., R. Socher, and C. D. Manning. 2014. “GloVe: Global vectors for word representation.” In Proc., 2014 Conf. on Empirical Methods in Natural Language Processing (EMNLP), 1532–1543. Doha, Qatar: Association for Computational Linguistics.
Petter, S., W. DeLone, and E. R. McLean. 2013. “Information systems success: The quest for the independent variables.” J. Manage. Inf. Syst. 29 (4): 7–62. https://doi.org/10.2753/MIS0742-1222290401.
PMI (Project Management Institute). 2017. A guide to the project management body of knowledge (PMBOK guide). 6th ed. Newton Square, PA: PMI.
Potts, C. M., and A. H. Jhala. 2021. “Narraport: Narrative-based interactions and report generation with large datasets.” In Proc., Int. Conf. on Interactive Digital Storytelling, 118–127. Cham, Switzerland: Springer. https://doi.org/10.1007/978-3-030-92300-6_11.
Rezgui, Y. 2006. “Ontology-centered knowledge management using information retrieval techniques.” J. Comput. Civ. Eng. 20 (4): 261–270. https://doi.org/10.1061/(ASCE)0887-3801(2006)20:4(261).
Robertson, J., B. Harrison, and A. H. Jhala. 2020. “Interactive summarization for data filtering and triage.” In Proc., 33rd Int. FLAIRS Conf. (FLAIRS-33), 252–257. North Miami Beach, FL: Association for the Advancement of Artificial Intelligence.
Rowe, S. F., and S. Sikes. 2006. “Lessons learned: sharing the knowledge.” In Paper presented at PMI® Global Congress 2006—EMEA. Newtown Square, PA: Project Management Institute.
Sheehan, T., D. Poole, I. Lyttle, and C. Egbu. 2005. “Strategies and business case for knowledge management.” In Knowledge management in construction, edited by C. Anumba, C. Egbu, and P. Carrillo, 50–64. Oxford, UK: Blackwell.
Smith, M. K. 2003. “Michael Polanyi and tacit knowledge.” Accessed April 1, 2021. https://infed.org/mobi/michael-polanyi-and-tacit-knowledge/.
Tserng, P. H., S. Y. Yen-Liang, and M. H. Lee. 2010. “The use of knowledge map model in construction industry.” J. Civ. Eng. Manage. 16 (3): 332–344. https://doi.org/10.3846/jcem.2010.38.
Tummalapudi, M., J. Killingsworth, C. Harper, and M. Mehaney. 2021. “US construction industry managerial strategies for economic recession and recovery: A Delphi Study.” J. Constr. Eng. Manage. 147 (11): 04021146. https://doi.org/10.1061/(ASCE)CO.1943-7862.0002175.
Youndt, M. A., and S. A. Snell. 2004. “Human resource configurations, intellectual capital, and organizational performance.” J. Managerial Issues 16 (3): 337–360.
Zahra, S. A., and G. George. 2002. “Absorptive capacity: A review, reconceptualization, and extension.” Acad. Manage. Rev. 27 (2): 185–203. https://doi.org/10.2307/4134351.
Zhai, C. X. 2008. Statistical language models for information retrieval. San Rafael, CA: Morgan & Claypool Publishers.
Zhang, J., and N. M. El-Gohary. 2016. “Semantic nlp-based information extraction from construction regulatory documents for automated compliance checking.” J. Comput. Civ. Eng. 30 (2): 04015014. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000346.
Zhang, R., and N. M. El-Gohary. 2021. “A deep neural network-based method for deep information extraction using transfer learning strategies to support automated compliance checking.” Autom. Constr. 132 (Dec): 103834. https://doi.org/10.1016/j.autcon.2021.103834.

Information & Authors

Information

Published In

Go to Journal of Computing in Civil Engineering
Journal of Computing in Civil Engineering
Volume 37Issue 3May 2023

History

Received: Feb 19, 2022
Accepted: Dec 21, 2022
Published online: Mar 6, 2023
Published in print: May 1, 2023
Discussion open until: Aug 6, 2023

Permissions

Request permissions for this article.

Authors

Affiliations

Siddharth Banerjee, Ph.D., A.M.ASCE https://orcid.org/0000-0003-3666-4987 [email protected]
Assistant Professor in Residence, Dept. of Civil Engineering and Construction, Bradley Univ., Peoria, IL 61625 (corresponding author). ORCID: https://orcid.org/0000-0003-3666-4987. Email: [email protected]
Postdoctoral Associate, Dept. of Computer Science, North Carolina State Univ., Raleigh, NC 27606. ORCID: https://orcid.org/0000-0001-9406-3499. Email: [email protected]
Arnav H. Jhala, Ph.D. [email protected]
Associate Professor, Dept. of Computer Science, North Carolina State Univ., Raleigh, NC 27606. Email: [email protected]
Edward J. Jaselskis, Ph.D., A.M.ASCE [email protected]
P.E.
E.I. Clancy Distinguished Professor, Dept. of Civil, Construction and Environmental Engineering, North Carolina State Univ., Raleigh, NC 27695. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share