Concept Relation Extraction from Construction Documents Using Natural Language Processing
Publication: Journal of Construction Engineering and Management
Volume 136, Issue 3
Abstract
The objective of this research is to present an innovative technique for managing the knowledge contained in construction contract documents to facilitate quick access and efficient use of such knowledge for project management and contract administration tasks. Knowledge Management has become the focus of a lot of scientific research during the second half of the 20th century as researchers discovered the importance of the knowledge resource to business organizations. Despite early expectations of improved document management techniques, document management systems used in the construction industry have failed to deliver the anticipated performance. Recent research attempts to utilize analysis of the contents of documents to improve document categorization and retrieval functions. It is hypothesized that natural language processing can be effectively used to perform document text analysis. The proposed system, technique for concept relation identification using shallow parsing (CRISP), utilizes a shallow parser to extract semantic knowledge from construction contract documents which can be used to improve electronic document management functions such as document categorization and retrieval. When compared with human evaluators, CRISP achieved almost 80% of the average kappa score attained by the evaluators, and approximately 90% of their -measure score.
Get full access to this article
View all available purchase options and get full access to this article.
Acknowledgments
The research team would like to thank Ellen Riloff and Siddharth Patwardhan from the University of Utah for providing us with a copy of Sundance. Special thanks to Nick Pendar from Iowa State University for his valuable assistance. This study is supported by the National Science Foundation (Award No. NSFNSF-CMMI-0700363). Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the writers and do not necessarily reflect the views of the National Science Foundation.
References
Allen, J. (2003). Natural language understanding, 2nd Ed., Benjamin-Cummings, Redwood City, Calif.
American Institute of Architects. (2008). “History of contract documents.” ⟨http://www.aia.org/docs_history⟩ (March 2008).
American Institute of Architects, Inc. (AIA). (1997). “General conditions of the contract for construction.” AIA A201, Washington, D.C.
Björk, B. C. (2006). “Electronic document management in temporary project organisations: Construction industry experiences.” Online Inf. Rev., 30(6), 644–655.
Brüninghaus, S., and Ashley, K. (2005). “Reasoning with textual cases.” Case-based reasoning research and development, Springer, Berlin, 137–151.
Brüninghaus, S., and Ashley, K. D. (2001). “The role of information extraction for textual CBR.” Proc., 4th Int. Conf. on Case-Based Reasoning, Springer, Berlin, 74–89.
Caldas, C. H., and Soibelman, L. (2003). “Automating hierarchical document classification for construction management information systems.” Autom. Constr., 12(4), 395–406.
Caldas, C. H., Soibelman, L., and Han, J. (2002). “Automated classification of construction project documents.” J. Comput. Civ. Eng., 16(4), 234–243.
Chassiakos, A. P., and Sakellaropoulos, S. P. (2008). “A web-based system for managing construction information.” Adv. Eng. Software, 39(11), 865–876.
Chinowsky, P., and Molenaar, K. (2005). “Learning organizations in construction.” Proc., Construction Research Congress 2005: Broadening Perspectives, ASCE, Reston, Va., 839–848.
Drucker, P. R. (1993). Post-capitalist society, Butterworth-Heinemann, Stoneham, Mass.
Edwards, D. J., Shaw, T., and Holt, G. D. (1996). “Electronic document management systems and the management of UK construction projects.” Build. Res. Inf., 24(5), 287–292.
El-Diraby, T. E., and Kashif, K. F. (2005). “Distributed ontology architecture for knowledge management in highway construction.” J. Constr. Eng. Manage., 131(5), 591–603.
El-Tayeh, A., and Gil, N. (2007). “Using digital socialization to support geographically dispersed AEC project teams.” J. Constr. Eng. Manage., 133(6), 462–473.
Fruchter, R., Demian, P., Yin, Z., and Luth, G. (2003). “Turning A/E/C knowledge into working knowledge.” Proc., 4th Joint Int. Symp. on Information Technology in Civil Engineering, ASCE, Reston, Va., 143–155.
Gomez-Pérez, A. (1998). “Knowledge sharing and re-use.” Handbook of applied expert systems, J. Liebowitz, ed., CRC, Boca Raton, Fla.
Hajjar, D., and AbouRizk, S. (2000). “Integrating document management with project and company data.” J. Comput. Civ. Eng., 14(1), 70–77.
Hammerton, J., Osborne, M., Armstrong, S., and Daelemans, W. (2002). “Introduction to special issue on machine learning approaches to shallow parsing.” J. Mach. Learn. Res., 2, 551–558.
Jurafsky, D., and Martin, J. H. (2000). Speech and language processing, Prentice-Hall, Upper Saddle River, N.J.
Kangari, R. (1995). “Construction documentation in arbitration.” Constr. Engrg. and Mgmt., 121(2), 201–208.
Lame, G. (2004). “Using NLP techniques to identify legal ontology components: Concepts and relations.” Artif. Intell. Law, 12(4), 379–396.
Lee, H. -S., An, S. -J., Son, B. -S., Jang, M. -H., and Choi, Y. -K. (2003). “Web-based electronic data interchange model to improve the collaboration of participants in construction projects.” Proc., Construction Research Congress 2003: Winds of Change, ASCE, Reston, Va., 871–879.
Luiten, G. T., Tolman, F. P., and Fischer, M. A. (1998). “Project-modelling in AEC to integrate design and construction.” Comput. Ind., 35(1), 13–29.
Meziane, F., and Rezgui, Y. (2004). “A document management methodology based on similarity contents.” Inf. Sci. (N.Y.), 158(1–4), 15–36.
Peña-Mora, F., Sosa, C. E., and McCone, S. D. (2003). Introduction to construction dispute resolution, 1st Ed., Prentice-Hall, Upper Saddle River, N.J.
Riloff, E., and Phillips, W. (2004). “An introduction to the Sundance and AutoSlog system.” ⟨http://www.cs.utah.edu/~riloff/pdfs/official-sundance-tr.pdf⟩ (August 2007).
Rubin, R., Fairweather, V., and Guy, S. (1999). Construction claims prevention and resolution, 3rd Ed., Wiley, New York.
Senge, P. M. (1990). The fifth discipline: The age and practice of the learning organization, Century Business, London.
Stewart, R., and Mohamed, S. (2004). “Evaluating web-based project information management in construction: Capturing the long-term value creation process.” Autom. Constr., 13(4), 469–479.
Tseng, F. S. C. (2005). “Design of a multi-dimensional query expression for document warehouses.” Inf. Sci. (N.Y.), 174(1–2), 55–79.
Turing, A. M. (1950). “Computing machinery and intelligence.” Mind, LIX, 433–460.
Turk, Ž. (2007). “Construction informatics in European research: Topics and agendas.” J. Comput. Civ. Eng., 21(3), 211–219.
Turk, Z., Bjork, B. C., Johansson, K., and Sevensson, K. (1994). “Document management systems as an essential step towards CIC.” Proc., CIB W78 Workshop on Computer Integrated Construction, International Council for Building Research Studies and Documentation, Helsinki.
Vidogah, W., and Ndekugri, I. (1998a). “Improving the management of claims on construction contracts: Consultant's perspective.” Constr. Manage. Econom., 16(3), 363–372.
Vidogah, W., and Ndekugri, I. (1998b). “Review of the role of information technology in construction claims management.” Comput. Ind., 35(1), 77–85.
Walters, R., Jaselskis, E. J., and Kurtenbach, J. M. (2007). “Classification of knowledge within the electrical contracting industry: A case study.” Leadership Manage. Eng., 7(1), 11–17.
Zhu, Y., Mao, W., and Ahmad, I. (2007). “Capturing implicit structures in unstructured content of construction documents.” J. Comput. Civ. Eng., 21(3), 220–227.
Zipf, P. J. (2000). “Technology-enhanced project management.” J. Manage. Eng., 16(1), 34–39.
Information & Authors
Information
Published In
Copyright
© 2010 ASCE.
History
Received: Jan 23, 2009
Accepted: Jul 18, 2009
Published online: Aug 10, 2009
Published in print: Mar 2010
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.