TECHNICAL PAPERS
Aug 10, 2009

Concept Relation Extraction from Construction Documents Using Natural Language Processing

Publication: Journal of Construction Engineering and Management
Volume 136, Issue 3

Abstract

The objective of this research is to present an innovative technique for managing the knowledge contained in construction contract documents to facilitate quick access and efficient use of such knowledge for project management and contract administration tasks. Knowledge Management has become the focus of a lot of scientific research during the second half of the 20th century as researchers discovered the importance of the knowledge resource to business organizations. Despite early expectations of improved document management techniques, document management systems used in the construction industry have failed to deliver the anticipated performance. Recent research attempts to utilize analysis of the contents of documents to improve document categorization and retrieval functions. It is hypothesized that natural language processing can be effectively used to perform document text analysis. The proposed system, technique for concept relation identification using shallow parsing (CRISP), utilizes a shallow parser to extract semantic knowledge from construction contract documents which can be used to improve electronic document management functions such as document categorization and retrieval. When compared with human evaluators, CRISP achieved almost 80% of the average kappa score attained by the evaluators, and approximately 90% of their F -measure score.

Get full access to this article

View all available purchase options and get full access to this article.

Acknowledgments

The research team would like to thank Ellen Riloff and Siddharth Patwardhan from the University of Utah for providing us with a copy of Sundance. Special thanks to Nick Pendar from Iowa State University for his valuable assistance. This study is supported by the National Science Foundation (Award No. NSFNSF-CMMI-0700363). Any opinions, findings, conclusions, or recommendations expressed in this publication are those of the writers and do not necessarily reflect the views of the National Science Foundation.

References

Allen, J. (2003). Natural language understanding, 2nd Ed., Benjamin-Cummings, Redwood City, Calif.
American Institute of Architects. (2008). “History of contract documents.” ⟨http://www.aia.org/docs_history⟩ (March 2008).
American Institute of Architects, Inc. (AIA). (1997). “General conditions of the contract for construction.” AIA A201, Washington, D.C.
Björk, B. C. (2006). “Electronic document management in temporary project organisations: Construction industry experiences.” Online Inf. Rev., 30(6), 644–655.
Brüninghaus, S., and Ashley, K. (2005). “Reasoning with textual cases.” Case-based reasoning research and development, Springer, Berlin, 137–151.
Brüninghaus, S., and Ashley, K. D. (2001). “The role of information extraction for textual CBR.” Proc., 4th Int. Conf. on Case-Based Reasoning, Springer, Berlin, 74–89.
Caldas, C. H., and Soibelman, L. (2003). “Automating hierarchical document classification for construction management information systems.” Autom. Constr., 12(4), 395–406.
Caldas, C. H., Soibelman, L., and Han, J. (2002). “Automated classification of construction project documents.” J. Comput. Civ. Eng., 16(4), 234–243.
Chassiakos, A. P., and Sakellaropoulos, S. P. (2008). “A web-based system for managing construction information.” Adv. Eng. Software, 39(11), 865–876.
Chinowsky, P., and Molenaar, K. (2005). “Learning organizations in construction.” Proc., Construction Research Congress 2005: Broadening Perspectives, ASCE, Reston, Va., 839–848.
Drucker, P. R. (1993). Post-capitalist society, Butterworth-Heinemann, Stoneham, Mass.
Edwards, D. J., Shaw, T., and Holt, G. D. (1996). “Electronic document management systems and the management of UK construction projects.” Build. Res. Inf., 24(5), 287–292.
El-Diraby, T. E., and Kashif, K. F. (2005). “Distributed ontology architecture for knowledge management in highway construction.” J. Constr. Eng. Manage., 131(5), 591–603.
El-Tayeh, A., and Gil, N. (2007). “Using digital socialization to support geographically dispersed AEC project teams.” J. Constr. Eng. Manage., 133(6), 462–473.
Fruchter, R., Demian, P., Yin, Z., and Luth, G. (2003). “Turning A/E/C knowledge into working knowledge.” Proc., 4th Joint Int. Symp. on Information Technology in Civil Engineering, ASCE, Reston, Va., 143–155.
Gomez-Pérez, A. (1998). “Knowledge sharing and re-use.” Handbook of applied expert systems, J. Liebowitz, ed., CRC, Boca Raton, Fla.
Hajjar, D., and AbouRizk, S. (2000). “Integrating document management with project and company data.” J. Comput. Civ. Eng., 14(1), 70–77.
Hammerton, J., Osborne, M., Armstrong, S., and Daelemans, W. (2002). “Introduction to special issue on machine learning approaches to shallow parsing.” J. Mach. Learn. Res., 2, 551–558.
Jurafsky, D., and Martin, J. H. (2000). Speech and language processing, Prentice-Hall, Upper Saddle River, N.J.
Kangari, R. (1995). “Construction documentation in arbitration.” Constr. Engrg. and Mgmt., 121(2), 201–208.
Lame, G. (2004). “Using NLP techniques to identify legal ontology components: Concepts and relations.” Artif. Intell. Law, 12(4), 379–396.
Lee, H. -S., An, S. -J., Son, B. -S., Jang, M. -H., and Choi, Y. -K. (2003). “Web-based electronic data interchange model to improve the collaboration of participants in construction projects.” Proc., Construction Research Congress 2003: Winds of Change, ASCE, Reston, Va., 871–879.
Luiten, G. T., Tolman, F. P., and Fischer, M. A. (1998). “Project-modelling in AEC to integrate design and construction.” Comput. Ind., 35(1), 13–29.
Meziane, F., and Rezgui, Y. (2004). “A document management methodology based on similarity contents.” Inf. Sci. (N.Y.), 158(1–4), 15–36.
Peña-Mora, F., Sosa, C. E., and McCone, S. D. (2003). Introduction to construction dispute resolution, 1st Ed., Prentice-Hall, Upper Saddle River, N.J.
Riloff, E., and Phillips, W. (2004). “An introduction to the Sundance and AutoSlog system.” ⟨http://www.cs.utah.edu/~riloff/pdfs/official-sundance-tr.pdf⟩ (August 2007).
Rubin, R., Fairweather, V., and Guy, S. (1999). Construction claims prevention and resolution, 3rd Ed., Wiley, New York.
Senge, P. M. (1990). The fifth discipline: The age and practice of the learning organization, Century Business, London.
Stewart, R., and Mohamed, S. (2004). “Evaluating web-based project information management in construction: Capturing the long-term value creation process.” Autom. Constr., 13(4), 469–479.
Tseng, F. S. C. (2005). “Design of a multi-dimensional query expression for document warehouses.” Inf. Sci. (N.Y.), 174(1–2), 55–79.
Turing, A. M. (1950). “Computing machinery and intelligence.” Mind, LIX, 433–460.
Turk, Ž. (2007). “Construction informatics in European research: Topics and agendas.” J. Comput. Civ. Eng., 21(3), 211–219.
Turk, Z., Bjork, B. C., Johansson, K., and Sevensson, K. (1994). “Document management systems as an essential step towards CIC.” Proc., CIB W78 Workshop on Computer Integrated Construction, International Council for Building Research Studies and Documentation, Helsinki.
Vidogah, W., and Ndekugri, I. (1998a). “Improving the management of claims on construction contracts: Consultant's perspective.” Constr. Manage. Econom., 16(3), 363–372.
Vidogah, W., and Ndekugri, I. (1998b). “Review of the role of information technology in construction claims management.” Comput. Ind., 35(1), 77–85.
Walters, R., Jaselskis, E. J., and Kurtenbach, J. M. (2007). “Classification of knowledge within the electrical contracting industry: A case study.” Leadership Manage. Eng., 7(1), 11–17.
Zhu, Y., Mao, W., and Ahmad, I. (2007). “Capturing implicit structures in unstructured content of construction documents.” J. Comput. Civ. Eng., 21(3), 220–227.
Zipf, P. J. (2000). “Technology-enhanced project management.” J. Manage. Eng., 16(1), 34–39.

Information & Authors

Information

Published In

Go to Journal of Construction Engineering and Management
Journal of Construction Engineering and Management
Volume 136Issue 3March 2010
Pages: 294 - 302

History

Received: Jan 23, 2009
Accepted: Jul 18, 2009
Published online: Aug 10, 2009
Published in print: Mar 2010

Permissions

Request permissions for this article.

Authors

Affiliations

Mohammed Al Qady [email protected]
Research Assistant, Division of Construction Engineering and Management, School of Civil Engineering, Purdue Univ., West Lafayette, IN 47907. E-mail: [email protected]
Amr Kandil, M.ASCE [email protected]
Assistant Professor, Division of Construction Engineering and Management, School of Civil Engineering, Purdue Univ., West Lafayette, IN 47907 (corresponding author). E-mail: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share