Chapter
May 24, 2022

Deep Learning-Based Sequence Labeling for Information Extraction from Multiple Types of Textual Bridge Reports

Publication: Computing in Civil Engineering 2021

ABSTRACT

A massive amount of data/information that is buried in unstructured data sources such as bridge rehabilitation reports offer opportunities to complement other data sources such as structural health monitoring data for improved prediction of future bridge conditions. Extracting these data/information from their sources is, however, challenging for two reasons: (1) utilizing textual data for analytics remains to be a challenge due to the inherently unstructured nature of these data; and (2) existing information extraction methods, which aim to extract structured data/information from text, are limited in their capabilities and generalizability (e.g., a method developed to extract certain information from one type of document may not perform well on another type). To address these gaps, this paper proposes a deep learning-based information extraction (IE) model that automatically recognizes and extracts bridge-related data/information (e.g., bridge deficiencies) from multiple types of unstructured textual sources and represents the extracted data/information in a structured format to support further data analytics. The proposed IE model utilizes deep learning-based BiLSTM; begin, inside, and outside (BIO) encoding for the phrase-level segmentation; and conditional random fields (CRF) for both word-level and phrase-level labeling for automatic sequence labeling. The paper discusses the proposed model and its performance results.

Get full access to this article

View all available purchase options and get full access to this chapter.

REFERENCES

Assaad, R., and El-Adaway, I. (2020). “Bridge infrastructure asset management system: Comparative computational machine learning approach for evaluating and predicting deck deterioration conditions.” Journal of Infrastructure Systems, 26 (3), 04020032.
Yang, D. Y., and Frangopol, D. M. (2020). Life-cycle management of deteriorating bridge networks with network-level risk bounds and system reliability analysis. Structural Safety, 83, 101–117.
Zhou, X., Di, J., and Tu, X. (2019). “Investigation of collapse of Florida International University (FIU) pedestrian bridge.” Engineering Structures, 200, 1–16.
Saeed, T. U., Moomen, M., Ahmed, A., Murillo-Hoyos, J., Volovski, M., and Labi, S. (2017). “Performance evaluation and life prediction of highway concrete bridge superstructure across design types.” Journal of Performance of Constructed Facilities, 31(5), 04017052.
Fang, Y., and Sun, L. (2019). “Developing A semi-markov process model for bridge deterioration prediction in Shanghai.” Sustainability, 11(19), 1–15.
Huang, Y.-H. (2010). “Artificial neural network model of bridge deterioration.” Journal of Performance of Constructed Facilities, 24(6), 597–602.
Dashtipour, K., Gogate, M., Adeel, A., Algarafi, A., Howard, N., and Hussain, A. (2017). “Persian named entity recognition.” Proc. In 2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI* CC), IEEE CS Press, Washington, D.C., United States.
Ravikumar, J., and Ramakanth, K. P. (2021). Machine learning model for clinical named entity recognition. International Journal of Electrical and Computer Engineering, 11(2), 1689–1677.
Liu, K., and El-Gohary, N. (2016). “Ontology-based sequence labelling for automated information extraction for supporting bridge data analytics.” Procedia Engineering, 145, 504–510.
Indurkhya, N., and Damerau, F. J. (Eds.). (2010). Handbook of natural language processing (Vol. 2), CRC Press, Boca Raton, Florida, 200–210.
Xu, X., and Cai, H. (2019). Semantic frame-based information extraction from utility regulatory documents to support compliance checking. Advances in Informatics and Computing in Civil and Construction Engineering, Springer, Basel, Switzerland 223–230.
Zhang, J., and El-Gohary, N. (2013). “Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking.” Journal of Computing in Civil Engineering, 30(2), 04015014.
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., and Dyer, C. (2016). “Neural architectures for named entity recognition.”.
Ling, W., Luis, T., Marujo, L., Astudillo, R. F., Amir, S., Dyer, C., Black, A. W., and Trancoso, I. (2015). “Finding function in form: Compositional character models for open vocabulary word representation.”.
Yu, Y., Si, X., Hu, C., and Zhang, J. (2019). “A review of recurrent neural networks: LSTM cells and network architectures.” Neural Computation, 31(7), 1235–1270.
Ammar, W., Dyer, C., and Smith, N. A. (2014). Conditional random field autoencoders for unsupervised structured prediction. Proc. in 2014 27th Advances in Neural Information Processing Systems, MIT Press55, MA, United States.
Lee, J., Yi, J. S., and Son, J. (2019). “Development of automatic-extraction model of poisonous clauses in international construction contracts using rule-based NLP”. Journal of Computing in Civil Engineering, 33(3), 04019003.
Seedah, D. P., and Leite, F. (2015). “Information extraction for freight-related natural language queries”. Computing in Civil Engineering, 35(5),427–435.

Information & Authors

Information

Published In

Go to Computing in Civil Engineering 2021
Computing in Civil Engineering 2021
Pages: 180 - 187

History

Published online: May 24, 2022

Permissions

Request permissions for this article.

Authors

Affiliations

Qiyang Chen, S.M.ASCE [email protected]
1Ph.D. Student, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana-Champaign, Urbana, IL. Email: [email protected]
Nora EI-Gohary, A.M.ASCE [email protected]
2Associate Professor, Dept. of Civil and Environmental Engineering, Univ. of Illinois at Urbana-Champaign, Urbana, IL. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$358.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$358.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share