Chapter
Jan 25, 2024

Dependency Parsing-Based Information Extraction from Car Crash Narratives to Support Crash Scene Reconstruction

Publication: Computing in Civil Engineering 2023

ABSTRACT

Crash scene reconstruction is essential for reverse-engineering the factors of a crash scene to determine the cause of a crash. It requires automated information extraction (IE) from the textual crash report narratives and their formalization in a computer-processable representation. Natural language processing (NLP) is a powerful computational tool to process texts. This paper presents a dependency parsing (DP)-based NLP system for automated IE of crash events information from crash report narratives. DP-based rules and patterns were leveraged for defining relations between subjects and objects to support the IE algorithm. The proposed IE system was tested on 50 reports collected from the Southeast Michigan Council of Governments (SEMCOG) traffic crash database, which achieved an overall 94.9% precision, 90.2% recall, and 92.5% F1-score. A parallel experiment was conducted with ChatGPT to extract information from crash narratives, where an 88.0% precision and 88.0% recall were obtained.

Get full access to this article

View all available purchase options and get full access to this chapter.

REFERENCES

Alkadi, S. (2017). “Rule-based information extraction for airplane crashes reports.” International Journal of Computational Linguistics (IJCL), 8, 1–36.
Deleger, L., et al. (2013). “Large-scale evaluation of automated clinical note de-identification and its impact on information extraction.” J. Am. Med. Inform. Assoc., 20(1), 84–94.
FARO Technologies. (2022). “Crash Reconstruction | Forensic Analysis and Pre-incident Planning | FARO.” FARO, <https://www.faro.com/en/Application/Forensic-Analysis-and-Pre-incident-Planning/Crash-Reconstruction>(Jan. 17, 2022).
Feng, Y., Chen, Y., Zhang, J., Tian, C., Ren, R., Han, T., and Proctor, R. W. (2023). “Human-centred design of next generation transportation infrastructure with connected and automated vehicles: A system-of-systems perspective.” Theoretical Issues in Ergonomics Science, 1–29.
Gambi, A., Huynh, T., and Fraser, G. (2019). “Automatically reconstructing car crashes from police reports for testing self-driving cars.” 2019 IEEE/ACM 41st International Conference on Software Engineering: Companion Proceedings (ICSE-Companion).
Liu, K., and El-Gohary, N. (2017). “Ontology-based semi-supervised conditional random fields for automated information extraction from Bridge Inspection Reports.” Automation in Construction, 81, 313–327.
Liu, K., and El-Gohary, N. (2016). “Ontology-based sequence labelling for automated information extraction for supporting bridge data analytics.” Procedia Engineering, 145, 504–510.
OpenAI. (2023). “CHATGPT: Optimizing language models for dialogue.” OpenAI, OpenAI, <https://openai.com/blog/chatgpt/>(Feb. 13, 2023).
Ren, R., Li, H., Han, T., Tian, C., Zhang, C., Zhang, J., Proctor, R., Chen, Y., and Feng, Y. (2023). “Vehicle crash simulations for safety: Introduction of connected and automated vehicles on the roadways.” Accident Analysis & Prevention, 186(June 2023), 1–13.
SEMCOG (Southeast Michigan Council of Governments). (2022). “Southeast Michigan Council of Governments.” SEMCOG, <https://semcog.org/>(Dec. 8, 2022).
spaCy. (2022). “Spacy · industrial-strength natural language processing in python.” Industrial-strength Natural Language Processing in Python, <https://spacy.io/>(Feb. 9, 2023).
Yehia, E., Boshnak, H., AbdelGaber, S., Abdo, A., and Elzanfaly, D. S. (2019). “Ontology-based clinical information extraction from physician’s free-text notes.” Journal of Biomedical Informatics, 98, 103276.
Zhang, J., and El-Gohary, N. (2016). “Semantic NLP-based information extraction from construction regulatory documents for automated compliance checking.” Journal of Computing in Civil Engineering, 30(2).
Zhang, J., and El-Gohary, N. M. (2015). “Automated Information Transformation for automated regulatory compliance checking in construction.” Journal of Computing in Civil Engineering, 29(4).
Zhou, P., and El-Gohary, N. (2017). “Ontology-based automated information extraction from building energy conservation codes.” Automation in Construction, 74(Feb. 2017), 103–117.

Information & Authors

Information

Published In

Go to Computing in Civil Engineering 2023
Computing in Civil Engineering 2023
Pages: 249 - 256

History

Published online: Jan 25, 2024

Permissions

Request permissions for this article.

ASCE Technical Topics:

Authors

Affiliations

Hang Li, S.M.ASCE [email protected]
1School of Construction Management Technology, Purdue Univ., West Lafayette, IN. Email: [email protected]
Jiansong Zhang, Ph.D., A.M.ASCE [email protected]
2School of Construction Management Technology, Purdue Univ., West Lafayette, IN. Email: [email protected]
Yunfeng Chen, Ph.D. [email protected]
3School of Construction Management Technology, Purdue Univ., West Lafayette, IN. Email: [email protected]
Yiheng Feng, Ph.D. [email protected]
4Lyles School of Civil Engineering, Purdue Univ., West Lafayette, IN. Email: [email protected]
Robert W. Proctor, Ph.D. [email protected]
5Dept. of Psychological Sciences, Purdue Univ., West Lafayette, IN. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$198.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$198.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share