Automated Procedure to Assess Civil Infrastructure Data Quality: Method and Validation

Buchheit, Rebecca Bari; Garrett, James H.; McNeil, Sue; Chen, Ping

doi:10.1061/(ASCE)1076-0342(2005)11:3(180)

TECHNICAL PAPERS

Sep 1, 2005

Automated Procedure to Assess Civil Infrastructure Data Quality: Method and Validation

Authors: Rebecca Bari Buchheit, James H. Garrett Jr., M.ASCE [email protected], Sue McNeil, M.ASCE [email protected], and Ping Chen [email protected]Author Affiliations

Publication: Journal of Infrastructure Systems

Volume 11, Issue 3

https://doi.org/10.1061/(ASCE)1076-0342(2005)11:3(180)

Get Access

Abstract

Monitoring data are collected to measure the condition, environment, usage, and performance of civil infrastructure. High quality monitoring data are necessary for decision-support systems, design analysis, and research. However, little work has been done in the area of generic, automated data quality assessment and cleansing procedures. We have developed an automated, two-level data quality assessment procedure to address this deficiency. In the first level of our procedure, several different data quality assessment methods are used in a voting scheme to identify concentrations of anomalies in aggregate data. In the second level, differences between anomalies and normal data at the individual data level are identified; combined with domain knowledge, these differences can be used to identify different types of errors, such as missing data and calibration errors. In our case studies, we have been able to effectively cleanse the data using the results from our data quality assessment procedure. We have also developed a test bench to explore the sensitivity of the data quality assessment algorithms used in our approach. The test bench introduces a known error into a clean, artificial data set and then evaluates how well each assessment method identifies the error. The test bench results show that our approach is able to effectively identify anomalies, even those with small magnitudes of error.

Get full access to this article

View all available purchase options and get full access to this article.

Get Access

Acknowledgments

This material is based upon work supported by the National Science Foundation under Grant No. NSFCMS-9987871 and partially supported by Illinois Department of Transportation through the Metropolitan Transportation Support Initiative (METSI) at University of Illinois, Chicago. The writers would also like to thank Margaret H. Chalkline and the Minnesota Department of Transportation for giving us the opportunity to study their weigh-in-motion data.

References

Agrawal, R., Imielinski, T., and Swami, A. (1993). “Mining association rules between sets of items in large databases.” Proc., ACM SIGMOD Int. Conf., Association of Computing Machinery, Washington, D.C., 207–216.

Abstract

Get full access to this article

Acknowledgments

References

Information

Published In

Copyright

History

Permissions

Authors

Affiliations

Metrics

Citations

Download citation

Cited by

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Get Access

Access content

Purchase

ASCE Library Card (5 downloads)

ASCE Library Card (5 downloads)

ASCE Library Card (20 downloads)

ASCE Library Card (20 downloads)

Buy Single Article

Buy Single Article

Figures

Other

Share

Copy the content Link

Share with email

Share

Request Username

Create a new account

Change Password

Password Changed Successfully

Verify Phone

Congrats!