Innovative Missing Data Replacement Methods Using Time Series Models
Publication: World Environmental and Water Resources Congress 2008: Ahupua'A
Abstract
Missing data commonly exist in operational records, especially in the influent and effluent water quality records in water and wastewater treatment plants. For example, unexpected events cause the failure of measurements of plant data, and holidays or less experienced personnel shifts make the measurements unavailable. Traditional missing data replacement methods, such as average (AVE) method and average of nearest observations (ANO) method, depend on a MCAR (missing completely at random) assumption, which means the probability that an observation is missing shall not be related with its data structure. Unfortunately, MCAR observations are less likely in reality because they are commonly correlated with time (for example, most of missing data happen on weekends and holidays). In this paper, two innovative methods (TES and TESWN) are developed based on commonly used time series models that can accurately capture time series' statistical characteristics: trend, lag, and/or seasonality. The TES method is a combined two directional exponential smoothing method, where forward ES and backward ES are applied independently and the mean values of those two forecasts are used to replace the missing values. The TESWN method is known as the TES method with white noise added. It begins with the TES method, then adds a white noise term to account for random effects observed in the data but not captured by the autocorrelation function. These two innovative methods, together with AVE and ANO methods, are applied to real water and wastewater treatment plants' influent data (flow and concentrations). The results indicate that the TES method is the best overall performance method because it has more similar statistical characteristics (mean, standard deviation, percentiles, etc.) with the original data. The TESWN method can also be recommended if the goal is to capture the overall variability in the distribution instead of obtaining a close match to the exact time series.
Get full access to this chapter
View all available purchase options and get full access to this chapter.
Information & Authors
Information
Published In
Copyright
© 2008 American Society of Civil Engineers.
History
Published online: Apr 26, 2012
ASCE Technical Topics:
- Business management
- Engineering fundamentals
- Environmental engineering
- Inflow
- Influents
- Innovation
- Mathematics
- Model accuracy
- Models (by type)
- Practice and Profession
- River engineering
- Rivers and streams
- Statistics
- Time series analysis
- Wastewater treatment plants
- Water and water resources
- Water quality
- Water treatment
- Water treatment plants
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.