A Feature-Based Procedure for Detecting Technical Outliers in Water-Quality Data From In Situ Sensors
File version
Version of Record (VoR)
Author(s)
Hyndman, Rob J
Leigh, Catherine
Mengersen, Kerrie
Smith-Miles, Kate
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Outliers due to technical errors in water‐quality data from in situ sensors can reduce data quality and have a direct impact on inference drawn from subsequent data analysis. However, outlier detection through manual monitoring is infeasible given the volume and velocity of data the sensors produce. Here we introduce an automated procedure, named oddwater, that provides early detection of outliers in water‐quality data from in situ sensors caused by technical issues. Our oddwater procedure is used to first identify the data features that differentiate outlying instances from typical behaviors. Then, statistical transformations are applied to make the outlying instances stand out in a transformed data space. Unsupervised outlier scoring techniques are applied to the transformed data space, and an approach based on extreme value theory is used to calculate a threshold for each potential outlier. Using two data sets obtained from in situ sensors in rivers flowing into the Great Barrier Reef lagoon, Australia, we show that oddwater successfully identifies outliers involving abrupt changes in turbidity, conductivity, and river level, including sudden spikes, sudden isolated drops, and level shifts, while maintaining very low false detection rates. We have implemented this oddwater procedure in the open source R package oddwater.
Journal Title
Water Resources Research
Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
This publication has been entered into Griffith Research Online as an Advanced Online Version.
Access the data
Related item(s)
Subject
Physical geography and environmental geoscience
Civil engineering
Environmental engineering
Persistent link to this record
Citation
Talagala, PD; Hyndman, RJ; Leigh, C; Mengersen, K; Smith-Miles, K, A Feature-Based Procedure for Detecting Technical Outliers in Water-Quality Data From In Situ Sensors, Water Resources Research, 2019