Cargando…

Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters

In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework...

Descripción completa

Detalles Bibliográficos
Autores principales: Kermorvant, Claire, Liquet, Benoit, Litt, Guy, Jones, Jeremy B., Mengersen, Kerrie, Peterson, Erin E., Hyndman, Rob J., Leigh, Catherine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8657025/
https://www.ncbi.nlm.nih.gov/pubmed/34886529
http://dx.doi.org/10.3390/ijerph182312803
_version_ 1784612416858882048
author Kermorvant, Claire
Liquet, Benoit
Litt, Guy
Jones, Jeremy B.
Mengersen, Kerrie
Peterson, Erin E.
Hyndman, Rob J.
Leigh, Catherine
author_facet Kermorvant, Claire
Liquet, Benoit
Litt, Guy
Jones, Jeremy B.
Mengersen, Kerrie
Peterson, Erin E.
Hyndman, Rob J.
Leigh, Catherine
author_sort Kermorvant, Claire
collection PubMed
description In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data. To mimic sporadically missing (i) single observations and (ii) periods of contiguous observations, we randomly removed (i) point data and (ii) day- and week-long sequences of data from a two-year time series of nitrate concentration data collected from Arikaree River, USA, where synoptically collected water temperature, turbidity, conductance, elevation, and dissolved oxygen data were available. In 72% of cases with missing point data, predicted values were within the sensor precision interval of the original value, although predictive ability declined when sequences of missing data occurred. Precision also depended on the availability of other water quality covariates. When covariates were available, even a sudden, event-based peak in nitrate concentration was reconstructed well. By providing a promising method for accurate prediction of missing data, the utility and confidence in summary statistics and statistical trends will increase, thereby assisting the effective monitoring and management of fresh waters and other at-risk ecosystems.
format Online
Article
Text
id pubmed-8657025
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86570252021-12-10 Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters Kermorvant, Claire Liquet, Benoit Litt, Guy Jones, Jeremy B. Mengersen, Kerrie Peterson, Erin E. Hyndman, Rob J. Leigh, Catherine Int J Environ Res Public Health Article In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data. To mimic sporadically missing (i) single observations and (ii) periods of contiguous observations, we randomly removed (i) point data and (ii) day- and week-long sequences of data from a two-year time series of nitrate concentration data collected from Arikaree River, USA, where synoptically collected water temperature, turbidity, conductance, elevation, and dissolved oxygen data were available. In 72% of cases with missing point data, predicted values were within the sensor precision interval of the original value, although predictive ability declined when sequences of missing data occurred. Precision also depended on the availability of other water quality covariates. When covariates were available, even a sudden, event-based peak in nitrate concentration was reconstructed well. By providing a promising method for accurate prediction of missing data, the utility and confidence in summary statistics and statistical trends will increase, thereby assisting the effective monitoring and management of fresh waters and other at-risk ecosystems. MDPI 2021-12-04 /pmc/articles/PMC8657025/ /pubmed/34886529 http://dx.doi.org/10.3390/ijerph182312803 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kermorvant, Claire
Liquet, Benoit
Litt, Guy
Jones, Jeremy B.
Mengersen, Kerrie
Peterson, Erin E.
Hyndman, Rob J.
Leigh, Catherine
Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title_full Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title_fullStr Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title_full_unstemmed Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title_short Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
title_sort reconstructing missing and anomalous data collected from high-frequency in-situ sensors in fresh waters
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8657025/
https://www.ncbi.nlm.nih.gov/pubmed/34886529
http://dx.doi.org/10.3390/ijerph182312803
work_keys_str_mv AT kermorvantclaire reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT liquetbenoit reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT littguy reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT jonesjeremyb reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT mengersenkerrie reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT petersonerine reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT hyndmanrobj reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters
AT leighcatherine reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters