Cargando…
Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters
In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8657025/ https://www.ncbi.nlm.nih.gov/pubmed/34886529 http://dx.doi.org/10.3390/ijerph182312803 |
_version_ | 1784612416858882048 |
---|---|
author | Kermorvant, Claire Liquet, Benoit Litt, Guy Jones, Jeremy B. Mengersen, Kerrie Peterson, Erin E. Hyndman, Rob J. Leigh, Catherine |
author_facet | Kermorvant, Claire Liquet, Benoit Litt, Guy Jones, Jeremy B. Mengersen, Kerrie Peterson, Erin E. Hyndman, Rob J. Leigh, Catherine |
author_sort | Kermorvant, Claire |
collection | PubMed |
description | In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data. To mimic sporadically missing (i) single observations and (ii) periods of contiguous observations, we randomly removed (i) point data and (ii) day- and week-long sequences of data from a two-year time series of nitrate concentration data collected from Arikaree River, USA, where synoptically collected water temperature, turbidity, conductance, elevation, and dissolved oxygen data were available. In 72% of cases with missing point data, predicted values were within the sensor precision interval of the original value, although predictive ability declined when sequences of missing data occurred. Precision also depended on the availability of other water quality covariates. When covariates were available, even a sudden, event-based peak in nitrate concentration was reconstructed well. By providing a promising method for accurate prediction of missing data, the utility and confidence in summary statistics and statistical trends will increase, thereby assisting the effective monitoring and management of fresh waters and other at-risk ecosystems. |
format | Online Article Text |
id | pubmed-8657025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86570252021-12-10 Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters Kermorvant, Claire Liquet, Benoit Litt, Guy Jones, Jeremy B. Mengersen, Kerrie Peterson, Erin E. Hyndman, Rob J. Leigh, Catherine Int J Environ Res Public Health Article In situ sensors that collect high-frequency data are used increasingly to monitor aquatic environments. These sensors are prone to technical errors, resulting in unrecorded observations and/or anomalous values that are subsequently removed and create gaps in time series data. We present a framework based on generalized additive and auto-regressive models to recover these missing data. To mimic sporadically missing (i) single observations and (ii) periods of contiguous observations, we randomly removed (i) point data and (ii) day- and week-long sequences of data from a two-year time series of nitrate concentration data collected from Arikaree River, USA, where synoptically collected water temperature, turbidity, conductance, elevation, and dissolved oxygen data were available. In 72% of cases with missing point data, predicted values were within the sensor precision interval of the original value, although predictive ability declined when sequences of missing data occurred. Precision also depended on the availability of other water quality covariates. When covariates were available, even a sudden, event-based peak in nitrate concentration was reconstructed well. By providing a promising method for accurate prediction of missing data, the utility and confidence in summary statistics and statistical trends will increase, thereby assisting the effective monitoring and management of fresh waters and other at-risk ecosystems. MDPI 2021-12-04 /pmc/articles/PMC8657025/ /pubmed/34886529 http://dx.doi.org/10.3390/ijerph182312803 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kermorvant, Claire Liquet, Benoit Litt, Guy Jones, Jeremy B. Mengersen, Kerrie Peterson, Erin E. Hyndman, Rob J. Leigh, Catherine Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title | Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title_full | Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title_fullStr | Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title_full_unstemmed | Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title_short | Reconstructing Missing and Anomalous Data Collected from High-Frequency In-Situ Sensors in Fresh Waters |
title_sort | reconstructing missing and anomalous data collected from high-frequency in-situ sensors in fresh waters |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8657025/ https://www.ncbi.nlm.nih.gov/pubmed/34886529 http://dx.doi.org/10.3390/ijerph182312803 |
work_keys_str_mv | AT kermorvantclaire reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT liquetbenoit reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT littguy reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT jonesjeremyb reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT mengersenkerrie reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT petersonerine reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT hyndmanrobj reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters AT leighcatherine reconstructingmissingandanomalousdatacollectedfromhighfrequencyinsitusensorsinfreshwaters |