Cargando…

Analyzing time‐ordered event data with missed observations

A common problem with observational datasets is that not all events of interest may be detected. For example, observing animals in the wild can difficult when animals move, hide, or cannot be closely approached. We consider time series of events recorded in conditions where events are occasionally m...

Descripción completa

Detalles Bibliográficos
Autores principales: Dokter, Adriaan M., van Loon, E. Emiel, Fokkema, Wimke, Lameris, Thomas K., Nolet, Bart A., van der Jeugd, Henk P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606873/
https://www.ncbi.nlm.nih.gov/pubmed/28944022
http://dx.doi.org/10.1002/ece3.3281
_version_ 1783265198591705088
author Dokter, Adriaan M.
van Loon, E. Emiel
Fokkema, Wimke
Lameris, Thomas K.
Nolet, Bart A.
van der Jeugd, Henk P.
author_facet Dokter, Adriaan M.
van Loon, E. Emiel
Fokkema, Wimke
Lameris, Thomas K.
Nolet, Bart A.
van der Jeugd, Henk P.
author_sort Dokter, Adriaan M.
collection PubMed
description A common problem with observational datasets is that not all events of interest may be detected. For example, observing animals in the wild can difficult when animals move, hide, or cannot be closely approached. We consider time series of events recorded in conditions where events are occasionally missed by observers or observational devices. These time series are not restricted to behavioral protocols, but can be any cyclic or recurring process where discrete outcomes are observed. Undetected events cause biased inferences on the process of interest, and statistical analyses are needed that can identify and correct the compromised detection processes. Missed observations in time series lead to observed time intervals between events at multiples of the true inter‐event time, which conveys information on their detection probability. We derive the theoretical probability density function for observed intervals between events that includes a probability of missed detection. Methodology and software tools are provided for analysis of event data with potential observation bias and its removal. The methodology was applied to simulation data and a case study of defecation rate estimation in geese, which is commonly used to estimate their digestive throughput and energetic uptake, or to calculate goose usage of a feeding site from dropping density. Simulations indicate that at a moderate chance to miss arrival events (p = 0.3), uncorrected arrival intervals were biased upward by up to a factor 3, while parameter values corrected for missed observations were within 1% of their true simulated value. A field case study shows that not accounting for missed observations leads to substantial underestimates of the true defecation rate in geese, and spurious rate differences between sites, which are introduced by differences in observational conditions. These results show that the derived methodology can be used to effectively remove observational biases in time‐ordered event data.
format Online
Article
Text
id pubmed-5606873
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-56068732017-09-24 Analyzing time‐ordered event data with missed observations Dokter, Adriaan M. van Loon, E. Emiel Fokkema, Wimke Lameris, Thomas K. Nolet, Bart A. van der Jeugd, Henk P. Ecol Evol Original Research A common problem with observational datasets is that not all events of interest may be detected. For example, observing animals in the wild can difficult when animals move, hide, or cannot be closely approached. We consider time series of events recorded in conditions where events are occasionally missed by observers or observational devices. These time series are not restricted to behavioral protocols, but can be any cyclic or recurring process where discrete outcomes are observed. Undetected events cause biased inferences on the process of interest, and statistical analyses are needed that can identify and correct the compromised detection processes. Missed observations in time series lead to observed time intervals between events at multiples of the true inter‐event time, which conveys information on their detection probability. We derive the theoretical probability density function for observed intervals between events that includes a probability of missed detection. Methodology and software tools are provided for analysis of event data with potential observation bias and its removal. The methodology was applied to simulation data and a case study of defecation rate estimation in geese, which is commonly used to estimate their digestive throughput and energetic uptake, or to calculate goose usage of a feeding site from dropping density. Simulations indicate that at a moderate chance to miss arrival events (p = 0.3), uncorrected arrival intervals were biased upward by up to a factor 3, while parameter values corrected for missed observations were within 1% of their true simulated value. A field case study shows that not accounting for missed observations leads to substantial underestimates of the true defecation rate in geese, and spurious rate differences between sites, which are introduced by differences in observational conditions. These results show that the derived methodology can be used to effectively remove observational biases in time‐ordered event data. John Wiley and Sons Inc. 2017-08-09 /pmc/articles/PMC5606873/ /pubmed/28944022 http://dx.doi.org/10.1002/ece3.3281 Text en © 2017 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Dokter, Adriaan M.
van Loon, E. Emiel
Fokkema, Wimke
Lameris, Thomas K.
Nolet, Bart A.
van der Jeugd, Henk P.
Analyzing time‐ordered event data with missed observations
title Analyzing time‐ordered event data with missed observations
title_full Analyzing time‐ordered event data with missed observations
title_fullStr Analyzing time‐ordered event data with missed observations
title_full_unstemmed Analyzing time‐ordered event data with missed observations
title_short Analyzing time‐ordered event data with missed observations
title_sort analyzing time‐ordered event data with missed observations
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5606873/
https://www.ncbi.nlm.nih.gov/pubmed/28944022
http://dx.doi.org/10.1002/ece3.3281
work_keys_str_mv AT dokteradriaanm analyzingtimeorderedeventdatawithmissedobservations
AT vanlooneemiel analyzingtimeorderedeventdatawithmissedobservations
AT fokkemawimke analyzingtimeorderedeventdatawithmissedobservations
AT lameristhomask analyzingtimeorderedeventdatawithmissedobservations
AT noletbarta analyzingtimeorderedeventdatawithmissedobservations
AT vanderjeugdhenkp analyzingtimeorderedeventdatawithmissedobservations