Cargando…

Expectation-Maximization Binary Clustering for Behavioural Annotation

The growing capacity to process and store animal tracks has spurred the development of new methods to segment animal trajectories into elementary units of movement. Key challenges for movement trajectory segmentation are to (i) minimize the need of supervision, (ii) reduce computational costs, (iii)...

Descripción completa

Detalles Bibliográficos
Autores principales: Garriga, Joan, Palmer, John R. B., Oltra, Aitana, Bartumeus, Frederic
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4803255/
https://www.ncbi.nlm.nih.gov/pubmed/27002631
http://dx.doi.org/10.1371/journal.pone.0151984
_version_ 1782422855573569536
author Garriga, Joan
Palmer, John R. B.
Oltra, Aitana
Bartumeus, Frederic
author_facet Garriga, Joan
Palmer, John R. B.
Oltra, Aitana
Bartumeus, Frederic
author_sort Garriga, Joan
collection PubMed
description The growing capacity to process and store animal tracks has spurred the development of new methods to segment animal trajectories into elementary units of movement. Key challenges for movement trajectory segmentation are to (i) minimize the need of supervision, (ii) reduce computational costs, (iii) minimize the need of prior assumptions (e.g. simple parametrizations), and (iv) capture biologically meaningful semantics, useful across a broad range of species. We introduce the Expectation-Maximization binary Clustering (EMbC), a general purpose, unsupervised approach to multivariate data clustering. The EMbC is a variant of the Expectation-Maximization Clustering (EMC), a clustering algorithm based on the maximum likelihood estimation of a Gaussian mixture model. This is an iterative algorithm with a closed form step solution and hence a reasonable computational cost. The method looks for a good compromise between statistical soundness and ease and generality of use (by minimizing prior assumptions and favouring the semantic interpretation of the final clustering). Here we focus on the suitability of the EMbC algorithm for behavioural annotation of movement data. We show and discuss the EMbC outputs in both simulated trajectories and empirical movement trajectories including different species and different tracking methodologies. We use synthetic trajectories to assess the performance of EMbC compared to classic EMC and Hidden Markov Models. Empirical trajectories allow us to explore the robustness of the EMbC to data loss and data inaccuracies, and assess the relationship between EMbC output and expert label assignments. Additionally, we suggest a smoothing procedure to account for temporal correlations among labels, and a proper visualization of the output for movement trajectories. Our algorithm is available as an R-package with a set of complementary functions to ease the analysis.
format Online
Article
Text
id pubmed-4803255
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48032552016-03-25 Expectation-Maximization Binary Clustering for Behavioural Annotation Garriga, Joan Palmer, John R. B. Oltra, Aitana Bartumeus, Frederic PLoS One Research Article The growing capacity to process and store animal tracks has spurred the development of new methods to segment animal trajectories into elementary units of movement. Key challenges for movement trajectory segmentation are to (i) minimize the need of supervision, (ii) reduce computational costs, (iii) minimize the need of prior assumptions (e.g. simple parametrizations), and (iv) capture biologically meaningful semantics, useful across a broad range of species. We introduce the Expectation-Maximization binary Clustering (EMbC), a general purpose, unsupervised approach to multivariate data clustering. The EMbC is a variant of the Expectation-Maximization Clustering (EMC), a clustering algorithm based on the maximum likelihood estimation of a Gaussian mixture model. This is an iterative algorithm with a closed form step solution and hence a reasonable computational cost. The method looks for a good compromise between statistical soundness and ease and generality of use (by minimizing prior assumptions and favouring the semantic interpretation of the final clustering). Here we focus on the suitability of the EMbC algorithm for behavioural annotation of movement data. We show and discuss the EMbC outputs in both simulated trajectories and empirical movement trajectories including different species and different tracking methodologies. We use synthetic trajectories to assess the performance of EMbC compared to classic EMC and Hidden Markov Models. Empirical trajectories allow us to explore the robustness of the EMbC to data loss and data inaccuracies, and assess the relationship between EMbC output and expert label assignments. Additionally, we suggest a smoothing procedure to account for temporal correlations among labels, and a proper visualization of the output for movement trajectories. Our algorithm is available as an R-package with a set of complementary functions to ease the analysis. Public Library of Science 2016-03-22 /pmc/articles/PMC4803255/ /pubmed/27002631 http://dx.doi.org/10.1371/journal.pone.0151984 Text en © 2016 Garriga et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Garriga, Joan
Palmer, John R. B.
Oltra, Aitana
Bartumeus, Frederic
Expectation-Maximization Binary Clustering for Behavioural Annotation
title Expectation-Maximization Binary Clustering for Behavioural Annotation
title_full Expectation-Maximization Binary Clustering for Behavioural Annotation
title_fullStr Expectation-Maximization Binary Clustering for Behavioural Annotation
title_full_unstemmed Expectation-Maximization Binary Clustering for Behavioural Annotation
title_short Expectation-Maximization Binary Clustering for Behavioural Annotation
title_sort expectation-maximization binary clustering for behavioural annotation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4803255/
https://www.ncbi.nlm.nih.gov/pubmed/27002631
http://dx.doi.org/10.1371/journal.pone.0151984
work_keys_str_mv AT garrigajoan expectationmaximizationbinaryclusteringforbehaviouralannotation
AT palmerjohnrb expectationmaximizationbinaryclusteringforbehaviouralannotation
AT oltraaitana expectationmaximizationbinaryclusteringforbehaviouralannotation
AT bartumeusfrederic expectationmaximizationbinaryclusteringforbehaviouralannotation