Cargando…

A hierarchical machine learning framework for the analysis of large scale animal movement data

BACKGROUND: In recent years the field of movement ecology has been revolutionized by our ability to collect high-accuracy, fine scale telemetry data from individual animals and groups. This growth in our data collection capacity has led to the development of statistical techniques that integrate tel...

Descripción completa

Detalles Bibliográficos
Autores principales: Torney, Colin J., Morales, Juan M., Husmeier, Dirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7893961/
https://www.ncbi.nlm.nih.gov/pubmed/33602302
http://dx.doi.org/10.1186/s40462-021-00242-0
_version_ 1783653152953729024
author Torney, Colin J.
Morales, Juan M.
Husmeier, Dirk
author_facet Torney, Colin J.
Morales, Juan M.
Husmeier, Dirk
author_sort Torney, Colin J.
collection PubMed
description BACKGROUND: In recent years the field of movement ecology has been revolutionized by our ability to collect high-accuracy, fine scale telemetry data from individual animals and groups. This growth in our data collection capacity has led to the development of statistical techniques that integrate telemetry data with random walk models to infer key parameters of the movement dynamics. While much progress has been made in the use of these models, several challenges remain. Notably robust and scalable methods are required for quantifying parameter uncertainty, coping with intermittent location fixes, and analysing the very large volumes of data being generated. METHODS: In this work we implement a novel approach to movement modelling through the use of multilevel Gaussian processes. The hierarchical structure of the method enables the inference of continuous latent behavioural states underlying movement processes. For efficient inference on large data sets, we approximate the full likelihood using trajectory segmentation and sample from posterior distributions using gradient-based Markov chain Monte Carlo methods. RESULTS: While formally equivalent to many continuous-time movement models, our Gaussian process approach provides flexible, powerful models that can detect multiscale patterns and trends in movement trajectory data. We illustrate a further advantage to our approach in that inference can be performed using highly efficient, GPU-accelerated machine learning libraries. CONCLUSIONS: Multilevel Gaussian process models offer efficient inference for large-volume movement data sets, along with the fitting of complex flexible models. Applications of this approach include inferring the mean location of a migration route and quantifying significant changes, detecting diurnal activity patterns, or identifying the onset of directed persistent movements. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40462-021-00242-0).
format Online
Article
Text
id pubmed-7893961
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78939612021-02-22 A hierarchical machine learning framework for the analysis of large scale animal movement data Torney, Colin J. Morales, Juan M. Husmeier, Dirk Mov Ecol Methodology Article BACKGROUND: In recent years the field of movement ecology has been revolutionized by our ability to collect high-accuracy, fine scale telemetry data from individual animals and groups. This growth in our data collection capacity has led to the development of statistical techniques that integrate telemetry data with random walk models to infer key parameters of the movement dynamics. While much progress has been made in the use of these models, several challenges remain. Notably robust and scalable methods are required for quantifying parameter uncertainty, coping with intermittent location fixes, and analysing the very large volumes of data being generated. METHODS: In this work we implement a novel approach to movement modelling through the use of multilevel Gaussian processes. The hierarchical structure of the method enables the inference of continuous latent behavioural states underlying movement processes. For efficient inference on large data sets, we approximate the full likelihood using trajectory segmentation and sample from posterior distributions using gradient-based Markov chain Monte Carlo methods. RESULTS: While formally equivalent to many continuous-time movement models, our Gaussian process approach provides flexible, powerful models that can detect multiscale patterns and trends in movement trajectory data. We illustrate a further advantage to our approach in that inference can be performed using highly efficient, GPU-accelerated machine learning libraries. CONCLUSIONS: Multilevel Gaussian process models offer efficient inference for large-volume movement data sets, along with the fitting of complex flexible models. Applications of this approach include inferring the mean location of a migration route and quantifying significant changes, detecting diurnal activity patterns, or identifying the onset of directed persistent movements. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s40462-021-00242-0). BioMed Central 2021-02-18 /pmc/articles/PMC7893961/ /pubmed/33602302 http://dx.doi.org/10.1186/s40462-021-00242-0 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Torney, Colin J.
Morales, Juan M.
Husmeier, Dirk
A hierarchical machine learning framework for the analysis of large scale animal movement data
title A hierarchical machine learning framework for the analysis of large scale animal movement data
title_full A hierarchical machine learning framework for the analysis of large scale animal movement data
title_fullStr A hierarchical machine learning framework for the analysis of large scale animal movement data
title_full_unstemmed A hierarchical machine learning framework for the analysis of large scale animal movement data
title_short A hierarchical machine learning framework for the analysis of large scale animal movement data
title_sort hierarchical machine learning framework for the analysis of large scale animal movement data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7893961/
https://www.ncbi.nlm.nih.gov/pubmed/33602302
http://dx.doi.org/10.1186/s40462-021-00242-0
work_keys_str_mv AT torneycolinj ahierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata
AT moralesjuanm ahierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata
AT husmeierdirk ahierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata
AT torneycolinj hierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata
AT moralesjuanm hierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata
AT husmeierdirk hierarchicalmachinelearningframeworkfortheanalysisoflargescaleanimalmovementdata