Cargando…

The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics

Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been fac...

Descripción completa

Detalles Bibliográficos
Autores principales: Provost, Kaiya L., Yang, Jiaying, Carstens, Bryan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9728902/
https://www.ncbi.nlm.nih.gov/pubmed/36477744
http://dx.doi.org/10.1371/journal.pone.0278522
_version_ 1784845369845219328
author Provost, Kaiya L.
Yang, Jiaying
Carstens, Bryan C.
author_facet Provost, Kaiya L.
Yang, Jiaying
Carstens, Bryan C.
author_sort Provost, Kaiya L.
collection PubMed
description Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been facilitated by artificial intelligence methods. These have yet to be evaluated with respect to accuracy of different automation strategies and features. Here, we use a recently published machine learning framework to extract syllables from ten bird species ranging in their phylogenetic relatedness from 1 to 85 million years, to compare how phylogenetic relatedness influences accuracy. We also evaluate the utility of applying trained models to novel species. Our results indicate that model performance is best on conspecifics, with accuracy progressively decreasing as phylogenetic distance increases between taxa. However, we also find that the application of models trained on multiple distantly related species can improve the overall accuracy to levels near that of training and analyzing a model on the same species. When planning big-data bioacoustics studies, care must be taken in sample design to maximize sample size and minimize human labor without sacrificing accuracy.
format Online
Article
Text
id pubmed-9728902
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-97289022022-12-08 The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics Provost, Kaiya L. Yang, Jiaying Carstens, Bryan C. PLoS One Research Article Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been facilitated by artificial intelligence methods. These have yet to be evaluated with respect to accuracy of different automation strategies and features. Here, we use a recently published machine learning framework to extract syllables from ten bird species ranging in their phylogenetic relatedness from 1 to 85 million years, to compare how phylogenetic relatedness influences accuracy. We also evaluate the utility of applying trained models to novel species. Our results indicate that model performance is best on conspecifics, with accuracy progressively decreasing as phylogenetic distance increases between taxa. However, we also find that the application of models trained on multiple distantly related species can improve the overall accuracy to levels near that of training and analyzing a model on the same species. When planning big-data bioacoustics studies, care must be taken in sample design to maximize sample size and minimize human labor without sacrificing accuracy. Public Library of Science 2022-12-07 /pmc/articles/PMC9728902/ /pubmed/36477744 http://dx.doi.org/10.1371/journal.pone.0278522 Text en © 2022 Provost et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Provost, Kaiya L.
Yang, Jiaying
Carstens, Bryan C.
The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title_full The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title_fullStr The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title_full_unstemmed The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title_short The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
title_sort impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9728902/
https://www.ncbi.nlm.nih.gov/pubmed/36477744
http://dx.doi.org/10.1371/journal.pone.0278522
work_keys_str_mv AT provostkaiyal theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics
AT yangjiaying theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics
AT carstensbryanc theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics
AT provostkaiyal impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics
AT yangjiaying impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics
AT carstensbryanc impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics