Cargando…
The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics
Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been fac...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9728902/ https://www.ncbi.nlm.nih.gov/pubmed/36477744 http://dx.doi.org/10.1371/journal.pone.0278522 |
_version_ | 1784845369845219328 |
---|---|
author | Provost, Kaiya L. Yang, Jiaying Carstens, Bryan C. |
author_facet | Provost, Kaiya L. Yang, Jiaying Carstens, Bryan C. |
author_sort | Provost, Kaiya L. |
collection | PubMed |
description | Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been facilitated by artificial intelligence methods. These have yet to be evaluated with respect to accuracy of different automation strategies and features. Here, we use a recently published machine learning framework to extract syllables from ten bird species ranging in their phylogenetic relatedness from 1 to 85 million years, to compare how phylogenetic relatedness influences accuracy. We also evaluate the utility of applying trained models to novel species. Our results indicate that model performance is best on conspecifics, with accuracy progressively decreasing as phylogenetic distance increases between taxa. However, we also find that the application of models trained on multiple distantly related species can improve the overall accuracy to levels near that of training and analyzing a model on the same species. When planning big-data bioacoustics studies, care must be taken in sample design to maximize sample size and minimize human labor without sacrificing accuracy. |
format | Online Article Text |
id | pubmed-9728902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-97289022022-12-08 The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics Provost, Kaiya L. Yang, Jiaying Carstens, Bryan C. PLoS One Research Article Vocalizations in animals, particularly birds, are critically important behaviors that influence their reproductive fitness. While recordings of bioacoustic data have been captured and stored in collections for decades, the automated extraction of data from these recordings has only recently been facilitated by artificial intelligence methods. These have yet to be evaluated with respect to accuracy of different automation strategies and features. Here, we use a recently published machine learning framework to extract syllables from ten bird species ranging in their phylogenetic relatedness from 1 to 85 million years, to compare how phylogenetic relatedness influences accuracy. We also evaluate the utility of applying trained models to novel species. Our results indicate that model performance is best on conspecifics, with accuracy progressively decreasing as phylogenetic distance increases between taxa. However, we also find that the application of models trained on multiple distantly related species can improve the overall accuracy to levels near that of training and analyzing a model on the same species. When planning big-data bioacoustics studies, care must be taken in sample design to maximize sample size and minimize human labor without sacrificing accuracy. Public Library of Science 2022-12-07 /pmc/articles/PMC9728902/ /pubmed/36477744 http://dx.doi.org/10.1371/journal.pone.0278522 Text en © 2022 Provost et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Provost, Kaiya L. Yang, Jiaying Carstens, Bryan C. The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title | The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title_full | The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title_fullStr | The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title_full_unstemmed | The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title_short | The impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
title_sort | impacts of fine-tuning, phylogenetic distance, and sample size on big-data bioacoustics |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9728902/ https://www.ncbi.nlm.nih.gov/pubmed/36477744 http://dx.doi.org/10.1371/journal.pone.0278522 |
work_keys_str_mv | AT provostkaiyal theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics AT yangjiaying theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics AT carstensbryanc theimpactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics AT provostkaiyal impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics AT yangjiaying impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics AT carstensbryanc impactsoffinetuningphylogeneticdistanceandsamplesizeonbigdatabioacoustics |