Cargando…

The Dawn of Open Access to Phylogenetic Data

The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using...

Descripción completa

Detalles Bibliográficos
Autores principales: Magee, Andrew F., May, Michael R., Moore, Brian R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4208793/
https://www.ncbi.nlm.nih.gov/pubmed/25343725
http://dx.doi.org/10.1371/journal.pone.0110268
_version_ 1782341176833081344
author Magee, Andrew F.
May, Michael R.
Moore, Brian R.
author_facet Magee, Andrew F.
May, Michael R.
Moore, Brian R.
author_sort Magee, Andrew F.
collection PubMed
description The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation – extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for [Image: see text] of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.
format Online
Article
Text
id pubmed-4208793
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42087932014-10-27 The Dawn of Open Access to Phylogenetic Data Magee, Andrew F. May, Michael R. Moore, Brian R. PLoS One Research Article The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation – extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for [Image: see text] of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data. Public Library of Science 2014-10-24 /pmc/articles/PMC4208793/ /pubmed/25343725 http://dx.doi.org/10.1371/journal.pone.0110268 Text en © 2014 Magee et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Magee, Andrew F.
May, Michael R.
Moore, Brian R.
The Dawn of Open Access to Phylogenetic Data
title The Dawn of Open Access to Phylogenetic Data
title_full The Dawn of Open Access to Phylogenetic Data
title_fullStr The Dawn of Open Access to Phylogenetic Data
title_full_unstemmed The Dawn of Open Access to Phylogenetic Data
title_short The Dawn of Open Access to Phylogenetic Data
title_sort dawn of open access to phylogenetic data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4208793/
https://www.ncbi.nlm.nih.gov/pubmed/25343725
http://dx.doi.org/10.1371/journal.pone.0110268
work_keys_str_mv AT mageeandrewf thedawnofopenaccesstophylogeneticdata
AT maymichaelr thedawnofopenaccesstophylogeneticdata
AT moorebrianr thedawnofopenaccesstophylogeneticdata
AT mageeandrewf dawnofopenaccesstophylogeneticdata
AT maymichaelr dawnofopenaccesstophylogeneticdata
AT moorebrianr dawnofopenaccesstophylogeneticdata