Cargando…

Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis

BACKGROUND: Recently, various evolution-related journals adopted policies to encourage or require archiving of phylogenetic trees and associated data. Such attention to practices that promote sharing of data reflects rapidly improving information technology, and rapidly expanding potential to use th...

Descripción completa

Detalles Bibliográficos
Autores principales: Stoltzfus, Arlin, O'Meara, Brian, Whitacre, Jamie, Mounce, Ross, Gillespie, Emily L, Kumar, Sudhir, Rosauer, Dan F, Vos, Rutger A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583491/
https://www.ncbi.nlm.nih.gov/pubmed/23088596
http://dx.doi.org/10.1186/1756-0500-5-574
_version_ 1782475434085056512
author Stoltzfus, Arlin
O'Meara, Brian
Whitacre, Jamie
Mounce, Ross
Gillespie, Emily L
Kumar, Sudhir
Rosauer, Dan F
Vos, Rutger A
author_facet Stoltzfus, Arlin
O'Meara, Brian
Whitacre, Jamie
Mounce, Ross
Gillespie, Emily L
Kumar, Sudhir
Rosauer, Dan F
Vos, Rutger A
author_sort Stoltzfus, Arlin
collection PubMed
description BACKGROUND: Recently, various evolution-related journals adopted policies to encourage or require archiving of phylogenetic trees and associated data. Such attention to practices that promote sharing of data reflects rapidly improving information technology, and rapidly expanding potential to use this technology to aggregate and link data from previously published research. Nevertheless, little is known about current practices, or best practices, for publishing trees and associated data so as to promote re-use. FINDINGS: Here we summarize results of an ongoing analysis of current practices for archiving phylogenetic trees and associated data, current practices of re-use, and current barriers to re-use. We find that the technical infrastructure is available to support rudimentary archiving, but the frequency of archiving is low. Currently, most phylogenetic knowledge is not easily re-used due to a lack of archiving, lack of awareness of best practices, and lack of community-wide standards for formatting data, naming entities, and annotating data. Most attempts at data re-use seem to end in disappointment. Nevertheless, we find many positive examples of data re-use, particularly those that involve customized species trees generated by grafting to, and pruning from, a much larger tree. CONCLUSIONS: The technologies and practices that facilitate data re-use can catalyze synthetic and integrative research. However, success will require engagement from various stakeholders including individual scientists who produce or consume shareable data, publishers, policy-makers, technology developers and resource-providers. The critical challenges for facilitating re-use of phylogenetic trees and associated data, we suggest, include: a broader commitment to public archiving; more extensive use of globally meaningful identifiers; development of user-friendly technology for annotating, submitting, searching, and retrieving data and their metadata; and development of a minimum reporting standard (MIAPA) indicating which kinds of data and metadata are most important for a re-useable phylogenetic record.
format Online
Article
Text
id pubmed-3583491
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35834912013-02-28 Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis Stoltzfus, Arlin O'Meara, Brian Whitacre, Jamie Mounce, Ross Gillespie, Emily L Kumar, Sudhir Rosauer, Dan F Vos, Rutger A BMC Res Notes Data Note BACKGROUND: Recently, various evolution-related journals adopted policies to encourage or require archiving of phylogenetic trees and associated data. Such attention to practices that promote sharing of data reflects rapidly improving information technology, and rapidly expanding potential to use this technology to aggregate and link data from previously published research. Nevertheless, little is known about current practices, or best practices, for publishing trees and associated data so as to promote re-use. FINDINGS: Here we summarize results of an ongoing analysis of current practices for archiving phylogenetic trees and associated data, current practices of re-use, and current barriers to re-use. We find that the technical infrastructure is available to support rudimentary archiving, but the frequency of archiving is low. Currently, most phylogenetic knowledge is not easily re-used due to a lack of archiving, lack of awareness of best practices, and lack of community-wide standards for formatting data, naming entities, and annotating data. Most attempts at data re-use seem to end in disappointment. Nevertheless, we find many positive examples of data re-use, particularly those that involve customized species trees generated by grafting to, and pruning from, a much larger tree. CONCLUSIONS: The technologies and practices that facilitate data re-use can catalyze synthetic and integrative research. However, success will require engagement from various stakeholders including individual scientists who produce or consume shareable data, publishers, policy-makers, technology developers and resource-providers. The critical challenges for facilitating re-use of phylogenetic trees and associated data, we suggest, include: a broader commitment to public archiving; more extensive use of globally meaningful identifiers; development of user-friendly technology for annotating, submitting, searching, and retrieving data and their metadata; and development of a minimum reporting standard (MIAPA) indicating which kinds of data and metadata are most important for a re-useable phylogenetic record. BioMed Central 2012-10-22 /pmc/articles/PMC3583491/ /pubmed/23088596 http://dx.doi.org/10.1186/1756-0500-5-574 Text en Copyright ©2012 Stoltzfus et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Note
Stoltzfus, Arlin
O'Meara, Brian
Whitacre, Jamie
Mounce, Ross
Gillespie, Emily L
Kumar, Sudhir
Rosauer, Dan F
Vos, Rutger A
Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title_full Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title_fullStr Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title_full_unstemmed Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title_short Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
title_sort sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583491/
https://www.ncbi.nlm.nih.gov/pubmed/23088596
http://dx.doi.org/10.1186/1756-0500-5-574
work_keys_str_mv AT stoltzfusarlin sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT omearabrian sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT whitacrejamie sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT mounceross sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT gillespieemilyl sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT kumarsudhir sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT rosauerdanf sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis
AT vosrutgera sharingandreuseofphylogenetictreesandassociateddatatofacilitatesynthesis