Cargando…

Phylesystem: a git-based data store for community-curated phylogenetic estimates

Motivation: Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data fil...

Descripción completa

Detalles Bibliográficos
Autores principales: McTavish, Emily Jane, Hinchliff, Cody E., Allman, James F., Brown, Joseph W., Cranston, Karen A., Holder, Mark T., Rees, Jonathan A., Smith, Stephen A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547614/
https://www.ncbi.nlm.nih.gov/pubmed/25940563
http://dx.doi.org/10.1093/bioinformatics/btv276
_version_ 1782387083805982720
author McTavish, Emily Jane
Hinchliff, Cody E.
Allman, James F.
Brown, Joseph W.
Cranston, Karen A.
Holder, Mark T.
Rees, Jonathan A.
Smith, Stephen A.
author_facet McTavish, Emily Jane
Hinchliff, Cody E.
Allman, James F.
Brown, Joseph W.
Cranston, Karen A.
Holder, Mark T.
Rees, Jonathan A.
Smith, Stephen A.
author_sort McTavish, Emily Jane
collection PubMed
description Motivation: Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct. Results: Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git’s version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the ‘phylesystem-api’, which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements. Availability and implementation: Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed at http://tree.opentreeoflife.org/curator. Code for that tool is available from https://github.com/OpenTreeOfLife/opentree. Contact: mtholder@gmail.com
format Online
Article
Text
id pubmed-4547614
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45476142015-08-25 Phylesystem: a git-based data store for community-curated phylogenetic estimates McTavish, Emily Jane Hinchliff, Cody E. Allman, James F. Brown, Joseph W. Cranston, Karen A. Holder, Mark T. Rees, Jonathan A. Smith, Stephen A. Bioinformatics Original Papers Motivation: Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct. Results: Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git’s version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the ‘phylesystem-api’, which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements. Availability and implementation: Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed at http://tree.opentreeoflife.org/curator. Code for that tool is available from https://github.com/OpenTreeOfLife/opentree. Contact: mtholder@gmail.com Oxford University Press 2015-09-01 2015-05-04 /pmc/articles/PMC4547614/ /pubmed/25940563 http://dx.doi.org/10.1093/bioinformatics/btv276 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
McTavish, Emily Jane
Hinchliff, Cody E.
Allman, James F.
Brown, Joseph W.
Cranston, Karen A.
Holder, Mark T.
Rees, Jonathan A.
Smith, Stephen A.
Phylesystem: a git-based data store for community-curated phylogenetic estimates
title Phylesystem: a git-based data store for community-curated phylogenetic estimates
title_full Phylesystem: a git-based data store for community-curated phylogenetic estimates
title_fullStr Phylesystem: a git-based data store for community-curated phylogenetic estimates
title_full_unstemmed Phylesystem: a git-based data store for community-curated phylogenetic estimates
title_short Phylesystem: a git-based data store for community-curated phylogenetic estimates
title_sort phylesystem: a git-based data store for community-curated phylogenetic estimates
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4547614/
https://www.ncbi.nlm.nih.gov/pubmed/25940563
http://dx.doi.org/10.1093/bioinformatics/btv276
work_keys_str_mv AT mctavishemilyjane phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT hinchliffcodye phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT allmanjamesf phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT brownjosephw phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT cranstonkarena phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT holdermarkt phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT reesjonathana phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates
AT smithstephena phylesystemagitbaseddatastoreforcommunitycuratedphylogeneticestimates