Cargando…

Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences

BACKGROUND: For more than two decades microbiologists have used a highly conserved microbial gene as a phylogenetic marker for bacteria and archaea. The small-subunit ribosomal RNA gene, also known as 16 S rRNA, is encoded by ribosomal DNA, 16 S rDNA, and has provided a powerful comparative tool to...

Descripción completa

Detalles Bibliográficos
Autores principales: Hartman, Amber L, Riddle, Sean, McPhillips, Timothy, Ludäscher, Bertram, Eisen, Jonathan A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898799/
https://www.ncbi.nlm.nih.gov/pubmed/20540779
http://dx.doi.org/10.1186/1471-2105-11-317
_version_ 1782183522585280512
author Hartman, Amber L
Riddle, Sean
McPhillips, Timothy
Ludäscher, Bertram
Eisen, Jonathan A
author_facet Hartman, Amber L
Riddle, Sean
McPhillips, Timothy
Ludäscher, Bertram
Eisen, Jonathan A
author_sort Hartman, Amber L
collection PubMed
description BACKGROUND: For more than two decades microbiologists have used a highly conserved microbial gene as a phylogenetic marker for bacteria and archaea. The small-subunit ribosomal RNA gene, also known as 16 S rRNA, is encoded by ribosomal DNA, 16 S rDNA, and has provided a powerful comparative tool to microbial ecologists. Over time, the microbial ecology field has matured from small-scale studies in a select number of environments to massive collections of sequence data that are paired with dozens of corresponding collection variables. As the complexity of data and tool sets have grown, the need for flexible automation and maintenance of the core processes of 16 S rDNA sequence analysis has increased correspondingly. RESULTS: We present WATERS, an integrated approach for 16 S rDNA analysis that bundles a suite of publicly available 16 S rDNA analysis software tools into a single software package. The "toolkit" includes sequence alignment, chimera removal, OTU determination, taxonomy assignment, phylogentic tree construction as well as a host of ecological analysis and visualization tools. WATERS employs a flexible, collection-oriented 'workflow' approach using the open-source Kepler system as a platform. CONCLUSIONS: By packaging available software tools into a single automated workflow, WATERS simplifies 16 S rDNA analyses, especially for those without specialized bioinformatics, programming expertise. In addition, WATERS, like some of the newer comprehensive rRNA analysis tools, allows researchers to minimize the time dedicated to carrying out tedious informatics steps and to focus their attention instead on the biological interpretation of the results. One advantage of WATERS over other comprehensive tools is that the use of the Kepler workflow system facilitates result interpretation and reproducibility via a data provenance sub-system. Furthermore, new "actors" can be added to the workflow as desired and we see WATERS as an initial seed for a sizeable and growing repository of interoperable, easy-to-combine tools for asking increasingly complex microbial ecology questions.
format Text
id pubmed-2898799
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28987992010-07-08 Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences Hartman, Amber L Riddle, Sean McPhillips, Timothy Ludäscher, Bertram Eisen, Jonathan A BMC Bioinformatics Software BACKGROUND: For more than two decades microbiologists have used a highly conserved microbial gene as a phylogenetic marker for bacteria and archaea. The small-subunit ribosomal RNA gene, also known as 16 S rRNA, is encoded by ribosomal DNA, 16 S rDNA, and has provided a powerful comparative tool to microbial ecologists. Over time, the microbial ecology field has matured from small-scale studies in a select number of environments to massive collections of sequence data that are paired with dozens of corresponding collection variables. As the complexity of data and tool sets have grown, the need for flexible automation and maintenance of the core processes of 16 S rDNA sequence analysis has increased correspondingly. RESULTS: We present WATERS, an integrated approach for 16 S rDNA analysis that bundles a suite of publicly available 16 S rDNA analysis software tools into a single software package. The "toolkit" includes sequence alignment, chimera removal, OTU determination, taxonomy assignment, phylogentic tree construction as well as a host of ecological analysis and visualization tools. WATERS employs a flexible, collection-oriented 'workflow' approach using the open-source Kepler system as a platform. CONCLUSIONS: By packaging available software tools into a single automated workflow, WATERS simplifies 16 S rDNA analyses, especially for those without specialized bioinformatics, programming expertise. In addition, WATERS, like some of the newer comprehensive rRNA analysis tools, allows researchers to minimize the time dedicated to carrying out tedious informatics steps and to focus their attention instead on the biological interpretation of the results. One advantage of WATERS over other comprehensive tools is that the use of the Kepler workflow system facilitates result interpretation and reproducibility via a data provenance sub-system. Furthermore, new "actors" can be added to the workflow as desired and we see WATERS as an initial seed for a sizeable and growing repository of interoperable, easy-to-combine tools for asking increasingly complex microbial ecology questions. BioMed Central 2010-06-12 /pmc/articles/PMC2898799/ /pubmed/20540779 http://dx.doi.org/10.1186/1471-2105-11-317 Text en Copyright ©2010 Hartman et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Hartman, Amber L
Riddle, Sean
McPhillips, Timothy
Ludäscher, Bertram
Eisen, Jonathan A
Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title_full Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title_fullStr Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title_full_unstemmed Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title_short Introducing W.A.T.E.R.S.: a Workflow for the Alignment, Taxonomy, and Ecology of Ribosomal Sequences
title_sort introducing w.a.t.e.r.s.: a workflow for the alignment, taxonomy, and ecology of ribosomal sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898799/
https://www.ncbi.nlm.nih.gov/pubmed/20540779
http://dx.doi.org/10.1186/1471-2105-11-317
work_keys_str_mv AT hartmanamberl introducingwatersaworkflowforthealignmenttaxonomyandecologyofribosomalsequences
AT riddlesean introducingwatersaworkflowforthealignmenttaxonomyandecologyofribosomalsequences
AT mcphillipstimothy introducingwatersaworkflowforthealignmenttaxonomyandecologyofribosomalsequences
AT ludascherbertram introducingwatersaworkflowforthealignmenttaxonomyandecologyofribosomalsequences
AT eisenjonathana introducingwatersaworkflowforthealignmenttaxonomyandecologyofribosomalsequences