Cargando…
RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improveme...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421441/ https://www.ncbi.nlm.nih.gov/pubmed/16476165 http://dx.doi.org/10.1186/1471-2105-7-66 |
_version_ | 1782127179330486272 |
---|---|
author | Teeling, Hanno Gloeckner, Frank Oliver |
author_facet | Teeling, Hanno Gloeckner, Frank Oliver |
author_sort | Teeling, Hanno |
collection | PubMed |
description | BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improvement, new phylogenetic methods have been developed alongside with the growing availability of complete genome sequences. Unfortunately, only a few genes in prokaryotic genomes qualify as universal phylogenetic markers and almost all of them have a lower information content than the 16S rRNA gene. Therefore, emphasis has been placed on methods that are based on multiple genes or even entire genomes. The concatenation of ribosomal protein sequences is one method which has been ascribed an improved resolution. Since there is neither a comprehensive database for ribosomal protein sequences nor a tool that assists in sequence retrieval and generation of respective input files for phylogenetic reconstruction programs, RibAlign has been developed to fill this gap. RESULTS: RibAlign serves two purposes: First, it provides a fast and scalable database that has been specifically adapted to eubacterial ribosomal protein sequences and second, it provides sophisticated import and export capabilities. This includes semi-automatic extraction of ribosomal protein sequences from whole-genome GenBank and FASTA files as well as exporting aligned, concatenated and filtered sequence files that can directly be used in conjunction with the PHYLIP and MrBayes phylogenetic reconstruction programs. CONCLUSION: Up to now, phylogeny based on concatenated ribosomal protein sequences is hampered by the limited set of sequenced genomes and high computational requirements. However, hundreds of full and draft genome sequencing projects are on the way, and advances in cluster-computing and algorithms make phylogenetic reconstructions feasible even with large alignments of concatenated marker genes. RibAlign is a first step in this direction and may be particularly interesting to scientists involved in whole genome sequencing of representatives of new or sparsely studied eubacterial phyla. RibAlign is available at |
format | Text |
id | pubmed-1421441 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14214412006-04-01 RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits Teeling, Hanno Gloeckner, Frank Oliver BMC Bioinformatics Software BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improvement, new phylogenetic methods have been developed alongside with the growing availability of complete genome sequences. Unfortunately, only a few genes in prokaryotic genomes qualify as universal phylogenetic markers and almost all of them have a lower information content than the 16S rRNA gene. Therefore, emphasis has been placed on methods that are based on multiple genes or even entire genomes. The concatenation of ribosomal protein sequences is one method which has been ascribed an improved resolution. Since there is neither a comprehensive database for ribosomal protein sequences nor a tool that assists in sequence retrieval and generation of respective input files for phylogenetic reconstruction programs, RibAlign has been developed to fill this gap. RESULTS: RibAlign serves two purposes: First, it provides a fast and scalable database that has been specifically adapted to eubacterial ribosomal protein sequences and second, it provides sophisticated import and export capabilities. This includes semi-automatic extraction of ribosomal protein sequences from whole-genome GenBank and FASTA files as well as exporting aligned, concatenated and filtered sequence files that can directly be used in conjunction with the PHYLIP and MrBayes phylogenetic reconstruction programs. CONCLUSION: Up to now, phylogeny based on concatenated ribosomal protein sequences is hampered by the limited set of sequenced genomes and high computational requirements. However, hundreds of full and draft genome sequencing projects are on the way, and advances in cluster-computing and algorithms make phylogenetic reconstructions feasible even with large alignments of concatenated marker genes. RibAlign is a first step in this direction and may be particularly interesting to scientists involved in whole genome sequencing of representatives of new or sparsely studied eubacterial phyla. RibAlign is available at BioMed Central 2006-02-13 /pmc/articles/PMC1421441/ /pubmed/16476165 http://dx.doi.org/10.1186/1471-2105-7-66 Text en Copyright © 2006 Teeling and Gloeckner; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Teeling, Hanno Gloeckner, Frank Oliver RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title | RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title_full | RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title_fullStr | RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title_full_unstemmed | RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title_short | RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
title_sort | ribalign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421441/ https://www.ncbi.nlm.nih.gov/pubmed/16476165 http://dx.doi.org/10.1186/1471-2105-7-66 |
work_keys_str_mv | AT teelinghanno ribalignasoftwaretoolanddatabaseforeubacterialphylogenybasedonconcatenatedribosomalproteinsubunits AT gloecknerfrankoliver ribalignasoftwaretoolanddatabaseforeubacterialphylogenybasedonconcatenatedribosomalproteinsubunits |