Cargando…

RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits

BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improveme...

Descripción completa

Detalles Bibliográficos
Autores principales: Teeling, Hanno, Gloeckner, Frank Oliver
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421441/
https://www.ncbi.nlm.nih.gov/pubmed/16476165
http://dx.doi.org/10.1186/1471-2105-7-66
_version_ 1782127179330486272
author Teeling, Hanno
Gloeckner, Frank Oliver
author_facet Teeling, Hanno
Gloeckner, Frank Oliver
author_sort Teeling, Hanno
collection PubMed
description BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improvement, new phylogenetic methods have been developed alongside with the growing availability of complete genome sequences. Unfortunately, only a few genes in prokaryotic genomes qualify as universal phylogenetic markers and almost all of them have a lower information content than the 16S rRNA gene. Therefore, emphasis has been placed on methods that are based on multiple genes or even entire genomes. The concatenation of ribosomal protein sequences is one method which has been ascribed an improved resolution. Since there is neither a comprehensive database for ribosomal protein sequences nor a tool that assists in sequence retrieval and generation of respective input files for phylogenetic reconstruction programs, RibAlign has been developed to fill this gap. RESULTS: RibAlign serves two purposes: First, it provides a fast and scalable database that has been specifically adapted to eubacterial ribosomal protein sequences and second, it provides sophisticated import and export capabilities. This includes semi-automatic extraction of ribosomal protein sequences from whole-genome GenBank and FASTA files as well as exporting aligned, concatenated and filtered sequence files that can directly be used in conjunction with the PHYLIP and MrBayes phylogenetic reconstruction programs. CONCLUSION: Up to now, phylogeny based on concatenated ribosomal protein sequences is hampered by the limited set of sequenced genomes and high computational requirements. However, hundreds of full and draft genome sequencing projects are on the way, and advances in cluster-computing and algorithms make phylogenetic reconstructions feasible even with large alignments of concatenated marker genes. RibAlign is a first step in this direction and may be particularly interesting to scientists involved in whole genome sequencing of representatives of new or sparsely studied eubacterial phyla. RibAlign is available at
format Text
id pubmed-1421441
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14214412006-04-01 RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits Teeling, Hanno Gloeckner, Frank Oliver BMC Bioinformatics Software BACKGROUND: Until today, analysis of 16S ribosomal RNA (rRNA) sequences has been the de-facto gold standard for the assessment of phylogenetic relationships among prokaryotes. However, the branching order of the individual phlya is not well-resolved in 16S rRNA-based trees. In search of an improvement, new phylogenetic methods have been developed alongside with the growing availability of complete genome sequences. Unfortunately, only a few genes in prokaryotic genomes qualify as universal phylogenetic markers and almost all of them have a lower information content than the 16S rRNA gene. Therefore, emphasis has been placed on methods that are based on multiple genes or even entire genomes. The concatenation of ribosomal protein sequences is one method which has been ascribed an improved resolution. Since there is neither a comprehensive database for ribosomal protein sequences nor a tool that assists in sequence retrieval and generation of respective input files for phylogenetic reconstruction programs, RibAlign has been developed to fill this gap. RESULTS: RibAlign serves two purposes: First, it provides a fast and scalable database that has been specifically adapted to eubacterial ribosomal protein sequences and second, it provides sophisticated import and export capabilities. This includes semi-automatic extraction of ribosomal protein sequences from whole-genome GenBank and FASTA files as well as exporting aligned, concatenated and filtered sequence files that can directly be used in conjunction with the PHYLIP and MrBayes phylogenetic reconstruction programs. CONCLUSION: Up to now, phylogeny based on concatenated ribosomal protein sequences is hampered by the limited set of sequenced genomes and high computational requirements. However, hundreds of full and draft genome sequencing projects are on the way, and advances in cluster-computing and algorithms make phylogenetic reconstructions feasible even with large alignments of concatenated marker genes. RibAlign is a first step in this direction and may be particularly interesting to scientists involved in whole genome sequencing of representatives of new or sparsely studied eubacterial phyla. RibAlign is available at BioMed Central 2006-02-13 /pmc/articles/PMC1421441/ /pubmed/16476165 http://dx.doi.org/10.1186/1471-2105-7-66 Text en Copyright © 2006 Teeling and Gloeckner; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Teeling, Hanno
Gloeckner, Frank Oliver
RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title_full RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title_fullStr RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title_full_unstemmed RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title_short RibAlign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
title_sort ribalign: a software tool and database for eubacterial phylogeny based on concatenated ribosomal protein subunits
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1421441/
https://www.ncbi.nlm.nih.gov/pubmed/16476165
http://dx.doi.org/10.1186/1471-2105-7-66
work_keys_str_mv AT teelinghanno ribalignasoftwaretoolanddatabaseforeubacterialphylogenybasedonconcatenatedribosomalproteinsubunits
AT gloecknerfrankoliver ribalignasoftwaretoolanddatabaseforeubacterialphylogenybasedonconcatenatedribosomalproteinsubunits