Cargando…

Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems

BACKGROUND: The analysis of RNA sequences, once a small niche field for a small collection of scientists whose primary emphasis was the structure and function of a few RNA molecules, has grown most significantly with the realizations that 1) RNA is implicated in many more functions within the cell,...

Descripción completa

Detalles Bibliográficos
Autores principales: Shang, Lei, Gardner, David P, Xu, Weijia, Cannone, Jamie J, Miranker, Daniel P, Ozer, Stuart, Gutell, Robin R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854672/
https://www.ncbi.nlm.nih.gov/pubmed/24565058
http://dx.doi.org/10.1186/1752-0509-7-S4-S13
_version_ 1782294844392079360
author Shang, Lei
Gardner, David P
Xu, Weijia
Cannone, Jamie J
Miranker, Daniel P
Ozer, Stuart
Gutell, Robin R
author_facet Shang, Lei
Gardner, David P
Xu, Weijia
Cannone, Jamie J
Miranker, Daniel P
Ozer, Stuart
Gutell, Robin R
author_sort Shang, Lei
collection PubMed
description BACKGROUND: The analysis of RNA sequences, once a small niche field for a small collection of scientists whose primary emphasis was the structure and function of a few RNA molecules, has grown most significantly with the realizations that 1) RNA is implicated in many more functions within the cell, and 2) the analysis of ribosomal RNA sequences is revealing more about the microbial ecology within all biological and environmental systems. The accurate and rapid alignment of these RNA sequences is essential to decipher the maximum amount of information from this data. METHODS: Two computer systems that utilize the Gutell lab's RNA Comparative Analysis Database (rCAD) were developed to align sequences to an existing template alignment available at the Gutell lab's Comparative RNA Web (CRW) Site. Multiple dimensions of cross-indexed information are contained within the relational database - rCAD, including sequence alignments, the NCBI phylogenetic tree, and comparative secondary structure information for each aligned sequence. The first program, CRWAlign-1 creates a phylogenetic-based sequence profile for each column in the alignment. The second program, CRWAlign-2 creates a profile based on phylogenetic, secondary structure, and sequence information. Both programs utilize their profiles to align new sequences into the template alignment. RESULTS: The accuracies of the two CRWAlign programs were compared with the best template-based rRNA alignment programs and the best de-novo alignment programs. We have compared our programs with a total of eight alternative alignment methods on different sets of 16S rRNA alignments with sequence percent identities ranging from 50% to 100%. Both CRWAlign programs were superior to these other programs in accuracy and speed. CONCLUSIONS: Both CRWAlign programs can be used to align the very extensive amount of RNA sequencing that is generated due to the rapid next-generation sequencing technology. This latter technology is augmenting the new paradigm that RNA is intimately implicated in a significant number of functions within the cell. In addition, the use of bacterial 16S rRNA sequencing in the identification of the microbiome in many different environmental systems creates a need for rapid and highly accurate alignment of bacterial 16S rRNA sequences.
format Online
Article
Text
id pubmed-3854672
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38546722013-12-16 Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems Shang, Lei Gardner, David P Xu, Weijia Cannone, Jamie J Miranker, Daniel P Ozer, Stuart Gutell, Robin R BMC Syst Biol Research BACKGROUND: The analysis of RNA sequences, once a small niche field for a small collection of scientists whose primary emphasis was the structure and function of a few RNA molecules, has grown most significantly with the realizations that 1) RNA is implicated in many more functions within the cell, and 2) the analysis of ribosomal RNA sequences is revealing more about the microbial ecology within all biological and environmental systems. The accurate and rapid alignment of these RNA sequences is essential to decipher the maximum amount of information from this data. METHODS: Two computer systems that utilize the Gutell lab's RNA Comparative Analysis Database (rCAD) were developed to align sequences to an existing template alignment available at the Gutell lab's Comparative RNA Web (CRW) Site. Multiple dimensions of cross-indexed information are contained within the relational database - rCAD, including sequence alignments, the NCBI phylogenetic tree, and comparative secondary structure information for each aligned sequence. The first program, CRWAlign-1 creates a phylogenetic-based sequence profile for each column in the alignment. The second program, CRWAlign-2 creates a profile based on phylogenetic, secondary structure, and sequence information. Both programs utilize their profiles to align new sequences into the template alignment. RESULTS: The accuracies of the two CRWAlign programs were compared with the best template-based rRNA alignment programs and the best de-novo alignment programs. We have compared our programs with a total of eight alternative alignment methods on different sets of 16S rRNA alignments with sequence percent identities ranging from 50% to 100%. Both CRWAlign programs were superior to these other programs in accuracy and speed. CONCLUSIONS: Both CRWAlign programs can be used to align the very extensive amount of RNA sequencing that is generated due to the rapid next-generation sequencing technology. This latter technology is augmenting the new paradigm that RNA is intimately implicated in a significant number of functions within the cell. In addition, the use of bacterial 16S rRNA sequencing in the identification of the microbiome in many different environmental systems creates a need for rapid and highly accurate alignment of bacterial 16S rRNA sequences. BioMed Central 2013-10-23 /pmc/articles/PMC3854672/ /pubmed/24565058 http://dx.doi.org/10.1186/1752-0509-7-S4-S13 Text en Copyright © 2013 Shang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Shang, Lei
Gardner, David P
Xu, Weijia
Cannone, Jamie J
Miranker, Daniel P
Ozer, Stuart
Gutell, Robin R
Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title_full Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title_fullStr Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title_full_unstemmed Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title_short Two accurate sequence, structure, and phylogenetic template-based RNA alignment systems
title_sort two accurate sequence, structure, and phylogenetic template-based rna alignment systems
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3854672/
https://www.ncbi.nlm.nih.gov/pubmed/24565058
http://dx.doi.org/10.1186/1752-0509-7-S4-S13
work_keys_str_mv AT shanglei twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT gardnerdavidp twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT xuweijia twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT cannonejamiej twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT mirankerdanielp twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT ozerstuart twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems
AT gutellrobinr twoaccuratesequencestructureandphylogenetictemplatebasedrnaalignmentsystems