Cargando…
Impact of RNA structure on the prediction of donor and acceptor splice sites
BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural pr...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526458/ https://www.ncbi.nlm.nih.gov/pubmed/16772025 http://dx.doi.org/10.1186/1471-2105-7-297 |
_version_ | 1782128901717229568 |
---|---|
author | Marashi, Sayed-Amir Eslahchi, Changiz Pezeshk, Hamid Sadeghi, Mehdi |
author_facet | Marashi, Sayed-Amir Eslahchi, Changiz Pezeshk, Hamid Sadeghi, Mehdi |
author_sort | Marashi, Sayed-Amir |
collection | PubMed |
description | BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms. RESULTS: we sought the role of mRNA secondary structures and their information contents for five vertebrate and plant splice site datasets. We selected 900-nucleotide sequences centered at each (real or decoy) donor and acceptor sites, and predicted their corresponding RNA structures by Vienna software. Then, based on whether the nucleotide is in a stem or not, the conventional four-letter nucleotide alphabet was translated into an eight-letter alphabet. Zero-, first- and second-order Markov models were selected as the signal detection methods. It is shown that applying the eight-letter alphabet compared to the four-letter alphabet considerably increases the accuracy of both donor and acceptor site predictions in case of higher order Markov models. CONCLUSION: Our results imply that RNA structure contains important data and future gene prediction programs can take advantage of such information. |
format | Text |
id | pubmed-1526458 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15264582006-08-07 Impact of RNA structure on the prediction of donor and acceptor splice sites Marashi, Sayed-Amir Eslahchi, Changiz Pezeshk, Hamid Sadeghi, Mehdi BMC Bioinformatics Research Article BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms. RESULTS: we sought the role of mRNA secondary structures and their information contents for five vertebrate and plant splice site datasets. We selected 900-nucleotide sequences centered at each (real or decoy) donor and acceptor sites, and predicted their corresponding RNA structures by Vienna software. Then, based on whether the nucleotide is in a stem or not, the conventional four-letter nucleotide alphabet was translated into an eight-letter alphabet. Zero-, first- and second-order Markov models were selected as the signal detection methods. It is shown that applying the eight-letter alphabet compared to the four-letter alphabet considerably increases the accuracy of both donor and acceptor site predictions in case of higher order Markov models. CONCLUSION: Our results imply that RNA structure contains important data and future gene prediction programs can take advantage of such information. BioMed Central 2006-06-13 /pmc/articles/PMC1526458/ /pubmed/16772025 http://dx.doi.org/10.1186/1471-2105-7-297 Text en Copyright © 2006 Marashi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Marashi, Sayed-Amir Eslahchi, Changiz Pezeshk, Hamid Sadeghi, Mehdi Impact of RNA structure on the prediction of donor and acceptor splice sites |
title | Impact of RNA structure on the prediction of donor and acceptor splice sites |
title_full | Impact of RNA structure on the prediction of donor and acceptor splice sites |
title_fullStr | Impact of RNA structure on the prediction of donor and acceptor splice sites |
title_full_unstemmed | Impact of RNA structure on the prediction of donor and acceptor splice sites |
title_short | Impact of RNA structure on the prediction of donor and acceptor splice sites |
title_sort | impact of rna structure on the prediction of donor and acceptor splice sites |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526458/ https://www.ncbi.nlm.nih.gov/pubmed/16772025 http://dx.doi.org/10.1186/1471-2105-7-297 |
work_keys_str_mv | AT marashisayedamir impactofrnastructureonthepredictionofdonorandacceptorsplicesites AT eslahchichangiz impactofrnastructureonthepredictionofdonorandacceptorsplicesites AT pezeshkhamid impactofrnastructureonthepredictionofdonorandacceptorsplicesites AT sadeghimehdi impactofrnastructureonthepredictionofdonorandacceptorsplicesites |