Cargando…

Impact of RNA structure on the prediction of donor and acceptor splice sites

BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Marashi, Sayed-Amir, Eslahchi, Changiz, Pezeshk, Hamid, Sadeghi, Mehdi
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526458/
https://www.ncbi.nlm.nih.gov/pubmed/16772025
http://dx.doi.org/10.1186/1471-2105-7-297
_version_ 1782128901717229568
author Marashi, Sayed-Amir
Eslahchi, Changiz
Pezeshk, Hamid
Sadeghi, Mehdi
author_facet Marashi, Sayed-Amir
Eslahchi, Changiz
Pezeshk, Hamid
Sadeghi, Mehdi
author_sort Marashi, Sayed-Amir
collection PubMed
description BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms. RESULTS: we sought the role of mRNA secondary structures and their information contents for five vertebrate and plant splice site datasets. We selected 900-nucleotide sequences centered at each (real or decoy) donor and acceptor sites, and predicted their corresponding RNA structures by Vienna software. Then, based on whether the nucleotide is in a stem or not, the conventional four-letter nucleotide alphabet was translated into an eight-letter alphabet. Zero-, first- and second-order Markov models were selected as the signal detection methods. It is shown that applying the eight-letter alphabet compared to the four-letter alphabet considerably increases the accuracy of both donor and acceptor site predictions in case of higher order Markov models. CONCLUSION: Our results imply that RNA structure contains important data and future gene prediction programs can take advantage of such information.
format Text
id pubmed-1526458
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15264582006-08-07 Impact of RNA structure on the prediction of donor and acceptor splice sites Marashi, Sayed-Amir Eslahchi, Changiz Pezeshk, Hamid Sadeghi, Mehdi BMC Bioinformatics Research Article BACKGROUND: gene identification in genomic DNA sequences by computational methods has become an important task in bioinformatics and computational gene prediction tools are now essential components of every genome sequencing project. Prediction of splice sites is a key step of all gene structural prediction algorithms. RESULTS: we sought the role of mRNA secondary structures and their information contents for five vertebrate and plant splice site datasets. We selected 900-nucleotide sequences centered at each (real or decoy) donor and acceptor sites, and predicted their corresponding RNA structures by Vienna software. Then, based on whether the nucleotide is in a stem or not, the conventional four-letter nucleotide alphabet was translated into an eight-letter alphabet. Zero-, first- and second-order Markov models were selected as the signal detection methods. It is shown that applying the eight-letter alphabet compared to the four-letter alphabet considerably increases the accuracy of both donor and acceptor site predictions in case of higher order Markov models. CONCLUSION: Our results imply that RNA structure contains important data and future gene prediction programs can take advantage of such information. BioMed Central 2006-06-13 /pmc/articles/PMC1526458/ /pubmed/16772025 http://dx.doi.org/10.1186/1471-2105-7-297 Text en Copyright © 2006 Marashi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Marashi, Sayed-Amir
Eslahchi, Changiz
Pezeshk, Hamid
Sadeghi, Mehdi
Impact of RNA structure on the prediction of donor and acceptor splice sites
title Impact of RNA structure on the prediction of donor and acceptor splice sites
title_full Impact of RNA structure on the prediction of donor and acceptor splice sites
title_fullStr Impact of RNA structure on the prediction of donor and acceptor splice sites
title_full_unstemmed Impact of RNA structure on the prediction of donor and acceptor splice sites
title_short Impact of RNA structure on the prediction of donor and acceptor splice sites
title_sort impact of rna structure on the prediction of donor and acceptor splice sites
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1526458/
https://www.ncbi.nlm.nih.gov/pubmed/16772025
http://dx.doi.org/10.1186/1471-2105-7-297
work_keys_str_mv AT marashisayedamir impactofrnastructureonthepredictionofdonorandacceptorsplicesites
AT eslahchichangiz impactofrnastructureonthepredictionofdonorandacceptorsplicesites
AT pezeshkhamid impactofrnastructureonthepredictionofdonorandacceptorsplicesites
AT sadeghimehdi impactofrnastructureonthepredictionofdonorandacceptorsplicesites