Cargando…

Features generated for computational splice-site prediction correspond to functional elements

BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capa...

Descripción completa

Detalles Bibliográficos
Autores principales: Dogan, Rezarta Islamaj, Getoor, Lise, Wilbur, W John, Mount, Stephen M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241647/
https://www.ncbi.nlm.nih.gov/pubmed/17958908
http://dx.doi.org/10.1186/1471-2105-8-410
_version_ 1782150528359202816
author Dogan, Rezarta Islamaj
Getoor, Lise
Wilbur, W John
Mount, Stephen M
author_facet Dogan, Rezarta Islamaj
Getoor, Lise
Wilbur, W John
Mount, Stephen M
author_sort Dogan, Rezarta Islamaj
collection PubMed
description BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologically meaningful splicing signals. RESULTS: We present examples from the observed features that correspond to known signals, both core signals (including the branch site and pyrimidine tract) and auxiliary signals (including GGG triplets and exon splicing enhancers). We present evidence that features identified by FGA include splicing signals not found by other methods. CONCLUSION: Our generated features capture known biological signals in the expected sequence interval flanking splice sites. The method can be easily applied to other species and to similar classification problems, such as tissue-specific regulatory elements, polyadenylation sites, promoters, etc.
format Text
id pubmed-2241647
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22416472008-02-13 Features generated for computational splice-site prediction correspond to functional elements Dogan, Rezarta Islamaj Getoor, Lise Wilbur, W John Mount, Stephen M BMC Bioinformatics Research Article BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologically meaningful splicing signals. RESULTS: We present examples from the observed features that correspond to known signals, both core signals (including the branch site and pyrimidine tract) and auxiliary signals (including GGG triplets and exon splicing enhancers). We present evidence that features identified by FGA include splicing signals not found by other methods. CONCLUSION: Our generated features capture known biological signals in the expected sequence interval flanking splice sites. The method can be easily applied to other species and to similar classification problems, such as tissue-specific regulatory elements, polyadenylation sites, promoters, etc. BioMed Central 2007-10-24 /pmc/articles/PMC2241647/ /pubmed/17958908 http://dx.doi.org/10.1186/1471-2105-8-410 Text en Copyright © 2007 Dogan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dogan, Rezarta Islamaj
Getoor, Lise
Wilbur, W John
Mount, Stephen M
Features generated for computational splice-site prediction correspond to functional elements
title Features generated for computational splice-site prediction correspond to functional elements
title_full Features generated for computational splice-site prediction correspond to functional elements
title_fullStr Features generated for computational splice-site prediction correspond to functional elements
title_full_unstemmed Features generated for computational splice-site prediction correspond to functional elements
title_short Features generated for computational splice-site prediction correspond to functional elements
title_sort features generated for computational splice-site prediction correspond to functional elements
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241647/
https://www.ncbi.nlm.nih.gov/pubmed/17958908
http://dx.doi.org/10.1186/1471-2105-8-410
work_keys_str_mv AT doganrezartaislamaj featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements
AT getoorlise featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements
AT wilburwjohn featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements
AT mountstephenm featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements