Cargando…
Features generated for computational splice-site prediction correspond to functional elements
BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capa...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241647/ https://www.ncbi.nlm.nih.gov/pubmed/17958908 http://dx.doi.org/10.1186/1471-2105-8-410 |
_version_ | 1782150528359202816 |
---|---|
author | Dogan, Rezarta Islamaj Getoor, Lise Wilbur, W John Mount, Stephen M |
author_facet | Dogan, Rezarta Islamaj Getoor, Lise Wilbur, W John Mount, Stephen M |
author_sort | Dogan, Rezarta Islamaj |
collection | PubMed |
description | BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologically meaningful splicing signals. RESULTS: We present examples from the observed features that correspond to known signals, both core signals (including the branch site and pyrimidine tract) and auxiliary signals (including GGG triplets and exon splicing enhancers). We present evidence that features identified by FGA include splicing signals not found by other methods. CONCLUSION: Our generated features capture known biological signals in the expected sequence interval flanking splice sites. The method can be easily applied to other species and to similar classification problems, such as tissue-specific regulatory elements, polyadenylation sites, promoters, etc. |
format | Text |
id | pubmed-2241647 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22416472008-02-13 Features generated for computational splice-site prediction correspond to functional elements Dogan, Rezarta Islamaj Getoor, Lise Wilbur, W John Mount, Stephen M BMC Bioinformatics Research Article BACKGROUND: Accurate selection of splice sites during the splicing of precursors to messenger RNA requires both relatively well-characterized signals at the splice sites and auxiliary signals in the adjacent exons and introns. We previously described a feature generation algorithm (FGA) that is capable of achieving high classification accuracy on human 3' splice sites. In this paper, we extend the splice-site prediction to 5' splice sites and explore the generated features for biologically meaningful splicing signals. RESULTS: We present examples from the observed features that correspond to known signals, both core signals (including the branch site and pyrimidine tract) and auxiliary signals (including GGG triplets and exon splicing enhancers). We present evidence that features identified by FGA include splicing signals not found by other methods. CONCLUSION: Our generated features capture known biological signals in the expected sequence interval flanking splice sites. The method can be easily applied to other species and to similar classification problems, such as tissue-specific regulatory elements, polyadenylation sites, promoters, etc. BioMed Central 2007-10-24 /pmc/articles/PMC2241647/ /pubmed/17958908 http://dx.doi.org/10.1186/1471-2105-8-410 Text en Copyright © 2007 Dogan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dogan, Rezarta Islamaj Getoor, Lise Wilbur, W John Mount, Stephen M Features generated for computational splice-site prediction correspond to functional elements |
title | Features generated for computational splice-site prediction correspond to functional elements |
title_full | Features generated for computational splice-site prediction correspond to functional elements |
title_fullStr | Features generated for computational splice-site prediction correspond to functional elements |
title_full_unstemmed | Features generated for computational splice-site prediction correspond to functional elements |
title_short | Features generated for computational splice-site prediction correspond to functional elements |
title_sort | features generated for computational splice-site prediction correspond to functional elements |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2241647/ https://www.ncbi.nlm.nih.gov/pubmed/17958908 http://dx.doi.org/10.1186/1471-2105-8-410 |
work_keys_str_mv | AT doganrezartaislamaj featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements AT getoorlise featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements AT wilburwjohn featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements AT mountstephenm featuresgeneratedforcomputationalsplicesitepredictioncorrespondtofunctionalelements |