Cargando…

Accurate prediction of NAGNAG alternative splicing

Alternative splicing (AS) involving NAGNAG tandem acceptors is an evolutionarily widespread class of AS. Recent predictions of alternative acceptor usage reported better results for acceptors separated by larger distances, than for NAGNAGs. To improve the latter, we aimed at the use of Bayesian netw...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sinha, Rileen, Nikolajewa, Swetlana, Szafranski, Karol, Hiller, Michael, Jahn, Niels, Huse, Klaus, Platzer, Matthias, Backofen, Rolf
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2009
Materias:	Computational Biology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699507/ https://www.ncbi.nlm.nih.gov/pubmed/19359358 http://dx.doi.org/10.1093/nar/gkp220

_version_	1782168500629929984
author	Sinha, Rileen Nikolajewa, Swetlana Szafranski, Karol Hiller, Michael Jahn, Niels Huse, Klaus Platzer, Matthias Backofen, Rolf
author_facet	Sinha, Rileen Nikolajewa, Swetlana Szafranski, Karol Hiller, Michael Jahn, Niels Huse, Klaus Platzer, Matthias Backofen, Rolf
author_sort	Sinha, Rileen
collection	PubMed
description	Alternative splicing (AS) involving NAGNAG tandem acceptors is an evolutionarily widespread class of AS. Recent predictions of alternative acceptor usage reported better results for acceptors separated by larger distances, than for NAGNAGs. To improve the latter, we aimed at the use of Bayesian networks (BN), and extensive experimental validation of the predictions. Using carefully constructed training and test datasets, a balanced sensitivity and specificity of ≥92% was achieved. A BN trained on the combined dataset was then used to make predictions, and 81% (38/47) of the experimentally tested predictions were verified. Using a BN learned on human data on six other genomes, we show that while the performance for the vertebrate genomes matches that achieved on human data, there is a slight drop for Drosophila and worm. Lastly, using the prediction accuracy according to experimental validation, we estimate the number of yet undiscovered alternative NAGNAGs. State of the art classifiers can produce highly accurate prediction of AS at NAGNAGs, indicating that we have identified the major features of the ‘NAGNAG-splicing code’ within the splice site and its immediate neighborhood. Our results suggest that the mechanism behind NAGNAG AS is simple, stochastic, and conserved among vertebrates and beyond.
format	Text
id	pubmed-2699507
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-26995072009-06-22 Accurate prediction of NAGNAG alternative splicing Sinha, Rileen Nikolajewa, Swetlana Szafranski, Karol Hiller, Michael Jahn, Niels Huse, Klaus Platzer, Matthias Backofen, Rolf Nucleic Acids Res Computational Biology Alternative splicing (AS) involving NAGNAG tandem acceptors is an evolutionarily widespread class of AS. Recent predictions of alternative acceptor usage reported better results for acceptors separated by larger distances, than for NAGNAGs. To improve the latter, we aimed at the use of Bayesian networks (BN), and extensive experimental validation of the predictions. Using carefully constructed training and test datasets, a balanced sensitivity and specificity of ≥92% was achieved. A BN trained on the combined dataset was then used to make predictions, and 81% (38/47) of the experimentally tested predictions were verified. Using a BN learned on human data on six other genomes, we show that while the performance for the vertebrate genomes matches that achieved on human data, there is a slight drop for Drosophila and worm. Lastly, using the prediction accuracy according to experimental validation, we estimate the number of yet undiscovered alternative NAGNAGs. State of the art classifiers can produce highly accurate prediction of AS at NAGNAGs, indicating that we have identified the major features of the ‘NAGNAG-splicing code’ within the splice site and its immediate neighborhood. Our results suggest that the mechanism behind NAGNAG AS is simple, stochastic, and conserved among vertebrates and beyond. Oxford University Press 2009-06 2009-04-09 /pmc/articles/PMC2699507/ /pubmed/19359358 http://dx.doi.org/10.1093/nar/gkp220 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Computational Biology Sinha, Rileen Nikolajewa, Swetlana Szafranski, Karol Hiller, Michael Jahn, Niels Huse, Klaus Platzer, Matthias Backofen, Rolf Accurate prediction of NAGNAG alternative splicing
title	Accurate prediction of NAGNAG alternative splicing
title_full	Accurate prediction of NAGNAG alternative splicing
title_fullStr	Accurate prediction of NAGNAG alternative splicing
title_full_unstemmed	Accurate prediction of NAGNAG alternative splicing
title_short	Accurate prediction of NAGNAG alternative splicing
title_sort	accurate prediction of nagnag alternative splicing
topic	Computational Biology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2699507/ https://www.ncbi.nlm.nih.gov/pubmed/19359358 http://dx.doi.org/10.1093/nar/gkp220
work_keys_str_mv	AT sinharileen accuratepredictionofnagnagalternativesplicing AT nikolajewaswetlana accuratepredictionofnagnagalternativesplicing AT szafranskikarol accuratepredictionofnagnagalternativesplicing AT hillermichael accuratepredictionofnagnagalternativesplicing AT jahnniels accuratepredictionofnagnagalternativesplicing AT huseklaus accuratepredictionofnagnagalternativesplicing AT platzermatthias accuratepredictionofnagnagalternativesplicing AT backofenrolf accuratepredictionofnagnagalternativesplicing

Accurate prediction of NAGNAG alternative splicing

Ejemplares similares