Cargando…
A sequence-based, deep learning model accurately predicts RNA splicing branchpoints
Experimental detection of RNA splicing branchpoints is difficult. To date, high-confidence experimental annotations exist for 18% of 3′ splice sites in the human genome. We develop a deep-learning-based branchpoint predictor, LaBranchoR, which predicts a correct branchpoint for at least 75% of 3′ sp...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6239175/ https://www.ncbi.nlm.nih.gov/pubmed/30224349 http://dx.doi.org/10.1261/rna.066290.118 |
_version_ | 1783371517925523456 |
---|---|
author | Paggi, Joseph M. Bejerano, Gill |
author_facet | Paggi, Joseph M. Bejerano, Gill |
author_sort | Paggi, Joseph M. |
collection | PubMed |
description | Experimental detection of RNA splicing branchpoints is difficult. To date, high-confidence experimental annotations exist for 18% of 3′ splice sites in the human genome. We develop a deep-learning-based branchpoint predictor, LaBranchoR, which predicts a correct branchpoint for at least 75% of 3′ splice sites genome-wide. Detailed analysis of cases in which our predicted branchpoint deviates from experimental data suggests a correct branchpoint is predicted in over 90% of cases. We use our predicted branchpoints to identify a novel sequence element upstream of branchpoints consistent with extended U2 snRNA base-pairing, show an association between weak branchpoints and alternative splicing, and explore the effects of genetic variants on branchpoints. We provide genome-wide branchpoint annotations and in silico mutagenesis scores at http://bejerano.stanford.edu/labranchor. |
format | Online Article Text |
id | pubmed-6239175 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-62391752019-12-01 A sequence-based, deep learning model accurately predicts RNA splicing branchpoints Paggi, Joseph M. Bejerano, Gill RNA Bioinformatics Experimental detection of RNA splicing branchpoints is difficult. To date, high-confidence experimental annotations exist for 18% of 3′ splice sites in the human genome. We develop a deep-learning-based branchpoint predictor, LaBranchoR, which predicts a correct branchpoint for at least 75% of 3′ splice sites genome-wide. Detailed analysis of cases in which our predicted branchpoint deviates from experimental data suggests a correct branchpoint is predicted in over 90% of cases. We use our predicted branchpoints to identify a novel sequence element upstream of branchpoints consistent with extended U2 snRNA base-pairing, show an association between weak branchpoints and alternative splicing, and explore the effects of genetic variants on branchpoints. We provide genome-wide branchpoint annotations and in silico mutagenesis scores at http://bejerano.stanford.edu/labranchor. Cold Spring Harbor Laboratory Press 2018-12 /pmc/articles/PMC6239175/ /pubmed/30224349 http://dx.doi.org/10.1261/rna.066290.118 Text en © 2018 Paggi and Bejerano; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by the RNA Society for the first 12 months after the full-issue publication date (see http://rnajournal.cshlp.org/site/misc/terms.xhtml). After 12 months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Bioinformatics Paggi, Joseph M. Bejerano, Gill A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title | A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title_full | A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title_fullStr | A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title_full_unstemmed | A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title_short | A sequence-based, deep learning model accurately predicts RNA splicing branchpoints |
title_sort | sequence-based, deep learning model accurately predicts rna splicing branchpoints |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6239175/ https://www.ncbi.nlm.nih.gov/pubmed/30224349 http://dx.doi.org/10.1261/rna.066290.118 |
work_keys_str_mv | AT paggijosephm asequencebaseddeeplearningmodelaccuratelypredictsrnasplicingbranchpoints AT bejeranogill asequencebaseddeeplearningmodelaccuratelypredictsrnasplicingbranchpoints AT paggijosephm sequencebaseddeeplearningmodelaccuratelypredictsrnasplicingbranchpoints AT bejeranogill sequencebaseddeeplearningmodelaccuratelypredictsrnasplicingbranchpoints |