Cargando…

Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data

Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95%...

Descripción completa

Detalles Bibliográficos
Autores principales: Dawes, Ruebena, Joshi, Himanshu, Cooper, Sandra T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964760/
https://www.ncbi.nlm.nih.gov/pubmed/35351883
http://dx.doi.org/10.1038/s41467-022-29271-y
_version_ 1784678289495818240
author Dawes, Ruebena
Joshi, Himanshu
Cooper, Sandra T.
author_facet Dawes, Ruebena
Joshi, Himanshu
Cooper, Sandra T.
author_sort Dawes, Ruebena
collection PubMed
description Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95% specificity. Strength (according to four algorithms) and proximity to the annotated-donor appear important determinants of cryptic-donor activation. However, other factors such as splicing regulatory elements, which are difficult to identify, play an important role and are likely responsible for current prediction inaccuracies. We find that the most frequently recurring natural mis-splicing events at each exon-intron junction, summarised over 40,233 RNA-sequencing samples (40K-RNA), predict with accuracy which cryptic-donor will be activated in rare disease. 40K-RNA provides an accurate, evidence-based method to predict variant-activated cryptic-donors in genetic disorders, assisting pathology consideration of possible consequences of a variant for the encoded protein and RNA diagnostic testing strategies.
format Online
Article
Text
id pubmed-8964760
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-89647602022-04-20 Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data Dawes, Ruebena Joshi, Himanshu Cooper, Sandra T. Nat Commun Article Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95% specificity. Strength (according to four algorithms) and proximity to the annotated-donor appear important determinants of cryptic-donor activation. However, other factors such as splicing regulatory elements, which are difficult to identify, play an important role and are likely responsible for current prediction inaccuracies. We find that the most frequently recurring natural mis-splicing events at each exon-intron junction, summarised over 40,233 RNA-sequencing samples (40K-RNA), predict with accuracy which cryptic-donor will be activated in rare disease. 40K-RNA provides an accurate, evidence-based method to predict variant-activated cryptic-donors in genetic disorders, assisting pathology consideration of possible consequences of a variant for the encoded protein and RNA diagnostic testing strategies. Nature Publishing Group UK 2022-03-29 /pmc/articles/PMC8964760/ /pubmed/35351883 http://dx.doi.org/10.1038/s41467-022-29271-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dawes, Ruebena
Joshi, Himanshu
Cooper, Sandra T.
Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title_full Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title_fullStr Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title_full_unstemmed Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title_short Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
title_sort empirical prediction of variant-activated cryptic splice donors using population-based rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964760/
https://www.ncbi.nlm.nih.gov/pubmed/35351883
http://dx.doi.org/10.1038/s41467-022-29271-y
work_keys_str_mv AT dawesruebena empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata
AT joshihimanshu empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata
AT coopersandrat empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata