Cargando…
Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data
Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95%...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964760/ https://www.ncbi.nlm.nih.gov/pubmed/35351883 http://dx.doi.org/10.1038/s41467-022-29271-y |
_version_ | 1784678289495818240 |
---|---|
author | Dawes, Ruebena Joshi, Himanshu Cooper, Sandra T. |
author_facet | Dawes, Ruebena Joshi, Himanshu Cooper, Sandra T. |
author_sort | Dawes, Ruebena |
collection | PubMed |
description | Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95% specificity. Strength (according to four algorithms) and proximity to the annotated-donor appear important determinants of cryptic-donor activation. However, other factors such as splicing regulatory elements, which are difficult to identify, play an important role and are likely responsible for current prediction inaccuracies. We find that the most frequently recurring natural mis-splicing events at each exon-intron junction, summarised over 40,233 RNA-sequencing samples (40K-RNA), predict with accuracy which cryptic-donor will be activated in rare disease. 40K-RNA provides an accurate, evidence-based method to predict variant-activated cryptic-donors in genetic disorders, assisting pathology consideration of possible consequences of a variant for the encoded protein and RNA diagnostic testing strategies. |
format | Online Article Text |
id | pubmed-8964760 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-89647602022-04-20 Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data Dawes, Ruebena Joshi, Himanshu Cooper, Sandra T. Nat Commun Article Predicting which cryptic-donors may be activated by a splicing variant in patient DNA is notoriously difficult. Through analysis of 5145 cryptic-donors (versus 86,963 decoy-donors not used; any GT or GC), we define an empirical method predicting cryptic-donor activation with 87% sensitivity and 95% specificity. Strength (according to four algorithms) and proximity to the annotated-donor appear important determinants of cryptic-donor activation. However, other factors such as splicing regulatory elements, which are difficult to identify, play an important role and are likely responsible for current prediction inaccuracies. We find that the most frequently recurring natural mis-splicing events at each exon-intron junction, summarised over 40,233 RNA-sequencing samples (40K-RNA), predict with accuracy which cryptic-donor will be activated in rare disease. 40K-RNA provides an accurate, evidence-based method to predict variant-activated cryptic-donors in genetic disorders, assisting pathology consideration of possible consequences of a variant for the encoded protein and RNA diagnostic testing strategies. Nature Publishing Group UK 2022-03-29 /pmc/articles/PMC8964760/ /pubmed/35351883 http://dx.doi.org/10.1038/s41467-022-29271-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Dawes, Ruebena Joshi, Himanshu Cooper, Sandra T. Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title | Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title_full | Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title_fullStr | Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title_full_unstemmed | Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title_short | Empirical prediction of variant-activated cryptic splice donors using population-based RNA-Seq data |
title_sort | empirical prediction of variant-activated cryptic splice donors using population-based rna-seq data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964760/ https://www.ncbi.nlm.nih.gov/pubmed/35351883 http://dx.doi.org/10.1038/s41467-022-29271-y |
work_keys_str_mv | AT dawesruebena empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata AT joshihimanshu empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata AT coopersandrat empiricalpredictionofvariantactivatedcrypticsplicedonorsusingpopulationbasedrnaseqdata |