Cargando…

The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted

INTRODUCTION: 5' splice site GT>GC or +2T>C variants have been frequently reported to cause human genetic disease and are routinely scored as pathogenic splicing mutations. However, we have recently demonstrated that such variants in human disease genes may not invariably be pathogenic. M...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jian-Min, Lin, Jin-Huan, Masson, Emmanuelle, Liao, Zhuan, Férec, Claude, Cooper, David N., Hayden, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bentham Science Publishers 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324893/
https://www.ncbi.nlm.nih.gov/pubmed/32655299
http://dx.doi.org/10.2174/1389202921666200210141701
_version_ 1783552046921678848
author Chen, Jian-Min
Lin, Jin-Huan
Masson, Emmanuelle
Liao, Zhuan
Férec, Claude
Cooper, David N.
Hayden, Matthew
author_facet Chen, Jian-Min
Lin, Jin-Huan
Masson, Emmanuelle
Liao, Zhuan
Férec, Claude
Cooper, David N.
Hayden, Matthew
author_sort Chen, Jian-Min
collection PubMed
description INTRODUCTION: 5' splice site GT>GC or +2T>C variants have been frequently reported to cause human genetic disease and are routinely scored as pathogenic splicing mutations. However, we have recently demonstrated that such variants in human disease genes may not invariably be pathogenic. Moreover, we found that no splicing prediction tools appear to be capable of reliably distinguishing those +2T>C variants that generate wild-type transcripts from those that do not. METHODOLOGY: Herein, we evaluated the performance of a novel deep learning-based tool, SpliceAI, in the context of three datasets of +2T>C variants, all of which had been characterized functionally in terms of their impact on pre-mRNA splicing. The first two datasets refer to our recently described “in vivo” dataset of 45 known disease-causing +2T>C variants and the “in vitro” dataset of 103 +2T>C substitutions subjected to full-length gene splicing assay. The third dataset comprised 12 BRCA1 +2T>C variants that were recently analyzed by saturation genome editing. RESULTS: Comparison of the SpliceAI-predicted and experimentally obtained functional impact assessments of these variants (and smaller datasets of +2T>A and +2T>G variants) revealed that although SpliceAI performed rather better than other prediction tools, it was still far from perfect. A key issue was that the impact of those +2T>C (and +2T>A) variants that generated wild-type transcripts represents a quantitative change that can vary from barely detectable to an almost full expression of wild-type transcripts, with wild-type transcripts often co-existing with aberrantly spliced transcripts. CONCLUSION: Our findings highlight the challenges that we still face in attempting to accurately identify splice-altering variants.
format Online
Article
Text
id pubmed-7324893
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Bentham Science Publishers
record_format MEDLINE/PubMed
spelling pubmed-73248932020-07-10 The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted Chen, Jian-Min Lin, Jin-Huan Masson, Emmanuelle Liao, Zhuan Férec, Claude Cooper, David N. Hayden, Matthew Curr Genomics Article INTRODUCTION: 5' splice site GT>GC or +2T>C variants have been frequently reported to cause human genetic disease and are routinely scored as pathogenic splicing mutations. However, we have recently demonstrated that such variants in human disease genes may not invariably be pathogenic. Moreover, we found that no splicing prediction tools appear to be capable of reliably distinguishing those +2T>C variants that generate wild-type transcripts from those that do not. METHODOLOGY: Herein, we evaluated the performance of a novel deep learning-based tool, SpliceAI, in the context of three datasets of +2T>C variants, all of which had been characterized functionally in terms of their impact on pre-mRNA splicing. The first two datasets refer to our recently described “in vivo” dataset of 45 known disease-causing +2T>C variants and the “in vitro” dataset of 103 +2T>C substitutions subjected to full-length gene splicing assay. The third dataset comprised 12 BRCA1 +2T>C variants that were recently analyzed by saturation genome editing. RESULTS: Comparison of the SpliceAI-predicted and experimentally obtained functional impact assessments of these variants (and smaller datasets of +2T>A and +2T>G variants) revealed that although SpliceAI performed rather better than other prediction tools, it was still far from perfect. A key issue was that the impact of those +2T>C (and +2T>A) variants that generated wild-type transcripts represents a quantitative change that can vary from barely detectable to an almost full expression of wild-type transcripts, with wild-type transcripts often co-existing with aberrantly spliced transcripts. CONCLUSION: Our findings highlight the challenges that we still face in attempting to accurately identify splice-altering variants. Bentham Science Publishers 2020-01 2020-01 /pmc/articles/PMC7324893/ /pubmed/32655299 http://dx.doi.org/10.2174/1389202921666200210141701 Text en © 2020 Bentham Science Publishers https://creativecommons.org/licenses/by-nc/4.0/legalcode This is an open access article licensed under the terms of the Creative Commons Attribution-Non-Commercial 4.0 International Public License (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/legalcode), which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited.
spellingShingle Article
Chen, Jian-Min
Lin, Jin-Huan
Masson, Emmanuelle
Liao, Zhuan
Férec, Claude
Cooper, David N.
Hayden, Matthew
The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title_full The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title_fullStr The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title_full_unstemmed The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title_short The Experimentally Obtained Functional Impact Assessments of 5' Splice Site GT'GC Variants Differ Markedly from Those Predicted
title_sort experimentally obtained functional impact assessments of 5' splice site gt'gc variants differ markedly from those predicted
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324893/
https://www.ncbi.nlm.nih.gov/pubmed/32655299
http://dx.doi.org/10.2174/1389202921666200210141701
work_keys_str_mv AT chenjianmin theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT linjinhuan theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT massonemmanuelle theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT liaozhuan theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT ferecclaude theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT cooperdavidn theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT haydenmatthew theexperimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT chenjianmin experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT linjinhuan experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT massonemmanuelle experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT liaozhuan experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT ferecclaude experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT cooperdavidn experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted
AT haydenmatthew experimentallyobtainedfunctionalimpactassessmentsof5splicesitegtgcvariantsdiffermarkedlyfromthosepredicted