Cargando…
Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas
Owing to the development of next‐generation sequencing (NGS) technologies, a large number of somatic variants have been identified in various types of cancer. However, the functional significance of most somatic variants remains unknown. Somatic variants that occur in exonic splicing enhancer (ESE)...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6885893/ https://www.ncbi.nlm.nih.gov/pubmed/31631560 http://dx.doi.org/10.1002/cam4.2619 |
_version_ | 1783474814188519424 |
---|---|
author | Tanimoto, Kousuke Muramatsu, Tomoki Inazawa, Johji |
author_facet | Tanimoto, Kousuke Muramatsu, Tomoki Inazawa, Johji |
author_sort | Tanimoto, Kousuke |
collection | PubMed |
description | Owing to the development of next‐generation sequencing (NGS) technologies, a large number of somatic variants have been identified in various types of cancer. However, the functional significance of most somatic variants remains unknown. Somatic variants that occur in exonic splicing enhancer (ESE) regions are thought to prevent serine and arginine‐rich (SR) proteins from binding to ESE sequence motifs, which leads to exon skipping. We computationally identified somatic variants in ESEs by compiling numerous open‐access datasets from The Cancer Genome Atlas (TCGA). Using somatic variants and RNA‐seq data from 9635 patients across 32 TCGA projects, we identified 646 ESE‐disrupting variants. The false positive rate of our method, estimated using a permutation test, was approximately 1%. Of these ESE‐disrupting variants, approximately 71% were located in the binding motifs of four classical SR proteins. ESE‐disrupting variants occurred in proportion to the number of somatic variants, but not necessarily in the specific genes associated with the biological processes of cancer. Existing bioinformatics tools could not predict the pathogenicity of ESE‐disrupting variants identified in this study, although these variants could cause exon skipping. We demonstrated that ESE‐disrupting nonsense variants tended to escape nonsense‐mediated decay surveillance. Using integrated analyses of open access data, we could specifically identify ESE‐disrupting variants. We have generated a powerful tool, which can handle datasets without normal samples or raw data, and thus contribute to reducing variants of uncertain significance because our statistical approach only uses the exon‐junction read counts from the tumor samples. |
format | Online Article Text |
id | pubmed-6885893 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-68858932019-12-09 Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas Tanimoto, Kousuke Muramatsu, Tomoki Inazawa, Johji Cancer Med Cancer Biology Owing to the development of next‐generation sequencing (NGS) technologies, a large number of somatic variants have been identified in various types of cancer. However, the functional significance of most somatic variants remains unknown. Somatic variants that occur in exonic splicing enhancer (ESE) regions are thought to prevent serine and arginine‐rich (SR) proteins from binding to ESE sequence motifs, which leads to exon skipping. We computationally identified somatic variants in ESEs by compiling numerous open‐access datasets from The Cancer Genome Atlas (TCGA). Using somatic variants and RNA‐seq data from 9635 patients across 32 TCGA projects, we identified 646 ESE‐disrupting variants. The false positive rate of our method, estimated using a permutation test, was approximately 1%. Of these ESE‐disrupting variants, approximately 71% were located in the binding motifs of four classical SR proteins. ESE‐disrupting variants occurred in proportion to the number of somatic variants, but not necessarily in the specific genes associated with the biological processes of cancer. Existing bioinformatics tools could not predict the pathogenicity of ESE‐disrupting variants identified in this study, although these variants could cause exon skipping. We demonstrated that ESE‐disrupting nonsense variants tended to escape nonsense‐mediated decay surveillance. Using integrated analyses of open access data, we could specifically identify ESE‐disrupting variants. We have generated a powerful tool, which can handle datasets without normal samples or raw data, and thus contribute to reducing variants of uncertain significance because our statistical approach only uses the exon‐junction read counts from the tumor samples. John Wiley and Sons Inc. 2019-10-21 /pmc/articles/PMC6885893/ /pubmed/31631560 http://dx.doi.org/10.1002/cam4.2619 Text en © 2019 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Cancer Biology Tanimoto, Kousuke Muramatsu, Tomoki Inazawa, Johji Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title | Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title_full | Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title_fullStr | Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title_full_unstemmed | Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title_short | Massive computational identification of somatic variants in exonic splicing enhancers using The Cancer Genome Atlas |
title_sort | massive computational identification of somatic variants in exonic splicing enhancers using the cancer genome atlas |
topic | Cancer Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6885893/ https://www.ncbi.nlm.nih.gov/pubmed/31631560 http://dx.doi.org/10.1002/cam4.2619 |
work_keys_str_mv | AT tanimotokousuke massivecomputationalidentificationofsomaticvariantsinexonicsplicingenhancersusingthecancergenomeatlas AT muramatsutomoki massivecomputationalidentificationofsomaticvariantsinexonicsplicingenhancersusingthecancergenomeatlas AT inazawajohji massivecomputationalidentificationofsomaticvariantsinexonicsplicingenhancersusingthecancergenomeatlas |