Cargando…

CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome

MOTIVATION: Next-generation sequencing technologies have accelerated the discovery of single nucleotide variants in the human genome, stimulating the development of predictors for classifying which of these variants are likely functional in disease, and which neutral. Recently, we proposed CScape, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Rogers, Mark F, Gaunt, Tom R, Campbell, Colin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320610/
https://www.ncbi.nlm.nih.gov/pubmed/32282885
http://dx.doi.org/10.1093/bioinformatics/btaa242
_version_ 1783551278044938240
author Rogers, Mark F
Gaunt, Tom R
Campbell, Colin
author_facet Rogers, Mark F
Gaunt, Tom R
Campbell, Colin
author_sort Rogers, Mark F
collection PubMed
description MOTIVATION: Next-generation sequencing technologies have accelerated the discovery of single nucleotide variants in the human genome, stimulating the development of predictors for classifying which of these variants are likely functional in disease, and which neutral. Recently, we proposed CScape, a method for discriminating between cancer driver mutations and presumed benign variants. For the neutral class, this method relied on benign germline variants found in the 1000 Genomes Project database. Discrimination could, therefore, be influenced by the distinction of germline versus somatic, rather than neutral versus disease driver. This motivates this article in which we consider predictive discrimination between recurrent and rare somatic single point mutations based solely on using cancer data, and the distinction between these two somatic classes and germline single point mutations. RESULTS: For somatic point mutations in coding and non-coding regions of the genome, we propose CScape-somatic, an integrative classifier for predictively discriminating between recurrent and rare variants in the human cancer genome. In this study, we use purely cancer genome data and investigate the distinction between minimal occurrence and significantly recurrent somatic single point mutations in the human cancer genome. We show that this type of predictive distinction can give novel insight, and may deliver more meaningful prediction in both coding and non-coding regions of the cancer genome. Tested on somatic mutations, CScape-somatic outperforms alternative methods, reaching 74% balanced accuracy in coding regions and 69% in non-coding regions, whereas even higher accuracy may be achieved using thresholds to isolate high-confidence predictions. AVAILABILITY AND IMPLEMENTATION: Predictions and software are available at http://CScape-somatic.biocompute.org.uk/. CONTACT: mark.f.rogers.phd@gmail.com or C.Campbell@bristol.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7320610
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73206102020-07-01 CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome Rogers, Mark F Gaunt, Tom R Campbell, Colin Bioinformatics Original Papers MOTIVATION: Next-generation sequencing technologies have accelerated the discovery of single nucleotide variants in the human genome, stimulating the development of predictors for classifying which of these variants are likely functional in disease, and which neutral. Recently, we proposed CScape, a method for discriminating between cancer driver mutations and presumed benign variants. For the neutral class, this method relied on benign germline variants found in the 1000 Genomes Project database. Discrimination could, therefore, be influenced by the distinction of germline versus somatic, rather than neutral versus disease driver. This motivates this article in which we consider predictive discrimination between recurrent and rare somatic single point mutations based solely on using cancer data, and the distinction between these two somatic classes and germline single point mutations. RESULTS: For somatic point mutations in coding and non-coding regions of the genome, we propose CScape-somatic, an integrative classifier for predictively discriminating between recurrent and rare variants in the human cancer genome. In this study, we use purely cancer genome data and investigate the distinction between minimal occurrence and significantly recurrent somatic single point mutations in the human cancer genome. We show that this type of predictive distinction can give novel insight, and may deliver more meaningful prediction in both coding and non-coding regions of the cancer genome. Tested on somatic mutations, CScape-somatic outperforms alternative methods, reaching 74% balanced accuracy in coding regions and 69% in non-coding regions, whereas even higher accuracy may be achieved using thresholds to isolate high-confidence predictions. AVAILABILITY AND IMPLEMENTATION: Predictions and software are available at http://CScape-somatic.biocompute.org.uk/. CONTACT: mark.f.rogers.phd@gmail.com or C.Campbell@bristol.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-06-15 2020-04-13 /pmc/articles/PMC7320610/ /pubmed/32282885 http://dx.doi.org/10.1093/bioinformatics/btaa242 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Rogers, Mark F
Gaunt, Tom R
Campbell, Colin
CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title_full CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title_fullStr CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title_full_unstemmed CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title_short CScape-somatic: distinguishing driver and passenger point mutations in the cancer genome
title_sort cscape-somatic: distinguishing driver and passenger point mutations in the cancer genome
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320610/
https://www.ncbi.nlm.nih.gov/pubmed/32282885
http://dx.doi.org/10.1093/bioinformatics/btaa242
work_keys_str_mv AT rogersmarkf cscapesomaticdistinguishingdriverandpassengerpointmutationsinthecancergenome
AT gaunttomr cscapesomaticdistinguishingdriverandpassengerpointmutationsinthecancergenome
AT campbellcolin cscapesomaticdistinguishingdriverandpassengerpointmutationsinthecancergenome