Cargando…

SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering

BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutation...

Descripción completa

Detalles Bibliográficos
Autores principales: Van den Eynden, Jimmy, Fierro, Ana Carolina, Verbeke, Lieven PC, Marchal, Kathleen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410004/
https://www.ncbi.nlm.nih.gov/pubmed/25903787
http://dx.doi.org/10.1186/s12859-015-0555-7
_version_ 1782368265567207424
author Van den Eynden, Jimmy
Fierro, Ana Carolina
Verbeke, Lieven PC
Marchal, Kathleen
author_facet Van den Eynden, Jimmy
Fierro, Ana Carolina
Verbeke, Lieven PC
Marchal, Kathleen
author_sort Van den Eynden, Jimmy
collection PubMed
description BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutations do not influence tumour biology. In this study, SomInaClust is introduced, a method that accurately identifies driver genes based on their mutation pattern across tumour samples and then classifies them into oncogenes or tumour suppressor genes respectively. RESULTS: SomInaClust starts from the observation that oncogenes mainly contain mutations that, due to positive selection, cluster at similar positions in a gene across patient samples, whereas tumour suppressor genes contain a high number of protein-truncating mutations throughout the entire gene length. The method was shown to prioritize driver genes in 9 different solid cancers. Furthermore it was found to be complementary to existing similar-purpose methods with the additional advantages that it has a higher sensitivity, also for rare mutations (occurring in less than 1% of all samples), and it accurately classifies candidate driver genes in putative oncogenes and tumour suppressor genes. Pathway enrichment analysis showed that the identified genes belong to known cancer signalling pathways, and that the distinction between oncogenes and tumour suppressor genes is biologically relevant. CONCLUSIONS: SomInaClust was shown to detect candidate driver genes based on somatic mutation patterns of inactivation and clustering and to distinguish oncogenes from tumour suppressor genes. The method could be used for the identification of new cancer genes or to filter mutation data for further data-integration purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0555-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4410004
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44100042015-04-27 SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering Van den Eynden, Jimmy Fierro, Ana Carolina Verbeke, Lieven PC Marchal, Kathleen BMC Bioinformatics Methodology Article BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutations do not influence tumour biology. In this study, SomInaClust is introduced, a method that accurately identifies driver genes based on their mutation pattern across tumour samples and then classifies them into oncogenes or tumour suppressor genes respectively. RESULTS: SomInaClust starts from the observation that oncogenes mainly contain mutations that, due to positive selection, cluster at similar positions in a gene across patient samples, whereas tumour suppressor genes contain a high number of protein-truncating mutations throughout the entire gene length. The method was shown to prioritize driver genes in 9 different solid cancers. Furthermore it was found to be complementary to existing similar-purpose methods with the additional advantages that it has a higher sensitivity, also for rare mutations (occurring in less than 1% of all samples), and it accurately classifies candidate driver genes in putative oncogenes and tumour suppressor genes. Pathway enrichment analysis showed that the identified genes belong to known cancer signalling pathways, and that the distinction between oncogenes and tumour suppressor genes is biologically relevant. CONCLUSIONS: SomInaClust was shown to detect candidate driver genes based on somatic mutation patterns of inactivation and clustering and to distinguish oncogenes from tumour suppressor genes. The method could be used for the identification of new cancer genes or to filter mutation data for further data-integration purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0555-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-23 /pmc/articles/PMC4410004/ /pubmed/25903787 http://dx.doi.org/10.1186/s12859-015-0555-7 Text en © Van den Eynden et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Van den Eynden, Jimmy
Fierro, Ana Carolina
Verbeke, Lieven PC
Marchal, Kathleen
SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title_full SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title_fullStr SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title_full_unstemmed SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title_short SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
title_sort sominaclust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410004/
https://www.ncbi.nlm.nih.gov/pubmed/25903787
http://dx.doi.org/10.1186/s12859-015-0555-7
work_keys_str_mv AT vandeneyndenjimmy sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering
AT fierroanacarolina sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering
AT verbekelievenpc sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering
AT marchalkathleen sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering