Cargando…
SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering
BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutation...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410004/ https://www.ncbi.nlm.nih.gov/pubmed/25903787 http://dx.doi.org/10.1186/s12859-015-0555-7 |
_version_ | 1782368265567207424 |
---|---|
author | Van den Eynden, Jimmy Fierro, Ana Carolina Verbeke, Lieven PC Marchal, Kathleen |
author_facet | Van den Eynden, Jimmy Fierro, Ana Carolina Verbeke, Lieven PC Marchal, Kathleen |
author_sort | Van den Eynden, Jimmy |
collection | PubMed |
description | BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutations do not influence tumour biology. In this study, SomInaClust is introduced, a method that accurately identifies driver genes based on their mutation pattern across tumour samples and then classifies them into oncogenes or tumour suppressor genes respectively. RESULTS: SomInaClust starts from the observation that oncogenes mainly contain mutations that, due to positive selection, cluster at similar positions in a gene across patient samples, whereas tumour suppressor genes contain a high number of protein-truncating mutations throughout the entire gene length. The method was shown to prioritize driver genes in 9 different solid cancers. Furthermore it was found to be complementary to existing similar-purpose methods with the additional advantages that it has a higher sensitivity, also for rare mutations (occurring in less than 1% of all samples), and it accurately classifies candidate driver genes in putative oncogenes and tumour suppressor genes. Pathway enrichment analysis showed that the identified genes belong to known cancer signalling pathways, and that the distinction between oncogenes and tumour suppressor genes is biologically relevant. CONCLUSIONS: SomInaClust was shown to detect candidate driver genes based on somatic mutation patterns of inactivation and clustering and to distinguish oncogenes from tumour suppressor genes. The method could be used for the identification of new cancer genes or to filter mutation data for further data-integration purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0555-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4410004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44100042015-04-27 SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering Van den Eynden, Jimmy Fierro, Ana Carolina Verbeke, Lieven PC Marchal, Kathleen BMC Bioinformatics Methodology Article BACKGROUND: With the advances in high throughput technologies, increasing amounts of cancer somatic mutation data are being generated and made available. Only a small number of (driver) mutations occur in driver genes and are responsible for carcinogenesis, while the majority of (passenger) mutations do not influence tumour biology. In this study, SomInaClust is introduced, a method that accurately identifies driver genes based on their mutation pattern across tumour samples and then classifies them into oncogenes or tumour suppressor genes respectively. RESULTS: SomInaClust starts from the observation that oncogenes mainly contain mutations that, due to positive selection, cluster at similar positions in a gene across patient samples, whereas tumour suppressor genes contain a high number of protein-truncating mutations throughout the entire gene length. The method was shown to prioritize driver genes in 9 different solid cancers. Furthermore it was found to be complementary to existing similar-purpose methods with the additional advantages that it has a higher sensitivity, also for rare mutations (occurring in less than 1% of all samples), and it accurately classifies candidate driver genes in putative oncogenes and tumour suppressor genes. Pathway enrichment analysis showed that the identified genes belong to known cancer signalling pathways, and that the distinction between oncogenes and tumour suppressor genes is biologically relevant. CONCLUSIONS: SomInaClust was shown to detect candidate driver genes based on somatic mutation patterns of inactivation and clustering and to distinguish oncogenes from tumour suppressor genes. The method could be used for the identification of new cancer genes or to filter mutation data for further data-integration purposes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0555-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-23 /pmc/articles/PMC4410004/ /pubmed/25903787 http://dx.doi.org/10.1186/s12859-015-0555-7 Text en © Van den Eynden et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Van den Eynden, Jimmy Fierro, Ana Carolina Verbeke, Lieven PC Marchal, Kathleen SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title | SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title_full | SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title_fullStr | SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title_full_unstemmed | SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title_short | SomInaClust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
title_sort | sominaclust: detection of cancer genes based on somatic mutation patterns of inactivation and clustering |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410004/ https://www.ncbi.nlm.nih.gov/pubmed/25903787 http://dx.doi.org/10.1186/s12859-015-0555-7 |
work_keys_str_mv | AT vandeneyndenjimmy sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering AT fierroanacarolina sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering AT verbekelievenpc sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering AT marchalkathleen sominaclustdetectionofcancergenesbasedonsomaticmutationpatternsofinactivationandclustering |