Cargando…
Identifying overlapping mutated driver pathways by constructing gene networks in cancer
BACKGROUND: Large-scale cancer genomic projects are providing lots of data on genomic, epigenomic and gene expression aberrations in many cancer types. One key challenge is to detect functional driver pathways and to filter out nonfunctional passenger genes in cancer genomics. Vandin et al. introduc...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4402595/ https://www.ncbi.nlm.nih.gov/pubmed/25859819 http://dx.doi.org/10.1186/1471-2105-16-S5-S3 |
_version_ | 1782367284688322560 |
---|---|
author | Wu, Hao Gao, Lin Li, Feng Song, Fei Yang, Xiaofei Kasabov, Nikola |
author_facet | Wu, Hao Gao, Lin Li, Feng Song, Fei Yang, Xiaofei Kasabov, Nikola |
author_sort | Wu, Hao |
collection | PubMed |
description | BACKGROUND: Large-scale cancer genomic projects are providing lots of data on genomic, epigenomic and gene expression aberrations in many cancer types. One key challenge is to detect functional driver pathways and to filter out nonfunctional passenger genes in cancer genomics. Vandin et al. introduced the Maximum Weight Sub-matrix Problem to find driver pathways and showed that it is an NP-hard problem. METHODS: To find a better solution and solve the problem more efficiently, we present a network-based method (NBM) to detect overlapping driver pathways automatically. This algorithm can directly find driver pathways or gene sets de novo from somatic mutation data utilizing two combinatorial properties, high coverage and high exclusivity, without any prior information. We firstly construct gene networks based on the approximate exclusivity between each pair of genes using somatic mutation data from many cancer patients. Secondly, we present a new greedy strategy to add or remove genes for obtaining overlapping gene sets with driver mutations according to the properties of high exclusivity and high coverage. RESULTS: To assess the efficiency of the proposed NBM, we apply the method on simulated data and compare results obtained from the NBM, RME, Dendrix and Multi-Dendrix. NBM obtains optimal results in less than nine seconds on a conventional computer and the time complexity is much less than the three other methods. To further verify the performance of NBM, we apply the method to analyze somatic mutation data from five real biological data sets such as the mutation profiles of 90 glioblastoma tumor samples and 163 lung carcinoma samples. NBM detects groups of genes which overlap with known pathways, including P53, RB and RTK/RAS/PI(3)K signaling pathways. New gene sets with p-value less than 1e-3 are found from the somatic mutation data. CONCLUSIONS: NBM can detect more biologically relevant gene sets. Results show that NBM outperforms other algorithms for detecting driver pathways or gene sets. Further research will be conducted with the use of novel machine learning techniques. |
format | Online Article Text |
id | pubmed-4402595 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44025952015-04-29 Identifying overlapping mutated driver pathways by constructing gene networks in cancer Wu, Hao Gao, Lin Li, Feng Song, Fei Yang, Xiaofei Kasabov, Nikola BMC Bioinformatics Proceedings BACKGROUND: Large-scale cancer genomic projects are providing lots of data on genomic, epigenomic and gene expression aberrations in many cancer types. One key challenge is to detect functional driver pathways and to filter out nonfunctional passenger genes in cancer genomics. Vandin et al. introduced the Maximum Weight Sub-matrix Problem to find driver pathways and showed that it is an NP-hard problem. METHODS: To find a better solution and solve the problem more efficiently, we present a network-based method (NBM) to detect overlapping driver pathways automatically. This algorithm can directly find driver pathways or gene sets de novo from somatic mutation data utilizing two combinatorial properties, high coverage and high exclusivity, without any prior information. We firstly construct gene networks based on the approximate exclusivity between each pair of genes using somatic mutation data from many cancer patients. Secondly, we present a new greedy strategy to add or remove genes for obtaining overlapping gene sets with driver mutations according to the properties of high exclusivity and high coverage. RESULTS: To assess the efficiency of the proposed NBM, we apply the method on simulated data and compare results obtained from the NBM, RME, Dendrix and Multi-Dendrix. NBM obtains optimal results in less than nine seconds on a conventional computer and the time complexity is much less than the three other methods. To further verify the performance of NBM, we apply the method to analyze somatic mutation data from five real biological data sets such as the mutation profiles of 90 glioblastoma tumor samples and 163 lung carcinoma samples. NBM detects groups of genes which overlap with known pathways, including P53, RB and RTK/RAS/PI(3)K signaling pathways. New gene sets with p-value less than 1e-3 are found from the somatic mutation data. CONCLUSIONS: NBM can detect more biologically relevant gene sets. Results show that NBM outperforms other algorithms for detecting driver pathways or gene sets. Further research will be conducted with the use of novel machine learning techniques. BioMed Central 2015-03-18 /pmc/articles/PMC4402595/ /pubmed/25859819 http://dx.doi.org/10.1186/1471-2105-16-S5-S3 Text en Copyright © 2015 Wu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Wu, Hao Gao, Lin Li, Feng Song, Fei Yang, Xiaofei Kasabov, Nikola Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title | Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title_full | Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title_fullStr | Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title_full_unstemmed | Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title_short | Identifying overlapping mutated driver pathways by constructing gene networks in cancer |
title_sort | identifying overlapping mutated driver pathways by constructing gene networks in cancer |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4402595/ https://www.ncbi.nlm.nih.gov/pubmed/25859819 http://dx.doi.org/10.1186/1471-2105-16-S5-S3 |
work_keys_str_mv | AT wuhao identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer AT gaolin identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer AT lifeng identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer AT songfei identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer AT yangxiaofei identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer AT kasabovnikola identifyingoverlappingmutateddriverpathwaysbyconstructinggenenetworksincancer |