Cargando…
A new machine learning method for cancer mutation analysis
It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that can...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612828/ https://www.ncbi.nlm.nih.gov/pubmed/36251702 http://dx.doi.org/10.1371/journal.pcbi.1010332 |
_version_ | 1784819851290738688 |
---|---|
author | Habibi, Mahnaz Taheri, Golnaz |
author_facet | Habibi, Mahnaz Taheri, Golnaz |
author_sort | Habibi, Mahnaz |
collection | PubMed |
description | It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis. |
format | Online Article Text |
id | pubmed-9612828 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96128282022-10-28 A new machine learning method for cancer mutation analysis Habibi, Mahnaz Taheri, Golnaz PLoS Comput Biol Research Article It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis. Public Library of Science 2022-10-17 /pmc/articles/PMC9612828/ /pubmed/36251702 http://dx.doi.org/10.1371/journal.pcbi.1010332 Text en © 2022 Habibi, Taheri https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Habibi, Mahnaz Taheri, Golnaz A new machine learning method for cancer mutation analysis |
title | A new machine learning method for cancer mutation analysis |
title_full | A new machine learning method for cancer mutation analysis |
title_fullStr | A new machine learning method for cancer mutation analysis |
title_full_unstemmed | A new machine learning method for cancer mutation analysis |
title_short | A new machine learning method for cancer mutation analysis |
title_sort | new machine learning method for cancer mutation analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612828/ https://www.ncbi.nlm.nih.gov/pubmed/36251702 http://dx.doi.org/10.1371/journal.pcbi.1010332 |
work_keys_str_mv | AT habibimahnaz anewmachinelearningmethodforcancermutationanalysis AT taherigolnaz anewmachinelearningmethodforcancermutationanalysis AT habibimahnaz newmachinelearningmethodforcancermutationanalysis AT taherigolnaz newmachinelearningmethodforcancermutationanalysis |