Cargando…

A new machine learning method for cancer mutation analysis

It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that can...

Descripción completa

Detalles Bibliográficos
Autores principales: Habibi, Mahnaz, Taheri, Golnaz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612828/
https://www.ncbi.nlm.nih.gov/pubmed/36251702
http://dx.doi.org/10.1371/journal.pcbi.1010332
_version_ 1784819851290738688
author Habibi, Mahnaz
Taheri, Golnaz
author_facet Habibi, Mahnaz
Taheri, Golnaz
author_sort Habibi, Mahnaz
collection PubMed
description It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis.
format Online
Article
Text
id pubmed-9612828
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-96128282022-10-28 A new machine learning method for cancer mutation analysis Habibi, Mahnaz Taheri, Golnaz PLoS Comput Biol Research Article It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis. Public Library of Science 2022-10-17 /pmc/articles/PMC9612828/ /pubmed/36251702 http://dx.doi.org/10.1371/journal.pcbi.1010332 Text en © 2022 Habibi, Taheri https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Habibi, Mahnaz
Taheri, Golnaz
A new machine learning method for cancer mutation analysis
title A new machine learning method for cancer mutation analysis
title_full A new machine learning method for cancer mutation analysis
title_fullStr A new machine learning method for cancer mutation analysis
title_full_unstemmed A new machine learning method for cancer mutation analysis
title_short A new machine learning method for cancer mutation analysis
title_sort new machine learning method for cancer mutation analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9612828/
https://www.ncbi.nlm.nih.gov/pubmed/36251702
http://dx.doi.org/10.1371/journal.pcbi.1010332
work_keys_str_mv AT habibimahnaz anewmachinelearningmethodforcancermutationanalysis
AT taherigolnaz anewmachinelearningmethodforcancermutationanalysis
AT habibimahnaz newmachinelearningmethodforcancermutationanalysis
AT taherigolnaz newmachinelearningmethodforcancermutationanalysis