Cargando…

Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types

BACKGROUND: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerni...

Descripción completa

Detalles Bibliográficos
Autores principales: Ghareyazi, Amin, Kazemi, Amirreza, Hamidieh, Kimia, Dashti, Hamed, Tahaei, Maedeh Sadat, Rabiee, Hamid R., Alinejad-Rokny, Hamid, Dehzangi, Iman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316662/
https://www.ncbi.nlm.nih.gov/pubmed/35879674
http://dx.doi.org/10.1186/s12859-022-04840-6
_version_ 1784754870314598400
author Ghareyazi, Amin
Kazemi, Amirreza
Hamidieh, Kimia
Dashti, Hamed
Tahaei, Maedeh Sadat
Rabiee, Hamid R.
Alinejad-Rokny, Hamid
Dehzangi, Iman
author_facet Ghareyazi, Amin
Kazemi, Amirreza
Hamidieh, Kimia
Dashti, Hamed
Tahaei, Maedeh Sadat
Rabiee, Hamid R.
Alinejad-Rokny, Hamid
Dehzangi, Iman
author_sort Ghareyazi, Amin
collection PubMed
description BACKGROUND: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. RESULTS: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. CONCLUSIONS: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on “gene-motif” suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04840-6.
format Online
Article
Text
id pubmed-9316662
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93166622022-07-27 Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types Ghareyazi, Amin Kazemi, Amirreza Hamidieh, Kimia Dashti, Hamed Tahaei, Maedeh Sadat Rabiee, Hamid R. Alinejad-Rokny, Hamid Dehzangi, Iman BMC Bioinformatics Research BACKGROUND: The advent of high throughput sequencing has enabled researchers to systematically evaluate the genetic variations in cancer, identifying many cancer-associated genes. Although cancers in the same tissue are widely categorized in the same group, they demonstrate many differences concerning their mutational profiles. Hence, there is no definitive treatment for most cancer types. This reveals the importance of developing new pipelines to identify cancer-associated genes accurately and re-classify patients with similar mutational profiles. Classification of cancer patients with similar mutational profiles may help discover subtypes of cancer patients who might benefit from specific treatment types. RESULTS: In this study, we propose a new machine learning pipeline to identify protein-coding genes mutated in many samples to identify cancer subtypes. We apply our pipeline to 12,270 samples collected from the international cancer genome consortium, covering 19 cancer types. As a result, we identify 17 different cancer subtypes. Comprehensive phenotypic and genotypic analysis indicates distinguishable properties, including unique cancer-related signaling pathways. CONCLUSIONS: This new subtyping approach offers a novel opportunity for cancer drug development based on the mutational profile of patients. Additionally, we analyze the mutational signatures for samples in each subtype, which provides important insight into their active molecular mechanisms. Some of the pathways we identified in most subtypes, including the cell cycle and the Axon guidance pathways, are frequently observed in cancer disease. Interestingly, we also identified several mutated genes and different rates of mutation in multiple cancer subtypes. In addition, our study on “gene-motif” suggests the importance of considering both the context of the mutations and mutational processes in identifying cancer-associated genes. The source codes for our proposed clustering pipeline and analysis are publicly available at: https://github.com/bcb-sut/Pan-Cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04840-6. BioMed Central 2022-07-25 /pmc/articles/PMC9316662/ /pubmed/35879674 http://dx.doi.org/10.1186/s12859-022-04840-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Ghareyazi, Amin
Kazemi, Amirreza
Hamidieh, Kimia
Dashti, Hamed
Tahaei, Maedeh Sadat
Rabiee, Hamid R.
Alinejad-Rokny, Hamid
Dehzangi, Iman
Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title_full Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title_fullStr Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title_full_unstemmed Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title_short Pan-cancer integrative analysis of whole-genome De novo somatic point mutations reveals 17 cancer types
title_sort pan-cancer integrative analysis of whole-genome de novo somatic point mutations reveals 17 cancer types
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9316662/
https://www.ncbi.nlm.nih.gov/pubmed/35879674
http://dx.doi.org/10.1186/s12859-022-04840-6
work_keys_str_mv AT ghareyaziamin pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT kazemiamirreza pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT hamidiehkimia pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT dashtihamed pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT tahaeimaedehsadat pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT rabieehamidr pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT alinejadroknyhamid pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes
AT dehzangiiman pancancerintegrativeanalysisofwholegenomedenovosomaticpointmutationsreveals17cancertypes