Cargando…

Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment

SIMPLE SUMMARY: Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechani...

Descripción completa

Detalles Bibliográficos
Autores principales: Garbulowski, Mateusz, Smolinska, Karolina, Çabuk, Uğur, Yones, Sara A., Celli, Ludovica, Yaz, Esma Nur, Barrenäs, Fredrik, Diamanti, Klev, Wadelius, Claes, Komorowski, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870250/
https://www.ncbi.nlm.nih.gov/pubmed/35205761
http://dx.doi.org/10.3390/cancers14041014
_version_ 1784656694376136704
author Garbulowski, Mateusz
Smolinska, Karolina
Çabuk, Uğur
Yones, Sara A.
Celli, Ludovica
Yaz, Esma Nur
Barrenäs, Fredrik
Diamanti, Klev
Wadelius, Claes
Komorowski, Jan
author_facet Garbulowski, Mateusz
Smolinska, Karolina
Çabuk, Uğur
Yones, Sara A.
Celli, Ludovica
Yaz, Esma Nur
Barrenäs, Fredrik
Diamanti, Klev
Wadelius, Claes
Komorowski, Jan
author_sort Garbulowski, Mateusz
collection PubMed
description SIMPLE SUMMARY: Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechanisms using machine learning. Our findings created hypotheses for annotations, e.g., pathways, that should be considered as therapeutic targets. ABSTRACT: Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment.
format Online
Article
Text
id pubmed-8870250
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88702502022-02-25 Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment Garbulowski, Mateusz Smolinska, Karolina Çabuk, Uğur Yones, Sara A. Celli, Ludovica Yaz, Esma Nur Barrenäs, Fredrik Diamanti, Klev Wadelius, Claes Komorowski, Jan Cancers (Basel) Article SIMPLE SUMMARY: Gliomas are heterogenous types of cancer, therefore the therapy should be personalized and targeted toward specific pathways. We developed a methodology that corrected strong batch effects from The Cancer Genome Atlas datasets and estimated glioma grade-specific co-enrichment mechanisms using machine learning. Our findings created hypotheses for annotations, e.g., pathways, that should be considered as therapeutic targets. ABSTRACT: Gliomas develop and grow in the brain and central nervous system. Examining glioma grading processes is valuable for improving therapeutic challenges. One of the most extensive repositories storing transcriptomics data for gliomas is The Cancer Genome Atlas (TCGA). However, such big cohorts should be processed with caution and evaluated thoroughly as they can contain batch and other effects. Furthermore, biological mechanisms of cancer contain interactions among biomarkers. Thus, we applied an interpretable machine learning approach to discover such relationships. This type of transparent learning provides not only good predictability, but also reveals co-predictive mechanisms among features. In this study, we corrected the strong and confounded batch effect in the TCGA glioma data. We further used the corrected datasets to perform comprehensive machine learning analysis applied on single-sample gene set enrichment scores using collections from the Molecular Signature Database. Furthermore, using rule-based classifiers, we displayed networks of co-enrichment related to glioma grades. Moreover, we validated our results using the external glioma cohorts. We believe that utilizing corrected glioma cohorts from TCGA may improve the application and validation of any future studies. Finally, the co-enrichment and survival analysis provided detailed explanations for glioma progression and consequently, it should support the targeted treatment. MDPI 2022-02-17 /pmc/articles/PMC8870250/ /pubmed/35205761 http://dx.doi.org/10.3390/cancers14041014 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Garbulowski, Mateusz
Smolinska, Karolina
Çabuk, Uğur
Yones, Sara A.
Celli, Ludovica
Yaz, Esma Nur
Barrenäs, Fredrik
Diamanti, Klev
Wadelius, Claes
Komorowski, Jan
Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title_full Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title_fullStr Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title_full_unstemmed Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title_short Machine Learning-Based Analysis of Glioma Grades Reveals Co-Enrichment
title_sort machine learning-based analysis of glioma grades reveals co-enrichment
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870250/
https://www.ncbi.nlm.nih.gov/pubmed/35205761
http://dx.doi.org/10.3390/cancers14041014
work_keys_str_mv AT garbulowskimateusz machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT smolinskakarolina machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT cabukugur machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT yonessaraa machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT celliludovica machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT yazesmanur machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT barrenasfredrik machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT diamantiklev machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT wadeliusclaes machinelearningbasedanalysisofgliomagradesrevealscoenrichment
AT komorowskijan machinelearningbasedanalysisofgliomagradesrevealscoenrichment