Cargando…

Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning

SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters al...

Descripción completa

Detalles Bibliográficos
Autores principales: Marquardt, André, Landwehr, Laura-Sophie, Ronchi, Cristina L., di Dalmazi, Guido, Riester, Anna, Kollmannsberger, Philip, Altieri, Barbara, Fassnacht, Martin, Sbiera, Silviu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469239/
https://www.ncbi.nlm.nih.gov/pubmed/34572898
http://dx.doi.org/10.3390/cancers13184671
_version_ 1784573879874748416
author Marquardt, André
Landwehr, Laura-Sophie
Ronchi, Cristina L.
di Dalmazi, Guido
Riester, Anna
Kollmannsberger, Philip
Altieri, Barbara
Fassnacht, Martin
Sbiera, Silviu
author_facet Marquardt, André
Landwehr, Laura-Sophie
Ronchi, Cristina L.
di Dalmazi, Guido
Riester, Anna
Kollmannsberger, Philip
Altieri, Barbara
Fassnacht, Martin
Sbiera, Silviu
author_sort Marquardt, André
collection PubMed
description SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters also correlated with patient survival. Applying the visual clustering method to a second dataset also including benign adrenocortical samples additionally revealed that one of the ACC clusters is more closely located to the benign samples, providing a possible explanation for the better survival of this ACC cluster. Furthermore, the subsequent use of machine learning identified new possible biomarker genes with prognostic potential for this rare disease, that are significantly differentially expressed in the different survival clusters and should be further evaluated. ABSTRACT: Adrenocortical carcinoma (ACC) is a rare disease, associated with poor survival. Several “multiple-omics” studies characterizing ACC on a molecular level identified two different clusters correlating with patient survival (C1A and C1B). We here used the publicly available transcriptome data from the TCGA-ACC dataset (n = 79), applying machine learning (ML) methods to classify the ACC based on expression pattern in an unbiased manner. UMAP (uniform manifold approximation and projection)-based clustering resulted in two distinct groups, ACC-UMAP1 and ACC-UMAP2, that largely overlap with clusters C1B and C1A, respectively. However, subsequent use of random-forest-based learning revealed a set of new possible marker genes showing significant differential expression in the described clusters (e.g., SOAT1, EIF2A1). For validation purposes, we used a secondary dataset based on a previous study from our group, consisting of 4 normal adrenal glands and 52 benign and 7 malignant tumor samples. The results largely confirmed those obtained for the TCGA-ACC cohort. In addition, the ENSAT dataset showed a correlation between benign adrenocortical tumors and the good prognosis ACC cluster ACC-UMAP1/C1B. In conclusion, the use of ML approaches re-identified and redefined known prognostic ACC subgroups. On the other hand, the subsequent use of random-forest-based learning identified new possible prognostic marker genes for ACC.
format Online
Article
Text
id pubmed-8469239
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84692392021-09-27 Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning Marquardt, André Landwehr, Laura-Sophie Ronchi, Cristina L. di Dalmazi, Guido Riester, Anna Kollmannsberger, Philip Altieri, Barbara Fassnacht, Martin Sbiera, Silviu Cancers (Basel) Article SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters also correlated with patient survival. Applying the visual clustering method to a second dataset also including benign adrenocortical samples additionally revealed that one of the ACC clusters is more closely located to the benign samples, providing a possible explanation for the better survival of this ACC cluster. Furthermore, the subsequent use of machine learning identified new possible biomarker genes with prognostic potential for this rare disease, that are significantly differentially expressed in the different survival clusters and should be further evaluated. ABSTRACT: Adrenocortical carcinoma (ACC) is a rare disease, associated with poor survival. Several “multiple-omics” studies characterizing ACC on a molecular level identified two different clusters correlating with patient survival (C1A and C1B). We here used the publicly available transcriptome data from the TCGA-ACC dataset (n = 79), applying machine learning (ML) methods to classify the ACC based on expression pattern in an unbiased manner. UMAP (uniform manifold approximation and projection)-based clustering resulted in two distinct groups, ACC-UMAP1 and ACC-UMAP2, that largely overlap with clusters C1B and C1A, respectively. However, subsequent use of random-forest-based learning revealed a set of new possible marker genes showing significant differential expression in the described clusters (e.g., SOAT1, EIF2A1). For validation purposes, we used a secondary dataset based on a previous study from our group, consisting of 4 normal adrenal glands and 52 benign and 7 malignant tumor samples. The results largely confirmed those obtained for the TCGA-ACC cohort. In addition, the ENSAT dataset showed a correlation between benign adrenocortical tumors and the good prognosis ACC cluster ACC-UMAP1/C1B. In conclusion, the use of ML approaches re-identified and redefined known prognostic ACC subgroups. On the other hand, the subsequent use of random-forest-based learning identified new possible prognostic marker genes for ACC. MDPI 2021-09-17 /pmc/articles/PMC8469239/ /pubmed/34572898 http://dx.doi.org/10.3390/cancers13184671 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Marquardt, André
Landwehr, Laura-Sophie
Ronchi, Cristina L.
di Dalmazi, Guido
Riester, Anna
Kollmannsberger, Philip
Altieri, Barbara
Fassnacht, Martin
Sbiera, Silviu
Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title_full Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title_fullStr Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title_full_unstemmed Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title_short Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
title_sort identifying new potential biomarkers in adrenocortical tumors based on mrna expression data using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469239/
https://www.ncbi.nlm.nih.gov/pubmed/34572898
http://dx.doi.org/10.3390/cancers13184671
work_keys_str_mv AT marquardtandre identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT landwehrlaurasophie identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT ronchicristinal identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT didalmaziguido identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT riesteranna identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT kollmannsbergerphilip identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT altieribarbara identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT fassnachtmartin identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning
AT sbierasilviu identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning