Cargando…
Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning
SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters al...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469239/ https://www.ncbi.nlm.nih.gov/pubmed/34572898 http://dx.doi.org/10.3390/cancers13184671 |
_version_ | 1784573879874748416 |
---|---|
author | Marquardt, André Landwehr, Laura-Sophie Ronchi, Cristina L. di Dalmazi, Guido Riester, Anna Kollmannsberger, Philip Altieri, Barbara Fassnacht, Martin Sbiera, Silviu |
author_facet | Marquardt, André Landwehr, Laura-Sophie Ronchi, Cristina L. di Dalmazi, Guido Riester, Anna Kollmannsberger, Philip Altieri, Barbara Fassnacht, Martin Sbiera, Silviu |
author_sort | Marquardt, André |
collection | PubMed |
description | SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters also correlated with patient survival. Applying the visual clustering method to a second dataset also including benign adrenocortical samples additionally revealed that one of the ACC clusters is more closely located to the benign samples, providing a possible explanation for the better survival of this ACC cluster. Furthermore, the subsequent use of machine learning identified new possible biomarker genes with prognostic potential for this rare disease, that are significantly differentially expressed in the different survival clusters and should be further evaluated. ABSTRACT: Adrenocortical carcinoma (ACC) is a rare disease, associated with poor survival. Several “multiple-omics” studies characterizing ACC on a molecular level identified two different clusters correlating with patient survival (C1A and C1B). We here used the publicly available transcriptome data from the TCGA-ACC dataset (n = 79), applying machine learning (ML) methods to classify the ACC based on expression pattern in an unbiased manner. UMAP (uniform manifold approximation and projection)-based clustering resulted in two distinct groups, ACC-UMAP1 and ACC-UMAP2, that largely overlap with clusters C1B and C1A, respectively. However, subsequent use of random-forest-based learning revealed a set of new possible marker genes showing significant differential expression in the described clusters (e.g., SOAT1, EIF2A1). For validation purposes, we used a secondary dataset based on a previous study from our group, consisting of 4 normal adrenal glands and 52 benign and 7 malignant tumor samples. The results largely confirmed those obtained for the TCGA-ACC cohort. In addition, the ENSAT dataset showed a correlation between benign adrenocortical tumors and the good prognosis ACC cluster ACC-UMAP1/C1B. In conclusion, the use of ML approaches re-identified and redefined known prognostic ACC subgroups. On the other hand, the subsequent use of random-forest-based learning identified new possible prognostic marker genes for ACC. |
format | Online Article Text |
id | pubmed-8469239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84692392021-09-27 Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning Marquardt, André Landwehr, Laura-Sophie Ronchi, Cristina L. di Dalmazi, Guido Riester, Anna Kollmannsberger, Philip Altieri, Barbara Fassnacht, Martin Sbiera, Silviu Cancers (Basel) Article SIMPLE SUMMARY: Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters also correlated with patient survival. Applying the visual clustering method to a second dataset also including benign adrenocortical samples additionally revealed that one of the ACC clusters is more closely located to the benign samples, providing a possible explanation for the better survival of this ACC cluster. Furthermore, the subsequent use of machine learning identified new possible biomarker genes with prognostic potential for this rare disease, that are significantly differentially expressed in the different survival clusters and should be further evaluated. ABSTRACT: Adrenocortical carcinoma (ACC) is a rare disease, associated with poor survival. Several “multiple-omics” studies characterizing ACC on a molecular level identified two different clusters correlating with patient survival (C1A and C1B). We here used the publicly available transcriptome data from the TCGA-ACC dataset (n = 79), applying machine learning (ML) methods to classify the ACC based on expression pattern in an unbiased manner. UMAP (uniform manifold approximation and projection)-based clustering resulted in two distinct groups, ACC-UMAP1 and ACC-UMAP2, that largely overlap with clusters C1B and C1A, respectively. However, subsequent use of random-forest-based learning revealed a set of new possible marker genes showing significant differential expression in the described clusters (e.g., SOAT1, EIF2A1). For validation purposes, we used a secondary dataset based on a previous study from our group, consisting of 4 normal adrenal glands and 52 benign and 7 malignant tumor samples. The results largely confirmed those obtained for the TCGA-ACC cohort. In addition, the ENSAT dataset showed a correlation between benign adrenocortical tumors and the good prognosis ACC cluster ACC-UMAP1/C1B. In conclusion, the use of ML approaches re-identified and redefined known prognostic ACC subgroups. On the other hand, the subsequent use of random-forest-based learning identified new possible prognostic marker genes for ACC. MDPI 2021-09-17 /pmc/articles/PMC8469239/ /pubmed/34572898 http://dx.doi.org/10.3390/cancers13184671 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Marquardt, André Landwehr, Laura-Sophie Ronchi, Cristina L. di Dalmazi, Guido Riester, Anna Kollmannsberger, Philip Altieri, Barbara Fassnacht, Martin Sbiera, Silviu Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title | Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title_full | Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title_fullStr | Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title_full_unstemmed | Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title_short | Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning |
title_sort | identifying new potential biomarkers in adrenocortical tumors based on mrna expression data using machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8469239/ https://www.ncbi.nlm.nih.gov/pubmed/34572898 http://dx.doi.org/10.3390/cancers13184671 |
work_keys_str_mv | AT marquardtandre identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT landwehrlaurasophie identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT ronchicristinal identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT didalmaziguido identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT riesteranna identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT kollmannsbergerphilip identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT altieribarbara identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT fassnachtmartin identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning AT sbierasilviu identifyingnewpotentialbiomarkersinadrenocorticaltumorsbasedonmrnaexpressiondatausingmachinelearning |