Cargando…

Deep learning identified glioblastoma subtypes based on internal genomic expression ranks

BACKGROUND: Glioblastoma (GBM) can be divided into subtypes according to their genomic features, including Proneural (PN), Neural (NE), Classical (CL) and Mesenchymal (ME). However, it is a difficult task to unify various genomic expression profiles which were standardized with various procedures fr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mao, Xing-gang, Xue, Xiao-yan, Wang, Ling, Lin, Wei, Zhang, Xiang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8780813/ https://www.ncbi.nlm.nih.gov/pubmed/35057766 http://dx.doi.org/10.1186/s12885-022-09191-2

_version_	1784637936202940416
author	Mao, Xing-gang Xue, Xiao-yan Wang, Ling Lin, Wei Zhang, Xiang
author_facet	Mao, Xing-gang Xue, Xiao-yan Wang, Ling Lin, Wei Zhang, Xiang
author_sort	Mao, Xing-gang
collection	PubMed
description	BACKGROUND: Glioblastoma (GBM) can be divided into subtypes according to their genomic features, including Proneural (PN), Neural (NE), Classical (CL) and Mesenchymal (ME). However, it is a difficult task to unify various genomic expression profiles which were standardized with various procedures from different studies and to manually classify a given GBM sample into a subtype. METHODS: An algorithm was developed to unify the genomic profiles of GBM samples into a standardized normal distribution (SND), based on their internal expression ranks. Deep neural networks (DNN) and convolutional DNN (CDNN) models were trained on original and SND data. In addition, expanded SND data by combining various The Cancer Genome Atlas (TCGA) datasets were used to improve the robustness and generalization capacity of the CDNN models. RESULTS: The SND data kept unimodal distribution similar to their original data, and also kept the internal expression ranks of all genes for each sample. CDNN models trained on the SND data showed significantly higher accuracy compared to DNN and CDNN models trained on primary expression data. Interestingly, the CDNN models classified the NE subtype with the lowest accuracy in the GBM datasets, expanded datasets and in IDH wide type GBMs, consistent with the recent studies that NE subtype should be excluded. Furthermore, the CDNN models also recognized independent GBM datasets, even with small set of genomic expressions. CONCLUSIONS: The GBM expression profiles can be transformed into unified SND data, which can be used to train CDNN models with high accuracy and generalization capacity. These models suggested NE subtype may be not compatible with the 4 subtypes classification system. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-022-09191-2.
format	Online Article Text
id	pubmed-8780813
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-87808132022-01-21 Deep learning identified glioblastoma subtypes based on internal genomic expression ranks Mao, Xing-gang Xue, Xiao-yan Wang, Ling Lin, Wei Zhang, Xiang BMC Cancer Research BACKGROUND: Glioblastoma (GBM) can be divided into subtypes according to their genomic features, including Proneural (PN), Neural (NE), Classical (CL) and Mesenchymal (ME). However, it is a difficult task to unify various genomic expression profiles which were standardized with various procedures from different studies and to manually classify a given GBM sample into a subtype. METHODS: An algorithm was developed to unify the genomic profiles of GBM samples into a standardized normal distribution (SND), based on their internal expression ranks. Deep neural networks (DNN) and convolutional DNN (CDNN) models were trained on original and SND data. In addition, expanded SND data by combining various The Cancer Genome Atlas (TCGA) datasets were used to improve the robustness and generalization capacity of the CDNN models. RESULTS: The SND data kept unimodal distribution similar to their original data, and also kept the internal expression ranks of all genes for each sample. CDNN models trained on the SND data showed significantly higher accuracy compared to DNN and CDNN models trained on primary expression data. Interestingly, the CDNN models classified the NE subtype with the lowest accuracy in the GBM datasets, expanded datasets and in IDH wide type GBMs, consistent with the recent studies that NE subtype should be excluded. Furthermore, the CDNN models also recognized independent GBM datasets, even with small set of genomic expressions. CONCLUSIONS: The GBM expression profiles can be transformed into unified SND data, which can be used to train CDNN models with high accuracy and generalization capacity. These models suggested NE subtype may be not compatible with the 4 subtypes classification system. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-022-09191-2. BioMed Central 2022-01-20 /pmc/articles/PMC8780813/ /pubmed/35057766 http://dx.doi.org/10.1186/s12885-022-09191-2 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Mao, Xing-gang Xue, Xiao-yan Wang, Ling Lin, Wei Zhang, Xiang Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title	Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title_full	Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title_fullStr	Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title_full_unstemmed	Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title_short	Deep learning identified glioblastoma subtypes based on internal genomic expression ranks
title_sort	deep learning identified glioblastoma subtypes based on internal genomic expression ranks
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8780813/ https://www.ncbi.nlm.nih.gov/pubmed/35057766 http://dx.doi.org/10.1186/s12885-022-09191-2
work_keys_str_mv	AT maoxinggang deeplearningidentifiedglioblastomasubtypesbasedoninternalgenomicexpressionranks AT xuexiaoyan deeplearningidentifiedglioblastomasubtypesbasedoninternalgenomicexpressionranks AT wangling deeplearningidentifiedglioblastomasubtypesbasedoninternalgenomicexpressionranks AT linwei deeplearningidentifiedglioblastomasubtypesbasedoninternalgenomicexpressionranks AT zhangxiang deeplearningidentifiedglioblastomasubtypesbasedoninternalgenomicexpressionranks

Deep learning identified glioblastoma subtypes based on internal genomic expression ranks

Ejemplares similares