Cargando…

HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene

Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately i...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Shanchen, Wu, Wenhao, Zhang, Yuanyuan, Wang, Shudong, Niu, Muyuan, Zhang, Kuijie, Yin, Wenjing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368717/
https://www.ncbi.nlm.nih.gov/pubmed/35954300
http://dx.doi.org/10.3390/cells11152456
_version_ 1784766226869780480
author Pang, Shanchen
Wu, Wenhao
Zhang, Yuanyuan
Wang, Shudong
Niu, Muyuan
Zhang, Kuijie
Yin, Wenjing
author_facet Pang, Shanchen
Wu, Wenhao
Zhang, Yuanyuan
Wang, Shudong
Niu, Muyuan
Zhang, Kuijie
Yin, Wenjing
author_sort Pang, Shanchen
collection PubMed
description Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately identifying cancer subtypes. However, the feature selection in the existing cancer subtypes identification methods has the problem that the most helpful features cannot be selected from a biomolecular perspective, and the relationship between the selected features cannot be reflected. To solve this problem, we propose a method for feature selection to identify cancer subtypes based on the heterogeneity score of a single gene: HSSG. In the proposed method, the sample-similarity network of a single gene is constructed, and pseudo-F statistics calculates the heterogeneity score for cancer subtypes identification of each gene. Finally, we construct gene-gene networks using genes with higher heterogeneity scores and mine essential genes from the networks. From the seven TCGA data sets for three experiments, including cancer subtypes identification in single-omics data, the performance in feature selection of multi-omics data, and the effectiveness and stability of the selected features, HSSG achieves good performance in all. This indicates that HSSG can effectively select features for subtypes identification.
format Online
Article
Text
id pubmed-9368717
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93687172022-08-12 HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene Pang, Shanchen Wu, Wenhao Zhang, Yuanyuan Wang, Shudong Niu, Muyuan Zhang, Kuijie Yin, Wenjing Cells Article Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately identifying cancer subtypes. However, the feature selection in the existing cancer subtypes identification methods has the problem that the most helpful features cannot be selected from a biomolecular perspective, and the relationship between the selected features cannot be reflected. To solve this problem, we propose a method for feature selection to identify cancer subtypes based on the heterogeneity score of a single gene: HSSG. In the proposed method, the sample-similarity network of a single gene is constructed, and pseudo-F statistics calculates the heterogeneity score for cancer subtypes identification of each gene. Finally, we construct gene-gene networks using genes with higher heterogeneity scores and mine essential genes from the networks. From the seven TCGA data sets for three experiments, including cancer subtypes identification in single-omics data, the performance in feature selection of multi-omics data, and the effectiveness and stability of the selected features, HSSG achieves good performance in all. This indicates that HSSG can effectively select features for subtypes identification. MDPI 2022-08-08 /pmc/articles/PMC9368717/ /pubmed/35954300 http://dx.doi.org/10.3390/cells11152456 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pang, Shanchen
Wu, Wenhao
Zhang, Yuanyuan
Wang, Shudong
Niu, Muyuan
Zhang, Kuijie
Yin, Wenjing
HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title_full HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title_fullStr HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title_full_unstemmed HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title_short HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
title_sort hssg: identification of cancer subtypes based on heterogeneity score of a single gene
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368717/
https://www.ncbi.nlm.nih.gov/pubmed/35954300
http://dx.doi.org/10.3390/cells11152456
work_keys_str_mv AT pangshanchen hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT wuwenhao hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT zhangyuanyuan hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT wangshudong hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT niumuyuan hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT zhangkuijie hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene
AT yinwenjing hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene