Cargando…
HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene
Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately i...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368717/ https://www.ncbi.nlm.nih.gov/pubmed/35954300 http://dx.doi.org/10.3390/cells11152456 |
_version_ | 1784766226869780480 |
---|---|
author | Pang, Shanchen Wu, Wenhao Zhang, Yuanyuan Wang, Shudong Niu, Muyuan Zhang, Kuijie Yin, Wenjing |
author_facet | Pang, Shanchen Wu, Wenhao Zhang, Yuanyuan Wang, Shudong Niu, Muyuan Zhang, Kuijie Yin, Wenjing |
author_sort | Pang, Shanchen |
collection | PubMed |
description | Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately identifying cancer subtypes. However, the feature selection in the existing cancer subtypes identification methods has the problem that the most helpful features cannot be selected from a biomolecular perspective, and the relationship between the selected features cannot be reflected. To solve this problem, we propose a method for feature selection to identify cancer subtypes based on the heterogeneity score of a single gene: HSSG. In the proposed method, the sample-similarity network of a single gene is constructed, and pseudo-F statistics calculates the heterogeneity score for cancer subtypes identification of each gene. Finally, we construct gene-gene networks using genes with higher heterogeneity scores and mine essential genes from the networks. From the seven TCGA data sets for three experiments, including cancer subtypes identification in single-omics data, the performance in feature selection of multi-omics data, and the effectiveness and stability of the selected features, HSSG achieves good performance in all. This indicates that HSSG can effectively select features for subtypes identification. |
format | Online Article Text |
id | pubmed-9368717 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93687172022-08-12 HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene Pang, Shanchen Wu, Wenhao Zhang, Yuanyuan Wang, Shudong Niu, Muyuan Zhang, Kuijie Yin, Wenjing Cells Article Cancer is a highly heterogeneous disease, which leads to the fact that even the same cancer can be further classified into different subtypes according to its pathology. With the multi-omics data widely used in cancer subtypes identification, effective feature selection is essential for accurately identifying cancer subtypes. However, the feature selection in the existing cancer subtypes identification methods has the problem that the most helpful features cannot be selected from a biomolecular perspective, and the relationship between the selected features cannot be reflected. To solve this problem, we propose a method for feature selection to identify cancer subtypes based on the heterogeneity score of a single gene: HSSG. In the proposed method, the sample-similarity network of a single gene is constructed, and pseudo-F statistics calculates the heterogeneity score for cancer subtypes identification of each gene. Finally, we construct gene-gene networks using genes with higher heterogeneity scores and mine essential genes from the networks. From the seven TCGA data sets for three experiments, including cancer subtypes identification in single-omics data, the performance in feature selection of multi-omics data, and the effectiveness and stability of the selected features, HSSG achieves good performance in all. This indicates that HSSG can effectively select features for subtypes identification. MDPI 2022-08-08 /pmc/articles/PMC9368717/ /pubmed/35954300 http://dx.doi.org/10.3390/cells11152456 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Pang, Shanchen Wu, Wenhao Zhang, Yuanyuan Wang, Shudong Niu, Muyuan Zhang, Kuijie Yin, Wenjing HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title | HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title_full | HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title_fullStr | HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title_full_unstemmed | HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title_short | HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene |
title_sort | hssg: identification of cancer subtypes based on heterogeneity score of a single gene |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9368717/ https://www.ncbi.nlm.nih.gov/pubmed/35954300 http://dx.doi.org/10.3390/cells11152456 |
work_keys_str_mv | AT pangshanchen hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT wuwenhao hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT zhangyuanyuan hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT wangshudong hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT niumuyuan hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT zhangkuijie hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene AT yinwenjing hssgidentificationofcancersubtypesbasedonheterogeneityscoreofasinglegene |