Cargando…
A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8510824/ https://www.ncbi.nlm.nih.gov/pubmed/34650605 http://dx.doi.org/10.1155/2021/6480456 |
_version_ | 1784582656659292160 |
---|---|
author | Zhong, Lianxin Meng, Qingfang Chen, Yuehui |
author_facet | Zhong, Lianxin Meng, Qingfang Chen, Yuehui |
author_sort | Zhong, Lianxin |
collection | PubMed |
description | The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a hot topic. However, most classifiers may face the challenges of overfitting and low classification accuracy when dealing with small sample size and high-dimensional biological data. In this paper, the Cascade Flexible Neural Forest (CFNForest) Model was proposed to accomplish cancer subtype classification. CFNForest extended the traditional flexible neural tree structure to FNT Group Forest exploiting a bagging ensemble strategy and could automatically generate the model's structure and parameters. In order to deepen the FNT Group Forest without introducing new hyperparameters, the multilayer cascade framework was exploited to design the FNT Group Forest model, which transformed features between levels and improved the performance of the model. The proposed CFNForest model also improved the operational efficiency and the robustness of the model by sample selection mechanism between layers and setting different weights for the output of each layer. To accomplish cancer subtype classification, FNT Group Forest with different feature sets was used to enrich the structural diversity of the model, which make it more suitable for processing small sample size datasets. The experiments on RNA-seq gene expression data showed that CFNForest effectively improves the accuracy of cancer subtype classification. The classification results have good robustness. |
format | Online Article Text |
id | pubmed-8510824 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-85108242021-10-13 A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data Zhong, Lianxin Meng, Qingfang Chen, Yuehui Comput Intell Neurosci Research Article The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a hot topic. However, most classifiers may face the challenges of overfitting and low classification accuracy when dealing with small sample size and high-dimensional biological data. In this paper, the Cascade Flexible Neural Forest (CFNForest) Model was proposed to accomplish cancer subtype classification. CFNForest extended the traditional flexible neural tree structure to FNT Group Forest exploiting a bagging ensemble strategy and could automatically generate the model's structure and parameters. In order to deepen the FNT Group Forest without introducing new hyperparameters, the multilayer cascade framework was exploited to design the FNT Group Forest model, which transformed features between levels and improved the performance of the model. The proposed CFNForest model also improved the operational efficiency and the robustness of the model by sample selection mechanism between layers and setting different weights for the output of each layer. To accomplish cancer subtype classification, FNT Group Forest with different feature sets was used to enrich the structural diversity of the model, which make it more suitable for processing small sample size datasets. The experiments on RNA-seq gene expression data showed that CFNForest effectively improves the accuracy of cancer subtype classification. The classification results have good robustness. Hindawi 2021-10-05 /pmc/articles/PMC8510824/ /pubmed/34650605 http://dx.doi.org/10.1155/2021/6480456 Text en Copyright © 2021 Lianxin Zhong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhong, Lianxin Meng, Qingfang Chen, Yuehui A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title | A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title_full | A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title_fullStr | A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title_full_unstemmed | A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title_short | A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data |
title_sort | cascade flexible neural forest model for cancer subtypes classification on gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8510824/ https://www.ncbi.nlm.nih.gov/pubmed/34650605 http://dx.doi.org/10.1155/2021/6480456 |
work_keys_str_mv | AT zhonglianxin acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata AT mengqingfang acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata AT chenyuehui acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata AT zhonglianxin cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata AT mengqingfang cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata AT chenyuehui cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata |