Cargando…

A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data

The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Lianxin, Meng, Qingfang, Chen, Yuehui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8510824/
https://www.ncbi.nlm.nih.gov/pubmed/34650605
http://dx.doi.org/10.1155/2021/6480456
_version_ 1784582656659292160
author Zhong, Lianxin
Meng, Qingfang
Chen, Yuehui
author_facet Zhong, Lianxin
Meng, Qingfang
Chen, Yuehui
author_sort Zhong, Lianxin
collection PubMed
description The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a hot topic. However, most classifiers may face the challenges of overfitting and low classification accuracy when dealing with small sample size and high-dimensional biological data. In this paper, the Cascade Flexible Neural Forest (CFNForest) Model was proposed to accomplish cancer subtype classification. CFNForest extended the traditional flexible neural tree structure to FNT Group Forest exploiting a bagging ensemble strategy and could automatically generate the model's structure and parameters. In order to deepen the FNT Group Forest without introducing new hyperparameters, the multilayer cascade framework was exploited to design the FNT Group Forest model, which transformed features between levels and improved the performance of the model. The proposed CFNForest model also improved the operational efficiency and the robustness of the model by sample selection mechanism between layers and setting different weights for the output of each layer. To accomplish cancer subtype classification, FNT Group Forest with different feature sets was used to enrich the structural diversity of the model, which make it more suitable for processing small sample size datasets. The experiments on RNA-seq gene expression data showed that CFNForest effectively improves the accuracy of cancer subtype classification. The classification results have good robustness.
format Online
Article
Text
id pubmed-8510824
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85108242021-10-13 A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data Zhong, Lianxin Meng, Qingfang Chen, Yuehui Comput Intell Neurosci Research Article The correct classification of cancer subtypes is of great significance for the in-depth study of cancer pathogenesis and the realization of accurate treatment for cancer patients. In recent years, the classification of cancer subtypes using deep neural networks and gene expression data has become a hot topic. However, most classifiers may face the challenges of overfitting and low classification accuracy when dealing with small sample size and high-dimensional biological data. In this paper, the Cascade Flexible Neural Forest (CFNForest) Model was proposed to accomplish cancer subtype classification. CFNForest extended the traditional flexible neural tree structure to FNT Group Forest exploiting a bagging ensemble strategy and could automatically generate the model's structure and parameters. In order to deepen the FNT Group Forest without introducing new hyperparameters, the multilayer cascade framework was exploited to design the FNT Group Forest model, which transformed features between levels and improved the performance of the model. The proposed CFNForest model also improved the operational efficiency and the robustness of the model by sample selection mechanism between layers and setting different weights for the output of each layer. To accomplish cancer subtype classification, FNT Group Forest with different feature sets was used to enrich the structural diversity of the model, which make it more suitable for processing small sample size datasets. The experiments on RNA-seq gene expression data showed that CFNForest effectively improves the accuracy of cancer subtype classification. The classification results have good robustness. Hindawi 2021-10-05 /pmc/articles/PMC8510824/ /pubmed/34650605 http://dx.doi.org/10.1155/2021/6480456 Text en Copyright © 2021 Lianxin Zhong et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhong, Lianxin
Meng, Qingfang
Chen, Yuehui
A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title_full A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title_fullStr A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title_full_unstemmed A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title_short A Cascade Flexible Neural Forest Model for Cancer Subtypes Classification on Gene Expression Data
title_sort cascade flexible neural forest model for cancer subtypes classification on gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8510824/
https://www.ncbi.nlm.nih.gov/pubmed/34650605
http://dx.doi.org/10.1155/2021/6480456
work_keys_str_mv AT zhonglianxin acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata
AT mengqingfang acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata
AT chenyuehui acascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata
AT zhonglianxin cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata
AT mengqingfang cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata
AT chenyuehui cascadeflexibleneuralforestmodelforcancersubtypesclassificationongeneexpressiondata