Cargando…

Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection

Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Junyi, Xu, Qingzhe, Wu, Mingxiao, Huang, Tao, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7417299/
https://www.ncbi.nlm.nih.gov/pubmed/32850695
http://dx.doi.org/10.3389/fbioe.2020.00766
_version_ 1783569463076978688
author Li, Junyi
Xu, Qingzhe
Wu, Mingxiao
Huang, Tao
Wang, Yadong
author_facet Li, Junyi
Xu, Qingzhe
Wu, Mingxiao
Huang, Tao
Wang, Yadong
author_sort Li, Junyi
collection PubMed
description Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we propose a computational method to classify cancer types by using the self-normalizing neural network (SNN) for analyzing pan-cancer copy number variation data. Since the dimension of the copy number variation features is high, the Monte Carlo feature selection method was used to rank these features. Then a classifier was built by SNN and feature selection method to select features. Three thousand six hundred ninety-four features were chosen for the prediction model, which yields the accuracy value is 0.798 and macro F1 is 0.789. We compared our model to random forest method. Results show the accuracy and macro F1 obtained by our classifier are higher than those obtained by random forest classifier, indicating the good predictive power of our method in distinguishing four different cancer types. This method is also extendable to pan-cancer classification for other molecular features.
format Online
Article
Text
id pubmed-7417299
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-74172992020-08-25 Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection Li, Junyi Xu, Qingzhe Wu, Mingxiao Huang, Tao Wang, Yadong Front Bioeng Biotechnol Bioengineering and Biotechnology Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we propose a computational method to classify cancer types by using the self-normalizing neural network (SNN) for analyzing pan-cancer copy number variation data. Since the dimension of the copy number variation features is high, the Monte Carlo feature selection method was used to rank these features. Then a classifier was built by SNN and feature selection method to select features. Three thousand six hundred ninety-four features were chosen for the prediction model, which yields the accuracy value is 0.798 and macro F1 is 0.789. We compared our model to random forest method. Results show the accuracy and macro F1 obtained by our classifier are higher than those obtained by random forest classifier, indicating the good predictive power of our method in distinguishing four different cancer types. This method is also extendable to pan-cancer classification for other molecular features. Frontiers Media S.A. 2020-08-04 /pmc/articles/PMC7417299/ /pubmed/32850695 http://dx.doi.org/10.3389/fbioe.2020.00766 Text en Copyright © 2020 Li, Xu, Wu, Huang and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Li, Junyi
Xu, Qingzhe
Wu, Mingxiao
Huang, Tao
Wang, Yadong
Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title_full Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title_fullStr Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title_full_unstemmed Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title_short Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
title_sort pan-cancer classification based on self-normalizing neural networks and feature selection
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7417299/
https://www.ncbi.nlm.nih.gov/pubmed/32850695
http://dx.doi.org/10.3389/fbioe.2020.00766
work_keys_str_mv AT lijunyi pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection
AT xuqingzhe pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection
AT wumingxiao pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection
AT huangtao pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection
AT wangyadong pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection