Cargando…
Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection
Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we pro...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7417299/ https://www.ncbi.nlm.nih.gov/pubmed/32850695 http://dx.doi.org/10.3389/fbioe.2020.00766 |
_version_ | 1783569463076978688 |
---|---|
author | Li, Junyi Xu, Qingzhe Wu, Mingxiao Huang, Tao Wang, Yadong |
author_facet | Li, Junyi Xu, Qingzhe Wu, Mingxiao Huang, Tao Wang, Yadong |
author_sort | Li, Junyi |
collection | PubMed |
description | Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we propose a computational method to classify cancer types by using the self-normalizing neural network (SNN) for analyzing pan-cancer copy number variation data. Since the dimension of the copy number variation features is high, the Monte Carlo feature selection method was used to rank these features. Then a classifier was built by SNN and feature selection method to select features. Three thousand six hundred ninety-four features were chosen for the prediction model, which yields the accuracy value is 0.798 and macro F1 is 0.789. We compared our model to random forest method. Results show the accuracy and macro F1 obtained by our classifier are higher than those obtained by random forest classifier, indicating the good predictive power of our method in distinguishing four different cancer types. This method is also extendable to pan-cancer classification for other molecular features. |
format | Online Article Text |
id | pubmed-7417299 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-74172992020-08-25 Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection Li, Junyi Xu, Qingzhe Wu, Mingxiao Huang, Tao Wang, Yadong Front Bioeng Biotechnol Bioengineering and Biotechnology Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we propose a computational method to classify cancer types by using the self-normalizing neural network (SNN) for analyzing pan-cancer copy number variation data. Since the dimension of the copy number variation features is high, the Monte Carlo feature selection method was used to rank these features. Then a classifier was built by SNN and feature selection method to select features. Three thousand six hundred ninety-four features were chosen for the prediction model, which yields the accuracy value is 0.798 and macro F1 is 0.789. We compared our model to random forest method. Results show the accuracy and macro F1 obtained by our classifier are higher than those obtained by random forest classifier, indicating the good predictive power of our method in distinguishing four different cancer types. This method is also extendable to pan-cancer classification for other molecular features. Frontiers Media S.A. 2020-08-04 /pmc/articles/PMC7417299/ /pubmed/32850695 http://dx.doi.org/10.3389/fbioe.2020.00766 Text en Copyright © 2020 Li, Xu, Wu, Huang and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioengineering and Biotechnology Li, Junyi Xu, Qingzhe Wu, Mingxiao Huang, Tao Wang, Yadong Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title | Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title_full | Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title_fullStr | Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title_full_unstemmed | Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title_short | Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection |
title_sort | pan-cancer classification based on self-normalizing neural networks and feature selection |
topic | Bioengineering and Biotechnology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7417299/ https://www.ncbi.nlm.nih.gov/pubmed/32850695 http://dx.doi.org/10.3389/fbioe.2020.00766 |
work_keys_str_mv | AT lijunyi pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection AT xuqingzhe pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection AT wumingxiao pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection AT huangtao pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection AT wangyadong pancancerclassificationbasedonselfnormalizingneuralnetworksandfeatureselection |