Cargando…
Wx: a neural network-based feature selection algorithm for transcriptomic data
Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6642261/ https://www.ncbi.nlm.nih.gov/pubmed/31324856 http://dx.doi.org/10.1038/s41598-019-47016-8 |
_version_ | 1783436945283612672 |
---|---|
author | Park, Sungsoo Shin, Bonggun Sang Shim, Won Choi, Yoonjung Kang, Kilsoo Kang, Keunsoo |
author_facet | Park, Sungsoo Shin, Bonggun Sang Shim, Won Choi, Yoonjung Kang, Kilsoo Kang, Keunsoo |
author_sort | Park, Sungsoo |
collection | PubMed |
description | Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS has made it difficult for researchers to choose the optimal set of genes. We have sought to resolve this issue by developing a neural network-based feature (gene) selection algorithm called Wx. The Wx algorithm ranks genes based on the discriminative index (DI) score that represents the classification power for distinguishing given groups. With a gene list ranked by DI score, researchers can institutively select the optimal set of genes from the highest-ranking ones. We applied the Wx algorithm to a TCGA pan-cancer gene-expression cohort to identify an optimal set of gene-expression biomarker candidates that can distinguish cancer samples from normal samples for 12 different types of cancer. The 14 gene-expression biomarker candidates identified by Wx were comparable to or outperformed previously reported universal gene expression biomarkers, highlighting the usefulness of the Wx algorithm for next-generation sequencing data. Thus, we anticipate that the Wx algorithm can complement current state-of-the-art analytical applications for the identification of biomarker candidates as an alternative method. The stand-alone and web versions of the Wx algorithm are available at https://github.com/deargen/DearWXpub and https://wx.deargendev.me/, respectively. |
format | Online Article Text |
id | pubmed-6642261 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-66422612019-07-26 Wx: a neural network-based feature selection algorithm for transcriptomic data Park, Sungsoo Shin, Bonggun Sang Shim, Won Choi, Yoonjung Kang, Kilsoo Kang, Keunsoo Sci Rep Article Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS has made it difficult for researchers to choose the optimal set of genes. We have sought to resolve this issue by developing a neural network-based feature (gene) selection algorithm called Wx. The Wx algorithm ranks genes based on the discriminative index (DI) score that represents the classification power for distinguishing given groups. With a gene list ranked by DI score, researchers can institutively select the optimal set of genes from the highest-ranking ones. We applied the Wx algorithm to a TCGA pan-cancer gene-expression cohort to identify an optimal set of gene-expression biomarker candidates that can distinguish cancer samples from normal samples for 12 different types of cancer. The 14 gene-expression biomarker candidates identified by Wx were comparable to or outperformed previously reported universal gene expression biomarkers, highlighting the usefulness of the Wx algorithm for next-generation sequencing data. Thus, we anticipate that the Wx algorithm can complement current state-of-the-art analytical applications for the identification of biomarker candidates as an alternative method. The stand-alone and web versions of the Wx algorithm are available at https://github.com/deargen/DearWXpub and https://wx.deargendev.me/, respectively. Nature Publishing Group UK 2019-07-19 /pmc/articles/PMC6642261/ /pubmed/31324856 http://dx.doi.org/10.1038/s41598-019-47016-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Park, Sungsoo Shin, Bonggun Sang Shim, Won Choi, Yoonjung Kang, Kilsoo Kang, Keunsoo Wx: a neural network-based feature selection algorithm for transcriptomic data |
title | Wx: a neural network-based feature selection algorithm for transcriptomic data |
title_full | Wx: a neural network-based feature selection algorithm for transcriptomic data |
title_fullStr | Wx: a neural network-based feature selection algorithm for transcriptomic data |
title_full_unstemmed | Wx: a neural network-based feature selection algorithm for transcriptomic data |
title_short | Wx: a neural network-based feature selection algorithm for transcriptomic data |
title_sort | wx: a neural network-based feature selection algorithm for transcriptomic data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6642261/ https://www.ncbi.nlm.nih.gov/pubmed/31324856 http://dx.doi.org/10.1038/s41598-019-47016-8 |
work_keys_str_mv | AT parksungsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata AT shinbonggun wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata AT sangshimwon wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata AT choiyoonjung wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata AT kangkilsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata AT kangkeunsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata |