Cargando…

Wx: a neural network-based feature selection algorithm for transcriptomic data

Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Sungsoo, Shin, Bonggun, Sang Shim, Won, Choi, Yoonjung, Kang, Kilsoo, Kang, Keunsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6642261/
https://www.ncbi.nlm.nih.gov/pubmed/31324856
http://dx.doi.org/10.1038/s41598-019-47016-8
_version_ 1783436945283612672
author Park, Sungsoo
Shin, Bonggun
Sang Shim, Won
Choi, Yoonjung
Kang, Kilsoo
Kang, Keunsoo
author_facet Park, Sungsoo
Shin, Bonggun
Sang Shim, Won
Choi, Yoonjung
Kang, Kilsoo
Kang, Keunsoo
author_sort Park, Sungsoo
collection PubMed
description Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS has made it difficult for researchers to choose the optimal set of genes. We have sought to resolve this issue by developing a neural network-based feature (gene) selection algorithm called Wx. The Wx algorithm ranks genes based on the discriminative index (DI) score that represents the classification power for distinguishing given groups. With a gene list ranked by DI score, researchers can institutively select the optimal set of genes from the highest-ranking ones. We applied the Wx algorithm to a TCGA pan-cancer gene-expression cohort to identify an optimal set of gene-expression biomarker candidates that can distinguish cancer samples from normal samples for 12 different types of cancer. The 14 gene-expression biomarker candidates identified by Wx were comparable to or outperformed previously reported universal gene expression biomarkers, highlighting the usefulness of the Wx algorithm for next-generation sequencing data. Thus, we anticipate that the Wx algorithm can complement current state-of-the-art analytical applications for the identification of biomarker candidates as an alternative method. The stand-alone and web versions of the Wx algorithm are available at https://github.com/deargen/DearWXpub and https://wx.deargendev.me/, respectively.
format Online
Article
Text
id pubmed-6642261
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-66422612019-07-26 Wx: a neural network-based feature selection algorithm for transcriptomic data Park, Sungsoo Shin, Bonggun Sang Shim, Won Choi, Yoonjung Kang, Kilsoo Kang, Keunsoo Sci Rep Article Next-generation sequencing (NGS), which allows the simultaneous sequencing of billions of DNA fragments simultaneously, has revolutionized how we study genomics and molecular biology by generating genome-wide molecular maps of molecules of interest. However, the amount of information produced by NGS has made it difficult for researchers to choose the optimal set of genes. We have sought to resolve this issue by developing a neural network-based feature (gene) selection algorithm called Wx. The Wx algorithm ranks genes based on the discriminative index (DI) score that represents the classification power for distinguishing given groups. With a gene list ranked by DI score, researchers can institutively select the optimal set of genes from the highest-ranking ones. We applied the Wx algorithm to a TCGA pan-cancer gene-expression cohort to identify an optimal set of gene-expression biomarker candidates that can distinguish cancer samples from normal samples for 12 different types of cancer. The 14 gene-expression biomarker candidates identified by Wx were comparable to or outperformed previously reported universal gene expression biomarkers, highlighting the usefulness of the Wx algorithm for next-generation sequencing data. Thus, we anticipate that the Wx algorithm can complement current state-of-the-art analytical applications for the identification of biomarker candidates as an alternative method. The stand-alone and web versions of the Wx algorithm are available at https://github.com/deargen/DearWXpub and https://wx.deargendev.me/, respectively. Nature Publishing Group UK 2019-07-19 /pmc/articles/PMC6642261/ /pubmed/31324856 http://dx.doi.org/10.1038/s41598-019-47016-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Park, Sungsoo
Shin, Bonggun
Sang Shim, Won
Choi, Yoonjung
Kang, Kilsoo
Kang, Keunsoo
Wx: a neural network-based feature selection algorithm for transcriptomic data
title Wx: a neural network-based feature selection algorithm for transcriptomic data
title_full Wx: a neural network-based feature selection algorithm for transcriptomic data
title_fullStr Wx: a neural network-based feature selection algorithm for transcriptomic data
title_full_unstemmed Wx: a neural network-based feature selection algorithm for transcriptomic data
title_short Wx: a neural network-based feature selection algorithm for transcriptomic data
title_sort wx: a neural network-based feature selection algorithm for transcriptomic data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6642261/
https://www.ncbi.nlm.nih.gov/pubmed/31324856
http://dx.doi.org/10.1038/s41598-019-47016-8
work_keys_str_mv AT parksungsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata
AT shinbonggun wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata
AT sangshimwon wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata
AT choiyoonjung wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata
AT kangkilsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata
AT kangkeunsoo wxaneuralnetworkbasedfeatureselectionalgorithmfortranscriptomicdata