Cargando…

Determination of biomarkers from microarray data using graph neural network and spectral clustering

In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Kun, Xie, Weidong, Wang, Linjie, Zhang, Shoujia, Li, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8668890/
https://www.ncbi.nlm.nih.gov/pubmed/34903818
http://dx.doi.org/10.1038/s41598-021-03316-6
_version_ 1784614674685231104
author Yu, Kun
Xie, Weidong
Wang, Linjie
Zhang, Shoujia
Li, Wei
author_facet Yu, Kun
Xie, Weidong
Wang, Linjie
Zhang, Shoujia
Li, Wei
author_sort Yu, Kun
collection PubMed
description In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features of various diseases, is essential for the disease classification. feature selection has therefore became fundemental for the analysis of microarray data, which designs to remove irrelevant and redundant features. There are a large number of redundant features and irrelevant features in microarray data, which severely degrade the classification effectiveness. We propose an innovative feature selection method with the goal of obtaining feature dependencies from a priori knowledge and removing redundant features using spectral clustering. In this paper, the graph structure is firstly constructed by using the gene interaction network as a priori knowledge, and then a link prediction method based on graph neural network is proposed to enhance the graph structure data. Finally, a feature selection method based on spectral clustering is proposed to determine biomarkers. The classification accuracy on DLBCL and Prostate can be improved by 10.90% and 16.22% compared to traditional methods. Link prediction provides an average classification accuracy improvement of 1.96% and 1.31%, and is up to 16.98% higher than the published method. The results show that the proposed method can have full use of a priori knowledge to effectively select disease prediction biomarkers with high classification accuracy.
format Online
Article
Text
id pubmed-8668890
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86688902021-12-15 Determination of biomarkers from microarray data using graph neural network and spectral clustering Yu, Kun Xie, Weidong Wang, Linjie Zhang, Shoujia Li, Wei Sci Rep Article In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features of various diseases, is essential for the disease classification. feature selection has therefore became fundemental for the analysis of microarray data, which designs to remove irrelevant and redundant features. There are a large number of redundant features and irrelevant features in microarray data, which severely degrade the classification effectiveness. We propose an innovative feature selection method with the goal of obtaining feature dependencies from a priori knowledge and removing redundant features using spectral clustering. In this paper, the graph structure is firstly constructed by using the gene interaction network as a priori knowledge, and then a link prediction method based on graph neural network is proposed to enhance the graph structure data. Finally, a feature selection method based on spectral clustering is proposed to determine biomarkers. The classification accuracy on DLBCL and Prostate can be improved by 10.90% and 16.22% compared to traditional methods. Link prediction provides an average classification accuracy improvement of 1.96% and 1.31%, and is up to 16.98% higher than the published method. The results show that the proposed method can have full use of a priori knowledge to effectively select disease prediction biomarkers with high classification accuracy. Nature Publishing Group UK 2021-12-13 /pmc/articles/PMC8668890/ /pubmed/34903818 http://dx.doi.org/10.1038/s41598-021-03316-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yu, Kun
Xie, Weidong
Wang, Linjie
Zhang, Shoujia
Li, Wei
Determination of biomarkers from microarray data using graph neural network and spectral clustering
title Determination of biomarkers from microarray data using graph neural network and spectral clustering
title_full Determination of biomarkers from microarray data using graph neural network and spectral clustering
title_fullStr Determination of biomarkers from microarray data using graph neural network and spectral clustering
title_full_unstemmed Determination of biomarkers from microarray data using graph neural network and spectral clustering
title_short Determination of biomarkers from microarray data using graph neural network and spectral clustering
title_sort determination of biomarkers from microarray data using graph neural network and spectral clustering
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8668890/
https://www.ncbi.nlm.nih.gov/pubmed/34903818
http://dx.doi.org/10.1038/s41598-021-03316-6
work_keys_str_mv AT yukun determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering
AT xieweidong determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering
AT wanglinjie determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering
AT zhangshoujia determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering
AT liwei determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering