Cargando…
Determination of biomarkers from microarray data using graph neural network and spectral clustering
In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8668890/ https://www.ncbi.nlm.nih.gov/pubmed/34903818 http://dx.doi.org/10.1038/s41598-021-03316-6 |
_version_ | 1784614674685231104 |
---|---|
author | Yu, Kun Xie, Weidong Wang, Linjie Zhang, Shoujia Li, Wei |
author_facet | Yu, Kun Xie, Weidong Wang, Linjie Zhang, Shoujia Li, Wei |
author_sort | Yu, Kun |
collection | PubMed |
description | In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features of various diseases, is essential for the disease classification. feature selection has therefore became fundemental for the analysis of microarray data, which designs to remove irrelevant and redundant features. There are a large number of redundant features and irrelevant features in microarray data, which severely degrade the classification effectiveness. We propose an innovative feature selection method with the goal of obtaining feature dependencies from a priori knowledge and removing redundant features using spectral clustering. In this paper, the graph structure is firstly constructed by using the gene interaction network as a priori knowledge, and then a link prediction method based on graph neural network is proposed to enhance the graph structure data. Finally, a feature selection method based on spectral clustering is proposed to determine biomarkers. The classification accuracy on DLBCL and Prostate can be improved by 10.90% and 16.22% compared to traditional methods. Link prediction provides an average classification accuracy improvement of 1.96% and 1.31%, and is up to 16.98% higher than the published method. The results show that the proposed method can have full use of a priori knowledge to effectively select disease prediction biomarkers with high classification accuracy. |
format | Online Article Text |
id | pubmed-8668890 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86688902021-12-15 Determination of biomarkers from microarray data using graph neural network and spectral clustering Yu, Kun Xie, Weidong Wang, Linjie Zhang, Shoujia Li, Wei Sci Rep Article In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features of various diseases, is essential for the disease classification. feature selection has therefore became fundemental for the analysis of microarray data, which designs to remove irrelevant and redundant features. There are a large number of redundant features and irrelevant features in microarray data, which severely degrade the classification effectiveness. We propose an innovative feature selection method with the goal of obtaining feature dependencies from a priori knowledge and removing redundant features using spectral clustering. In this paper, the graph structure is firstly constructed by using the gene interaction network as a priori knowledge, and then a link prediction method based on graph neural network is proposed to enhance the graph structure data. Finally, a feature selection method based on spectral clustering is proposed to determine biomarkers. The classification accuracy on DLBCL and Prostate can be improved by 10.90% and 16.22% compared to traditional methods. Link prediction provides an average classification accuracy improvement of 1.96% and 1.31%, and is up to 16.98% higher than the published method. The results show that the proposed method can have full use of a priori knowledge to effectively select disease prediction biomarkers with high classification accuracy. Nature Publishing Group UK 2021-12-13 /pmc/articles/PMC8668890/ /pubmed/34903818 http://dx.doi.org/10.1038/s41598-021-03316-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Yu, Kun Xie, Weidong Wang, Linjie Zhang, Shoujia Li, Wei Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title | Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title_full | Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title_fullStr | Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title_full_unstemmed | Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title_short | Determination of biomarkers from microarray data using graph neural network and spectral clustering |
title_sort | determination of biomarkers from microarray data using graph neural network and spectral clustering |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8668890/ https://www.ncbi.nlm.nih.gov/pubmed/34903818 http://dx.doi.org/10.1038/s41598-021-03316-6 |
work_keys_str_mv | AT yukun determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering AT xieweidong determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering AT wanglinjie determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering AT zhangshoujia determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering AT liwei determinationofbiomarkersfrommicroarraydatausinggraphneuralnetworkandspectralclustering |