Cargando…

A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO

The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the ed...

Descripción completa

Detalles Bibliográficos
Autores principales: Mandal, Monalisa, Mukhopadhyay, Anirban
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953335/
https://www.ncbi.nlm.nih.gov/pubmed/24625895
http://dx.doi.org/10.1371/journal.pone.0090949
_version_ 1782307335140540416
author Mandal, Monalisa
Mukhopadhyay, Anirban
author_facet Mandal, Monalisa
Mukhopadhyay, Anirban
author_sort Mandal, Monalisa
collection PubMed
description The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the edges represent their dissimilarity. Both nodes and edges are given weight according to the feature’s relevance and dissimilarity among the features, respectively. The problem of finding relevant and non-redundant features is then mapped into densest subgraph finding problem. We have proposed a multiobjective particle swarm optimization (PSO)-based algorithm that optimizes average node-weight and average edge-weight of the candidate subgraph simultaneously. The proposed algorithm is applied for identifying relevant and non-redundant disease-related genes from microarray gene expression data. The performance of the proposed method is compared with that of several other existing feature selection techniques on different real-life microarray gene expression datasets.
format Online
Article
Text
id pubmed-3953335
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39533352014-03-18 A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO Mandal, Monalisa Mukhopadhyay, Anirban PLoS One Research Article The purpose of feature selection is to identify the relevant and non-redundant features from a dataset. In this article, the feature selection problem is organized as a graph-theoretic problem where a feature-dissimilarity graph is shaped from the data matrix. The nodes represent features and the edges represent their dissimilarity. Both nodes and edges are given weight according to the feature’s relevance and dissimilarity among the features, respectively. The problem of finding relevant and non-redundant features is then mapped into densest subgraph finding problem. We have proposed a multiobjective particle swarm optimization (PSO)-based algorithm that optimizes average node-weight and average edge-weight of the candidate subgraph simultaneously. The proposed algorithm is applied for identifying relevant and non-redundant disease-related genes from microarray gene expression data. The performance of the proposed method is compared with that of several other existing feature selection techniques on different real-life microarray gene expression datasets. Public Library of Science 2014-03-13 /pmc/articles/PMC3953335/ /pubmed/24625895 http://dx.doi.org/10.1371/journal.pone.0090949 Text en © 2014 Mandal, Mukhopadhyay http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mandal, Monalisa
Mukhopadhyay, Anirban
A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title_full A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title_fullStr A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title_full_unstemmed A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title_short A Graph-Theoretic Approach for Identifying Non-Redundant and Relevant Gene Markers from Microarray Data Using Multiobjective Binary PSO
title_sort graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary pso
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3953335/
https://www.ncbi.nlm.nih.gov/pubmed/24625895
http://dx.doi.org/10.1371/journal.pone.0090949
work_keys_str_mv AT mandalmonalisa agraphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT mukhopadhyayanirban agraphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT mandalmonalisa graphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso
AT mukhopadhyayanirban graphtheoreticapproachforidentifyingnonredundantandrelevantgenemarkersfrommicroarraydatausingmultiobjectivebinarypso