Cargando…

Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data

Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the conseq...

Descripción completa

Detalles Bibliográficos
Autores principales: Zanin, Massimiliano, Menasalvas, Ernestina, Boccaletti, Stefano, Sousa, Pedro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753346/
https://www.ncbi.nlm.nih.gov/pubmed/23991036
http://dx.doi.org/10.1371/journal.pone.0072045
_version_ 1782281820292775936
author Zanin, Massimiliano
Menasalvas, Ernestina
Boccaletti, Stefano
Sousa, Pedro
author_facet Zanin, Massimiliano
Menasalvas, Ernestina
Boccaletti, Stefano
Sousa, Pedro
author_sort Zanin, Massimiliano
collection PubMed
description Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the consequent high computational cost, make a direct network analysis unfeasible. We here present a comparative analysis of three customary feature selection algorithms, including the binning of spectral data and the use of information theory metrics. Such algorithms are compared by assessing the score obtained in a classification task, where healthy subjects and people suffering from different types of cancers should be discriminated. Results indicate that a feature selection strategy based on Mutual Information outperforms the more classical data binning, while allowing a reduction of the dimensionality of the data set in two orders of magnitude.
format Online
Article
Text
id pubmed-3753346
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37533462013-08-29 Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data Zanin, Massimiliano Menasalvas, Ernestina Boccaletti, Stefano Sousa, Pedro PLoS One Research Article Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the consequent high computational cost, make a direct network analysis unfeasible. We here present a comparative analysis of three customary feature selection algorithms, including the binning of spectral data and the use of information theory metrics. Such algorithms are compared by assessing the score obtained in a classification task, where healthy subjects and people suffering from different types of cancers should be discriminated. Results indicate that a feature selection strategy based on Mutual Information outperforms the more classical data binning, while allowing a reduction of the dimensionality of the data set in two orders of magnitude. Public Library of Science 2013-08-26 /pmc/articles/PMC3753346/ /pubmed/23991036 http://dx.doi.org/10.1371/journal.pone.0072045 Text en © 2013 Zanin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zanin, Massimiliano
Menasalvas, Ernestina
Boccaletti, Stefano
Sousa, Pedro
Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title_full Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title_fullStr Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title_full_unstemmed Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title_short Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
title_sort feature selection in the reconstruction of complex network representations of spectral data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753346/
https://www.ncbi.nlm.nih.gov/pubmed/23991036
http://dx.doi.org/10.1371/journal.pone.0072045
work_keys_str_mv AT zaninmassimiliano featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata
AT menasalvasernestina featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata
AT boccalettistefano featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata
AT sousapedro featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata