Cargando…
Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the conseq...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753346/ https://www.ncbi.nlm.nih.gov/pubmed/23991036 http://dx.doi.org/10.1371/journal.pone.0072045 |
_version_ | 1782281820292775936 |
---|---|
author | Zanin, Massimiliano Menasalvas, Ernestina Boccaletti, Stefano Sousa, Pedro |
author_facet | Zanin, Massimiliano Menasalvas, Ernestina Boccaletti, Stefano Sousa, Pedro |
author_sort | Zanin, Massimiliano |
collection | PubMed |
description | Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the consequent high computational cost, make a direct network analysis unfeasible. We here present a comparative analysis of three customary feature selection algorithms, including the binning of spectral data and the use of information theory metrics. Such algorithms are compared by assessing the score obtained in a classification task, where healthy subjects and people suffering from different types of cancers should be discriminated. Results indicate that a feature selection strategy based on Mutual Information outperforms the more classical data binning, while allowing a reduction of the dimensionality of the data set in two orders of magnitude. |
format | Online Article Text |
id | pubmed-3753346 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37533462013-08-29 Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data Zanin, Massimiliano Menasalvas, Ernestina Boccaletti, Stefano Sousa, Pedro PLoS One Research Article Complex networks have been extensively used in the last decade to characterize and analyze complex systems, and they have been recently proposed as a novel instrument for the analysis of spectra extracted from biological samples. Yet, the high number of measurements composing spectra, and the consequent high computational cost, make a direct network analysis unfeasible. We here present a comparative analysis of three customary feature selection algorithms, including the binning of spectral data and the use of information theory metrics. Such algorithms are compared by assessing the score obtained in a classification task, where healthy subjects and people suffering from different types of cancers should be discriminated. Results indicate that a feature selection strategy based on Mutual Information outperforms the more classical data binning, while allowing a reduction of the dimensionality of the data set in two orders of magnitude. Public Library of Science 2013-08-26 /pmc/articles/PMC3753346/ /pubmed/23991036 http://dx.doi.org/10.1371/journal.pone.0072045 Text en © 2013 Zanin et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zanin, Massimiliano Menasalvas, Ernestina Boccaletti, Stefano Sousa, Pedro Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title | Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title_full | Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title_fullStr | Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title_full_unstemmed | Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title_short | Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data |
title_sort | feature selection in the reconstruction of complex network representations of spectral data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753346/ https://www.ncbi.nlm.nih.gov/pubmed/23991036 http://dx.doi.org/10.1371/journal.pone.0072045 |
work_keys_str_mv | AT zaninmassimiliano featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata AT menasalvasernestina featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata AT boccalettistefano featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata AT sousapedro featureselectioninthereconstructionofcomplexnetworkrepresentationsofspectraldata |