Cargando…

Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome

Guillain-Barré syndrome (GBS) is a neurological disorder which has not been explored using clustering algorithms. Clustering algorithms perform more efficiently when they work only with relevant features. In this work, we applied correlation-based feature selection (CFS), chi-squared, information ga...

Descripción completa

Detalles Bibliográficos
Autores principales: Hernández-Torruco, José, Canul-Reich, Juana, Frausto-Solís, Juan, Méndez-Castillo, Juan José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180197/
https://www.ncbi.nlm.nih.gov/pubmed/25302074
http://dx.doi.org/10.1155/2014/432109
_version_ 1782337193211068416
author Hernández-Torruco, José
Canul-Reich, Juana
Frausto-Solís, Juan
Méndez-Castillo, Juan José
author_facet Hernández-Torruco, José
Canul-Reich, Juana
Frausto-Solís, Juan
Méndez-Castillo, Juan José
author_sort Hernández-Torruco, José
collection PubMed
description Guillain-Barré syndrome (GBS) is a neurological disorder which has not been explored using clustering algorithms. Clustering algorithms perform more efficiently when they work only with relevant features. In this work, we applied correlation-based feature selection (CFS), chi-squared, information gain, symmetrical uncertainty, and consistency filter methods to select the most relevant features from a 156-feature real dataset. This dataset contains clinical, serological, and nerve conduction tests data obtained from GBS patients. The most relevant feature subsets, determined with each filter method, were used to identify four subtypes of GBS present in the dataset. We used partitions around medoids (PAM) clustering algorithm to form four clusters, corresponding to the GBS subtypes. We applied the purity of each cluster as evaluation measure. After experimentation, symmetrical uncertainty and information gain determined a feature subset of seven variables. These variables conformed as a dataset were used as input to PAM and reached a purity of 0.7984. This result leads to a first characterization of this syndrome using computational techniques.
format Online
Article
Text
id pubmed-4180197
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41801972014-10-09 Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome Hernández-Torruco, José Canul-Reich, Juana Frausto-Solís, Juan Méndez-Castillo, Juan José Comput Math Methods Med Research Article Guillain-Barré syndrome (GBS) is a neurological disorder which has not been explored using clustering algorithms. Clustering algorithms perform more efficiently when they work only with relevant features. In this work, we applied correlation-based feature selection (CFS), chi-squared, information gain, symmetrical uncertainty, and consistency filter methods to select the most relevant features from a 156-feature real dataset. This dataset contains clinical, serological, and nerve conduction tests data obtained from GBS patients. The most relevant feature subsets, determined with each filter method, were used to identify four subtypes of GBS present in the dataset. We used partitions around medoids (PAM) clustering algorithm to form four clusters, corresponding to the GBS subtypes. We applied the purity of each cluster as evaluation measure. After experimentation, symmetrical uncertainty and information gain determined a feature subset of seven variables. These variables conformed as a dataset were used as input to PAM and reached a purity of 0.7984. This result leads to a first characterization of this syndrome using computational techniques. Hindawi Publishing Corporation 2014 2014-09-15 /pmc/articles/PMC4180197/ /pubmed/25302074 http://dx.doi.org/10.1155/2014/432109 Text en Copyright © 2014 José Hernández-Torruco et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hernández-Torruco, José
Canul-Reich, Juana
Frausto-Solís, Juan
Méndez-Castillo, Juan José
Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title_full Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title_fullStr Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title_full_unstemmed Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title_short Feature Selection for Better Identification of Subtypes of Guillain-Barré Syndrome
title_sort feature selection for better identification of subtypes of guillain-barré syndrome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4180197/
https://www.ncbi.nlm.nih.gov/pubmed/25302074
http://dx.doi.org/10.1155/2014/432109
work_keys_str_mv AT hernandeztorrucojose featureselectionforbetteridentificationofsubtypesofguillainbarresyndrome
AT canulreichjuana featureselectionforbetteridentificationofsubtypesofguillainbarresyndrome
AT fraustosolisjuan featureselectionforbetteridentificationofsubtypesofguillainbarresyndrome
AT mendezcastillojuanjose featureselectionforbetteridentificationofsubtypesofguillainbarresyndrome