Cargando…

Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes

Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance...

Descripción completa

Detalles Bibliográficos
Autores principales: Balliu, Brunilda, Würtz, Rolf P., Horsthemke, Bernhard, Wieczorek, Dagmar, Böhringer, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236018/
https://www.ncbi.nlm.nih.gov/pubmed/25405460
http://dx.doi.org/10.1371/journal.pone.0109033
_version_ 1782345091663265792
author Balliu, Brunilda
Würtz, Rolf P.
Horsthemke, Bernhard
Wieczorek, Dagmar
Böhringer, Stefan
author_facet Balliu, Brunilda
Würtz, Rolf P.
Horsthemke, Bernhard
Wieczorek, Dagmar
Böhringer, Stefan
author_sort Balliu, Brunilda
collection PubMed
description Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a principal component analysis (PCA) pre-processing step. In our data example classification accuracy improves from 47% to 62% when switching from PCA to penalized regression. A second goal is to visualize the resulting classifiers. We develop importance plots highlighting the influence of coordinates in the original 2D space. Features used for classification are mapped to coordinates in the original images and combined into an importance measure for each pixel. These plots assist in assessing plausibility of classifiers, interpretation of classifiers, and determination of the relative importance of different features.
format Online
Article
Text
id pubmed-4236018
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42360182014-11-21 Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes Balliu, Brunilda Würtz, Rolf P. Horsthemke, Bernhard Wieczorek, Dagmar Böhringer, Stefan PLoS One Research Article Data transformations prior to analysis may be beneficial in classification tasks. In this article we investigate a set of such transformations on 2D graph-data derived from facial images and their effect on classification accuracy in a high-dimensional setting. These transformations are low-variance in the sense that each involves only a fixed small number of input features. We show that classification accuracy can be improved when penalized regression techniques are employed, as compared to a principal component analysis (PCA) pre-processing step. In our data example classification accuracy improves from 47% to 62% when switching from PCA to penalized regression. A second goal is to visualize the resulting classifiers. We develop importance plots highlighting the influence of coordinates in the original 2D space. Features used for classification are mapped to coordinates in the original images and combined into an importance measure for each pixel. These plots assist in assessing plausibility of classifiers, interpretation of classifiers, and determination of the relative importance of different features. Public Library of Science 2014-11-18 /pmc/articles/PMC4236018/ /pubmed/25405460 http://dx.doi.org/10.1371/journal.pone.0109033 Text en © 2014 Balliu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Balliu, Brunilda
Würtz, Rolf P.
Horsthemke, Bernhard
Wieczorek, Dagmar
Böhringer, Stefan
Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title_full Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title_fullStr Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title_full_unstemmed Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title_short Classification and Visualization Based on Derived Image Features: Application to Genetic Syndromes
title_sort classification and visualization based on derived image features: application to genetic syndromes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236018/
https://www.ncbi.nlm.nih.gov/pubmed/25405460
http://dx.doi.org/10.1371/journal.pone.0109033
work_keys_str_mv AT balliubrunilda classificationandvisualizationbasedonderivedimagefeaturesapplicationtogeneticsyndromes
AT wurtzrolfp classificationandvisualizationbasedonderivedimagefeaturesapplicationtogeneticsyndromes
AT horsthemkebernhard classificationandvisualizationbasedonderivedimagefeaturesapplicationtogeneticsyndromes
AT wieczorekdagmar classificationandvisualizationbasedonderivedimagefeaturesapplicationtogeneticsyndromes
AT bohringerstefan classificationandvisualizationbasedonderivedimagefeaturesapplicationtogeneticsyndromes