Cargando…

A Projection and Density Estimation Method for Knowledge Discovery

A key ingredient to modern data analysis is probability density estimation. However, it is well known that the curse of dimensionality prevents a proper estimation of densities in high dimensions. The problem is typically circumvented by using a fixed set of assumptions about the data, e.g., by assu...

Descripción completa

Detalles Bibliográficos
Autores principales: Stanski, Adam, Hellwich, Olaf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3462203/
https://www.ncbi.nlm.nih.gov/pubmed/23049675
http://dx.doi.org/10.1371/journal.pone.0044495
_version_ 1782245158440402944
author Stanski, Adam
Hellwich, Olaf
author_facet Stanski, Adam
Hellwich, Olaf
author_sort Stanski, Adam
collection PubMed
description A key ingredient to modern data analysis is probability density estimation. However, it is well known that the curse of dimensionality prevents a proper estimation of densities in high dimensions. The problem is typically circumvented by using a fixed set of assumptions about the data, e.g., by assuming partial independence of features, data on a manifold or a customized kernel. These fixed assumptions limit the applicability of a method. In this paper we propose a framework that uses a flexible set of assumptions instead. It allows to tailor a model to various problems by means of 1d-decompositions. The approach achieves a fast runtime and is not limited by the curse of dimensionality as all estimations are performed in 1d-space. The wide range of applications is demonstrated at two very different real world examples. The first is a data mining software that allows the fully automatic discovery of patterns. The software is publicly available for evaluation. As a second example an image segmentation method is realized. It achieves state of the art performance on a benchmark dataset although it uses only a fraction of the training data and very simple features.
format Online
Article
Text
id pubmed-3462203
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34622032012-10-05 A Projection and Density Estimation Method for Knowledge Discovery Stanski, Adam Hellwich, Olaf PLoS One Research Article A key ingredient to modern data analysis is probability density estimation. However, it is well known that the curse of dimensionality prevents a proper estimation of densities in high dimensions. The problem is typically circumvented by using a fixed set of assumptions about the data, e.g., by assuming partial independence of features, data on a manifold or a customized kernel. These fixed assumptions limit the applicability of a method. In this paper we propose a framework that uses a flexible set of assumptions instead. It allows to tailor a model to various problems by means of 1d-decompositions. The approach achieves a fast runtime and is not limited by the curse of dimensionality as all estimations are performed in 1d-space. The wide range of applications is demonstrated at two very different real world examples. The first is a data mining software that allows the fully automatic discovery of patterns. The software is publicly available for evaluation. As a second example an image segmentation method is realized. It achieves state of the art performance on a benchmark dataset although it uses only a fraction of the training data and very simple features. Public Library of Science 2012-10-01 /pmc/articles/PMC3462203/ /pubmed/23049675 http://dx.doi.org/10.1371/journal.pone.0044495 Text en © 2012 Stanski, Hellwich http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Stanski, Adam
Hellwich, Olaf
A Projection and Density Estimation Method for Knowledge Discovery
title A Projection and Density Estimation Method for Knowledge Discovery
title_full A Projection and Density Estimation Method for Knowledge Discovery
title_fullStr A Projection and Density Estimation Method for Knowledge Discovery
title_full_unstemmed A Projection and Density Estimation Method for Knowledge Discovery
title_short A Projection and Density Estimation Method for Knowledge Discovery
title_sort projection and density estimation method for knowledge discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3462203/
https://www.ncbi.nlm.nih.gov/pubmed/23049675
http://dx.doi.org/10.1371/journal.pone.0044495
work_keys_str_mv AT stanskiadam aprojectionanddensityestimationmethodforknowledgediscovery
AT hellwicholaf aprojectionanddensityestimationmethodforknowledgediscovery
AT stanskiadam projectionanddensityestimationmethodforknowledgediscovery
AT hellwicholaf projectionanddensityestimationmethodforknowledgediscovery