Cargando…

Contingent Kernel Density Estimation

Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias....

Descripción completa

Detalles Bibliográficos
Autores principales: Fortmann-Roe, Scott, Starfield, Richard, Getz, Wayne M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3286465/
https://www.ncbi.nlm.nih.gov/pubmed/22383966
http://dx.doi.org/10.1371/journal.pone.0030549
_version_ 1782224564267253760
author Fortmann-Roe, Scott
Starfield, Richard
Getz, Wayne M.
author_facet Fortmann-Roe, Scott
Starfield, Richard
Getz, Wayne M.
author_sort Fortmann-Roe, Scott
collection PubMed
description Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias. Often this error is negligible and may be disregarded in analysis. In cases where the error is non-negligible, estimation methods should be adjusted to reduce resulting bias. Several modifications of kernel density estimation have been developed to address specific forms of errors. One form of error that has not yet been addressed is the case where observations are nominally placed at the centers of areas from which the points are assumed to have been drawn, where these areas are of varying sizes. In this scenario, the bias arises because the size of the error can vary among points and some subset of points can be known to have smaller error than another subset or the form of the error may change among points. This paper proposes a “contingent kernel density estimation” technique to address this form of error. This new technique adjusts the standard kernel on a point-by-point basis in an adaptive response to changing structure and magnitude of error. In this paper, equations for our contingent kernel technique are derived, the technique is validated using numerical simulations, and an example using the geographic locations of social networking users is worked to demonstrate the utility of the method.
format Online
Article
Text
id pubmed-3286465
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32864652012-03-01 Contingent Kernel Density Estimation Fortmann-Roe, Scott Starfield, Richard Getz, Wayne M. PLoS One Research Article Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias. Often this error is negligible and may be disregarded in analysis. In cases where the error is non-negligible, estimation methods should be adjusted to reduce resulting bias. Several modifications of kernel density estimation have been developed to address specific forms of errors. One form of error that has not yet been addressed is the case where observations are nominally placed at the centers of areas from which the points are assumed to have been drawn, where these areas are of varying sizes. In this scenario, the bias arises because the size of the error can vary among points and some subset of points can be known to have smaller error than another subset or the form of the error may change among points. This paper proposes a “contingent kernel density estimation” technique to address this form of error. This new technique adjusts the standard kernel on a point-by-point basis in an adaptive response to changing structure and magnitude of error. In this paper, equations for our contingent kernel technique are derived, the technique is validated using numerical simulations, and an example using the geographic locations of social networking users is worked to demonstrate the utility of the method. Public Library of Science 2012-02-24 /pmc/articles/PMC3286465/ /pubmed/22383966 http://dx.doi.org/10.1371/journal.pone.0030549 Text en Fortmann-Roe et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Fortmann-Roe, Scott
Starfield, Richard
Getz, Wayne M.
Contingent Kernel Density Estimation
title Contingent Kernel Density Estimation
title_full Contingent Kernel Density Estimation
title_fullStr Contingent Kernel Density Estimation
title_full_unstemmed Contingent Kernel Density Estimation
title_short Contingent Kernel Density Estimation
title_sort contingent kernel density estimation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3286465/
https://www.ncbi.nlm.nih.gov/pubmed/22383966
http://dx.doi.org/10.1371/journal.pone.0030549
work_keys_str_mv AT fortmannroescott contingentkerneldensityestimation
AT starfieldrichard contingentkerneldensityestimation
AT getzwaynem contingentkerneldensityestimation