Cargando…
Contingent Kernel Density Estimation
Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3286465/ https://www.ncbi.nlm.nih.gov/pubmed/22383966 http://dx.doi.org/10.1371/journal.pone.0030549 |
_version_ | 1782224564267253760 |
---|---|
author | Fortmann-Roe, Scott Starfield, Richard Getz, Wayne M. |
author_facet | Fortmann-Roe, Scott Starfield, Richard Getz, Wayne M. |
author_sort | Fortmann-Roe, Scott |
collection | PubMed |
description | Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias. Often this error is negligible and may be disregarded in analysis. In cases where the error is non-negligible, estimation methods should be adjusted to reduce resulting bias. Several modifications of kernel density estimation have been developed to address specific forms of errors. One form of error that has not yet been addressed is the case where observations are nominally placed at the centers of areas from which the points are assumed to have been drawn, where these areas are of varying sizes. In this scenario, the bias arises because the size of the error can vary among points and some subset of points can be known to have smaller error than another subset or the form of the error may change among points. This paper proposes a “contingent kernel density estimation” technique to address this form of error. This new technique adjusts the standard kernel on a point-by-point basis in an adaptive response to changing structure and magnitude of error. In this paper, equations for our contingent kernel technique are derived, the technique is validated using numerical simulations, and an example using the geographic locations of social networking users is worked to demonstrate the utility of the method. |
format | Online Article Text |
id | pubmed-3286465 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-32864652012-03-01 Contingent Kernel Density Estimation Fortmann-Roe, Scott Starfield, Richard Getz, Wayne M. PLoS One Research Article Kernel density estimation is a widely used method for estimating a distribution based on a sample of points drawn from that distribution. Generally, in practice some form of error contaminates the sample of observed points. Such error can be the result of imprecise measurements or observation bias. Often this error is negligible and may be disregarded in analysis. In cases where the error is non-negligible, estimation methods should be adjusted to reduce resulting bias. Several modifications of kernel density estimation have been developed to address specific forms of errors. One form of error that has not yet been addressed is the case where observations are nominally placed at the centers of areas from which the points are assumed to have been drawn, where these areas are of varying sizes. In this scenario, the bias arises because the size of the error can vary among points and some subset of points can be known to have smaller error than another subset or the form of the error may change among points. This paper proposes a “contingent kernel density estimation” technique to address this form of error. This new technique adjusts the standard kernel on a point-by-point basis in an adaptive response to changing structure and magnitude of error. In this paper, equations for our contingent kernel technique are derived, the technique is validated using numerical simulations, and an example using the geographic locations of social networking users is worked to demonstrate the utility of the method. Public Library of Science 2012-02-24 /pmc/articles/PMC3286465/ /pubmed/22383966 http://dx.doi.org/10.1371/journal.pone.0030549 Text en Fortmann-Roe et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Fortmann-Roe, Scott Starfield, Richard Getz, Wayne M. Contingent Kernel Density Estimation |
title | Contingent Kernel Density Estimation |
title_full | Contingent Kernel Density Estimation |
title_fullStr | Contingent Kernel Density Estimation |
title_full_unstemmed | Contingent Kernel Density Estimation |
title_short | Contingent Kernel Density Estimation |
title_sort | contingent kernel density estimation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3286465/ https://www.ncbi.nlm.nih.gov/pubmed/22383966 http://dx.doi.org/10.1371/journal.pone.0030549 |
work_keys_str_mv | AT fortmannroescott contingentkerneldensityestimation AT starfieldrichard contingentkerneldensityestimation AT getzwaynem contingentkerneldensityestimation |