Cargando…

A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates

A new nonparametric test of equality of two densities is investigated. The test statistic is an average of log-Bayes factors, each of which is constructed from a kernel density estimate. Prior densities for the bandwidths of the kernel estimates are required, and it is shown how to choose priors so...

Descripción completa

Detalles Bibliográficos
Autores principales: Merchant, Naveed, Hart, Jeffrey D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407360/
https://www.ncbi.nlm.nih.gov/pubmed/36010735
http://dx.doi.org/10.3390/e24081071
_version_ 1784774345314271232
author Merchant, Naveed
Hart, Jeffrey D.
author_facet Merchant, Naveed
Hart, Jeffrey D.
author_sort Merchant, Naveed
collection PubMed
description A new nonparametric test of equality of two densities is investigated. The test statistic is an average of log-Bayes factors, each of which is constructed from a kernel density estimate. Prior densities for the bandwidths of the kernel estimates are required, and it is shown how to choose priors so that the log-Bayes factors can be calculated exactly. Critical values of the test statistic are determined by a permutation distribution, conditional on the data. An attractive property of the methodology is that a critical value of 0 leads to a test for which both type I and II error probabilities tend to 0 as sample sizes tend to ∞. Existing results on Kullback–Leibler loss of kernel estimates are crucial to obtaining these asymptotic results, and also imply that the proposed test works best with heavy-tailed kernels. Finite sample characteristics of the test are studied via simulation, and extensions to multivariate data are straightforward, as illustrated by an application to bivariate connectionist data.
format Online
Article
Text
id pubmed-9407360
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94073602022-08-26 A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates Merchant, Naveed Hart, Jeffrey D. Entropy (Basel) Article A new nonparametric test of equality of two densities is investigated. The test statistic is an average of log-Bayes factors, each of which is constructed from a kernel density estimate. Prior densities for the bandwidths of the kernel estimates are required, and it is shown how to choose priors so that the log-Bayes factors can be calculated exactly. Critical values of the test statistic are determined by a permutation distribution, conditional on the data. An attractive property of the methodology is that a critical value of 0 leads to a test for which both type I and II error probabilities tend to 0 as sample sizes tend to ∞. Existing results on Kullback–Leibler loss of kernel estimates are crucial to obtaining these asymptotic results, and also imply that the proposed test works best with heavy-tailed kernels. Finite sample characteristics of the test are studied via simulation, and extensions to multivariate data are straightforward, as illustrated by an application to bivariate connectionist data. MDPI 2022-08-03 /pmc/articles/PMC9407360/ /pubmed/36010735 http://dx.doi.org/10.3390/e24081071 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Merchant, Naveed
Hart, Jeffrey D.
A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title_full A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title_fullStr A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title_full_unstemmed A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title_short A Bayesian Motivated Two-Sample Test Based on Kernel Density Estimates
title_sort bayesian motivated two-sample test based on kernel density estimates
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407360/
https://www.ncbi.nlm.nih.gov/pubmed/36010735
http://dx.doi.org/10.3390/e24081071
work_keys_str_mv AT merchantnaveed abayesianmotivatedtwosampletestbasedonkerneldensityestimates
AT hartjeffreyd abayesianmotivatedtwosampletestbasedonkerneldensityestimates
AT merchantnaveed bayesianmotivatedtwosampletestbasedonkerneldensityestimates
AT hartjeffreyd bayesianmotivatedtwosampletestbasedonkerneldensityestimates