Cargando…

Semiparametric maximum likelihood probability density estimation

A comprehensive methodology for semiparametric probability density estimation is introduced and explored. The probability density is modelled by sequences of mostly regular or steep exponential families generated by flexible sets of basis functions, possibly including boundary terms. Parameters are...

Descripción completa

Detalles Bibliográficos
Autor principal: Kwasniok, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8577774/
https://www.ncbi.nlm.nih.gov/pubmed/34752460
http://dx.doi.org/10.1371/journal.pone.0259111
_version_ 1784596129680195584
author Kwasniok, Frank
author_facet Kwasniok, Frank
author_sort Kwasniok, Frank
collection PubMed
description A comprehensive methodology for semiparametric probability density estimation is introduced and explored. The probability density is modelled by sequences of mostly regular or steep exponential families generated by flexible sets of basis functions, possibly including boundary terms. Parameters are estimated by global maximum likelihood without any roughness penalty. A statistically orthogonal formulation of the inference problem and a numerically stable and fast convex optimization algorithm for its solution are presented. Automatic model selection over the type and number of basis functions is performed with the Bayesian information criterion. The methodology can naturally be applied to densities supported on bounded, infinite or semi-infinite domains without boundary bias. Relationships to the truncated moment problem and the moment-constrained maximum entropy principle are discussed and a new theorem on the existence of solutions is contributed. The new technique compares very favourably to kernel density estimation, the diffusion estimator, finite mixture models and local likelihood density estimation across a diverse range of simulation and observation data sets. The semiparametric estimator combines a very small mean integrated squared error with a high degree of smoothness which allows for a robust and reliable detection of the modality of the probability density in terms of the number of modes and bumps.
format Online
Article
Text
id pubmed-8577774
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-85777742021-11-10 Semiparametric maximum likelihood probability density estimation Kwasniok, Frank PLoS One Research Article A comprehensive methodology for semiparametric probability density estimation is introduced and explored. The probability density is modelled by sequences of mostly regular or steep exponential families generated by flexible sets of basis functions, possibly including boundary terms. Parameters are estimated by global maximum likelihood without any roughness penalty. A statistically orthogonal formulation of the inference problem and a numerically stable and fast convex optimization algorithm for its solution are presented. Automatic model selection over the type and number of basis functions is performed with the Bayesian information criterion. The methodology can naturally be applied to densities supported on bounded, infinite or semi-infinite domains without boundary bias. Relationships to the truncated moment problem and the moment-constrained maximum entropy principle are discussed and a new theorem on the existence of solutions is contributed. The new technique compares very favourably to kernel density estimation, the diffusion estimator, finite mixture models and local likelihood density estimation across a diverse range of simulation and observation data sets. The semiparametric estimator combines a very small mean integrated squared error with a high degree of smoothness which allows for a robust and reliable detection of the modality of the probability density in terms of the number of modes and bumps. Public Library of Science 2021-11-09 /pmc/articles/PMC8577774/ /pubmed/34752460 http://dx.doi.org/10.1371/journal.pone.0259111 Text en © 2021 Frank Kwasniok https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kwasniok, Frank
Semiparametric maximum likelihood probability density estimation
title Semiparametric maximum likelihood probability density estimation
title_full Semiparametric maximum likelihood probability density estimation
title_fullStr Semiparametric maximum likelihood probability density estimation
title_full_unstemmed Semiparametric maximum likelihood probability density estimation
title_short Semiparametric maximum likelihood probability density estimation
title_sort semiparametric maximum likelihood probability density estimation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8577774/
https://www.ncbi.nlm.nih.gov/pubmed/34752460
http://dx.doi.org/10.1371/journal.pone.0259111
work_keys_str_mv AT kwasniokfrank semiparametricmaximumlikelihoodprobabilitydensityestimation