Cargando…

A Balanced Approach to Adaptive Probability Density Estimation

Our development of a Fast (Mutual) Information Matching (FIM) of molecular dynamics time series data led us to the general problem of how to accurately estimate the probability density function of a random variable, especially in cases of very uneven samples. Here, we propose a novel Balanced Adapti...

Descripción completa

Detalles Bibliográficos
Autores principales: Kovacs, Julio A., Helmick, Cailee, Wriggers, Willy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5404644/
https://www.ncbi.nlm.nih.gov/pubmed/28487858
http://dx.doi.org/10.3389/fmolb.2017.00025
_version_ 1783231625643950080
author Kovacs, Julio A.
Helmick, Cailee
Wriggers, Willy
author_facet Kovacs, Julio A.
Helmick, Cailee
Wriggers, Willy
author_sort Kovacs, Julio A.
collection PubMed
description Our development of a Fast (Mutual) Information Matching (FIM) of molecular dynamics time series data led us to the general problem of how to accurately estimate the probability density function of a random variable, especially in cases of very uneven samples. Here, we propose a novel Balanced Adaptive Density Estimation (BADE) method that effectively optimizes the amount of smoothing at each point. To do this, BADE relies on an efficient nearest-neighbor search which results in good scaling for large data sizes. Our tests on simulated data show that BADE exhibits equal or better accuracy than existing methods, and visual tests on univariate and bivariate experimental data show that the results are also aesthetically pleasing. This is due in part to the use of a visual criterion for setting the smoothing level of the density estimate. Our results suggest that BADE offers an attractive new take on the fundamental density estimation problem in statistics. We have applied it on molecular dynamics simulations of membrane pore formation. We also expect BADE to be generally useful for low-dimensional applications in other statistical application domains such as bioinformatics, signal processing and econometrics.
format Online
Article
Text
id pubmed-5404644
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-54046442017-05-09 A Balanced Approach to Adaptive Probability Density Estimation Kovacs, Julio A. Helmick, Cailee Wriggers, Willy Front Mol Biosci Molecular Biosciences Our development of a Fast (Mutual) Information Matching (FIM) of molecular dynamics time series data led us to the general problem of how to accurately estimate the probability density function of a random variable, especially in cases of very uneven samples. Here, we propose a novel Balanced Adaptive Density Estimation (BADE) method that effectively optimizes the amount of smoothing at each point. To do this, BADE relies on an efficient nearest-neighbor search which results in good scaling for large data sizes. Our tests on simulated data show that BADE exhibits equal or better accuracy than existing methods, and visual tests on univariate and bivariate experimental data show that the results are also aesthetically pleasing. This is due in part to the use of a visual criterion for setting the smoothing level of the density estimate. Our results suggest that BADE offers an attractive new take on the fundamental density estimation problem in statistics. We have applied it on molecular dynamics simulations of membrane pore formation. We also expect BADE to be generally useful for low-dimensional applications in other statistical application domains such as bioinformatics, signal processing and econometrics. Frontiers Media S.A. 2017-04-25 /pmc/articles/PMC5404644/ /pubmed/28487858 http://dx.doi.org/10.3389/fmolb.2017.00025 Text en Copyright © 2017 Kovacs, Helmick and Wriggers. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Kovacs, Julio A.
Helmick, Cailee
Wriggers, Willy
A Balanced Approach to Adaptive Probability Density Estimation
title A Balanced Approach to Adaptive Probability Density Estimation
title_full A Balanced Approach to Adaptive Probability Density Estimation
title_fullStr A Balanced Approach to Adaptive Probability Density Estimation
title_full_unstemmed A Balanced Approach to Adaptive Probability Density Estimation
title_short A Balanced Approach to Adaptive Probability Density Estimation
title_sort balanced approach to adaptive probability density estimation
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5404644/
https://www.ncbi.nlm.nih.gov/pubmed/28487858
http://dx.doi.org/10.3389/fmolb.2017.00025
work_keys_str_mv AT kovacsjulioa abalancedapproachtoadaptiveprobabilitydensityestimation
AT helmickcailee abalancedapproachtoadaptiveprobabilitydensityestimation
AT wriggerswilly abalancedapproachtoadaptiveprobabilitydensityestimation
AT kovacsjulioa balancedapproachtoadaptiveprobabilitydensityestimation
AT helmickcailee balancedapproachtoadaptiveprobabilitydensityestimation
AT wriggerswilly balancedapproachtoadaptiveprobabilitydensityestimation