Cargando…

Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data

A common statistical situation concerns inferring an unknown distribution Q(x) from a known distribution P(y), where X (dimension n), and Y (dimension m) have a known functional relationship. Most commonly, n ≤ m, and the task is relatively straightforward for well-defined functional relationships....

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Jayajit, Mukherjee, Sayak, Hodge, Susan E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4734403/
https://www.ncbi.nlm.nih.gov/pubmed/26843809
http://dx.doi.org/10.3390/e17074986
_version_ 1782412914441846784
author Das, Jayajit
Mukherjee, Sayak
Hodge, Susan E.
author_facet Das, Jayajit
Mukherjee, Sayak
Hodge, Susan E.
author_sort Das, Jayajit
collection PubMed
description A common statistical situation concerns inferring an unknown distribution Q(x) from a known distribution P(y), where X (dimension n), and Y (dimension m) have a known functional relationship. Most commonly, n ≤ m, and the task is relatively straightforward for well-defined functional relationships. For example, if Y(1) and Y(2) are independent random variables, each uniform on [0, 1], one can determine the distribution of X = Y(1) + Y(2); here m = 2 and n = 1. However, biological and physical situations can arise where n > m and the functional relation Y→X is non-unique. In general, in the absence of additional information, there is no unique solution to Q in those cases. Nevertheless, one may still want to draw some inferences about Q. To this end, we propose a novel maximum entropy (MaxEnt) approach that estimates Q(x) based only on the available data, namely, P(y). The method has the additional advantage that one does not need to explicitly calculate the Lagrange multipliers. In this paper we develop the approach, for both discrete and continuous probability distributions, and demonstrate its validity. We give an intuitive justification as well, and we illustrate with examples.
format Online
Article
Text
id pubmed-4734403
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-47344032016-02-01 Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data Das, Jayajit Mukherjee, Sayak Hodge, Susan E. Entropy (Basel) Article A common statistical situation concerns inferring an unknown distribution Q(x) from a known distribution P(y), where X (dimension n), and Y (dimension m) have a known functional relationship. Most commonly, n ≤ m, and the task is relatively straightforward for well-defined functional relationships. For example, if Y(1) and Y(2) are independent random variables, each uniform on [0, 1], one can determine the distribution of X = Y(1) + Y(2); here m = 2 and n = 1. However, biological and physical situations can arise where n > m and the functional relation Y→X is non-unique. In general, in the absence of additional information, there is no unique solution to Q in those cases. Nevertheless, one may still want to draw some inferences about Q. To this end, we propose a novel maximum entropy (MaxEnt) approach that estimates Q(x) based only on the available data, namely, P(y). The method has the additional advantage that one does not need to explicitly calculate the Lagrange multipliers. In this paper we develop the approach, for both discrete and continuous probability distributions, and demonstrate its validity. We give an intuitive justification as well, and we illustrate with examples. 2015-07-15 2015-07 /pmc/articles/PMC4734403/ /pubmed/26843809 http://dx.doi.org/10.3390/e17074986 Text en licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Das, Jayajit
Mukherjee, Sayak
Hodge, Susan E.
Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title_full Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title_fullStr Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title_full_unstemmed Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title_short Maximum Entropy Estimation of Probability Distribution of Variables in Higher Dimensions from Lower Dimensional Data
title_sort maximum entropy estimation of probability distribution of variables in higher dimensions from lower dimensional data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4734403/
https://www.ncbi.nlm.nih.gov/pubmed/26843809
http://dx.doi.org/10.3390/e17074986
work_keys_str_mv AT dasjayajit maximumentropyestimationofprobabilitydistributionofvariablesinhigherdimensionsfromlowerdimensionaldata
AT mukherjeesayak maximumentropyestimationofprobabilitydistributionofvariablesinhigherdimensionsfromlowerdimensionaldata
AT hodgesusane maximumentropyestimationofprobabilitydistributionofvariablesinhigherdimensionsfromlowerdimensionaldata