Cargando…

Integral, mean and covariance of the simplex-truncated multivariate normal distribution

Compositional data, which is data consisting of fractions or probabilities, is common in many fields including ecology, economics, physical science and political science. If these data would otherwise be normally distributed, their spread can be conveniently represented by a multivariate normal dist...

Descripción completa

Detalles Bibliográficos
Autor principal: Adams, Matthew P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307197/
https://www.ncbi.nlm.nih.gov/pubmed/35867671
http://dx.doi.org/10.1371/journal.pone.0272014
_version_ 1784752707164176384
author Adams, Matthew P.
author_facet Adams, Matthew P.
author_sort Adams, Matthew P.
collection PubMed
description Compositional data, which is data consisting of fractions or probabilities, is common in many fields including ecology, economics, physical science and political science. If these data would otherwise be normally distributed, their spread can be conveniently represented by a multivariate normal distribution truncated to the non-negative space under a unit simplex. Here this distribution is called the simplex-truncated multivariate normal distribution. For calculations on truncated distributions, it is often useful to obtain rapid estimates of their integral, mean and covariance; these quantities characterising the truncated distribution will generally possess different values to the corresponding non-truncated distribution. In this paper, three different approaches that can estimate the integral, mean and covariance of any simplex-truncated multivariate normal distribution are described and compared. These three approaches are (1) naive rejection sampling, (2) a method described by Gessner et al. that unifies subset simulation and the Holmes-Diaconis-Ross algorithm with an analytical version of elliptical slice sampling, and (3) a semi-analytical method that expresses the integral, mean and covariance in terms of integrals of hyperrectangularly-truncated multivariate normal distributions, the latter of which are readily computed in modern mathematical and statistical packages. Strong agreement is demonstrated between all three approaches, but the most computationally efficient approach depends strongly both on implementation details and the dimension of the simplex-truncated multivariate normal distribution. For computations in low-dimensional distributions, the semi-analytical method is fast and thus should be considered. As the dimension increases, the Gessner et al. method becomes the only practically efficient approach of the methods tested here.
format Online
Article
Text
id pubmed-9307197
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-93071972022-07-23 Integral, mean and covariance of the simplex-truncated multivariate normal distribution Adams, Matthew P. PLoS One Research Article Compositional data, which is data consisting of fractions or probabilities, is common in many fields including ecology, economics, physical science and political science. If these data would otherwise be normally distributed, their spread can be conveniently represented by a multivariate normal distribution truncated to the non-negative space under a unit simplex. Here this distribution is called the simplex-truncated multivariate normal distribution. For calculations on truncated distributions, it is often useful to obtain rapid estimates of their integral, mean and covariance; these quantities characterising the truncated distribution will generally possess different values to the corresponding non-truncated distribution. In this paper, three different approaches that can estimate the integral, mean and covariance of any simplex-truncated multivariate normal distribution are described and compared. These three approaches are (1) naive rejection sampling, (2) a method described by Gessner et al. that unifies subset simulation and the Holmes-Diaconis-Ross algorithm with an analytical version of elliptical slice sampling, and (3) a semi-analytical method that expresses the integral, mean and covariance in terms of integrals of hyperrectangularly-truncated multivariate normal distributions, the latter of which are readily computed in modern mathematical and statistical packages. Strong agreement is demonstrated between all three approaches, but the most computationally efficient approach depends strongly both on implementation details and the dimension of the simplex-truncated multivariate normal distribution. For computations in low-dimensional distributions, the semi-analytical method is fast and thus should be considered. As the dimension increases, the Gessner et al. method becomes the only practically efficient approach of the methods tested here. Public Library of Science 2022-07-22 /pmc/articles/PMC9307197/ /pubmed/35867671 http://dx.doi.org/10.1371/journal.pone.0272014 Text en © 2022 Matthew P. Adams https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Adams, Matthew P.
Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title_full Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title_fullStr Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title_full_unstemmed Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title_short Integral, mean and covariance of the simplex-truncated multivariate normal distribution
title_sort integral, mean and covariance of the simplex-truncated multivariate normal distribution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307197/
https://www.ncbi.nlm.nih.gov/pubmed/35867671
http://dx.doi.org/10.1371/journal.pone.0272014
work_keys_str_mv AT adamsmatthewp integralmeanandcovarianceofthesimplextruncatedmultivariatenormaldistribution