Statistics > Methodology
[Submitted on 20 Apr 2022 (v1), last revised 15 Jul 2022 (this version, v2)]
Title:Integral, mean and covariance of the simplex-truncated multivariate normal distribution
View PDFAbstract:Compositional data, which is data consisting of fractions or probabilities, is common in many fields including ecology, economics, physical science and political science. If these data would otherwise be normally distributed, their spread can be conveniently represented by a multivariate normal distribution truncated to the non-negative space under a unit simplex. Here this distribution is called the simplex-truncated multivariate normal distribution. For calculations on truncated distributions, it is often useful to obtain rapid estimates of their integral, mean and covariance; these quantities characterising the truncated distribution will generally possess different values to the corresponding non-truncated distribution. In this paper, three different approaches that can estimate the integral, mean and covariance of any simplex-truncated multivariate normal distribution are described and compared. These three approaches are (1) naive rejection sampling, (2) a method described by Gessner et al. that unifies subset simulation and the Holmes-Diaconis-Ross algorithm with an analytical version of elliptical slice sampling, and (3) a semi-analytical method that expresses the integral, mean and covariance in terms of integrals of hyperrectangularly-truncated multivariate normal distributions, the latter of which are readily computed in modern mathematical and statistical packages. Strong agreement is demonstrated between all three approaches, but the most computationally efficient approach depends strongly both on implementation details and the dimension of the simplex-truncated multivariate normal distribution. For computations in low-dimensional distributions, the semi-analytical method is fast and thus should be considered. As the dimension increases, the Gessner et al. method becomes the only practically efficient approach of the methods tested here.
Submission history
From: Matthew Adams [view email][v1] Wed, 20 Apr 2022 06:40:23 UTC (1,274 KB)
[v2] Fri, 15 Jul 2022 00:36:01 UTC (1,276 KB)
Ancillary-file links:
Ancillary files (details):
- Overarching_Comparison_of_Methods_Part_1_Calculations.m
- Overarching_Comparison_of_Methods_Part_2_Figures.m
- Results_Dimension_10.mat
- Results_Dimension_2.mat
- Results_Dimension_3.mat
- Results_Dimension_4.mat
- Results_Dimension_5.mat
- Results_Dimension_6.mat
- Results_Dimension_7.mat
- Results_Dimension_8.mat
- Results_Dimension_9.mat
- subcode_STMND_Using_Gessner_et_al_Method.m
- subcode_STMND_Using_Naive_Rejection_Sampling.m
- subcode_STMND_Using_Semi_Analytical_Method.m
- subcode_STMND_Using_Semi_Analytical_Method_Bivariate_Only.m
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.