Showing 1–2 of 2 results for author: Barin-Pacela, V

Search v0.5.6 released 2020-02-24

arXiv:2306.16334 [pdf, other]

cs.LG cs.AI

On the Identifiability of Quantized Factors

Authors: Vitória Barin-Pacela, Kartik Ahuja, Simon Lacoste-Julien, Pascal Vincent

Abstract: Disentanglement aims to recover meaningful latent ground-truth factors from the observed distribution solely, and is formalized through the theory of identifiability. The identifiability of independent latent factors is proven to be impossible in the unsupervised i.i.d. setting under a general nonlinear map from factors to observations. In this work, however, we demonstrate that it is possible to… ▽ More Disentanglement aims to recover meaningful latent ground-truth factors from the observed distribution solely, and is formalized through the theory of identifiability. The identifiability of independent latent factors is proven to be impossible in the unsupervised i.i.d. setting under a general nonlinear map from factors to observations. In this work, however, we demonstrate that it is possible to recover quantized latent factors under a generic nonlinear diffeomorphism. We only assume that the latent factors have independent discontinuities in their density, without requiring the factors to be statistically independent. We introduce this novel form of identifiability, termed quantized factor identifiability, and provide a comprehensive proof of the recovery of the quantized factors. △ Less

Submitted 12 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

Comments: Appears in: 3rd Conference on Causal Learning and Reasoning (CLeaR 2024). 39 pages
arXiv:2111.15431 [pdf, other]

cs.LG stat.ML

Binary Independent Component Analysis: A Non-stationarity-based Approach

Authors: Antti Hyttinen, Vitória Barin-Pacela, Aapo Hyvärinen

Abstract: We consider independent component analysis of binary data. While fundamental in practice, this case has been much less developed than ICA for continuous data. We start by assuming a linear mixing model in a continuous-valued latent space, followed by a binary observation model. Importantly, we assume that the sources are non-stationary; this is necessary since any non-Gaussianity would essentially… ▽ More We consider independent component analysis of binary data. While fundamental in practice, this case has been much less developed than ICA for continuous data. We start by assuming a linear mixing model in a continuous-valued latent space, followed by a binary observation model. Importantly, we assume that the sources are non-stationary; this is necessary since any non-Gaussianity would essentially be destroyed by the binarization. Interestingly, the model allows for closed-form likelihood by employing the cumulative distribution function of the multivariate Gaussian distribution. In stark contrast to the continuous-valued case, we prove non-identifiability of the model with few observed variables; our empirical results imply identifiability when the number of observed variables is higher. We present a practical method for binary ICA that uses only pairwise marginals, which are faster to compute than the full multivariate likelihood. Experiments give insight into the requirements for the number of observed variables, segments, and latent sources that allow the model to be estimated. △ Less

Submitted 2 August, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

Comments: This is an updated version (including a slight name change) which was published at UAI2022

Search v0.5.6 released 2020-02-24