Skip to main content

Showing 1–6 of 6 results for author: Javaloy, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.05415  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Causal normalizing flows: from theory to practice

    Authors: Adrián Javaloy, Pablo Sánchez-Martín, Isabel Valera

    Abstract: In this work, we deepen on the use of normalizing flows for causal reasoning. Specifically, we first leverage recent results on non-linear ICA to show that causal models are identifiable from observational data given a causal ordering, and thus can be recovered using autoregressive normalizing flows (NFs). Second, we analyze different design and learning choices for causal normalizing flows to cap… ▽ More

    Submitted 8 December, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 32 pages, 15 figures. Accepted as an Oral presentation at NeurIPS 2023

  2. arXiv:2211.11853  [pdf, other

    cs.LG

    Learnable Graph Convolutional Attention Networks

    Authors: Adrián Javaloy, Pablo Sanchez-Martin, Amit Levi, Isabel Valera

    Abstract: Existing Graph Neural Networks (GNNs) compute the message exchange between nodes by either aggregating uniformly (convolving) the features of all the neighboring nodes, or by applying a non-uniform score (attending) to the features. Recent works have shown the strengths and weaknesses of the resulting GNN architectures, respectively, GCNs and GATs. In this work, we aim at exploiting the strengths… ▽ More

    Submitted 28 February, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Published as a conference paper at ICLR 2023. 35 pages, 5 figures

  3. arXiv:2206.04496  [pdf, other

    cs.LG

    Mitigating Modality Collapse in Multimodal VAEs via Impartial Optimization

    Authors: Adrián Javaloy, Maryam Meghdadi, Isabel Valera

    Abstract: A number of variational autoencoders (VAEs) have recently emerged with the aim of modeling multimodal data, e.g., to jointly model images and their corresponding captions. Still, multimodal VAEs tend to focus solely on a subset of the modalities, e.g., by fitting the image while neglecting the caption. We refer to this limitation as modality collapse. In this work, we argue that this effect is a c… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted as a Spotlight paper at ICML 2022. 27 pages, 10 figures

  4. arXiv:2103.02631  [pdf, other

    cs.LG

    RotoGrad: Gradient Homogenization in Multitask Learning

    Authors: Adrián Javaloy, Isabel Valera

    Abstract: Multitask learning is being increasingly adopted in applications domains like computer vision and reinforcement learning. However, optimally exploiting its advantages remains a major challenge due to the effect of negative transfer. Previous works have tracked down this issue to the disparities in gradient magnitudes and directions across tasks, when optimizing the shared network parameters. While… ▽ More

    Submitted 16 February, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Spotlight at ICLR 2022. 24 pages, 9 figures

  5. arXiv:2006.15090  [pdf, other

    stat.ML cs.LG

    Relative gradient optimization of the Jacobian term in unsupervised deep learning

    Authors: Luigi Gresele, Giancarlo Fissore, Adrián Javaloy, Bernhard Schölkopf, Aapo Hyvärinen

    Abstract: Learning expressive probabilistic models correctly describing the data is a ubiquitous problem in machine learning. A popular approach for solving it is map** the observations into a representation space with a simple joint distribution, which can typically be written as a product of its marginals -- thus drawing a connection with the field of nonlinear independent component analysis. Deep densi… ▽ More

    Submitted 26 October, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

  6. arXiv:2002.11369  [pdf, other

    cs.LG stat.ML

    Lipschitz standardization for multivariate learning

    Authors: Adrián Javaloy, Isabel Valera

    Abstract: Probabilistic learning is increasingly being tackled as an optimization problem, with gradient-based approaches as predominant methods. When modelling multivariate likelihoods, a usual but undesirable outcome is that the learned model fits only a subset of the observed variables, overlooking the rest. In this work, we study this problem through the lens of multitask learning (MTL), where similar e… ▽ More

    Submitted 21 October, 2020; v1 submitted 26 February, 2020; originally announced February 2020.