Skip to main content

Showing 1–6 of 6 results for author: Daunhawer, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.13555  [pdf, other

    cs.CV cs.AI cs.LG

    Benchmarking the Fairness of Image Upsampling Methods

    Authors: Mike Laszkiewicz, Imant Daunhawer, Julia E. Vogt, Asja Fischer, Johannes Lederer

    Abstract: Recent years have witnessed a rapid development of deep generative models for creating synthetic media, such as images and videos. While the practical applications of these models in everyday tasks are enticing, it is crucial to assess the inherent risks regarding their fairness. In this work, we introduce a comprehensive framework for benchmarking the performance and fairness of conditional gener… ▽ More

    Submitted 29 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published at the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

  2. arXiv:2303.09166  [pdf, other

    cs.LG stat.ML

    Identifiability Results for Multimodal Contrastive Learning

    Authors: Imant Daunhawer, Alice Bizeul, Emanuele Palumbo, Alexander Marx, Julia E. Vogt

    Abstract: Contrastive learning is a cornerstone underlying recent progress in multi-view and multimodal learning, e.g., in representation learning with image/caption pairs. While its effectiveness is not yet fully understood, a line of recent work reveals that contrastive learning can invert the data generating process and recover ground truth latent factors shared between views. In this work, we present ne… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 camera-ready version

  3. arXiv:2206.08871  [pdf, other

    cs.LG stat.ML

    How Robust is Unsupervised Representation Learning to Distribution Shift?

    Authors: Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H. S. Torr, Amartya Sanyal

    Abstract: The robustness of machine learning algorithms to distributions shift is primarily discussed in the context of supervised learning (SL). As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift. We posit that the input-driven objectives of unsup… ▽ More

    Submitted 16 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  4. arXiv:2110.04121  [pdf, other

    cs.LG

    On the Limitations of Multimodal VAEs

    Authors: Imant Daunhawer, Thomas M. Sutter, Kieran Chin-Cheong, Emanuele Palumbo, Julia E. Vogt

    Abstract: Multimodal variational autoencoders (VAEs) have shown promise as efficient generative models for weakly-supervised data. Yet, despite their advantage of weak supervision, they exhibit a gap in generative quality compared to unimodal VAEs, which are completely unsupervised. In an attempt to explain this gap, we uncover a fundamental limitation that applies to a large family of mixture-based multimo… ▽ More

    Submitted 7 April, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 camera-ready version

  5. arXiv:2105.02470  [pdf, other

    cs.LG stat.ML

    Generalized Multimodal ELBO

    Authors: Thomas M. Sutter, Imant Daunhawer, Julia E. Vogt

    Abstract: Multiple data types naturally co-occur when describing real-world phenomena and learning from them is a long-standing goal in machine learning research. However, existing self-supervised generative models approximating an ELBO are not able to fulfill all desired requirements of multimodal models: their posterior approximation functions lead to a trade-off between the semantic coherence and the abi… ▽ More

    Submitted 25 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: 2021 ICLR

  6. arXiv:2006.08242  [pdf, other

    cs.LG stat.ML

    Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence

    Authors: Thomas M. Sutter, Imant Daunhawer, Julia E. Vogt

    Abstract: Learning from different data types is a long-standing goal in machine learning research, as multiple information sources co-occur when describing natural phenomena. However, existing generative models that approximate a multimodal ELBO rely on difficult or inefficient training schemes to learn a joint distribution and the dependencies between modalities. In this work, we propose a novel, efficient… ▽ More

    Submitted 2 November, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted at NeurIPS 2020, camera-ready version

    Journal ref: 34th Conference on Neural Information Processing Systems (NeurIPS 2020)