Skip to main content

Showing 1–29 of 29 results for author: Bouchacourt, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05183  [pdf, other

    cs.LG cs.AI cs.CL

    The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More

    Authors: Ouail Kitouni, Niklas Nolte, Diane Bouchacourt, Adina Williams, Mike Rabbat, Mark Ibrahim

    Abstract: Today's best language models still struggle with hallucinations: factually incorrect generations, which impede their ability to reliably retrieve information seen during training. The reversal curse, where models cannot recall information when probed in a different order than was encountered during training, exemplifies this in information retrieval. We reframe the reversal curse as a factorizatio… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  2. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2404.16717  [pdf, other

    cs.CV cs.AI cs.HC

    Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

    Authors: Mazda Moayeri, Michael Rabbat, Mark Ibrahim, Diane Bouchacourt

    Abstract: Vision-language models enable open-world classification of objects without the need for any retraining. While this zero-shot paradigm marks a significant advance, even today's best models exhibit skewed performance when objects are dissimilar from their typical depiction. Real world objects such as pears appear in a variety of forms -- from diced to whole, on a table or in a bowl -- yet standard V… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to FAccT 2024

  4. arXiv:2401.01764  [pdf, other

    cs.CV cs.LG

    Understanding the Detrimental Class-level Effects of Data Augmentation

    Authors: Polina Kirichenko, Mark Ibrahim, Randall Balestriero, Diane Bouchacourt, Ramakrishna Vedantam, Hamed Firooz, Andrew Gordon Wilson

    Abstract: Data augmentation (DA) encodes invariance and provides implicit regularization critical to a model's performance in image classification tasks. However, while DA improves average accuracy, recent studies have shown that its impact can be highly class dependent: achieving optimal average accuracy comes at the cost of significantly hurting individual class accuracy by as much as 20% on ImageNet. The… ▽ More

    Submitted 7 December, 2023; originally announced January 2024.

    Comments: Neural Information Processing Systems (NeurIPS), 2023

  5. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  6. arXiv:2309.16748  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering environments with XRM

    Authors: Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

    Abstract: Successful out-of-distribution generalization requires environment annotations. Unfortunately, these are resource-intensive to obtain, and their relevance to model performance is limited by the expectations and perceptual biases of human annotators. Therefore, to enable robust AI systems across applications, we must develop algorithms to automatically discover environments inducing broad generaliz… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  7. arXiv:2308.03977  [pdf, other

    cs.CV cs.LG

    PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning

    Authors: Florian Bordes, Shashank Shekhar, Mark Ibrahim, Diane Bouchacourt, Pascal Vincent, Ari S. Morcos

    Abstract: Synthetic image datasets offer unmatched advantages for designing and evaluating deep neural networks: they make it possible to (i) render as many data samples as needed, (ii) precisely control each scene and yield granular ground truth labels (and captions), (iii) precisely control distribution shifts between training and testing to isolate variables of interest for sound experimentation. Despite… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  8. arXiv:2307.13136  [pdf, other

    cs.CV cs.LG

    Does Progress On Object Recognition Benchmarks Improve Real-World Generalization?

    Authors: Megan Richards, Polina Kirichenko, Diane Bouchacourt, Mark Ibrahim

    Abstract: For more than a decade, researchers have measured progress in object recognition on ImageNet-based generalization benchmarks such as ImageNet-A, -C, and -R. Recent advances in foundation models, trained on orders of magnitude more data, have begun to saturate these standard benchmarks, but remain brittle in practice. This suggests standard benchmarks, which tend to focus on predefined or synthetic… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  9. arXiv:2306.00802  [pdf, other

    stat.ML cs.CL cs.LG

    Birth of a Transformer: A Memory Viewpoint

    Authors: Alberto Bietti, Vivien Cabannes, Diane Bouchacourt, Herve Jegou, Leon Bottou

    Abstract: Large language models based on transformers have achieved great empirical successes. However, as they are deployed more widely, there is a growing need to better understand their internal mechanisms in order to make them more reliable. These models appear to store vast amounts of knowledge from their training data, and to adapt quickly to new information provided in their context or prompt. We stu… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  10. arXiv:2304.05391  [pdf, other

    cs.CV

    Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies

    Authors: Laura Gustafson, Megan Richards, Melissa Hall, Caner Hazirbas, Diane Bouchacourt, Mark Ibrahim

    Abstract: Despite impressive advances in object-recognition, deep learning systems' performance degrades significantly across geographies and lower income levels raising pressing concerns of inequity. Addressing such performance gaps remains a challenge, as little is understood about why performance degrades across incomes or geographies. We take a step in this direction by annotating images from Dollar Str… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  11. arXiv:2211.01866  [pdf, other

    cs.CV cs.LG

    ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations

    Authors: Badr Youbi Idrissi, Diane Bouchacourt, Randall Balestriero, Ivan Evtimov, Caner Hazirbas, Nicolas Ballas, Pascal Vincent, Michal Drozdzal, David Lopez-Paz, Mark Ibrahim

    Abstract: Deep learning vision systems are widely deployed across applications where reliability is critical. However, even today's best models can fail to recognize an object when its pose, lighting, or background varies. While existing benchmarks surface examples challenging for models, they do not explain why such mistakes arise. To address this need, we introduce ImageNet-X, a set of sixteen human annot… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  12. arXiv:2210.13604  [pdf, other

    cs.CV cs.LG

    The Robustness Limits of SoTA Vision Models to Natural Variation

    Authors: Mark Ibrahim, Quentin Garrido, Ari Morcos, Diane Bouchacourt

    Abstract: Recent state-of-the-art vision models introduced new architectures, learning paradigms, and larger pretraining data, leading to impressive performance on tasks such as classification. While previous generations of vision models were shown to lack robustness to factors such as pose, it's unclear the extent to which this next generation of models are more robust. To study this question, we develop a… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  13. arXiv:2210.13356  [pdf, other

    cs.CV cs.LG

    Robust Self-Supervised Learning with Lie Groups

    Authors: Mark Ibrahim, Diane Bouchacourt, Ari Morcos

    Abstract: Deep learning has led to remarkable advances in computer vision. Even so, today's best models are brittle when presented with variations that differ even slightly from those seen during training. Minor shifts in the pose, color, or illumination of an object can lead to catastrophic misclassifications. State-of-the art models struggle to understand how a set of variations can affect different objec… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  14. arXiv:2210.07347  [pdf, other

    cs.LG stat.ML

    Disentanglement of Correlated Factors via Hausdorff Factorized Support

    Authors: Karsten Roth, Mark Ibrahim, Zeynep Akata, Pascal Vincent, Diane Bouchacourt

    Abstract: A grand goal in deep learning research is to learn representations capable of generalizing across distribution shifts. Disentanglement is one promising direction aimed at aligning a model's representation with the underlying factors generating the data (e.g. color or background). Existing disentanglement methods, however, rely on an often unrealistic assumption: that factors are statistically inde… ▽ More

    Submitted 25 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  15. arXiv:2207.09960  [pdf, other

    stat.ML cs.CY cs.LG

    Measuring and signing fairness as performance under multiple stakeholder distributions

    Authors: David Lopez-Paz, Diane Bouchacourt, Levent Sagun, Nicolas Usunier

    Abstract: As learning machines increase their influence on decisions concerning human lives, analyzing their fairness properties becomes a subject of central importance. Yet, our best tools for measuring the fairness of learning systems are rigid fairness metrics encapsulated as mathematical one-liners, offer limited power to the stakeholders involved in the prediction task, and are easy to manipulate when… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  16. arXiv:2106.05121  [pdf, other

    cs.CV

    Grounding inductive biases in natural images:invariance stems from variations in data

    Authors: Diane Bouchacourt, Mark Ibrahim, Ari S. Morcos

    Abstract: To perform well on unseen and potentially out-of-distribution samples, it is desirable for machine learning models to have a predictable response with respect to transformations affecting the factors of variation of the input. Here, we study the relative importance of several types of inductive biases towards such predictable behavior: the choice of data, their augmentations, and model architectur… ▽ More

    Submitted 16 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  17. arXiv:2102.05623  [pdf, other

    cs.LG cs.AI

    Addressing the Topological Defects of Disentanglement via Distributed Operators

    Authors: Diane Bouchacourt, Mark Ibrahim, Stéphane Deny

    Abstract: A core challenge in Machine Learning is to learn to disentangle natural factors of variation in data (e.g. object shape vs. pose). A popular approach to disentanglement consists in learning to map each of these factors to distinct subspaces of a model's latent representation. However, this approach has shown limited empirical success to date. Here, we show that, for a broad family of transformatio… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  18. arXiv:2009.13962  [pdf, other

    cs.LG stat.ML

    Think before you act: A simple baseline for compositional generalization

    Authors: Christina Heinze-Deml, Diane Bouchacourt

    Abstract: Contrarily to humans who have the ability to recombine familiar expressions to create novel ones, modern neural networks struggle to do so. This has been emphasized recently with the introduction of the benchmark dataset "gSCAN" (Ruis et al. 2020), aiming to evaluate models' performance at compositional generalization in grounded language understanding. In this work, we challenge the gSCAN benchma… ▽ More

    Submitted 1 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

  19. arXiv:2004.09124  [pdf, other

    cs.CL cs.AI cs.LG

    Compositionality and Generalization in Emergent Languages

    Authors: Rahma Chaabouni, Eugene Kharitonov, Diane Bouchacourt, Emmanuel Dupoux, Marco Baroni

    Abstract: Natural language allows us to refer to novel composite concepts by combining expressions denoting their parts according to systematic rules, a property known as \emph{compositionality}. In this paper, we study whether the language emerging in deep multi-agent simulations possesses a similar ability to refer to novel primitive combinations, and whether it accomplishes this feat by strategies akin t… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  20. arXiv:2003.05161  [pdf, other

    cs.CL cs.AI cs.LG

    A Benchmark for Systematic Generalization in Grounded Language Understanding

    Authors: Laura Ruis, Jacob Andreas, Marco Baroni, Diane Bouchacourt, Brenden M. Lake

    Abstract: Humans easily interpret expressions that describe unfamiliar situations composed from familiar parts ("greet the pink brontosaurus by the ferris wheel"). Modern neural networks, by contrast, struggle to interpret novel compositions. In this paper, we introduce a new benchmark, gSCAN, for evaluating compositional generalization in situated language understanding. Going beyond a related benchmark th… ▽ More

    Submitted 17 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: accepted at NeurIPS 2020

  21. arXiv:1911.01892  [pdf, ps, other

    cs.CL cs.AI

    Focus on What's Informative and Ignore What's not: Communication Strategies in a Referential Game

    Authors: Roberto Dessì, Diane Bouchacourt, Davide Crepaldi, Marco Baroni

    Abstract: Research in multi-agent cooperation has shown that artificial agents are able to learn to play a simple referential game while develo** a shared lexicon. This lexicon is not easy to analyze, as it does not show many properties of a natural language. In a simple referential game with two neural network-based agents, we analyze the object-symbol map** trying to understand what kind of strategy w… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 3rd NeurIPS Workshop on Emergent Communication

  22. arXiv:1908.05135  [pdf, other

    cs.CL cs.AI cs.MA

    Mastering emergent language: learning to guide in simulated navigation

    Authors: Mathijs Mul, Diane Bouchacourt, Elia Bruni

    Abstract: To cooperate with humans effectively, virtual agents need to be able to understand and execute language instructions. A typical setup to achieve this is with a scripted teacher which guides a virtual agent using language instructions. However, such setup has clear limitations in scalability and, more importantly, it is not interactive. Here, we introduce an autonomous agent that uses discrete comm… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

  23. arXiv:1907.00852  [pdf, other

    cs.CL cs.AI

    EGG: a toolkit for research on Emergence of lanGuage in Games

    Authors: Eugene Kharitonov, Rahma Chaabouni, Diane Bouchacourt, Marco Baroni

    Abstract: There is renewed interest in simulating language emergence among deep neural agents that communicate to jointly solve a task, spurred by the practical aim to develop language-enabled interactive AIs, as well as by theoretical questions about the evolution of human language. However, optimizing deep architectures connected by a discrete communication channel (such as that in which language emerges)… ▽ More

    Submitted 13 October, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: EMNLP 2019 Demo paper

  24. arXiv:1905.13687  [pdf, other

    cs.CL cs.LG

    Entropy Minimization In Emergent Languages

    Authors: Eugene Kharitonov, Rahma Chaabouni, Diane Bouchacourt, Marco Baroni

    Abstract: There is growing interest in studying the languages that emerge when neural agents are jointly trained to solve tasks requiring communication through a discrete channel. We investigate here the information-theoretic complexity of such languages, focusing on the basic two-agent, one-exchange setup. We find that, under common training procedures, the emergent languages are subject to an entropy mini… ▽ More

    Submitted 26 June, 2020; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: Accepted at ICML 2020

  25. arXiv:1905.11871  [pdf, other

    cs.CL cs.MA

    Miss Tools and Mr Fruit: Emergent communication in agents learning about object affordances

    Authors: Diane Bouchacourt, Marco Baroni

    Abstract: Recent research studies communication emergence in communities of deep network agents assigned a joint task, ho** to gain insights on human language evolution. We propose here a new task capturing crucial aspects of the human environment, such as natural object affordances, and of human conversation, such as full symmetry among the participants. By conducting a thorough pragmatic and semantic an… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: Association for Computational Linguistics

  26. arXiv:1905.11852  [pdf, other

    cs.LG stat.ML

    EDUCE: Explaining model Decisions through Unsupervised Concepts Extraction

    Authors: Diane Bouchacourt, Ludovic Denoyer

    Abstract: Providing explanations along with predictions is crucial in some text processing tasks. Therefore, we propose a new self-interpretable model that performs output prediction and simultaneously provides an explanation in terms of the presence of particular concepts in the input. To do so, our model's prediction relies solely on a low-dimensional binary representation of the input, where each feature… ▽ More

    Submitted 27 September, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

  27. arXiv:1808.10696  [pdf, other

    cs.CL cs.LG

    How agents see things: On visual representations in an emergent language game

    Authors: Diane Bouchacourt, Marco Baroni

    Abstract: There is growing interest in the language developed by agents interacting in emergent-communication settings. Earlier studies have focused on the agents' symbol usage, rather than on their representation of visual input. In this paper, we consider the referential games of Lazaridou et al. (2017) and investigate the representations the agents develop during their evolving interaction. We find that… ▽ More

    Submitted 13 September, 2018; v1 submitted 31 August, 2018; originally announced August 2018.

    Comments: 2018 Conference on Empirical Methods in Natural Language Processing

  28. arXiv:1705.08841  [pdf, other

    cs.LG stat.ML

    Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

    Authors: Diane Bouchacourt, Ryota Tomioka, Sebastian Nowozin

    Abstract: We would like to learn a representation of the data which decomposes an observation into factors of variation which we can independently control. Specifically, we want to use minimal supervision to learn a latent representation that reflects the semantics behind a specific grou** of the data, where within a group the samples share a common factor of variation. For example, consider a collection… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  29. arXiv:1606.02556  [pdf, other

    cs.CV cs.AI

    DISCO Nets: DISsimilarity COefficient Networks

    Authors: Diane Bouchacourt, M. Pawan Kumar, Sebastian Nowozin

    Abstract: We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training to… ▽ More

    Submitted 28 October, 2016; v1 submitted 8 June, 2016; originally announced June 2016.