Skip to main content

Showing 1–11 of 11 results for author: Garrido, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.08471  [pdf, other

    cs.CV cs.AI cs.LG

    Revisiting Feature Prediction for Learning Visual Representations from Video

    Authors: Adrien Bardes, Quentin Garrido, Jean Ponce, Xinlei Chen, Michael Rabbat, Yann LeCun, Mahmoud Assran, Nicolas Ballas

    Abstract: This paper explores feature prediction as a stand-alone objective for unsupervised learning from video and introduces V-JEPA, a collection of vision models trained solely using a feature prediction objective, without the use of pretrained image encoders, text, negative examples, reconstruction, or other sources of supervision. The models are trained on 2 million videos collected from public datase… ▽ More

    Submitted 15 February, 2024; originally announced April 2024.

  3. arXiv:2403.00504  [pdf, other

    cs.CV cs.AI cs.LG

    Learning and Leveraging World Models in Visual Representation Learning

    Authors: Quentin Garrido, Mahmoud Assran, Nicolas Ballas, Adrien Bardes, Laurent Najman, Yann LeCun

    Abstract: Joint-Embedding Predictive Architecture (JEPA) has emerged as a promising self-supervised approach that learns by leveraging a world model. While previously limited to predicting missing parts of an input, we explore how to generalize the JEPA prediction task to a broader set of corruptions. We introduce Image World Models, an approach that goes beyond masked image modeling and learns to predict t… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 23 pages, 16 figures

  4. arXiv:2307.05432  [pdf, other

    cs.LG math.NA

    Self-Supervised Learning with Lie Symmetries for Partial Differential Equations

    Authors: Grégoire Mialon, Quentin Garrido, Hannah Lawrence, Danyal Rehman, Yann LeCun, Bobak T. Kiani

    Abstract: Machine learning for differential equations paves the way for computationally efficient alternatives to numerical solvers, with potentially broad impacts in science and engineering. Though current algorithms typically require simulated training data tailored to a given setting, one may instead wish to learn useful information from heterogeneous sources, or from real dynamical systems observations… ▽ More

    Submitted 14 February, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  5. arXiv:2304.12210  [pdf, other

    cs.LG cs.CV

    A Cookbook of Self-Supervised Learning

    Authors: Randall Balestriero, Mark Ibrahim, Vlad Sobal, Ari Morcos, Shashank Shekhar, Tom Goldstein, Florian Bordes, Adrien Bardes, Gregoire Mialon, Yuandong Tian, Avi Schwarzschild, Andrew Gordon Wilson, Jonas Gei**, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann LeCun, Micah Goldblum

    Abstract: Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high barrier to entry. While many components are familiar, successfully training a SSL method involves a dizzying set of choices from the pretext tasks to training hyper-parameters. Our goal is to lower the barrier… ▽ More

    Submitted 28 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

  6. arXiv:2302.10283  [pdf, other

    cs.CV cs.AI cs.LG

    Self-supervised learning of Split Invariant Equivariant representations

    Authors: Quentin Garrido, Laurent Najman, Yann Lecun

    Abstract: Recent progress has been made towards learning invariant or equivariant representations with self-supervised learning. While invariant methods are evaluated on large scale datasets, equivariant ones are evaluated in smaller, more controlled, settings. We aim at bridging the gap between the two in order to learn more diverse representations that are suitable for a wide range of tasks. We start by i… ▽ More

    Submitted 19 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Journal ref: The Fortieth International Conference on Machine Learning, 2023, Honolulu, United States

  7. arXiv:2210.13604  [pdf, other

    cs.CV cs.LG

    The Robustness Limits of SoTA Vision Models to Natural Variation

    Authors: Mark Ibrahim, Quentin Garrido, Ari Morcos, Diane Bouchacourt

    Abstract: Recent state-of-the-art vision models introduced new architectures, learning paradigms, and larger pretraining data, leading to impressive performance on tasks such as classification. While previous generations of vision models were shown to lack robustness to factors such as pose, it's unclear the extent to which this next generation of models are more robust. To study this question, we develop a… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  8. arXiv:2210.02885  [pdf, other

    cs.LG cs.AI cs.CV

    RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank

    Authors: Quentin Garrido, Randall Balestriero, Laurent Najman, Yann Lecun

    Abstract: Joint-Embedding Self Supervised Learning (JE-SSL) has seen a rapid development, with the emergence of many method variations but only few principled guidelines that would help practitioners to successfully deploy them. The main reason for that pitfall comes from JE-SSL's core principle of not employing any input reconstruction therefore lacking visual cues of unsuccessful training. Adding non info… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Journal ref: The Fortieth International Conference on Machine Learning, 2023, Honolulu, United States

  9. arXiv:2206.13378  [pdf, other

    cs.LG

    Guillotine Regularization: Why removing layers is needed to improve generalization in Self-Supervised Learning

    Authors: Florian Bordes, Randall Balestriero, Quentin Garrido, Adrien Bardes, Pascal Vincent

    Abstract: One unexpected technique that emerged in recent years consists in training a Deep Network (DN) with a Self-Supervised Learning (SSL) method, and using this network on downstream tasks but with its last few projector layers entirely removed. This trick of throwing away the projector is actually critical for SSL methods to display competitive performances on ImageNet for which more than 30 percentag… ▽ More

    Submitted 9 June, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted at TMLR 2023

  10. On the duality between contrastive and non-contrastive self-supervised learning

    Authors: Quentin Garrido, Yubei Chen, Adrien Bardes, Laurent Najman, Yann Lecun

    Abstract: Recent approaches in self-supervised learning of image representations can be categorized into different families of methods and, in particular, can be divided into contrastive and non-contrastive approaches. While differences between the two families have been thoroughly discussed to motivate new approaches, we focus more on the theoretical similarities between them. By designing contrastive and… ▽ More

    Submitted 26 June, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: The Eleventh International Conference on Learning Representations, 2023, Kigali, Rwanda

  11. arXiv:2102.05892  [pdf, other

    q-bio.QM cs.NE

    Visualizing hierarchies in scRNA-seq data using a density tree-biased autoencoder

    Authors: Quentin Garrido, Sebastian Damrich, Alexander Jäger, Dario Cerletti, Manfred Claassen, Laurent Najman, Fred Hamprecht

    Abstract: Motivation: Single cell RNA sequencing (scRNA-seq) data makes studying the development of cells possible at unparalleled resolution. Given that many cellular differentiation processes are hierarchical, their scRNA-seq data is expected to be approximately tree-shaped in gene expression space. Inference and representation of this tree-structure in two dimensions is highly desirable for biological in… ▽ More

    Submitted 22 April, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

    Journal ref: Bioinformatics, Oxford University Press (OUP), In press