Skip to main content

Showing 1–9 of 9 results for author: Eastwood, C

.
  1. arXiv:2312.02116  [pdf, other

    cs.CV

    GIVT: Generative Infinite-Vocabulary Transformers

    Authors: Michael Tschannen, Cian Eastwood, Fabian Mentzer

    Abstract: We introduce generative infinite-vocabulary transformers (GIVT) which generate vector sequences with real-valued entries, instead of discrete tokens from a finite vocabulary. To this end, we propose two surprisingly simple modifications to decoder-only transformers: 1) at the input, we replace the finite-vocabulary lookup table with a linear projection of the input vectors; and 2) at the output, w… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: v2: add related NLP work, loss details. v3: Improved GMM formulation, added adapter module, larger models, better image generation results. Code and model checkpoints are available at: https://github.com/google-research/big_vision

  2. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2307.09933  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

    Authors: Cian Eastwood, Shashank Singh, Andrei Liviu Nicolicioiu, Marin Vlastelica, Julius von Kügelgen, Bernhard Schölkopf

    Abstract: To avoid failures on out-of-distribution data, recent works have sought to extract features that have an invariant or stable relationship with the label across domains, discarding "spurious" or unstable features whose relationship with the label changes across domains. However, unstable features often carry complementary information that could boost performance if used correctly in the test domain… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 Camera-Ready

  4. arXiv:2210.00364  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability

    Authors: Cian Eastwood, Andrei Liviu Nicolicioiu, Julius von Kügelgen, Armin Kekić, Frederik Träuble, Andrea Dittadi, Bernhard Schölkopf

    Abstract: In representation learning, a common approach is to seek representations which disentangle the underlying factors of variation. Eastwood & Williams (2018) proposed three metrics for quantifying the quality of such disentangled representations: disentanglement (D), completeness (C) and informativeness (I). In this work, we first connect this DCI framework to two common notions of linear and nonline… ▽ More

    Submitted 16 February, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  5. arXiv:2207.09944  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Probable Domain Generalization via Quantile Risk Minimization

    Authors: Cian Eastwood, Alexander Robey, Shashank Singh, Julius von Kügelgen, Hamed Hassani, George J. Pappas, Bernhard Schölkopf

    Abstract: Domain generalization (DG) seeks predictors which perform well on unseen test distributions by leveraging data drawn from multiple related training distributions or domains. To achieve this, DG is commonly formulated as an average- or worst-case problem over the set of possible domains. However, predictors that perform well on average lack robustness while predictors that perform well in the worst… ▽ More

    Submitted 22 August, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022 camera-ready (+ minor corrections)

  6. arXiv:2203.04694  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Align-Deform-Subtract: An Interventional Framework for Explaining Object Differences

    Authors: Cian Eastwood, Li Nanbo, Christopher K. I. Williams

    Abstract: Given two object images, how can we explain their differences in terms of the underlying object properties? To address this question, we propose Align-Deform-Subtract (ADS) -- an interventional framework for explaining object differences. By leveraging semantic alignments in image-space as counterfactual interventions on the underlying object properties, ADS iteratively quantifies and removes diff… ▽ More

    Submitted 20 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 Workshop on Objects, Structure and Causality

  7. arXiv:2111.07117  [pdf, other

    cs.CV cs.LG

    Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

    Authors: Li Nanbo, Cian Eastwood, Robert B. Fisher

    Abstract: Learning object-centric representations of multi-object scenes is a promising approach towards machine intelligence, facilitating high-level reasoning and control from visual sensory data. However, current approaches for unsupervised object-centric scene representation are incapable of aggregating information from multiple observations of a scene. As a result, these "single-view" methods form thei… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: Accepted at NeurIPS 2020 (Spotlight)

  8. arXiv:2107.05446  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration

    Authors: Cian Eastwood, Ian Mason, Christopher K. I. Williams, Bernhard Schölkopf

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model trained on labelled data in a source domain to unlabelled data in a target domain without access to the source-domain data during adaptation. Existing methods for SFDA leverage entropy-minimization techniques which: (i) apply only to classification; (ii) destroy model calibration; and (iii) rely on the source model achieving a good level o… ▽ More

    Submitted 17 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: ICLR 2022 (Spotlight)

  9. arXiv:2007.12989  [pdf, other

    cs.AI

    Information Fusion on Belief Networks

    Authors: Shawn C. Eastwood, Svetlana N. Yanushkevich

    Abstract: This paper will focus on the process of 'fusing' several observations or models of uncertainty into a single resultant model. Many existing approaches to fusion use subjective quantities such as 'strengths of belief' and process these quantities with heuristic algorithms. This paper argues in favor of quantities that can be objectively measured, as opposed to the subjective 'strength of belief' va… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: 25 pages, pages of Appendix, 3 figures