Skip to main content

Showing 1–6 of 6 results for author: Gritsenko, A A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.09865  [pdf, other

    cs.CV cs.AI cs.LG

    Improving fine-grained understanding in image-text pre-training

    Authors: Ioana Bica, Anastasija Ilić, Matthias Bauer, Goker Erdogan, Matko Bošnjak, Christos Kaplanis, Alexey A. Gritsenko, Matthias Minderer, Charles Blundell, Razvan Pascanu, Jovana Mitrović

    Abstract: We introduce SPARse Fine-grained Contrastive Alignment (SPARC), a simple method for pretraining more fine-grained multimodal representations from image-text pairs. Given that multiple image patches often correspond to single words, we propose to learn a grou** of image patches for every token in the caption. To achieve this, we use a sparse similarity metric between image patches and language to… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 26 pages

  2. arXiv:2110.02037  [pdf, other

    cs.LG stat.ML

    Autoregressive Diffusion Models

    Authors: Emiel Hoogeboom, Alexey A. Gritsenko, Jasmijn Bastings, Ben Poole, Rianne van den Berg, Tim Salimans

    Abstract: We introduce Autoregressive Diffusion Models (ARDMs), a model class encompassing and generalizing order-agnostic autoregressive models (Uria et al., 2014) and absorbing discrete diffusion (Austin et al., 2021), which we show are special cases of ARDMs under mild assumptions. ARDMs are simple to implement and easy to train. Unlike standard ARMs, they do not require causal masking of model represent… ▽ More

    Submitted 1 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at International Conference on Learning Representations (ICLR) 2022

  3. arXiv:2107.07002  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.IR

    The Benchmark Lottery

    Authors: Mostafa Dehghani, Yi Tay, Alexey A. Gritsenko, Zhe Zhao, Neil Houlsby, Fernando Diaz, Donald Metzler, Oriol Vinyals

    Abstract: The world of empirical machine learning (ML) strongly relies on benchmarks in order to determine the relative effectiveness of different algorithms and methods. This paper proposes the notion of "a benchmark lottery" that describes the overall fragility of the ML benchmarking process. The benchmark lottery postulates that many factors, other than fundamental algorithmic superiority, may lead to a… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  4. arXiv:2008.01160  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    A Spectral Energy Distance for Parallel Speech Synthesis

    Authors: Alexey A. Gritsenko, Tim Salimans, Rianne van den Berg, Jasper Snoek, Nal Kalchbrenner

    Abstract: Speech synthesis is an important practical generative modeling problem that has seen great progress over the last few years, with likelihood-based autoregressive neural models now outperforming traditional concatenative systems. A downside of such autoregressive models is that they require executing tens of thousands of sequential operations per second of generated audio, making them ill-suited fo… ▽ More

    Submitted 23 October, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

  5. arXiv:2006.12459  [pdf, other

    cs.LG stat.ML

    IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression

    Authors: Rianne van den Berg, Alexey A. Gritsenko, Mostafa Dehghani, Casper Kaae Sønderby, Tim Salimans

    Abstract: In this paper we analyse and improve integer discrete flows for lossless compression. Integer discrete flows are a recently proposed class of models that learn invertible transformations for integer-valued random variables. Their discrete nature makes them particularly suitable for lossless compression with entropy coding schemes. We start by investigating a recent theoretical claim that states th… ▽ More

    Submitted 23 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted as a conference paper at the Ninth International Conference on Learning Representations (ICLR) 2021

  6. arXiv:1812.06869  [pdf, other

    cs.LG cs.CV stat.ML

    BriarPatches: Pixel-Space Interventions for Inducing Demographic Parity

    Authors: Alexey A. Gritsenko, Alex D'Amour, James Atwood, Yoni Halpern, D. Sculley

    Abstract: We introduce the BriarPatch, a pixel-space intervention that obscures sensitive attributes from representations encoded in pre-trained classifiers. The patches encourage internal model representations not to encode sensitive information, which has the effect of pushing downstream predictors towards exhibiting demographic parity with respect to the sensitive information. The net result is that thes… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: 6 pages, 5 figures, NeurIPS Workshop on Ethical, Social and Governance Issues in AI