Skip to main content

Showing 1–11 of 11 results for author: Bińkowski, M

.
  1. arXiv:2204.14198  [pdf, other

    cs.CV cs.AI cs.LG

    Flamingo: a Visual Language Model for Few-Shot Learning

    Authors: Jean-Baptiste Alayrac, Jeff Donahue, Pauline Luc, Antoine Miech, Iain Barr, Yana Hasson, Karel Lenc, Arthur Mensch, Katie Millican, Malcolm Reynolds, Roman Ring, Eliza Rutherford, Serkan Cabi, Tengda Han, Zhitao Gong, Sina Samangooei, Marianne Monteiro, Jacob Menick, Sebastian Borgeaud, Andrew Brock, Aida Nematzadeh, Sahand Sharifzadeh, Mikolaj Binkowski, Ricardo Barreira, Oriol Vinyals , et al. (2 additional authors not shown)

    Abstract: Building models that can be rapidly adapted to novel tasks using only a handful of annotated examples is an open challenge for multimodal machine learning research. We introduce Flamingo, a family of Visual Language Models (VLM) with this ability. We propose key architectural innovations to: (i) bridge powerful pretrained vision-only and language-only models, (ii) handle sequences of arbitrarily i… ▽ More

    Submitted 15 November, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 54 pages. In Proceedings of Neural Information Processing Systems (NeurIPS) 2022

  2. arXiv:2112.06749  [pdf, other

    cs.CL cs.LG

    Step-unrolled Denoising Autoencoders for Text Generation

    Authors: Nikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aaron van den Oord

    Abstract: In this paper we propose a new generative model of text, Step-unrolled Denoising Autoencoder (SUNDAE), that does not rely on autoregressive models. Similarly to denoising diffusion techniques, SUNDAE is repeatedly applied on a sequence of tokens, starting from random inputs and improving them each time until convergence. We present a simple new improvement operator that converges in fewer iteratio… ▽ More

    Submitted 19 April, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted to ICLR 2022

  3. arXiv:2102.05182  [pdf, other

    astro-ph.GA cs.LG

    A Deep Learning Approach for Characterizing Major Galaxy Mergers

    Authors: Skanda Koppula, Victor Bapst, Marc Huertas-Company, Sam Blackwell, Agnieszka Grabska-Barwinska, Sander Dieleman, Andrea Huber, Natasha Antropova, Mikolaj Binkowski, Hannah Openshaw, Adria Recasens, Fernando Caro, Avishai Deke, Yohan Dubois, Jesus Vega Ferrero, David C. Koo, Joel R. Primack, Trevor Back

    Abstract: Fine-grained estimation of galaxy merger stages from observations is a key problem useful for validation of our current theoretical understanding of galaxy formation. To this end, we demonstrate a CNN-based regression model that is able to predict, for the first time, using a single image, the merger stage relative to the first perigee passage with a median error of 38.3 million years (Myrs) over… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  4. arXiv:2006.03575  [pdf, other

    cs.SD cs.LG eess.AS

    End-to-End Adversarial Text-to-Speech

    Authors: Jeff Donahue, Sander Dieleman, Mikołaj Bińkowski, Erich Elsen, Karen Simonyan

    Abstract: Modern text-to-speech synthesis pipelines typically involve multiple processing stages, each of which is designed or learnt independently from the rest. In this work, we take on the challenging task of learning to synthesise speech from normalised text or phonemes in an end-to-end manner, resulting in models which operate directly on character or phoneme input sequences and produce raw speech audi… ▽ More

    Submitted 17 March, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: 23 pages. In proceedings of ICLR 2021

  5. arXiv:1909.11646  [pdf, other

    cs.SD cs.LG eess.AS

    High Fidelity Speech Synthesis with Adversarial Networks

    Authors: Mikołaj Bińkowski, Jeff Donahue, Sander Dieleman, Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan

    Abstract: Generative adversarial networks have seen rapid development in recent years and have led to remarkable improvements in generative modelling of images. However, their application in the audio domain has received limited attention, and autoregressive models, such as WaveNet, remain the state of the art in generative modelling of audio signals such as human speech. To address this paucity, we introdu… ▽ More

    Submitted 26 September, 2019; v1 submitted 25 September, 2019; originally announced September 2019.

  6. arXiv:1905.12760  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Batch weight for domain adaptation with mass shift

    Authors: Mikołaj Bińkowski, R Devon Hjelm, Aaron Courville

    Abstract: Unsupervised domain transfer is the task of transferring or translating samples from a source distribution to a different target distribution. Current solutions unsupervised domain transfer often operate on data on which the modes of the distribution are well-matched, for instance have the same frequencies of classes between source and target distributions. However, these models do not perform wel… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  7. arXiv:1811.03766  [pdf, other

    q-fin.TR

    Endogeneous Dynamics of Intraday Liquidity

    Authors: Mikołaj Bińkowski, Charles-Albert Lehalle

    Abstract: In this paper we investigate the endogenous information contained in four liquidity variables at a five minutes time scale on equity markets around the world: the traded volume, the bid-ask spread, the volatility and the volume at first limits of the orderbook. In the spirit of Granger causality, we measure the level of information by the level of accuracy of linear autoregressive models. This emp… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

  8. arXiv:1805.11565  [pdf, other

    stat.ML cs.LG

    On gradient regularizers for MMD GANs

    Authors: Michael Arbel, Danica J. Sutherland, Mikołaj Bińkowski, Arthur Gretton

    Abstract: We propose a principled method for gradient-based regularization of the critic of GAN-like models trained by adversarially optimizing the kernel of a Maximum Mean Discrepancy (MMD). We show that controlling the gradient of the critic is vital to having a sensible loss function, and devise a method to enforce exact, analytical gradient constraints at no additional cost compared to existing approxim… ▽ More

    Submitted 14 January, 2021; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Code at https://github.com/MichaelArbel/Scaled-MMD-GAN

    Journal ref: Advances in Neural Information Processing Systems 31 (NeurIPS 2018), 6700-6710

  9. arXiv:1801.01401  [pdf, other

    stat.ML cs.LG

    Demystifying MMD GANs

    Authors: Mikołaj Bińkowski, Danica J. Sutherland, Michael Arbel, Arthur Gretton

    Abstract: We investigate the training and performance of generative adversarial networks using the Maximum Mean Discrepancy (MMD) as critic, termed MMD GANs. As our main theoretical contribution, we clarify the situation with bias in GAN loss functions raised by recent work: we show that gradient estimators used in the optimization process for both MMD GANs and Wasserstein GANs are unbiased, but learning a… ▽ More

    Submitted 14 January, 2021; v1 submitted 4 January, 2018; originally announced January 2018.

    Comments: Published at ICLR 2018: https://openreview.net/forum?id=r1lUOzWCW

  10. arXiv:1703.04122  [pdf, other

    cs.LG

    Autoregressive Convolutional Neural Networks for Asynchronous Time Series

    Authors: Mikołaj Bińkowski, Gautier Marti, Philippe Donnat

    Abstract: We propose Significance-Offset Convolutional Neural Network, a deep convolutional network architecture for regression of multivariate asynchronous time series. The model is inspired by standard autoregressive (AR) models and gating mechanisms used in recurrent neural networks. It involves an AR-like weighting system, where the final predictor is obtained as a weighted sum of adjusted regressors, w… ▽ More

    Submitted 12 June, 2018; v1 submitted 12 March, 2017; originally announced March 2017.

    Comments: Proceedings of The 35th International Conference on Machine Learning (ICML), Stockholm, Sweden, 2018, to appear

  11. A review of two decades of correlations, hierarchies, networks and clustering in financial markets

    Authors: Gautier Marti, Frank Nielsen, Mikołaj Bińkowski, Philippe Donnat

    Abstract: We review the state of the art of clustering financial time series and the study of their correlations alongside other interaction networks. The aim of this review is to gather in one place the relevant material from different fields, e.g. machine learning, information geometry, econophysics, statistical physics, econometrics, behavioral finance. We hope it will help researchers to use more effect… ▽ More

    Submitted 3 November, 2020; v1 submitted 1 March, 2017; originally announced March 2017.

    Journal ref: Chapter in Progress in Information Geometry: Theory and Applications, 245-274, 2021