Skip to main content

Showing 1–14 of 14 results for author: Arjovsky, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.11672  [pdf, other

    stat.ML cs.LG

    Why does Throwing Away Data Improve Worst-Group Error?

    Authors: Kamalika Chaudhuri, Kartik Ahuja, Martin Arjovsky, David Lopez-Paz

    Abstract: When facing data with imbalanced classes or groups, practitioners follow an intriguing strategy to achieve best results. They throw away examples until the classes or groups are balanced in size, and then perform empirical risk minimization on the reduced training set. This opposes common wisdom in learning theory, where the expected error is supposed to decrease as the dataset grows in size. In t… ▽ More

    Submitted 21 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  2. arXiv:2110.14503  [pdf, other

    cs.LG cs.AI cs.CR

    Simple data balancing achieves competitive worst-group-accuracy

    Authors: Badr Youbi Idrissi, Martin Arjovsky, Mohammad Pezeshki, David Lopez-Paz

    Abstract: We study the problem of learning classifiers that perform well across (known or unknown) groups of data. After observing that common worst-group-accuracy datasets suffer from substantial imbalances, we set out to compare state-of-the-art methods to simple balancing of classes and groups by either subsampling or reweighting data. Our results show that these data balancing baselines achieve state-of… ▽ More

    Submitted 18 February, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted at CLeaR (Causal Learning and Reasoning) 2022

  3. arXiv:2103.02667  [pdf, other

    stat.ML cs.LG

    Out of Distribution Generalization in Machine Learning

    Authors: Martin Arjovsky

    Abstract: Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In everyday situations when models are tested in slightly different data than they were trained on, ML algorithms can fail spectacularly. This research attempts to… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  4. arXiv:2102.10867  [pdf, other

    cs.LG cs.AI

    Linear unit-tests for invariance discovery

    Authors: Benjamin Aubin, Agnieszka Słowik, Martin Arjovsky, Leon Bottou, David Lopez-Paz

    Abstract: There is an increasing interest in algorithms to learn invariant correlations across training environments. A big share of the current proposals find theoretical support in the causality literature but, how useful are they in practice? The purpose of this note is to propose six linear low-dimensional problems -- unit tests -- to evaluate different types of out-of-distribution generalization in a p… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 5 pages, Causal Discovery & Causality-Inspired Machine Learning Workshop at Neural Information Processing Systems

  5. arXiv:2006.05394  [pdf, other

    stat.ML cs.LG

    Low Distortion Block-Resampling with Spatially Stochastic Networks

    Authors: Sarah Jane Hong, Martin Arjovsky, Darryl Barnhart, Ian Thompson

    Abstract: We formalize and attack the problem of generating new images from old ones that are as diverse as possible, only allowing them to change without restrictions in certain parts of the image while remaining globally consistent. This encompasses the typical situation found in generative modelling, where we are happy with parts of the generated data, but would like to resample others ("I like this gene… ▽ More

    Submitted 10 December, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  6. arXiv:2002.06038  [pdf, other

    cs.LG stat.ML

    Never Give Up: Learning Directed Exploration Strategies

    Authors: Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

    Abstract: We propose a reinforcement learning agent to solve hard exploration games by learning a range of directed exploratory policies. We construct an episodic memory-based intrinsic reward using k-nearest neighbors over the agent's recent experience to train the directed exploratory policies, thereby encouraging the agent to repeatedly revisit all states in its environment. A self-supervised inverse dyn… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper in ICLR 2020

  7. arXiv:1909.13334  [pdf, other

    cs.LG stat.ML

    Symplectic Recurrent Neural Networks

    Authors: Zhengdao Chen, Jianyu Zhang, Martin Arjovsky, Léon Bottou

    Abstract: We propose Symplectic Recurrent Neural Networks (SRNNs) as learning algorithms that capture the dynamics of physical systems from observed trajectories. An SRNN models the Hamiltonian function of the system by a neural network and furthermore leverages symplectic integration, multiple-step training and initial state optimization to address the challenging numerical issues associated with Hamiltoni… ▽ More

    Submitted 25 April, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: Added link to GitHub repository

    Journal ref: 8th International Conference on Learning Representations (ICLR 2020)

  8. arXiv:1907.02893  [pdf, other

    stat.ML cs.AI cs.LG

    Invariant Risk Minimization

    Authors: Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, David Lopez-Paz

    Abstract: We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the cau… ▽ More

    Submitted 27 March, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

  9. arXiv:1712.07822  [pdf, other

    stat.ML cs.AI cs.LG

    Geometrical Insights for Implicit Generative Modeling

    Authors: Leon Bottou, Martin Arjovsky, David Lopez-Paz, Maxime Oquab

    Abstract: Learning algorithms for implicit generative models can optimize a variety of criteria that measure how the data distribution differs from the implicit model distribution, including the Wasserstein distance, the Energy distance, and the Maximum Mean Discrepancy criterion. A careful look at the geometries induced by these distances on the space of probability measures reveals interesting differences… ▽ More

    Submitted 21 August, 2019; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: this version fixes a typo in a definition

  10. arXiv:1704.00028  [pdf, other

    cs.LG stat.ML

    Improved Training of Wasserstein GANs

    Authors: Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron Courville

    Abstract: Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clip** in WGAN to enforce a Lipschitz constraint on the… ▽ More

    Submitted 25 December, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: NIPS camera-ready

  11. arXiv:1701.07875  [pdf, other

    stat.ML cs.LG

    Wasserstein GAN

    Authors: Martin Arjovsky, Soumith Chintala, Léon Bottou

    Abstract: We introduce a new algorithm named WGAN, an alternative to traditional GAN training. In this new model, we show that we can improve the stability of learning, get rid of problems like mode collapse, and provide meaningful learning curves useful for debugging and hyperparameter searches. Furthermore, we show that the corresponding optimization problem is sound, and provide extensive theoretical wor… ▽ More

    Submitted 6 December, 2017; v1 submitted 26 January, 2017; originally announced January 2017.

  12. arXiv:1701.04862  [pdf, other

    stat.ML cs.LG

    Towards Principled Methods for Training Generative Adversarial Networks

    Authors: Martin Arjovsky, Léon Bottou

    Abstract: The goal of this paper is not to introduce a single algorithm or method, but to make theoretical steps towards fully understanding the training dynamics of generative adversarial networks. In order to substantiate our theoretical analysis, we perform targeted experiments to verify our assumptions, illustrate our claims, and quantify the phenomena. This paper is divided into three sections. The fir… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

  13. arXiv:1606.00704  [pdf, other

    stat.ML cs.LG

    Adversarially Learned Inference

    Authors: Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Olivier Mastropietro, Alex Lamb, Martin Arjovsky, Aaron Courville

    Abstract: We introduce the adversarially learned inference (ALI) model, which jointly learns a generation network and an inference network using an adversarial process. The generation network maps samples from stochastic latent variables to the data space while the inference network maps training examples in data space to the space of latent variables. An adversarial game is cast between these two networks… ▽ More

    Submitted 21 February, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

  14. arXiv:1511.06464  [pdf, other

    cs.LG cs.NE stat.ML

    Unitary Evolution Recurrent Neural Networks

    Authors: Martin Arjovsky, Amar Shah, Yoshua Bengio

    Abstract: Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and exploding gradients, especially when trying to learn long-term dependencies. To circumvent this problem, we propose a new architecture that learns a unitary weight… ▽ More

    Submitted 25 May, 2016; v1 submitted 19 November, 2015; originally announced November 2015.