Skip to main content

Showing 1–13 of 13 results for author: Arjovsky, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2205.11672  [pdf, other

    stat.ML cs.LG

    Why does Throwing Away Data Improve Worst-Group Error?

    Authors: Kamalika Chaudhuri, Kartik Ahuja, Martin Arjovsky, David Lopez-Paz

    Abstract: When facing data with imbalanced classes or groups, practitioners follow an intriguing strategy to achieve best results. They throw away examples until the classes or groups are balanced in size, and then perform empirical risk minimization on the reduced training set. This opposes common wisdom in learning theory, where the expected error is supposed to decrease as the dataset grows in size. In t… ▽ More

    Submitted 21 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  2. arXiv:2103.02667  [pdf, other

    stat.ML cs.LG

    Out of Distribution Generalization in Machine Learning

    Authors: Martin Arjovsky

    Abstract: Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In everyday situations when models are tested in slightly different data than they were trained on, ML algorithms can fail spectacularly. This research attempts to… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  3. arXiv:2006.05394  [pdf, other

    stat.ML cs.LG

    Low Distortion Block-Resampling with Spatially Stochastic Networks

    Authors: Sarah Jane Hong, Martin Arjovsky, Darryl Barnhart, Ian Thompson

    Abstract: We formalize and attack the problem of generating new images from old ones that are as diverse as possible, only allowing them to change without restrictions in certain parts of the image while remaining globally consistent. This encompasses the typical situation found in generative modelling, where we are happy with parts of the generated data, but would like to resample others ("I like this gene… ▽ More

    Submitted 10 December, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  4. arXiv:2002.06038  [pdf, other

    cs.LG stat.ML

    Never Give Up: Learning Directed Exploration Strategies

    Authors: Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andew Bolt, Charles Blundell

    Abstract: We propose a reinforcement learning agent to solve hard exploration games by learning a range of directed exploratory policies. We construct an episodic memory-based intrinsic reward using k-nearest neighbors over the agent's recent experience to train the directed exploratory policies, thereby encouraging the agent to repeatedly revisit all states in its environment. A self-supervised inverse dyn… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper in ICLR 2020

  5. arXiv:1909.13334  [pdf, other

    cs.LG stat.ML

    Symplectic Recurrent Neural Networks

    Authors: Zhengdao Chen, Jianyu Zhang, Martin Arjovsky, Léon Bottou

    Abstract: We propose Symplectic Recurrent Neural Networks (SRNNs) as learning algorithms that capture the dynamics of physical systems from observed trajectories. An SRNN models the Hamiltonian function of the system by a neural network and furthermore leverages symplectic integration, multiple-step training and initial state optimization to address the challenging numerical issues associated with Hamiltoni… ▽ More

    Submitted 25 April, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: Added link to GitHub repository

    Journal ref: 8th International Conference on Learning Representations (ICLR 2020)

  6. arXiv:1907.02893  [pdf, other

    stat.ML cs.AI cs.LG

    Invariant Risk Minimization

    Authors: Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, David Lopez-Paz

    Abstract: We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the cau… ▽ More

    Submitted 27 March, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

  7. arXiv:1712.07822  [pdf, other

    stat.ML cs.AI cs.LG

    Geometrical Insights for Implicit Generative Modeling

    Authors: Leon Bottou, Martin Arjovsky, David Lopez-Paz, Maxime Oquab

    Abstract: Learning algorithms for implicit generative models can optimize a variety of criteria that measure how the data distribution differs from the implicit model distribution, including the Wasserstein distance, the Energy distance, and the Maximum Mean Discrepancy criterion. A careful look at the geometries induced by these distances on the space of probability measures reveals interesting differences… ▽ More

    Submitted 21 August, 2019; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: this version fixes a typo in a definition

  8. arXiv:1704.00028  [pdf, other

    cs.LG stat.ML

    Improved Training of Wasserstein GANs

    Authors: Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron Courville

    Abstract: Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clip** in WGAN to enforce a Lipschitz constraint on the… ▽ More

    Submitted 25 December, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: NIPS camera-ready

  9. arXiv:1701.07875  [pdf, other

    stat.ML cs.LG

    Wasserstein GAN

    Authors: Martin Arjovsky, Soumith Chintala, Léon Bottou

    Abstract: We introduce a new algorithm named WGAN, an alternative to traditional GAN training. In this new model, we show that we can improve the stability of learning, get rid of problems like mode collapse, and provide meaningful learning curves useful for debugging and hyperparameter searches. Furthermore, we show that the corresponding optimization problem is sound, and provide extensive theoretical wor… ▽ More

    Submitted 6 December, 2017; v1 submitted 26 January, 2017; originally announced January 2017.

  10. arXiv:1701.04862  [pdf, other

    stat.ML cs.LG

    Towards Principled Methods for Training Generative Adversarial Networks

    Authors: Martin Arjovsky, Léon Bottou

    Abstract: The goal of this paper is not to introduce a single algorithm or method, but to make theoretical steps towards fully understanding the training dynamics of generative adversarial networks. In order to substantiate our theoretical analysis, we perform targeted experiments to verify our assumptions, illustrate our claims, and quantify the phenomena. This paper is divided into three sections. The fir… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

  11. arXiv:1606.00704  [pdf, other

    stat.ML cs.LG

    Adversarially Learned Inference

    Authors: Vincent Dumoulin, Ishmael Belghazi, Ben Poole, Olivier Mastropietro, Alex Lamb, Martin Arjovsky, Aaron Courville

    Abstract: We introduce the adversarially learned inference (ALI) model, which jointly learns a generation network and an inference network using an adversarial process. The generation network maps samples from stochastic latent variables to the data space while the inference network maps training examples in data space to the space of latent variables. An adversarial game is cast between these two networks… ▽ More

    Submitted 21 February, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

  12. arXiv:1511.06464  [pdf, other

    cs.LG cs.NE stat.ML

    Unitary Evolution Recurrent Neural Networks

    Authors: Martin Arjovsky, Amar Shah, Yoshua Bengio

    Abstract: Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and exploding gradients, especially when trying to learn long-term dependencies. To circumvent this problem, we propose a new architecture that learns a unitary weight… ▽ More

    Submitted 25 May, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

  13. arXiv:1506.00059  [pdf, ps, other

    math.NA stat.ML

    Saddle-free Hessian-free Optimization

    Authors: Martin Arjovsky

    Abstract: Nonconvex optimization problems such as the ones in training deep neural networks suffer from a phenomenon called saddle point proliferation. This means that there are a vast number of high error saddle points present in the loss function. Second order methods have been tremendously successful and widely adopted in the convex optimization community, while their usefulness in deep learning remains… ▽ More

    Submitted 5 November, 2016; v1 submitted 29 May, 2015; originally announced June 2015.

    Comments: NIPS 2016 Workshop on Nonconvex Optimization for Machine Learning: Theory and Practice