Skip to main content

Showing 1–11 of 11 results for author: Gulrajani, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.18619  [pdf, other

    cs.CL cs.LG

    Likelihood-Based Diffusion Language Models

    Authors: Ishaan Gulrajani, Tatsunori B. Hashimoto

    Abstract: Despite a growing interest in diffusion-based language models, existing work has not shown that these models can attain nontrivial likelihoods on standard language modeling benchmarks. In this work, we take the first steps towards closing the likelihood gap between autoregressive and diffusion-based language models, with the goal of building and releasing a diffusion model which outperforms a smal… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  2. arXiv:2305.14387  [pdf, other

    cs.LG cs.AI cs.CL

    AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

    Authors: Yann Dubois, Xuechen Li, Rohan Taori, Tianyi Zhang, Ishaan Gulrajani, Jimmy Ba, Carlos Guestrin, Percy Liang, Tatsunori B. Hashimoto

    Abstract: Large language models (LLMs) such as ChatGPT have seen widespread adoption due to their strong instruction-following abilities. Develo** these LLMs involves a complex yet poorly understood workflow requiring training with human feedback. Replicating and understanding this instruction-following requires tackling three major challenges: the high cost of data collection, the lack of trustworthy eva… ▽ More

    Submitted 7 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Spotlight at NeurIPS 2023

  3. arXiv:2205.14217  [pdf, other

    cs.CL cs.AI cs.LG

    Diffusion-LM Improves Controllable Text Generation

    Authors: Xiang Lisa Li, John Thickstun, Ishaan Gulrajani, Percy Liang, Tatsunori B. Hashimoto

    Abstract: Controlling the behavior of language models (LMs) without re-training is a major open problem in natural language generation. While recent works have demonstrated successes on controlling simple sentence attributes (e.g., sentiment), there has been little progress on complex, fine-grained controls (e.g., syntactic structure). To address this challenge, we develop a new non-autoregressive language… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  4. arXiv:2007.01434  [pdf, other

    cs.LG stat.ML

    In Search of Lost Domain Generalization

    Authors: Ishaan Gulrajani, David Lopez-Paz

    Abstract: The goal of domain generalization algorithms is to predict well on distributions different from those seen during training. While a myriad of domain generalization algorithms exist, inconsistencies in experimental conditions -- datasets, architectures, and model selection criteria -- render fair and realistic comparisons difficult. In this paper, we are interested in understanding how useful domai… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  5. arXiv:2001.03653  [pdf, other

    cs.LG stat.ML

    Towards GAN Benchmarks Which Require Generalization

    Authors: Ishaan Gulrajani, Colin Raffel, Luke Metz

    Abstract: For many evaluation metrics commonly used as benchmarks for unconditional image generation, trivially memorizing the training set attains a better score than models which are considered state-of-the-art; we consider this problematic. We clarify a necessary condition for an evaluation metric not to behave this way: estimating the function must require a large sample from the model. In search of suc… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: ICLR 2019 conference paper

  6. arXiv:1907.02893  [pdf, other

    stat.ML cs.AI cs.LG

    Invariant Risk Minimization

    Authors: Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, David Lopez-Paz

    Abstract: We introduce Invariant Risk Minimization (IRM), a learning paradigm to estimate invariant correlations across multiple training distributions. To achieve this goal, IRM learns a data representation such that the optimal classifier, on top of that data representation, matches for all training distributions. Through theory and experiments, we show how the invariances learned by IRM relate to the cau… ▽ More

    Submitted 27 March, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

  7. arXiv:1902.08710  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    GANSynth: Adversarial Neural Audio Synthesis

    Authors: Jesse Engel, Kumar Krishna Agrawal, Shuo Chen, Ishaan Gulrajani, Chris Donahue, Adam Roberts

    Abstract: Efficient audio synthesis is an inherently difficult machine learning task, as human perception is sensitive to both global structure and fine-scale waveform coherence. Autoregressive models, such as WaveNet, model local structure at the expense of global latent structure and slow iterative sampling, while Generative Adversarial Networks (GANs), have global latent conditioning and efficient parall… ▽ More

    Submitted 14 April, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Colab Notebook: http://goo.gl/magenta/gansynth-demo

  8. arXiv:1704.00028  [pdf, other

    cs.LG stat.ML

    Improved Training of Wasserstein GANs

    Authors: Ishaan Gulrajani, Faruk Ahmed, Martin Arjovsky, Vincent Dumoulin, Aaron Courville

    Abstract: Generative Adversarial Networks (GANs) are powerful generative models, but suffer from training instability. The recently proposed Wasserstein GAN (WGAN) makes progress toward stable training of GANs, but sometimes can still generate only low-quality samples or fail to converge. We find that these problems are often due to the use of weight clip** in WGAN to enforce a Lipschitz constraint on the… ▽ More

    Submitted 25 December, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: NIPS camera-ready

  9. arXiv:1612.07837  [pdf, other

    cs.SD cs.AI

    SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

    Authors: Soroush Mehri, Kundan Kumar, Ishaan Gulrajani, Rithesh Kumar, Shubham Jain, Jose Sotelo, Aaron Courville, Yoshua Bengio

    Abstract: In this paper we propose a novel model for unconditional audio generation based on generating one audio sample at a time. We show that our model, which profits from combining memory-less modules, namely autoregressive multilayer perceptrons, and stateful recurrent neural networks in a hierarchical structure is able to capture underlying sources of variations in the temporal sequences over very lon… ▽ More

    Submitted 11 February, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: Published as a conference paper at ICLR 2017

  10. arXiv:1611.05013  [pdf, other

    cs.LG

    PixelVAE: A Latent Variable Model for Natural Images

    Authors: Ishaan Gulrajani, Kundan Kumar, Faruk Ahmed, Adrien Ali Taiga, Francesco Visin, David Vazquez, Aaron Courville

    Abstract: Natural image modeling is a landmark challenge of unsupervised learning. Variational Autoencoders (VAEs) learn a useful latent representation and model global structure well but have difficulty capturing small details. PixelCNN models details very well, but lacks a latent code and is difficult to scale for capturing large structures. We present PixelVAE, a VAE model with an autoregressive decoder… ▽ More

    Submitted 15 November, 2016; originally announced November 2016.

  11. arXiv:1506.07285  [pdf, other

    cs.CL cs.LG cs.NE

    Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

    Authors: Ankit Kumar, Ozan Irsoy, Peter Ondruska, Mohit Iyyer, James Bradbury, Ishaan Gulrajani, Victor Zhong, Romain Paulus, Richard Socher

    Abstract: Most tasks in natural language processing can be cast into question answering (QA) problems over language input. We introduce the dynamic memory network (DMN), a neural network architecture which processes input sequences and questions, forms episodic memories, and generates relevant answers. Questions trigger an iterative attention process which allows the model to condition its attention on the… ▽ More

    Submitted 5 March, 2016; v1 submitted 24 June, 2015; originally announced June 2015.