Skip to main content

Showing 51–100 of 184 results for author: Courville, A

.
  1. arXiv:2106.02719  [pdf, other

    cs.CV

    Hierarchical Video Generation for Complex Data

    Authors: Lluis Castrejon, Nicolas Ballas, Aaron Courville

    Abstract: Videos can often be created by first outlining a global description of the scene and then adding local details. Inspired by this we propose a hierarchical model for video generation which follows a coarse to fine approach. First our model generates a low resolution video, establishing the global scene structure, that is then refined by subsequent levels in the hierarchy. We train each level in our… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  2. arXiv:2105.03519  [pdf, other

    cs.CL

    Understanding by Understanding Not: Modeling Negation in Language Models

    Authors: Arian Hosseini, Siva Reddy, Dzmitry Bahdanau, R Devon Hjelm, Alessandro Sordoni, Aaron Courville

    Abstract: Negation is a core construction in natural language. Despite being very successful on many tasks, state-of-the-art pre-trained language models often handle negation incorrectly. To improve language models in this regard, we propose to augment the language modeling objective with an unlikelihood objective that is based on negated generic sentences from a raw text corpus. By training BERT with the r… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  3. arXiv:2105.01119  [pdf, other

    cs.LG

    Iterated learning for emergent systematicity in VQA

    Authors: Ankit Vani, Max Schwarzer, Yuchen Lu, Eeshan Dhekane, Aaron Courville

    Abstract: Although neural module networks have an architectural bias towards compositionality, they require gold standard layouts to generalize systematically in practice. When instead learning layouts and modules jointly, compositionality does not arise automatically and an explicit pressure is necessary for the emergence of layouts exhibiting the right structure. We propose to address this problem using i… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

    Comments: Published as a conference paper at ICLR 2021. 9 pages main, 21 pages total including references and appendix

    ACM Class: I.2.6

    Journal ref: 9th International Conference on Learning Representations (ICLR 2021)

  4. arXiv:2104.00442  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Touch-based Curiosity for Sparse-Reward Tasks

    Authors: Sai Rajeswar, Cyril Ibrahim, Nitin Surya, Florian Golemo, David Vazquez, Aaron Courville, Pedro O. Pinheiro

    Abstract: Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks that involve contact-rich motion. In this work, we leverage surprise from mismatches in touch feedback to guide exploration in hard sparse-reward reinforcement learning tasks. Our approach, Touch-based Curiosity (ToC), learns what visible objects interactions are… ▽ More

    Submitted 26 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

  5. arXiv:2103.10972  [pdf, other

    cs.LG cs.AI

    Learning Task Decomposition with Ordered Memory Policy Network

    Authors: Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron Courville, Joshua B. Tenenbaum, Chuang Gan

    Abstract: Many complex real-world tasks are composed of several levels of sub-tasks. Humans leverage these hierarchical structures to accelerate the learning process and achieve better generalization. In this work, we study the inductive bias and propose Ordered Memory Policy Network (OMPN) to discover subtask hierarchy by learning from demonstration. The discovered subtask hierarchy could be used to perfor… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  6. arXiv:2103.03216  [pdf, other

    cs.LG cs.AI cs.MA

    Continuous Coordination As a Realistic Scenario for Lifelong Learning

    Authors: Hadi Nekoei, Akilesh Badrinaaraayanan, Aaron Courville, Sarath Chandar

    Abstract: Current deep reinforcement learning (RL) algorithms are still highly task-specific and lack the ability to generalize to new environments. Lifelong learning (LLL), however, aims at solving multiple tasks sequentially by efficiently transferring and using knowledge between tasks. Despite a surge of interest in lifelong RL in recent years, the lack of a realistic testbed makes robust evaluation of L… ▽ More

    Submitted 14 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages with supplementary materials. Added results for Lifelong RL methods and some future work. Accepted to ICML 2021

  7. arXiv:2101.10276  [pdf, other

    cs.LG cs.AI cs.MA

    Emergent Communication under Competition

    Authors: Michael Noukhovitch, Travis LaCroix, Angeliki Lazaridou, Aaron Courville

    Abstract: The literature in modern machine learning has only negative results for learning to communicate between competitive agents using standard RL. We introduce a modified sender-receiver game to study the spectrum of partially-competitive scenarios and show communication can indeed emerge in a competitive setting. We empirically demonstrate three key takeaways for future research. First, we show that c… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: To be presented at AAMAS 2021

  8. arXiv:2012.05942  [pdf, other

    cs.LG math.OC

    Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization

    Authors: Chin-Wei Huang, Ricky T. Q. Chen, Christos Tsirigotis, Aaron Courville

    Abstract: Flow-based models are powerful tools for designing probabilistic models with tractable density. This paper introduces Convex Potential Flows (CP-Flow), a natural and efficient parameterization of invertible models inspired by the optimal transport (OT) theory. CP-Flows are the gradient map of a strongly convex neural potential function. The convexity implies invertibility and allows us to resort t… ▽ More

    Submitted 23 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  9. arXiv:2012.00857  [pdf, other

    cs.CL cs.AI cs.LG

    StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling

    Authors: Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron Courville

    Abstract: There are two major classes of natural language grammar -- the dependency grammar that models one-to-one correspondences between words and the constituency grammar that models the assembly of one or several corresponded words. While previous unsupervised parsing methods mostly focus on only inducing one class of grammars, we introduce a novel model, StructFormer, that can simultaneously induce dep… ▽ More

    Submitted 10 July, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Published as a conference paper at ACL 2021

  10. arXiv:2011.09468  [pdf, other

    cs.LG math.DS stat.ML

    Gradient Starvation: A Learning Proclivity in Neural Networks

    Authors: Mohammad Pezeshki, Sékou-Oumar Kaba, Yoshua Bengio, Aaron Courville, Doina Precup, Guillaume Lajoie

    Abstract: We identify and formalize a fundamental gradient descent phenomenon resulting in a learning proclivity in over-parameterized neural networks. Gradient Starvation arises when cross-entropy loss is minimized by capturing only a subset of features relevant for the task, despite the presence of other predictive features that fail to be discovered. This work provides a theoretical explanation for the e… ▽ More

    Submitted 24 November, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: Proceeding of NeurIPS 2021

  11. arXiv:2011.07960  [pdf, other

    cs.CL cs.LG

    Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle

    Authors: Yikang Shen, Shawn Tan, Alessandro Sordoni, Siva Reddy, Aaron Courville

    Abstract: Syntax is fundamental to our thinking about language. Failing to capture the structure of input language could lead to generalization problems and over-parametrization. In the present work, we propose a new syntax-aware language model: Syntactic Ordered Memory (SOM). The model explicitly models the structure with an incremental parser and maintains the conditional probability setting of a standard… ▽ More

    Submitted 10 May, 2021; v1 submitted 21 October, 2020; originally announced November 2020.

    Comments: 12 pages, 10 figures

    Journal ref: NAACL 2021

  12. arXiv:2011.05499  [pdf, other

    cs.CV

    Unsupervised Learning of Dense Visual Representations

    Authors: Pedro O. Pinheiro, Amjad Almahairi, Ryan Y. Benmalek, Florian Golemo, Aaron Courville

    Abstract: Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e., compositions of data augmentation) of the same image. However, many visual understanding tasks require dense (pixel-level) representations. In this paper, we propose… ▽ More

    Submitted 7 December, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

  13. arXiv:2010.11362  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    NU-GAN: High resolution neural upsampling with GAN

    Authors: Rithesh Kumar, Kundan Kumar, Vicki Anand, Yoshua Bengio, Aaron Courville

    Abstract: In this paper, we propose NU-GAN, a new method for resampling audio from lower to higher sampling rates (upsampling). Audio upsampling is an important problem since productionizing generative speech technology requires operating at high sampling rates. Such applications use audio at a resolution of 44.1 kHz or 48 kHz, whereas current speech synthesis methods are equipped to handle a maximum of 24… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  14. arXiv:2010.10079  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Neural Approximate Sufficient Statistics for Implicit Models

    Authors: Yanzhi Chen, Dinghuai Zhang, Michael Gutmann, Aaron Courville, Zhanxing Zhu

    Abstract: We consider the fundamental problem of how to automatically construct summary statistics for implicit generative models where the evaluation of the likelihood function is intractable, but sampling data from the model is possible. The idea is to frame the task of constructing sufficient statistics as learning mutual information maximizing representations of the data with the help of deep neural net… ▽ More

    Submitted 30 March, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: ICLR2021 spotlight

  15. arXiv:2010.04704  [pdf, other

    cs.CL cs.LG

    Recursive Top-Down Production for Sentence Generation with Latent Trees

    Authors: Shawn Tan, Yikang Shen, Timothy J. O'Donnell, Alessandro Sordoni, Aaron Courville

    Abstract: We model the recursive production property of context-free grammars for natural and synthetic languages. To this end, we present a dynamic programming algorithm that marginalises over latent binary tree structures with $N$ leaves, allowing us to compute the likelihood of a sequence of $N$ tokens under a latent tree model, which we maximise to train a recursive neural function. We demonstrate perfo… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  16. arXiv:2010.02975  [pdf, other

    cs.CL

    Supervised Seeded Iterated Learning for Interactive Language Learning

    Authors: Yuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron Courville

    Abstract: Language drift has been one of the major obstacles to train language models through interaction. When word-based conversational agents are trained towards completing a task, they tend to invent their language rather than leveraging natural language. In recent literature, two general methods partially counter this phenomenon: Supervised Selfplay (S2P) and Seeded Iterated Learning (SIL). While S2P j… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  17. arXiv:2010.01262  [pdf, other

    cs.LG stat.ML

    Integrating Categorical Semantics into Unsupervised Domain Translation

    Authors: Samuel Lavoie, Faruk Ahmed, Aaron Courville

    Abstract: While unsupervised domain translation (UDT) has seen a lot of success recently, we argue that mediating its translation via categorical semantic features could broaden its applicability. In particular, we demonstrate that categorical semantics improves the translation between perceptually different domains sharing multiple object categories. We propose a method to learn, in an unsupervised manner,… ▽ More

    Submitted 16 March, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 22 pages. In submission to the International Conference on Learning Representation (ICLR) 2021

  18. arXiv:2007.05929  [pdf, other

    cs.LG stat.ML

    Data-Efficient Reinforcement Learning with Self-Predictive Representations

    Authors: Max Schwarzer, Ankesh Anand, Rishab Goel, R Devon Hjelm, Aaron Courville, Philip Bachman

    Abstract: While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an agent can learn more efficiently if we augment reward maximization with self-supervised objectives based on structure in its visual input and sequential intera… ▽ More

    Submitted 20 May, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: The first two authors contributed equally to this work. v4 includes new ablations and reformatting for ICLR camera ready

  19. arXiv:2007.05756  [pdf, other

    cs.CV cs.LG stat.ML

    Generative Compositional Augmentations for Scene Graph Prediction

    Authors: Boris Knyazev, Harm de Vries, Cătălina Cangea, Graham W. Taylor, Aaron Courville, Eugene Belilovsky

    Abstract: Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges in this task due to a long tail data distribution. Current scene graph generation models are trained on a tiny fraction of the distribution corresponding to the… ▽ More

    Submitted 1 October, 2021; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: ICCV 2021 camera ready. Added more baselines, combining GANs with Neural Motifs and t-sne visualizations. Code is available at https://github.com/bknyaz/sgg

  20. arXiv:2006.05164  [pdf, other

    cs.LG stat.ML

    AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation

    Authors: Jae Hyun Lim, Aaron Courville, Christopher Pal, Chin-Wei Huang

    Abstract: Entropy is ubiquitous in machine learning, but it is in general intractable to compute the entropy of the distribution of an arbitrary continuous random variable. In this paper, we propose the amortized residual denoising autoencoder (AR-DAE) to approximate the gradient of the log density function, which can be used to estimate the gradient of entropy. Amortization allows us to significantly reduc… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: accepted in ICML 2020

  21. arXiv:2005.08230  [pdf, other

    cs.CV cs.LG

    Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

    Authors: Boris Knyazev, Harm de Vries, Cătălina Cangea, Graham W. Taylor, Aaron Courville, Eugene Belilovsky

    Abstract: Scene graph generation (SGG) aims to predict graph-structured descriptions of input images, in the form of objects and relationships between them. This task is becoming increasingly useful for progress at the interface of vision and language. Here, it is important - yet challenging - to perform well on novel (zero-shot) or rare (few-shot) compositions of objects and relationships. In this paper, w… ▽ More

    Submitted 17 August, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: accepted at BMVC 2020, the code is available at https://github.com/bknyaz/sgg

  22. arXiv:2005.06616  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.LG

    A Large-Scale, Open-Domain, Mixed-Interface Dialogue-Based ITS for STEM

    Authors: Iulian Vlad Serban, Varun Gupta, Ekaterina Kochmar, Dung D. Vu, Robert Belfer, Joelle Pineau, Aaron Courville, Laurent Charlin, Yoshua Bengio

    Abstract: We present Korbit, a large-scale, open-domain, mixed-interface, dialogue-based intelligent tutoring system (ITS). Korbit uses machine learning, natural language processing and reinforcement learning to provide interactive, personalized learning online. Korbit has been designed to easily scale to thousands of subjects, by automating, standardizing and simplifying the content creation process. Unlik… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 6 pages, 1 figure, 1 table, accepted for publication in the 21st International Conference on Artificial Intelligence in Education (AIED 2020)

    ACM Class: I.2.0; I.2.1; I.2.7; K.3.1; G.4

  23. arXiv:2003.14166  [pdf, other

    cs.CV cs.LG stat.ML

    Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation

    Authors: Sai Rajeswar, Fahim Mannan, Florian Golemo, Jérôme Parent-Lévesque, David Vazquez, Derek Nowrouzezahrai, Aaron Courville

    Abstract: We infer and generate three-dimensional (3D) scene information from a single input image and without supervision. This problem is under-explored, with most prior work relying on supervision from, e.g., 3D ground-truth, multiple images of a scene, image silhouettes or key-points. We propose Pix2Shape, an approach to solve this problem with four components: (i) an encoder that infers the latent 3D r… ▽ More

    Submitted 17 April, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: This is a pre-print of an article published in International Journal of Computer Vision. The final authenticated version is available online at: https://doi.org/10.1007/s11263-020-01322-1

    Journal ref: International Journal of Computer Vision, (2020), 1-16

  24. arXiv:2003.12694  [pdf, other

    cs.AI cs.CL

    Countering Language Drift with Seeded Iterated Learning

    Authors: Yuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron Courville

    Abstract: Pretraining on human corpus and then finetuning in a simulator has become a standard pipeline for training a goal-oriented dialogue agent. Nevertheless, as soon as the agents are finetuned to maximize task completion, they suffer from the so-called language drift phenomenon: they slowly lose syntactic and semantic properties of language as they only focus on solving the task. In this paper, we pro… ▽ More

    Submitted 24 August, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

  25. arXiv:2003.00688  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Out-of-Distribution Generalization via Risk Extrapolation (REx)

    Authors: David Krueger, Ethan Caballero, Joern-Henrik Jacobsen, Amy Zhang, Jonathan Binas, Dinghuai Zhang, Remi Le Priol, Aaron Courville

    Abstract: Distributional shift is one of the major obstacles when transferring machine learning prediction systems from the lab to the real world. To tackle this problem, we assume that variation across training domains is representative of the variation we might encounter at test time, but also that shifts at test time may be more extreme in magnitude. In particular, we show that reducing differences in ri… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

  26. arXiv:2002.07101  [pdf, other

    cs.LG stat.ML

    Augmented Normalizing Flows: Bridging the Gap Between Generative Flows and Latent Variable Models

    Authors: Chin-Wei Huang, Laurent Dinh, Aaron Courville

    Abstract: In this work, we propose a new family of generative flows on an augmented data space, with an aim to improve expressivity without drastically increasing the computational cost of sampling and evaluation of a lower bound on the likelihood. Theoretically, we prove the proposed flow can approximate a Hamiltonian ODE as a universal transport map. Empirically, we demonstrate state-of-the-art performanc… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 27 pages, 12 figures

  27. arXiv:1912.05783  [pdf, other

    cs.AI cs.LG

    CLOSURE: Assessing Systematic Generalization of CLEVR Models

    Authors: Dzmitry Bahdanau, Harm de Vries, Timothy J. O'Donnell, Shikhar Murty, Philippe Beaudoin, Yoshua Bengio, Aaron Courville

    Abstract: The CLEVR dataset of natural-looking questions about 3D-rendered scenes has recently received much attention from the research community. A number of models have been proposed for this task, many of which achieved very high accuracies of around 97-99%. In this work, we study how systematic the generalization of such models is, that is to which extent they are capable of handling novel combinations… ▽ More

    Submitted 17 October, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

    Comments: Technical report

  28. arXiv:1911.05248  [pdf, other

    cs.LG cs.AI cs.CV cs.HC stat.ML

    What Do Compressed Deep Neural Networks Forget?

    Authors: Sara Hooker, Aaron Courville, Gregory Clark, Yann Dauphin, Andrea Frome

    Abstract: Deep neural network pruning and quantization techniques have demonstrated it is possible to achieve high levels of compression with surprisingly little degradation to test set accuracy. However, this measure of performance conceals significant differences in how different classes and images are impacted by model compression techniques. We find that models with radically different numbers of weight… ▽ More

    Submitted 5 September, 2021; v1 submitted 12 November, 2019; originally announced November 2019.

  29. arXiv:1910.13466  [pdf, other

    cs.LG cs.CL

    Ordered Memory

    Authors: Yikang Shen, Shawn Tan, Arian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron Courville

    Abstract: Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cum… ▽ More

    Submitted 3 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Published in NeurIPS 2019

  30. arXiv:1910.09570  [pdf, other

    q-bio.QM cs.CV eess.SP stat.AP stat.ML

    Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

    Authors: Shawn Tan, Guillaume Androz, Ahmad Chamseddine, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

    Abstract: We release the largest public ECG dataset of continuous raw signals for representation learning containing 11 thousand patients and 2 billion labelled beats. Our goal is to enable semi-supervised ECG models to be made as well as to discover unknown subtypes of arrhythmia and anomalous ECG signal events. To this end, we propose an unsupervised representation learning task, evaluated in a semi-super… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Under Review

  31. arXiv:1910.06711  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

    Authors: Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brebisson, Yoshua Bengio, Aaron Courville

    Abstract: Previous works (Donahue et al., 2018a; Engel et al., 2019a) have found that generating coherent raw audio waveforms with GANs is challenging. In this paper, we show that it is possible to train GANs reliably to generate high quality coherent waveforms by introducing a set of architectural changes and simple training techniques. Subjective evaluation metric (Mean Opinion Score, or MOS) shows the ef… ▽ More

    Submitted 8 December, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  32. arXiv:1909.02128  [pdf, other

    cs.AI cs.LG cs.MA

    No Press Diplomacy: Modeling Multi-Agent Gameplay

    Authors: Philip Paquette, Yuchen Lu, Steven Bocco, Max O. Smith, Satya Ortiz-Gagne, Jonathan K. Kummerfeld, Satinder Singh, Joelle Pineau, Aaron Courville

    Abstract: Diplomacy is a seven-player non-stochastic, non-cooperative game, where agents acquire resources through a mix of teamwork and betrayal. Reliance on trust and coordination makes Diplomacy the first non-cooperative multi-agent benchmark for complex sequential social dilemmas in a rich environment. In this work, we focus on training an agent that learns to play the No Press version of Diplomacy wher… ▽ More

    Submitted 19 November, 2019; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted at NeurIPS 2019

  33. arXiv:1908.04950  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    VideoNavQA: Bridging the Gap between Visual and Embodied Question Answering

    Authors: Cătălina Cangea, Eugene Belilovsky, Pietro Liò, Aaron Courville

    Abstract: Embodied Question Answering (EQA) is a recently proposed task, where an agent is placed in a rich 3D environment and must act based solely on its egocentric input to answer a given question. The desired outcome is that the agent learns to combine capabilities such as scene understanding, navigation and language understanding in order to perform complex reasoning in the visual world. However, initi… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: To appear at BMVC 2019. 15 pages, 5 figures

  34. arXiv:1908.04388  [pdf, other

    cs.CV cs.LG

    Detecting semantic anomalies

    Authors: Faruk Ahmed, Aaron Courville

    Abstract: We critically appraise the recent interest in out-of-distribution (OOD) detection and question the practical relevance of existing benchmarks. While the currently prevalent trend is to consider different datasets as OOD, we argue that out-distributions of practical interest are ones where the distinction is semantic in nature for a specified context, and that evaluative tasks should reflect this m… ▽ More

    Submitted 21 November, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: Preprint for AAAI '20 publication

  35. arXiv:1908.02388  [pdf, other

    cs.LG stat.ML

    Benchmarking Bonus-Based Exploration Methods on the Arcade Learning Environment

    Authors: Adrien Ali Taïga, William Fedus, Marlos C. Machado, Aaron Courville, Marc G. Bellemare

    Abstract: This paper provides an empirical evaluation of recently developed exploration algorithms within the Arcade Learning Environment (ALE). We study the use of different reward bonuses that incentives exploration in reinforcement learning. We do so by fixing the learning algorithm used and focusing only on the impact of the different exploration bonuses in the agent's performance. We use Rainbow, the s… ▽ More

    Submitted 24 September, 2021; v1 submitted 6 August, 2019; originally announced August 2019.

    Comments: Accepted at the second Exploration in Reinforcement Learning Workshop at the 36th International Conference on Machine Learning, Long Beach, California. The full version arxiv.longhoe.net/abs/2109.11052 was published as a conference paper at ICLR 2020

  36. arXiv:1906.09691  [pdf, other

    cs.LG stat.ML

    Adversarial Computation of Optimal Transport Maps

    Authors: Jacob Leygonie, Jennifer She, Amjad Almahairi, Sai Rajeswar, Aaron Courville

    Abstract: Computing optimal transport maps between high-dimensional and continuous distributions is a challenging problem in optimal transport (OT). Generative adversarial networks (GANs) are powerful generative models which have been successfully applied to learn maps across high-dimensional domains. However, little is known about the nature of the map learned with a GAN objective. To address this problem,… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

  37. arXiv:1906.09635  [pdf, other

    cs.CL cs.LG

    Investigating Biases in Textual Entailment Datasets

    Authors: Shawn Tan, Yikang Shen, Chin-wei Huang, Aaron Courville

    Abstract: The ability to understand logical relationships between sentences is an important task in language understanding. To aid in progress for this task, researchers have collected datasets for machine learning and evaluation of current systems. However, like in the crowdsourced Visual Question Answering (VQA) task, some biases in the data inevitably occur. In our experiments, we find that performing cl… ▽ More

    Submitted 23 June, 2019; originally announced June 2019.

  38. arXiv:1906.04282  [pdf, other

    cs.LG stat.ML

    Stochastic Neural Network with Kronecker Flow

    Authors: Chin-Wei Huang, Ahmed Touati, Pascal Vincent, Gintare Karolina Dziugaite, Alexandre Lacoste, Aaron Courville

    Abstract: Recent advances in variational inference enable the modelling of highly structured joint distributions, but are limited in their capacity to scale to the high-dimensional setting of stochastic neural networks. This limitation motivates a need for scalable parameterizations of the noise generation process, in a manner that adequately captures the dependencies among the various parameters. In this w… ▽ More

    Submitted 13 February, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 23rdInternational Conference on ArtificialIntelligence and Statistics (AISTATS) 2020

  39. arXiv:1906.03708  [pdf, other

    cs.LG stat.ML

    Note on the bias and variance of variational inference

    Authors: Chin-Wei Huang, Aaron Courville

    Abstract: In this note, we study the relationship between the variational gap and the variance of the (log) likelihood ratio. We show that the gap can be upper bounded by some form of dispersion measure of the likelihood ratio, which suggests the bias of variational inference can be reduced by making the distribution of the likelihood ratio more concentrated, such as via averaging and variance reduction.

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: 5 pages

  40. arXiv:1905.12760  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Batch weight for domain adaptation with mass shift

    Authors: Mikołaj Bińkowski, R Devon Hjelm, Aaron Courville

    Abstract: Unsupervised domain transfer is the task of transferring or translating samples from a source distribution to a different target distribution. Current solutions unsupervised domain transfer often operate on data on which the modes of the distribution are well-matched, for instance have the same frequencies of classes between source and target distributions. However, these models do not perform wel… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  41. arXiv:1905.04866  [pdf, other

    cs.LG stat.ML

    Hierarchical Importance Weighted Autoencoders

    Authors: Chin-Wei Huang, Kris Sankaran, Eeshan Dhekane, Alexandre Lacoste, Aaron Courville

    Abstract: Importance weighted variational inference (Burda et al., 2015) uses multiple i.i.d. samples to have a tighter variational lower bound. We believe a joint proposal has the potential of reducing the number of redundant samples, and introduce a hierarchical structure to induce correlation. The hope is that the proposals would coordinate to make up for the error made by one another to reduce the varia… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: Accepted by ICML 2019. 17 pages

  42. arXiv:1904.12165  [pdf, other

    cs.CV cs.LG

    Improved Conditional VRNNs for Video Prediction

    Authors: Lluis Castrejon, Nicolas Ballas, Aaron Courville

    Abstract: Predicting future frames for a video sequence is a challenging generative modeling task. Promising approaches include probabilistic latent variable models such as the Variational Auto-Encoder. While VAEs can handle uncertainty and model multiple possible future outcomes, they have a tendency to produce blurry predictions. In this work we argue that this is a sign of underfitting. To address this i… ▽ More

    Submitted 27 April, 2019; originally announced April 2019.

    Comments: Project page: https://sites.google.com/view/videovrnn

  43. arXiv:1903.07227  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Counterpoint by Convolution

    Authors: Cheng-Zhi Anna Huang, Tim Cooijmans, Adam Roberts, Aaron Courville, Douglas Eck

    Abstract: Machine learning models of music typically break up the task of composition into a chronological process, composing a piece of music in a single pass from beginning to end. On the contrary, human composers write music in a nonlinear fashion, scribbling motifs here and there, often revisiting choices previously made. In order to better approximate this process, we train a convolutional neural netwo… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

    Comments: Proceedings of the 18th International Society for Music Information Retrieval Conference, ISMIR 2017

    ACM Class: H.5.5; I.2

  44. arXiv:1901.08508  [pdf, other

    cs.LG cs.AI stat.ML

    Maximum Entropy Generators for Energy-Based Models

    Authors: Rithesh Kumar, Sherjil Ozair, Anirudh Goyal, Aaron Courville, Yoshua Bengio

    Abstract: Maximum likelihood estimation of energy-based models is a challenging problem due to the intractability of the log-likelihood gradient. In this work, we propose learning both the energy function and an amortized approximate sampling mechanism using a neural generator network, which provides an efficient approximation of the log-likelihood gradient. The resulting objective requires maximizing entro… ▽ More

    Submitted 27 May, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

  45. arXiv:1812.01180  [pdf, other

    cs.CV

    Deep Generative Modeling of LiDAR Data

    Authors: Lucas Caccia, Herke van Hoof, Aaron Courville, Joelle Pineau

    Abstract: Building models capable of generating structured output is a key challenge for AI and robotics. While generative models have been explored on many types of data, little work has been done on synthesizing lidar scans, which play a key role in robot map** and localization. In this work, we show that one can adapt deep generative models for this task by unravelling lidar scans into a 2D point map.… ▽ More

    Submitted 2 December, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: Presented at IROS 2019

  46. arXiv:1811.12889  [pdf, other

    cs.CL cs.AI

    Systematic Generalization: What Is Required and Can It Be Learned?

    Authors: Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

    Abstract: Numerous models for grounded language understanding have been recently proposed, including (i) generic models that can be easily adapted to any given task and (ii) intuitively appealing modular models that require background knowledge to be instantiated. We compare both types of models in how much they lend themselves to a particular form of systematic generalization. Using a synthetic VQA test, w… ▽ More

    Submitted 21 April, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Published as a conference paper at ICLR 2019

  47. arXiv:1811.10097  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Planning in Dynamic Environments with Conditional Autoregressive Models

    Authors: Johanna Hansen, Kyle Kastner, Aaron Courville, Gregory Dudek

    Abstract: We demonstrate the use of conditional autoregressive generative models (van den Oord et al., 2016a) over a discrete latent space (van den Oord et al., 2017b) for forward planning with MCTS. In order to test this method, we introduce a new environment featuring varying difficulty levels, along with moving goals and obstacles. The combination of high-quality frame generation and classical planning a… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

    Comments: 6 pages, 1 figure, in Proceedings of the Prediction and Generative Modeling in Reinforcement Learning Workshop at the International Conference on Machine Learning (ICML) in 2018

  48. arXiv:1811.07426  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Harmonic Recomposition using Conditional Autoregressive Modeling

    Authors: Kyle Kastner, Rithesh Kumar, Tim Cooijmans, Aaron Courville

    Abstract: We demonstrate a conditional autoregressive pipeline for efficient music recomposition, based on methods presented in van den Oord et al.(2017). Recomposition (Casal & Casey, 2010) focuses on reworking existing musical pieces, adhering to structure at a high level while also re-imagining other aspects of the work. This can involve reuse of pre-existing themes or parts of the original piece, while… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: 3 pages, 2 figures. In Proceedings of The Joint Workshop on Machine Learning for Music, ICML 2018

  49. arXiv:1811.07240  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Representation Mixing for TTS Synthesis

    Authors: Kyle Kastner, João Felipe Santos, Yoshua Bengio, Aaron Courville

    Abstract: Recent character and phoneme-based parametric TTS systems using deep learning have shown strong performance in natural speech generation. However, the choice between character or phoneme input can create serious limitations for practical deployment, as direct control of pronunciation is crucial in certain cases. We demonstrate a simple method for combining multiple types of linguistic information… ▽ More

    Submitted 24 November, 2018; v1 submitted 17 November, 2018; originally announced November 2018.

    Comments: 5 pages, 3 figures

  50. arXiv:1811.05013  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Blindfold Baselines for Embodied QA

    Authors: Ankesh Anand, Eugene Belilovsky, Kyle Kastner, Hugo Larochelle, Aaron Courville

    Abstract: We explore blindfold (question-only) baselines for Embodied Question Answering. The EmbodiedQA task requires an agent to answer a question by intelligently navigating in a simulated environment, gathering necessary visual information only through first-person vision before finally answering. Consequently, a blindfold baseline which ignores the environment and visual information is a degenerate sol… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: NIPS 2018 Visually-Grounded Interaction and Language (ViGilL) Workshop