Skip to main content

Showing 1–17 of 17 results for author: Warde-Farley, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.02101  [pdf, other

    cs.AI cs.LG cs.LO

    Solving MaxSAT with Matrix Multiplication

    Authors: David Warde-Farley, Vinod Nair, Yujia Li, Ivan Lobov, Felix Gimeno, Simon Osindero

    Abstract: We propose an incomplete algorithm for Maximum Satisfiability (MaxSAT) specifically designed to run on neural network accelerators such as GPUs and TPUs. Given a MaxSAT problem instance in conjunctive normal form, our procedure constructs a Restricted Boltzmann Machine (RBM) with an equilibrium distribution wherein the probability of a Boolean assignment is exponential in the number of clauses it… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  2. arXiv:2107.14226  [pdf, other

    cs.LG cs.AI stat.ML

    Learning more skills through optimistic exploration

    Authors: DJ Strouse, Kate Baumli, David Warde-Farley, Vlad Mnih, Steven Hansen

    Abstract: Unsupervised skill learning objectives (Gregor et al., 2016, Eysenbach et al., 2018) allow agents to learn rich repertoires of behavior in the absence of extrinsic rewards. They work by simultaneously training a policy to produce distinguishable latent-conditioned trajectories, and a discriminator to evaluate distinguishability by trying to infer latents from trajectories. The hope is for the agen… ▽ More

    Submitted 12 May, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted at ICLR 2022 (spotlight)

  3. arXiv:2012.07827  [pdf, other

    cs.AI cs.LG

    Relative Variational Intrinsic Control

    Authors: Kate Baumli, David Warde-Farley, Steven Hansen, Volodymyr Mnih

    Abstract: In the absence of external rewards, agents can still learn useful behaviors by identifying and mastering a set of diverse skills within their environment. Existing skill learning methods use mutual information objectives to incentivize each skill to be diverse and distinguishable from the rest. However, if care is not taken to constrain the ways in which the skills are diverse, trivially diverse s… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI2021

  4. arXiv:2001.08116  [pdf, other

    cs.LG cs.AI stat.ML

    Q-Learning in enormous action spaces via amortized approximate maximization

    Authors: Tom Van de Wiele, David Warde-Farley, Andriy Mnih, Volodymyr Mnih

    Abstract: Applying Q-learning to high-dimensional or continuous action spaces can be difficult due to the required maximization over the set of possible actions. Motivated by techniques from amortized inference, we replace the expensive maximization over all actions with a maximization over a small subset of possible actions sampled from a learned proposal distribution. The resulting approach, which we dub… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Comments: A previous version of this work appeared at the Deep Reinforcement Learning Workshop, NeurIPS 2018

  5. arXiv:1906.05030  [pdf, other

    cs.LG cs.AI stat.ML

    Fast Task Inference with Variational Intrinsic Successor Features

    Authors: Steven Hansen, Will Dabney, Andre Barreto, Tom Van de Wiele, David Warde-Farley, Volodymyr Mnih

    Abstract: It has been established that diverse behaviors spanning the controllable subspace of an Markov decision process can be trained by rewarding a policy for being distinguishable from other policies \citep{gregor2016variational, eysenbach2018diversity, warde2018unsupervised}. However, one limitation of this formulation is generalizing behaviors beyond the finite set being explicitly learned, as is nee… ▽ More

    Submitted 27 January, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at ICLR 2020

  6. arXiv:1811.11359  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Control Through Non-Parametric Discriminative Rewards

    Authors: David Warde-Farley, Tom Van de Wiele, Tejas Kulkarni, Catalin Ionescu, Steven Hansen, Volodymyr Mnih

    Abstract: Learning to control an environment without hand-crafted rewards or expert data remains challenging and is at the frontier of reinforcement learning research. We present an unsupervised learning algorithm to train agents to achieve perceptually-specified goals using only a stream of observations and actions. Our agent simultaneously learns a goal-conditioned policy and a goal achievement reward fun… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: 10 pages + references & 5 page appendix

  7. arXiv:1706.04987  [pdf, other

    stat.ML cs.LG

    Variational Approaches for Auto-Encoding Generative Adversarial Networks

    Authors: Mihaela Rosca, Balaji Lakshminarayanan, David Warde-Farley, Shakir Mohamed

    Abstract: Auto-encoding generative adversarial networks (GANs) combine the standard GAN algorithm, which discriminates between real and model-generated data, with a reconstruction loss given by an auto-encoder. Such models aim to prevent mode collapse in the learned generative model by ensuring that it is grounded in all the available training data. In this paper, we develop a principle upon which auto-enco… ▽ More

    Submitted 21 October, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

  8. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  9. arXiv:1506.00619  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Blocks and Fuel: Frameworks for deep learning

    Authors: Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, Yoshua Bengio

    Abstract: We introduce two Python frameworks to train neural networks on large datasets: Blocks and Fuel. Blocks is based on Theano, a linear algebra compiler with CUDA-support. It facilitates the training of complex neural network models by providing parametrized Theano operations, attaching metadata to Theano's symbolic computational graph, and providing an extensive set of utilities to assist training th… ▽ More

    Submitted 1 June, 2015; originally announced June 2015.

  10. Brain Tumor Segmentation with Deep Neural Networks

    Authors: Mohammad Havaei, Axel Davy, David Warde-Farley, Antoine Biard, Aaron Courville, Yoshua Bengio, Chris Pal, Pierre-Marc Jodoin, Hugo Larochelle

    Abstract: In this paper, we present a fully automatic brain tumor segmentation method based on Deep Neural Networks (DNNs). The proposed networks are tailored to glioblastomas (both low and high grade) pictured in MR images. By their very nature, these tumors can appear anywhere in the brain and have almost any kind of shape, size, and contrast. These reasons motivate our exploration of a machine learning s… ▽ More

    Submitted 20 May, 2016; v1 submitted 13 May, 2015; originally announced May 2015.

  11. arXiv:1503.01800  [pdf, other

    cs.LG cs.CV

    EmoNets: Multimodal deep learning approaches for emotion recognition in video

    Authors: Samira Ebrahimi Kahou, Xavier Bouthillier, Pascal Lamblin, Caglar Gulcehre, Vincent Michalski, Kishore Konda, Sébastien Jean, Pierre Froumenty, Yann Dauphin, Nicolas Boulanger-Lewandowski, Raul Chandias Ferrari, Mehdi Mirza, David Warde-Farley, Aaron Courville, Pascal Vincent, Roland Memisevic, Christopher Pal, Yoshua Bengio

    Abstract: The task of the emotion recognition in the wild (EmotiW) Challenge is to assign one of seven emotions to short video clips extracted from Hollywood style movies. The videos depict acted-out emotions under realistic conditions with a large degree of variation in attributes such as pose and illumination, making it worthwhile to explore approaches which consider combinations of features from multiple… ▽ More

    Submitted 29 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.

  12. arXiv:1412.6563  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Self-informed neural network structure learning

    Authors: David Warde-Farley, Andrew Rabinovich, Dragomir Anguelov

    Abstract: We study the problem of large scale, multi-label visual recognition with a large number of possible classes. We propose a method for augmenting a trained neural network classifier with auxiliary capacity in a manner designed to significantly improve upon an already well-performing model, while minimally impacting its computational footprint. Using the predictions of the network itself as a descrip… ▽ More

    Submitted 13 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

    Comments: Updated with accepted workshop contribution header

  13. arXiv:1406.2661  [pdf, other

    stat.ML cs.LG

    Generative Adversarial Networks

    Authors: Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio

    Abstract: We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training procedure for G is to maximize the probability of D making a mistake. This fram… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

  14. arXiv:1312.6197  [pdf, other

    stat.ML cs.LG cs.NE

    An empirical analysis of dropout in piecewise linear networks

    Authors: David Warde-Farley, Ian J. Goodfellow, Aaron Courville, Yoshua Bengio

    Abstract: The recently introduced dropout training criterion for neural networks has been the subject of much attention due to its simplicity and remarkable effectiveness as a regularizer, as well as its interpretation as a training procedure for an exponentially large ensemble of networks that share parameters. In this work we empirically investigate several questions related to the efficacy of dropout, sp… ▽ More

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: Extensive updates; 8 pages plus acknowledgements/references

  15. arXiv:1308.4214  [pdf, ps, other

    stat.ML cs.LG cs.MS

    Pylearn2: a machine learning research library

    Authors: Ian J. Goodfellow, David Warde-Farley, Pascal Lamblin, Vincent Dumoulin, Mehdi Mirza, Razvan Pascanu, James Bergstra, Frédéric Bastien, Yoshua Bengio

    Abstract: Pylearn2 is a machine learning research library. This does not just mean that it is a collection of machine learning algorithms that share a common API; it means that it has been designed for flexibility and extensibility in order to facilitate research projects that involve new or unusual use cases. In this paper we give a brief history of the library, an overview of its basic philosophy, a summa… ▽ More

    Submitted 19 August, 2013; originally announced August 2013.

    Comments: 9 pages

  16. arXiv:1302.4389  [pdf, other

    stat.ML cs.LG

    Maxout Networks

    Authors: Ian J. Goodfellow, David Warde-Farley, Mehdi Mirza, Aaron Courville, Yoshua Bengio

    Abstract: We consider the problem of designing models to leverage a recently introduced approximate model averaging technique called dropout. We define a simple new model called maxout (so named because its output is the max of a set of inputs, and because it is a natural companion to dropout) designed to both facilitate optimization by dropout and improve the accuracy of dropout's fast approximate model av… ▽ More

    Submitted 20 September, 2013; v1 submitted 18 February, 2013; originally announced February 2013.

    Comments: This is the version of the paper that appears in ICML 2013

    Journal ref: JMLR WCP 28 (3): 1319-1327, 2013

  17. arXiv:1211.5590  [pdf, other

    cs.SC cs.LG

    Theano: new features and speed improvements

    Authors: Frédéric Bastien, Pascal Lamblin, Razvan Pascanu, James Bergstra, Ian Goodfellow, Arnaud Bergeron, Nicolas Bouchard, David Warde-Farley, Yoshua Bengio

    Abstract: Theano is a linear algebra compiler that optimizes a user's symbolically-specified mathematical computations to produce efficient low-level implementations. In this paper, we present new features and efficiency improvements to Theano, and benchmarks demonstrating Theano's performance relative to Torch7, a recently introduced machine learning library, and to RNNLM, a C++ library targeted at recurre… ▽ More

    Submitted 23 November, 2012; originally announced November 2012.

    Comments: Presented at the Deep Learning Workshop, NIPS 2012