Skip to main content

Showing 1–12 of 12 results for author: de Brébisson, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.06711  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    MelGAN: Generative Adversarial Networks for Conditional Waveform Synthesis

    Authors: Kundan Kumar, Rithesh Kumar, Thibault de Boissiere, Lucas Gestin, Wei Zhen Teoh, Jose Sotelo, Alexandre de Brebisson, Yoshua Bengio, Aaron Courville

    Abstract: Previous works (Donahue et al., 2018a; Engel et al., 2019a) have found that generating coherent raw audio waveforms with GANs is challenging. In this paper, we show that it is possible to train GANs reliably to generate high quality coherent waveforms by introducing a set of architectural changes and simple training techniques. Subjective evaluation metric (Mean Opinion Score, or MOS) shows the ef… ▽ More

    Submitted 8 December, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  2. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  3. arXiv:1801.01442  [pdf, other

    cs.CV

    ObamaNet: Photo-realistic lip-sync from text

    Authors: Rithesh Kumar, Jose Sotelo, Kundan Kumar, Alexandre de Brebisson, Yoshua Bengio

    Abstract: We present ObamaNet, the first architecture that generates both audio and synchronized photo-realistic lip-sync videos from any new text. Contrary to other published lip-sync approaches, ours is only composed of fully trainable neural modules and does not rely on any traditional computer graphics methods. More precisely, we use three main modules: a text-to-speech network based on Char2Wav, a time… ▽ More

    Submitted 6 December, 2017; originally announced January 2018.

  4. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7

  5. arXiv:1609.05866  [pdf, other

    cs.LG cs.IR cs.NE stat.ML

    A Cheap Linear Attention Mechanism with Fast Lookups and Fixed-Size Representations

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: The softmax content-based attention mechanism has proven to be very beneficial in many applications of recurrent neural networks. Nevertheless it suffers from two major computational limitations. First, its computations for an attention lookup scale linearly in the size of the attended sequence. Second, it does not encode the sequence into a fixed-size representation but instead requires to memori… ▽ More

    Submitted 19 September, 2016; originally announced September 2016.

  6. arXiv:1606.08061  [pdf, other

    cs.NE cs.LG

    Exact gradient updates in time independent of output size for the spherical loss family

    Authors: Pascal Vincent, Alexandre de Brébisson, Xavier Bouthillier

    Abstract: An important class of problems involves training deep neural networks with sparse prediction targets of very high dimension D. These occur naturally in e.g. neural language models or the learning of word-embeddings, often posed as predicting the probability of next words among a vocabulary of size D (e.g. 200,000). Computing the equally large, but typically non-sparse D-dimensional output vector f… ▽ More

    Submitted 26 June, 2016; originally announced June 2016.

    Comments: Expanded journal version of our NIPS-2015 conference paper arXiv:1412.7091 with full algorithm generalized to the spherical family

  7. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  8. arXiv:1604.08859  [pdf, other

    cs.LG cs.AI stat.ML

    The Z-loss: a shift and scale invariant classification loss belonging to the Spherical Family

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: Despite being the standard loss function to train multi-class neural networks, the log-softmax has two potential limitations. First, it involves computations that scale linearly with the number of output classes, which can restrict the size of problems we are able to tackle with current hardware. Second, it remains unclear how close it matches the task loss such as the top-k error rate or other no… ▽ More

    Submitted 27 May, 2016; v1 submitted 29 April, 2016; originally announced April 2016.

  9. arXiv:1511.05042  [pdf, ps, other

    cs.NE cs.LG stat.ML

    An Exploration of Softmax Alternatives Belonging to the Spherical Loss Family

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: In a multi-class classification problem, it is standard to model the output of a neural network as a categorical distribution conditioned on the inputs. The output must therefore be positive and sum to one, which is traditionally enforced by a softmax. This probabilistic map** allows to use the maximum likelihood principle, which leads to the well-known log-softmax loss. However the choice of th… ▽ More

    Submitted 28 February, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: Published at ICLR 2016

  10. arXiv:1508.00021  [pdf, other

    cs.LG cs.NE

    Artificial Neural Networks Applied to Taxi Destination Prediction

    Authors: Alexandre de Brébisson, Étienne Simon, Alex Auvolat, Pascal Vincent, Yoshua Bengio

    Abstract: We describe our first-place solution to the ECML/PKDD discovery challenge on taxi destination prediction. The task consisted in predicting the destination of a taxi based on the beginning of its trajectory, represented as a variable-length sequence of GPS points, and diverse associated meta-information, such as the departure time, the driver id and client information. Contrary to most published co… ▽ More

    Submitted 21 September, 2015; v1 submitted 31 July, 2015; originally announced August 2015.

    Comments: ECML/PKDD discovery challenge

  11. arXiv:1502.02445  [pdf, other

    cs.CV cs.LG stat.AP stat.ML

    Deep Neural Networks for Anatomical Brain Segmentation

    Authors: Alexandre de Brebisson, Giovanni Montana

    Abstract: We present a novel approach to automatically segment magnetic resonance (MR) images of the human brain into anatomical regions. Our methodology is based on a deep artificial neural network that assigns each voxel in an MR image of the brain to its corresponding anatomical region. The inputs of the network capture information at different scales around the voxel of interest: 3D and orthogonal 2D in… ▽ More

    Submitted 25 June, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

  12. arXiv:1412.7091  [pdf, other

    cs.NE cs.CL cs.LG

    Efficient Exact Gradient Update for training Deep Networks with Very Large Sparse Targets

    Authors: Pascal Vincent, Alexandre de Brébisson, Xavier Bouthillier

    Abstract: An important class of problems involves training deep neural networks with sparse prediction targets of very high dimension D. These occur naturally in e.g. neural language models or the learning of word-embeddings, often posed as predicting the probability of next words among a vocabulary of size D (e.g. 200 000). Computing the equally large, but typically non-sparse D-dimensional output vector f… ▽ More

    Submitted 13 July, 2015; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: 15 pages technical report version