Skip to main content

Showing 51–82 of 82 results for author: Larochelle, H

.
  1. arXiv:1605.07427  [pdf, other

    stat.ML cs.CL cs.LG cs.NE

    Hierarchical Memory Networks

    Authors: Sarath Chandar, Sung** Ahn, Hugo Larochelle, Pascal Vincent, Gerald Tesauro, Yoshua Bengio

    Abstract: Memory networks are neural networks with an explicit memory component that can be both read and written to by the network. The memory is often addressed in a soft way using a softmax function, making end-to-end training with backpropagation possible. However, this is not computationally scalable for applications which require the network to read from extremely large memories. On the other hand, it… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: 10 pages

  2. arXiv:1605.03705  [pdf, other

    cs.CV cs.CL

    Movie Description

    Authors: Anna Rohrbach, Atousa Torabi, Marcus Rohrbach, Niket Tandon, Christopher Pal, Hugo Larochelle, Aaron Courville, Bernt Schiele

    Abstract: Audio Description (AD) provides linguistic descriptions of movies and allows visually impaired people to follow a movie along with their peers. Such descriptions are by design mainly visual and thus naturally form an interesting data source for computer vision and computational linguistics. In this work we propose a novel dataset which contains transcribed ADs, which are temporally aligned to full… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

  3. arXiv:1605.02226  [pdf, other

    cs.LG

    Neural Autoregressive Distribution Estimation

    Authors: Benigno Uria, Marc-Alexandre Côté, Karol Gregor, Iain Murray, Hugo Larochelle

    Abstract: We present Neural Autoregressive Distribution Estimation (NADE) models, which are neural network architectures applied to the problem of unsupervised distribution and density estimation. They leverage the probability product rule and a weight sharing scheme inspired from restricted Boltzmann machines, to yield an estimator that is both tractable and has good generalization performance. We discuss… ▽ More

    Submitted 27 May, 2016; v1 submitted 7 May, 2016; originally announced May 2016.

  4. arXiv:1603.08199  [pdf, other

    cs.CV

    Recurrent Mixture Density Network for Spatiotemporal Visual Attention

    Authors: Loris Bazzani, Hugo Larochelle, Lorenzo Torresani

    Abstract: In many computer vision tasks, the relevant information to solve the problem at hand is mixed to irrelevant, distracting information. This has motivated researchers to design attentional models that can dynamically focus on parts of images or videos that are salient, e.g., by down-weighting irrelevant pixels. In this work, we propose a spatiotemporal attentional model that learns where to look in… ▽ More

    Submitted 11 February, 2017; v1 submitted 27 March, 2016; originally announced March 2016.

    Comments: ICLR 2017

  5. arXiv:1603.05962  [pdf, other

    cs.LG cs.CL

    Document Neural Autoregressive Distribution Estimation

    Authors: Stanislas Lauly, Yin Zheng, Alexandre Allauzen, Hugo Larochelle

    Abstract: We present an approach based on feed-forward neural networks for learning the distribution of textual documents. This approach is inspired by the Neural Autoregressive Distribution Estimator(NADE) model, which has been shown to be a good estimator of the distribution of discrete-valued igh-dimensional vectors. In this paper, we present how NADE can successfully be adapted to the case of textual da… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

  6. arXiv:1512.09300  [pdf, other

    cs.LG cs.CV stat.ML

    Autoencoding beyond pixels using a learned similarity metric

    Authors: Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, Ole Winther

    Abstract: We present an autoencoder that leverages learned representations to better measure similarities in data space. By combining a variational autoencoder with a generative adversarial network we can use learned feature representations in the GAN discriminator as basis for the VAE reconstruction objective. Thereby, we replace element-wise errors with feature-wise errors to better capture the data distr… ▽ More

    Submitted 10 February, 2016; v1 submitted 31 December, 2015; originally announced December 2015.

  7. arXiv:1511.07838  [pdf, other

    cs.LG cs.NE

    Dynamic Capacity Networks

    Authors: Amjad Almahairi, Nicolas Ballas, Tim Cooijmans, Yin Zheng, Hugo Larochelle, Aaron Courville

    Abstract: We introduce the Dynamic Capacity Network (DCN), a neural network that can adaptively assign its capacity across different portions of the input data. This is achieved by combining modules of two types: low-capacity sub-networks and high-capacity sub-networks. The low-capacity sub-networks are applied across most of the input, but also provide a guide to select a few portions of the input on which… ▽ More

    Submitted 22 May, 2016; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: ICML 2016

  8. Within-Brain Classification for Brain Tumor Segmentation

    Authors: Mohammad Havaei, Hugo Larochelle, Philippe Poulin, Pierre-Marc Jodoin

    Abstract: Purpose: In this paper, we investigate a framework for interactive brain tumor segmentation which, at its core, treats the problem of interactive brain tumor segmentation as a machine learning problem. Methods: This method has an advantage over typical machine learning methods for this task where generalization is made across brains. The problem with these methods is that they need to deal with… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.

  9. arXiv:1507.05910  [pdf, other

    cs.LG cs.CL stat.ML

    Clustering is Efficient for Approximate Maximum Inner Product Search

    Authors: Alex Auvolat, Sarath Chandar, Pascal Vincent, Hugo Larochelle, Yoshua Bengio

    Abstract: Efficient Maximum Inner Product Search (MIPS) is an important task that has a wide applicability in recommendation systems and classification with a large number of classes. Solutions based on locality-sensitive hashing (LSH) as well as tree-based solutions have been investigated in the recent literature, to perform approximate MIPS in sublinear time. In this paper, we compare these to another ext… ▽ More

    Submitted 29 November, 2015; v1 submitted 21 July, 2015; originally announced July 2015.

    Comments: 10 pages, Under review at ICLR 2016

  10. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  11. Brain Tumor Segmentation with Deep Neural Networks

    Authors: Mohammad Havaei, Axel Davy, David Warde-Farley, Antoine Biard, Aaron Courville, Yoshua Bengio, Chris Pal, Pierre-Marc Jodoin, Hugo Larochelle

    Abstract: In this paper, we present a fully automatic brain tumor segmentation method based on Deep Neural Networks (DNNs). The proposed networks are tailored to glioblastomas (both low and high grade) pictured in MR images. By their very nature, these tumors can appear anywhere in the brain and have almost any kind of shape, size, and contrast. These reasons motivate our exploration of a machine learning s… ▽ More

    Submitted 20 May, 2016; v1 submitted 13 May, 2015; originally announced May 2015.

  12. arXiv:1504.07225  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Correlational Neural Networks

    Authors: Sarath Chandar, Mitesh M. Khapra, Hugo Larochelle, Balaraman Ravindran

    Abstract: Common Representation Learning (CRL), wherein different descriptions (or views) of the data are embedded in a common subspace, is receiving a lot of attention recently. Two popular paradigms here are Canonical Correlation Analysis (CCA) based approaches and Autoencoder (AE) based approaches. CCA based approaches learn a joint representation by maximizing correlation of the views when projected to… ▽ More

    Submitted 12 October, 2015; v1 submitted 27 April, 2015; originally announced April 2015.

    Comments: 27 pages. To Appear in Neural Computation

  13. arXiv:1503.01070  [pdf, other

    cs.CV cs.AI

    Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research

    Authors: Atousa Torabi, Christopher Pal, Hugo Larochelle, Aaron Courville

    Abstract: In this work, we introduce a dataset of video annotated with high quality natural language phrases describing the visual content in a given segment of time. Our dataset is based on the Descriptive Video Service (DVS) that is now encoded on many digital media products such as DVDs. DVS is an audio narration describing the visual elements and actions in a movie for the visually impaired. It is tempo… ▽ More

    Submitted 3 March, 2015; originally announced March 2015.

    Comments: 7 pages

  14. arXiv:1502.08029  [pdf, other

    stat.ML cs.AI cs.CL cs.CV cs.LG

    Describing Videos by Exploiting Temporal Structure

    Authors: Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville

    Abstract: Recent progress in using recurrent neural networks (RNNs) for image description has motivated the exploration of their application for video description. However, while images are static, working with videos requires modeling their dynamic temporal structure and then properly integrating that information into a natural language description. In this context, we propose an approach that successfully… ▽ More

    Submitted 30 September, 2015; v1 submitted 27 February, 2015; originally announced February 2015.

    Comments: Accepted to ICCV15. This version comes with code release and supplementary material

  15. arXiv:1502.03509  [pdf, other

    cs.LG cs.NE stat.ML

    MADE: Masked Autoencoder for Distribution Estimation

    Authors: Mathieu Germain, Karol Gregor, Iain Murray, Hugo Larochelle

    Abstract: There has been a lot of recent interest in designing neural network models to estimate a distribution from a set of examples. We introduce a simple modification for autoencoder neural networks that yields powerful generative models. Our method masks the autoencoder's parameters to respect autoregressive constraints: each input is reconstructed only from previous inputs in a given ordering. Constra… ▽ More

    Submitted 5 June, 2015; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 9 pages and 1 page of supplementary material. Updated to match published version

    Journal ref: Proceedings of the 32nd International Conference on Machine Learning, JMLR W&CP 37:881-889, 2015

  16. arXiv:1502.02476  [pdf, other

    cs.LG

    An Infinite Restricted Boltzmann Machine

    Authors: Marc-Alexandre Côté, Hugo Larochelle

    Abstract: We present a mathematical construction for the restricted Boltzmann machine (RBM) that doesn't require specifying the number of hidden units. In fact, the hidden layer size is adaptive and can grow during training. This is obtained by first extending the RBM to be sensitive to the ordering of its hidden units. Then, thanks to a carefully chosen definition of the energy function, we show that the l… ▽ More

    Submitted 18 March, 2016; v1 submitted 9 February, 2015; originally announced February 2015.

    Comments: 25 pages, 8 figures

  17. arXiv:1412.4446  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Neural Networks

    Authors: Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: We introduce a new representation learning algorithm suited to the context of domain adaptation, in which data at training and test time come from similar but different distributions. Our algorithm is directly inspired by theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on a data representation that cannot discriminate betwee… ▽ More

    Submitted 9 February, 2015; v1 submitted 14 December, 2014; originally announced December 2014.

    Comments: The first version of this paper was accepted at the "Second Workshop on Transfer and Multi-Task Learning: Theory meets Practice" (NIPS 2014, Montreal, Canada). See: https://sites.google.com/site/multitaskwsnips2014/

  18. arXiv:1409.3970  [pdf, other

    cs.CV cs.IR cs.LG cs.NE

    A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data

    Authors: Yin Zheng, Yu-** Zhang, Hugo Larochelle

    Abstract: Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks, such as the deep Boltzmann machine (DBM). Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was pr… ▽ More

    Submitted 31 December, 2015; v1 submitted 13 September, 2014; originally announced September 2014.

    Comments: 24 pages, 10 figures. A version has been accepted by TPAMI on Aug 4th, 2015. Add footnote about how to train the model in practice in Section 5.1. arXiv admin note: substantial text overlap with arXiv:1305.5306

  19. arXiv:1402.1454  [pdf, other

    cs.CL cs.LG stat.ML

    An Autoencoder Approach to Learning Bilingual Word Representations

    Authors: Sarath Chandar A P, Stanislas Lauly, Hugo Larochelle, Mitesh M. Khapra, Balaraman Ravindran, Vikas Raykar, Amrita Saha

    Abstract: Cross-language learning allows us to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this work we explore the use of autoencoder-based methods for cross-language learning of vectorial word representations that are aligned between two languages, while… ▽ More

    Submitted 6 February, 2014; originally announced February 2014.

    Comments: 10 pages

  20. arXiv:1402.0796  [pdf, other

    cs.LG stat.ML

    Sequential Model-Based Ensemble Optimization

    Authors: Alexandre Lacoste, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: One of the most tedious tasks in the application of machine learning is model selection, i.e. hyperparameter selection. Fortunately, recent progress has been made in the automation of this process, through the use of sequential model-based optimization (SMBO) methods. This can be used to optimize a cross-validation performance of a learning algorithm over the value of its hyperparameters. However,… ▽ More

    Submitted 4 February, 2014; originally announced February 2014.

  21. arXiv:1401.1803  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Multilingual Word Representations using a Bag-of-Words Autoencoder

    Authors: Stanislas Lauly, Alex Boulanger, Hugo Larochelle

    Abstract: Recent work on learning multilingual word representations usually relies on the use of word-level alignements (e.g. infered with the help of GIZA++) between translated sentences, in order to align the word embeddings in different languages. In this workshop paper, we investigate an autoencoder model for learning multilingual word representations that does without such word-level alignements. The a… ▽ More

    Submitted 8 January, 2014; originally announced January 2014.

    Comments: This workshop paper was accepted on Octoble 30 2013 at the NIPS 2013 workshop on deep learning (https://sites.google.com/site/deeplearningworkshopnips2013/accepted-papers)

  22. arXiv:1310.1757  [pdf, other

    stat.ML cs.LG

    A Deep and Tractable Density Estimator

    Authors: Benigno Uria, Iain Murray, Hugo Larochelle

    Abstract: The Neural Autoregressive Distribution Estimator (NADE) and its real-valued version RNADE are competitive density models of multidimensional data across a variety of domains. These models use a fixed, arbitrary ordering of the data dimensions. One can easily condition on variables at the beginning of the ordering, and marginalize out variables at the end of the ordering, however other inference ta… ▽ More

    Submitted 11 January, 2014; v1 submitted 7 October, 2013; originally announced October 2013.

    Comments: 9 pages, 4 tables, 1 algorithm, 5 figures. To appear ICML 2014, JMLR W&CP volume 32

  23. arXiv:1306.0186  [pdf, other

    stat.ML cs.LG

    RNADE: The real-valued neural autoregressive density-estimator

    Authors: Benigno Uria, Iain Murray, Hugo Larochelle

    Abstract: We introduce RNADE, a new model for joint density estimation of real-valued vectors. Our model calculates the density of a datapoint as the product of one-dimensional conditionals modeled using mixture density networks with shared parameters. RNADE learns a distributed representation of the data, while having a tractable expression for the calculation of densities. A tractable likelihood allows di… ▽ More

    Submitted 9 January, 2014; v1 submitted 2 June, 2013; originally announced June 2013.

    Comments: 12 pages, 3 figures, 3 tables, 2 algorithms. Merges the published paper and supplementary material into one document

    Journal ref: Advances in Neural Information Processing Systems 26:2175-2183, 2013

  24. arXiv:1305.5306  [pdf, other

    cs.CV cs.LG stat.ML

    A Supervised Neural Autoregressive Topic Model for Simultaneous Image Classification and Annotation

    Authors: Yin Zheng, Yu-** Zhang, Hugo Larochelle

    Abstract: Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to perform scene recognition and annotation. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for document modeling. In this work, we show how to successfully apply and extend this model to… ▽ More

    Submitted 22 May, 2013; originally announced May 2013.

    Comments: 13 pages, 5 figures

  25. arXiv:1206.2944  [pdf, other

    stat.ML cs.LG

    Practical Bayesian Optimization of Machine Learning Algorithms

    Authors: Jasper Snoek, Hugo Larochelle, Ryan P. Adams

    Abstract: Machine learning algorithms frequently require careful tuning of model hyperparameters, regularization terms, and optimization parameters. Unfortunately, this tuning is often a "black art" that requires expert experience, unwritten rules of thumb, or sometimes brute-force search. Much more appealing is the idea of develo** automatic approaches which can optimize the performance of a given learni… ▽ More

    Submitted 29 August, 2012; v1 submitted 13 June, 2012; originally announced June 2012.

  26. arXiv:1202.5695  [pdf, other

    cs.LG stat.ML

    Training Restricted Boltzmann Machines on Word Observations

    Authors: George E. Dahl, Ryan P. Adams, Hugo Larochelle

    Abstract: The restricted Boltzmann machine (RBM) is a flexible tool for modeling complex data, however there have been significant computational difficulties in using RBMs to model high-dimensional multinomial observations. In natural language processing applications, words are naturally modeled by K-ary discrete distributions, where K is determined by the vocabulary size and can easily be in the hundreds o… ▽ More

    Submitted 5 July, 2012; v1 submitted 25 February, 2012; originally announced February 2012.

  27. arXiv:1202.3748  [pdf

    cs.LG stat.ML

    Conditional Restricted Boltzmann Machines for Structured Output Prediction

    Authors: Volodymyr Mnih, Hugo Larochelle, Geoffrey E. Hinton

    Abstract: Conditional Restricted Boltzmann Machines (CRBMs) are rich probabilistic models that have recently been applied to a wide range of problems, including collaborative filtering, classification, and modeling motion capture data. While much progress has been made in training non-conditional RBMs, these algorithms are not applicable to conditional models and there has been almost no work on training an… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-514-522

  28. arXiv:1109.3737  [pdf, other

    cs.AI

    Learning where to Attend with Deep Architectures for Image Tracking

    Authors: Misha Denil, Loris Bazzani, Hugo Larochelle, Nando de Freitas

    Abstract: We discuss an attentional model for simultaneous object tracking and recognition that is driven by gaze data. Motivated by theories of perception, the model consists of two interacting pathways: identity and control, intended to mirror the what and where pathways in neuroscience models. The identity pathway models object appearance and performs classification using deep (factored)-Restricted Boltz… ▽ More

    Submitted 16 September, 2011; originally announced September 2011.

  29. arXiv:1107.1805  [pdf, other

    stat.ML cs.AI

    Loss-sensitive Training of Probabilistic Conditional Random Fields

    Authors: Maksims N. Volkovs, Hugo Larochelle, Richard S. Zemel

    Abstract: We consider the problem of training probabilistic conditional random fields (CRFs) in the context of a task where performance is measured using a specific loss function. While maximum likelihood is the most common approach to training CRFs, it ignores the inherent structure of the task's loss function. We describe alternatives to maximum likelihood which take that loss into account. These include… ▽ More

    Submitted 9 July, 2011; originally announced July 2011.

  30. arXiv:1103.4896  [pdf, other

    cs.LG stat.ML

    Classification of Sets using Restricted Boltzmann Machines

    Authors: Jérôme Louradour, Hugo Larochelle

    Abstract: We consider the problem of classification when inputs correspond to sets of vectors. This setting occurs in many problems such as the classification of pieces of mail containing several pages, of web sites with several sections or of images that have been pre-segmented into smaller regions. We propose generalizations of the restricted Boltzmann machine (RBM) that are appropriate in this context an… ▽ More

    Submitted 24 March, 2011; originally announced March 2011.

    Comments: 17 pages, 4 figures

  31. arXiv:1103.2832  [pdf, other

    cs.LG cs.IR cs.SD

    Autotagging music with conditional restricted Boltzmann machines

    Authors: Michael Mandel, Razvan Pascanu, Hugo Larochelle, Yoshua Bengio

    Abstract: This paper describes two applications of conditional restricted Boltzmann machines (CRBMs) to the task of autotagging music. The first consists of training a CRBM to predict tags that a user would apply to a clip of a song based on tags already applied by other users. By learning the relationships between tags, this model is able to pre-process training data to significantly improve the performanc… ▽ More

    Submitted 14 March, 2011; originally announced March 2011.

  32. arXiv:1102.1492  [pdf, other

    stat.ML

    On Nonparametric Guidance for Learning Autoencoder Representations

    Authors: Jasper Snoek, Ryan Prescott Adams, Hugo Larochelle

    Abstract: Unsupervised discovery of latent representations, in addition to being useful for density modeling, visualisation and exploratory data analysis, is also increasingly important for learning features relevant to discriminative tasks. Autoencoders, in particular, have proven to be an effective way to learn latent codes that reflect meaningful variations in data. A continuing challenge, however, is gu… ▽ More

    Submitted 25 October, 2011; v1 submitted 7 February, 2011; originally announced February 2011.

    Comments: 9 pages, 12 figures