Skip to main content

Showing 1–44 of 44 results for author: Hjelm, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02917  [pdf, ps, other

    cs.CL cs.AI

    Towards Negotiative Dialogue for the Talkamatic Dialogue Manager

    Authors: Staffan Larsson, Alexander Berman, David Hjelm

    Abstract: The paper describes a number of dialogue phenomena associated with negotiative dialogue, as implemented in a development version of the Talkamatic Dialogue Manager (TDM). This implementation is an initial step towards full coverage of general features of negotiative dialogue in TDM.

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.07904  [pdf, other

    cs.LG

    Grounding Multimodal Large Language Models in Actions

    Authors: Andrew Szot, Bogdan Mazoure, Harsh Agrawal, Devon Hjelm, Zsolt Kira, Alexander Toshev

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated a wide range of capabilities across many domains, including Embodied AI. In this work, we study how to best ground a MLLM into different embodiments and their associated action spaces, with the goal of leveraging the multimodal world knowledge of the MLLM. We first generalize a number of methods through a unified architecture and the lens… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2403.05490  [pdf, other

    cs.LG cs.AI cs.CV cs.IT stat.ML

    Poly-View Contrastive Learning

    Authors: Amitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridge

    Abstract: Contrastive learning typically matches pairs of related views among a number of unrelated negative views. Views can be generated (e.g. by augmentations) or be observed. We investigate matching when there are more than two related views which we call poly-view tasks, and derive new representation learning objectives using information maximization and sufficient statistics. We show that with unlimit… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to ICLR 2024. 42 pages, 7 figures, 3 tables, loss pseudo-code included in appendix

  4. arXiv:2310.17722  [pdf, other

    cs.LG cs.AI cs.CL

    Large Language Models as Generalizable Policies for Embodied Tasks

    Authors: Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, Devon Hjelm, Alexander Toshev

    Abstract: We show that large language models (LLMs) can be adapted to be generalizable policies for embodied visual tasks. Our approach, called Large LAnguage model Reinforcement Learning Policy (LLaRP), adapts a pre-trained frozen LLM to take as input text instructions and visual egocentric observations and output actions directly in the environment. Using reinforcement learning, we train LLaRP to see and… ▽ More

    Submitted 16 April, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  5. arXiv:2306.07290  [pdf, other

    cs.LG cs.AI

    Value function estimation using conditional diffusion models for control

    Authors: Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind

    Abstract: A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative to address, sooner than later, the potential problem of running out of high-quality demonstrations. In this case, instead of collecting only new data via costly… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  6. arXiv:2211.14449  [pdf, other

    cs.CV cs.AI cs.LG

    PatchBlender: A Motion Prior for Video Transformers

    Authors: Gabriele Prato, Yale Song, Janarthanan Rajendran, R Devon Hjelm, Neel Joshi, Sarath Chandar

    Abstract: Transformers have become one of the dominant architectures in the field of computer vision. However, there are yet several challenges when applying such architectures to video data. Most notably, these models struggle to model the temporal patterns of video data effectively. Directly targeting this issue, we introduce PatchBlender, a learnable blending function that operates over patch embeddings… ▽ More

    Submitted 10 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

  7. arXiv:2209.02876  [pdf, other

    cs.LG eess.IV q-bio.NC

    Self-supervised multimodal neuroimaging yields predictive representations for a spectrum of Alzheimer's phenotypes

    Authors: Alex Fedorov, Eloy Geenjaar, Lei Wu, Tristan Sylvain, Thomas P. DeRamus, Margaux Luck, Maria Misiura, R Devon Hjelm, Sergey M. Plis, Vince D. Calhoun

    Abstract: Recent neuroimaging studies that focus on predicting brain disorders via modern machine learning approaches commonly include a single modality and rely on supervised over-parameterized models.However, a single modality provides only a limited view of the highly complex brain. Critically, supervised models in clinical settings lack accurate diagnostic labels for training. Coarse labels do not captu… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  8. arXiv:2203.10351  [pdf, other

    cs.LG

    The Sandbox Environment for Generalizable Agent Research (SEGAR)

    Authors: R Devon Hjelm, Bogdan Mazoure, Florian Golemo, Felipe Frujeri, Mihai Jalobeanu, Andrey Kolobov

    Abstract: A broad challenge of research on generalization for sequential decision-making tasks in interactive environments is designing benchmarks that clearly landmark progress. While there has been notable headway, current benchmarks either do not provide suitable exposure nor intuitive control of the underlying factors, are not easy-to-implement, customizable, or extensible, or are computationally expens… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  9. arXiv:2201.04309  [pdf, other

    cs.CV cs.LG

    Robust Contrastive Learning against Noisy Views

    Authors: Ching-Yao Chuang, R Devon Hjelm, Xin Wang, Vibhav Vineet, Neel Joshi, Antonio Torralba, Stefanie Jegelka, Yale Song

    Abstract: Contrastive learning relies on an assumption that positive pairs contain related views, e.g., patches of an image or co-occurring multimodal signals of a video, that share certain underlying information about an instance. But what if this assumption is violated? The literature suggests that contrastive learning produces suboptimal representations in the presence of noisy views, e.g., false positiv… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  10. arXiv:2106.08365  [pdf, other

    cs.LG cs.AI stat.ML

    Test Sample Accuracy Scales with Training Sample Density in Neural Networks

    Authors: Xu Ji, Razvan Pascanu, Devon Hjelm, Balaji Lakshminarayanan, Andrea Vedaldi

    Abstract: Intuitively, one would expect accuracy of a trained neural network's prediction on test samples to correlate with how densely the samples are surrounded by seen training samples in representation space. We find that a bound on empirical training error smoothed across linear activation regions scales inversely with training sample density in representation space. Empirically, we verify this bound i… ▽ More

    Submitted 28 July, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: CoLLAs 2022 oral

  11. arXiv:2106.04799  [pdf, other

    cs.LG

    Pretraining Representations for Data-Efficient Reinforcement Learning

    Authors: Max Schwarzer, Nitarshan Rajkumar, Michael Noukhovitch, Ankesh Anand, Laurent Charlin, Devon Hjelm, Philip Bachman, Aaron Courville

    Abstract: Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning representations which capture diverse aspects of the underlying MDP, we employ a combination of latent dynamics modelling and unsupervised goal-conditioned RL. When limited t… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  12. arXiv:2106.02193  [pdf, other

    cs.LG cs.AI

    Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL

    Authors: Bogdan Mazoure, Ahmed M. Ahmed, Patrick MacAlpine, R Devon Hjelm, Andrey Kolobov

    Abstract: A highly desirable property of a reinforcement learning (RL) agent -- and a major difficulty for deep RL approaches -- is the ability to generalize policies learned on a few tasks over a high-dimensional observation space to similar tasks not seen during training. Many promising approaches to this challenge consider RL as a process of training two functions simultaneously: a complex nonlinear enco… ▽ More

    Submitted 16 March, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  13. arXiv:2105.03519  [pdf, other

    cs.CL

    Understanding by Understanding Not: Modeling Negation in Language Models

    Authors: Arian Hosseini, Siva Reddy, Dzmitry Bahdanau, R Devon Hjelm, Alessandro Sordoni, Aaron Courville

    Abstract: Negation is a core construction in natural language. Despite being very successful on many tasks, state-of-the-art pre-trained language models often handle negation incorrectly. To improve language models in this regard, we propose to augment the language modeling objective with an unlikelihood objective that is based on negated generic sentences from a raw text corpus. By training BERT with the r… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  14. arXiv:2010.13320  [pdf, other

    cs.CV cs.LG

    Zero-Shot Learning from scratch (ZFS): leveraging local compositional representations

    Authors: Tristan Sylvain, Linda Petrini, R Devon Hjelm

    Abstract: Zero-shot classification is a generalization task where no instance from the target classes is seen during training. To allow for test-time transfer, each class is annotated with semantic information, commonly in the form of attributes or text descriptions. While classical zero-shot learning does not explicitly forbid using information from other datasets, the approaches that achieve the best abso… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: ICML 2019 Workshop on Understanding and Improving General-ization in Deep Learning, Long Beach, California, 2019 Spotlight presentation. arXiv admin note: text overlap with arXiv:1912.12179

  15. arXiv:2010.10593  [pdf, other

    cs.CV cs.AI

    Cross-Modal Information Maximization for Medical Imaging: CMIM

    Authors: Tristan Sylvain, Francis Dutil, Tess Berthier, Lisa Di Jorio, Margaux Luck, Devon Hjelm, Yoshua Bengio

    Abstract: In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not… ▽ More

    Submitted 1 February, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: ICASSP 2021

  16. arXiv:2008.00938  [pdf, other

    cs.LG stat.ML

    Implicit Regularization via Neural Feature Alignment

    Authors: Aristide Baratin, Thomas George, César Laurent, R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

    Abstract: We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rad… ▽ More

    Submitted 16 March, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  17. arXiv:2007.13278  [pdf, other

    cs.CV cs.LG

    Representation Learning with Video Deep InfoMax

    Authors: R Devon Hjelm, Philip Bachman

    Abstract: Self-supervised learning has made unsupervised pretraining relevant again for difficult computer vision tasks. The most effective self-supervised methods involve prediction tasks based on features extracted from diverse views of the data. DeepInfoMax (DIM) is a self-supervised method which leverages the internal structure of deep networks to construct such views, forming prediction tasks between l… ▽ More

    Submitted 27 July, 2020; v1 submitted 26 July, 2020; originally announced July 2020.

  18. arXiv:2007.05929  [pdf, other

    cs.LG stat.ML

    Data-Efficient Reinforcement Learning with Self-Predictive Representations

    Authors: Max Schwarzer, Ankesh Anand, Rishab Goel, R Devon Hjelm, Aaron Courville, Philip Bachman

    Abstract: While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an agent can learn more efficiently if we augment reward maximization with self-supervised objectives based on structure in its visual input and sequential intera… ▽ More

    Submitted 20 May, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: The first two authors contributed equally to this work. v4 includes new ablations and reformatting for ICLR camera ready

  19. arXiv:2006.07217  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement and InfoMax Learning

    Authors: Bogdan Mazoure, Remi Tachet des Combes, Thang Doan, Philip Bachman, R Devon Hjelm

    Abstract: We begin with the hypothesis that a model-free agent whose representations are predictive of properties of future states (beyond expected rewards) will be more capable of solving and adapting to new RL problems. To test that hypothesis, we introduce an objective based on Deep InfoMax (DIM) which trains the agent to predict the future by maximizing the mutual information between its internal repres… ▽ More

    Submitted 16 November, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  20. arXiv:2003.07449  [pdf, other

    cs.CV cs.LG eess.IV

    Object-Centric Image Generation from Layouts

    Authors: Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio, R Devon Hjelm, Shikhar Sharma

    Abstract: Despite recent impressive results on single-object and single-domain image generation, the generation of complex scenes with multiple objects remains challenging. In this paper, we start with the idea that a model must be able to understand individual objects and relationships between objects in order to generate complex scenes well. Our layout-to-image-generation method, which we call Object-Cent… ▽ More

    Submitted 3 December, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: AAAI 2021

  21. arXiv:2002.09469  [pdf, other

    cs.LG stat.ML

    An end-to-end approach for the verification problem: learning the right distance

    Authors: Joao Monteiro, Isabela Albuquerque, Jahangir Alam, R Devon Hjelm, Tiago Falk

    Abstract: In this contribution, we augment the metric learning setting by introducing a parametric pseudo-distance, trained jointly with the encoder. Several interpretations are thus drawn for the learned distance-like model's output. We first show it approximates a likelihood ratio which can be used for hypothesis tests, and that it further induces a large divergence across the joint distributions of pairs… ▽ More

    Submitted 14 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: ICML 2020 final camera ready. Code is available at: https://github.com/joaomonteirof/e2e_verification

  22. arXiv:1912.12179  [pdf, other

    cs.CV cs.LG

    Locality and compositionality in zero-shot learning

    Authors: Tristan Sylvain, Linda Petrini, Devon Hjelm

    Abstract: In this work we study locality and compositionality in the context of learning representations for Zero Shot Learning (ZSL). In order to well-isolate the importance of these properties in learned representations, we impose the additional constraint that, differently from most recent work in ZSL, no pre-training on different datasets (e.g. ImageNet) is performed. The results of our experiments show… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Published at ICLR 2020

  23. arXiv:1909.07543  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning

    Authors: Thang Doan, Bogdan Mazoure, Moloud Abdar, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Continuous control tasks in reinforcement learning are important because they provide an important framework for learning in high-dimensional state spaces with deceptive rewards, where the agent can easily become trapped into suboptimal solutions. One way to avoid local optima is to use a population of agents to ensure coverage of the policy space, yet learning a population with the "best" coverag… ▽ More

    Submitted 9 July, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  24. arXiv:1906.08226  [pdf, other

    cs.LG stat.ML

    Unsupervised State Representation Learning in Atari

    Authors: Ankesh Anand, Evan Racah, Sherjil Ozair, Yoshua Bengio, Marc-Alexandre Côté, R Devon Hjelm

    Abstract: State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporall… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; v6 fixes a broken figure reference

  25. arXiv:1906.00910  [pdf, other

    cs.LG stat.ML

    Learning Representations by Maximizing Mutual Information Across Views

    Authors: Philip Bachman, R Devon Hjelm, William Buchwalter

    Abstract: We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local spatio-temporal context by observing it from different locations (e.g., camera positions within a scene), and via different modalities (e.g., tactile, auditory, or visual… ▽ More

    Submitted 8 July, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

  26. arXiv:1905.12760  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Batch weight for domain adaptation with mass shift

    Authors: Mikołaj Bińkowski, R Devon Hjelm, Aaron Courville

    Abstract: Unsupervised domain transfer is the task of transferring or translating samples from a source distribution to a different target distribution. Current solutions unsupervised domain transfer often operate on data on which the modes of the distribution are well-matched, for instance have the same frequencies of classes between source and target distributions. However, these models do not perform wel… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  27. arXiv:1905.06893  [pdf, other

    cs.LG stat.ML

    Leveraging exploration in off-policy algorithms via normalizing flows

    Authors: Bogdan Mazoure, Thang Doan, Audrey Durand, R Devon Hjelm, Joelle Pineau

    Abstract: The ability to discover approximately optimal policies in domains with sparse rewards is crucial to applying reinforcement learning (RL) in many real-world scenarios. Approaches such as neural density models and continuous exploration (e.g., Go-Explore) have been proposed to maintain the high exploration rate necessary to find high performing and generalizable policies. Soft actor-critic(SAC) is a… ▽ More

    Submitted 24 September, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted to 3rd Conference on Robot Learning (CoRL 2019); Keywords: Exploration, soft actor-critic, normalizing flow, off-policy; maximum entropy, reinforcement learning; deceptive reward; sparse reward; inverse autoregressive flow

  28. arXiv:1904.10931  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Prediction of Progression to Alzheimer's disease with Deep InfoMax

    Authors: Alex Fedorov, R Devon Hjelm, Anees Abrol, Zening Fu, Yuhui Du, Sergey Plis, Vince D. Calhoun

    Abstract: Arguably, unsupervised learning plays a crucial role in the majority of algorithms for processing brain imaging. A recently introduced unsupervised approach Deep InfoMax (DIM) is a promising tool for exploring brain structure in a flexible non-linear way. In this paper, we investigate the use of variants of DIM in a setting of progression to Alzheimer's disease in comparison with supervised AlexNe… ▽ More

    Submitted 30 April, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted to 2019 IEEE Biomedical and Health Informatics (BHI) as a conference paper

  29. arXiv:1904.06316  [pdf, other

    cs.LG stat.ML

    Spatio-Temporal Deep Graph Infomax

    Authors: Felix L. Opolka, Aaron Solomon, Cătălina Cangea, Petar Veličković, Pietro Liò, R Devon Hjelm

    Abstract: Spatio-temporal graphs such as traffic networks or gene regulatory systems present challenges for the existing deep learning methods due to the complexity of structural changes over time. To address these issues, we introduce Spatio-Temporal Deep Graph Infomax (STDGI)---a fully unsupervised node representation learning approach based on mutual information maximization that exploits both the tempor… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: 6 pages, 2 figures, Representation Learning on Graphs and Manifolds Workshop of the International Conference on Learning Representations (ICLR)

  30. arXiv:1903.02709  [pdf, other

    stat.ML cs.LG

    On Adversarial Mixup Resynthesis

    Authors: Christopher Beckham, Sina Honari, Vikas Verma, Alex Lamb, Farnoosh Ghadiri, R Devon Hjelm, Yoshua Bengio, Christopher Pal

    Abstract: In this paper, we explore new approaches to combining information encoded within the learned representations of auto-encoders. We explore models that are capable of combining the attributes of multiple inputs such that a resynthesised output is trained to fool an adversarial discriminator for real versus synthesised data. Furthermore, we explore the use of such an architecture in the context of se… ▽ More

    Submitted 23 October, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 'Camera-ready draft'

  31. arXiv:1811.09845  [pdf, other

    cs.CV

    Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction

    Authors: Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W. Taylor

    Abstract: Conditional text-to-image generation is an active area of research, with many possible applications. Existing research has primarily focused on generating a single image from available conditioning information in one step. One practical extension beyond one-step generation is a system that generates an image iteratively, conditioned on ongoing linguistic input or feedback. This is significantly mo… ▽ More

    Submitted 23 September, 2019; v1 submitted 24 November, 2018; originally announced November 2018.

    Comments: Accepted at ICCV 2019

    Journal ref: Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV)

  32. arXiv:1809.10341  [pdf, other

    stat.ML cs.IT cs.LG cs.SI

    Deep Graph Infomax

    Authors: Petar Veličković, William Fedus, William L. Hamilton, Pietro Liò, Yoshua Bengio, R Devon Hjelm

    Abstract: We present Deep Graph Infomax (DGI), a general approach for learning node representations within graph-structured data in an unsupervised manner. DGI relies on maximizing mutual information between patch representations and corresponding high-level summaries of graphs---both derived using established graph convolutional network architectures. The learnt patch representations summarize subgraphs ce… ▽ More

    Submitted 21 December, 2018; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: To appear at ICLR 2019. 17 pages, 8 figures

  33. arXiv:1808.06670  [pdf, other

    stat.ML cs.LG

    Learning deep representations by mutual information estimation and maximization

    Authors: R Devon Hjelm, Alex Fedorov, Samuel Lavoie-Marchildon, Karan Grewal, Phil Bachman, Adam Trischler, Yoshua Bengio

    Abstract: In this work, we perform unsupervised learning of representations by maximizing mutual information between an input and the output of a deep neural network encoder. Importantly, we show that structure matters: incorporating knowledge about locality of the input to the objective can greatly influence a representation's suitability for downstream tasks. We further control characteristics of the repr… ▽ More

    Submitted 22 February, 2019; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: Accepted as an oral presentation at the International Conference for Learning Representations (ICLR), 2019

  34. arXiv:1808.00020  [pdf, other

    cs.LG stat.ML

    On-line Adaptative Curriculum Learning for GANs

    Authors: Thang Doan, Joao Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Generative Adversarial Networks (GANs) can successfully approximate a probability distribution and produce realistic samples. However, open questions such as sufficient convergence conditions and mode collapse still persist. In this paper, we build on existing work in the area by proposing a novel framework for training the generator against an ensemble of discriminator networks, which can be seen… ▽ More

    Submitted 11 March, 2019; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Accepted to the Thirty-Third AAAI Conference On Artificial Intelligence, 2019 (Added 128x128 CelebA samples to the end of the appendix)

    Journal ref: Proceedings of 33rd AAAI Conference on Artificial Intelligence (AAAI 2019)

  35. arXiv:1801.04062  [pdf, other

    cs.LG stat.ML

    MINE: Mutual Information Neural Estimation

    Authors: Mohamed Ishmael Belghazi, Aristide Baratin, Sai Rajeswar, Sherjil Ozair, Yoshua Bengio, Aaron Courville, R Devon Hjelm

    Abstract: We argue that the estimation of mutual information between high dimensional continuous random variables can be achieved by gradient descent over neural networks. We present a Mutual Information Neural Estimator (MINE) that is linearly scalable in dimensionality as well as in sample size, trainable through back-prop, and strongly consistent. We present a handful of applications on which MINE can be… ▽ More

    Submitted 14 August, 2021; v1 submitted 12 January, 2018; originally announced January 2018.

    Comments: 19 pages, 6 figures

    Journal ref: ICML 2018

  36. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  37. arXiv:1711.04755  [pdf, other

    stat.ML cs.LG

    ACtuAL: Actor-Critic Under Adversarial Learning

    Authors: Anirudh Goyal, Nan Rosemary Ke, Alex Lamb, R Devon Hjelm, Chris Pal, Joelle Pineau, Yoshua Bengio

    Abstract: Generative Adversarial Networks (GANs) are a powerful framework for deep generative modeling. Posed as a two-player minimax problem, GANs are typically trained end-to-end on real-valued data and can be used to train a generator of high-dimensional and realistic images. However, a major limitation of GANs is that training relies on passing gradients from the discriminator through the generator via… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  38. arXiv:1707.00309   

    stat.ML cs.LG

    Variance Regularizing Adversarial Learning

    Authors: Karan Grewal, R Devon Hjelm, Yoshua Bengio

    Abstract: We introduce a novel approach for training adversarial models by replacing the discriminator score with a bi-modal Gaussian distribution over the real/fake indicator variables. In order to do this, we train the Gaussian classifier to match the target bi-modal distribution implicitly through meta-adversarial training. We hypothesize that this approach ensures a non-zero gradient to the generator, e… ▽ More

    Submitted 19 August, 2018; v1 submitted 2 July, 2017; originally announced July 2017.

    Comments: Method is out of date and some results are incorrect

  39. arXiv:1702.08431  [pdf, other

    stat.ML cs.LG

    Boundary-Seeking Generative Adversarial Networks

    Authors: R Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio

    Abstract: Generative adversarial networks (GANs) are a learning framework that rely on training a discriminator to estimate a measure of difference between a target and generated distributions. GANs, as normally formulated, rely on the generated samples being completely differentiable w.r.t. the generative parameters, and thus do not work for discrete data. We introduce a method for training GANs with discr… ▽ More

    Submitted 21 February, 2018; v1 submitted 27 February, 2017; originally announced February 2017.

  40. arXiv:1702.07983  [pdf, other

    cs.AI cs.CL cs.LG

    Maximum-Likelihood Augmented Discrete Generative Adversarial Networks

    Authors: Tong Che, Yanran Li, Ruixiang Zhang, R Devon Hjelm, Wenjie Li, Yangqiu Song, Yoshua Bengio

    Abstract: Despite the successes in capturing continuous distributions, the application of generative adversarial networks (GANs) to discrete settings, like natural language tasks, is rather restricted. The fundamental reason is the difficulty of back-propagation through discrete random variables combined with the inherent instability of the GAN training objective. To address these problems, we propose Maxim… ▽ More

    Submitted 25 February, 2017; originally announced February 2017.

    Comments: 11 pages, 3 figures

  41. arXiv:1611.00864  [pdf, other

    cs.NE q-bio.NC

    Spatio-temporal Dynamics of Intrinsic Networks in Functional Magnetic Imaging Data Using Recurrent Neural Networks

    Authors: R Devon Hjelm, Eswar Damaraju, Kyunghyun Cho, Helmut Laufs, Sergey M. Plis, Vince Calhoun

    Abstract: We introduce a novel recurrent neural network (RNN) approach to account for temporal dynamics and dependencies in brain networks observed via functional magnetic resonance imaging (fMRI). Our approach directly parameterizes temporal dynamics through recurrent connections, which can be used to formulate blind source separation with a conditional (rather than marginal) independence assumption, which… ▽ More

    Submitted 27 August, 2018; v1 submitted 2 November, 2016; originally announced November 2016.

    Comments: Accepted to Frontiers of Neuroscience

  42. arXiv:1603.06624  [pdf, other

    cs.LG cs.NE stat.ML

    Variational Autoencoders for Feature Detection of Magnetic Resonance Imaging Data

    Authors: R. Devon Hjelm, Sergey M. Plis, Vince C. Calhoun

    Abstract: Independent component analysis (ICA), as an approach to the blind source-separation (BSS) problem, has become the de-facto standard in many medical imaging settings. Despite successes and a large ongoing research effort, the limitation of ICA to square linear transformations have not been overcome, so that general INFOMAX is still far from being realized. As an alternative, we present feature anal… ▽ More

    Submitted 21 March, 2016; originally announced March 2016.

  43. arXiv:1511.06382  [pdf, other

    cs.LG stat.ML

    Iterative Refinement of the Approximate Posterior for Directed Belief Networks

    Authors: R Devon Hjelm, Kyunghyun Cho, Junyoung Chung, Russ Salakhutdinov, Vince Calhoun, Nebojsa Jojic

    Abstract: Variational methods that rely on a recognition network to approximate the posterior of directed graphical models offer better inference and learning than previous methods. Recent advances that exploit the capacity and flexibility in this approach have expanded what kinds of models can be trained. However, as a proposal for the posterior, the capacity of the recognition network is limited, which ca… ▽ More

    Submitted 20 February, 2018; v1 submitted 19 November, 2015; originally announced November 2015.

  44. arXiv:1312.5847  [pdf, other

    cs.NE cs.LG stat.ML

    Deep learning for neuroimaging: a validation study

    Authors: Sergey M. Plis, Devon R. Hjelm, Ruslan Salakhutdinov, Vince D. Calhoun

    Abstract: Deep learning methods have recently made notable advances in the tasks of classification and representation learning. These tasks are important for brain imaging and neuroscience discovery, making the methods attractive for porting to a neuroimager's toolbox. Success of these methods is, in part, explained by the flexibility of deep learning models. However, this flexibility makes the process of p… ▽ More

    Submitted 19 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR 2014 revisions