Skip to main content

Showing 1–25 of 25 results for author: Hjelm, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.05490  [pdf, other

    cs.LG cs.AI cs.CV cs.IT stat.ML

    Poly-View Contrastive Learning

    Authors: Amitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridge

    Abstract: Contrastive learning typically matches pairs of related views among a number of unrelated negative views. Views can be generated (e.g. by augmentations) or be observed. We investigate matching when there are more than two related views which we call poly-view tasks, and derive new representation learning objectives using information maximization and sufficient statistics. We show that with unlimit… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to ICLR 2024. 42 pages, 7 figures, 3 tables, loss pseudo-code included in appendix

  2. arXiv:2106.08365  [pdf, other

    cs.LG cs.AI stat.ML

    Test Sample Accuracy Scales with Training Sample Density in Neural Networks

    Authors: Xu Ji, Razvan Pascanu, Devon Hjelm, Balaji Lakshminarayanan, Andrea Vedaldi

    Abstract: Intuitively, one would expect accuracy of a trained neural network's prediction on test samples to correlate with how densely the samples are surrounded by seen training samples in representation space. We find that a bound on empirical training error smoothed across linear activation regions scales inversely with training sample density in representation space. Empirically, we verify this bound i… ▽ More

    Submitted 28 July, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: CoLLAs 2022 oral

  3. arXiv:2008.00938  [pdf, other

    cs.LG stat.ML

    Implicit Regularization via Neural Feature Alignment

    Authors: Aristide Baratin, Thomas George, César Laurent, R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

    Abstract: We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rad… ▽ More

    Submitted 16 March, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  4. arXiv:2007.05929  [pdf, other

    cs.LG stat.ML

    Data-Efficient Reinforcement Learning with Self-Predictive Representations

    Authors: Max Schwarzer, Ankesh Anand, Rishab Goel, R Devon Hjelm, Aaron Courville, Philip Bachman

    Abstract: While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an agent can learn more efficiently if we augment reward maximization with self-supervised objectives based on structure in its visual input and sequential intera… ▽ More

    Submitted 20 May, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: The first two authors contributed equally to this work. v4 includes new ablations and reformatting for ICLR camera ready

  5. arXiv:2006.07217  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement and InfoMax Learning

    Authors: Bogdan Mazoure, Remi Tachet des Combes, Thang Doan, Philip Bachman, R Devon Hjelm

    Abstract: We begin with the hypothesis that a model-free agent whose representations are predictive of properties of future states (beyond expected rewards) will be more capable of solving and adapting to new RL problems. To test that hypothesis, we introduce an objective based on Deep InfoMax (DIM) which trains the agent to predict the future by maximizing the mutual information between its internal repres… ▽ More

    Submitted 16 November, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  6. arXiv:2002.09469  [pdf, other

    cs.LG stat.ML

    An end-to-end approach for the verification problem: learning the right distance

    Authors: Joao Monteiro, Isabela Albuquerque, Jahangir Alam, R Devon Hjelm, Tiago Falk

    Abstract: In this contribution, we augment the metric learning setting by introducing a parametric pseudo-distance, trained jointly with the encoder. Several interpretations are thus drawn for the learned distance-like model's output. We first show it approximates a likelihood ratio which can be used for hypothesis tests, and that it further induces a large divergence across the joint distributions of pairs… ▽ More

    Submitted 14 August, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: ICML 2020 final camera ready. Code is available at: https://github.com/joaomonteirof/e2e_verification

  7. arXiv:1909.07543  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning

    Authors: Thang Doan, Bogdan Mazoure, Moloud Abdar, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Continuous control tasks in reinforcement learning are important because they provide an important framework for learning in high-dimensional state spaces with deceptive rewards, where the agent can easily become trapped into suboptimal solutions. One way to avoid local optima is to use a population of agents to ensure coverage of the policy space, yet learning a population with the "best" coverag… ▽ More

    Submitted 9 July, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  8. arXiv:1906.08226  [pdf, other

    cs.LG stat.ML

    Unsupervised State Representation Learning in Atari

    Authors: Ankesh Anand, Evan Racah, Sherjil Ozair, Yoshua Bengio, Marc-Alexandre Côté, R Devon Hjelm

    Abstract: State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporall… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; v6 fixes a broken figure reference

  9. arXiv:1906.00910  [pdf, other

    cs.LG stat.ML

    Learning Representations by Maximizing Mutual Information Across Views

    Authors: Philip Bachman, R Devon Hjelm, William Buchwalter

    Abstract: We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local spatio-temporal context by observing it from different locations (e.g., camera positions within a scene), and via different modalities (e.g., tactile, auditory, or visual… ▽ More

    Submitted 8 July, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

  10. arXiv:1905.12760  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Batch weight for domain adaptation with mass shift

    Authors: Mikołaj Bińkowski, R Devon Hjelm, Aaron Courville

    Abstract: Unsupervised domain transfer is the task of transferring or translating samples from a source distribution to a different target distribution. Current solutions unsupervised domain transfer often operate on data on which the modes of the distribution are well-matched, for instance have the same frequencies of classes between source and target distributions. However, these models do not perform wel… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  11. arXiv:1905.06893  [pdf, other

    cs.LG stat.ML

    Leveraging exploration in off-policy algorithms via normalizing flows

    Authors: Bogdan Mazoure, Thang Doan, Audrey Durand, R Devon Hjelm, Joelle Pineau

    Abstract: The ability to discover approximately optimal policies in domains with sparse rewards is crucial to applying reinforcement learning (RL) in many real-world scenarios. Approaches such as neural density models and continuous exploration (e.g., Go-Explore) have been proposed to maintain the high exploration rate necessary to find high performing and generalizable policies. Soft actor-critic(SAC) is a… ▽ More

    Submitted 24 September, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted to 3rd Conference on Robot Learning (CoRL 2019); Keywords: Exploration, soft actor-critic, normalizing flow, off-policy; maximum entropy, reinforcement learning; deceptive reward; sparse reward; inverse autoregressive flow

  12. arXiv:1904.10931  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Prediction of Progression to Alzheimer's disease with Deep InfoMax

    Authors: Alex Fedorov, R Devon Hjelm, Anees Abrol, Zening Fu, Yuhui Du, Sergey Plis, Vince D. Calhoun

    Abstract: Arguably, unsupervised learning plays a crucial role in the majority of algorithms for processing brain imaging. A recently introduced unsupervised approach Deep InfoMax (DIM) is a promising tool for exploring brain structure in a flexible non-linear way. In this paper, we investigate the use of variants of DIM in a setting of progression to Alzheimer's disease in comparison with supervised AlexNe… ▽ More

    Submitted 30 April, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted to 2019 IEEE Biomedical and Health Informatics (BHI) as a conference paper

  13. arXiv:1904.06316  [pdf, other

    cs.LG stat.ML

    Spatio-Temporal Deep Graph Infomax

    Authors: Felix L. Opolka, Aaron Solomon, Cătălina Cangea, Petar Veličković, Pietro Liò, R Devon Hjelm

    Abstract: Spatio-temporal graphs such as traffic networks or gene regulatory systems present challenges for the existing deep learning methods due to the complexity of structural changes over time. To address these issues, we introduce Spatio-Temporal Deep Graph Infomax (STDGI)---a fully unsupervised node representation learning approach based on mutual information maximization that exploits both the tempor… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: 6 pages, 2 figures, Representation Learning on Graphs and Manifolds Workshop of the International Conference on Learning Representations (ICLR)

  14. arXiv:1903.02709  [pdf, other

    stat.ML cs.LG

    On Adversarial Mixup Resynthesis

    Authors: Christopher Beckham, Sina Honari, Vikas Verma, Alex Lamb, Farnoosh Ghadiri, R Devon Hjelm, Yoshua Bengio, Christopher Pal

    Abstract: In this paper, we explore new approaches to combining information encoded within the learned representations of auto-encoders. We explore models that are capable of combining the attributes of multiple inputs such that a resynthesised output is trained to fool an adversarial discriminator for real versus synthesised data. Furthermore, we explore the use of such an architecture in the context of se… ▽ More

    Submitted 23 October, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: 'Camera-ready draft'

  15. arXiv:1809.10341  [pdf, other

    stat.ML cs.IT cs.LG cs.SI

    Deep Graph Infomax

    Authors: Petar Veličković, William Fedus, William L. Hamilton, Pietro Liò, Yoshua Bengio, R Devon Hjelm

    Abstract: We present Deep Graph Infomax (DGI), a general approach for learning node representations within graph-structured data in an unsupervised manner. DGI relies on maximizing mutual information between patch representations and corresponding high-level summaries of graphs---both derived using established graph convolutional network architectures. The learnt patch representations summarize subgraphs ce… ▽ More

    Submitted 21 December, 2018; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: To appear at ICLR 2019. 17 pages, 8 figures

  16. arXiv:1808.06670  [pdf, other

    stat.ML cs.LG

    Learning deep representations by mutual information estimation and maximization

    Authors: R Devon Hjelm, Alex Fedorov, Samuel Lavoie-Marchildon, Karan Grewal, Phil Bachman, Adam Trischler, Yoshua Bengio

    Abstract: In this work, we perform unsupervised learning of representations by maximizing mutual information between an input and the output of a deep neural network encoder. Importantly, we show that structure matters: incorporating knowledge about locality of the input to the objective can greatly influence a representation's suitability for downstream tasks. We further control characteristics of the repr… ▽ More

    Submitted 22 February, 2019; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: Accepted as an oral presentation at the International Conference for Learning Representations (ICLR), 2019

  17. arXiv:1808.00020  [pdf, other

    cs.LG stat.ML

    On-line Adaptative Curriculum Learning for GANs

    Authors: Thang Doan, Joao Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau, R Devon Hjelm

    Abstract: Generative Adversarial Networks (GANs) can successfully approximate a probability distribution and produce realistic samples. However, open questions such as sufficient convergence conditions and mode collapse still persist. In this paper, we build on existing work in the area by proposing a novel framework for training the generator against an ensemble of discriminator networks, which can be seen… ▽ More

    Submitted 11 March, 2019; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Accepted to the Thirty-Third AAAI Conference On Artificial Intelligence, 2019 (Added 128x128 CelebA samples to the end of the appendix)

    Journal ref: Proceedings of 33rd AAAI Conference on Artificial Intelligence (AAAI 2019)

  18. arXiv:1801.04062  [pdf, other

    cs.LG stat.ML

    MINE: Mutual Information Neural Estimation

    Authors: Mohamed Ishmael Belghazi, Aristide Baratin, Sai Rajeswar, Sherjil Ozair, Yoshua Bengio, Aaron Courville, R Devon Hjelm

    Abstract: We argue that the estimation of mutual information between high dimensional continuous random variables can be achieved by gradient descent over neural networks. We present a Mutual Information Neural Estimator (MINE) that is linearly scalable in dimensionality as well as in sample size, trainable through back-prop, and strongly consistent. We present a handful of applications on which MINE can be… ▽ More

    Submitted 14 August, 2021; v1 submitted 12 January, 2018; originally announced January 2018.

    Comments: 19 pages, 6 figures

    Journal ref: ICML 2018

  19. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  20. arXiv:1711.04755  [pdf, other

    stat.ML cs.LG

    ACtuAL: Actor-Critic Under Adversarial Learning

    Authors: Anirudh Goyal, Nan Rosemary Ke, Alex Lamb, R Devon Hjelm, Chris Pal, Joelle Pineau, Yoshua Bengio

    Abstract: Generative Adversarial Networks (GANs) are a powerful framework for deep generative modeling. Posed as a two-player minimax problem, GANs are typically trained end-to-end on real-valued data and can be used to train a generator of high-dimensional and realistic images. However, a major limitation of GANs is that training relies on passing gradients from the discriminator through the generator via… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  21. arXiv:1707.00309   

    stat.ML cs.LG

    Variance Regularizing Adversarial Learning

    Authors: Karan Grewal, R Devon Hjelm, Yoshua Bengio

    Abstract: We introduce a novel approach for training adversarial models by replacing the discriminator score with a bi-modal Gaussian distribution over the real/fake indicator variables. In order to do this, we train the Gaussian classifier to match the target bi-modal distribution implicitly through meta-adversarial training. We hypothesize that this approach ensures a non-zero gradient to the generator, e… ▽ More

    Submitted 19 August, 2018; v1 submitted 2 July, 2017; originally announced July 2017.

    Comments: Method is out of date and some results are incorrect

  22. arXiv:1702.08431  [pdf, other

    stat.ML cs.LG

    Boundary-Seeking Generative Adversarial Networks

    Authors: R Devon Hjelm, Athul Paul Jacob, Tong Che, Adam Trischler, Kyunghyun Cho, Yoshua Bengio

    Abstract: Generative adversarial networks (GANs) are a learning framework that rely on training a discriminator to estimate a measure of difference between a target and generated distributions. GANs, as normally formulated, rely on the generated samples being completely differentiable w.r.t. the generative parameters, and thus do not work for discrete data. We introduce a method for training GANs with discr… ▽ More

    Submitted 21 February, 2018; v1 submitted 27 February, 2017; originally announced February 2017.

  23. arXiv:1603.06624  [pdf, other

    cs.LG cs.NE stat.ML

    Variational Autoencoders for Feature Detection of Magnetic Resonance Imaging Data

    Authors: R. Devon Hjelm, Sergey M. Plis, Vince C. Calhoun

    Abstract: Independent component analysis (ICA), as an approach to the blind source-separation (BSS) problem, has become the de-facto standard in many medical imaging settings. Despite successes and a large ongoing research effort, the limitation of ICA to square linear transformations have not been overcome, so that general INFOMAX is still far from being realized. As an alternative, we present feature anal… ▽ More

    Submitted 21 March, 2016; originally announced March 2016.

  24. arXiv:1511.06382  [pdf, other

    cs.LG stat.ML

    Iterative Refinement of the Approximate Posterior for Directed Belief Networks

    Authors: R Devon Hjelm, Kyunghyun Cho, Junyoung Chung, Russ Salakhutdinov, Vince Calhoun, Nebojsa Jojic

    Abstract: Variational methods that rely on a recognition network to approximate the posterior of directed graphical models offer better inference and learning than previous methods. Recent advances that exploit the capacity and flexibility in this approach have expanded what kinds of models can be trained. However, as a proposal for the posterior, the capacity of the recognition network is limited, which ca… ▽ More

    Submitted 20 February, 2018; v1 submitted 19 November, 2015; originally announced November 2015.

  25. arXiv:1312.5847  [pdf, other

    cs.NE cs.LG stat.ML

    Deep learning for neuroimaging: a validation study

    Authors: Sergey M. Plis, Devon R. Hjelm, Ruslan Salakhutdinov, Vince D. Calhoun

    Abstract: Deep learning methods have recently made notable advances in the tasks of classification and representation learning. These tasks are important for brain imaging and neuroscience discovery, making the methods attractive for porting to a neuroimager's toolbox. Success of these methods is, in part, explained by the flexibility of deep learning models. However, this flexibility makes the process of p… ▽ More

    Submitted 19 February, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR 2014 revisions