Skip to main content

Showing 1–29 of 29 results for author: Deleu, T

.
  1. arXiv:2402.10309  [pdf, other

    cs.LG

    Discrete Probabilistic Inference as Control in Multi-path Environments

    Authors: Tristan Deleu, Padideh Nouri, Nikolay Malkin, Doina Precup, Yoshua Bengio

    Abstract: We consider the problem of sampling from a discrete and structured distribution as a sequential decision problem, where the objective is to find a stochastic policy such that objects are sampled at the end of this sequential process proportionally to some predefined reward. While we could use maximum entropy Reinforcement Learning (MaxEnt RL) to solve this problem for some distributions, it has be… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  2. arXiv:2307.04988  [pdf, other

    cs.LG stat.ME

    Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation

    Authors: Chris Chinenye Emezue, Alexandre Drouin, Tristan Deleu, Stefan Bauer, Yoshua Bengio

    Abstract: The practical utility of causality in decision-making is widespread and brought about by the intertwining of causal discovery and causal inference. Nevertheless, a notable gap exists in the evaluation of causal discovery methods, where insufficient emphasis is placed on downstream inference. To address this gap, we evaluate seven established baseline causal discovery methods including a newly prop… ▽ More

    Submitted 30 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: Peer-reviewed and Accepted to ICML 2023 Workshop on Structured Probabilistic Inference & Generative Modeling

  3. arXiv:2307.01422  [pdf, other

    cs.LG

    Generative Flow Networks: a Markov Chain Perspective

    Authors: Tristan Deleu, Yoshua Bengio

    Abstract: While Markov chain Monte Carlo methods (MCMC) provide a general framework to sample from a probability distribution defined up to normalization, they often suffer from slow convergence to the target distribution when the latter is highly multi-modal. Recently, Generative Flow Networks (GFlowNets) have been proposed as an alternative framework to mitigate this issue when samples have a clear compos… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  4. arXiv:2306.15058  [pdf, other

    cs.LG stat.ML

    BatchGFN: Generative Flow Networks for Batch Active Learning

    Authors: Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

    Abstract: We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

  5. arXiv:2305.19366  [pdf, other

    cs.LG stat.ML

    Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network

    Authors: Tristan Deleu, Mizu Nishikawa-Toomey, Jithendaraa Subramanian, Nikolay Malkin, Laurent Charlin, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets), a class of generative models over discrete and structured sample spaces, have been previously applied to the problem of inferring the marginal posterior distribution over the directed acyclic graph (DAG) of a Bayesian Network, given a dataset of observations. Based on recent advances extending this framework to non-discrete sample spaces, we propose in this pa… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  6. GFlowNets for AI-Driven Scientific Discovery

    Authors: Moksh Jain, Tristan Deleu, Jason Hartford, Cheng-Hao Liu, Alex Hernandez-Garcia, Yoshua Bengio

    Abstract: Tackling the most pressing problems for humanity, such as the climate crisis and the threat of global pandemics, requires accelerating the pace of scientific discovery. While science has traditionally relied on trial and error and even serendipity to a large extent, the last few decades have seen a surge of data-driven scientific discoveries. However, in order to truly leverage large-scale data se… ▽ More

    Submitted 27 June, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 31 pages, 5 figures. Updated with camera-ready changes

  7. arXiv:2301.12594  [pdf, other

    cs.LG stat.ML

    A theory of continuous generative flow networks

    Authors: Salem Lahlou, Tristan Deleu, Pablo Lemos, Dinghuai Zhang, Alexandra Volokhova, Alex Hernández-García, Léna Néhale Ezzine, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects. A key limitation of GFlowNets until this time has been that they are restricted to discrete spaces. We present a theory for generalized GFlowNets, which encompasses both existing discrete GFlowNets and ones with continuous… ▽ More

    Submitted 25 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: ICML 2023; 32 pages; code: https://github.com/saleml/continuous-gfn

  8. arXiv:2211.14666  [pdf, other

    cs.LG stat.ML

    Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning

    Authors: Sébastien Lachapelle, Tristan Deleu, Divyat Mahajan, Ioannis Mitliagkas, Yoshua Bengio, Simon Lacoste-Julien, Quentin Bertrand

    Abstract: Although disentangled representations are often said to be beneficial for downstream tasks, current empirical and theoretical understanding is limited. In this work, we provide evidence that disentangled representations coupled with sparse base-predictors improve generalization. In the context of multi-task learning, we prove a new identifiability result that provides conditions under which maxima… ▽ More

    Submitted 6 June, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: Appears in: Fortieth International Conference on Machine Learning (ICML 2023). 36 pages

    ACM Class: I.2.6; I.5.1

  9. arXiv:2211.02763  [pdf, other

    cs.LG stat.ML

    Bayesian learning of Causal Structure and Mechanisms with GFlowNets and Variational Bayes

    Authors: Mizu Nishikawa-Toomey, Tristan Deleu, Jithendaraa Subramanian, Yoshua Bengio, Laurent Charlin

    Abstract: Bayesian causal structure learning aims to learn a posterior distribution over directed acyclic graphs (DAGs), and the mechanisms that define the relationship between parent and child variables. By taking a Bayesian approach, it is possible to reason about the uncertainty of the causal model. The notion of modelling the uncertainty over models is particularly crucial for causal structure learning… ▽ More

    Submitted 3 June, 2024; v1 submitted 4 November, 2022; originally announced November 2022.

  10. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  11. arXiv:2210.00580  [pdf, other

    cs.LG stat.ML

    GFlowNets and variational inference

    Authors: Nikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio

    Abstract: This paper builds bridges between two families of probabilistic algorithms: (hierarchical) variational inference (VI), which is typically used to model distributions over continuous spaces, and generative flow networks (GFlowNets), which have been used for distributions over discrete structures such as graphs. We demonstrate that, in certain cases, VI algorithms are equivalent to special cases of… ▽ More

    Submitted 2 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 final version; code: https://github.com/GFNOrg/GFN_vs_HVI

  12. arXiv:2203.01443  [pdf, other

    cs.LG

    Continuous-Time Meta-Learning with Forward Mode Differentiation

    Authors: Tristan Deleu, David Kanaa, Leo Feng, Giancarlo Kerg, Yoshua Bengio, Guillaume Lajoie, Pierre-Luc Bacon

    Abstract: Drawing inspiration from gradient-based meta-learning methods with infinitely small gradient steps, we introduce Continuous-Time Meta-Learning (COMLN), a meta-learning algorithm where adaptation follows the dynamics of a gradient vector field. Specifically, representations of the inputs are meta-learned such that a task-specific linear classifier is obtained as a solution of an ordinary differenti… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  13. arXiv:2202.13903  [pdf, other

    cs.LG stat.ML

    Bayesian Structure Learning with Generative Flow Networks

    Authors: Tristan Deleu, António Góis, Chris Emezue, Mansi Rankawat, Simon Lacoste-Julien, Stefan Bauer, Yoshua Bengio

    Abstract: In Bayesian structure learning, we are interested in inferring a distribution over the directed acyclic graph (DAG) structure of Bayesian networks, from data. Defining such a distribution is very challenging, due to the combinatorially large sample space, and approximations based on MCMC are often required. Recently, a novel class of probabilistic models, called Generative Flow Networks (GFlowNets… ▽ More

    Submitted 28 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

  14. arXiv:2201.11783  [pdf, other

    cs.LG

    Boosting Exploration in Multi-Task Reinforcement Learning using Adversarial Networks

    Authors: Ramnath Kumar, Tristan Deleu, Yoshua Bengio

    Abstract: Advancements in reinforcement learning (RL) have been remarkable in recent years. However, the limitations of traditional training methods have become increasingly evident, particularly in meta-RL settings where agents face new, unseen tasks. Conventional training approaches are susceptible to failure in such situations as they need more robustness to adversity. Our proposed adversarial training r… ▽ More

    Submitted 6 February, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

  15. arXiv:2201.11775  [pdf, other

    cs.LG

    The Effect of Diversity in Meta-Learning

    Authors: Ramnath Kumar, Tristan Deleu, Yoshua Bengio

    Abstract: Recent studies show that task distribution plays a vital role in the meta-learner's performance. Conventional wisdom is that task diversity should improve the performance of meta-learning. In this work, we find evidence to the contrary; (i) our experiments draw into question the efficacy of our learned models: similar manifolds can be learned with a subset of the data (lower task diversity). This… ▽ More

    Submitted 24 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted at AAAI 23

  16. arXiv:2111.09266  [pdf, other

    cs.LG cs.AI stat.ML

    GFlowNet Foundations

    Authors: Yoshua Bengio, Salem Lahlou, Tristan Deleu, Edward J. Hu, Mo Tiwari, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets) have been introduced as a method to sample a diverse set of candidates in an active learning context, with a training objective that makes them approximately sample in proportion to a given reward function. In this paper, we show a number of additional theoretical properties of GFlowNets. They can be used to estimate joint probability distributions and the corr… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

  17. arXiv:2102.03869  [pdf, other

    cs.LG math.OC

    Structured Sparsity Inducing Adaptive Optimizers for Deep Learning

    Authors: Tristan Deleu, Yoshua Bengio

    Abstract: The parameters of a neural network are naturally organized in groups, some of which might not contribute to its overall performance. To prune out unimportant groups of parameters, we can include some non-differentiable penalty to the objective function, and minimize it using proximal gradient methods. In this paper, we derive the weighted proximal operator, which is a necessary component of these… ▽ More

    Submitted 5 January, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

  18. arXiv:2010.16004  [pdf, other

    cs.CY cs.LG cs.MA cs.SI

    COVI-AgentSim: an Agent-based Model for Evaluating Methods of Digital Contact Tracing

    Authors: Prateek Gupta, Tegan Maharaj, Martin Weiss, Nasim Rahaman, Hannah Alsdurf, Abhinav Sharma, Nanor Minoyan, Soren Harnois-Leblanc, Victor Schmidt, Pierre-Luc St. Charles, Tristan Deleu, Andrew Williams, Akshay Patel, Meng Qu, Olexa Bilaniuk, Gaétan Marceau Caron, Pierre Luc Carrier, Satya Ortiz-Gagné, Marc-Andre Rousseau, David Buckeridge, Joumana Ghosn, Yang Zhang, Bernhard Schölkopf, Jian Tang, Irina Rish , et al. (4 additional authors not shown)

    Abstract: The rapid global spread of COVID-19 has led to an unprecedented demand for effective methods to mitigate the spread of the disease, and various digital contact tracing (DCT) methods have emerged as a component of the solution. In order to make informed public health choices, there is a need for tools which allow evaluation and comparison of DCT methods. We introduce an agent-based compartmental si… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  19. arXiv:2010.12536  [pdf, other

    cs.LG cs.AI cs.MA cs.SI

    Predicting Infectiousness for Proactive Contact Tracing

    Authors: Yoshua Bengio, Prateek Gupta, Tegan Maharaj, Nasim Rahaman, Martin Weiss, Tristan Deleu, Eilif Muller, Meng Qu, Victor Schmidt, Pierre-Luc St-Charles, Hannah Alsdurf, Olexa Bilanuik, David Buckeridge, Gáetan Marceau Caron, Pierre-Luc Carrier, Joumana Ghosn, Satya Ortiz-Gagne, Chris Pal, Irina Rish, Bernhard Schölkopf, Abhinav Sharma, Jian Tang, Andrew Williams

    Abstract: The COVID-19 pandemic has spread rapidly worldwide, overwhelming manual contact tracing in many countries and resulting in widespread lockdowns for emergency containment. Large-scale digital contact tracing (DCT) has emerged as a potential solution to resume economic and social activity while minimizing spread of the virus. Various DCT methods have been proposed, each making trade-offs between pri… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  20. arXiv:2005.08502  [pdf, other

    cs.CR cs.AI cs.CY

    COVI White Paper

    Authors: Hannah Alsdurf, Edmond Belliveau, Yoshua Bengio, Tristan Deleu, Prateek Gupta, Daphne Ippolito, Richard Janda, Max Jarvie, Tyler Kolody, Sekoul Krastev, Tegan Maharaj, Robert Obryk, Dan Pilat, Valerie Pisano, Benjamin Prud'homme, Meng Qu, Nasim Rahaman, Irina Rish, Jean-Francois Rousseau, Abhinav Sharma, Brooke Struck, Jian Tang, Martin Weiss, Yun William Yu

    Abstract: The SARS-CoV-2 (Covid-19) pandemic has caused significant strain on public health institutions around the world. Contact tracing is an essential tool to change the course of the Covid-19 pandemic. Manual contact tracing of Covid-19 cases has significant challenges that limit the ability of public health authorities to minimize community infections. Personalized peer-to-peer contact tracing through… ▽ More

    Submitted 27 July, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 64 pages, 1 figure

  21. arXiv:2002.07956  [pdf, other

    cs.LG cs.AI stat.ML

    Curriculum in Gradient-Based Meta-Reinforcement Learning

    Authors: Bhairav Mehta, Tristan Deleu, Sharath Chandra Raparthy, Chris J. Pal, Liam Paull

    Abstract: Gradient-based meta-learners such as Model-Agnostic Meta-Learning (MAML) have shown strong few-shot performance in supervised and reinforcement learning settings. However, specifically in the case of meta-reinforcement learning (meta-RL), we can show that gradient-based meta-learners are sensitive to task distributions. With the wrong curriculum, agents suffer the effects of meta-overfitting, shal… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: 11 pages, 10 figures

  22. arXiv:1910.08636  [pdf, other

    cs.LG q-bio.QM stat.ML

    The TCGA Meta-Dataset Clinical Benchmark

    Authors: Mandana Samiei, Tobias Würfl, Tristan Deleu, Martin Weiss, Francis Dutil, Thomas Fevens, Geneviève Boucher, Sebastien Lemieux, Joseph Paul Cohen

    Abstract: Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: 5 Pages, Submitted to MLCB 2019

  23. arXiv:1909.06576  [pdf, ps, other

    cs.LG stat.ML

    Torchmeta: A Meta-Learning library for PyTorch

    Authors: Tristan Deleu, Tobias Würfl, Mandana Samiei, Joseph Paul Cohen, Yoshua Bengio

    Abstract: The constant introduction of standardized benchmarks in the literature has helped accelerating the recent advances in meta-learning research. They offer a way to get a fair comparison between different algorithms, and the wide range of datasets available allows full control over the complexity of this evaluation. However, for a large majority of code available online, the data pipeline is often sp… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

  24. arXiv:1906.04355  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Powerful Policies by Using Consistent Dynamics Model

    Authors: Shagun Sodhani, Anirudh Goyal, Tristan Deleu, Yoshua Bengio, Sergey Levine, Jian Tang

    Abstract: Model-based Reinforcement Learning approaches have the promise of being sample efficient. Much of the progress in learning dynamics models in RL has been made by learning models via supervised learning. But traditional model-based approaches lead to `compounding errors' when the model is unrolled step by step. Essentially, the state transitions that the learner predicts (by unrolling the model for… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Accpted at RLDM 2019

  25. arXiv:1906.02226  [pdf, other

    cs.LG stat.ML

    Gradient-Based Neural DAG Learning

    Authors: Sébastien Lachapelle, Philippe Brouillard, Tristan Deleu, Simon Lacoste-Julien

    Abstract: We propose a novel score-based approach to learning a directed acyclic graph (DAG) from observational data. We adapt a recently proposed continuous constrained optimization formulation to allow for nonlinear relationships between variables using neural networks. This extension allows to model complex interactions while avoiding the combinatorial nature of the problem. In addition to comparing our… ▽ More

    Submitted 18 February, 2020; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Appears in: Proceedings of the Eighth International Conference on Learning Representations (ICLR 2020). 23 pages

    ACM Class: I.2.6; I.5.1

  26. arXiv:1901.10912  [pdf, other

    cs.LG stat.ML

    A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms

    Authors: Yoshua Bengio, Tristan Deleu, Nasim Rahaman, Rosemary Ke, Sébastien Lachapelle, Olexa Bilaniuk, Anirudh Goyal, Christopher Pal

    Abstract: We propose to meta-learn causal structures based on how fast a learner adapts to new distributions arising from sparse distributional changes, e.g. due to interventions, actions of agents and other sources of non-stationarities. We show that under this assumption, the correct causal structural choices lead to faster adaptation to modified distributions because the changes are concentrated in one o… ▽ More

    Submitted 4 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

  27. arXiv:1812.02159  [pdf, other

    cs.LG stat.ML

    The effects of negative adaptation in Model-Agnostic Meta-Learning

    Authors: Tristan Deleu, Yoshua Bengio

    Abstract: The capacity of meta-learning algorithms to quickly adapt to a variety of tasks, including ones they did not experience during meta-training, has been a key factor in the recent success of these methods on few-shot learning problems. This particular advantage of using meta-learning over standard supervised or reinforcement learning is only well founded under the assumption that the adaptation phas… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: Workshop on Meta-Learning - 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada

  28. arXiv:1612.00827  [pdf, ps, other

    cs.LG

    Learning Operations on a Stack with Neural Turing Machines

    Authors: Tristan Deleu, Joseph Dureau

    Abstract: Multiple extensions of Recurrent Neural Networks (RNNs) have been proposed recently to address the difficulty of storing information over long time periods. In this paper, we experiment with the capacity of Neural Turing Machines (NTMs) to deal with these long-term dependencies on well-balanced strings of parentheses. We show that not only does the NTM emulate a stack with its heads and learn an a… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

    Comments: 1st Workshop on Neural Abstract Machines & Program Induction (NAMPI), NIPS 2016, Barcelona, Spain

  29. arXiv:1401.4944  [pdf, other

    cs.IT

    Iterative pre-distortion of the non-linear satellite channel

    Authors: Thibault Deleu, Mathieu Dervin, Kenta Kasai, François Horlin

    Abstract: Digital Video Broadcasting - Satellite - Second Generation (DVB-S2) is the current European standard for satellite broadcast and broadband communications. It relies on high order modulations up to 32-amplitude/phase-shift-keying (APSK) in order to increase the system spectral efficiency. Unfortunately, as the modulation order increases, the receiver becomes more sensitive to physical layer impairm… ▽ More

    Submitted 13 May, 2014; v1 submitted 20 January, 2014; originally announced January 2014.