Skip to main content

Showing 1–8 of 8 results for author: Jiralerspong, T

.
  1. arXiv:2407.00957  [pdf, other

    cs.NE q-bio.NC stat.ML

    Expressivity of Neural Networks with Random Weights and Learned Biases

    Authors: Ezekiel Williams, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Alexandre Payeur, Matthew G. Perich, Luca Mazzucato, Guillaume Lajoie

    Abstract: Landmark universal function approximation results for neural networks with trained weights and biases provided impetus for the ubiquitous use of neural networks as learning models in Artificial Intelligence (AI) and neuroscience. Recent work has pushed the bounds of universal approximation by showing that arbitrary functions can similarly be learned by tuning smaller subsets of parameters, for exa… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: change to article metadata only: author name typo correction

  2. arXiv:2402.01207  [pdf, other

    cs.LG cs.AI stat.ME

    Efficient Causal Graph Discovery Using Large Language Models

    Authors: Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio

    Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-based methods have used a pairwise query approach, this requires a quadratic number of queries which quickly becomes impractical for larger causal graphs. In contrast, the proposed framework uses a breadth-first search (BFS) approach which allows it to use only a linear number of queries. We also s… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2310.10693  [pdf, other

    cs.SI cs.AI cs.LG

    Network Analysis of the iNaturalist Citizen Science Community

    Authors: Yu Lu Liu, Thomas Jiralerspong

    Abstract: In recent years, citizen science has become a larger and larger part of the scientific community. Its ability to crowd source data and expertise from thousands of citizen scientists makes it invaluable. Despite the field's growing popularity, the interactions and structure of citizen science projects are still poorly understood and under analyzed. We use the iNaturalist citizen science platform as… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  4. arXiv:2310.09997  [pdf, other

    cs.AI cs.LG eess.SY

    Forecaster: Towards Temporally Abstract Tree-Search Planning from Pixels

    Authors: Thomas Jiralerspong, Flemming Kondrup, Doina Precup, Khimya Khetarpal

    Abstract: The ability to plan at many different levels of abstraction enables agents to envision the long-term repercussions of their decisions and thus enables sample-efficient learning. This becomes particularly beneficial in complex environments from high-dimensional state space such as pixels, where the goal is distant and the reward sparse. We introduce Forecaster, a deep hierarchical reinforcement lea… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  5. arXiv:2310.02423  [pdf, other

    cs.LG stat.ML

    Delta-AI: Local objectives for amortized inference in sparse graphical models

    Authors: Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio

    Abstract: We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $Δ$-amortized inference ($Δ$-AI). Our approach is based on the observation that when the sampling of variables in a PGM is seen as a sequence of actions taken by an agent, sparsity of the PGM enables local credit assignment in the agent's policy learning objective. This yields a local… ▽ More

    Submitted 13 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 19 pages, code: https://github.com/GFNOrg/Delta-AI/

  6. arXiv:2308.05711  [pdf, other

    cs.LG eess.SY

    A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control

    Authors: Marshall Wang, John Willes, Thomas Jiralerspong, Matin Moezzi

    Abstract: Reinforcement learning (RL) is a promising approach for optimizing HVAC control. RL offers a framework for improving system performance, reducing energy consumption, and enhancing cost efficiency. We benchmark two popular classical and deep RL methods (Q-Learning and Deep-Q-Networks) across multiple HVAC environments and explore the practical consideration of model hyper-parameter selection and re… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  7. arXiv:2210.05845  [pdf, other

    cs.LG cs.AI

    Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL

    Authors: Chen Sun, Wannan Yang, Thomas Jiralerspong, Dane Malenfant, Benjamin Alsbury-Nealy, Yoshua Bengio, Blake Richards

    Abstract: In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critica… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  8. arXiv:2210.02552  [pdf, other

    cs.LG

    Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

    Authors: Flemming Kondrup, Thomas Jiralerspong, Elaine Lau, Nathan de Lara, Jacob Shkrob, My Duc Tran, Doina Precup, Sumana Basu

    Abstract: Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offli… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: to be published in IAAI (Innovative Applications of Artificial Intelligence) 2023