Skip to main content

Showing 1–2 of 2 results for author: Kondrup, F

.
  1. arXiv:2310.09997  [pdf, other

    cs.AI cs.LG eess.SY

    Forecaster: Towards Temporally Abstract Tree-Search Planning from Pixels

    Authors: Thomas Jiralerspong, Flemming Kondrup, Doina Precup, Khimya Khetarpal

    Abstract: The ability to plan at many different levels of abstraction enables agents to envision the long-term repercussions of their decisions and thus enables sample-efficient learning. This becomes particularly beneficial in complex environments from high-dimensional state space such as pixels, where the goal is distant and the reward sparse. We introduce Forecaster, a deep hierarchical reinforcement lea… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  2. arXiv:2210.02552  [pdf, other

    cs.LG

    Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

    Authors: Flemming Kondrup, Thomas Jiralerspong, Elaine Lau, Nathan de Lara, Jacob Shkrob, My Duc Tran, Doina Precup, Sumana Basu

    Abstract: Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offli… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: to be published in IAAI (Innovative Applications of Artificial Intelligence) 2023