Skip to main content

Showing 1–8 of 8 results for author: As, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01163  [pdf, other

    cs.LG

    When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL

    Authors: Lenart Treven, Bhavya Sukhija, Yarden As, Florian Dörfler, Andreas Krause

    Abstract: Reinforcement learning (RL) excels in optimizing policies for discrete-time Markov decision processes (MDP). However, various systems are inherently continuous in time, making discrete-time MDPs an inexact modeling choice. In many applications, such as greenhouse control or medical treatments, each interaction (measurement or switching of action) involves manual intervention and thus is inherently… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2405.05890  [pdf, other

    cs.LG cs.AI

    Safe Exploration Using Bayesian World Models and Log-Barrier Optimization

    Authors: Yarden As, Bhavya Sukhija, Andreas Krause

    Abstract: A major challenge in deploying reinforcement learning in online tasks is ensuring that safety is maintained throughout the learning process. In this work, we propose CERL, a new method for solving constrained Markov decision processes while kee** the policy safe during learning. Our method leverages Bayesian world models and suggests policies that are pessimistic w.r.t. the model's epistemic unc… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  3. arXiv:2402.15898  [pdf, other

    cs.LG cs.AI

    Transductive Active Learning: Theory and Applications

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We generalize active learning to address real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.15441

  4. arXiv:2402.15441  [pdf, other

    cs.LG cs.AI

    Active Few-Shot Fine-Tuning

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for information-based transductive learning, an approach which samples adaptively to maximize information gained… ▽ More

    Submitted 21 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  5. arXiv:2311.07558  [pdf, other

    cs.LG cs.RO

    Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning

    Authors: Arjun Bhardwaj, Jonas Rothfuss, Bhavya Sukhija, Yarden As, Marco Hutter, Stelian Coros, Andreas Krause

    Abstract: We introduce PACOH-RL, a novel model-based Meta-Reinforcement Learning (Meta-RL) algorithm designed to efficiently adapt control policies to changing dynamics. PACOH-RL meta-learns priors for the dynamics model, allowing swift adaptation to new dynamics with minimal interaction data. Existing Meta-RL methods require abundant meta-learning data, limiting their applicability in settings such as robo… ▽ More

    Submitted 6 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  6. arXiv:2305.05354  [pdf, other

    cs.RO cs.AI

    Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement

    Authors: Yunke Ao, Hooman Esfandiari, Fabio Carrillo, Yarden As, Mazda Farshad, Benjamin F. Grewe, Andreas Krause, Philipp Fuernstahl

    Abstract: Spinal fusion surgery requires highly accurate implantation of pedicle screw implants, which must be conducted in critical proximity to vital structures with a limited view of anatomy. Robotic surgery systems have been proposed to improve placement accuracy, however, state-of-the-art systems suffer from the limitations of open-loop approaches, as they follow traditional concepts of preoperative pl… ▽ More

    Submitted 10 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 10 pages, 4 figures

  7. arXiv:2207.10415  [pdf, other

    math.OC cs.LG

    Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement Learning

    Authors: Ilnura Usmanova, Yarden As, Maryam Kamgarpour, Andreas Krause

    Abstract: Optimizing noisy functions online, when evaluating the objective requires experiments on a deployed system, is a crucial task arising in manufacturing, robotics and many others. Often, constraints on safe inputs are unknown ahead of time, and we only obtain noisy information, indicating how close we are to violating the constraints. Yet, safety must be guaranteed at all times, not only for the fin… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 36 pages, 9 pages of appendix

  8. arXiv:2201.09802  [pdf, other

    cs.LG cs.AI cs.RO

    Constrained Policy Optimization via Bayesian World Models

    Authors: Yarden As, Ilnura Usmanova, Sebastian Curi, Andreas Krause

    Abstract: Improving sample-efficiency and safety are crucial challenges when deploying reinforcement learning in high-stakes real world applications. We propose LAMBDA, a novel model-based approach for policy optimization in safety critical tasks modeled via constrained Markov decision processes. Our approach utilizes Bayesian world models, and harnesses the resulting uncertainty to maximize optimistic uppe… ▽ More

    Submitted 6 February, 2022; v1 submitted 24 January, 2022; originally announced January 2022.