Skip to main content

Showing 1–8 of 8 results for author: Luis, C E

.
  1. arXiv:2402.15347  [pdf, other

    cs.LG cs.AI stat.ML

    Information-Theoretic Safe Bayesian Optimization

    Authors: Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider a sequential decision making task, where the goal is to optimize an unknown function without evaluating parameters that violate an a~priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown functions and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and… ▽ More

    Submitted 10 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.04914

  2. arXiv:2312.04386  [pdf, other

    cs.LG cs.AI

    Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning. In particular, we focus on characterizing the variance over values induced by a distribution over MDPs. Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation (UBE), but the over-approximation may result in inefficient… ▽ More

    Submitted 13 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.12526

  3. arXiv:2308.06590  [pdf, other

    cs.LG cs.AI

    Value-Distributional Model-Based Reinforcement Learning

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: Quantifying uncertainty about a policy's long-term performance is important to solve sequential decision-making tasks. We study the problem from a model-based Bayesian reinforcement learning perspective, where the goal is to learn the posterior distribution over value functions induced by parameter (epistemic) uncertainty of the Markov decision process. Previous work restricts the analysis to a fe… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

  4. arXiv:2302.12526  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Based Uncertainty in Value Functions

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning. In particular, we focus on characterizing the variance over values induced by a distribution over MDPs. Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation, but the over-approximation may result in inefficient explo… ▽ More

    Submitted 7 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: AISTATS 2023

  5. arXiv:2212.04914  [pdf, other

    cs.LG

    Information-Theoretic Safe Exploration with Gaussian Processes

    Authors: Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown constraint and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and cannot be directly extended to t… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: Submitted to NeurIPS 2022

  6. Online Trajectory Generation with Distributed Model Predictive Control for Multi-Robot Motion Planning

    Authors: Carlos E. Luis, Marijan Vukosavljev, Angela P. Schoellig

    Abstract: We present a distributed model predictive control (DMPC) algorithm to generate trajectories in real-time for multiple robots. We adopted the \textit{on-demand collision avoidance} method presented in previous work to efficiently compute non-colliding trajectories in transition tasks. An event-triggered replanning strategy is proposed to account for disturbances. Our simulation results show that th… ▽ More

    Submitted 24 January, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

    Comments: 8 pages, 8 figures

    Journal ref: IEEE Robotics and Automation Letters, vol. 5, iss. 2, pp. 604-611, 2020

  7. arXiv:1810.03572  [pdf, other

    cs.RO

    Fast and In Sync: Periodic Swarm Patterns for Quadrotors

    Authors: Xintong Du, Carlos E. Luis, Marijan Vukosavljev, Angela P. Schoellig

    Abstract: This paper aims to design quadrotor swarm performances, where the swarm acts as an integrated, coordinated unit embodying moving and deforming objects. We divide the task of creating a choreography into three basic steps: designing swarm motion primitives, transitioning between those movements, and synchronizing the motion of the drones. The result is a flexible framework for designing choreograph… ▽ More

    Submitted 2 May, 2019; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: This work was accepted to ICRA 2019. It is a finalist nominated for the Best Paper Award on Multi-Robot Systems and the Best Paper Award on Uncrewed Aerial Vehicles

  8. Trajectory Generation for Multiagent Point-To-Point Transitions via Distributed Model Predictive Control

    Authors: Carlos E. Luis, Angela P. Schoellig

    Abstract: This paper introduces a novel algorithm for multiagent offline trajectory generation based on distributed model predictive control. Central to the algorithm's scalability and success is the development of an on-demand collision avoidance strategy. By predicting future states and sharing this information with their neighbors, the agents are able to detect and avoid collisions while moving toward th… ▽ More

    Submitted 15 January, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: 8 pages, 7 figures

    Journal ref: IEEE Robotics and Automation Letters, vol. 4, iss. 2, pp. 375-382, 2019