Skip to main content

Showing 1–13 of 13 results for author: Theile, M

.
  1. arXiv:2406.06253  [pdf, other

    eess.SY cs.PL

    PretVM: Predictable, Efficient Virtual Machine for Real-Time Concurrency

    Authors: Shaokai Lin, Erling Jellum, Mirco Theile, Tassilo Tanneberger, Binqi Sun, Chadlia Jerad, Ruomu Xu, Guangyu Feng, Christian Menard, Marten Lohstroh, Jeronimo Castrillon, Sanjit Seshia, Edward Lee

    Abstract: This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with wel… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2403.12856  [pdf, other

    cs.LG cs.RO

    Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning

    Authors: Mirco Theile, Hongpeng Cao, Marco Caccamo, Alberto L. Sangiovanni-Vincentelli

    Abstract: In reinforcement learning (RL), exploiting environmental symmetries can significantly enhance efficiency, robustness, and performance. However, ensuring that the deep RL policy and value networks are respectively equivariant and invariant to exploit these symmetries is a substantial challenge. Related works try to design networks that are equivariant and invariant by construction, limiting them to… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: submitted for possible publication. A video can be found here: https://youtu.be/L6NOdvU7n7s

  3. arXiv:2309.03157  [pdf, other

    cs.RO cs.LG

    Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning

    Authors: Mirco Theile, Harald Bayerlein, Marco Caccamo, Alberto L. Sangiovanni-Vincentelli

    Abstract: Coverage path planning (CPP) is a critical problem in robotics, where the goal is to find an efficient path that covers every point in an area of interest. This work addresses the power-constrained CPP problem with recharge for battery-limited unmanned aerial vehicles (UAVs). In this problem, a notable challenge emerges from integrating recharge journeys into the overall coverage strategy, highlig… ▽ More

    Submitted 7 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2308.14647  [pdf, other

    cs.LG cs.DC cs.DM math.OC

    Edge Generation Scheduling for DAG Tasks Using Deep Reinforcement Learning

    Authors: Binqi Sun, Mirco Theile, Ziyuan Qin, Daniele Bernardini, Debayan Roy, Andrea Bastoni, Marco Caccamo

    Abstract: Directed acyclic graph (DAG) tasks are currently adopted in the real-time domain to model complex applications from the automotive, avionics, and industrial domains that implement their functionalities through chains of intercommunicating tasks. This paper studies the problem of scheduling real-time DAG tasks by presenting a novel schedulability test based on the concept of trivial schedulability.… ▽ More

    Submitted 10 January, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in IEEE Transactions on Computers

  5. Learning to Generate All Feasible Actions

    Authors: Mirco Theile, Daniele Bernardini, Raphael Trumpp, Cristina Piazza, Marco Caccamo, Alberto L. Sangiovanni-Vincentelli

    Abstract: Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints such as safety or operational bounds. Typically, to learn to satisfy these constraints, the agent must violate them systematically, which is computationally proh… ▽ More

    Submitted 5 July, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

  6. Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning

    Authors: Hongpeng Cao, Mirco Theile, Federico G. Wyrwal, Marco Caccamo

    Abstract: Deep reinforcement learning (DRL) is a promising approach to solve complex control tasks by learning policies through interactions with the environment. However, the training of DRL policies requires large amounts of training experiences, making it impractical to learn the policy directly on physical systems. Sim-to-real approaches leverage simulations to pretrain DRL policies and then deploy them… ▽ More

    Submitted 28 July, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Submitted to IROS 2022

  7. arXiv:2107.09973  [pdf, other

    cs.RO cs.MA eess.SY

    Multi-Agent Belief Sharing through Autonomous Hierarchical Multi-Level Clustering

    Authors: Mirco Theile, Jonathan Ponniah, Or Dantsker, Marco Caccamo

    Abstract: Coordination in multi-agent systems is challenging for agile robots such as unmanned aerial vehicles (UAVs), where relative agent positions frequently change due to unconstrained movement. The problem is exacerbated through the individual take-off and landing of agents for battery recharging leading to a varying number of active agents throughout the whole mission. This work proposes autonomous hi… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Submitted to IEEE Transactions on Robotics, article extends on https://doi.org/10.2514/6.2021-0656

  8. arXiv:2010.12461  [pdf, other

    cs.MA cs.IT cs.LG cs.RO eess.SY

    Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning

    Authors: Harald Bayerlein, Mirco Theile, Marco Caccamo, David Gesbert

    Abstract: Harvesting data from distributed Internet of Things (IoT) devices with multiple autonomous unmanned aerial vehicles (UAVs) is a challenging problem requiring flexible path planning methods. We propose a multi-agent reinforcement learning (MARL) approach that, in contrast to previous work, can adapt to profound changes in the scenario parameters defining the data harvesting mission, such as the num… ▽ More

    Submitted 3 June, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Modifications: final formatting; Code available under https://github.com/hbayerlein/uav_data_harvesting, article extends on arXiv:2007.00544

    Journal ref: IEEE Open Journal of the Communications Society, vol. 2, pp. 1171-1187, 2021

  9. UAV Path Planning using Global and Local Map Information with Deep Reinforcement Learning

    Authors: Mirco Theile, Harald Bayerlein, Richard Nai, David Gesbert, Marco Caccamo

    Abstract: Path planning methods for autonomous unmanned aerial vehicles (UAVs) are typically designed for one specific type of mission. This work presents a method for autonomous UAV path planning based on deep reinforcement learning (DRL) that can be applied to a wide range of mission scenarios. Specifically, we compare coverage path planning (CPP), where the UAV's goal is to survey an area of interest to… ▽ More

    Submitted 21 October, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: ICAR 2021, code available at https://github.com/theilem/uavSim

  10. arXiv:2007.00544  [pdf, other

    cs.LG cs.IT cs.RO eess.SP stat.ML

    UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

    Authors: Harald Bayerlein, Mirco Theile, Marco Caccamo, David Gesbert

    Abstract: Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next-generation communication networks requires efficient trajectory planning methods. We propose a new end-to-end reinforcement learning (RL) approach to UAV-enabled data collection from Internet of Things (IoT) devices in an urban environment. An autonomous drone is tasked with gathering data from distributed sensor nodes subjec… ▽ More

    Submitted 26 October, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Code available under https://github.com/hbayerlein/uav_data_harvesting, IEEE Global Communications Conference (GLOBECOM) 2020

  11. UAV Coverage Path Planning under Varying Power Constraints using Deep Reinforcement Learning

    Authors: Mirco Theile, Harald Bayerlein, Richard Nai, David Gesbert, Marco Caccamo

    Abstract: Coverage path planning (CPP) is the task of designing a trajectory that enables a mobile agent to travel over every point of an area of interest. We propose a new method to control an unmanned aerial vehicle (UAV) carrying a camera on a CPP mission with random start positions and multiple options for landing positions in an environment containing no-fly zones. While numerous approaches have been p… ▽ More

    Submitted 12 February, 2021; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  12. Evolutionary Algorithms and Dynamic Programming

    Authors: Benjamin Doerr, Anton Eremeev, Frank Neumann, Madeleine Theile, Christian Thyssen

    Abstract: Recently, it has been proven that evolutionary algorithms produce good results for a wide range of combinatorial optimization problems. Some of the considered problems are tackled by evolutionary algorithms that use a representation which enables them to construct solutions in a dynamic programming fashion. We take a general approach and relate the construction of such algorithms to the developmen… ▽ More

    Submitted 17 January, 2013; originally announced January 2013.

    Comments: This is an updated version of journal publication where few misprints are fixed

    Journal ref: Theoretical Computer Science, Vol. 412, Issue 43, 2011, P.6020-6035

  13. arXiv:1207.0369  [pdf, ps, other

    cs.NE

    More Effective Crossover Operators for the All-Pairs Shortest Path Problem

    Authors: Benjamin Doerr, Daniel Johannsen, Timo Kötzing, Frank Neumann, Madeleine Theile

    Abstract: The all-pairs shortest path problem is the first non-artificial problem for which it was shown that adding crossover can significantly speed up a mutation-only evolutionary algorithm. Recently, the analysis of this algorithm was refined and it was shown to have an expected optimization time (w.r.t. the number of fitness evaluations) of $Θ(n^{3.25}(\log n)^{0.25})$. In contrast to this simple alg… ▽ More

    Submitted 2 July, 2012; originally announced July 2012.