-
Edge Generation Scheduling for DAG Tasks Using Deep Reinforcement Learning
Authors:
Binqi Sun,
Mirco Theile,
Ziyuan Qin,
Daniele Bernardini,
Debayan Roy,
Andrea Bastoni,
Marco Caccamo
Abstract:
Directed acyclic graph (DAG) tasks are currently adopted in the real-time domain to model complex applications from the automotive, avionics, and industrial domains that implement their functionalities through chains of intercommunicating tasks. This paper studies the problem of scheduling real-time DAG tasks by presenting a novel schedulability test based on the concept of trivial schedulability.…
▽ More
Directed acyclic graph (DAG) tasks are currently adopted in the real-time domain to model complex applications from the automotive, avionics, and industrial domains that implement their functionalities through chains of intercommunicating tasks. This paper studies the problem of scheduling real-time DAG tasks by presenting a novel schedulability test based on the concept of trivial schedulability. Using this schedulability test, we propose a new DAG scheduling framework (edge generation scheduling -- EGS) that attempts to minimize the DAG width by iteratively generating edges while guaranteeing the deadline constraint. We study how to efficiently solve the problem of generating edges by develo** a deep reinforcement learning algorithm combined with a graph representation neural network to learn an efficient edge generation policy for EGS. We evaluate the effectiveness of the proposed algorithm by comparing it with state-of-the-art DAG scheduling heuristics and an optimal mixed-integer linear programming baseline. Experimental results show that the proposed algorithm outperforms the state-of-the-art by requiring fewer processors to schedule the same DAG tasks. The code is available at https://github.com/binqi-sun/egs.
△ Less
Submitted 10 January, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Learning to Generate All Feasible Actions
Authors:
Mirco Theile,
Daniele Bernardini,
Raphael Trumpp,
Cristina Piazza,
Marco Caccamo,
Alberto L. Sangiovanni-Vincentelli
Abstract:
Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints such as safety or operational bounds. Typically, to learn to satisfy these constraints, the agent must violate them systematically, which is computationally proh…
▽ More
Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints such as safety or operational bounds. Typically, to learn to satisfy these constraints, the agent must violate them systematically, which is computationally prohibitive in most systems. Recent efforts aim to utilize feasibility models that assess whether a proposed action is feasible to avoid applying the agent's infeasible action proposals to the system. However, these efforts focus on guaranteeing constraint satisfaction rather than the agent's learning efficiency. To improve the learning process, we introduce action map**, a novel approach that divides the learning process into two steps: first learn feasibility and subsequently, the objective by map** actions into the sets of feasible actions. This paper focuses on the feasibility part by learning to generate all feasible actions through self-supervised querying of the feasibility model. We train the agent by formulating the problem as a distribution matching problem and deriving gradient estimators for different divergences. Through an illustrative example, a robotic path planning scenario, and a robotic gras** simulation, we demonstrate the agent's proficiency in generating actions across disconnected feasible action sets. By addressing the feasibility step, this paper makes it possible to focus future work on the objective part of action map**, paving the way for an RL framework that is both safe and efficient.
△ Less
Submitted 5 July, 2024; v1 submitted 26 January, 2023;
originally announced January 2023.
-
6IMPOSE: Bridging the Reality Gap in 6D Pose Estimation for Robotic Gras**
Authors:
Hongpeng Cao,
Lukas Dirnberger,
Daniele Bernardini,
Cristina Piazza,
Marco Caccamo
Abstract:
6D pose recognition has been a crucial factor in the success of robotic gras**, and recent deep learning based approaches have achieved remarkable results on benchmarks. However, their generalization capabilities in real-world applications remain unclear. To overcome this gap, we introduce 6IMPOSE, a novel framework for sim-to-real data generation and 6D pose estimation. 6IMPOSE consists of four…
▽ More
6D pose recognition has been a crucial factor in the success of robotic gras**, and recent deep learning based approaches have achieved remarkable results on benchmarks. However, their generalization capabilities in real-world applications remain unclear. To overcome this gap, we introduce 6IMPOSE, a novel framework for sim-to-real data generation and 6D pose estimation. 6IMPOSE consists of four modules: First, a data generation pipeline that employs the 3D software suite Blender to create synthetic RGBD image datasets with 6D pose annotations. Second, an annotated RGBD dataset of five household objects generated using the proposed pipeline. Third, a real-time two-stage 6D pose estimation approach that integrates the object detector YOLO-V4 and a streamlined, real-time version of the 6D pose estimation algorithm PVN3D optimized for time-sensitive robotics applications. Fourth, a codebase designed to facilitate the integration of the vision system into a robotic gras** experiment. Our approach demonstrates the efficient generation of large amounts of photo-realistic RGBD images and the successful transfer of the trained inference model to robotic gras** experiments, achieving an overall success rate of 87% in gras** five different household objects from cluttered backgrounds under varying lighting conditions. This is made possible by the fine-tuning of data generation and domain randomization techniques, and the optimization of the inference pipeline, overcoming the generalization and performance shortcomings of the original PVN3D algorithm. Finally, we make the code, synthetic dataset, and all the pretrained models available on Github.
△ Less
Submitted 9 March, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
A machine-learning approach to synthesize virtual sensors for parameter-varying systems
Authors:
Daniele Masti,
Daniele Bernardini,
Alberto Bemporad
Abstract:
This paper introduces a novel model-free approach to synthesize virtual sensors for the estimation of dynamical quantities that are unmeasurable at runtime but are available for design purposes on test benches. After collecting a dataset of measurements of such quantities, together with other variables that are also available during on-line operations, the virtual sensor is obtained using machine…
▽ More
This paper introduces a novel model-free approach to synthesize virtual sensors for the estimation of dynamical quantities that are unmeasurable at runtime but are available for design purposes on test benches. After collecting a dataset of measurements of such quantities, together with other variables that are also available during on-line operations, the virtual sensor is obtained using machine learning techniques by training a predictor whose inputs are the measured variables and the features extracted by a bank of linear observers fed with the same measures. The approach is applicable to infer the value of quantities such as physical states and other time-varying parameters that affect the dynamics of the system. The proposed virtual sensor architecture - whose structure can be related to the Multiple Model Adaptive Estimation framework - is conceived to keep computational and memory requirements as low as possible, so that it can be efficiently implemented in embedded hardware platforms.
The effectiveness of the approach is shown in different numerical examples, involving the estimation of the scheduling parameter of a nonlinear parameter-varying system, the reconstruction of the mode of a switching linear system, and the estimation of the state of charge (SoC) of a lithium-ion battery.
△ Less
Submitted 23 March, 2021;
originally announced March 2021.
-
Sigil3D: A Crowdsourcing Platform for Interactive 3D Content
Authors:
Andrea Barillari,
Daniele Bernardini,
Pierluigi Crescenzi
Abstract:
In this paper we propose applying the crowdsourcing approach to a software platform that uses a modern and state-of-the-art 3D game engine. This platform could facilitate the generation and manipulation of interactive 3D environments by a community of users producing different content such as cultural heritage, scientific virtual labs, games, novel art forms and virtual museums.
In this paper we propose applying the crowdsourcing approach to a software platform that uses a modern and state-of-the-art 3D game engine. This platform could facilitate the generation and manipulation of interactive 3D environments by a community of users producing different content such as cultural heritage, scientific virtual labs, games, novel art forms and virtual museums.
△ Less
Submitted 8 March, 2017;
originally announced March 2017.
-
Nonlinear Volterra model of a loudspeaker behavior based on Laser Doppler Vibrometry
Authors:
Alessandro Loriga,
Parvin Moyassari,
Daniele Bernardini,
Gregorio Landi,
Francesca Venturini,
Elisabeth Dumont
Abstract:
We demonstrate the capabilities of nonlinear Volterra models to simulate the behavior of an audio system and compare them to linear filters. In this paper a nonlinear model of an audio system based on Volterra series is presented and Normalized Least Mean Square algorithm is used to determine the Volterra series to third order. Training data for the models were collected measuring a physical speak…
▽ More
We demonstrate the capabilities of nonlinear Volterra models to simulate the behavior of an audio system and compare them to linear filters. In this paper a nonlinear model of an audio system based on Volterra series is presented and Normalized Least Mean Square algorithm is used to determine the Volterra series to third order. Training data for the models were collected measuring a physical speaker using a laser interferometer. We explore several training signals and filter's parameters. Results indicate a decrease in Mean Squared Error compared to the linear model with a dependency on the particular test signal, the order and the parameters of the model.
△ Less
Submitted 28 February, 2017;
originally announced March 2017.