-
Range Limited Coverage Control using Air-Ground Multi-Robot Teams
Authors:
Max Rudolph,
Sean Wilson,
Magnus Egerstedt
Abstract:
In this paper, we investigate how heterogeneous multi-robot systems with different sensing capabilities can observe a domain with an apriori unknown density function. Common coverage control techniques are targeted towards homogeneous teams of robots and do not consider what happens when the sensing capabilities of the robots are vastly different. This work proposes an extension to Lloyd's algorit…
▽ More
In this paper, we investigate how heterogeneous multi-robot systems with different sensing capabilities can observe a domain with an apriori unknown density function. Common coverage control techniques are targeted towards homogeneous teams of robots and do not consider what happens when the sensing capabilities of the robots are vastly different. This work proposes an extension to Lloyd's algorithm that fuses coverage information from heterogeneous robots with differing sensing capabilities to effectively observe a domain. Namely, we study a bimodal team of robots consisting of aerial and ground agents. In our problem formulation we use aerial robots with coarse domain sensors to approximate the number of ground robots needed within their sensing region to effectively cover it. This information is relayed to ground robots, who perform an extension to the Lloyd's algorithm that balances a locally focused coverage controller with a globally focused distribution controller. The stability of the Lloyd's algorithm extension is proven and its performance is evaluated through simulation and experiments using the Robotarium, a remotely-accessible, multi-robot testbed.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
Quadratic Programming for Continuous Control of Safety-Critical Multi-Agent Systems Under Uncertainty
Authors:
Si Wu,
Tengfei Liu,
Magnus Egerstedt,
Zhong-** Jiang
Abstract:
This paper studies the control problem for safety-critical multi-agent systems based on quadratic programming (QP). Each controlled agent is modeled as a cascade connection of an integrator and an uncertain nonlinear actuation system. In particular, the integrator represents the position-velocity relation, and the actuation system describes the dynamic response of the actual velocity to the veloci…
▽ More
This paper studies the control problem for safety-critical multi-agent systems based on quadratic programming (QP). Each controlled agent is modeled as a cascade connection of an integrator and an uncertain nonlinear actuation system. In particular, the integrator represents the position-velocity relation, and the actuation system describes the dynamic response of the actual velocity to the velocity reference signal. The notion of input-to-output stability (IOS) is employed to characterize the essential velocity-tracking capability of the actuation system. The uncertain actuation dynamics may cause infeasibility or discontinuous solutions of QP algorithms for collision avoidance. Also, the interaction between the controlled integrator and the uncertain actuation dynamics may lead to significant robustness issues. By using nonlinear control methods and numerical optimization methods, this paper first contributes a new feasible-set resha** technique and a refined QP algorithm for feasibility, robustness, and local Lipschitz continuity. Then, we present a nonlinear small-gain analysis to handle the inherent interaction for guaranteed safety of the closed-loop multi-agent system. The proposed methods are illustrated by numerical simulations and a physical experiment.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Distributed Coverage Hole Prevention for Visual Environmental Monitoring with Quadcopters via Nonsmooth Control Barrier Functions
Authors:
Riku Funada,
María Santos,
Ryuichi Maniwa,
Junya Yamauchi,
Masayuki Fujita,
Mitsuji Sampei,
Magnus Egerstedt
Abstract:
This paper proposes a distributed coverage control strategy for quadcopters equipped with downward-facing cameras that prevents the appearance of unmonitored areas in between the quadcopters' fields of view (FOVs). We derive a necessary and sufficient condition for eliminating any unsurveilled area that may arise in between the FOVs among a trio of quadcopters by utilizing a power diagram, i.e. a…
▽ More
This paper proposes a distributed coverage control strategy for quadcopters equipped with downward-facing cameras that prevents the appearance of unmonitored areas in between the quadcopters' fields of view (FOVs). We derive a necessary and sufficient condition for eliminating any unsurveilled area that may arise in between the FOVs among a trio of quadcopters by utilizing a power diagram, i.e. a weighted Voronoi diagram defined by radii of FOVs. Because this condition can be described as logically combined constraints, we leverage nonsmooth control barrier functions (NCBFs) to prevent the appearance of unmonitored areas among a team's FOV. We then investigate the symmetric properties of the proposed NCBFs to develop a distributed algorithm. The proposed algorithm can support the switching of the NCBFs caused by changes of the quadcopters composing trios. The existence of the control input satisfying NCBF conditions is analyzed by employing the characteristics of the power diagram. The proposed framework is synthesized with a coverage control law that maximizes the monitoring quality while reducing overlaps of FOVs. The proposed method is demonstrated in simulation and experiment.
△ Less
Submitted 4 January, 2024; v1 submitted 5 November, 2022;
originally announced November 2022.
-
A Low-complexity Brain-computer Interface for High-complexity Robot Swarm Control
Authors:
Gregory Canal,
Yancy Diaz-Mercado,
Magnus Egerstedt,
Christopher Rozell
Abstract:
A brain-computer interface (BCI) is a system that allows a human operator to use only mental commands in controlling end effectors that interact with the world around them. Such a system consists of a measurement device to record the human user's brain activity, which is then processed into commands that drive a system end effector. BCIs involve either invasive measurements which allow for high-co…
▽ More
A brain-computer interface (BCI) is a system that allows a human operator to use only mental commands in controlling end effectors that interact with the world around them. Such a system consists of a measurement device to record the human user's brain activity, which is then processed into commands that drive a system end effector. BCIs involve either invasive measurements which allow for high-complexity control but are generally infeasible, or noninvasive measurements which offer lower quality signals but are more practical to use. In general, BCI systems have not been developed that efficiently, robustly, and scalably perform high-complexity control while retaining the practicality of noninvasive measurements. Here we leverage recent results from feedback information theory to fill this gap by modeling BCIs as a communications system and deploying a human-implementable interaction algorithm for noninvasive control of a high-complexity robot swarm. We construct a scalable dictionary of robotic behaviors that can be searched simply and efficiently by a BCI user, as we demonstrate through a large-scale user study testing the feasibility of our interaction algorithm, a user test of the full BCI system on (virtual and real) robot swarms, and simulations that verify our results against theoretical models. Our results provide a proof of concept for how a large class of high-complexity effectors (even beyond robotics) can be effectively controlled by a BCI system with low-complexity and noisy inputs.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Safe Reinforcement Learning Using Robust Control Barrier Functions
Authors:
Yousef Emam,
Gennaro Notomista,
Paul Glotfelter,
Zsolt Kira,
Magnus Egerstedt
Abstract:
Reinforcement Learning (RL) has been shown to be effective in many scenarios. However, it typically requires the exploration of a sufficiently large number of state-action pairs, some of which may be unsafe. Consequently, its application to safety-critical systems remains a challenge. An increasingly common approach to address safety involves the addition of a safety layer that projects the RL act…
▽ More
Reinforcement Learning (RL) has been shown to be effective in many scenarios. However, it typically requires the exploration of a sufficiently large number of state-action pairs, some of which may be unsafe. Consequently, its application to safety-critical systems remains a challenge. An increasingly common approach to address safety involves the addition of a safety layer that projects the RL actions onto a safe set of actions. In turn, a difficulty for such frameworks is how to effectively couple RL with the safety layer to improve the learning performance. In this paper, we frame safety as a differentiable robust-control-barrier-function layer in a model-based RL framework. Moreover, we also propose an approach to modularly learn the underlying reward-driven task, independent of safety constraints. We demonstrate that this approach both ensures safety and effectively guides exploration during training in a range of experiments, including zero-shot transfer when the reward is learned in a modular way.
△ Less
Submitted 22 June, 2022; v1 submitted 11 October, 2021;
originally announced October 2021.
-
Optimal Stochastic Evasive Maneuvers Using the Schrodinger's Equation
Authors:
Farhad Farokhi,
Magnus Egerstedt
Abstract:
In this paper, preys with stochastic evasion policies are considered. The stochasticity adds unpredictable changes to the prey's path for avoiding predator's attacks. The prey's cost function is composed of two terms balancing the unpredictability factor (by using stochasticity to make the task of forecasting its future positions by the predator difficult) and energy consumption (the least amount…
▽ More
In this paper, preys with stochastic evasion policies are considered. The stochasticity adds unpredictable changes to the prey's path for avoiding predator's attacks. The prey's cost function is composed of two terms balancing the unpredictability factor (by using stochasticity to make the task of forecasting its future positions by the predator difficult) and energy consumption (the least amount of energy required for performing a maneuver). The optimal probability density functions of the actions of the prey for trading-off unpredictability and energy consumption is shown to be characterized by the stationary Schrodinger's equation.
△ Less
Submitted 10 October, 2021;
originally announced October 2021.
-
Model Free Barrier Functions via Implicit Evading Maneuvers
Authors:
Eric Squires,
Rohit Konda,
Samuel Coogan,
Magnus Egerstedt
Abstract:
This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct c…
▽ More
This paper demonstrates that the safety override arising from the use of a barrier function can in some cases be needlessly restrictive. In particular, we examine the case of fixed-wing collision avoidance and show that when using a barrier function, there are cases where two fixed-wing aircraft can come closer to colliding than if there were no barrier function at all. In addition, we construct cases where the barrier function labels the system as unsafe even when the vehicles start arbitrarily far apart. In other words, the barrier function ensures safety but with unnecessary costs to performance. We therefore introduce model-free barrier functions which take a data driven approach to creating a barrier function. We demonstrate the effectiveness of model-free barrier functions in a collision avoidance simulation of two fixed-wing aircraft.
△ Less
Submitted 23 September, 2022; v1 submitted 27 July, 2021;
originally announced July 2021.
-
A Resilient and Energy-Aware Task Allocation Framework for Heterogeneous Multi-Robot Systems
Authors:
Gennaro Notomista,
Siddharth Mayya,
Yousef Emam,
Christopher Kroninger,
Addison Bohannon,
Seth Hutchinson,
Magnus Egerstedt
Abstract:
In the context of heterogeneous multi-robot teams deployed for executing multiple tasks, this paper develops an energy-aware framework for allocating tasks to robots in an online fashion. With a primary focus on long-duration autonomy applications, we opt for a survivability-focused approach. Towards this end, the task prioritization and execution -- through which the allocation of tasks to robots…
▽ More
In the context of heterogeneous multi-robot teams deployed for executing multiple tasks, this paper develops an energy-aware framework for allocating tasks to robots in an online fashion. With a primary focus on long-duration autonomy applications, we opt for a survivability-focused approach. Towards this end, the task prioritization and execution -- through which the allocation of tasks to robots is effectively realized -- are encoded as constraints within an optimization problem aimed at minimizing the energy consumed by the robots at each point in time. In this context, an allocation is interpreted as a prioritization of a task over all others by each of the robots. Furthermore, we present a novel framework to represent the heterogeneous capabilities of the robots, by distinguishing between the features available on the robots, and the capabilities enabled by these features. By embedding these descriptions within the optimization problem, we make the framework resilient to situations where environmental conditions make certain features unsuitable to support a capability and when component failures on the robots occur. We demonstrate the efficacy and resilience of the proposed approach in a variety of use-case scenarios, consisting of simulations and real robot experiments.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Data-Driven Robust Barrier Functions for Safe, Long-Term Operation
Authors:
Yousef Emam,
Paul Glotfelter,
Sean Wilson,
Gennaro Notomista,
Magnus Egerstedt
Abstract:
Applications that require multi-robot systems to operate independently for extended periods of time in unknown or unstructured environments face a broad set of challenges, such as hardware degradation, changing weather patterns, or unfamiliar terrain. To operate effectively under these changing conditions, algorithms developed for long-term autonomy applications require a stronger focus on robustn…
▽ More
Applications that require multi-robot systems to operate independently for extended periods of time in unknown or unstructured environments face a broad set of challenges, such as hardware degradation, changing weather patterns, or unfamiliar terrain. To operate effectively under these changing conditions, algorithms developed for long-term autonomy applications require a stronger focus on robustness. Consequently, this work considers the ability to satisfy the operation-critical constraints of a disturbed system in a modular fashion, which means compatibility with different system objectives and disturbance representations. Toward this end, this paper introduces a controller-synthesis approach to constraint satisfaction for disturbed control-affine dynamical systems by utilizing Control Barrier Functions (CBFs). The aforementioned framework is constructed by modelling the disturbance as a union of convex hulls and leveraging previous work on CBFs for differential inclusions. This method of disturbance modeling grants compatibility with different disturbance-estimation methods. For example, this work demonstrates how a disturbance learned via a Gaussian process may be utilized in the proposed framework. These estimated disturbances are incorporated into the proposed controller-synthesis framework which is then tested on a fleet of robots in different scenarios.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Data-Driven Adaptive Task Allocation for Heterogeneous Multi-Robot Teams Using Robust Control Barrier Functions
Authors:
Yousef Emam,
Gennaro Notomista,
Paul Glotfelter,
Magnus Egerstedt
Abstract:
Multi-robot task allocation is a ubiquitous problem in robotics due to its applicability in a variety of scenarios. Adaptive task-allocation algorithms account for unknown disturbances and unpredicted phenomena in the environment where robots are deployed to execute tasks. However, this adaptivity typically comes at the cost of requiring precise knowledge of robot models in order to evaluate the a…
▽ More
Multi-robot task allocation is a ubiquitous problem in robotics due to its applicability in a variety of scenarios. Adaptive task-allocation algorithms account for unknown disturbances and unpredicted phenomena in the environment where robots are deployed to execute tasks. However, this adaptivity typically comes at the cost of requiring precise knowledge of robot models in order to evaluate the allocation effectiveness and to adjust the task assignment online. As such, environmental disturbances can significantly degrade the accuracy of the models which in turn negatively affects the quality of the task allocation. In this paper, we leverage Gaussian processes, differential inclusions, and robust control barrier functions to learn environmental disturbances in order to guarantee robust task execution. We show the implementation and the effectiveness of the proposed framework on a real multi-robot system.
△ Less
Submitted 10 November, 2020; v1 submitted 2 November, 2020;
originally announced November 2020.
-
Safety With Limited Range Sensing Constraints For Fixed Wing Aircraft
Authors:
Eric Squires,
Rohit Konda,
Pietro Pierpaoli,
Samuel Coogan,
Magnus Egerstedt
Abstract:
In this paper we discuss how to use a barrier function that is subject to kinematic constraints and limited sensing in order to guarantee that fixed wing unmanned aerial vehicles (UAVs) will maintain safe distances from each other at all times despite being subject to limited range sensing constraints. Prior work has shown that a barrier function can be used to guarantee safe system operation when…
▽ More
In this paper we discuss how to use a barrier function that is subject to kinematic constraints and limited sensing in order to guarantee that fixed wing unmanned aerial vehicles (UAVs) will maintain safe distances from each other at all times despite being subject to limited range sensing constraints. Prior work has shown that a barrier function can be used to guarantee safe system operation when the state can be sensed at all times. However, in this paper we show that this construction does not guarantee safety when the UAVs are subject to limited range sensing. To resolve this issue, we introduce a method for constructing a new barrier function that accommodates limited sensing range from a previously existing barrier function that may not necessarily accommodate limited range sensing. We show that, under appropriate conditions, the newly constructed barrier function ensures system safety even in the presence of limited range sensing. We demonstrate the contribution of this paper in a scenario of 20 fixed wing aircraft, where because of the proposed algorithm, the vehicles are able to maintain safe distances from each other even though the vehicles are subject to limited range sensing.
△ Less
Submitted 9 March, 2021; v1 submitted 21 October, 2020;
originally announced October 2020.
-
Neural Identification for Control
Authors:
Priyabrata Saha,
Magnus Egerstedt,
Saibal Mukhopadhyay
Abstract:
We present a new method for learning control law that stabilizes an unknown nonlinear dynamical system at an equilibrium point. We formulate a system identification task in a self-supervised learning setting that jointly learns a controller and corresponding stable closed-loop dynamics hypothesis. The input-output behavior of the unknown dynamical system under random control inputs is used as the…
▽ More
We present a new method for learning control law that stabilizes an unknown nonlinear dynamical system at an equilibrium point. We formulate a system identification task in a self-supervised learning setting that jointly learns a controller and corresponding stable closed-loop dynamics hypothesis. The input-output behavior of the unknown dynamical system under random control inputs is used as the supervising signal to train the neural network-based system model and the controller. The proposed method relies on the Lyapunov stability theory to generate a stable closed-loop dynamics hypothesis and corresponding control law. We demonstrate our method on various nonlinear control problems such as n-link pendulum balancing and trajectory tracking, pendulum on cart balancing, and wheeled vehicle path following.
△ Less
Submitted 15 March, 2022; v1 submitted 24 September, 2020;
originally announced September 2020.
-
Resilient Monitoring in Heterogeneous Multi-robot Systems through Network Reconfiguration
Authors:
Ragesh K. Ramachandran,
Pietro Pierpaoli,
Magnus Egerstedt,
Gaurav S. Sukhatme
Abstract:
We propose a framework for resilience in a networked heterogeneous multi-robot team subject to resource failures. Each robot in the team is equipped with resources that it shares with its neighbors. Additionally, each robot in the team executes a task, whose performance depends on the resources to which it has access. When a resource on a particular robot becomes unavailable (\eg a camera ceases t…
▽ More
We propose a framework for resilience in a networked heterogeneous multi-robot team subject to resource failures. Each robot in the team is equipped with resources that it shares with its neighbors. Additionally, each robot in the team executes a task, whose performance depends on the resources to which it has access. When a resource on a particular robot becomes unavailable (\eg a camera ceases to function), the team optimally reconfigures its communication network so that the robots affected by the failure can continue their tasks. We focus on a monitoring task, where robots individually estimate the state of an exogenous process. We encode the end-to-end effect of a robot's resource loss on the monitoring performance of the team by defining a new stronger notion of observability -- \textit{one-hop observability}. By abstracting the impact that {low-level} individual resources have on the task performance through the notion of one-hop observability, our framework leads to the principled reconfiguration of information flow in the team to effectively replace the lost resource on one robot with information from another, as long as certain conditions are met. Network reconfiguration is converted to the problem of selecting edges to be modified in the system's communication graph after a resource failure has occurred. A controller based on finite-time convergence control barrier functions drives each robot to a spatial location that enables the communication links of the modified graph. We validate the effectiveness of our framework by deploying it on a team of differential-drive robots estimating the position of a group of quadrotors.
△ Less
Submitted 6 August, 2021; v1 submitted 4 August, 2020;
originally announced August 2020.
-
Integral Control Barrier Functions for Dynamically Defined Control Laws
Authors:
Aaron D. Ames,
Gennaro Notomista,
Yorai Wardi,
Magnus Egerstedt
Abstract:
This paper introduces integral control barrier functions (I-CBFs) as a means to enable the safety-critical integral control of nonlinear systems. Importantly, I-CBFs allow for the holistic encoding of both state constraints and input bounds in a single framework. We demonstrate this by applying them to a dynamically defined tracking controller, thereby enforcing safety in state and input through a…
▽ More
This paper introduces integral control barrier functions (I-CBFs) as a means to enable the safety-critical integral control of nonlinear systems. Importantly, I-CBFs allow for the holistic encoding of both state constraints and input bounds in a single framework. We demonstrate this by applying them to a dynamically defined tracking controller, thereby enforcing safety in state and input through a minimally invasive I-CBF controller framed as a quadratic program.
△ Less
Submitted 7 July, 2020; v1 submitted 30 June, 2020;
originally announced June 2020.
-
Intersection-Traffic Control of Autonomous Vehicles using Newton-Raphson Flows and Barrier Functions
Authors:
Shashwat Shivam,
Yorai Wardi,
Magnus Egerstedt,
Aris Kanellopoulos,
Kyriakos G. Vamvoudakis
Abstract:
This paper concerns an application of a recently-developed nonlinear tracking technique to trajectory control of autonomous vehicles at traffic intersections. The technique uses a flow version of the Newton-Raphson method for controlling a predicted system-output to a future reference target. Its implementations are based on numerical solutions of ordinary differential equations, and it does not s…
▽ More
This paper concerns an application of a recently-developed nonlinear tracking technique to trajectory control of autonomous vehicles at traffic intersections. The technique uses a flow version of the Newton-Raphson method for controlling a predicted system-output to a future reference target. Its implementations are based on numerical solutions of ordinary differential equations, and it does not specify any particular method for computing its future reference trajectories. Consequently it can use relatively simple algorithms on crude models for computing the target trajectories, and more-accurate models and algorithms for trajectory control in the tight loop. We demonstrate this point at an extant predictive traffic planning-and-control method with our tracking technique. Furthermore, we guarantee safety specifications by applying to the tracking technique the framework of control barrier functions.
△ Less
Submitted 21 April, 2020;
originally announced April 2020.
-
A Receding Horizon Scheduling Approach for Search & Rescue Scenarios
Authors:
Yousef Emam,
Sean Wilson,
Mathias Hakenberg,
Ulrich Munz,
Magnus Egerstedt
Abstract:
Many applications involving complex multi-task problems such as disaster relief, logistics and manufacturing necessitate the deployment and coordination of heterogeneous multi-agent systems due to the sheer number of tasks that must be executed simultaneously. A fundamental requirement for the successful coordination of such systems is leveraging the specialization of each agent within the team. T…
▽ More
Many applications involving complex multi-task problems such as disaster relief, logistics and manufacturing necessitate the deployment and coordination of heterogeneous multi-agent systems due to the sheer number of tasks that must be executed simultaneously. A fundamental requirement for the successful coordination of such systems is leveraging the specialization of each agent within the team. This work presents a Receding Horizon Planning (RHP) framework aimed at scheduling tasks for heterogeneous multi-agent teams in a robust manner. In order to allow for the modular addition and removal of different types of agents to the team, the proposed framework accounts for the capabilities that each agent exhibits (e.g. quadrotors are agile and agnostic to rough terrain but are not suited to transport heavy payloads). An instantiation of the proposed RHP is developed and tested for a search and rescue scenario. Moreover, we present an abstracted search and rescue simulation environment, where a heterogeneous team of agents is deployed to simultaneously explore the environment, find and rescue trapped victims, and extinguish spreading fires as quickly as possible. We validate the effectiveness of our approach through extensive simulations comparing the presented framework with various planning horizons to a greedy task allocation scheme.
△ Less
Submitted 5 April, 2020;
originally announced April 2020.
-
Distributed Collision-Free Motion Coordination on a Sphere: A Conic Control Barrier Function Approach
Authors:
Tatsuya Ibuki,
Sean Wilson,
Aaron D. Ames,
Magnus Egerstedt
Abstract:
This letter studies a distributed collision avoidance control problem for a group of rigid bodies on a sphere. A rigid body network, consisting of multiple rigid bodies constrained to a spherical surface and an interconnection topology, is first formulated. In this formulation, it is shown that motion coordination on a sphere is equivalent to attitude coordination on the 3-dimensional Special Orth…
▽ More
This letter studies a distributed collision avoidance control problem for a group of rigid bodies on a sphere. A rigid body network, consisting of multiple rigid bodies constrained to a spherical surface and an interconnection topology, is first formulated. In this formulation, it is shown that motion coordination on a sphere is equivalent to attitude coordination on the 3-dimensional Special Orthogonal group. Then, an angle-based control barrier function that can handle a geodesic distance constraint on a spherical surface is presented. The proposed control barrier function is then extended to a relative motion case and applied to a collision avoidance problem for a rigid body network operating on a sphere. Each rigid body chooses its control input by solving a distributed optimization problem to achieve a nominal distributed motion coordination strategy while satisfying constraints for collision avoidance. The proposed collision-free motion coordination law is validated via simulation.
△ Less
Submitted 22 June, 2020; v1 submitted 30 March, 2020;
originally announced March 2020.
-
Adaptive Task Allocation for Heterogeneous Multi-Robot Teams with Evolving and Unknown Robot Capabilities
Authors:
Yousef Emam,
Siddharth Mayya,
Gennaro Notomista,
Addison Bohannon,
Magnus Egerstedt
Abstract:
For multi-robot teams with heterogeneous capabilities, typical task allocation methods assign tasks to robots based on the suitability of the robots to perform certain tasks as well as the requirements of the task itself. However, in real-world deployments of robot teams, the suitability of a robot might be unknown prior to deployment, or might vary due to changing environmental conditions. This p…
▽ More
For multi-robot teams with heterogeneous capabilities, typical task allocation methods assign tasks to robots based on the suitability of the robots to perform certain tasks as well as the requirements of the task itself. However, in real-world deployments of robot teams, the suitability of a robot might be unknown prior to deployment, or might vary due to changing environmental conditions. This paper presents an adaptive task allocation and task execution framework which allows individual robots to prioritize among tasks while explicitly taking into account their efficacy at performing the tasks---the parameters of which might be unknown before deployment and/or might vary over time. Such a \emph{specialization} parameter---encoding the effectiveness of a given robot towards a task---is updated on-the-fly, allowing our algorithm to reassign tasks among robots with the aim of executing them. The developed framework requires no explicit model of the changing environment or of the unknown robot capabilities---it only takes into account the progress made by the robots at completing the tasks. Simulations and experiments demonstrate the efficacy of the proposed approach during variations in environmental conditions and when robot capabilities are unknown before deployment.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
Tracking Control by the Newton-Raphson Method with Output Prediction and Controller Speedup
Authors:
Yorai Wardi,
Carla Seatzu,
Jorge Cortes,
Magnus Egerstedt,
Shashwat Shivam,
Ian Buckley
Abstract:
This paper presents a control technique for output tracking of reference signals in continuous-time dynamical systems. The technique is comprised of the following three elements: (i) output prediction which has to track the reference signal, (ii) a controller based on an integrator with variable gain, and (iii) a speedup of the control action for enhancing the tracker's accuracy and, in some cases…
▽ More
This paper presents a control technique for output tracking of reference signals in continuous-time dynamical systems. The technique is comprised of the following three elements: (i) output prediction which has to track the reference signal, (ii) a controller based on an integrator with variable gain, and (iii) a speedup of the control action for enhancing the tracker's accuracy and, in some cases, guaranteeing stability of the closed-loop system. The technique is suitable for linear and nonlinear systems, implementable by simple algorithms, can track reference points as well as time-dependent reference signals, and may have large, even global domains of attraction. The derived theoretical results include convergence of the tracking controller and error analysis, and are supported by illustrative simulation and laboratory experiments.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Path Planning in Unknown Environments Using Optimal Transport Theory
Authors:
Haoyan Zhai,
Magnus Egerstedt,
Haomin Zhou
Abstract:
This paper introduces a graph-based, potential-guided method for path planning problems in unknown environments, where obstacles are unknown until the robots are in close proximity to the obstacle locations. Inspired by optimal transport theory, the proposed method generates a graph connecting the initial and target configurations, and then finds a path over the graph using the available environme…
▽ More
This paper introduces a graph-based, potential-guided method for path planning problems in unknown environments, where obstacles are unknown until the robots are in close proximity to the obstacle locations. Inspired by optimal transport theory, the proposed method generates a graph connecting the initial and target configurations, and then finds a path over the graph using the available environmental information. The graph and path are updated iteratively when newly encountered obstacle information becomes available. The resulting method is a deterministic procedure proven to be complete, i.e., it is guaranteed to find a feasible path, when one exists, in a finite number of iterations. The method is scalable to high-dimensional problems. In addition, our method does not search the entire domain for the path, instead, the algorithm only explores a sub-region that can be described by the evolution of the Fokker-Planck equation. We demonstrate the performance of our algorithm via several numerical examples with different environments and dimensions, including high-dimensional cases.
△ Less
Submitted 24 September, 2019;
originally announced September 2019.
-
Inferring and Learning Multi-Robot Policies by Observing an Expert
Authors:
Pietro Pierpaoli,
Harish Ravichandar,
Nicholas Waytowich,
Anqi Li,
Derrik Asher,
Magnus Egerstedt
Abstract:
We present a technique for learning how to solve a multi-robot mission that requires interaction with an external environment by observing an expert system executing the same mission. We define the expert system as a team of robots equipped with a library of controllers, each designed to solve a specific task, supervised by an expert policy that appropriately selects controllers based on the state…
▽ More
We present a technique for learning how to solve a multi-robot mission that requires interaction with an external environment by observing an expert system executing the same mission. We define the expert system as a team of robots equipped with a library of controllers, each designed to solve a specific task, supervised by an expert policy that appropriately selects controllers based on the states of robots and environment. The objective is for an un-trained team of robots (i.e., imitator system) equipped with the same library of controllers, but agnostic to the expert policy, to execute the mission, with performances comparable to those of the expert system. From un-annotated observations of the expert system, a multi-hypothesis filtering technique is used to estimate individual controllers executed by the expert policy. Then, the history of estimated controllers and environmental states is used to train a neural network policy for the imitator system. Considering a perimeter protection scenario on a team of differential-drive robots, we show that the learned policy endows the imitator system with performances comparable to those of the expert system.
△ Less
Submitted 2 March, 2020; v1 submitted 17 September, 2019;
originally announced September 2019.
-
A Reinforcement Learning Framework for Sequencing Multi-Robot Behaviors
Authors:
Pietro Pierpaoli,
Thinh T. Doan,
Justin Romberg,
Magnus Egerstedt
Abstract:
Given a list of behaviors and associated parameterized controllers for solving different individual tasks, we study the problem of selecting an optimal sequence of coordinated behaviors in multi-robot systems for completing a given mission, which could not be handled by any single behavior. In addition, we are interested in the case where partial information of the underlying mission is unknown, t…
▽ More
Given a list of behaviors and associated parameterized controllers for solving different individual tasks, we study the problem of selecting an optimal sequence of coordinated behaviors in multi-robot systems for completing a given mission, which could not be handled by any single behavior. In addition, we are interested in the case where partial information of the underlying mission is unknown, therefore, the robots must cooperatively learn this information through their course of actions. Such problem can be formulated as an optimal decision problem in multi-robot systems, however, it is in general intractable due to modeling imperfections and the curse of dimensionality of the decision variables. To circumvent these issues, we first consider an alternate formulation of the original problem through introducing a sequence of behaviors' switching times. Our main contribution is then to propose a novel reinforcement learning based method, that combines Q-learning and online gradient descent, for solving this reformulated problem. In particular, the optimal sequence of the robots' behaviors is found by using Q-learning while the optimal parameters of the associated controllers are obtained through an online gradient descent method. Finally, to illustrate the effectiveness of our proposed method we implement it on a team of differential-drive robots for solving two different missions, namely, convoy protection and object manipulation.
△ Less
Submitted 13 September, 2019; v1 submitted 12 September, 2019;
originally announced September 2019.
-
Robust Barrier Functions for a Fully Autonomous, Remotely Accessible Swarm-Robotics Testbed
Authors:
Yousef Emam,
Paul Glotfelter,
Magnus Egerstedt
Abstract:
The Robotarium, a remotely accessible swarm-robotics testbed, has provided free, open access to robotics and controls research for hundreds of users in thousands of experiments. This high level of usage requires autonomy in the system, which mainly corresponds to constraint satisfaction in the context of users' submissions. In other words, in case that the users' inputs to the robots may lead to c…
▽ More
The Robotarium, a remotely accessible swarm-robotics testbed, has provided free, open access to robotics and controls research for hundreds of users in thousands of experiments. This high level of usage requires autonomy in the system, which mainly corresponds to constraint satisfaction in the context of users' submissions. In other words, in case that the users' inputs to the robots may lead to collisions, these inputs must be altered to avoid these collisions automatically. However, these alterations must be minimal so as to preserve the users' objective in the experiment. Toward this end, the system has utilized barrier functions, which admit a minimally invasive controller-synthesis procedure. However, barrier functions are yet to be robustified with respect to unmodeled disturbances (e.g., wheel slip or packet loss) in a manner conducive to real-time synthesis. As such, this paper formulates robust barrier functions for a general class of disturbed control-affine systems that, in turn, is key for the Robotarium to operate fully autonomously (i.e., without human supervision). Experimental results showcase the effectiveness of this robust formulation in a long-term experiment in the Robotarium.
△ Less
Submitted 6 September, 2019;
originally announced September 2019.
-
Constraint Learning for Control Tasks with Limited Duration Barrier Functions
Authors:
Motoya Ohnishi,
Gennaro Notomista,
Masashi Sugiyama,
Magnus Egerstedt
Abstract:
When deploying autonomous agents in unstructured environments over sustained periods of time, adaptability and robustness oftentimes outweigh optimality as a primary consideration. In other words, safety and survivability constraints play a key role and in this paper, we present a novel, constraint-learning framework for control tasks built on the idea of constraints-driven control. However, since…
▽ More
When deploying autonomous agents in unstructured environments over sustained periods of time, adaptability and robustness oftentimes outweigh optimality as a primary consideration. In other words, safety and survivability constraints play a key role and in this paper, we present a novel, constraint-learning framework for control tasks built on the idea of constraints-driven control. However, since control policies that keep a dynamical agent within state constraints over infinite horizons are not always available, this work instead considers constraints that can be satisfied over some finite time horizon T > 0, which we refer to as limited-duration safety. Consequently, value function learning can be used as a tool to help us find limited-duration safe policies. We show that, in some applications, the existence of limited-duration safe policies is actually sufficient for long-duration autonomy. This idea is illustrated on a swarm of simulated robots that are tasked with covering a given area, but that sporadically need to abandon this task to charge batteries. We show how the battery-charging behavior naturally emerges as a result of the constraints. Additionally, using a cart-pole simulation environment, we show how a control policy can be efficiently transferred from the source task, balancing the pole, to the target task, moving the cart to one direction without letting the pole fall down.
△ Less
Submitted 6 April, 2021; v1 submitted 26 August, 2019;
originally announced August 2019.
-
A Sequential Composition Framework for Coordinating Multi-Robot Behaviors
Authors:
Pietro Pierpaoli,
Anqi Li,
Mohit Srinivasan,
Xiaoyi Cai,
Samuel Coogan,
Magnus Egerstedt
Abstract:
A number of coordinated behaviors have been proposed for achieving specific tasks for multi-robot systems. However, since most applications require more than one such behavior, one needs to be able to compose together sequences of behaviors while respecting local information flow constraints. Specifically, when the inter-agent communication depends on inter-robot distances, these constraints trans…
▽ More
A number of coordinated behaviors have been proposed for achieving specific tasks for multi-robot systems. However, since most applications require more than one such behavior, one needs to be able to compose together sequences of behaviors while respecting local information flow constraints. Specifically, when the inter-agent communication depends on inter-robot distances, these constraints translate into particular configurations that must be reached in finite time in order for the system to be able to transition between the behaviors. To this end, we develop a distributed framework based on finite-time convergence control barrier functions that enables a team of robots to adjust its configuration in order to meet the communication requirements for the different tasks. In order to demonstrate the significance of the proposed framework, we implemented a full-scale scenario where a team of eight planar robots explore an urban environment in order to localize and rescue a subject.
△ Less
Submitted 2 March, 2020; v1 submitted 17 July, 2019;
originally announced July 2019.
-
Composition of Safety Constraints For Fixed-Wing Collision Avoidance Amidst Limited Communications
Authors:
Eric Squires,
Pietro Pierpaoli,
Rohit Konda,
Samuel Coogan,
Magnus Egerstedt
Abstract:
This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two…
▽ More
This paper considers how to ensure that a system of fixed wing Unmanned Aerial Vehicles (UAVs) can avoid collisions. To do so we develop a novel method for creating a barrier function, which is similar to a Lyapunov function and can be used to ensure that a system can stay safe for all future times. After introducing the general approach, it is shown how to ensure that collision avoidance for two vehicles can be guaranteed for all future times. The construction is then extended to the case of arbitrarily many vehicles by addressing how to satisfy multiple safety objectives simultaneously. We do this while ensuring output actuator commands are within specified limits. Because this formulation requires communication of control values and may therefore reduce throughput of other important messages, we then show how to reformulate the solution without this significant communication overhead while still ensuring safety is maintained and actuator limits are respected. We validate the theoretical developments of this paper in the simulator SCRIMMAGE with a simulation of 20 UAVs that maintain safe distances from each other even though their nominal paths would otherwise cause a collision.
△ Less
Submitted 21 July, 2021; v1 submitted 9 June, 2019;
originally announced June 2019.
-
Herdability of Linear Systems Based on Sign Patterns and Graph Structures
Authors:
Sebastian F Ruf,
Magnus Egerstedt,
Jeff S. Shamma
Abstract:
We consider the notion of herdability, a set-based reachability condition, which asks whether the state of a system can be controlled to be element-wise larger than a non-negative threshold. First a number of foundational results on herdability of a continuous time, linear time invariant system are presented. These show that the herdability of a linear system can be determined based on certain mat…
▽ More
We consider the notion of herdability, a set-based reachability condition, which asks whether the state of a system can be controlled to be element-wise larger than a non-negative threshold. First a number of foundational results on herdability of a continuous time, linear time invariant system are presented. These show that the herdability of a linear system can be determined based on certain matrices, such as the controllability matrix, which arise in the study of controllability of linear systems. Second, the relationship between the sign pattern of the underlying graph structure of a system and the herdability properties of the system is investigated. In doing so the notion of sign herdability is introduced which captures classes of systems whose sign pattern determines their herdability. We identify a set of conditions, first on the sign pattern of the controllability matrix and then on the underlying graph structure, that ensure that the system is sign herdable.
△ Less
Submitted 16 April, 2019;
originally announced April 2019.
-
Passivity-Based Decentralized Control of Multi-Robot Systems With Delays Using Control Barrier Functions
Authors:
Gennaro Notomista,
Xiaoyi Cai,
Junya Yamauchi,
Magnus Egerstedt
Abstract:
In this paper, we present a solution to the problem of coordinating multiple robots across a communication channel that experiences delays. The proposed approach leverages control barrier functions in order to ensure that the multi-robot system remains dissipative. This is achieved by encoding the dissipativity-preserving condition as a set invariance constraint. This constraint is then included i…
▽ More
In this paper, we present a solution to the problem of coordinating multiple robots across a communication channel that experiences delays. The proposed approach leverages control barrier functions in order to ensure that the multi-robot system remains dissipative. This is achieved by encoding the dissipativity-preserving condition as a set invariance constraint. This constraint is then included in an optimization problem, whose objective is that of modifying, in a minimally invasive fashion, the nominal input to the robots. The formulated optimization problem is decentralized in the sense that, in order to be solved, it does not require the individual robots to have access to global information. Moreover, thanks to its convexity, each robot can solve it using fast and efficient algorithms. The effectiveness of the proposed control framework is demonstrated through the implementation of a formation control algorithm in presence of delays on a team of mobile robots.
△ Less
Submitted 3 September, 2019; v1 submitted 9 April, 2019;
originally announced April 2019.
-
Collective motion planning for a group of robots using intermittent diffusion
Authors:
Christina Frederick,
Magnus Egerstedt,
Haomin Zhou
Abstract:
In this work we establish a simple yet effective strategy, based on optimal transport theory, for enabling a group of robots to accomplish complex tasks, such as shape formation and assembly. We demonstrate the feasibility of this approach and rigorously prove collision avoidance and convergence properties of the proposed algorithms.
In this work we establish a simple yet effective strategy, based on optimal transport theory, for enabling a group of robots to accomplish complex tasks, such as shape formation and assembly. We demonstrate the feasibility of this approach and rigorously prove collision avoidance and convergence properties of the proposed algorithms.
△ Less
Submitted 16 October, 2020; v1 submitted 4 April, 2019;
originally announced April 2019.
-
Stable, Concurrent Controller Composition for Multi-Objective Robotic Tasks
Authors:
Anqi Li,
Ching-An Cheng,
Byron Boots,
Magnus Egerstedt
Abstract:
Robotic systems often need to consider multiple tasks concurrently. This challenge calls for controller synthesis algorithms that fulfill multiple control specifications while maintaining the stability of the overall system. In this paper, we decompose multi-objective tasks into subtasks, where individual subtask controllers are designed independently and then combined to generate the overall cont…
▽ More
Robotic systems often need to consider multiple tasks concurrently. This challenge calls for controller synthesis algorithms that fulfill multiple control specifications while maintaining the stability of the overall system. In this paper, we decompose multi-objective tasks into subtasks, where individual subtask controllers are designed independently and then combined to generate the overall control policy. In particular, we adopt Riemannian Motion Policies (RMPs), a recently proposed controller structure in robotics, and, RMPflow, its associated computational framework for combining RMP controllers. We re-establish and extend the stability results of RMPflow through a rigorous Control Lyapunov Function (CLF) treatment. We then show that RMPflow can stably combine individually designed subtask controllers that satisfy certain CLF constraints. This new insight leads to an efficient CLF-based computational framework to generate stable controllers that consider all the subtasks simultaneously. Compared with the original usage of RMPflow, our framework provides users the flexibility to incorporate design heuristics through nominal controllers for the subtasks. We validate the proposed computational framework through numerical simulation and robotic implementation.
△ Less
Submitted 30 August, 2019; v1 submitted 29 March, 2019;
originally announced March 2019.
-
From Motions to Emotions: Can the Fundamental Emotions be Expressed in a Robot Swarm?
Authors:
María Santos,
Magnus Egerstedt
Abstract:
This paper explores the expressive capabilities of a swarm of miniature mobile robots within the context of inter-robot interactions and their map** to the so-called fundamental emotions. In particular, we investigate how motion and shape descriptors that are psychologically associated with different emotions can be incorporated into different swarm behaviors for the purpose of artistic expositi…
▽ More
This paper explores the expressive capabilities of a swarm of miniature mobile robots within the context of inter-robot interactions and their map** to the so-called fundamental emotions. In particular, we investigate how motion and shape descriptors that are psychologically associated with different emotions can be incorporated into different swarm behaviors for the purpose of artistic expositions. Based on these characterizations from social psychology, a set of swarm behaviors is created, where each behavior corresponds to a fundamental emotion. The effectiveness of these behaviors is evaluated in a survey in which the participants are asked to associate different swarm behaviors with the fundamental emotions. The results of the survey show that most of the research participants assigned to each video the emotion intended to be portrayed by design. These results confirm that abstract descriptors associated with the different fundamental emotions in social psychology provide useful motion characterizations that can be effectively transformed into expressive behaviors for a swarm of simple ground mobile robots.
△ Less
Submitted 3 March, 2020; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Control Barrier Functions: Theory and Applications
Authors:
Aaron D. Ames,
Samuel Coogan,
Magnus Egerstedt,
Gennaro Notomista,
Koushil Sreenath,
Paulo Tabuada
Abstract:
This paper provides an introduction and overview of recent work on control barrier functions and their use to verify and enforce safety properties in the context of (optimization based) safety-critical controllers. We survey the main technical results and discuss applications to several domains including robotic systems.
This paper provides an introduction and overview of recent work on control barrier functions and their use to verify and enforce safety properties in the context of (optimization based) safety-critical controllers. We survey the main technical results and discuss applications to several domains including robotic systems.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
An Optimal Task Allocation Strategy for Heterogeneous Multi-Robot Systems
Authors:
Gennaro Notomista,
Siddharth Mayya,
Seth Hutchinson,
Magnus Egerstedt
Abstract:
For a team of heterogeneous robots executing multiple tasks, we propose a novel algorithm to optimally allocate tasks to robots while accounting for their different capabilities. Motivated by the need that robot teams have in many real-world applications of remaining operational for long periods of time, we allow each robot to choose tasks taking into account the energy consumed by executing them,…
▽ More
For a team of heterogeneous robots executing multiple tasks, we propose a novel algorithm to optimally allocate tasks to robots while accounting for their different capabilities. Motivated by the need that robot teams have in many real-world applications of remaining operational for long periods of time, we allow each robot to choose tasks taking into account the energy consumed by executing them, besides the global specifications on the task allocation. The tasks are encoded as constraints in an energy minimization problem solved at each point in time by each robot. The prioritization of a task over others -- effectively signifying the allocation of the task to that particular robot -- occurs via the introduction of slack variables in the task constraints. Moreover, the suitabilities of certain robots towards certain tasks are also taken into account to generate a task allocation algorithm for a team of robots with heterogeneous capabilities. The efficacy of the developed approach is demonstrated both in simulation and on a team of real robots.
△ Less
Submitted 2 September, 2019; v1 submitted 20 March, 2019;
originally announced March 2019.
-
Persistification of Robotic Tasks
Authors:
Gennaro Notomista,
Magnus Egerstedt
Abstract:
In this paper we propose a control framework that enables robots to execute tasks persistently, i.e., over time horizons much longer than robots' battery life. This is achieved by ensuring that the energy stored in the batteries of the robots is never depleted. This is framed as a set invariance constraint in an optimization problem whose objective is that of minimizing the difference between the…
▽ More
In this paper we propose a control framework that enables robots to execute tasks persistently, i.e., over time horizons much longer than robots' battery life. This is achieved by ensuring that the energy stored in the batteries of the robots is never depleted. This is framed as a set invariance constraint in an optimization problem whose objective is that of minimizing the difference between the robots' control inputs and nominal control inputs corresponding to the task that is to be executed. We refer to this process as the persistification of a robotic task. Forward invariance of subsets of the state space of the robots is turned into a control input constraint by using control barrier functions. The solution of the formulated optimization problem with energy constraints ensures that the robotic task is persistent. To illustrate the operation of the proposed framework, we consider two tasks whose persistent execution is particularly relevant: environment exploration and environment surveillance. We show the persistification of these two tasks both in simulation and on a team of wheeled mobile robots on the Robotarium.
△ Less
Submitted 31 October, 2019; v1 submitted 14 March, 2019;
originally announced March 2019.
-
A Study of a Class of Vibration-Driven Robots: Modeling, Analysis, Control and Design of the Brushbot
Authors:
Gennaro Notomista,
Siddharth Mayya,
Anirban Mazumdar,
Seth Hutchinson,
Magnus Egerstedt
Abstract:
In this paper we present a study of a specific class of vibration-driven robots: the brushbots. In a bottom-up fashion, we start by deriving dynamic models of the brushes and we discuss the conditions under which these models can be employed to describe the motion of brushbots. Then, we present two designs of brushbots: a fully-actuated platform and a differential-drive-like one. The former is emp…
▽ More
In this paper we present a study of a specific class of vibration-driven robots: the brushbots. In a bottom-up fashion, we start by deriving dynamic models of the brushes and we discuss the conditions under which these models can be employed to describe the motion of brushbots. Then, we present two designs of brushbots: a fully-actuated platform and a differential-drive-like one. The former is employed to experimentally validate both the developed theoretical models and the devised motion control algorithms. Finally, a coordinated-control algorithm is implemented on a swarm of differential-drive-like brushbots in order to demonstrate the design simplicity and robustness that can be achieved employing a vibration-based locomotion strategy.
△ Less
Submitted 2 March, 2019; v1 submitted 27 February, 2019;
originally announced February 2019.
-
Non-Uniform Robot Densities in Vibration Driven Swarms Using Phase Separation Theory
Authors:
Siddharth Mayya,
Gennaro Notomista,
Dylan Shell,
Seth Hutchinson,
Magnus Egerstedt
Abstract:
In robot swarms operating under highly restrictive sensing and communication constraints, individuals may need to use direct physical proximity to facilitate information exchange. However, in certain task-related scenarios, this requirement might conflict with the need for robots to spread out in the environment, e.g., for distributed sensing or surveillance applications. This paper demonstrates h…
▽ More
In robot swarms operating under highly restrictive sensing and communication constraints, individuals may need to use direct physical proximity to facilitate information exchange. However, in certain task-related scenarios, this requirement might conflict with the need for robots to spread out in the environment, e.g., for distributed sensing or surveillance applications. This paper demonstrates how a swarm of minimally-equipped robots can form high-density robot aggregates which coexist with lower robot densities in the domain. We envision a scenario where a swarm of vibration-driven robots---which sit atop bristles and achieve directed motion by vibrating them---move somewhat randomly in an environment while colliding with each other. Theoretical techniques from the study of far-from-equilibrium collectives and statistical mechanics clarify the mechanisms underlying the formation of these high and low density regions. Specifically, we capitalize on a transformation that connects the collective properties of a system of self-propelled particles with that of a well-studied molecular fluid system, thereby inheriting the rich theory of equilibrium thermodynamics. This connection is a formal one and is a relatively recent result in studies of motility induced phase separation; it is previously unexplored in the context of robotics. Real robot experiments as well as simulations illustrate how inter-robot collisions can precipitate the formation of non-uniform robot densities in a closed and bounded region.
△ Less
Submitted 2 March, 2019; v1 submitted 27 February, 2019;
originally announced February 2019.
-
Multi-Objective Policy Generation for Multi-Robot Systems Using Riemannian Motion Policies
Authors:
Anqi Li,
Mustafa Mukadam,
Magnus Egerstedt,
Byron Boots
Abstract:
In many applications, multi-robot systems are required to achieve multiple objectives. For these multi-objective tasks, it is oftentimes hard to design a single control policy that fulfills all the objectives simultaneously. In this paper, we focus on multi-objective tasks that can be decomposed into a set of simple subtasks. Controllers for these subtasks are individually designed and then combin…
▽ More
In many applications, multi-robot systems are required to achieve multiple objectives. For these multi-objective tasks, it is oftentimes hard to design a single control policy that fulfills all the objectives simultaneously. In this paper, we focus on multi-objective tasks that can be decomposed into a set of simple subtasks. Controllers for these subtasks are individually designed and then combined into a control policy for the entire team. One significant feature of our work is that the subtask controllers are designed along with their underlying manifolds. When a controller is combined with other controllers, their associated manifolds are also taken into account. This formulation yields a policy generation framework for multi-robot systems that can combine controllers for a variety of objectives while implicitly handling the interaction among robots and subtasks. To describe controllers on manifolds, we adopt Riemannian Motion Policies (RMPs), and propose a collection of RMPs for common multi-robot subtasks. Centralized and decentralized algorithms are designed to combine these RMPs into a final control policy. Theoretical analysis shows that the system under the control policy is stable. Moreover, we prove that many existing multi-robot controllers can be closely approximated by the framework. The proposed algorithms are validated through both simulated tasks and robotic implementations.
△ Less
Submitted 30 August, 2019; v1 submitted 13 February, 2019;
originally announced February 2019.
-
Voluntary Retreat for Decentralized Interference Reduction in Robot Swarms
Authors:
Siddharth Mayya,
Pietro Pierpaoli,
Magnus Egerstedt
Abstract:
In densely-packed robot swarms operating in confined regions, spatial interference -- which manifests itself as a competition for physical space -- forces robots to spend more time navigating around each other rather than performing the primary task. This paper develops a decentralized algorithm that enables individual robots to decide whether to stay in the region and contribute to the overall mi…
▽ More
In densely-packed robot swarms operating in confined regions, spatial interference -- which manifests itself as a competition for physical space -- forces robots to spend more time navigating around each other rather than performing the primary task. This paper develops a decentralized algorithm that enables individual robots to decide whether to stay in the region and contribute to the overall mission, or vacate the region so as to reduce the negative effects that interference has on the overall efficiency of the swarm. We develop this algorithm in the context of a distributed collection task, where a team of robots collect and deposit objects from one set of locations to another in a given region. Robots do not communicate and use only binary information regarding the presence of other robots around them to make the decision to stay or retreat. We illustrate the efficacy of the algorithm with experiments on a team of real robots.
△ Less
Submitted 10 March, 2019; v1 submitted 5 December, 2018;
originally announced December 2018.
-
Tracking Control by the Newton-Raphson Flow: Applications to Autonomous Vehicles
Authors:
Shashwat Shivam,
Ian Buckley,
Yorai Wardi,
Carla Seatzu,
Magnus Egerstedt
Abstract:
This paper concerns applications of a recently-developed output-tracking technique to trajectory control of autonomous vehicles. The technique is based on three principles: Newton-Raphson flow for solving algebraic equations,output prediction, and controller speedup. Early applications of the technique, made to simple systems of an academic nature,were implemented by simple algorithms requiring mo…
▽ More
This paper concerns applications of a recently-developed output-tracking technique to trajectory control of autonomous vehicles. The technique is based on three principles: Newton-Raphson flow for solving algebraic equations,output prediction, and controller speedup. Early applications of the technique, made to simple systems of an academic nature,were implemented by simple algorithms requiring modest computational efforts. In contrast, this paper tests it on commonly-used dynamic models to see if it can handle more complex control scenarios. Results are derived from simulations as well as a laboratory setting, and they indicate effective tracking convergence despite the simplicity of the control algorithm.
△ Less
Submitted 5 March, 2019; v1 submitted 19 November, 2018;
originally announced November 2018.
-
Constraint-Driven Coordinated Control of Multi-Robot Systems
Authors:
Gennaro Notomista,
Magnus Egerstedt
Abstract:
In this paper we present a reformulation--framed as a constrained optimization problem--of multi-robot tasks which are encoded through a cost function that is to be minimized. The advantages of this approach are multiple. The constraint-based formulation provides a natural way of enabling long-term robot autonomy applications, where resilience and adaptability to changing environmental conditions…
▽ More
In this paper we present a reformulation--framed as a constrained optimization problem--of multi-robot tasks which are encoded through a cost function that is to be minimized. The advantages of this approach are multiple. The constraint-based formulation provides a natural way of enabling long-term robot autonomy applications, where resilience and adaptability to changing environmental conditions are essential. Moreover, under certain assumptions on the cost function, the resulting controller is guaranteed to be decentralized. Furthermore, finite-time convergence can be achieved, while using local information only, and therefore preserving the decentralized nature of the algorithm. The developed control framework has been tested on a team of ground mobile robots implementing long-term environmental monitoring.
△ Less
Submitted 2 September, 2019; v1 submitted 4 November, 2018;
originally announced November 2018.
-
Control of Multi-Agent Systems with Finite Time Control Barrier Certificates and Temporal Logic
Authors:
Mohit Srinivasan,
Samuel Coogan,
Magnus Egerstedt
Abstract:
In this paper, a method to synthesize controllers using finite time convergence control barrier functions guided by linear temporal logic specifications for continuous time multi-agent dynamical systems is proposed. Finite time convergence to a desired set in the state space is guaranteed under the existence of a suitable finite time convergence control barrier function. In addition, these barrier…
▽ More
In this paper, a method to synthesize controllers using finite time convergence control barrier functions guided by linear temporal logic specifications for continuous time multi-agent dynamical systems is proposed. Finite time convergence to a desired set in the state space is guaranteed under the existence of a suitable finite time convergence control barrier function. In addition, these barrier functions also guarantee forward invariance once the system converges to the desired set. This allows us to formulate a theoretical framework which synthesizes controllers for the multi-agent system. These properties also enable us to solve the reachability problem in continuous time by formulating a theorem on the composition of multiple finite time convergence control barrier functions. This approach is more flexible than existing methods and also allows for a greater set of feasible control laws. Linear temporal logic is used to specify complex task specifications that need to be satisfied by the multi-agent system. With this solution methodology, a control law is synthesized that satisfies the given temporal logic task specification. Robotic experiments are provided which were performed on the Robotarium multi-robot testbed at Georgia Tech.
△ Less
Submitted 7 August, 2018;
originally announced August 2018.
-
Fault Tolerant Control for Networked Mobile Robots
Authors:
Pietro Pierpaoli,
Dominique Sauter,
Magnus Egerstedt
Abstract:
Teams of networked autonomous agents have been used in a number of applications, such as mobile sensor networks and intelligent transportation systems. However, in such systems, the effect of faults and errors in one or more of the sub-systems can easily spread throughout the network, quickly degrading the performance of the entire system. In consensus-driven dynamics, the effects of faults are pa…
▽ More
Teams of networked autonomous agents have been used in a number of applications, such as mobile sensor networks and intelligent transportation systems. However, in such systems, the effect of faults and errors in one or more of the sub-systems can easily spread throughout the network, quickly degrading the performance of the entire system. In consensus-driven dynamics, the effects of faults are particularly relevant because of the presence of unconstrained rigid modes in the transfer function of the system. Here, we propose a two-stage technique for the identification and accommodation of a biased-measurements agent, in a network of mobile robots with time invariant interaction topology. We assume these interactions to only take place in the form of relative position measurements. A fault identification filter deployed on a single observer agent is used to estimate a single fault occurring anywhere in the network. Once the fault is detected, an optimal leader-based accommodation strategy is initiated. Results are presented by means of numerical simulations and robot experiments.
△ Less
Submitted 29 September, 2018; v1 submitted 6 June, 2018;
originally announced June 2018.
-
A Distributed Version of the Hungarian Method for Multi-Robot Assignment
Authors:
Smriti Chopra,
Giuseppe Notarstefano,
Matthew Rice,
Magnus Egerstedt
Abstract:
In this paper, we propose a distributed version of the Hungarian Method to solve the well known assignment problem. In the context of multi-robot applications, all robots cooperatively compute a common assignment that optimizes a given global criterion (e.g. the total distance traveled) within a finite set of local computations and communications over a peer-to-peer network. As a motivating applic…
▽ More
In this paper, we propose a distributed version of the Hungarian Method to solve the well known assignment problem. In the context of multi-robot applications, all robots cooperatively compute a common assignment that optimizes a given global criterion (e.g. the total distance traveled) within a finite set of local computations and communications over a peer-to-peer network. As a motivating application, we consider a class of multi-robot routing problems with "spatio-temporal" constraints, i.e. spatial targets that require servicing at particular time instants. As a means of demonstrating the theory developed in this paper, the robots cooperatively find online, suboptimal routes by applying an iterative version of the proposed algorithm, in a distributed and dynamic setting. As a concrete experimental test-bed, we provide an interactive "multi-robot orchestral" framework in which a team of robots cooperatively plays a piece of music on a so-called orchestral floor.
△ Less
Submitted 22 May, 2018;
originally announced May 2018.
-
Herdable Systems Over Signed, Directed Graphs
Authors:
Sebastian F. Ruf,
Magnus Egerstedt,
Jeff S. Shamma
Abstract:
This paper considers the notion of herdability, a set-based reachability condition, which asks whether the state of a system can be controlled to be element-wise larger than a non-negative threshold. The basic theory of herdable systems is presented, including a necessary and sufficient condition for herdability. This paper then considers the impact of the underlying graph structure of a linear sy…
▽ More
This paper considers the notion of herdability, a set-based reachability condition, which asks whether the state of a system can be controlled to be element-wise larger than a non-negative threshold. The basic theory of herdable systems is presented, including a necessary and sufficient condition for herdability. This paper then considers the impact of the underlying graph structure of a linear system on the herdability of the system, for the case where the graph is represented as signed and directed. By classifying nodes based on the length and sign of walks from an input, we find a class of completely herdable systems as well as provide a complete characterization of nodes that can be herded in systems with an underlying graph that is a directed out-branching rooted at a single input.
△ Less
Submitted 11 April, 2018;
originally announced April 2018.
-
Permissive Barrier Certificates for Safe Stabilization Using Sum-of-squares
Authors:
Li Wang,
Dongkun Han,
Magnus Egerstedt
Abstract:
Motivated by the need to simultaneously guarantee safety and stability of safety-critical dynamical systems, we construct permissive barrier certificates in this paper that explicitly maximize the region where the system can be stabilized without violating safety constraints. An optimization strategy is developed to search for the maximum volume barrier certified region of safe stabilization. The…
▽ More
Motivated by the need to simultaneously guarantee safety and stability of safety-critical dynamical systems, we construct permissive barrier certificates in this paper that explicitly maximize the region where the system can be stabilized without violating safety constraints. An optimization strategy is developed to search for the maximum volume barrier certified region of safe stabilization. The barrier certified region, which is allowed to take any arbitrary shape, is proved to be strictly larger than safe regions generated with Lyapunov sublevel set based methods. The proposed approach effectively unites a Lyapunov function with multiple barrier functions that might not be compatible with each other. Iterative search algorithms are developed using sum-of-squares to compute the most permissive, that is, the maximum volume, barrier certificates. Simulation results of the iterative search algorithm demonstrate the effectiveness of the proposed method.
△ Less
Submitted 24 February, 2018;
originally announced February 2018.
-
Coverage Control for Wire-Traversing Robots
Authors:
Gennaro Notomista,
Magnus Egerstedt
Abstract:
In this paper we consider the coverage control problem for a team of wire-traversing robots. The two-dimensional motion of robots moving in a planar environment has to be projected to one-dimensional manifolds representing the wires. Starting from Lloyd's descent algorithm for coverage control, a solution that generates continuous motion of the robots on the wires is proposed. This is realized by…
▽ More
In this paper we consider the coverage control problem for a team of wire-traversing robots. The two-dimensional motion of robots moving in a planar environment has to be projected to one-dimensional manifolds representing the wires. Starting from Lloyd's descent algorithm for coverage control, a solution that generates continuous motion of the robots on the wires is proposed. This is realized by means of a Continuous Onto Wires (COW) map: the robots' workspace is mapped onto the wires on which the motion of the robots is constrained to be. A final projection step is introduced to ensure that the configuration of the robots on the wires is a local minimizer of the constrained locational cost. An algorithm for the continuous constrained coverage control problem is proposed and it is tested both in simulation and on a team of mobile robots.
△ Less
Submitted 24 February, 2018;
originally announced February 2018.
-
A Parametric MPC Approach to Balancing the Cost of Abstraction for Differential-Drive Mobile Robots
Authors:
Paul Glotfelter,
Magnus Egerstedt
Abstract:
When designing control strategies for differential-drive mobile robots, one standard tool is the consideration of a point at a fixed distance along a line orthogonal to the wheel axis instead of the full pose of the vehicle. This abstraction supports replacing the non-holonomic, three-state unicycle model with a much simpler two-state single-integrator model (i.e., a velocity-controlled point). Ye…
▽ More
When designing control strategies for differential-drive mobile robots, one standard tool is the consideration of a point at a fixed distance along a line orthogonal to the wheel axis instead of the full pose of the vehicle. This abstraction supports replacing the non-holonomic, three-state unicycle model with a much simpler two-state single-integrator model (i.e., a velocity-controlled point). Yet this transformation comes at a performance cost, through the robot's precision and maneuverability. This work contains derivations for expressions of these precision and maneuverability costs in terms of the transformation's parameters. Furthermore, these costs show that only selecting the parameter once over the course of an application may cause an undue loss of precision. Model Predictive Control (MPC) represents one such method to ameliorate this condition. However, MPC typically realizes a control signal, rather than a parameter, so this work also proposes a Parametric Model Predictive Control (PMPC) method for parameter and sampling horizon optimization. Experimental results are presented that demonstrate the effects of the parameterization on the deployment of algorithms developed for the single-integrator model on actual differential-drive mobile robots.
△ Less
Submitted 20 February, 2018;
originally announced February 2018.
-
Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation
Authors:
Motoya Ohnishi,
Li Wang,
Gennaro Notomista,
Magnus Egerstedt
Abstract:
This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique. We use the learned model in combination with control barrier certificates which constrain policies (feedback controllers) in o…
▽ More
This paper presents a safe learning framework that employs an adaptive model learning algorithm together with barrier certificates for systems with possibly nonstationary agent dynamics. To extract the dynamic structure of the model, we use a sparse optimization technique. We use the learned model in combination with control barrier certificates which constrain policies (feedback controllers) in order to maintain safety, which refers to avoiding particular undesirable regions of the state space. Under certain conditions, recovery of safety in the sense of Lyapunov stability after violations of safety due to the nonstationarity is guaranteed. In addition, we reformulate an action-value function approximation to make any kernel-based nonlinear function estimation method applicable to our adaptive learning framework. Lastly, solutions to the barrier-certified policy optimization are guaranteed to be globally optimal, ensuring the greedy policy improvement under mild conditions. The resulting framework is validated via simulations of a quadrotor, which has previously been used under stationarity assumptions in the safe learnings literature, and is then tested on a real robot, the brushbot, whose dynamics is unknown, highly complex and nonstationary.
△ Less
Submitted 6 August, 2019; v1 submitted 29 January, 2018;
originally announced January 2018.
-
Safe Learning of Quadrotor Dynamics Using Barrier Certificates
Authors:
Li Wang,
Evangelos A. Theodorou,
Magnus Egerstedt
Abstract:
To effectively control complex dynamical systems, accurate nonlinear models are typically needed. However, these models are not always known. In this paper, we present a data-driven approach based on Gaussian processes that learns models of quadrotors operating in partially unknown environments. What makes this challenging is that if the learning process is not carefully controlled, the system wil…
▽ More
To effectively control complex dynamical systems, accurate nonlinear models are typically needed. However, these models are not always known. In this paper, we present a data-driven approach based on Gaussian processes that learns models of quadrotors operating in partially unknown environments. What makes this challenging is that if the learning process is not carefully controlled, the system will go unstable, i.e., the quadcopter will crash. To this end, barrier certificates are employed for safe learning. The barrier certificates establish a non-conservative forward invariant safe region, in which high probability safety guarantees are provided based on the statistics of the Gaussian Process. A learning controller is designed to efficiently explore those uncertain states and expand the barrier certified safe region based on an adaptive sampling scheme. In addition, a recursive Gaussian Process prediction method is developed to learn the complex quadrotor dynamics in real-time. Simulation results are provided to demonstrate the effectiveness of the proposed approach.
△ Less
Submitted 15 October, 2017;
originally announced October 2017.
-
Differentially Private Cloud-Based Multi-Agent Optimization with Constraints
Authors:
Matthew Hale,
Magnus Egerstedt
Abstract:
We present an optimization framework that solves constrained multi-agent optimization problems while kee** each agent's state differentially private. The agents in the network seek to optimize a local objective function in the presence of global constraints. Agents communicate only through a trusted cloud computer and the cloud also performs computations based on global information. The cloud co…
▽ More
We present an optimization framework that solves constrained multi-agent optimization problems while kee** each agent's state differentially private. The agents in the network seek to optimize a local objective function in the presence of global constraints. Agents communicate only through a trusted cloud computer and the cloud also performs computations based on global information. The cloud computer modifies the results of such computations before they are sent to the agents in order to guarantee that the agents' states are kept private. We show that under mild conditions each agent's optimization problem converges in mean-square to its unique solution while each agent's state is kept differentially private. A numerical simulation is provided to demonstrate the viability of this approach.
△ Less
Submitted 28 August, 2017;
originally announced August 2017.