-
Geometric PID Controller for Stabilization of Nonholonomic Mechanical Systems on Lie Groups
Authors:
Rama Seshan,
Ravi N Banavar,
D. H. S. Maithripala,
Arun D. Mahindrakar
Abstract:
The PID controller is an elegant and versatile controller for set point tracking in double integrator systems of which mechanical systems evolving on Euclidean space constitute a large class. But since mechanical systems are typically constrained interconnections of rigid bodies whose configuration space is $SE(3)$, which is not even topologically Euclidean, a geometric PID controller has been dev…
▽ More
The PID controller is an elegant and versatile controller for set point tracking in double integrator systems of which mechanical systems evolving on Euclidean space constitute a large class. But since mechanical systems are typically constrained interconnections of rigid bodies whose configuration space is $SE(3)$, which is not even topologically Euclidean, a geometric PID controller has been developed for mechanical systems evolving on Lie groups. In this work, we extend the framework to such systems which have nonholonomic constraints. It encompasses many practically applicable mechanical systems encountered in robotics as robots are constrained interconnections of rigid bodies where the constraints could either be holonomic or nonholonomic.
△ Less
Submitted 13 November, 2021;
originally announced November 2021.
-
Stabilizing a spherical pendulum on a quadrotor
Authors:
Aradhana Nayak,
Ravi Banavar,
D. H. S Maithripala
Abstract:
In this article we design a backstep** control law based on geometric principles to swing up a spherical pendulum mounted on a moving quadrotor. The available degrees of freedom in the control vector also permit us to position the plane of the quadrotor parallel to the ground. The problem addressed here is, indeed, novel and has many practical applications which arise during the transport of a p…
▽ More
In this article we design a backstep** control law based on geometric principles to swing up a spherical pendulum mounted on a moving quadrotor. The available degrees of freedom in the control vector also permit us to position the plane of the quadrotor parallel to the ground. The problem addressed here is, indeed, novel and has many practical applications which arise during the transport of a payload mounted on top of a quadrotor. The modeling and control law are coordinate-free and thus avoid singularity issues. The geometric treatment of the problem greatly simplifies both the modeling and control law for the system. The control action is verified and supported by numerical experiments for aggressive manoeuvres starting very close to the downward stable equilibrium position of the pendulum.
△ Less
Submitted 22 July, 2021; v1 submitted 13 July, 2020;
originally announced July 2020.
-
A Decentralized Policy with Logarithmic Regret for a Class of Multi-Agent Multi-Armed Bandit Problems with Option Unavailability Constraints and Stochastic Communication Protocols
Authors:
Pathmanathan Pankayaraj,
D. H. S. Maithripala,
J. M. Berg
Abstract:
This paper considers a multi-armed bandit (MAB) problem in which multiple mobile agents receive rewards by sampling from a collection of spatially dispersed stochastic processes, called bandits. The goal is to formulate a decentralized policy for each agent, in order to maximize the total cumulative reward over all agents, subject to option availability and inter-agent communication constraints. T…
▽ More
This paper considers a multi-armed bandit (MAB) problem in which multiple mobile agents receive rewards by sampling from a collection of spatially dispersed stochastic processes, called bandits. The goal is to formulate a decentralized policy for each agent, in order to maximize the total cumulative reward over all agents, subject to option availability and inter-agent communication constraints. The problem formulation is motivated by applications in which a team of autonomous mobile robots cooperates to accomplish an exploration and exploitation task in an uncertain environment. Bandit locations are represented by vertices of the spatial graph. At any time, an agent's option consist of sampling the bandit at its current location, or traveling along an edge of the spatial graph to a new bandit location. Communication constraints are described by a directed, non-stationary, stochastic communication graph. At any time, agents may receive data only from their communication graph in-neighbors. For the case of a single agent on a fully connected spatial graph, it is known that the expected regret for any optimal policy is necessarily bounded below by a function that grows as the logarithm of time. A class of policies called upper confidence bound (UCB) algorithms asymptotically achieve logarithmic regret for the classical MAB problem. In this paper, we propose a UCB-based decentralized motion and option selection policy and a non-stationary stochastic communication protocol that guarantee logarithmic regret. To our knowledge, this is the first such decentralized policy for non-fully connected spatial graphs with communication constraints. When the spatial graph is fully connected and the communication graph is stationary, our decentralized algorithm matches or exceeds the best reported prior results from the literature.
△ Less
Submitted 31 March, 2020; v1 submitted 29 March, 2020;
originally announced March 2020.
-
A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems
Authors:
Pathmanathan Pankayaraj,
D. H. S. Maithripala
Abstract:
This paper proposes a novel policy for a group of agents to, individually as well as collectively, solve a multi armed bandit (MAB) problem. The policy relies solely on the information that an agent has obtained through sampling of the options on its own and through communication with neighbors. The option selection policy is based on an Upper Confidence Based (UCB) strategy while the communicatio…
▽ More
This paper proposes a novel policy for a group of agents to, individually as well as collectively, solve a multi armed bandit (MAB) problem. The policy relies solely on the information that an agent has obtained through sampling of the options on its own and through communication with neighbors. The option selection policy is based on an Upper Confidence Based (UCB) strategy while the communication strategy that is proposed forces agents to communicate with other agents who they believe are most likely to be exploring than exploiting. The overall strategy is shown to significantly outperform an independent Erdős-Rényi (ER) graph based random communication policy. The policy is shown to be cost effective in terms of communication and thus to be easily scalable to a large network of agents.
△ Less
Submitted 21 February, 2020; v1 submitted 7 October, 2019;
originally announced October 2019.
-
A bundle framework for observer design on smooth manifolds with symmetry
Authors:
Anant A. Joshi,
D. H. S. Maithripala,
Ravi N. Banavar
Abstract:
The article presents a bundle framework for nonlinear observer design on a manifold with a Lie group action. The group action on the manifold decomposes the manifold to a quotient structure and an orbit space, and the problem of observer design for the entire system gets decomposed to a design over the orbit (the group space) and a design over the quotient space. The emphasis throughout the articl…
▽ More
The article presents a bundle framework for nonlinear observer design on a manifold with a Lie group action. The group action on the manifold decomposes the manifold to a quotient structure and an orbit space, and the problem of observer design for the entire system gets decomposed to a design over the orbit (the group space) and a design over the quotient space. The emphasis throughout the article is on presenting an overarching geometric structure; the special case when the group action is free is given special emphasis. Gradient based observer design on a Lie group is given explicit attention. The concepts developed are illustrated by applying them on well known examples, which include the action of $\mathbb{SO}(3)$ on $\mathbb{R}^3 \setminus \{0\}$ and the simultaneous localisation and map** (SLAM) problem.
△ Less
Submitted 12 July, 2021; v1 submitted 22 July, 2019;
originally announced July 2019.
-
Feedback Regularization and Geometric PID Control for Robust Stabilization of a Planar Three-link Hybrid Bipedal Walking Model
Authors:
W. M. L. T. Weerakoon,
T. W. U. Madhushani,
D. H. S. Maithripala,
J. M. Berg
Abstract:
This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot ha…
▽ More
This paper applies a recently developed geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking model. The three links represent the robot torso and two kneeless legs, with an independent control torque available at each hip joint. The geometric PID controller is derived for fully actuated mechanical systems, however in the swing phase the three-link biped robot has three degrees of freedom and only two controls. Following the bipedal walking literature, underactuation is addressed by choosing two "virtual constraints" to enforce, and verifying the stability of the resulting two-dimensional zero dynamics. The resulting controlled dynamics do not have the structure of a mechanical system, however this structure is restored using "feedback regularization," following which geometric PID control is used to provide robust asymptotic regulation of the virtual constraints. The proposed method can tolerate significantly greater variations in inclination, showing the value of the geometric methods, and the benefit of integral action.
△ Less
Submitted 5 October, 2017;
originally announced October 2017.
-
Asymptotic Allocation Rules for a Class of Dynamic Multi-armed Bandit Problems
Authors:
T. W. U. Madhushani,
D. H. S. Maithripala,
N. E. Leonard
Abstract:
This paper presents a class of Dynamic Multi-Armed Bandit problems where the reward can be modeled as the noisy output of a time varying linear stochastic dynamic system that satisfies some boundedness constraints. The class allows many seemingly different problems with time varying option characteristics to be considered in a single framework. It also opens up the possibility of considering many…
▽ More
This paper presents a class of Dynamic Multi-Armed Bandit problems where the reward can be modeled as the noisy output of a time varying linear stochastic dynamic system that satisfies some boundedness constraints. The class allows many seemingly different problems with time varying option characteristics to be considered in a single framework. It also opens up the possibility of considering many new problems of practical importance. For instance it affords the simultaneous consideration of temporal option unavailabilities and the depen- dencies between options with time varying option characteristics in a seamless manner. We show that, for this class of problems, the combination of any Upper Confidence Bound type algorithm with any efficient reward estimator for the expected reward ensures the logarithmic bounding of the expected cumulative regret. We demonstrate the versatility of the approach by the explicit consideration of a new example of practical interest.
△ Less
Submitted 7 October, 2017; v1 submitted 1 October, 2017;
originally announced October 2017.
-
Control Synthesis for an Underactuated Cable Suspended System Using Dynamic Decoupling
Authors:
Siddharth H. Nair,
Ravi N. Banavar,
D. H. S. Maithripala
Abstract:
This article studies the dynamics and control of a novel underactuated system, wherein a plate suspended by cables and with a freely moving mass on top, whose other ends are attached to three quadrotors, is sought to be horizontally stabilized at a certain height, with the ball positioned at the center of mass of the plate. The freely moving mass introduces a 2-degree of underactuation into the sy…
▽ More
This article studies the dynamics and control of a novel underactuated system, wherein a plate suspended by cables and with a freely moving mass on top, whose other ends are attached to three quadrotors, is sought to be horizontally stabilized at a certain height, with the ball positioned at the center of mass of the plate. The freely moving mass introduces a 2-degree of underactuation into the system. The design proceeds through a decoupling of the quadrotors and the plate dynamics. Through a partial feedback linearization approach, the attitude of the plate and the translational height of the plate is initially controlled, while maintaining a bounded velocity along the $y$ and $x$ directions. These inputs are then synthesized through the quadrotors with a backstep** and timescale separation argument based on Tikhonov's theorem.
△ Less
Submitted 23 January, 2018; v1 submitted 3 July, 2017;
originally announced July 2017.
-
A Geometric PID Control Framework for Mechanical Systems
Authors:
D. H. S. Maithripala,
T. W. U. Madhushani,
J. M. Berg
Abstract:
These lectures demonstrate the development of a PID control framework for mechanical systems. Based on the observation that mechanical systems are essentially double integrator systems, we generalize the linear PID controller to mechanical systems that have a non-Euclidean configuration space. Specifically we start by presenting the development of the geometric PID controller for fully actuated me…
▽ More
These lectures demonstrate the development of a PID control framework for mechanical systems. Based on the observation that mechanical systems are essentially double integrator systems, we generalize the linear PID controller to mechanical systems that have a non-Euclidean configuration space. Specifically we start by presenting the development of the geometric PID controller for fully actuated mechanical systems and then extend it to a class of under actuated interconnected mechanical systems of practical significance by introducing the notion of feedback regularization. We show that feedback regularization is the mechanical system equivalent to partial feedback linearization. We apply these results for trajectory tracking for several systems of interest in the field of robotics. First, we demonstrate the robust almost-global stability properties of the geometric PID controller developed for fully actuated mechanical systems using simulations and experiments on a multi-rotor-aerial-vehicle. The extension to the class of under actuated interconnected systems allow one to ensure the semi-almost-global locally exponential tracking of the geometric center of a spherical robot on an inclined plane of unknown angle of inclination. The results are demonstrated using simulations for a hoop rolling on an inclined plane and then for a sphere rolling on an inclined plane. The final extension that we present here is that of geometric PID control for holonomically or non-holonomically constrained mechanical systems on Lie groups. The results are demonstrated by ensuring the robust almost global locally exponential tracking of a nontrivial spherical pendulum.
△ Less
Submitted 14 October, 2016;
originally announced October 2016.
-
Feedback Regularization and Geometric PID Control for Trajectory Tracking of Coupled Mechanical Systems: Hoop Robots on an Inclined Plane
Authors:
T. W. U. Madhushani,
D. H. S. Maithripala,
J. M. Berg
Abstract:
This paper applies geometric PID control for asymptotic tracking of a desired trajectory by a hoop robot in the presence of disturbances and uncertainties. The hoop robot, consisting of a circular body rolling without slip along a one-dimensional surface, is a planar analog of a spherical robot. A variety of coupled mechanical system may be used to actuate the hoop robot. This paper specifically c…
▽ More
This paper applies geometric PID control for asymptotic tracking of a desired trajectory by a hoop robot in the presence of disturbances and uncertainties. The hoop robot, consisting of a circular body rolling without slip along a one-dimensional surface, is a planar analog of a spherical robot. A variety of coupled mechanical system may be used to actuate the hoop robot. This paper specifically considers two different actuators, one a simple pendulum and the other an internal cart. The geometric PID controller requires the plant to be a mechanical system, and the hoop robot does not satisfy this condition. Therefore a geometric inner loop is presented that gives the hoop robot the required structure. This procedure is here referred to as feedback regularization. Feedback regularization--in contrast to feedback linearization--is coordinate independent, and hence reflects the fundamental system structure. Note also that the resulting mechanical system is nonlinear and underactuated. Subsequently, the geometric PID outer loop guarantees almost-semiglobal tracking with locally exponential convergence, and the integral action of the PID guarantees robustness to constant disturbances and parameter uncertainties, including constant inclination of the rolling surface. The complete tracking controller is the composition of the two coordinate-independent loops, and therefore is also coordinate independent.
△ Less
Submitted 26 February, 2017; v1 submitted 29 September, 2016;
originally announced September 2016.
-
Semi-globally Exponential Trajectory Tracking for a Class of Spherical Robots
Authors:
T. W. U. Madhushani,
D. H. S. Maithripala,
J. V. Wijayakulasooriya,
J. M. Berg
Abstract:
A spherical robot consists of an externally spherical rigid body rolling on a two-dimensional surface, actuated by an auxiliary mechanism. For a class of actuation mechanisms, we derive a controller for the geometric center of the sphere to asymptotically track any sufficiently smooth reference trajectory, with robustness to bounded, constant uncertainties in the inertial properties of the sphere…
▽ More
A spherical robot consists of an externally spherical rigid body rolling on a two-dimensional surface, actuated by an auxiliary mechanism. For a class of actuation mechanisms, we derive a controller for the geometric center of the sphere to asymptotically track any sufficiently smooth reference trajectory, with robustness to bounded, constant uncertainties in the inertial properties of the sphere and actuation mechanism, and to constant disturbance forces including, for example, from constant inclination of the rolling surface. The sphere and actuator are modeled as distinct systems, coupled by reaction forces. It is assumed that the actuator can provide three independent control torques, and that the actuator center of mass remains at a constant distance from the geometric center of the sphere. We show that a necessary and sufficient condition for such a controller to exist is that for any constant disturbance torque acting on the sphere there is a constant input such that the sphere and the actuator mechanism has a stable relative equilibrium. A geometric PID controller guarantees robust, semi-global, locally exponential stability for the position tracking error of the geometric center of the sphere, while ensuring that actuator velocities are bounded.
△ Less
Submitted 1 March, 2017; v1 submitted 4 August, 2016;
originally announced August 2016.