Search | arXiv e-print repository

Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-intensive and fraught with generalization issues. In this paper, we present a purely model-based approach that just requires the digital elevation information of the terrain. Specifically, we express the wheel-terrain interaction and 6dof pose prediction as a non-linear least squares (NLS) problem. As a result, trajectory planning can be viewed as a bi-level optimization. The inner optimization layer predicts the pose on the terrain along a given trajectory, while the outer layer deforms the trajectory itself to reduce the stability and kinematic costs of the pose. We improve the state-of-the-art in the following respects. First, we show that our NLS based pose prediction closely matches the output from a high-fidelity physics engine. This result coupled with the fact that we can query gradients of the NLS solver, makes our pose predictor, a differentiable wheel-terrain interaction model. We further leverage this differentiability to efficiently solve the proposed bi-level trajectory optimization problem. Finally, we perform extensive experiments, and comparison with a baseline to showcase the effectiveness of our approach in obtaining smooth, stable trajectories. △ Less

Submitted 11 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

arXiv:2309.07567 [pdf, other]

Persistence in Active Turbulence

Authors: Amal Manoharan, Sanjay CP, Ashwin Joy

Abstract: Active fluids such as bacterial swarms, self-propelled colloids, and cell tissues can all display complex spatio-temporal vortices that are reminiscent of inertial turbulence. This emergent behavior despite the overdamped nature of these systems is the hallmark of active turbulence. In this letter, using a generalized hydrodynamic model, we present a study of the persistence problem in active turb… ▽ More Active fluids such as bacterial swarms, self-propelled colloids, and cell tissues can all display complex spatio-temporal vortices that are reminiscent of inertial turbulence. This emergent behavior despite the overdamped nature of these systems is the hallmark of active turbulence. In this letter, using a generalized hydrodynamic model, we present a study of the persistence problem in active turbulence. We report that the persistence time of passive tracers inside the coherent vortices follows a Weibull probability density whose shape and scale are decided by the strength of activity -- contrary to inertial turbulence that displays power-law statistics in this region. In the turbulent background, the persistence time is exponentially distributed that is remindful of inertial turbulence. Finally we show that the driver of persistence inside the coherent vortices is the temporal decorrelation of the topological field, whereas it is the vortex turnover time in the turbulent background. △ Less

Submitted 14 September, 2023; originally announced September 2023.

arXiv:2207.09136 [pdf, ps, other]

doi 10.1007/s10846-023-01859-6

Nonlinear Model Predictive Control Framework For Cooperative Three-Agent Target Defense Game

Authors: Amith Manoharan, P. B. Sujit

Abstract: This paper presents cooperative target defense guidance strategies using nonlinear model predictive control (NMPC) framework for a target-attacker-defender (TAD) game. The TAD game consists of an attacker and a cooperative target-defender pair. The attacker's objective is to capture the target, whereas the target-defender team acts together such that the defender can intercept the attacker and ens… ▽ More This paper presents cooperative target defense guidance strategies using nonlinear model predictive control (NMPC) framework for a target-attacker-defender (TAD) game. The TAD game consists of an attacker and a cooperative target-defender pair. The attacker's objective is to capture the target, whereas the target-defender team acts together such that the defender can intercept the attacker and ensure target survival. We assume that the cooperative target-defender pair do not have perfect knowledge of the attacker states, and hence the states are estimated using an Extended Kalman Filter (EKF). The capture analysis based on the Apollonius circles is performed to identify the target survival regions. The efficacy of the NMPC-based solution is evaluated through extensive numerical simulations. The results show that the NMPC-based solution offers robustness to the different unknown attacker models and has better performance than CLOS and A-CLOS based strategies. △ Less

Submitted 19 July, 2022; originally announced July 2022.

Comments: 16 pages

Journal ref: Journal of Intelligent & Robotic Systems, 108, Article number: 21 (2023)

arXiv:2201.09285 [pdf, other]

Multi-AAV Cooperative Path Planning using Nonlinear Model Predictive Control with Localization Constraints

Authors: Amith Manoharan, Rajnikanth Sharma, P. B. Sujit

Abstract: In this paper, we solve a joint cooperative localization and path planning problem for a group of Autonomous Aerial Vehicles (AAVs) in GPS-denied areas using nonlinear model predictive control (NMPC). A moving horizon estimator (MHE) is used to estimate the vehicle states with the help of relative bearing information to known landmarks and other vehicles. The goal of the NMPC is to devise optimal… ▽ More In this paper, we solve a joint cooperative localization and path planning problem for a group of Autonomous Aerial Vehicles (AAVs) in GPS-denied areas using nonlinear model predictive control (NMPC). A moving horizon estimator (MHE) is used to estimate the vehicle states with the help of relative bearing information to known landmarks and other vehicles. The goal of the NMPC is to devise optimal paths for each vehicle between a given source and destination while maintaining desired localization accuracy. Estimating localization covariance in the NMPC is computationally intensive, hence we develop an approximate analytical closed form expression based on the relationship between covariance and path lengths to landmarks. Using this expression while computing NMPC commands reduces the computational complexity significantly. We present numerical simulations to validate the proposed approach for different numbers of vehicles and landmark configurations. We also compare the results with EKF-based estimation to show the superiority of the proposed closed form approach. △ Less

Submitted 23 January, 2022; originally announced January 2022.

arXiv:2110.08318 [pdf, other]

Dynamic probabilistic logic models for effective abstractions in RL

Authors: Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran, Prasad Tadepalli

Abstract: State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments. Recently, we proposed RePReL (Kokel et al. 2021), a hierarchical framework that leverages a relational planner to provide useful state abstractions for learning. We present a brief overview of this framework and the use of a dynamic probabilistic logic model to design these… ▽ More State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments. Recently, we proposed RePReL (Kokel et al. 2021), a hierarchical framework that leverages a relational planner to provide useful state abstractions for learning. We present a brief overview of this framework and the use of a dynamic probabilistic logic model to design these state abstractions. Our experiments show that RePReL not only achieves better performance and efficient learning on the task at hand but also demonstrates better generalization to unseen tasks. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: Accepted at StarAI 2021 (held in conjunction with IJCLR 2021)

arXiv:2108.06276 [pdf, ps, other]

doi 10.1109/LCSYS.2022.3195819

NMPC-Based Cooperative Strategy For A Target Pair To Lure Two Attackers Into Collision

Authors: Amith Manoharan, P. B. Sujit

Abstract: This paper presents a cooperative target defense strategy using nonlinear model-predictive control (NMPC) framework for a two--targets two--attackers (2T2A) game. The 2T2A game consists of two attackers and two targets. Each attacker needs to capture a designated target individually. However, the two targets cooperate to lure the attackers into a collision. We assume that the cooperative target pa… ▽ More This paper presents a cooperative target defense strategy using nonlinear model-predictive control (NMPC) framework for a two--targets two--attackers (2T2A) game. The 2T2A game consists of two attackers and two targets. Each attacker needs to capture a designated target individually. However, the two targets cooperate to lure the attackers into a collision. We assume that the cooperative target pair do not have perfect knowledge of the attacker states, and hence they estimate the attacker states using an extended Kalman filter (EKF). The NMPC scheme computes closed- loop optimal control commands for the targets while respecting imposed state and control constraints. Theoretical analysis is carried out to determine regions that will lead to the targets' survival, given the initial positions of the attacker and target agents. Numerical simulations are carried out to evaluate the performance of the proposed NMPC- based strategy for different scenarios. △ Less

Submitted 13 August, 2021; originally announced August 2021.

Comments: 18 pages, 12 figures

Journal ref: IEEE Control Systems Letters, vol. 7, pp. 496-501, 2023

arXiv:1909.04134 [pdf, other]

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Authors: Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran

Abstract: Option discovery and skill acquisition frameworks are integral to the functioning of a Hierarchically organized Reinforcement learning agent. However, such techniques often yield a large number of options or skills, which can potentially be represented succinctly by filtering out any redundant information. Such a reduction can reduce the required computation while also improving the performance on… ▽ More Option discovery and skill acquisition frameworks are integral to the functioning of a Hierarchically organized Reinforcement learning agent. However, such techniques often yield a large number of options or skills, which can potentially be represented succinctly by filtering out any redundant information. Such a reduction can reduce the required computation while also improving the performance on a target task. In order to compress an array of option policies, we attempt to find a policy basis that accurately captures the set of all options. In this work, we propose Option Encoder, an auto-encoder based framework with intelligently constrained weights, that helps discover a collection of basis policies. The policy basis can be used as a proxy for the original set of skills in a suitable hierarchically organized framework. We demonstrate the efficacy of our method on a collection of grid-worlds and on the high-dimensional Fetch-Reach robotic manipulation task by evaluating the obtained policy basis on a set of downstream tasks. △ Less

Submitted 3 July, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

Comments: ECML-PKDD 2020

arXiv:1703.01510 [pdf]

doi 10.1016/j.jmmm.2017.04.070

Tuning the magnetic properties of Fe50-xMnxPt50 thin films

Authors: Ezhil A. Manoharan, Gary Mankey, Yang-Ki Hong

Abstract: The magnetic and structural properties of highly ordered (S ~ 0.82) epitaxial Fe50-xMnxPt50 thin films were investigated. We report the change in the magnetic properties of Mn doped FePt epitaxial thin films. This study differs from the earlier experimental studies on Mn doped FePt based alloys. Ordered L10 Fe50-xMnxPt50 (x=0, 6, 9, 12 and 15) thin films with a constant thickness of 45 nm were pre… ▽ More The magnetic and structural properties of highly ordered (S ~ 0.82) epitaxial Fe50-xMnxPt50 thin films were investigated. We report the change in the magnetic properties of Mn doped FePt epitaxial thin films. This study differs from the earlier experimental studies on Mn doped FePt based alloys. Ordered L10 Fe50-xMnxPt50 (x=0, 6, 9, 12 and 15) thin films with a constant thickness of 45 nm were prepared by co-sputtering Fe50Pt50 and Mn50Pt50 on to MgO (100) single crystal substrate. We find a significant increase in the coercivity for Fe-Mn-Pt thin films. We have shown that this increase in magnetic properties coincide with the tetragonal distortion, while the recent first principles study of Mn doped FePt showed the sub lattice ordering of ferromagnetically aligned Mn atoms would lead to increase in magnetic properties in the FeMnPt ternary alloy system with fixed Pt concentration. At x=12 the coercivity has increased by 46.4 % when compared to Fe50Pt50. The increase in magnetic properties in Fe50-xMnxPt50 is due to the tetragonal distortion as experimental c/a ratio is larger than the expected c/a ratio for ferromagnetically ordered Mn atoms in the sublattice at the concentration x=12. Thus we show that high temperature deposition and high temperature annealing is one of the methods to achieve large coercivity in Mn doped FePt as it leads to tetragonal distortion. △ Less

Submitted 4 March, 2017; originally announced March 2017.

arXiv:quant-ph/0304136 [pdf, ps, other]

Beyond Quantum Computation and Towards Quantum Field Computation

Authors: A. C. Manoharan

Abstract: Because the subject of relativistic quantum field theory (QFT) contains all of non-relativistic quantum mechanics, we expect quantum field computation to contain (non-relativistic) quantum computation. Although we do not yet have a quantum theory of the gravitational field, and are far from a practical implementation of a quantum field computer, some pieces of the puzzle (without gravity) are no… ▽ More Because the subject of relativistic quantum field theory (QFT) contains all of non-relativistic quantum mechanics, we expect quantum field computation to contain (non-relativistic) quantum computation. Although we do not yet have a quantum theory of the gravitational field, and are far from a practical implementation of a quantum field computer, some pieces of the puzzle (without gravity) are now available. We consider a general model for computation with quantum field theory, and obtain some results for relativistic quantum computation. Moreover, it is possible to see new connections between principal models of computation, namely, computation over the continuum and computation over the integers (Turing computation). Thus we identify a basic problem in QFT, namely Wightman's computation problem for domains of holomorphy, which we call WHOLO. Inspired by the same analytic functions which are central to the famous CPT theorem of QFT, it is possible to obtain a computational complexity structure for QFT and shed new light on certain complexity classes for this problem WHOLO. △ Less

Submitted 19 April, 2003; originally announced April 2003.

Comments: 18 pages

arXiv:quant-ph/0109015 [pdf, ps, other]

The unity between quantum field computation, real computation, and quantum computation

Authors: A. C. Manoharan

Abstract: It is indicated that principal models of computation are indeed significantly related. The quantum field computation model contains the quantum computation model of Feynman. (The term "quantum field computer" was used by Freedman.) Quantum field computation (as enhanced by Wightman's model of quantum field theory) involves computation over the continuum which is remarkably related to the real co… ▽ More It is indicated that principal models of computation are indeed significantly related. The quantum field computation model contains the quantum computation model of Feynman. (The term "quantum field computer" was used by Freedman.) Quantum field computation (as enhanced by Wightman's model of quantum field theory) involves computation over the continuum which is remarkably related to the real computation model of Smale. The latter model was established as a generalization of Turing computation. All this is not surprising since it is well known that the physics of quantum field theory (which includes Einstein's special relativity) contains quantum mechanics which in turn contains classical mechanics. The unity of these computing models, which seem to have grown largely independently, could shed new light into questions of computational complexity, into the central P (Polynomial time) versus NP (Non-deterministic Polynomial time) problem of computer science, and also into the description of Nature by fundamental physics theories. △ Less

Submitted 3 September, 2001; originally announced September 2001.

arXiv:quant-ph/0002017 [pdf, ps, other]

Quantum Field Symbolic Analog Computation: Relativity Model

Authors: A. C. Manoharan

Abstract: It is natural to consider a quantum system in the continuum limit of space-time configuration. Incorporating also, Einstein's special relativity, leads to the quantum theory of fields. Non-relativistic quantum mechanics and classical mechanics are special cases. By studying vacuum expectation values (Wightman functions W(n; z) where z denotes the set of n complex variables) of products of quantu… ▽ More It is natural to consider a quantum system in the continuum limit of space-time configuration. Incorporating also, Einstein's special relativity, leads to the quantum theory of fields. Non-relativistic quantum mechanics and classical mechanics are special cases. By studying vacuum expectation values (Wightman functions W(n; z) where z denotes the set of n complex variables) of products of quantum field operators in a separable Hilbert space, one is led to computation of holomorphy domains for these functions over the space of several complex variables, C^n. Quantum fields were reconstructed from these functions by Wightman. Computer automation has been accomplished as deterministic exact analog computation (computation over "cells" in the continuum of C^n) for obtaining primitive extended tube domains of holomorphy. This is done in a one dimensional space plus one dimensional time model. By considering boundary related semi-algebraic sets, some analytic extensions of these domains are obtained by non-deterministic methods. The novel methods of computation raise interesting issues of computability and complexity. Moreover, the computation is independent of any particular form of Lagrangian or dynamics, and is uniform in n, qualifying for a universal quantum machine over C^infinity. △ Less

Submitted 6 February, 2000; originally announced February 2000.

Comments: 7 pages

Showing 1–11 of 11 results for author: Manoharan, A