-
RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback
Authors:
Souradip Chakraborty,
Kasun Weerakoon,
Prithvi Poddar,
Mohamed Elnoor,
Priya Narayanan,
Carl Busart,
Pratap Tokekar,
Amrit Singh Bedi,
Dinesh Manocha
Abstract:
Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on) to adapt already trained policy to real-time changes in the environment without re-training vi…
▽ More
Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on) to adapt already trained policy to real-time changes in the environment without re-training via utilizing a language-based feedback. The proposed approach essentially boils down to addressing two main challenges of (1) when to ask for feedback and, if received, (2) how to incorporate feedback into trained policies. RE-MOVE incorporates an epistemic uncertainty-based framework to determine the optimal time to request instructions-based feedback. For the second challenge, we employ a zero-shot learning natural language processing (NLP) paradigm with efficient, prompt design and leverage state-of-the-art GPT-3.5, Llama-2 language models. To show the efficacy of the proposed approach, we performed extensive synthetic and real-world evaluations in several test-time dynamic navigation scenarios. Utilizing RE-MOVE result in up to 80% enhancement in the attainment of successful goals, coupled with a reduction of 13.50% in the normalized trajectory length, as compared to alternative approaches, particularly in demanding real-world environments with perceptual challenges.
△ Less
Submitted 17 September, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
A Model-free Deep Reinforcement Learning Approach To Maneuver A Quadrotor Despite Single Rotor Failure
Authors:
Paras Sharma,
Prithvi Poddar,
P. B. Sujit
Abstract:
Ability to recover from faults and continue mission is desirable for many quadrotor applications. The quadrotor's rotor may fail while performing a mission and it is essential to develop recovery strategies so that the vehicle is not damaged. In this paper, we develop a model-free deep reinforcement learning approach for a quadrotor to recover from a single rotor failure. The approach is based on…
▽ More
Ability to recover from faults and continue mission is desirable for many quadrotor applications. The quadrotor's rotor may fail while performing a mission and it is essential to develop recovery strategies so that the vehicle is not damaged. In this paper, we develop a model-free deep reinforcement learning approach for a quadrotor to recover from a single rotor failure. The approach is based on Soft-actor-critic that enables the vehicle to hover, land, and perform complex maneuvers. Simulation results are presented to validate the proposed approach using a custom simulator. The results show that the proposed approach achieves hover, landing, and path following in 2D and 3D. We also show that the proposed approach is robust to wind disturbances.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Sensing, Communication, and Localization Constraints
Authors:
Manav Mishra,
Prithvi Poddar,
Rajat Agarwal,
**gxi Chen,
Pratap Tokekar,
P. B. Sujit
Abstract:
Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxilia…
▽ More
Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxiliary agents with low localization accuracy. To localize itself, the auxiliary agents must be within the communication range of an {anchor}, directly or indirectly. The robotic team's objective is to minimize environmental uncertainty through persistent monitoring. We propose a multi-agent deep reinforcement learning (MARL) based architecture with graph convolution called Graph Localized Proximal Policy Optimization (GALOPP), which incorporates the limited sensor field-of-view, communication, and localization constraints of the agents along with persistent monitoring objectives to determine motion policies for each agent. We evaluate the performance of GALOPP on open maps with obstacles having a different number of anchor and auxiliary agents. We further study (i) the effect of communication range, obstacle density, and sensing range on the performance and (ii) compare the performance of GALOPP with non-RL baselines, namely, greedy search, random search, and random search with communication constraint. For its generalization capability, we also evaluated GALOPP in two different environments -- 2-room and 4-room. The results show that GALOPP learns the policies and monitors the area well. As a proof-of-concept, we perform hardware experiments to demonstrate the performance of GALOPP.
△ Less
Submitted 14 May, 2023; v1 submitted 14 September, 2021;
originally announced September 2021.
-
Coherent Atomically-Thin Superlattices with Engineered Strain
Authors:
Saien Xie,
Lijie Tu,
Yimo Han,
Lujie Huang,
Kibum Kang,
Ka Un Lao,
Preeti Poddar,
David A. Muller,
Robert A. DiStasio Jr,
Jiwoong Park
Abstract:
Epitaxy forms the basis of modern electronics and optoelectronics. We report coherent atomically-thin superlattices, in which different transition metal dichalcogenide monolayers--despite large lattice mismatches--are repeated and integrated without dislocations. Grown by a novel omnidirectional epitaxy, these superlattices display fully-matched lattice constants across heterointerfaces while main…
▽ More
Epitaxy forms the basis of modern electronics and optoelectronics. We report coherent atomically-thin superlattices, in which different transition metal dichalcogenide monolayers--despite large lattice mismatches--are repeated and integrated without dislocations. Grown by a novel omnidirectional epitaxy, these superlattices display fully-matched lattice constants across heterointerfaces while maintaining a surprisingly isotropic lattice structure and triangular symmetry. This strong epitaxial strain is precisely engineered via the nanoscale supercell dimensions, thereby enabling broad tuning of the optical properties and producing photoluminescence peak shifts as large as 250 meV. We present theoretical models to explain this coherent growth as well as the energetic interplay governing the flat-rippled configuration space in these strained monolayers. Such coherent superlattices provide novel building blocks with targeted functionalities at the atomically-thin monolayer limit.
△ Less
Submitted 30 August, 2017;
originally announced August 2017.
-
Discovery of room temperature multiferroicity and magneto-electric coupling in Fe3Se4 nanorods
Authors:
Mousumi Sen Bishwas,
Pankaj Poddar
Abstract:
We report for the first time, that Fe3Se4 is a room temperature, type-II multiferroic with magnetoelectric coupling. We observed the coexistence of coupled ferrimagnetic and ferroelectric ordering in Fe3Se4nanorods well above room temperature, which is a hard magnet with large magnetocrystalline anisotropy. For the first time, we observed spontaneous, reversible ferroelectric polarization in Fe3Se…
▽ More
We report for the first time, that Fe3Se4 is a room temperature, type-II multiferroic with magnetoelectric coupling. We observed the coexistence of coupled ferrimagnetic and ferroelectric ordering in Fe3Se4nanorods well above room temperature, which is a hard magnet with large magnetocrystalline anisotropy. For the first time, we observed spontaneous, reversible ferroelectric polarization in Fe3Se4 nanorods below the magnetic Curie temperature. The coupling is manifested by an anomaly in the dielectric constant and Raman shift at Tc. We do not completely understand the origin of the ferroelectric ordering at this point however the simultaneous presence of magnetic and ferroelectric ordering at room temperature in Fe3Se4 along with hard magnetic properties will open new research areas for devices.
△ Less
Submitted 12 June, 2019; v1 submitted 20 December, 2016;
originally announced December 2016.
-
Automated Objective Surgical Skill Assessment in the Operating Room Using Unstructured Tool Motion
Authors:
Piyush Poddar,
Narges Ahmidi,
S. Swaroop Vedula,
Lisa Ishii,
Gregory D. Hager,
Masaru Ishii
Abstract:
Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy. Further, these methods only considered generic motion metrics such as time and number of movements, which are of limited instructive value. In this paper, we developed and evaluated an automated approach to the surgical skil…
▽ More
Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy. Further, these methods only considered generic motion metrics such as time and number of movements, which are of limited instructive value. In this paper, we developed and evaluated an automated approach to the surgical skill assessment of nasal septoplasty in the OR. The obstructed field of view and highly unstructured nature of septoplasty precludes trainees from efficiently learning the procedure. We propose a descriptive structure of septoplasty consisting of two types of activity: (1) brushing activity directed away from the septum plane characterizing the consistency of the surgeon's wrist motion and (2) activity along the septal plane characterizing the surgeon's coverage pattern. We derived features related to these two activity types that classify a surgeon's level of training with an average accuracy of about 72%. The features we developed provide surgeons with personalized, actionable feedback regarding their tool motion.
△ Less
Submitted 18 December, 2014;
originally announced December 2014.
-
Manifestation of the Verwey Transition in the Tunneling Spectra of Magnetite Nanocrystals
Authors:
Pankaj Poddar,
Tcipi Fried,
Gil Markovich,
Amos Sharoni,
David Katz,
Tommer Wizansky,
Oded Millo
Abstract:
Tunneling transport measurements performed on single particles and on arrays of Fe3O4 (magnetite) nanocrystals provide strong evidence for the existence of the Verwey metal-insulator transition at the nanoscale. The resistance measurements on nanocrystal arrays show an abrupt increase of the resistance around 100 K, consistent with the Verwey transition, while the current-voltage characteristics…
▽ More
Tunneling transport measurements performed on single particles and on arrays of Fe3O4 (magnetite) nanocrystals provide strong evidence for the existence of the Verwey metal-insulator transition at the nanoscale. The resistance measurements on nanocrystal arrays show an abrupt increase of the resistance around 100 K, consistent with the Verwey transition, while the current-voltage characteristics exhibit a sharp transition from an insulator gap to a peak structure around zero bias voltage. The tunneling spectra obtained on isolated particles using a Scanning Tunneling Microscope reveal an insulator-like gap structure in the density of states below the transition temperature that gradually disappeared with increasing temperature, transforming to a small peak structure at the Fermi energy. These data provide insight into the roles played by long- and short-range charge ordering in the Verwey transition.
△ Less
Submitted 20 July, 2003;
originally announced July 2003.