Search | arXiv e-print repository

RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback

Authors: Souradip Chakraborty, Kasun Weerakoon, Prithvi Poddar, Mohamed Elnoor, Priya Narayanan, Carl Busart, Pratap Tokekar, Amrit Singh Bedi, Dinesh Manocha

Abstract: Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on) to adapt already trained policy to real-time changes in the environment without re-training vi… ▽ More Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on) to adapt already trained policy to real-time changes in the environment without re-training via utilizing a language-based feedback. The proposed approach essentially boils down to addressing two main challenges of (1) when to ask for feedback and, if received, (2) how to incorporate feedback into trained policies. RE-MOVE incorporates an epistemic uncertainty-based framework to determine the optimal time to request instructions-based feedback. For the second challenge, we employ a zero-shot learning natural language processing (NLP) paradigm with efficient, prompt design and leverage state-of-the-art GPT-3.5, Llama-2 language models. To show the efficacy of the proposed approach, we performed extensive synthetic and real-world evaluations in several test-time dynamic navigation scenarios. Utilizing RE-MOVE result in up to 80% enhancement in the attainment of successful goals, coupled with a reduction of 13.50% in the normalized trajectory length, as compared to alternative approaches, particularly in demanding real-world environments with perceptual challenges. △ Less

Submitted 17 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

arXiv:2109.10488 [pdf, other]

A Model-free Deep Reinforcement Learning Approach To Maneuver A Quadrotor Despite Single Rotor Failure

Authors: Paras Sharma, Prithvi Poddar, P. B. Sujit

Abstract: Ability to recover from faults and continue mission is desirable for many quadrotor applications. The quadrotor's rotor may fail while performing a mission and it is essential to develop recovery strategies so that the vehicle is not damaged. In this paper, we develop a model-free deep reinforcement learning approach for a quadrotor to recover from a single rotor failure. The approach is based on… ▽ More Ability to recover from faults and continue mission is desirable for many quadrotor applications. The quadrotor's rotor may fail while performing a mission and it is essential to develop recovery strategies so that the vehicle is not damaged. In this paper, we develop a model-free deep reinforcement learning approach for a quadrotor to recover from a single rotor failure. The approach is based on Soft-actor-critic that enables the vehicle to hover, land, and perform complex maneuvers. Simulation results are presented to validate the proposed approach using a custom simulator. The results show that the proposed approach achieves hover, landing, and path following in 2D and 3D. We also show that the proposed approach is robust to wind disturbances. △ Less

Submitted 21 September, 2021; originally announced September 2021.

arXiv:2109.06831 [pdf, other]

Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Sensing, Communication, and Localization Constraints

Authors: Manav Mishra, Prithvi Poddar, Rajat Agarwal, **gxi Chen, Pratap Tokekar, P. B. Sujit

Abstract: Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxilia… ▽ More Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxiliary agents with low localization accuracy. To localize itself, the auxiliary agents must be within the communication range of an {anchor}, directly or indirectly. The robotic team's objective is to minimize environmental uncertainty through persistent monitoring. We propose a multi-agent deep reinforcement learning (MARL) based architecture with graph convolution called Graph Localized Proximal Policy Optimization (GALOPP), which incorporates the limited sensor field-of-view, communication, and localization constraints of the agents along with persistent monitoring objectives to determine motion policies for each agent. We evaluate the performance of GALOPP on open maps with obstacles having a different number of anchor and auxiliary agents. We further study (i) the effect of communication range, obstacle density, and sensing range on the performance and (ii) compare the performance of GALOPP with non-RL baselines, namely, greedy search, random search, and random search with communication constraint. For its generalization capability, we also evaluated GALOPP in two different environments -- 2-room and 4-room. The results show that GALOPP learns the policies and monitors the area well. As a proof-of-concept, we perform hardware experiments to demonstrate the performance of GALOPP. △ Less

Submitted 14 May, 2023; v1 submitted 14 September, 2021; originally announced September 2021.

arXiv:1708.09539 [pdf]

Coherent Atomically-Thin Superlattices with Engineered Strain

Authors: Saien Xie, Lijie Tu, Yimo Han, Lujie Huang, Kibum Kang, Ka Un Lao, Preeti Poddar, David A. Muller, Robert A. DiStasio Jr, Jiwoong Park

Abstract: Epitaxy forms the basis of modern electronics and optoelectronics. We report coherent atomically-thin superlattices, in which different transition metal dichalcogenide monolayers--despite large lattice mismatches--are repeated and integrated without dislocations. Grown by a novel omnidirectional epitaxy, these superlattices display fully-matched lattice constants across heterointerfaces while main… ▽ More Epitaxy forms the basis of modern electronics and optoelectronics. We report coherent atomically-thin superlattices, in which different transition metal dichalcogenide monolayers--despite large lattice mismatches--are repeated and integrated without dislocations. Grown by a novel omnidirectional epitaxy, these superlattices display fully-matched lattice constants across heterointerfaces while maintaining a surprisingly isotropic lattice structure and triangular symmetry. This strong epitaxial strain is precisely engineered via the nanoscale supercell dimensions, thereby enabling broad tuning of the optical properties and producing photoluminescence peak shifts as large as 250 meV. We present theoretical models to explain this coherent growth as well as the energetic interplay governing the flat-rippled configuration space in these strained monolayers. Such coherent superlattices provide novel building blocks with targeted functionalities at the atomically-thin monolayer limit. △ Less

Submitted 30 August, 2017; originally announced August 2017.

Comments: 4 main figures and 11 supplementary figures

arXiv:1612.06512 [pdf]

Discovery of room temperature multiferroicity and magneto-electric coupling in Fe3Se4 nanorods

Authors: Mousumi Sen Bishwas, Pankaj Poddar

Abstract: We report for the first time, that Fe3Se4 is a room temperature, type-II multiferroic with magnetoelectric coupling. We observed the coexistence of coupled ferrimagnetic and ferroelectric ordering in Fe3Se4nanorods well above room temperature, which is a hard magnet with large magnetocrystalline anisotropy. For the first time, we observed spontaneous, reversible ferroelectric polarization in Fe3Se… ▽ More We report for the first time, that Fe3Se4 is a room temperature, type-II multiferroic with magnetoelectric coupling. We observed the coexistence of coupled ferrimagnetic and ferroelectric ordering in Fe3Se4nanorods well above room temperature, which is a hard magnet with large magnetocrystalline anisotropy. For the first time, we observed spontaneous, reversible ferroelectric polarization in Fe3Se4 nanorods below the magnetic Curie temperature. The coupling is manifested by an anomaly in the dielectric constant and Raman shift at Tc. We do not completely understand the origin of the ferroelectric ordering at this point however the simultaneous presence of magnetic and ferroelectric ordering at room temperature in Fe3Se4 along with hard magnetic properties will open new research areas for devices. △ Less

Submitted 12 June, 2019; v1 submitted 20 December, 2016; originally announced December 2016.

Comments: 27 Pages, 11 Figures including supplementary information

arXiv:1412.6163 [pdf]

Automated Objective Surgical Skill Assessment in the Operating Room Using Unstructured Tool Motion

Authors: Piyush Poddar, Narges Ahmidi, S. Swaroop Vedula, Lisa Ishii, Gregory D. Hager, Masaru Ishii

Abstract: Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy. Further, these methods only considered generic motion metrics such as time and number of movements, which are of limited instructive value. In this paper, we developed and evaluated an automated approach to the surgical skil… ▽ More Previous work on surgical skill assessment using intraoperative tool motion in the operating room (OR) has focused on highly-structured surgical tasks such as cholecystectomy. Further, these methods only considered generic motion metrics such as time and number of movements, which are of limited instructive value. In this paper, we developed and evaluated an automated approach to the surgical skill assessment of nasal septoplasty in the OR. The obstructed field of view and highly unstructured nature of septoplasty precludes trainees from efficiently learning the procedure. We propose a descriptive structure of septoplasty consisting of two types of activity: (1) brushing activity directed away from the septum plane characterizing the consistency of the surgeon's wrist motion and (2) activity along the septal plane characterizing the surgeon's coverage pattern. We derived features related to these two activity types that classify a surgeon's level of training with an average accuracy of about 72%. The features we developed provide surgeons with personalized, actionable feedback regarding their tool motion. △ Less

Submitted 18 December, 2014; originally announced December 2014.

arXiv:cond-mat/0307487 [pdf]

doi 10.1209/epl/i2003-00141-0

Manifestation of the Verwey Transition in the Tunneling Spectra of Magnetite Nanocrystals

Authors: Pankaj Poddar, Tcipi Fried, Gil Markovich, Amos Sharoni, David Katz, Tommer Wizansky, Oded Millo

Abstract: Tunneling transport measurements performed on single particles and on arrays of Fe3O4 (magnetite) nanocrystals provide strong evidence for the existence of the Verwey metal-insulator transition at the nanoscale. The resistance measurements on nanocrystal arrays show an abrupt increase of the resistance around 100 K, consistent with the Verwey transition, while the current-voltage characteristics… ▽ More Tunneling transport measurements performed on single particles and on arrays of Fe3O4 (magnetite) nanocrystals provide strong evidence for the existence of the Verwey metal-insulator transition at the nanoscale. The resistance measurements on nanocrystal arrays show an abrupt increase of the resistance around 100 K, consistent with the Verwey transition, while the current-voltage characteristics exhibit a sharp transition from an insulator gap to a peak structure around zero bias voltage. The tunneling spectra obtained on isolated particles using a Scanning Tunneling Microscope reveal an insulator-like gap structure in the density of states below the transition temperature that gradually disappeared with increasing temperature, transforming to a small peak structure at the Fermi energy. These data provide insight into the roles played by long- and short-range charge ordering in the Verwey transition. △ Less

Submitted 20 July, 2003; originally announced July 2003.

Comments: 8 pages,3 figs, pdf. submitted to Europhysics Letters

Showing 1–7 of 7 results for author: Poddar, P