Skip to main content

Showing 1–50 of 79 results for author: Tokekar, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16625  [pdf, other

    cs.RO

    GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection

    Authors: Harnaik Dhami, Charith Reddy, Vishnu Dutt Sharma, Troi Williams, Pratap Tokekar

    Abstract: We study the problem of visual surface inspection of infrastructure for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the infrastructure is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the infrastructure. The input to GATSBI consists of a 3D occupancy map created onlin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures, 2 tables. Submitted to IEEE TAES. arXiv admin note: text overlap with arXiv:2012.04803

  2. arXiv:2404.03834  [pdf, other

    cs.RO

    Fast k-connectivity Restoration in Multi-Robot Systems for Robust Communication Maintenance

    Authors: Md Ishat-E-Rabban, Guangyao Shi, Griffin Bonner, Pratap Tokekar

    Abstract: Maintaining a robust communication network plays an important role in the success of a multi-robot team jointly performing an optimization task. A key characteristic of a robust cooperative multi-robot system is the ability to repair the communication topology in the case of robot failure. In this paper, we focus on the Fast k-connectivity Restoration (FCR) problem, which aims to repair a network… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 17 pages, 6 figures, 3 algorithms. arXiv admin note: text overlap with arXiv:2011.00685

  3. arXiv:2403.12891  [pdf, other

    cs.RO cs.AI cs.CV

    Adaptive Visual Imitation Learning for Robotic Assisted Feeding Across Varied Bowl Configurations and Food Types

    Authors: Rui Liu, Amisha Bhaskar, Pratap Tokekar

    Abstract: In this study, we introduce a novel visual imitation network with a spatial attention module for robotic assisted feeding (RAF). The goal is to acquire (i.e., scoop) food items from a bowl. However, achieving robust and adaptive food manipulation is particularly challenging. To deal with this, we propose a framework that integrates visual perception with imitation learning to enable the robot to h… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2403.12876  [pdf, other

    cs.RO cs.HC

    LAVA: Long-horizon Visual Action based Food Acquisition

    Authors: Amisha Bhaskar, Rui Liu, Vishnu D. Sharma, Guangyao Shi, Pratap Tokekar

    Abstract: Robotic Assisted Feeding (RAF) addresses the fundamental need for individuals with mobility impairments to regain autonomy in feeding themselves. The goal of RAF is to use a robot arm to acquire and transfer food to individuals from the table. Existing RAF methods primarily focus on solid foods, leaving a gap in manipulation strategies for semi-solid and deformable foods. This study introduces Lon… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  5. arXiv:2403.08955  [pdf, other

    cs.LG cs.AI math.OC

    Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis

    Authors: Rui Liu, Erfaun Noorani, Pratap Tokekar, John S. Baras

    Abstract: Reinforcement Learning (RL) has shown exceptional performance across various applications, enabling autonomous agents to learn optimal policies through interaction with their environments. However, traditional RL frameworks often face challenges in terms of iteration complexity and robustness. Risk-sensitive RL, which balances expected return and risk, has been explored for its potential to yield… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  6. arXiv:2403.08936  [pdf, other

    cs.MA cs.AI cs.RO

    Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

    Authors: Peihong Yu, Manav Mishra, Alec Koppel, Carl Busart, Priya Narayan, Dinesh Manocha, Amrit Bedi, Pratap Tokekar

    Abstract: Multi-Agent Reinforcement Learning (MARL) algorithms face the challenge of efficient exploration due to the exponential increase in the size of the joint state-action space. While demonstration-guided learning has proven beneficial in single-agent settings, its direct applicability to MARL is hindered by the practical difficulty of obtaining joint expert demonstrations. In this work, we introduce… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  7. arXiv:2312.14436  [pdf, other

    cs.RO cs.LG

    REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

    Authors: Souradip Chakraborty, Anukriti Singh, Amisha Bhaskar, Pratap Tokekar, Dinesh Manocha, Amrit Singh Bedi

    Abstract: The effectiveness of reinforcement learning (RL) agents in continuous control robotics tasks is heavily dependent on the design of the underlying reward function. However, a misalignment between the reward function and user intentions, values, or social norms can be catastrophic in the real world. Current methods to mitigate this misalignment work by learning reward functions from human preference… ▽ More

    Submitted 14 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  8. arXiv:2311.04740  [pdf, other

    cs.MA cs.LG cs.RO

    Enhancing Multi-Agent Coordination through Common Operating Picture Integration

    Authors: Peihong Yu, Bhoram Lee, Aswin Raghavan, Supun Samarasekara, Pratap Tokekar, James Zachary Hare

    Abstract: In multi-agent systems, agents possess only local observations of the environment. Communication between teammates becomes crucial for enhancing coordination. Past research has primarily focused on encoding local information into embedding messages which are unintelligible to humans. We find that using these messages in agent's policy learning leads to brittle policies when tested on out-of-distri… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: accepted to OODWorkshop@CoRL23; please see https://openreview.net/forum?id=fADcJl0B0P for the paper

  9. arXiv:2310.07621  [pdf, other

    cs.RO

    AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV

    Authors: Nare Karapetyan, Ahmad Bilal Asghar, Amisha Bhaskar, Guangyao Shi, Dinesh Manocha, Pratap Tokekar

    Abstract: In this paper, we present an approach for coverage path planning for a team of an energy-constrained Unmanned Aerial Vehicle (UAV) and an Unmanned Ground Vehicle (UGV). Both the UAV and the UGV have predefined areas that they have to cover. The goal is to perform complete coverage by both robots while minimizing the coverage time. The UGV can also serve as a mobile recharging station. The UAV and… ▽ More

    Submitted 15 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: ICRA 2024 Proceedings

  10. arXiv:2310.07070  [pdf, other

    cs.RO

    D2M2N: Decentralized Differentiable Memory-Enabled Map** and Navigation for Multiple Robots

    Authors: Md Ishat-E-Rabban, Pratap Tokekar

    Abstract: Recently, a number of learning-based models have been proposed for multi-robot navigation. However, these models lack memory and only rely on the current observations of the robot to plan their actions. They are unable to leverage past observations to plan better paths, especially in complex environments. In this work, we propose a fully differentiable and decentralized memory-enabled architecture… ▽ More

    Submitted 7 April, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 7 pages, 5 figures, 4 tables

  11. arXiv:2310.07021  [pdf, other

    cs.RO cs.CV

    Pre-Trained Masked Image Model for Mobile Robot Navigation

    Authors: Vishnu Dutt Sharma, Anukriti Singh, Pratap Tokekar

    Abstract: 2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at ICRA 2024

  12. arXiv:2310.01519  [pdf, other

    cs.RO

    Decision-Oriented Learning Using Differentiable Submodular Maximization for Multi-Robot Coordination

    Authors: Guangyao Shi, Chak Lam Shek, Nare Karapetyan, Pratap Tokekar

    Abstract: We present a differentiable, decision-oriented learning framework for cost prediction in a class of multi-robot decision-making problems, in which the robots need to trade off the task performance with the costs of taking actions when they select actions to take. Specifically, we consider the cases where the task performance is measured by a known monotone submodular function (e.g., coverage, mutu… ▽ More

    Submitted 25 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.01543

  13. arXiv:2310.00834  [pdf, other

    cs.RO

    Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots

    Authors: Ahmad Bilal Asghar, Pratap Tokekar

    Abstract: This paper considers energy-aware route planning for a battery-constrained robot operating in environments with multiple recharging depots. The robot has a battery discharge time $D$, and it should visit the recharging depots at most every $D$ time units to not run out of charge. The objective is to minimize robot's travel time while ensuring it visits all task locations in the environment. We pre… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  14. arXiv:2310.00481  [pdf, other

    cs.RO

    LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

    Authors: Chak Lam Shek, Xiyang Wu, Wesley A. Suttle, Carl Busart, Erin Zaroukian, Dinesh Manocha, Pratap Tokekar, Amrit Singh Bedi

    Abstract: Navigating robots through unstructured terrains is challenging, primarily due to the dynamic environmental changes. While humans adeptly navigate such terrains by using context from their observations, creating a similar context-aware navigation system for robots is difficult. The essence of the issue lies in the acquisition and interpretation of contextual information, a task complicated by the i… ▽ More

    Submitted 19 March, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  15. arXiv:2309.08806  [pdf, other

    cs.RO

    UIVNAV: Underwater Information-driven Vision-based Navigation via Imitation Learning

    Authors: Xiaomin Lin, Nare Karapetyan, Kaustubh Joshi, Tianchen Liu, Nikhil Chopra, Miao Yu, Pratap Tokekar, Yiannis Aloimonos

    Abstract: Autonomous navigation in the underwater environment is challenging due to limited visibility, dynamic changes, and the lack of a cost-efficient accurate localization system. We introduce UIVNav, a novel end-to-end underwater navigation solution designed to drive robots over Objects of Interest (OOI) while avoiding obstacles, without relying on localization. UIVNav uses imitation learning and is in… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  16. arXiv:2309.07981  [pdf, other

    cs.RO

    Efficiently Identifying Hotspots in a Spatially Varying Field with Multiple Robots

    Authors: Varun Suryan, Pratap Tokekar

    Abstract: In this paper, we present algorithms to identify environmental hotspots using mobile sensors. We examine two approaches: one involving a single robot and another using multiple robots coordinated through a decentralized robot system. We introduce an adaptive algorithm that does not require precise knowledge of Gaussian Processes (GPs) hyperparameters, making the modeling process more flexible. The… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  17. arXiv:2308.02698  [pdf, other

    cs.RO

    A Survey of Decision-Theoretic Approaches for Robotic Environmental Monitoring

    Authors: Yoonchang Sung, Zhiang Chen, Jnaneshwar Das, Pratap Tokekar

    Abstract: Robotics has dramatically increased our ability to gather data about our environments, creating an opportunity for the robotics and algorithms communities to collaborate on novel solutions to environmental monitoring problems. To understand a taxonomy of problems and methods in this realm, we present the first comprehensive survey of decision-theoretic approaches that enable efficient sampling of… ▽ More

    Submitted 6 November, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 95 pages, 8 figures, Published in Foundations and Trends in Robotics

  18. arXiv:2307.04328  [pdf, other

    cs.RO cs.DM

    Where to Drop Sensors from Aerial Robots to Monitor a Surface-Level Phenomenon?

    Authors: Chak Lam Shek, Guangyao Shi, Ahmad Bilal Asghar, Pratap Tokekar

    Abstract: We consider the problem of routing a team of energy-constrained Unmanned Aerial Vehicles (UAVs) to drop unmovable sensors for monitoring a task area in the presence of stochastic wind disturbances. In prior work on mobile sensor routing problems, sensors and their carrier are one integrated platform, and sensors are assumed to be able to take measurements at exactly desired locations. By contrast,… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  19. arXiv:2307.04004  [pdf, other

    cs.RO cs.MA

    MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

    Authors: Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

    Abstract: Next-Best View (NBV) planning is a long-standing problem of determining where to obtain the next best view of an object from, by a robot that is viewing the object. There are a number of methods for choosing NBV based on the observed part of the object. In this paper, we investigate how predicting the unobserved part helps with the efficiency of reconstructing the object. We present, Multi-Agent P… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: 8 pages, 7 figures, 1 table. Submitted to IROS 2024

  20. arXiv:2305.05519   

    cs.RO

    ProxMaP: Proximal Occupancy Map Prediction for Efficient Indoor Robot Navigation

    Authors: Vishnu Dutt Sharma, **gxi Chen, Pratap Tokekar

    Abstract: In a typical path planning pipeline for a ground robot, we build a map (e.g., an occupancy grid) of the environment as the robot moves around. While navigating indoors, a ground robot's knowledge about the environment may be limited due to occlusions. Therefore, the map will have many as-yet-unknown regions that may need to be avoided by a conservative planner. Instead, if a robot is able to corre… ▽ More

    Submitted 9 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: This is an incremental work over an existing arxiv submission of the author. It will be re-uploaded as a version of that work [arXiv:2203.04177]

  21. arXiv:2304.11465  [pdf, other

    cs.RO

    Pred-NBV: Prediction-guided Next-Best-View for 3D Object Reconstruction

    Authors: Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

    Abstract: Prediction-based active perception has shown the potential to improve the navigation efficiency and safety of the robot by anticipating the uncertainty in the unknown environment. The existing works for 3D shape prediction make an implicit assumption about the partial observations and therefore cannot be used for real-world planning and do not consider the control effort for next-best-view plannin… ▽ More

    Submitted 7 August, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: 6 pages, 4 figures, 2 tables. Accepted to IROS 2023

  22. arXiv:2303.07622  [pdf, other

    cs.RO cs.AI cs.LG

    RE-MOVE: An Adaptive Policy Design for Robotic Navigation Tasks in Dynamic Environments via Language-Based Feedback

    Authors: Souradip Chakraborty, Kasun Weerakoon, Prithvi Poddar, Mohamed Elnoor, Priya Narayanan, Carl Busart, Pratap Tokekar, Amrit Singh Bedi, Dinesh Manocha

    Abstract: Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on) to adapt already trained policy to real-time changes in the environment without re-training vi… ▽ More

    Submitted 17 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  23. arXiv:2303.02293  [pdf, other

    cs.RO

    Data-Driven Distributionally Robust Optimal Control with State-Dependent Noise

    Authors: Rui Liu, Guangyao Shi, Pratap Tokekar

    Abstract: Distributionally Robust Optimal Control (DROC) is a technique that enables robust control in a stochastic setting when the true distribution is not known. Traditional DROC approaches require given ambiguity sets or a KL divergence bound to represent the distributional uncertainty. These may not be known a priori and may require hand-crafting. In this paper, we lift this assumption by introducing a… ▽ More

    Submitted 1 August, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  24. arXiv:2303.01543  [pdf, other

    cs.RO cs.LG

    Decision-Oriented Learning with Differentiable Submodular Maximization for Vehicle Routing Problem

    Authors: Guangyao Shi, Pratap Tokekar

    Abstract: We study the problem of learning a function that maps context observations (input) to parameters of a submodular function (output). Our motivating case study is a specific type of vehicle routing problem, in which a team of Unmanned Ground Vehicles (UGVs) can serve as mobile charging stations to recharge a team of Unmanned Ground Vehicles (UAVs) that execute persistent monitoring tasks. {We want t… ▽ More

    Submitted 25 September, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: camera-ready version for IROS 2023

  25. arXiv:2211.16721  [pdf, other

    cs.RO cs.AI

    Where Am I Now? Dynamically Finding Optimal Sensor States to Minimize Localization Uncertainty for a Perception-Denied Rover

    Authors: Troi Williams, Po-Lun Chen, Sparsh Bhogavilli, Vaibhav Sanjay, Pratap Tokekar

    Abstract: We present DyFOS, an active perception method that dynamically finds optimal states to minimize localization uncertainty while avoiding obstacles and occlusions. We consider the scenario where a perception-denied rover relies on position and uncertainty measurements from a viewer robot to localize itself along an obstacle-filled path. The position uncertainty from the viewer's sensor is a function… ▽ More

    Submitted 25 September, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 7 pages, 7 figures, Accepted to 2023 IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS)

  26. arXiv:2211.04987  [pdf, other

    cs.LG cs.AI

    Interpretable Deep Reinforcement Learning for Green Security Games with Real-Time Information

    Authors: Vishnu Dutt Sharma, John P. Dickerson, Pratap Tokekar

    Abstract: Green Security Games with real-time information (GSG-I) add the real-time information about the agents' movement to the typical GSG formulation. Prior works on GSG-I have used deep reinforcement learning (DRL) to learn the best policy for the agent in such an environment without any need to store the huge number of state representations for GSG-I. However, the decision-making process of DRL method… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  27. arXiv:2210.08107  [pdf, other

    cs.RO

    Approximation Algorithms for Robot Tours in Random Fields with Guaranteed Estimation Accuracy

    Authors: Shamak Dutta, Nils Wilde, Pratap Tokekar, Stephen L. Smith

    Abstract: We study the sample placement and shortest tour problem for robots tasked with map** environmental phenomena modeled as stationary random fields. The objective is to minimize the resources used (samples or tour length) while guaranteeing estimation accuracy. We give approximation algorithms for both problems in convex environments. These improve previously known results, both in terms of theoret… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  28. arXiv:2209.09292  [pdf, other

    cs.RO

    D2CoPlan: A Differentiable Decentralized Planner for Multi-Robot Coverage

    Authors: Vishnu Dutt Sharma, Lifeng Zhou, Pratap Tokekar

    Abstract: Centralized approaches for multi-robot coverage planning problems suffer from the lack of scalability. Learning-based distributed algorithms provide a scalable avenue in addition to bringing data-oriented feature generation capabilities to the table, allowing integration with other learning-based approaches. To this end, we present a learning-based, differentiable distributed coverage planner (D2C… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  29. arXiv:2209.06308  [pdf, other

    cs.RO

    Risk-aware Resource Allocation for Multiple UAVs-UGVs Recharging Rendezvous

    Authors: Ahmad Bilal Asghar, Guangyao Shi, Nare Karapetyan, James Humann, Jean-Paul Reddinger, James Dotterweich, Pratap Tokekar

    Abstract: We study a resource allocation problem for the cooperative aerial-ground vehicle routing application, in which multiple Unmanned Aerial Vehicles (UAVs) with limited battery capacity and multiple Unmanned Ground Vehicles (UGVs) that can also act as a mobile recharging stations need to jointly accomplish a mission such as persistently monitoring a set of points. Due to the limited battery capacity o… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  30. arXiv:2206.05652  [pdf, other

    cs.LG cs.RO eess.SY

    Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies

    Authors: Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Pratap Tokekar, Dinesh Manocha

    Abstract: In this paper, we present a novel Heavy-Tailed Stochastic Policy Gradient (HT-PSG) algorithm to deal with the challenges of sparse rewards in continuous control problems. Sparse reward is common in continuous control robotics tasks such as manipulation and navigation, and makes the learning problem hard due to non-trivial estimation of value functions over the state space. This demands either rewa… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

  31. arXiv:2206.01162  [pdf, other

    cs.LG math.OC stat.ML

    Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning

    Authors: Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Furong Huang, Pratap Tokekar, Dinesh Manocha

    Abstract: Model-based approaches to reinforcement learning (MBRL) exhibit favorable performance in practice, but their theoretical guarantees in large spaces are mostly restricted to the setting when transition model is Gaussian or Lipschitz, and demands a posterior estimate whose representational complexity grows unbounded with time. In this work, we develop a novel MBRL method (i) which relaxes the assump… ▽ More

    Submitted 4 May, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

  32. arXiv:2204.04767  [pdf, other

    cs.RO eess.SY

    Risk-aware UAV-UGV Rendezvous with Chance-Constrained Markov Decision Process

    Authors: Guangyao Shi, Nare Karapetyan, Ahmad Bilal Asghar, Jean-Paul Reddinger, James Dotterweich, James Humann, Pratap Tokekar

    Abstract: We study a chance-constrained variant of the cooperative aerial-ground vehicle routing problem, in which an Unmanned Aerial Vehicle (UAV) with limited battery capacity and an Unmanned Ground Vehicle (UGV) that can also act as a mobile recharging station need to jointly accomplish a mission such as monitoring a set of points. Due to the limited battery capacity of the UAV, two vehicles sometimes ha… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  33. arXiv:2203.04177  [pdf, other

    cs.RO

    ProxMaP: Proximal Occupancy Map Prediction for Efficient Indoor Robot Navigation

    Authors: Vishnu Dutt Sharma, **gxi Chen, Pratap Tokekar

    Abstract: Planning a path for a mobile robot typically requires building a map (e.g., an occupancy grid) of the environment as the robot moves around. While navigating in an unknown environment, the map built by the robot online may have many as-yet-unknown regions. A conservative planner may avoid such regions taking a longer time to navigate to the goal. Instead, if a robot is able to correctly predict th… ▽ More

    Submitted 2 August, 2023; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted at IROS 2023

  34. arXiv:2201.12332  [pdf, other

    cs.LG cs.AI math.OC

    On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

    Authors: Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian Sadler, Pratap Tokekar, Alec Koppel

    Abstract: We focus on parameterized policy search for reinforcement learning over continuous action spaces. Typically, one assumes the score function associated with a policy is bounded, which fails to hold even for Gaussian policies. To properly address this issue, one must introduce an exploration tolerance parameter to quantify the region in which it is bounded. Doing so incurs a persistent bias that app… ▽ More

    Submitted 30 January, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  35. arXiv:2112.09203  [pdf, other

    cs.RO

    Intermittent Deployment for Large-Scale Multi-Robot Forage Perception: Data Synthesis, Prediction, and Planning

    Authors: Jun Liu, Murtaza Rangwala, Kulbir Singh Ahluwalia, Shayan Ghajar, Harnaik Singh Dhami, Pratap Tokekar, Benjamin F. Tracy, Ryan K. Williams

    Abstract: Monitoring the health and vigor of grasslands is vital for informing management decisions to optimize rotational grazing in agriculture applications. To take advantage of forage resources and improve land productivity, we require knowledge of pastureland growth patterns that is simply unavailable at state of the art. In this paper, we propose to deploy a team of robots to monitor the evolution of… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 21 pages, 26 figures, submitted to IEEE Transactions on Automation Science and Engineering

  36. arXiv:2109.06831  [pdf, other

    cs.RO

    Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Sensing, Communication, and Localization Constraints

    Authors: Manav Mishra, Prithvi Poddar, Rajat Agarwal, **gxi Chen, Pratap Tokekar, P. B. Sujit

    Abstract: Determining multi-robot motion policies for persistently monitoring a region with limited sensing, communication, and localization constraints in non-GPS environments is a challenging problem. To take the localization constraints into account, in this paper, we consider a heterogeneous robotic system consisting of two types of agents: anchor agents with accurate localization capability and auxilia… ▽ More

    Submitted 14 May, 2023; v1 submitted 14 September, 2021; originally announced September 2021.

  37. arXiv:2105.08601  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Graph Neural Networks for Decentralized Multi-Robot Submodular Action Selection

    Authors: Lifeng Zhou, Vishnu D. Sharma, Qingbiao Li, Amanda Prorok, Alejandro Ribeiro, Pratap Tokekar, Vijay Kumar

    Abstract: The problem of decentralized multi-robot target tracking asks for jointly selecting actions, e.g., motion primitives, for the robots to maximize target tracking performance with local communications. One major challenge for practical implementations is to make target tracking approaches scalable for large-scale problem instances. In this work, we propose a general-purpose learning architecture tow… ▽ More

    Submitted 14 September, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

  38. arXiv:2105.07305  [pdf, other

    cs.RO cs.DS cs.GT cs.MA

    Distributed Resilient Submodular Action Selection in Adversarial Environments

    Authors: Jun Liu, Lifeng Zhou, Pratap Tokekar, Ryan K. Williams

    Abstract: In this letter, we consider a distributed submodular maximization problem for multi-robot systems when attacked by adversaries. One of the major challenges for multi-robot systems is to increase resilience against failures or attacks. This is particularly important for distributed systems under attack as there is no central point of command that can detect, mitigate, and recover from attacks. Inst… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Journal ref: IEEE Robotics and Automation Letters, 2021

  39. Multi-Robot Coordination and Planning in Uncertain and Adversarial Environments

    Authors: Lifeng Zhou, Pratap Tokekar

    Abstract: Deploying a team of robots that can carefully coordinate their actions can make the entire system robust to individual failures. In this report, we review recent algorithmic development in making multi-robot systems robust to environmental uncertainties, failures, and adversarial attacks. We find the following three trends in the recent research in the area of multi-robot coordination: (1) resil… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Journal ref: Current Robotics Reports, 2021

  40. arXiv:2104.11709  [pdf, other

    cs.RO

    Risk-Aware Path Planning for Ground Vehicles using Occluded Aerial Images

    Authors: Vishnu Dutt Sharma, Pratap Tokekar

    Abstract: We consider scenarios where a ground vehicle plans its path using data gathered by an aerial vehicle. In the aerial images, navigable areas of the scene may be occluded due to obstacles. Naively planning paths using aerial images may result in longer paths as a conservative planner may try to avoid regions that are occluded. We propose a modular, deep learning-based framework that allows the robot… ▽ More

    Submitted 24 April, 2022; v1 submitted 23 April, 2021; originally announced April 2021.

  41. Multi-robot Symmetric Rendezvous Search on the Line

    Authors: Deniz Ozsoyeller, Pratap Tokekar

    Abstract: We study the Symmetric Rendezvous Search Problem for a multi-robot system. There are $n>2$ robots arbitrarily located on a line. Their goal is to meet somewhere on the line as quickly as possible. The robots do not know the initial location of any of the other robots or their own positions on the line. The symmetric version of the problem requires the robots to execute the same search strategy to… ▽ More

    Submitted 28 January, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

  42. GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection

    Authors: Harnaik Dhami, Kevin Yu, Troi Williams, Vineeth Vajipey, Pratap Tokekar

    Abstract: We study the problem of visual surface inspection of a bridge for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the bridge is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the bridge. The input to GATSBI consists of a 3D occupancy map created online with LiDAR scans. Oc… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 8 pages, 12 figures, 2 tables. Accepted to ICUAS 2023

  43. arXiv:2011.01476  [pdf, other

    cs.RO

    Communication-Aware Multi-robot Coordination with Submodular Maximization

    Authors: Guangyao Shi, Ishat E Rabban, Lifeng Zhou, Pratap Tokekar

    Abstract: Submodular maximization has been widely used in many multi-robot task planning problems including information gathering, exploration, and target tracking. However, the interplay between submodular maximization and communication is rarely explored in the multi-robot setting. In many cases, maximizing the submodular objective may drive the robots in a way so as to disconnect the communication networ… ▽ More

    Submitted 8 April, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: accepted to ICRA2021

  44. arXiv:2011.01129  [pdf, other

    cs.RO cs.AI cs.LG

    Multi-Agent Reinforcement Learning for Visibility-based Persistent Monitoring

    Authors: **gxi Chen, Amrish Baskaran, Zhongshun Zhang, Pratap Tokekar

    Abstract: The Visibility-based Persistent Monitoring (VPM) problem seeks to find a set of trajectories (or controllers) for robots to persistently monitor a changing environment. Each robot has a sensor, such as a camera, with a limited field-of-view that is obstructed by obstacles in the environment. The robots may need to coordinate with each other to ensure no point in the environment is left unmonitored… ▽ More

    Submitted 6 October, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

  45. arXiv:2011.01095  [pdf, other

    cs.RO

    Risk-Aware Submodular Optimization for Multi-objective Travelling Salesperson Problem

    Authors: Rishab Balasubramanian, Lifeng Zhou, Pratap Tokekar, P. B. Sujit

    Abstract: We introduce a risk-aware multi-objective Traveling Salesperson Problem (TSP) variant, where the robot tour cost and tour reward have to be optimized simultaneously. The robot obtains reward along the edges in the graph. We study the case where the rewards and the costs exhibit diminishing marginal gains, i.e., are submodular. Unlike prior work, we focus on the scenario where the costs and the rew… ▽ More

    Submitted 21 September, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: 7 pages

    MSC Class: 68; 90; 41 ACM Class: I.2.9

  46. arXiv:2011.00685  [pdf, other

    cs.RO

    Fast Biconnectivity Restoration in Multi-Robot Systems for Robust Communication Maintenance

    Authors: Md Ishat-E-Rabban, Guangyao Shi, Pratap Tokekar

    Abstract: Maintaining a robust communication network plays an important role in the success of a multi-robot team jointly performing an optimization task. A key characteristic of a robust multi-robot system is the ability to repair the communication topology itself in the case of robot failure. In this paper, we focus on the Fast Biconnectivity Restoration (FBR) problem, which aims to repair a connected net… ▽ More

    Submitted 8 April, 2024; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: updated author affiliation, fixed typos, added references

  47. arXiv:2007.03501  [pdf, other

    cs.RO

    Coverage of an Environment Using Energy-Constrained Unmanned Aerial Vehicles

    Authors: Kevin Yu, Jason M. O'Kane, Pratap Tokekar

    Abstract: We study the problem of covering an environment using an Unmanned Aerial Vehicle (UAV) with limited battery capacity. We consider a scenario where the UAV can land on an Unmanned Ground Vehicle (UGV) and recharge the onboard battery. The UGV can also recharge the UAV while transporting the UAV to the next take-off site. We present an algorithm to solve a new variant of the area coverage problem th… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 18 pages, 12 figures

  48. arXiv:2007.02204  [pdf, other

    cs.RO

    Failure-Resilient Coverage Maximization with Multiple Robots

    Authors: Ishat E Rabban, Pratap Tokekar

    Abstract: The task of maximizing coverage using multiple robots has several applications such as surveillance, exploration, and environmental monitoring. A major challenge of deploying such multi-robot systems in a practical scenario is to ensure resilience against robot failures. A recent work introduced the Resilient Coverage Maximization (RCM) problem where the goal is to maximize a submodular coverage u… ▽ More

    Submitted 4 March, 2021; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: 8 pages, 9 figures, 2 algorithms

  49. arXiv:2007.00100  [pdf, other

    cs.RO cs.MA

    Robust Multi-Agent Task Assignment in Failure-Prone and Adversarial Environments

    Authors: Russell Schwartz, Pratap Tokekar

    Abstract: The problem of assigning agents to tasks is a central computational challenge in many multi-agent autonomous systems. However, in the real world, agents are not always perfect and may fail due to a number of reasons. A motivating application is where the agents are robots that operate in the physical world and are susceptible to failures. This paper studies the problem of Robust Multi-Agent Task A… ▽ More

    Submitted 30 June, 2020; originally announced July 2020.

    Comments: 6 pages, 3 figures, 3 algorithms; submitted to the Workshop on Heterogeneous Multi-Robot Task Allocation and Coordination (RSS 2020)

  50. arXiv:2004.06856  [pdf, other

    cs.RO cs.AI

    Combining Geometric and Information-Theoretic Approaches for Multi-Robot Exploration

    Authors: Aravind Preshant Premkumar, Kevin Yu, Pratap Tokekar

    Abstract: We present an algorithm to explore an orthogonal polygon using a team of $p$ robots. This algorithm combines ideas from information-theoretic exploration algorithms and computational geometry based exploration algorithms. We show that the exploration time of our algorithm is competitive (as a function of $p$) with respect to the offline optimal exploration algorithm. The algorithm is based on a si… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.