Skip to main content

Showing 1–6 of 6 results for author: Prajapat, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.06562  [pdf, other

    eess.SY cs.LG cs.RO math.OC

    Safe Guaranteed Exploration for Non-linear Systems

    Authors: Manish Prajapat, Johannes Köhler, Matteo Turchetta, Andreas Krause, Melanie N. Zeilinger

    Abstract: Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  2. arXiv:2307.13372  [pdf, other

    cs.LG

    Submodular Reinforcement Learning

    Authors: Manish Prajapat, Mojmír Mutný, Melanie N. Zeilinger, Andreas Krause

    Abstract: In reinforcement learning (RL), rewards of states are typically considered additive, and following the Markov assumption, they are $\textit{independent}$ of states visited previously. In many important applications, such as coverage control, experiment design and informative path planning, rewards naturally have diminishing returns, i.e., their value decreases in light of similar states visited pr… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: Spotlight paper at ICLR 2024

  3. arXiv:2210.06380  [pdf, other

    cs.LG cs.AI cs.MA cs.RO math.OC

    Near-Optimal Multi-Agent Learning for Safe Coverage Control

    Authors: Manish Prajapat, Matteo Turchetta, Melanie N. Zeilinger, Andreas Krause

    Abstract: In multi-agent coverage control problems, agents navigate their environment to reach locations that maximize the coverage of some density. In practice, the density is rarely known $\textit{a priori}$, further complicating the original NP-hard problem. Moreover, in many applications, agents cannot visit arbitrary locations due to $\textit{a priori}$ unknown safety constraints. In this paper, we aim… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  4. arXiv:2006.10611  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Competitive Policy Optimization

    Authors: Manish Prajapat, Kamyar Azizzadenesheli, Alexander Liniger, Yisong Yue, Anima Anandkumar

    Abstract: A core challenge in policy optimization in competitive Markov decision processes is the design of efficient optimization methods with desirable convergence and stability properties. To tackle this, we propose competitive policy optimization (CoPO), a novel policy gradient approach that exploits the game-theoretic nature of competitive games to derive policy updates. Motivated by the competitive gr… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 11 pages main paper, 6 pages references, and 31 pages appendix. 14 figures

  5. arXiv:1905.05150  [pdf, other

    cs.RO

    AMZ Driverless: The Full Autonomous Racing System

    Authors: Juraj Kabzan, Miguel de la Iglesia Valls, Victor Reijgwart, Hubertus Franciscus Cornelis Hendrikx, Claas Ehmke, Manish Prajapat, Andreas Bühler, Nikhil Gosala, Mehak Gupta, Ramya Sivanesan, Ankit Dhall, Eugenio Chisari, Napat Karnchanachari, Sonja Brits, Manuel Dangel, Inkyu Sa, Renaud Dubé, Abel Gawel, Mark Pfeiffer, Alexander Liniger, John Lygeros, Roland Siegwart

    Abstract: This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and con… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: 40 pages, 32 figures, submitted to Journal of Field Robotics

  6. Redundant Perception and State Estimation for Reliable Autonomous Racing

    Authors: Nikhil Bharadwaj Gosala, Andreas Bühler, Manish Prajapat, Claas Ehmke, Mehak Gupta, Ramya Sivanesan, Abel Gawel, Mark Pfeiffer, Mathias Bürki, Inkyu Sa, Renaud Dubé, Roland Siegwart

    Abstract: In autonomous racing, vehicles operate close to the limits of handling and a sensor failure can have critical consequences. To limit the impact of such failures, this paper presents the redundant perception and state estimation approaches developed for an autonomous race car. Redundancy in perception is achieved by estimating the color and position of the track delimiting objects using two sensor… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: 7 pages, 21 figures, submitted to the International Conference on Robotics and Automation 2019, for accompanying video visit https://www.youtube.com/watch?v=ir_uqEYuT84