Skip to main content

Showing 1–6 of 6 results for author: Sharon, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.10999  [pdf, other

    cs.LG cs.AI

    Task Phasing: Automated Curriculum Learning from Demonstrations

    Authors: Vaibhav Bajaj, Guni Sharon, Peter Stone

    Abstract: Applying reinforcement learning (RL) to sparse reward domains is notoriously challenging due to insufficient guiding signals. Common RL techniques for addressing such domains include (1) learning from demonstrations and (2) curriculum learning. While these two approaches have been studied in detail, they have rarely been considered together. This paper aims to do so by introducing a principled tas… ▽ More

    Submitted 27 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 7 pages main paper, 7 figures, 4 pages appendix. Submitted to AAAI 2023 Conference

  2. arXiv:2209.09446  [pdf, other

    cs.LG cs.AI

    A Joint Imitation-Reinforcement Learning Framework for Reduced Baseline Regret

    Authors: Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna

    Abstract: In various control task domains, existing controllers provide a baseline level of performance that -- though possibly suboptimal -- should be maintained. Reinforcement learning (RL) algorithms that rely on extensive exploration of the state and action space can be used to optimize a control policy. However, fully exploratory RL algorithms may decrease performance below a baseline level during trai… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021

  3. arXiv:2209.03393  [pdf, other

    cs.AI cs.LG

    The (Un)Scalability of Heuristic Approximators for NP-Hard Search Problems

    Authors: Sumedh Pendurkar, Taoan Huang, Sven Koenig, Guni Sharon

    Abstract: The A* algorithm is commonly used to solve NP-hard combinatorial optimization problems. When provided with a completely informed heuristic function, A* solves many NP-hard minimum-cost path problems in time polynomial in the branching factor and the number of edges in a minimum-cost path. Thus, approximating their completely informed heuristic functions with high precision is NP-hard. We therefore… ▽ More

    Submitted 7 December, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 10 pages, 5 figures

  4. arXiv:1912.11023  [pdf, other

    cs.LG stat.ML

    Learning an Interpretable Traffic Signal Control Policy

    Authors: James Ault, Josiah P. Hanna, Guni Sharon

    Abstract: Signalized intersections are managed by controllers that assign right of way (green, yellow, and red lights) to non-conflicting directions. Optimizing the actuation policy of such controllers is expected to alleviate traffic congestion and its adverse impact. Given such a safety-critical domain, the affiliated actuation policy is required to be interpretable in a way that can be understood and reg… ▽ More

    Submitted 26 February, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  5. arXiv:1709.09569  [pdf, other

    cs.MA cs.AI

    Traffic Optimization For a Mixture of Self-interested and Compliant Agents

    Authors: Guni Sharon, Michael Albert, Tarun Rambha, Stephen Boyles, Peter Stone

    Abstract: This paper focuses on two commonly used path assignment policies for agents traversing a congested network: self-interested routing, and system-optimum routing. In the self-interested routing policy each agent selects a path that optimizes its own utility, while the system-optimum routing agents are assigned paths with the goal of maximizing system performance. This paper considers a scenario wher… ▽ More

    Submitted 27 September, 2017; originally announced September 2017.

  6. arXiv:1702.05515  [pdf, other

    cs.AI cs.MA cs.RO

    Overview: Generalizations of Multi-Agent Path Finding to Real-World Scenarios

    Authors: Hang Ma, Sven Koenig, Nora Ayanian, Liron Cohen, Wolfgang Hoenig, T. K. Satish Kumar, Tansel Uras, Hong Xu, Craig Tovey, Guni Sharon

    Abstract: Multi-agent path finding (MAPF) is well-studied in artificial intelligence, robotics, theoretical computer science and operations research. We discuss issues that arise when generalizing MAPF methods to real-world scenarios and four research directions that address them. We emphasize the importance of addressing these issues as opposed to develo** faster methods for the standard formulation of t… ▽ More

    Submitted 17 February, 2017; originally announced February 2017.

    Comments: In IJCAI-16 Workshop on Multi-Agent Path Finding