Skip to main content

Showing 1–14 of 14 results for author: Dibangoye, J

.
  1. arXiv:2406.00054  [pdf, ps, other

    cs.GT cs.LG

    $ε$-Optimally Solving Zero-Sum POSGs

    Authors: Erwan Escudie, Matthia Sabatelli, Jilles Dibangoye

    Abstract: A recent method for solving zero-sum partially observable stochastic games (zs-POSGs) embeds the original game into a new one called the occupancy Markov game. This reformulation allows applying Bellman's principle of optimality to solve zs-POSGs. However, improving a current solution requires solving a linear program with exponentially many potential constraints, which significantly restricts the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

  2. arXiv:2402.02954  [pdf, other

    cs.GT cs.LG

    Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach

    Authors: Johan Peralez, Aurélien Delage, Olivier Buffet, Jilles S. Dibangoye

    Abstract: A recent theory shows that a multi-player decentralized partially observable Markov decision process can be transformed into an equivalent single-player game, enabling the application of \citeauthor{bellman}'s principle of optimality to solve the single-player game by breaking it down into single-stage subgames. However, this approach entangles the decision variables of all players at each single-… ▽ More

    Submitted 9 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2311.09459  [pdf, other

    cs.MA

    On Convex Optimal Value Functions For POSGs

    Authors: Rafael F. Cunha, Jacopo Castellini, Johan Peralez, Jilles S. Dibangoye

    Abstract: Multi-agent planning and reinforcement learning can be challenging when agents cannot see the state of the world or communicate with each other due to communication costs, latency, or noise. Partially Observable Stochastic Games (POSGs) provide a mathematical framework for modelling such scenarios. This paper aims to improve the efficiency of planning and reinforcement learning algorithms for POSG… ▽ More

    Submitted 6 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Currently under review at JAIR

    MSC Class: I.2.6; I.2.8; I.2.11

  4. arXiv:2302.06414  [pdf, other

    cs.RO cs.AI cs.CV

    LAPTNet-FPN: Multi-scale LiDAR-aided Projective Transform Network for Real Time Semantic Grid Prediction

    Authors: Manuel Alejandro Diaz-Zapata, David Sierra González, Özgür Erkent, Jilles Dibangoye, Christian Laugier

    Abstract: Semantic grids can be useful representations of the scene around an autonomous system. By having information about the layout of the space around itself, a robot can leverage this type of representation for crucial tasks such as navigation or tracking. By fusing information from multiple sensors, robustness can be increased and the computational load for the task can be lowered, achieving real tim… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 2023 IEEE International Conference on Robotics and Automation (ICRA), IEEE Robotics and Automation Society, May 2023, London, United Kingdom

  5. arXiv:2211.14445  [pdf, other

    cs.CV cs.AI cs.RO

    LAPTNet: LiDAR-Aided Perspective Transform Network

    Authors: Manuel Alejandro Diaz-Zapata, Özgür Erkent, Christian Laugier, Jilles Dibangoye, David Sierra González

    Abstract: Semantic grids are a useful representation of the environment around a robot. They can be used in autonomous vehicles to concisely represent the scene around the car, capturing vital information for downstream tasks like navigation or collision assessment. Information from different sensors can be used to generate these grids. Some methods rely only on RGB images, whereas others choose to incorpor… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: ICARCV 2022 - 17th International Conference on Control, Automation, Robotics and Vision, Dec 2022, Singapore, Singapore

  6. arXiv:2210.14640  [pdf, other

    cs.AI

    HSVI can solve zero-sum Partially Observable Stochastic Games

    Authors: Aurélien Delage, Olivier Buffet, Jilles S. Dibangoye, Abdallah Saffidine

    Abstract: State-of-the-art methods for solving 2-player zero-sum imperfect information games rely on linear programming or regret minimization, though not on dynamic programming (DP) or heuristic search (HS), while the latter are often at the core of state-of-the-art solvers for other sequential decision-making problems. In partially observable or collaborative settings (e.g., POMDPs and Dec- POMDPs), DP an… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 42 pages, 2 algorithms. arXiv admin note: substantial text overlap with arXiv:2110.14529

  7. arXiv:2110.14529  [pdf, other

    cs.GT cs.AI cs.LG

    HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties

    Authors: Aurélien Delage, Olivier Buffet, Jilles Dibangoye

    Abstract: Dynamic programming and heuristic search are at the core of state-of-the-art solvers for sequential decision-making problems. In partially observable or collaborative settings (\eg, POMDPs and Dec-POMDPs), this requires introducing an appropriate statistic that induces a fully observable problem as well as bounding (convex) approximators of the optimal value function. This approach has succeeded i… ▽ More

    Submitted 15 November, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 37 pages, 4 figures, 4 tables, 3 algorithms

  8. arXiv:2007.05270  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to plan with uncertain topological maps

    Authors: Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

    Abstract: We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy. Our main contribution is a data driven learning based approach for planning under uncertainty in topological maps, requiring an estimate of shortest paths in valued graphs with a probabilistic structure. Whereas classical symbolic algorithms achieve optimal… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: ECCV 2020

  9. arXiv:2006.16395  [pdf, other

    cs.AI cs.GT

    On Bellman's Optimality Principle for zs-POSGs

    Authors: Olivier Buffet, Jilles Dibangoye, Aurélien Delage, Abdallah Saffidine, Vincent Thomas

    Abstract: Many non-trivial sequential decision-making problems are efficiently solved by relying on Bellman's optimality principle, i.e., exploiting the fact that sub-problems are nested recursively within the original problem. Here we show how it can apply to (infinite horizon) 2-player zero-sum partially observable stochastic games (zs-POSGs) by (i) taking a central planner's viewpoint, which can only rea… ▽ More

    Submitted 15 November, 2022; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: 18 pages, 0 figures, 1 algorithm

    ACM Class: I.2.8

  10. arXiv:2002.02286  [pdf, other

    cs.LG cs.AI

    EgoMap: Projective map** and structured egocentric memory for Deep RL

    Authors: Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

    Abstract: Tasks involving localization, memorization and planning in partially observable 3D environments are an ongoing challenge in Deep Reinforcement Learning. We present EgoMap, a spatially structured neural memory architecture. EgoMap augments a deep reinforcement learning agent's performance in 3D environments on challenging tasks with multi-step objectives. The EgoMap architecture incorporates severa… ▽ More

    Submitted 7 February, 2020; v1 submitted 24 January, 2020; originally announced February 2020.

  11. arXiv:1904.07802  [pdf, other

    cs.LG cs.AI cs.HC

    Learning 3D Navigation Protocols on Touch Interfaces with Cooperative Multi-Agent Reinforcement Learning

    Authors: Quentin Debard, Jilles Steeve Dibangoye, Stéphane Canu, Christian Wolf

    Abstract: Using touch devices to navigate in virtual 3D environments such as computer assisted design (CAD) models or geographical information systems (GIS) is inherently difficult for humans, as the 3D operations have to be performed by the user on a 2D touch surface. This ill-posed problem is classically solved with a fixed and handcrafted interaction protocol, which must be learned by the user. We propos… ▽ More

    Submitted 27 August, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: 17 pages, 8 figures. Accepted at The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases 2019 (ECMLPKDD 2019)

  12. arXiv:1904.01806  [pdf, other

    cs.LG stat.ML

    Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

    Authors: Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

    Abstract: An important goal of research in Deep Reinforcement Learning in mobile robotics is to train agents capable of solving complex tasks, which require a high level of scene understanding and reasoning from an egocentric perspective. When trained from simulations, optimal environments should satisfy a currently unobtainable combination of high-fidelity photographic observations, massive amounts of diff… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

  13. arXiv:1901.10272  [pdf, other

    cs.RO

    Multi-UAV Visual Coverage of Partially Known 3D Surfaces: Voronoi-based Initialization to Improve Local Optimizers

    Authors: Alessandro Renzaglia, Jilles Dibangoye, Vincent Le Doze, Olivier Simonin

    Abstract: In this paper we study the problem of steering a team of Unmanned Aerial Vehicles (UAVs) toward a static configuration which maximizes the visibility of a 3D environment. The UAVs are assumed to be equipped with visual sensors constrained by a maximum sensing range and the prior knowledge on the environment is considered to be very sparse. To solve this problem on-line, derivative-free measurement… ▽ More

    Submitted 30 January, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

  14. arXiv:1210.4865  [pdf

    cs.AI cs.MA

    Scaling Up Decentralized MDPs Through Heuristic Search

    Authors: Jilles S. Dibangoye, Christopher Amato, Arnoud Doniec

    Abstract: Decentralized partially observable Markov decision processes (Dec-POMDPs) are rich models for cooperative decision-making under uncertainty, but are often intractable to solve optimally (NEXP-complete). The transition and observation independent Dec-MDP is a general subclass that has been shown to have complexity in NP, but optimal algorithms for this subclass are still inefficient in practice. In… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-217-226