Skip to main content

Showing 1–23 of 23 results for author: Sartoretti, G

.
  1. arXiv:2406.16671  [pdf, other

    cs.RO

    STAR: Swarm Technology for Aerial Robotics Research

    Authors: Jimmy Chiun, Yan Rui Tan, Yuhong Cao, John Tan, Guillaume Sartoretti

    Abstract: In recent years, the field of aerial robotics has witnessed significant progress, finding applications in diverse domains, including post-disaster search and rescue operations. Despite these strides, the prohibitive acquisition costs associated with deploying physical multi-UAV systems have posed challenges, impeding their widespread utilization in research endeavors. To overcome these challenges,… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2405.17794  [pdf, other

    cs.RO

    LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding

    Authors: Yutong Wang, Tanishq Duhan, Jiaoyang Li, Guillaume Sartoretti

    Abstract: Multi-Agent Path Finding (MAPF) is a critical component of logistics and warehouse management, which focuses on planning collision-free paths for a team of robots in a known environment. Recent work introduced a novel MAPF approach, LNS2, which proposed to repair a quickly-obtainable set of infeasible paths via iterative re-planning, by relying on a fast, yet lower-quality, priority-based planner.… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2404.17815  [pdf, other

    cs.RO

    Learning-based Hierarchical Control: Emulating the Central Nervous System for Bio-Inspired Legged Robot Locomotion

    Authors: Ge Sun, Milad Shafiee, Peizhuo Li, Guillaume Bellegarda, Auke Ijspeert, Guillaume Sartoretti

    Abstract: Animals possess a remarkable ability to navigate challenging terrains, achieved through the interplay of various pathways between the brain, central pattern generators (CPGs) in the spinal cord, and musculoskeletal system. Traditional bioinspired control frameworks often rely on a singular control policy that models both higher (supraspinal) and spinal cord functions. In this work, we build upon o… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Submitted to the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  4. arXiv:2403.10833  [pdf, other

    cs.RO

    Deep Reinforcement Learning-based Large-scale Robot Exploration

    Authors: Yuhong Cao, Rui Zhao, Yizhuo Wang, Bairan Xiang, Guillaume Sartoretti

    Abstract: In this work, we propose a deep reinforcement learning (DRL) based reactive planner to solve large-scale Lidar-based autonomous robot exploration problems in 2D action space. Our DRL-based planner allows the agent to reactively plan its exploration path by making implicit predictions about unknown areas, based on a learned estimation of the underlying transition model of the environment. To this e… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  5. arXiv:2310.08350  [pdf, other

    cs.RO

    ALPHA: Attention-based Long-horizon Pathfinding in Highly-structured Areas

    Authors: Chengyang He, Tianze Yang, Tanishq Duhan, Yutong Wang, Guillaume Sartoretti

    Abstract: The multi-agent pathfinding (MAPF) problem seeks collision-free paths for a team of agents from their current positions to their pre-set goals in a known environment, and is an essential problem found at the core of many logistics, transportation, and general robotics applications. Existing learning-based MAPF approaches typically only let each agent make decisions based on a limited field-of-view… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Submitted to the IEEE International Conference on Robotics and Automation (ICRA 2024)

  6. arXiv:2310.05714  [pdf, other

    cs.RO

    DecAP: Decaying Action Priors for Accelerated Learning of Torque-Based Legged Locomotion Policies

    Authors: Shivam Sood, Ge Sun, Peizhuo Li, Guillaume Sartoretti

    Abstract: Optimal Control for legged robots has gone through a paradigm shift from position-based to torque-based control, owing to the latter's compliant and robust nature. In parallel to this shift, the community has also turned to Deep Reinforcement Learning (DRL) as a promising approach to directly learn locomotion policies for complex real-life tasks. However, most end-to-end DRL approaches still opera… ▽ More

    Submitted 31 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Submitted to the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  7. arXiv:2305.16145  [pdf, other

    cs.LG

    SocialLight: Distributed Cooperation Learning towards Network-Wide Traffic Signal Control

    Authors: Harsh Goel, Yifeng Zhang, Mehul Damani, Guillaume Sartoretti

    Abstract: Many recent works have turned to multi-agent reinforcement learning (MARL) for adaptive traffic signal control to optimize the travel time of vehicles over large urban networks. However, achieving effective and scalable cooperation among junctions (agents) remains an open challenge, as existing methods often rely on extensive, non-generalizable reward sha** or on non-scalable centralized learnin… ▽ More

    Submitted 20 April, 2023; originally announced May 2023.

    Comments: To appear in the International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)

  8. arXiv:2303.16865  [pdf, other

    cs.RO

    Legged Robots for Object Manipulation: A Review

    Authors: Yifeng Gong, Ge Sun, Aditya Nair, Aditya Bidwai, Raghuram CS, John Grezmak, Guillaume Sartoretti, Kathryn A. Daltorio

    Abstract: Legged robots can have a unique role in manipulating objects in dynamic, human-centric, or otherwise inaccessible environments. Although most legged robotics research to date typically focuses on traversing these challenging environments, many legged platform demonstrations have also included "moving an object" as a way of doing tangible work. Legged robots can be designed to manipulate a particul… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Preprint of the paper submitted to Frontiers in Mechanical Engineering

  9. arXiv:2303.06350  [pdf, other

    cs.RO

    Spatio-Temporal Attention Network for Persistent Monitoring of Multiple Mobile Targets

    Authors: Yizhuo Wang, Yutong Wang, Yuhong Cao, Guillaume Sartoretti

    Abstract: This work focuses on the persistent monitoring problem, where a set of targets moving based on an unknown model must be monitored by an autonomous mobile robot with a limited sensing range. To keep each target's position estimate as accurate as possible, the robot needs to adaptively plan its path to (re-)visit all the targets and update its belief from measurements collected along the way. In doi… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: Submitted to the IEEE/RSJ International Conference on Intelligent Robots (IROS 2023)

  10. arXiv:2303.05351  [pdf, other

    cs.RO

    Intent-based Deep Reinforcement Learning for Multi-agent Informative Path Planning

    Authors: Tianze Yang, Yuhong Cao, Guillaume Sartoretti

    Abstract: In multi-agent informative path planning (MAIPP), agents must collectively construct a global belief map of an underlying distribution of interest (e.g., gas concentration, light intensity, or pollution levels) over a given domain, based on measurements taken along their trajectory. They must frequently replan their path to balance the exploration of new areas with the exploitation of known high-i… ▽ More

    Submitted 24 October, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  11. arXiv:2303.00605  [pdf, other

    cs.RO

    SCRIMP: Scalable Communication for Reinforcement- and Imitation-Learning-Based Multi-Agent Pathfinding

    Authors: Yutong Wang, Bairan Xiang, Shinan Huang, Guillaume Sartoretti

    Abstract: Trading off performance guarantees in favor of scalability, the Multi-Agent Path Finding (MAPF) community has recently started to embrace Multi-Agent Reinforcement Learning (MARL), where agents learn to collaboratively generate individual, collision-free (but often suboptimal) paths. Scalability is usually achieved by assuming a local field of view (FOV) around the agents, hel** scale to arbitra… ▽ More

    Submitted 31 August, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  12. arXiv:2301.11575  [pdf, other

    cs.RO

    ARiADNE: A Reinforcement learning approach using Attention-based Deep Networks for Exploration

    Authors: Yuhong Cao, Tianxiang Hou, Yizhuo Wang, Xian Yi, Guillaume Sartoretti

    Abstract: In autonomous robot exploration tasks, a mobile robot needs to actively explore and map an unknown environment as fast as possible. Since the environment is being revealed during exploration, the robot needs to frequently re-plan its path online, as new information is acquired by onboard sensors and used to update its partial map. While state-of-the-art exploration planners are frontier- and sampl… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  13. arXiv:2204.03516  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    Distributed Reinforcement Learning for Robot Teams: A Review

    Authors: Yutong Wang, Mehul Damani, Pamela Wang, Yuhong Cao, Guillaume Sartoretti

    Abstract: Purpose of review: Recent advances in sensing, actuation, and computation have opened the door to multi-robot systems consisting of hundreds/thousands of robots, with promising applications to automated manufacturing, disaster relief, harvesting, last-mile delivery, port/airport operations, or search and rescue. The community has leveraged model-free multi-agent reinforcement learning (MARL) to de… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: Preprint of the paper submitted to Springer's Current Robotics Reports

  14. arXiv:2201.11994  [pdf, other

    cs.RO cs.AI cs.LG cs.MA

    FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems

    Authors: Yutong Wang, Guillaume Sartoretti

    Abstract: Decentralized cooperation in partially-observable multi-agent systems requires effective communications among agents. To support this effort, this work focuses on the class of problems where global communications are available but may be unreliable, thus precluding differentiable communication learning methods. We introduce FCMNet, a reinforcement learning based approach that allows agents to simu… ▽ More

    Submitted 31 January, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: To appear in the International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2022)

  15. A general locomotion control framework for multi-legged locomotors

    Authors: Baxi Chong, Yasemin O. Aydin, Jennifer M. Rieser, Guillaume Sartoretti, Tianyu Wang, Julian Whitman, Abdul Kaba, Enes Aydin, Ciera McFarland, Kelimar Diaz Cruz, Jeffery W. Rankin, Krijn B Michel, Alfredo Nicieza, John R Hutchinson, Howie Choset, Daniel I. Goldman

    Abstract: Serially connected robots are promising candidates for performing tasks in confined spaces such as search-and-rescue in large-scale disasters. Such robots are typically limbless, and we hypothesize that the addition of limbs could improve mobility. However, a challenge in designing and controlling such devices lies in the coordination of high-dimensional redundant modules in a way that improves mo… ▽ More

    Submitted 3 February, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  16. arXiv:2109.04205  [pdf, other

    cs.RO cs.AI cs.MA

    DAN: Decentralized Attention-based Neural Network for the MinMax Multiple Traveling Salesman Problem

    Authors: Yuhong Cao, Zhanhong Sun, Guillaume Sartoretti

    Abstract: The multiple traveling salesman problem (mTSP) is a well-known NP-hard problem with numerous real-world applications. In particular, this work addresses MinMax mTSP, where the objective is to minimize the max tour length among all agents. Many robotic deployments require recomputing potentially large mTSP instances frequently, making the natural trade-off between computing time and solution qualit… ▽ More

    Submitted 7 July, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Submitted to the 16th International Symposium on Distributed Autonomous Robotic Systems (DARS 2022)

  17. arXiv:2103.16511  [pdf, other

    cs.AI cs.LG

    Flatland Competition 2020: MAPF and MARL for Efficient Train Coordination on a Grid World

    Authors: Florian Laurent, Manuel Schneider, Christian Scheller, Jeremy Watson, Jiaoyang Li, Zhe Chen, Yi Zheng, Shao-Hung Chan, Konstantin Makhnev, Oleg Svidchenko, Vladimir Egorov, Dmitry Ivanov, Aleksei Shpilman, Evgenija Spirovska, Oliver Tanevski, Aleksandar Nikov, Ramon Grunder, David Galevski, Jakov Mitrovski, Guillaume Sartoretti, Zhiyao Luo, Mehul Damani, Nilabha Bhattacharya, Shivam Agarwal, Adrian Egli , et al. (2 additional authors not shown)

    Abstract: The Flatland competition aimed at finding novel approaches to solve the vehicle re-scheduling problem (VRSP). The VRSP is concerned with scheduling trips in traffic networks and the re-scheduling of vehicles when disruptions occur, for example the breakdown of a vehicle. While solving the VRSP in various settings has been an active area in operations research (OR) for decades, the ever-growing com… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 28 pages, 8 figures

  18. arXiv:2012.05893  [pdf, other

    cs.AI

    Flatland-RL : Multi-Agent Reinforcement Learning on Trains

    Authors: Sharada Mohanty, Erik Nygren, Florian Laurent, Manuel Schneider, Christian Scheller, Nilabha Bhattacharya, Jeremy Watson, Adrian Egli, Christian Eichenberger, Christian Baumberger, Gereon Vienken, Irene Sturm, Guillaume Sartoretti, Giacomo Spigler

    Abstract: Efficient automated scheduling of trains remains a major challenge for modern railway systems. The underlying vehicle rescheduling problem (VRSP) has been a major focus of Operations Research (OR) since decades. Traditional approaches use complex simulators to study VRSP, where experimenting with a broad range of novel ideas is time consuming and has a huge computational overhead. In this paper, w… ▽ More

    Submitted 11 December, 2020; v1 submitted 10 December, 2020; originally announced December 2020.

  19. PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Lifelong

    Authors: Mehul Damani, Zhiyao Luo, Emerson Wenzel, Guillaume Sartoretti

    Abstract: Multi-agent path finding (MAPF) is an indispensable component of large-scale robot deployments in numerous domains ranging from airport management to warehouse automation. In particular, this work addresses lifelong MAPF (LMAPF) - an online variant of the problem where agents are immediately assigned a new goal upon reaching their current one - in dense and highly structured environments, typical… ▽ More

    Submitted 4 March, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  20. arXiv:2006.08152  [pdf, other

    cs.RO

    ForMIC: Foraging via Multiagent RL with Implicit Communication

    Authors: Samuel Shaw, Emerson Wenzel, Alexis Walker, Guillaume Sartoretti

    Abstract: Multi-agent foraging (MAF) involves distributing a team of agents to search an environment and extract resources from it. Nature provides several examples of highly effective foragers, where individuals within the foraging collective use biological markers (e.g., pheromones) to communicate critical information to others via the environment. In this work, we propose ForMIC, a distributed reinforcem… ▽ More

    Submitted 12 February, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  21. Distributed Learning of Decentralized Control Policies for Articulated Mobile Robots

    Authors: Guillaume Sartoretti, William Paivine, Yunfei Shi, Yue Wu, Howie Choset

    Abstract: State-of-the-art distributed algorithms for reinforcement learning rely on multiple independent agents, which simultaneously learn in parallel environments while asynchronously updating a common, shared policy. Moreover, decentralized control architectures (e.g., CPGs) can coordinate spatially distributed portions of an articulated robot to achieve system-level objectives. In this work, we investi… ▽ More

    Submitted 9 June, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  22. PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning

    Authors: Guillaume Sartoretti, Justin Kerr, Yunfei Shi, Glenn Wagner, T. K. Satish Kumar, Sven Koenig, Howie Choset

    Abstract: Multi-agent path finding (MAPF) is an essential component of many large-scale, real-world robot deployments, from aerial swarms to warehouse automation. However, despite the community's continued efforts, most state-of-the-art MAPF planners still rely on centralized planning and scale poorly past a few hundred agents. Such planning approaches are maladapted to real-world deployments, where noise a… ▽ More

    Submitted 20 February, 2019; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: \c{opyright} 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  23. arXiv:1803.01446  [pdf, other

    cs.RO

    Learning to Sequence Robot Behaviors for Visual Navigation

    Authors: Hadi Salman, Puneet Singhal, Tanmay Shankar, Peng Yin, Ali Salman, William Paivine, Guillaume Sartoretti, Matthew Travers, Howie Choset

    Abstract: Recent literature in the robotics community has focused on learning robot behaviors that abstract out lower-level details of robot control. To fully leverage the efficacy of such behaviors, it is necessary to select and sequence them to achieve a given task. In this paper, we present an approach to both learn and sequence robot behaviors, applied to the problem of visual navigation of mobile robot… ▽ More

    Submitted 25 March, 2018; v1 submitted 4 March, 2018; originally announced March 2018.