Skip to main content

Showing 1–50 of 52 results for author: Gombolay, M

.
  1. arXiv:2406.05003  [pdf, other

    cs.RO cs.HC

    Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable Systems

    Authors: Rohan Paleja, Michael Munje, Kimberlee Chang, Reed Jensen, Matthew Gombolay

    Abstract: Collaborative robots and machine learning-based virtual agents are increasingly entering the human workspace with the aim of increasing productivity and enhancing safety. Despite this, we show in a ubiquitous experimental domain, Overcooked-AI, that state-of-the-art techniques for human-machine teaming (HMT), which rely on imitation or reinforcement learning, are brittle and result in a machine ag… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2403.16178  [pdf, other

    cs.RO cs.AI

    Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation

    Authors: Manisha Natarajan, Chunyue Xue, Sanne van Waveren, Karen Feigh, Matthew Gombolay

    Abstract: For effective human-agent teaming, robots and other artificial intelligence (AI) agents must infer their human partner's abilities and behavioral response patterns and adapt accordingly. Most prior works make the unrealistic assumption that one or more teammates can act near-optimally. In real-world collaboration, humans and autonomous agents can be suboptimal, especially when each only has partia… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 pages for supplementary

  3. arXiv:2403.10809  [pdf, other

    cs.RO

    Efficient Trajectory Forecasting and Generation with Conditional Flow Matching

    Authors: Sean Ye, Matthew Gombolay

    Abstract: Trajectory prediction and generation are vital for autonomous robots navigating dynamic environments. While prior research has typically focused on either prediction or generation, our approach unifies these tasks to provide a versatile framework and achieve state-of-the-art performance. Diffusion models, which are currently state-of-the-art for learned trajectory generation in long-horizon planni… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  4. arXiv:2403.10794  [pdf, other

    cs.RO cs.LG cs.MA

    Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games

    Authors: Zixuan Wu, Sean Ye, Manisha Natarajan, Matthew C. Gombolay

    Abstract: Reinforcement Learning- (RL-)based motion planning has recently shown the potential to outperform traditional approaches from autonomous navigation to robot manipulation. In this work, we focus on a motion planning task for an evasive target in a partially observable multi-agent adversarial pursuit-evasion games (PEG). These pursuit-evasion problems are relevant to various applications, such as se… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE Robotics and Automation Letters (RA-L) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  5. arXiv:2401.17185  [pdf, other

    cs.RO cs.CV

    Multi-Camera Asynchronous Ball Localization and Trajectory Prediction with Factor Graphs and Human Poses

    Authors: Qingyu Xiao, Zulfiqar Zaidi, Matthew Gombolay

    Abstract: The rapid and precise localization and prediction of a ball are critical for develo** agile robots in ball sports, particularly in sports like tennis characterized by high-speed ball movements and powerful spins. The Magnus effect induced by spin adds complexity to trajectory prediction during flight and bounce dynamics upon contact with the ground. In this study, we introduce an innovative appr… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by ICRA 2024

  6. arXiv:2311.10041  [pdf, other

    cs.RO

    Interpretable Reinforcement Learning for Robotics and Continuous Control

    Authors: Rohan Paleja, Letian Chen, Yaru Niu, Andrew Silva, Zhaoxin Li, Songan Zhang, Chace Ritchie, Sugju Choi, Kimberlee Chestnut Chang, Hongtei Eric Tseng, Yan Wang, Subramanya Nageshrao, Matthew Gombolay

    Abstract: Interpretability in machine learning is critical for the safe deployment of learned policies across legally-regulated and safety-critical domains. While gradient-based approaches in reinforcement learning have achieved tremendous success in learning policies for continuous control problems such as robotics and autonomous driving, the lack of interpretability is a fundamental barrier to adoption. W… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.02352

  7. arXiv:2309.17046  [pdf, other

    cs.RO

    CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning

    Authors: Tianyu Li, Hyunyoung Jung, Matthew Gombolay, Yong Kwon Cho, Sehoon Ha

    Abstract: Human motion driven control (HMDC) is an effective approach for generating natural and compelling robot motions while preserving high-level semantics. However, establishing the correspondence between humans and robots with different body structures is not straightforward due to the mismatches in kinematics and dynamics properties, which causes intrinsic ambiguity to the problem. Many previous algo… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  8. arXiv:2307.06244  [pdf, other

    cs.RO cs.LG cs.MA

    Diffusion Models for Multi-target Adversarial Tracking

    Authors: Sean Ye, Manisha Natarajan, Zixuan Wu, Matthew Gombolay

    Abstract: Target tracking plays a crucial role in real-world scenarios, particularly in drug-trafficking interdiction, where the knowledge of an adversarial target's location is often limited. Improving autonomous tracking systems will enable unmanned aerial, surface, and underwater vehicles to better assist in interdicting smugglers that use manned surface, semi-submersible, and aerial vessels. As unmanned… ▽ More

    Submitted 12 January, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

  9. arXiv:2306.11301  [pdf, other

    cs.LG cs.AI cs.RO

    Adversarial Search and Tracking with Multiagent Reinforcement Learning in Sparsely Observable Environment

    Authors: Zixuan Wu, Sean Ye, Manisha Natarajan, Letian Chen, Rohan Paleja, Matthew C. Gombolay

    Abstract: We study a search and tracking (S&T) problem where a team of dynamic search agents must collaborate to track an adversarial, evasive agent. The heterogeneous search team may only have access to a limited number of past adversary trajectories within a large search space. This problem is challenging for both model-based searching and reinforcement learning (RL) methods since the adversary exhibits r… ▽ More

    Submitted 20 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS) 2023

  10. arXiv:2306.11168  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Models of Adversarial Agent Behavior under Partial Observability

    Authors: Sean Ye, Manisha Natarajan, Zixuan Wu, Rohan Paleja, Letian Chen, Matthew C. Gombolay

    Abstract: The need for opponent modeling and tracking arises in several real-world scenarios, such as professional sports, video game design, and drug-trafficking interdiction. In this work, we present Graph based Adversarial Modeling with Mutal Information (GrAMMI) for modeling the behavior of an adversarial opponent agent. GrAMMI is a novel graph neural network (GNN) based approach that uses mutual inform… ▽ More

    Submitted 5 July, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 8 pages, 3 figures, 2 tables

  11. The Effect of Robot Skill Level and Communication in Rapid, Proximate Human-Robot Collaboration

    Authors: Kin Man Lee, Arjun Krishna, Zulfiqar Zaidi, Rohan Paleja, Letian Chen, Erin Hedlund-Botti, Mariah Schrum, Matthew Gombolay

    Abstract: As high-speed, agile robots become more commonplace, these robots will have the potential to better aid and collaborate with humans. However, due to the increased agility and functionality of these robots, close collaboration with humans can create safety concerns that alter team dynamics and degrade task performance. In this work, we aim to enable the deployment of safe and trustworthy agile robo… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Journal ref: HRI '23: Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

  12. arXiv:2301.13279  [pdf, other

    cs.AI cs.LG cs.MA cs.RO

    Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation

    Authors: Batuhan Altundas, Zheyuan Wang, Joshua Bishop, Matthew Gombolay

    Abstract: As human-robot collaboration increases in the workforce, it becomes essential for human-robot teams to coordinate efficiently and intuitively. Traditional approaches for human-robot scheduling either utilize exact methods that are intractable for large-scale problems and struggle to account for stochastic, time varying human task performance, or application-specific heuristics that require expert… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: 8 pages, 2 figures, 3 Tables

    Journal ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  13. arXiv:2301.08595  [pdf, other

    cs.RO cs.AI cs.HC

    MAVERIC: A Data-Driven Approach to Personalized Autonomous Driving

    Authors: Mariah L. Schrum, Emily Sumner, Matthew C. Gombolay, Andrew Best

    Abstract: Personalization of autonomous vehicles (AV) may significantly increase trust, use, and acceptance. In particular, we hypothesize that the similarity of an AV's driving style compared to the end-user's driving style will have a major impact on end-user's willingness to use the AV. To investigate the impact of driving style on user acceptance, we 1) develop a data-driven approach to personalize driv… ▽ More

    Submitted 27 February, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  14. arXiv:2301.08144  [pdf, other

    cs.IR cs.AI cs.HC

    Towards the design of user-centric strategy recommendation systems for collaborative Human-AI tasks

    Authors: Lakshita Dodeja, Pradyumna Tambwekar, Erin Hedlund-Botti, Matthew Gombolay

    Abstract: Artificial Intelligence is being employed by humans to collaboratively solve complicated tasks for search and rescue, manufacturing, etc. Efficient teamwork can be achieved by understanding user preferences and recommending different strategies for solving the particular task to humans. Prior work has focused on personalization of recommendation systems for relatively well-understood tasks in the… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  15. arXiv:2301.05347  [pdf, other

    cs.CY cs.AI

    Towards Reconciling Usability and Usefulness of Explainable AI Methodologies

    Authors: Pradyumna Tambwekar, Matthew Gombolay

    Abstract: Interactive Artificial Intelligence (AI) agents are becoming increasingly prevalent in society. However, application of such systems without understanding them can be problematic. Black-box AI systems can lead to liability and accountability issues when they produce an incorrect decision. Explainable AI (XAI) seeks to bridge the knowledge gap, between developers and end-users, by offering insights… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  16. arXiv:2212.14403  [pdf, other

    cs.RO

    Utilizing Human Feedback for Primitive Optimization in Wheelchair Tennis

    Authors: Arjun Krishna, Zulfiqar Zaidi, Letian Chen, Rohan Paleja, Esmaeil Seraj, Matthew Gombolay

    Abstract: Agile robotics presents a difficult challenge with robots moving at high speeds requiring precise and low-latency sensing and control. Creating agile motion that accomplishes the task at hand while being safe to execute is a key requirement for agile robots to gain human trust. This requires designing new approaches that are flexible and maintain knowledge over world constraints. In this paper, we… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Workshop paper at Learning for Agile Robotics Workshop, CoRL 2022

  17. arXiv:2212.02753  [pdf, other

    cs.RO cs.AI cs.LG

    Safe Inverse Reinforcement Learning via Control Barrier Function

    Authors: Yue Yang, Letian Chen, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) is a powerful method for enabling robots to perform novel tasks as it is often more tractable for a non-roboticist end-user to demonstrate the desired skill and for the robot to efficiently learn from the associated data than for a human to engineer a reward function for the robot to learn the skill via reinforcement learning (RL). Safety issues arise in modern Lf… ▽ More

    Submitted 6 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 6 pages, 3 figures

  18. arXiv:2210.03766  [pdf, other

    cs.CL cs.LG

    FedPC: Federated Learning for Language Generation with Personal and Context Preference Embeddings

    Authors: Andrew Silva, Pradyumna Tambwekar, Matthew Gombolay

    Abstract: Federated learning is a training paradigm that learns from multiple distributed users without aggregating data on a centralized server. Such a paradigm promises the ability to deploy machine-learning at-scale to a diverse population of end-users without first collecting a large, labeled dataset for all possible tasks. As federated learning typically averages learning updates across a decentralized… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Andrew Silva and Pradyumna Tambwekar contributed equally towards this work

  19. arXiv:2210.02517  [pdf, other

    cs.RO

    Athletic Mobile Manipulator System for Robotic Wheelchair Tennis

    Authors: Zulfiqar Zaidi, Daniel Martin, Nathaniel Belles, Viacheslav Zakharov, Arjun Krishna, Kin Man Lee, Peter Wagstaff, Sumedh Naik, Matthew Sklar, Sugju Choi, Yoshiki Kakehi, Ruturaj Patil, Divya Mallemadugula, Florian Pesce, Peter Wilson, Wendell Hom, Matan Diamond, Bryan Zhao, Nina Moorman, Rohan Paleja, Letian Chen, Esmaeil Seraj, Matthew Gombolay

    Abstract: Athletics are a quintessential and universal expression of humanity. From French monks who in the 12th century invented jeu de paume, the precursor to modern lawn tennis, back to the K'iche' people who played the Maya Ballgame as a form of religious expression over three thousand years ago, humans have sought to train their minds and bodies to excel in sporting contests. Advances in robotics are o… ▽ More

    Submitted 7 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 8 pages, accepted at RA-L, will also be presented at IROS 2023

  20. arXiv:2209.11908  [pdf, other

    cs.LG cs.RO

    Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

    Authors: Letian Chen, Sravan Jayanthi, Rohan Paleja, Daniel Martin, Viacheslav Zakharov, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. However, current LfD frameworks are not capable of fast adaptation to heterogeneous human demonstrations nor the large-scale deployment in ubiquitous robotics applications. In this paper, we propose a novel LfD framework, Fast Lif… ▽ More

    Submitted 12 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of Conference on Robot Learning (CoRL) 2022

  21. arXiv:2209.03943  [pdf, other

    cs.AI cs.HC

    The Utility of Explainable AI in Ad Hoc Human-Machine Teaming

    Authors: Rohan Paleja, Muyleng Ghuy, Nadun Ranawaka Arachchige, Reed Jensen, Matthew Gombolay

    Abstract: Recent advances in machine learning have led to growing interest in Explainable AI (xAI) to enable humans to gain insight into the decision-making of machine learning models. Despite this recent interest, the utility of xAI techniques has not yet been characterized in human-machine teaming. Importantly, xAI offers the promise of enhancing team situational awareness (SA) and shared mental model dev… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: Part of Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  22. arXiv:2208.08374  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    A Computational Interface to Translate Strategic Intent from Unstructured Language in a Low-Data Setting

    Authors: Pradyumna Tambwekar, Lakshita Dodeja, Nathan Vaska, Wei Xu, Matthew Gombolay

    Abstract: Many real-world tasks involve a mixed-initiative setup, wherein humans and AI systems collaboratively perform a task. While significant work has been conducted towards enabling humans to specify, through language, exactly how an agent should complete a task (i.e., low-level specification), prior work lacks on interpreting the high-level strategic intent of the human commanders. Parsing strategic i… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: 19 Pages, 7 figures, 8 page appendix

  23. arXiv:2207.11569  [pdf, other

    cs.RO cs.AI cs.CV cs.CY cs.LG

    Robots Enact Malignant Stereotypes

    Authors: Andrew Hundt, William Agnew, Vicky Zeng, Severin Kacianka, Matthew Gombolay

    Abstract: Stereotypes, bias, and discrimination have been extensively documented in Machine Learning (ML) methods such as Computer Vision (CV) [18, 80], Natural Language Processing (NLP) [6], or both, in the case of large image and caption models such as OpenAI CLIP [14]. In this paper, we evaluate how ML bias manifests in robots that physically and autonomously act within the world. We audit one of several… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: 30 pages, 10 figures, 5 tables. Website: https://sites.google.com/view/robots-enact-stereotypes . Published in the 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT 22), June 21-24, 2022, Seoul, Republic of Korea. ACM, DOI: https://doi.org/10.1145/3531146.3533138 . FAccT22 Submission dates: Abstract Dec 13, 2021; Submitted Jan 22, 2022; Accepted Apr 7, 2022

    Journal ref: In 2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT 22). ACM, New York, NY, USA, 743-756

  24. arXiv:2206.10544  [pdf, other

    cs.RO cs.AI eess.SY

    Multi-UAV Planning for Cooperative Wildfire Coverage and Tracking with Quality-of-Service Guarantees

    Authors: Esmaeil Seraj, Andrew Silva, Matthew Gombolay

    Abstract: In recent years, teams of robot and Unmanned Aerial Vehicles (UAVs) have been commissioned by researchers to enable accurate, online wildfire coverage and tracking. While the majority of prior work focuses on the coordination and control of such multi-robot systems, to date, these UAV teams have not been given the ability to reason about a fire's track (i.e., location and propagation dynamics) to… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: To appear in the journal of Autonomous Agents and Multi-Agent Systems (AAMAS)

  25. arXiv:2204.04260  [pdf, other

    cs.RO cs.HC

    Towards Cognitive Robots That People Accept in Their Home

    Authors: Nina Moorman, Erin Hedlund-Botti, Matthew Gombolay

    Abstract: It is intractable for assistive robots to have all functionalities pre-programmed prior to deployment. Rather, it is more realistic for robots to perform supplemental, on-site learning about user's needs and preferences, and particularities of the environment. This additional learning is especially helpful for care robots that assist with individualized caregiver activities in residential or assis… ▽ More

    Submitted 17 October, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: 2022 AAAI Fall Symposium on Artificial Intelligence for Human-Robot Interaction (AI-HRI 2022) workshop paper

  26. arXiv:2203.12774  [pdf, other

    cs.LG cs.AI cs.RO

    Efficient Exploration via First-Person Behavior Cloning Assisted Rapidly-Exploring Random Trees

    Authors: Max Zuo, Logan Schick, Matthew Gombolay, Nakul Gopalan

    Abstract: Modern day computer games have extremely large state and action spaces. To detect bugs in these games' models, human testers play the games repeatedly to explore the game and find errors in the games. Such gameplay is exhaustive and time consuming. Moreover, since robotics simulators depend on similar methods of model specification and debugging, the problem of finding errors in the model is of in… ▽ More

    Submitted 19 April, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Published in HRI 2022 Workshop - MLHRC. This is a replacement to include broader citations from works in the field

  27. arXiv:2202.07014  [pdf, other

    cs.LG

    Strategy Discovery and Mixture in Lifelong Learning from Heterogeneous Demonstration

    Authors: Sravan Jayanthi, Letian Chen, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. A key challenge in LfD research is that users tend to provide heterogeneous demonstrations for the same task due to various strategies and preferences. Therefore, it is essential to develop LfD algorithms that ensure \textit{flexi… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning (IML@AAAI'22)

  28. arXiv:2202.02352  [pdf, other

    cs.LG cs.RO

    Learning Interpretable, High-Performing Policies for Autonomous Driving

    Authors: Rohan Paleja, Yaru Niu, Andrew Silva, Chace Ritchie, Sugju Choi, Matthew Gombolay

    Abstract: Gradient-based approaches in reinforcement learning (RL) have achieved tremendous success in learning policies for autonomous vehicles. While the performance of these approaches warrants real-world adoption, these policies lack interpretability, limiting deployability in the safety-critical and legally-regulated domain of autonomous driving (AD). AD requires interpretable and verifiable control po… ▽ More

    Submitted 31 July, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Robotics Science and Systems 2022

  29. arXiv:2201.08484  [pdf, other

    cs.MA cs.AI cs.LG cs.RO

    Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming

    Authors: Sachin Konan, Esmaeil Seraj, Matthew Gombolay

    Abstract: Information sharing is key in building team cognition and enables coordination and cooperation. High-performing human teams also benefit from acting strategically with hierarchical levels of iterated communication and rationalizability, meaning a human agent can reason about the actions of their teammates in their decision-making. Yet, the majority of prior work in Multi-Agent Reinforcement Learni… ▽ More

    Submitted 24 June, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: The first two authors contributed equally to this work (Published in ICLR 2022)

    Journal ref: International Conference on Learning Representations 2022

  30. arXiv:2110.04647  [pdf, other

    cs.LG cs.CL

    Learning to Follow Language Instructions with Compositional Policies

    Authors: Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew Gombolay, Benjamin Rosman

    Abstract: We propose a framework that learns to execute natural language instructions in an environment consisting of goal-reaching tasks that share components of their task descriptions. Our approach leverages the compositionality of both value functions and language, with the aim of reducing the sample complexity of learning novel tasks. First, we train a reinforcement learning agent to learn value functi… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/53

  31. arXiv:2110.04347  [pdf, other

    cs.RO cs.LG

    Towards Sample-efficient Apprenticeship Learning from Suboptimal Demonstration

    Authors: Letian Chen, Rohan Paleja, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) seeks to democratize robotics by enabling non-roboticist end-users to teach robots to perform novel tasks by providing demonstrations. However, as demonstrators are typically non-experts, modern LfD techniques are unable to produce policies much better than the suboptimal demonstration. A previously-proposed framework, SSRR, has shown success in learning from subo… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/39

  32. arXiv:2110.03134  [pdf, other

    cs.RO

    Improving Robot-Centric Learning from Demonstration via Personalized Embeddings

    Authors: Mariah L. Schrum, Erin Hedlund, Matthew C. Gombolay

    Abstract: Learning from demonstration (LfD) techniques seek to enable novice users to teach robots novel tasks in the real world. However, prior work has shown that robot-centric LfD approaches, such as Dataset Aggregation (DAgger), do not perform well with human teachers. DAgger requires a human demonstrator to provide corrective feedback to the learner either in real-time, which can result in degraded per… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

    Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/29

  33. arXiv:2108.09568  [pdf, other

    cs.MA

    Heterogeneous Graph Attention Networks for Learning Diverse Communication

    Authors: Esmaeil Seraj, Zheyuan Wang, Rohan Paleja, Matthew Sklar, Anirudh Patel, Matthew Gombolay

    Abstract: Multi-agent teaming achieves better performance when there is communication among participating agents allowing them to coordinate their actions for maximizing shared utility. However, when collaborating a team of agents with different action and observation spaces, information sharing is not straightforward and requires customized communication protocols, depending on sender and receiver types. W… ▽ More

    Submitted 28 October, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

  34. arXiv:2101.07140  [pdf, other

    cs.LG cs.AI cs.CL cs.RO

    Natural Language Specification of Reinforcement Learning Policies through Differentiable Decision Trees

    Authors: Pradyumna Tambwekar, Andrew Silva, Nakul Gopalan, Matthew Gombolay

    Abstract: Human-AI policy specification is a novel procedure we define in which humans can collaboratively warm-start a robot's reinforcement learning policy. This procedure is comprised of two steps; (1) Policy Specification, i.e. humans specifying the behavior they would like their companion robot to accomplish, and (2) Policy Optimization, i.e. the robot applying reinforcement learning to improve the ini… ▽ More

    Submitted 20 May, 2023; v1 submitted 18 January, 2021; originally announced January 2021.

  35. arXiv:2012.03898  [pdf, other

    cs.RO

    A Generalized Robotic Handwriting Learning System based on Dynamic Movement Primitives (DMPs)

    Authors: Qian Luo, **g Wu, Matthew Gombolay

    Abstract: Learning from demonstration (LfD) is a powerful learning method to enable a robot to infer how to perform a task given one or more human demonstrations of the desired task. By learning from end-user demonstration rather than requiring that a domain expert manually programming each skill, robots can more readily be applied to a wider range of real-world applications. Writing robots, as one applicat… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  36. arXiv:2012.01685  [pdf, other

    cs.LG

    Cross-Loss Influence Functions to Explain Deep Network Representations

    Authors: Andrew Silva, Rohit Chopra, Matthew Gombolay

    Abstract: As machine learning is increasingly deployed in the real world, it is paramount that we develop the tools necessary to analyze the decision-making of the models we train and deploy to end-users. Recently, researchers have shown that influence functions, a statistical measure of sample impact, can approximate the effects of training samples on classification accuracy for deep neural networks. Howev… ▽ More

    Submitted 3 May, 2022; v1 submitted 2 December, 2020; originally announced December 2020.

  37. arXiv:2011.00165  [pdf, other

    cs.RO cs.AI cs.HC cs.LG cs.MA

    FireCommander: An Interactive, Probabilistic Multi-agent Environment for Heterogeneous Robot Teams

    Authors: Esmaeil Seraj, Xiyang Wu, Matthew Gombolay

    Abstract: The purpose of this tutorial is to help individuals use the \underline{FireCommander} game environment for research applications. The FireCommander is an interactive, probabilistic joint perception-action reconnaissance environment in which a composite team of agents (e.g., robots) cooperate to fight dynamic, propagating firespots (e.g., targets). In FireCommander game, a team of agents must be ta… ▽ More

    Submitted 27 October, 2021; v1 submitted 30 October, 2020; originally announced November 2020.

  38. arXiv:2010.11723  [pdf, other

    cs.RO cs.LG

    Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

    Authors: Letian Chen, Rohan Paleja, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) seeks to democratize robotics by enabling non-roboticist end-users to teach robots to perform a task by providing a human demonstration. However, modern LfD techniques, e.g. inverse reinforcement learning (IRL), assume users provide at least stochastically optimal demonstrations. This assumption fails to hold in most real-world scenarios. Recent attempts to learn… ▽ More

    Submitted 23 November, 2020; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the Conference on Robot Learning (CoRL '20)

  39. arXiv:2007.03742  [pdf, other

    cs.LG stat.ML

    Meta-active Learning in Probabilistically-Safe Optimization

    Authors: Mariah L. Schrum, Mark Connolly, Eric Cole, Mihir Ghetiya, Robert Gross, Matthew C. Gombolay

    Abstract: Learning to control a safety-critical system with latent dynamics (e.g. for deep brain stimulation) requires taking calculated risks to gain information as efficiently as possible. To address this problem, we present a probabilistically-safe, meta-active learning approach to efficiently learn system dynamics and optimal configurations. We cast this problem as meta-learning an acquisition function,… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: 9 pages

  40. arXiv:2007.01921  [pdf, other

    cs.RO cs.AI cs.HC eess.SY

    Human-Robot Team Coordination with Dynamic and Latent Human Task Proficiencies: Scheduling with Learning Curves

    Authors: Ruisen Liu, Manisha Natarajan, Matthew Gombolay

    Abstract: As robots become ubiquitous in the workforce, it is essential that human-robot collaboration be both intuitive and adaptive. A robot's quality improves based on its ability to explicitly reason about the time-varying (i.e. learning curves) and stochastic capabilities of its human counterparts, and adjust the joint workload to improve efficiency while factoring human preferences. We introduce a nov… ▽ More

    Submitted 8 July, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

  41. arXiv:2006.07969  [pdf, other

    eess.SY cs.MA cs.RO eess.SP

    Coordinated Control of UAVs for Human-Centered Active Sensing of Wildfires

    Authors: Esmaeil Seraj, Matthew Gombolay

    Abstract: Fighting wildfires is a precarious task, imperiling the lives of engaging firefighters and those who reside in the fire's path. Firefighters need online and dynamic observation of the firefront to anticipate a wildfire's unknown characteristics, such as size, scale, and propagation velocity, and to plan accordingly. In this paper, we propose a distributed control framework to coordinate a team of… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  42. Heterogeneous Learning from Demonstration

    Authors: Rohan Paleja, Matthew Gombolay

    Abstract: The development of human-robot systems able to leverage the strengths of both humans and their robotic counterparts has been greatly sought after because of the foreseen, broad-ranging impact across industry and research. We believe the true potential of these systems cannot be reached unless the robot is able to act with a high level of autonomy, reducing the burden of manual tasking or teleopera… ▽ More

    Submitted 14 April, 2020; v1 submitted 26 January, 2020; originally announced January 2020.

    Journal ref: 2019 14th Human-Robot Interaction (HRI) Pioneers Workshop

  43. arXiv:2001.03231  [pdf, other

    cs.HC cs.RO stat.AP stat.ME

    Four Years in Review: Statistical Practices of Likert Scales in Human-Robot Interaction Studies

    Authors: Mariah L. Schrum, Michael Johnson, Muyleng Ghuy, Matthew C. Gombolay

    Abstract: As robots become more prevalent, the importance of the field of human-robot interaction (HRI) grows accordingly. As such, we should endeavor to employ the best statistical practices. Likert scales are commonly used metrics in HRI to measure perceptions and attitudes. Due to misinformation or honest mistakes, most HRI researchers do not adopt best practices when analyzing Likert data. We conduct a… ▽ More

    Submitted 30 January, 2020; v1 submitted 9 January, 2020; originally announced January 2020.

  44. arXiv:2001.00503  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation

    Authors: Letian Chen, Rohan Paleja, Muyleng Ghuy, Matthew Gombolay

    Abstract: Reinforcement learning (RL) has achieved tremendous success as a general framework for learning how to make decisions. However, this success relies on the interactive hand-tuning of a reward function by RL experts. On the other hand, inverse reinforcement learning (IRL) seeks to learn a reward function from readily-obtained human demonstrations. Yet, IRL suffers from two major limitations: 1) rewa… ▽ More

    Submitted 23 November, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: In Proceedings of the 2020 ACM/IEEE In-ternational Conference on Human-Robot Interaction (HRI '20), March 23 to 26, 2020, Cambridge, United Kingdom.ACM, New York, NY, USA, 10 pages

  45. arXiv:1912.08116  [pdf, other

    cs.RO cs.NE

    When Your Robot Breaks: Active Learning During Plant Failure

    Authors: Mariah Schrum, Matthew Gombolay

    Abstract: Detecting and adapting to catastrophic failures in robotic systems requires a robot to learn its new dynamics quickly and safely to best accomplish its goals. To address this challenging problem, we propose probabilistically-safe, online learning techniques to infer the altered dynamics of a robot at the moment a failure (e.g., physical damage) occurs. We combine model predictive control and activ… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

  46. arXiv:1912.02059  [pdf, other

    cs.RO cs.AI cs.LG

    Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks

    Authors: Zheyuan Wang, Matthew Gombolay

    Abstract: Increasing interest in integrating advanced robotics within manufacturing has spurred a renewed concentration in develo** real-time scheduling solutions to coordinate human-robot collaboration in this environment. Traditionally, the problem of scheduling agents to complete tasks with temporal and spatial constraints has been approached either with exact algorithms, which are computationally intr… ▽ More

    Submitted 28 June, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: This paper has been extended to an article in IEEE Robotics and Automation Letters (DOI: 10.1109/LRA.2020.3002198)

  47. arXiv:1906.06397  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations

    Authors: Rohan Paleja, Andrew Silva, Letian Chen, Matthew Gombolay

    Abstract: Resource scheduling and coordination is an NP-hard optimization requiring an efficient allocation of agents to a set of tasks with upper- and lower bound temporal and resource constraints. Due to the large-scale and dynamic nature of resource coordination in hospitals and factories, human domain experts manually plan and adjust schedules on the fly. To perform this job, domain experts leverage het… ▽ More

    Submitted 7 December, 2021; v1 submitted 14 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of the 34th International Conference on Neural Information Processing Systems 2020, 6417-6428

  48. arXiv:1903.09338  [pdf, other

    cs.LG stat.ML

    Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning

    Authors: Andrew Silva, Taylor Killian, Ivan Dario Jimenez Rodriguez, Sung-Hyun Son, Matthew Gombolay

    Abstract: Decision trees are ubiquitous in machine learning for their ease of use and interpretability. Yet, these models are not typically employed in reinforcement learning as they cannot be updated online via stochastic gradient descent. We overcome this limitation by allowing for a gradient update over the entire tree that improves sample complexity affords interpretable policy extraction. First, we inc… ▽ More

    Submitted 25 June, 2020; v1 submitted 21 March, 2019; originally announced March 2019.

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics 2020, 1855-1865

  49. arXiv:1903.06847  [pdf, other

    cs.RO cs.AI cs.HC cs.MA

    Safe Coordination of Human-Robot Firefighting Teams

    Authors: Esmaeil Seraj, Andrew Silva, Matthew Gombolay

    Abstract: Wildfires are destructive and inflict massive, irreversible harm to victims' lives and natural resources. Researchers have proposed commissioning unmanned aerial vehicles (UAVs) to provide firefighters with real-time tracking information; yet, these UAVs are not able to reason about a fire's track, including current location, measurement, and uncertainty, as well as propagation. We propose a model… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

  50. arXiv:1903.06047  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Inferring Personalized Bayesian Embeddings for Learning from Heterogeneous Demonstration

    Authors: Rohan Paleja, Matthew Gombolay

    Abstract: For assistive robots and virtual agents to achieve ubiquity, machines will need to anticipate the needs of their human counterparts. The field of Learning from Demonstration (LfD) has sought to enable machines to infer predictive models of human behavior for autonomous robot control. However, humans exhibit heterogeneity in decision-making, which traditional LfD approaches fail to capture. To over… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: 8 Pages, 7 figures