Skip to main content

Showing 1–19 of 19 results for author: Paleja, R

.
  1. arXiv:2407.02632  [pdf, other

    cs.HC cs.FL

    STL: Still Tricky Logic (for System Validation, Even When Showing Your Work)

    Authors: Isabelle Hurley, Rohan Paleja, Ashley Suh, Jaime D. Peña, Ho Chit Siu

    Abstract: As learned control policies become increasingly common in autonomous systems, there is increasing need to ensure that they are interpretable and can be checked by human stakeholders. Formal specifications have been proposed as ways to produce human-interpretable policies for autonomous systems that can still be learned from examples. Previous work showed that despite claims of interpretability, hu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.05003  [pdf, other

    cs.RO cs.HC

    Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable Systems

    Authors: Rohan Paleja, Michael Munje, Kimberlee Chang, Reed Jensen, Matthew Gombolay

    Abstract: Collaborative robots and machine learning-based virtual agents are increasingly entering the human workspace with the aim of increasing productivity and enhancing safety. Despite this, we show in a ubiquitous experimental domain, Overcooked-AI, that state-of-the-art techniques for human-machine teaming (HMT), which rely on imitation or reinforcement learning, are brittle and result in a machine ag… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.02018  [pdf, other

    cs.CL cs.AI cs.HC

    Why Would You Suggest That? Human Trust in Language Model Responses

    Authors: Manasi Sharma, Ho Chit Siu, Rohan Paleja, Jaime D. Peña

    Abstract: The emergence of Large Language Models (LLMs) has revealed a growing need for human-AI collaboration, especially in creative decision-making scenarios where trust and reliance are paramount. Through human studies and model evaluations on the open-ended News Headline Generation task from the LaMP benchmark, we analyze how the framing and presence of explanations affect user trust and model performa… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2311.10041  [pdf, other

    cs.RO

    Interpretable Reinforcement Learning for Robotics and Continuous Control

    Authors: Rohan Paleja, Letian Chen, Yaru Niu, Andrew Silva, Zhaoxin Li, Songan Zhang, Chace Ritchie, Sugju Choi, Kimberlee Chestnut Chang, Hongtei Eric Tseng, Yan Wang, Subramanya Nageshrao, Matthew Gombolay

    Abstract: Interpretability in machine learning is critical for the safe deployment of learned policies across legally-regulated and safety-critical domains. While gradient-based approaches in reinforcement learning have achieved tremendous success in learning policies for continuous control problems such as robotics and autonomous driving, the lack of interpretability is a fundamental barrier to adoption. W… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.02352

  5. arXiv:2306.11301  [pdf, other

    cs.LG cs.AI cs.RO

    Adversarial Search and Tracking with Multiagent Reinforcement Learning in Sparsely Observable Environment

    Authors: Zixuan Wu, Sean Ye, Manisha Natarajan, Letian Chen, Rohan Paleja, Matthew C. Gombolay

    Abstract: We study a search and tracking (S&T) problem where a team of dynamic search agents must collaborate to track an adversarial, evasive agent. The heterogeneous search team may only have access to a limited number of past adversary trajectories within a large search space. This problem is challenging for both model-based searching and reinforcement learning (RL) methods since the adversary exhibits r… ▽ More

    Submitted 20 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: Accepted by IEEE International Symposium on Multi-Robot & Multi-Agent Systems (MRS) 2023

  6. arXiv:2306.11168  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Models of Adversarial Agent Behavior under Partial Observability

    Authors: Sean Ye, Manisha Natarajan, Zixuan Wu, Rohan Paleja, Letian Chen, Matthew C. Gombolay

    Abstract: The need for opponent modeling and tracking arises in several real-world scenarios, such as professional sports, video game design, and drug-trafficking interdiction. In this work, we present Graph based Adversarial Modeling with Mutal Information (GrAMMI) for modeling the behavior of an adversarial opponent agent. GrAMMI is a novel graph neural network (GNN) based approach that uses mutual inform… ▽ More

    Submitted 5 July, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 8 pages, 3 figures, 2 tables

  7. The Effect of Robot Skill Level and Communication in Rapid, Proximate Human-Robot Collaboration

    Authors: Kin Man Lee, Arjun Krishna, Zulfiqar Zaidi, Rohan Paleja, Letian Chen, Erin Hedlund-Botti, Mariah Schrum, Matthew Gombolay

    Abstract: As high-speed, agile robots become more commonplace, these robots will have the potential to better aid and collaborate with humans. However, due to the increased agility and functionality of these robots, close collaboration with humans can create safety concerns that alter team dynamics and degrade task performance. In this work, we aim to enable the deployment of safe and trustworthy agile robo… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Journal ref: HRI '23: Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

  8. arXiv:2212.14403  [pdf, other

    cs.RO

    Utilizing Human Feedback for Primitive Optimization in Wheelchair Tennis

    Authors: Arjun Krishna, Zulfiqar Zaidi, Letian Chen, Rohan Paleja, Esmaeil Seraj, Matthew Gombolay

    Abstract: Agile robotics presents a difficult challenge with robots moving at high speeds requiring precise and low-latency sensing and control. Creating agile motion that accomplishes the task at hand while being safe to execute is a key requirement for agile robots to gain human trust. This requires designing new approaches that are flexible and maintain knowledge over world constraints. In this paper, we… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    Comments: Workshop paper at Learning for Agile Robotics Workshop, CoRL 2022

  9. arXiv:2210.02517  [pdf, other

    cs.RO

    Athletic Mobile Manipulator System for Robotic Wheelchair Tennis

    Authors: Zulfiqar Zaidi, Daniel Martin, Nathaniel Belles, Viacheslav Zakharov, Arjun Krishna, Kin Man Lee, Peter Wagstaff, Sumedh Naik, Matthew Sklar, Sugju Choi, Yoshiki Kakehi, Ruturaj Patil, Divya Mallemadugula, Florian Pesce, Peter Wilson, Wendell Hom, Matan Diamond, Bryan Zhao, Nina Moorman, Rohan Paleja, Letian Chen, Esmaeil Seraj, Matthew Gombolay

    Abstract: Athletics are a quintessential and universal expression of humanity. From French monks who in the 12th century invented jeu de paume, the precursor to modern lawn tennis, back to the K'iche' people who played the Maya Ballgame as a form of religious expression over three thousand years ago, humans have sought to train their minds and bodies to excel in sporting contests. Advances in robotics are o… ▽ More

    Submitted 7 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 8 pages, accepted at RA-L, will also be presented at IROS 2023

  10. arXiv:2209.11908  [pdf, other

    cs.LG cs.RO

    Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations

    Authors: Letian Chen, Sravan Jayanthi, Rohan Paleja, Daniel Martin, Viacheslav Zakharov, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) approaches empower end-users to teach robots novel tasks via demonstrations of the desired behaviors, democratizing access to robotics. However, current LfD frameworks are not capable of fast adaptation to heterogeneous human demonstrations nor the large-scale deployment in ubiquitous robotics applications. In this paper, we propose a novel LfD framework, Fast Lif… ▽ More

    Submitted 12 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of Conference on Robot Learning (CoRL) 2022

  11. arXiv:2209.03943  [pdf, other

    cs.AI cs.HC

    The Utility of Explainable AI in Ad Hoc Human-Machine Teaming

    Authors: Rohan Paleja, Muyleng Ghuy, Nadun Ranawaka Arachchige, Reed Jensen, Matthew Gombolay

    Abstract: Recent advances in machine learning have led to growing interest in Explainable AI (xAI) to enable humans to gain insight into the decision-making of machine learning models. Despite this recent interest, the utility of xAI techniques has not yet been characterized in human-machine teaming. Importantly, xAI offers the promise of enhancing team situational awareness (SA) and shared mental model dev… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: Part of Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  12. arXiv:2202.02352  [pdf, other

    cs.LG cs.RO

    Learning Interpretable, High-Performing Policies for Autonomous Driving

    Authors: Rohan Paleja, Yaru Niu, Andrew Silva, Chace Ritchie, Sugju Choi, Matthew Gombolay

    Abstract: Gradient-based approaches in reinforcement learning (RL) have achieved tremendous success in learning policies for autonomous vehicles. While the performance of these approaches warrants real-world adoption, these policies lack interpretability, limiting deployability in the safety-critical and legally-regulated domain of autonomous driving (AD). AD requires interpretable and verifiable control po… ▽ More

    Submitted 31 July, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Robotics Science and Systems 2022

  13. arXiv:2110.04347  [pdf, other

    cs.RO cs.LG

    Towards Sample-efficient Apprenticeship Learning from Suboptimal Demonstration

    Authors: Letian Chen, Rohan Paleja, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) seeks to democratize robotics by enabling non-roboticist end-users to teach robots to perform novel tasks by providing demonstrations. However, as demonstrators are typically non-experts, modern LfD techniques are unable to produce policies much better than the suboptimal demonstration. A previously-proposed framework, SSRR, has shown success in learning from subo… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/39

  14. arXiv:2108.09568  [pdf, other

    cs.MA

    Heterogeneous Graph Attention Networks for Learning Diverse Communication

    Authors: Esmaeil Seraj, Zheyuan Wang, Rohan Paleja, Matthew Sklar, Anirudh Patel, Matthew Gombolay

    Abstract: Multi-agent teaming achieves better performance when there is communication among participating agents allowing them to coordinate their actions for maximizing shared utility. However, when collaborating a team of agents with different action and observation spaces, information sharing is not straightforward and requires customized communication protocols, depending on sender and receiver types. W… ▽ More

    Submitted 28 October, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

  15. arXiv:2010.11723  [pdf, other

    cs.RO cs.LG

    Learning from Suboptimal Demonstration via Self-Supervised Reward Regression

    Authors: Letian Chen, Rohan Paleja, Matthew Gombolay

    Abstract: Learning from Demonstration (LfD) seeks to democratize robotics by enabling non-roboticist end-users to teach robots to perform a task by providing a human demonstration. However, modern LfD techniques, e.g. inverse reinforcement learning (IRL), assume users provide at least stochastically optimal demonstrations. This assumption fails to hold in most real-world scenarios. Recent attempts to learn… ▽ More

    Submitted 23 November, 2020; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the Conference on Robot Learning (CoRL '20)

  16. Heterogeneous Learning from Demonstration

    Authors: Rohan Paleja, Matthew Gombolay

    Abstract: The development of human-robot systems able to leverage the strengths of both humans and their robotic counterparts has been greatly sought after because of the foreseen, broad-ranging impact across industry and research. We believe the true potential of these systems cannot be reached unless the robot is able to act with a high level of autonomy, reducing the burden of manual tasking or teleopera… ▽ More

    Submitted 14 April, 2020; v1 submitted 26 January, 2020; originally announced January 2020.

    Journal ref: 2019 14th Human-Robot Interaction (HRI) Pioneers Workshop

  17. arXiv:2001.00503  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation

    Authors: Letian Chen, Rohan Paleja, Muyleng Ghuy, Matthew Gombolay

    Abstract: Reinforcement learning (RL) has achieved tremendous success as a general framework for learning how to make decisions. However, this success relies on the interactive hand-tuning of a reward function by RL experts. On the other hand, inverse reinforcement learning (IRL) seeks to learn a reward function from readily-obtained human demonstrations. Yet, IRL suffers from two major limitations: 1) rewa… ▽ More

    Submitted 23 November, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: In Proceedings of the 2020 ACM/IEEE In-ternational Conference on Human-Robot Interaction (HRI '20), March 23 to 26, 2020, Cambridge, United Kingdom.ACM, New York, NY, USA, 10 pages

  18. arXiv:1906.06397  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations

    Authors: Rohan Paleja, Andrew Silva, Letian Chen, Matthew Gombolay

    Abstract: Resource scheduling and coordination is an NP-hard optimization requiring an efficient allocation of agents to a set of tasks with upper- and lower bound temporal and resource constraints. Due to the large-scale and dynamic nature of resource coordination in hospitals and factories, human domain experts manually plan and adjust schedules on the fly. To perform this job, domain experts leverage het… ▽ More

    Submitted 7 December, 2021; v1 submitted 14 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of the 34th International Conference on Neural Information Processing Systems 2020, 6417-6428

  19. arXiv:1903.06047  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Inferring Personalized Bayesian Embeddings for Learning from Heterogeneous Demonstration

    Authors: Rohan Paleja, Matthew Gombolay

    Abstract: For assistive robots and virtual agents to achieve ubiquity, machines will need to anticipate the needs of their human counterparts. The field of Learning from Demonstration (LfD) has sought to enable machines to infer predictive models of human behavior for autonomous robot control. However, humans exhibit heterogeneity in decision-making, which traditional LfD approaches fail to capture. To over… ▽ More

    Submitted 14 March, 2019; originally announced March 2019.

    Comments: 8 Pages, 7 figures