Skip to main content

Showing 1–8 of 8 results for author: Chuck, C

.
  1. arXiv:2406.08805  [pdf, other

    cs.LG cs.AI cs.RO

    A Dual Approach to Imitation Learning from Observations with Offline Datasets

    Authors: Harshit Sikchi, Caleb Chuck, Amy Zhang, Scott Niekum

    Abstract: Demonstrations are an effective alternative to task specification for learning agents in settings where designing a reward function is difficult. However, demonstrating expert behavior in the action space of the agent becomes unwieldy when robots have complex, unintuitive morphologies. We consider the practical setting where an agent has a dataset of prior interactions with the environment and is… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Under submission. 23 pages

  2. arXiv:2405.03113  [pdf, other

    cs.RO cs.AI

    Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

    Authors: Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: Reinforcement Learning is a promising tool for learning complex policies even in fast-moving and object-interactive domains where human teleoperation or hard-coded policies might fail. To effectively reflect this challenging category of tasks, we introduce a dynamic, interactive RL testbed based on robot air hockey. By augmenting air hockey with a large family of tasks ranging from easy tasks like… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  3. arXiv:2404.10883  [pdf, other

    cs.AI cs.LG stat.ME

    Automated Discovery of Functional Actual Causes in Complex Environments

    Authors: Caleb Chuck, Sankaran Vaidyanathan, Stephen Giguere, Amy Zhang, David Jensen, Scott Niekum

    Abstract: Reinforcement learning (RL) algorithms often struggle to learn policies that generalize to novel situations due to issues such as causal confusion, overfitting to irrelevant factors, and failure to isolate control of state factors. These issues stem from a common source: a failure to accurately identify and exploit state-specific causal relationships in the environment. While some prior works in R… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  4. arXiv:2403.16369  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Action-based Representations Using Invariance

    Authors: Max Rudolph, Caleb Chuck, Kevin Black, Misha Lvovsky, Scott Niekum, Amy Zhang

    Abstract: Robust reinforcement learning agents using high-dimensional observations must be able to identify relevant state features amidst many exogeneous distractors. A representation that captures controllability identifies these state elements by determining what affects agent control. While methods such as inverse dynamics and mutual information capture controllability for a limited number of timesteps,… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Published at the Reinforcement Learning Conference 2024

  5. arXiv:2306.09509  [pdf, other

    cs.AI cs.RO

    Granger-Causal Hierarchical Skill Discovery

    Authors: Caleb Chuck, Kevin Black, Aditya Arjun, Yuke Zhu, Scott Niekum

    Abstract: Reinforcement Learning (RL) has demonstrated promising results in learning policies for complex tasks, but it often suffers from low sample efficiency and limited transferability. Hierarchical RL (HRL) methods aim to address the difficulty of learning long-horizon tasks by decomposing policies into skills, abstracting states, and reusing skills in new tasks. However, many HRL methods require some… ▽ More

    Submitted 18 March, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted TMLR 2024

  6. arXiv:2008.10518  [pdf, other

    cs.RO cs.AI

    ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory

    Authors: A**kya Jain, Rudolf Lioutikov, Caleb Chuck, Scott Niekum

    Abstract: Robots in human environments will need to interact with a wide variety of articulated objects such as cabinets, drawers, and dishwashers while assisting humans in performing day-to-day tasks. Existing methods either require objects to be textured or need to know the articulation model category a priori for estimating the model parameters for an articulated object. We propose ScrewNet, a novel appr… ▽ More

    Submitted 19 July, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Presented at ICRA'21. Project webpage: https://pearl-utexas.github.io/ScrewNet/

  7. arXiv:1906.01408  [pdf, other

    cs.LG cs.AI stat.ML

    Hypothesis-Driven Skill Discovery for Hierarchical Deep Reinforcement Learning

    Authors: Caleb Chuck, Supawit Chockchowwat, Scott Niekum

    Abstract: Deep reinforcement learning (DRL) is capable of learning high-performing policies on a variety of complex high-dimensional tasks, ranging from video games to robotic manipulation. However, standard DRL methods often suffer from poor sample efficiency, partially because they aim to be entirely problem-agnostic. In this work, we introduce a novel approach to exploration and hierarchical skill learni… ▽ More

    Submitted 3 March, 2020; v1 submitted 27 May, 2019; originally announced June 2019.

    Comments: Submitted to IROS 2020

  8. arXiv:1610.00850  [pdf, other

    cs.RO cs.LG

    Comparing Human-Centric and Robot-Centric Sampling for Robot Deep Learning from Demonstrations

    Authors: Michael Laskey, Caleb Chuck, Jonathan Lee, Jeffrey Mahler, Sanjay Krishnan, Kevin Jamieson, Anca Dragan, Ken Goldberg

    Abstract: Motivated by recent advances in Deep Learning for robot control, this paper considers two learning algorithms in terms of how they acquire demonstrations. "Human-Centric" (HC) sampling is the standard supervised learning algorithm, where a human supervisor demonstrates the task by teleoperating the robot to provide trajectories consisting of state-control pairs. "Robot-Centric" (RC) sampling is an… ▽ More

    Submitted 28 March, 2017; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: Submitted to International Conference on Robotics and Automation (ICRA) 2017