Skip to main content

Showing 1–5 of 5 results for author: He, J Z

.
  1. arXiv:2405.01768  [pdf, other

    cs.CL cs.AI

    CoS: Enhancing Personalization and Mitigating Bias with Context Steering

    Authors: Jerry Zhi-Yang He, Sashrika Pandey, Mariah L. Schrum, Anca Dragan

    Abstract: When querying a large language model (LLM), the context, i.e. personal, demographic, and cultural information specific to an end-user, can significantly shape the response of the LLM. For example, asking the model to explain Newton's second law with the context "I am a toddler" yields a different answer compared to the context "I am a physics professor." Proper usage of the context enables the LLM… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2310.10610  [pdf, other

    cs.AI cs.LG cs.RO

    Quantifying Assistive Robustness Via the Natural-Adversarial Frontier

    Authors: Jerry Zhi-Yang He, Zackory Erickson, Daniel S. Brown, Anca D. Dragan

    Abstract: Our ultimate goal is to build robust policies for robots that assist people. What makes this hard is that people can behave unexpectedly at test time, potentially interacting with the robot outside its training distribution and leading to failures. Even just measuring robustness is a challenge. Adversarial perturbations are the default, but they can paint the wrong picture: they can correspond to… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  3. arXiv:2212.03175  [pdf, other

    cs.LG cs.AI cs.RO

    Learning Representations that Enable Generalization in Assistive Tasks

    Authors: Jerry Zhi-Yang He, Aditi Raghunathan, Daniel S. Brown, Zackory Erickson, Anca D. Dragan

    Abstract: Recent work in sim2real has successfully enabled robots to act in physical environments by training in simulation with a diverse ''population'' of environments (i.e. domain randomization). In this work, we focus on enabling generalization in assistive tasks: tasks in which the robot is acting to assist a user (e.g. hel** someone with motor impairments with bathing or with scratching an itch). Su… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  4. arXiv:2204.06601  [pdf, other

    cs.LG cs.RO

    Causal Confusion and Reward Misidentification in Preference-Based Reward Learning

    Authors: Jeremy Tien, Jerry Zhi-Yang He, Zackory Erickson, Anca D. Dragan, Daniel S. Brown

    Abstract: Learning policies via preference-based reward learning is an increasingly popular method for customizing agent behavior, but has been shown anecdotally to be prone to spurious correlations and reward hacking behaviors. While much prior work focuses on causal confusion in reinforcement learning and behavioral cloning, we focus on a systematic study of causal confusion and reward misidentification w… ▽ More

    Submitted 18 March, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: In the proceedings of the Eleventh International Conference on Learning Representations (ICLR 2023). https://iclr.cc/virtual/2023/poster/10822

  5. arXiv:2111.09884  [pdf, other

    cs.RO cs.AI cs.LG

    Assisted Robust Reward Design

    Authors: Jerry Zhi-Yang He, Anca D. Dragan

    Abstract: Real-world robotic tasks require complex reward functions. When we define the problem the robot needs to solve, we pretend that a designer specifies this complex reward exactly, and it is set in stone from then on. In practice, however, reward design is an iterative process: the designer chooses a reward, eventually encounters an "edge-case" environment where the reward incentivizes the wrong beha… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: 5th Conference on Robot Learning (CoRL 2021)