Skip to main content

Showing 1–4 of 4 results for author: Adeniji, A

.
  1. arXiv:2308.12270  [pdf, other

    cs.LG cs.AI

    Language Reward Modulation for Pretraining Reinforcement Learning

    Authors: Ademi Adeniji, Amber Xie, Carmelo Sferrazza, Younggyo Seo, Stephen James, Pieter Abbeel

    Abstract: Using learned reward functions (LRFs) as a means to solve sparse-reward reinforcement learning (RL) tasks has yielded some steady progress in task-complexity through the years. In this work, we question whether today's LRFs are best-suited as a direct replacement for task rewards. Instead, we propose leveraging the capabilities of LRFs as a pretraining signal for RL. Concretely, we propose… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Code available at https://github.com/ademiadeniji/lamp

  2. arXiv:2305.14343  [pdf, other

    cs.LG cs.AI cs.CV

    Video Prediction Models as Rewards for Reinforcement Learning

    Authors: Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel

    Abstract: Specifying reward signals that allow agents to learn complex behaviors is a long-standing challenge in reinforcement learning. A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet. We present Video Prediction Rewards (VIPER), an algorithm that leverages pretrained video prediction models as action-free reward signals for rei… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 18 figures, 4 tables. under review

  3. arXiv:2210.07426  [pdf, other

    cs.LG cs.AI cs.RO

    Skill-Based Reinforcement Learning with Intrinsic Reward Matching

    Authors: Ademi Adeniji, Amber Xie, Pieter Abbeel

    Abstract: While unsupervised skill discovery has shown promise in autonomously acquiring behavioral primitives, there is still a large methodological disconnect between task-agnostic skill pretraining and downstream, task-aware finetuning. We present Intrinsic Reward Matching (IRM), which unifies these two phases of learning via the $\textit{skill discriminator}$, a pretraining model component often discard… ▽ More

    Submitted 25 May, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 16 pages

  4. arXiv:1901.01994  [pdf, other

    cs.LG cs.AI stat.ML

    Recurrent Control Nets for Deep Reinforcement Learning

    Authors: Vincent Liu, Ademi Adeniji, Nathaniel Lee, Jason Zhao, Mario Srouji

    Abstract: Central Pattern Generators (CPGs) are biological neural circuits capable of producing coordinated rhythmic outputs in the absence of rhythmic input. As a result, they are responsible for most rhythmic motion in living organisms. This rhythmic control is broadly applicable to fields such as locomotive robotics and medical devices. In this paper, we explore the possibility of creating a self-sustain… ▽ More

    Submitted 17 January, 2019; v1 submitted 6 January, 2019; originally announced January 2019.