Skip to main content

Showing 1–9 of 9 results for author: Escontrela, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11752  [pdf, other

    cs.LG cs.AI

    Learning a Diffusion Model Policy from Rewards via Q-Score Matching

    Authors: Michael Psenka, Alejandro Escontrela, Pieter Abbeel, Yi Ma

    Abstract: Diffusion models have become a popular choice for representing actor policies in behavior cloning and offline reinforcement learning. This is due to their natural ability to optimize an expressive class of distributions over a continuous space. However, previous works fail to exploit the score-based structure of diffusion models, and instead utilize a simple behavior cloning term to train the acto… ▽ More

    Submitted 14 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: To appear in ICML 2024. 20 pages, 9 figures

  2. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, lea**, and jum**. There is a growing interest in develo** legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  3. arXiv:2305.14343  [pdf, other

    cs.LG cs.AI cs.CV

    Video Prediction Models as Rewards for Reinforcement Learning

    Authors: Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel

    Abstract: Specifying reward signals that allow agents to learn complex behaviors is a long-standing challenge in reinforcement learning. A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet. We present Video Prediction Rewards (VIPER), an algorithm that leverages pretrained video prediction models as action-free reward signals for rei… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 18 figures, 4 tables. under review

  4. arXiv:2207.07813  [pdf, other

    cs.RO cs.AI

    Autonomously Untangling Long Cables

    Authors: Vainavi Viswanath, Kaushik Shivakumar, Justin Kerr, Brijen Thananjeyan, Ellen Novoseller, Jeffrey Ichnowski, Alejandro Escontrela, Michael Laskey, Joseph E. Gonzalez, Ken Goldberg

    Abstract: Cables are ubiquitous in many settings and it is often useful to untangle them. However, cables are prone to self-occlusions and knots, making them difficult to perceive and manipulate. The challenge increases with cable length: long cables require more complex slack management to facilitate observability and reachability. In this paper, we focus on autonomously untangling cables up to 3 meters in… ▽ More

    Submitted 31 July, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

  5. arXiv:2206.14176  [pdf, other

    cs.RO cs.AI cs.LG

    DayDreamer: World Models for Physical Robot Learning

    Authors: Philipp Wu, Alejandro Escontrela, Danijar Hafner, Ken Goldberg, Pieter Abbeel

    Abstract: To solve tasks in complex environments, robots need to learn from experience. Deep reinforcement learning is a common approach to robot learning but requires a large amount of trial and error to learn, limiting its deployment in the physical world. As a consequence, many advances in robot learning rely on simulators. On the other hand, learning inside of simulators fails to capture the complexity… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Website: https://danijar.com/daydreamer

  6. arXiv:2203.15103  [pdf, other

    cs.AI cs.RO

    Adversarial Motion Priors Make Good Substitutes for Complex Reward Functions

    Authors: Alejandro Escontrela, Xue Bin Peng, Wenhao Yu, Tingnan Zhang, Atil Iscen, Ken Goldberg, Pieter Abbeel

    Abstract: Training a high-dimensional simulated agent with an under-specified reward function often leads the agent to learn physically infeasible strategies that are ineffective when deployed in the real world. To mitigate these unnatural behaviors, reinforcement learning practitioners often utilize complex reward functions that encourage physically plausible behaviors. However, a tedious labor-intensive t… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: 8 pages, 6 figures, 3 tables

  7. arXiv:2011.06194  [pdf, other

    cs.RO

    A Factor-Graph Approach for Optimization Problems with Dynamics Constraints

    Authors: Mandy Xie, Alejandro Escontrela, Frank Dellaert

    Abstract: In this paper, we introduce dynamics factor graphs as a graphical framework to solve dynamics problems and kinodynamic motion planning problems with full consideration of whole-body dynamics and contacts. A factor graph representation of dynamics problems provides an insightful visualization of their mathematical structure and can be used in conjunction with sparse nonlinear optimizers to solve ch… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: arXiv admin note: text overlap with arXiv:1911.10065

  8. arXiv:2011.05541  [pdf, other

    cs.RO

    Learning Agile Locomotion Skills with a Mentor

    Authors: Atil Iscen, George Yu, Alejandro Escontrela, Deepali Jain, Jie Tan, Ken Caluwaerts

    Abstract: Develo** agile behaviors for legged robots remains a challenging problem. While deep reinforcement learning is a promising approach, learning truly agile behaviors typically requires tedious reward sha** and careful curriculum design. We formulate agile locomotion as a multi-stage learning problem in which a mentor guides the agent throughout the training. The mentor is optimized to place a ch… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  9. arXiv:2011.05513  [pdf, other

    cs.RO

    Zero-Shot Terrain Generalization for Visual Locomotion Policies

    Authors: Alejandro Escontrela, George Yu, Peng Xu, Atil Iscen, Jie Tan

    Abstract: Legged robots have unparalleled mobility on unstructured terrains. However, it remains an open challenge to design locomotion controllers that can operate in a large variety of environments. In this paper, we address this challenge of automatically learning locomotion controllers that can generalize to a diverse collection of terrains often encountered in the real world. We frame this challenge as… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.