Skip to main content

Showing 1–3 of 3 results for author: Guadarrama, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.12401  [pdf, other

    cs.LG cs.AI cs.IT cs.RO stat.ML

    Predictive Information Accelerates Learning in RL

    Authors: Kuang-Huei Lee, Ian Fischer, Anthony Liu, Yijie Guo, Honglak Lee, John Canny, Sergio Guadarrama

    Abstract: The Predictive Information is the mutual information between the past and the future, I(X_past; X_future). We hypothesize that capturing the predictive information is useful in RL, since the ability to model what will happen next is necessary for success on many tasks. To test our hypothesis, we train Soft Actor-Critic (SAC) agents from pixels with an auxiliary task that learns a compressed repres… ▽ More

    Submitted 25 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: To appear at NeurIPS 2020

  2. arXiv:1912.05663  [pdf, other

    stat.ML cs.AI cs.LG

    Measuring the Reliability of Reinforcement Learning Algorithms

    Authors: Stephanie C. Y. Chan, Samuel Fishman, John Canny, Anoop Korattikara, Sergio Guadarrama

    Abstract: Lack of reliability is a well-known issue for reinforcement learning (RL) algorithms. This problem has gained increasing attention in recent years, and efforts to improve it have grown substantially. To aid RL researchers and production users with the evaluation and improvement of reliability, we propose a set of metrics that quantitatively measure different aspects of reliability. In this work, w… ▽ More

    Submitted 12 February, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Accepted for publication at ICLR 2020 (spotlight)

  3. arXiv:1902.07742  [pdf, other

    cs.LG stat.ML

    From Language to Goals: Inverse Reinforcement Learning for Vision-Based Instruction Following

    Authors: Justin Fu, Anoop Korattikara, Sergey Levine, Sergio Guadarrama

    Abstract: Reinforcement learning is a promising framework for solving control problems, but its use in practical situations is hampered by the fact that reward functions are often difficult to engineer. Specifying goals and tasks for autonomous machines, such as robots, is a significant challenge: conventionally, reward functions and goal states have been used to communicate objectives. But people can commu… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.