Skip to main content

Showing 1–2 of 2 results for author: GX-Chen, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.12345  [pdf, other

    cs.LG cs.AI

    Light-weight probing of unsupervised representations for Reinforcement Learning

    Authors: Wancong Zhang, Anthony GX-Chen, Vlad Sobal, Yann LeCun, Nicolas Carion

    Abstract: Unsupervised visual representation learning offers the opportunity to leverage large corpora of unlabeled trajectories to form useful visual representations, which can benefit the training of reinforcement learning (RL) algorithms. However, evaluating the fitness of such representations requires training RL algorithms which is computationally intensive and has high variance outcomes. Inspired by t… ▽ More

    Submitted 31 May, 2024; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: To appear in the proceedings of the Reinforcement Learning Conference 2024

  2. arXiv:2201.01836  [pdf, other

    cs.LG cs.AI

    A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions

    Authors: Anthony GX-Chen, Veronica Chelu, Blake A. Richards, Joelle Pineau

    Abstract: Estimating value functions is a core component of reinforcement learning algorithms. Temporal difference (TD) learning algorithms use bootstrap**, i.e. they update the value function toward a learning target using value estimates at subsequent time-steps. Alternatively, the value function can be updated toward a learning target constructed by separately predicting successor features (SF)--a poli… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: 18 pages, 6 figures, 2 tables. Preprint. Accepted by AAAI-22