Skip to main content

Showing 1–3 of 3 results for author: Bagaria, A

.
  1. arXiv:2306.03186  [pdf, other

    cs.LG cs.AI

    Flip** Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

    Authors: Sam Lobel, Akhil Bagaria, George Konidaris

    Abstract: We propose a new method for count-based exploration in high-dimensional state spaces. Unlike previous work which relies on density models, we show that counts can be derived by averaging samples from the Rademacher distribution (or coin flips). This insight is used to set up a simple supervised learning objective which, when optimized, yields a state's visitation count. We show that our method is… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 11 pages (+9 appendix). Published as a conference paper at ICML 2023. Code available at https://github.com/samlobel/CFN/

  2. arXiv:2302.04693  [pdf, other

    cs.LG cs.AI

    Scaling Goal-based Exploration via Pruning Proto-goals

    Authors: Akhil Bagaria, Ray Jiang, Ramana Kumar, Tom Schaul

    Abstract: One of the gnarliest challenges in reinforcement learning (RL) is exploration that scales to vast domains, where novelty-, or coverage-seeking behaviour falls short. Goal-directed, purposeful behaviours are able to overcome this, but rely on a good goal space. The core challenge in goal discovery is finding the right balance between generality (not hand-crafted) and tractability (useful, not too m… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  3. arXiv:1412.1140  [pdf, other

    cs.AR

    Sphynx: A Shared Instruction Cache Exporatory Study

    Authors: Dong-hyeon Park, Akhil Bagaria, Fabiha Hannan, Eric Storm, Josef Spjut

    Abstract: The Sphynx project was an exploratory study to discover what might be done to improve the heavy replication of in- structions in independent instruction caches for a massively parallel machine where a single program is executing across all of the cores. While a machine with only many cores (fewer than 50) might not have any issues replicating the instructions for each core, as we approach the era… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: 4 pages, 6 figures

    ACM Class: B.3.2; C.1.2; C.1.4