Skip to main content

Showing 1–4 of 4 results for author: Pašukonis, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08525  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    GATS: Gather-Attend-Scatter

    Authors: Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez Colmenarejo

    Abstract: As the AI community increasingly adopts large-scale models, it is crucial to develop general and flexible tools to integrate them. We introduce Gather-Attend-Scatter (GATS), a novel module that enables seamless combination of pretrained foundation models, both trainable and frozen, into larger multimodal networks. GATS empowers AI systems to process and generate information across multiple modalit… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  2. arXiv:2301.04104  [pdf, other

    cs.AI cs.LG stat.ML

    Mastering Diverse Domains through World Models

    Authors: Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap

    Abstract: Develo** a general algorithm that learns to solve tasks across a wide range of applications has been a fundamental challenge in artificial intelligence. Although current reinforcement learning algorithms can be readily applied to tasks similar to what they have been developed for, configuring them for new application domains requires significant human expertise and experimentation. We present Dr… ▽ More

    Submitted 17 April, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: Website: https://danijar.com/dreamerv3

  3. arXiv:2210.13383  [pdf, other

    cs.AI cs.LG

    Evaluating Long-Term Memory in 3D Mazes

    Authors: Jurgis Pasukonis, Timothy Lillicrap, Danijar Hafner

    Abstract: Intelligent agents need to remember salient information to reason in partially-observed environments. For example, agents with a first-person view should remember the positions of relevant objects even if they go out of view. Similarly, to effectively navigate through rooms agents need to remember the floor plan of how rooms are connected. However, most benchmark tasks in reinforcement learning do… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Project website: https://github.com/jurgisp/memory-maze

  4. arXiv:2103.15332  [pdf, other

    cs.LG cs.AI

    Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

    Authors: Sharada Mohanty, Jyotish Poonganam, Adrien Gaidon, Andrey Kolobov, Blake Wulfe, Dipam Chakraborty, Gražvydas Šemetulskis, João Schapke, Jonas Kubilius, Jurgis Pašukonis, Linas Klimas, Matthew Hausknecht, Patrick MacAlpine, Quang Nhat Tran, Thomas Tumiel, Xiaocheng Tang, Xinwei Chen, Christopher Hesse, Jacob Hilton, William Hebgen Guss, Sahika Genc, John Schulman, Karl Cobbe

    Abstract: The NeurIPS 2020 Procgen Competition was designed as a centralized benchmark with clearly defined tasks for measuring Sample Efficiency and Generalization in Reinforcement Learning. Generalization remains one of the most fundamental challenges in deep reinforcement learning, and yet we do not have enough benchmarks to measure the progress of the community on Generalization in Reinforcement Learnin… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.