Skip to main content

Showing 1–13 of 13 results for author: Furuta, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16726  [pdf, other

    cs.LG cs.AI

    Interpreting Grokked Transformers in Complex Modular Arithmetic

    Authors: Hiroki Furuta, Gouki Minegishi, Yusuke Iwasawa, Yutaka Matsuo

    Abstract: Grokking has been actively explored to reveal the mystery of delayed generalization. Identifying interpretable algorithms inside the grokked models is a suggestive hint to understanding its mechanism. In this work, beyond the simplest and well-studied modular addition, we observe the internal circuits learned through grokking in complex modular arithmetic via interpretable reverse engineering, whi… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Code: https://github.com/frt03/grok_mod_poly

  2. arXiv:2402.09727  [pdf, other

    cs.CL cs.AI cs.IR

    A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

    Authors: Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John Canny, Ian Fischer

    Abstract: Current Large Language Models (LLMs) are not only limited to some maximum context length, but also are not able to robustly consume long inputs. To address these limitations, we propose ReadAgent, an LLM agent system that increases effective context length up to 20x in our experiments. Inspired by how humans interactively read long documents, we implement ReadAgent as a simple prompting system tha… ▽ More

    Submitted 23 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Website: https://read-agent.github.io

  3. arXiv:2311.18751  [pdf, other

    cs.LG cs.AI cs.CL

    Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web

    Authors: Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur

    Abstract: Language model agents (LMA) recently emerged as a promising paradigm on muti-step decision making tasks, often outperforming humans and other reinforcement learning agents. Despite the promise, their performance on real-world applications that often involve combinations of tasks is still underexplored. In this work, we introduce a new benchmark, called CompWoB -- 50 new compositional web automatio… ▽ More

    Submitted 4 February, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Code: https://github.com/google-research/google-research/tree/master/compositional_rl/compwob

  4. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  5. arXiv:2307.12856  [pdf, other

    cs.LG cs.AI cs.CL

    A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

    Authors: Izzeddin Gur, Hiroki Furuta, Austin Huang, Mustafa Safdari, Yutaka Matsuo, Douglas Eck, Aleksandra Faust

    Abstract: Pre-trained large language models (LLMs) have recently achieved better generalization and sample efficiency in autonomous web automation. However, the performance on real-world websites has still suffered from (1) open domainness, (2) limited context length, and (3) lack of inductive bias on HTML. We introduce WebAgent, an LLM-driven agent that learns from self-experience to complete tasks on real… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted to ICLR 2024 (Oral)

  6. arXiv:2305.11854  [pdf, other

    cs.LG cs.AI stat.ML

    Multimodal Web Navigation with Instruction-Finetuned Foundation Models

    Authors: Hiroki Furuta, Kuang-Huei Lee, Ofir Nachum, Yutaka Matsuo, Aleksandra Faust, Shixiang Shane Gu, Izzeddin Gur

    Abstract: The progress of autonomous web navigation has been hindered by the dependence on billions of exploratory interactions via online reinforcement learning, and domain-specific model designs that make it difficult to leverage generalization from rich out-of-domain data. In this work, we study data-driven offline training for web agents with vision-language foundation models. We propose an instruction-… ▽ More

    Submitted 25 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024. Website: https://sites.google.com/view/mm-webnav/

  7. arXiv:2211.15136  [pdf, other

    cs.RO cs.AI cs.LG

    Collective Intelligence for 2D Push Manipulations with Mobile Robots

    Authors: So Kuroki, Tatsuya Matsushima, Jumpei Arima, Hiroki Furuta, Yutaka Matsuo, Shixiang Shane Gu, Yu** Tang

    Abstract: While natural systems often present collective intelligence that allows them to self-organize and adapt to changes, the equivalent is missing in most artificial systems. We explore the possibility of such a system in the context of cooperative 2D push manipulations using mobile robots. Although conventional works demonstrate potential solutions for the problem in restricted settings, they have com… ▽ More

    Submitted 4 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Robotics and Automation Letters(RA-L) 2023

  8. arXiv:2211.14296  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation

    Authors: Hiroki Furuta, Yusuke Iwasawa, Yutaka Matsuo, Shixiang Shane Gu

    Abstract: The rise of generalist large-scale models in natural language and vision has made us expect that a massive data-driven approach could achieve broader generalization in other domains such as continuous control. In this work, we explore a method for learning a single policy that manipulates various forms of agents to solve various tasks by distilling a large amount of proficient behavioral data. In… ▽ More

    Submitted 4 February, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted at ICLR2023 (notable-top-25%), Website: https://sites.google.com/view/control-graph

  9. arXiv:2111.10364  [pdf, other

    cs.LG cs.AI stat.ML

    Generalized Decision Transformer for Offline Hindsight Information Matching

    Authors: Hiroki Furuta, Yutaka Matsuo, Shixiang Shane Gu

    Abstract: How to extract as much learning signal from each trajectory data has been a key problem in reinforcement learning (RL), where sample inefficiency has posed serious challenges for practical applications. Recent works have shown that using expressive policy function approximators and conditioning on future trajectory information -- such as future states in hindsight experience replay or returns-to-g… ▽ More

    Submitted 4 February, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Accepted to ICLR2022, Spotlight. Website: https://sites.google.com/view/generalizeddt and Code: https://github.com/frt03/generalized_dt

  10. arXiv:2110.04686  [pdf, other

    cs.LG cs.AI

    Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

    Authors: Shixiang Shane Gu, Manfred Diaz, Daniel C. Freeman, Hiroki Furuta, Seyed Kamyar Seyed Ghasemipour, Anton Raichuk, Byron David, Erik Frey, Erwin Coumans, Olivier Bachem

    Abstract: The goal of continuous control is to synthesize desired behaviors. In reinforcement learning (RL)-driven approaches, this is often accomplished through careful task reward engineering for efficient exploration and running an off-the-shelf RL algorithm. While reward maximization is at the core of RL, reward engineering is not the only -- sometimes nor the easiest -- way for specifying complex behav… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

  11. arXiv:2103.17258  [pdf, other

    cs.LG cs.AI stat.ML

    Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning

    Authors: Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima, Yutaka Matsuo, Shixiang Shane Gu

    Abstract: Recently many algorithms were devised for reinforcement learning (RL) with function approximation. While they have clear algorithmic distinctions, they also have many implementation differences that are algorithm-independent and sometimes under-emphasized. Such mixing of algorithmic novelty and implementation craftsmanship makes rigorous analyses of the sources of performance improvements across a… ▽ More

    Submitted 25 October, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted at NeurIPS 2021. The implementation is available at: https://github.com/frt03/inference-based-rl

  12. arXiv:2103.12726  [pdf, other

    cs.LG cs.AI stat.ML

    Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning

    Authors: Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno, Yutaka Matsuo, Sergey Levine, Ofir Nachum, Shixiang Shane Gu

    Abstract: Progress in deep reinforcement learning (RL) research is largely enabled by benchmark task environments. However, analyzing the nature of those environments is often overlooked. In particular, we still do not have agreeable ways to measure the difficulty or solvability of a task, given that each has fundamentally different actions, observations, dynamics, rewards, and can be tackled with diverse R… ▽ More

    Submitted 31 May, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted to ICML2021. The code is available at: https://github.com/frt03/pic

  13. arXiv:2006.03647  [pdf, other

    cs.LG cs.AI stat.ML

    Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization

    Authors: Tatsuya Matsushima, Hiroki Furuta, Yutaka Matsuo, Ofir Nachum, Shixiang Gu

    Abstract: Most reinforcement learning (RL) algorithms assume online access to the environment, in which one may readily interleave updates to the policy with experience collection using that policy. However, in many real-world applications such as health, education, dialogue agents, and robotics, the cost or potential risk of deploying a new data-collection policy is high, to the point that it can become pr… ▽ More

    Submitted 23 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.