Skip to main content

Showing 1–3 of 3 results for author: Babayan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10003  [pdf, other

    cs.CL

    ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

    Authors: Renat Aksitov, Sobhan Miryoosefi, Zonglin Li, Daliang Li, Sheila Babayan, Kavya Kopparapu, Zachary Fisher, Ruiqi Guo, Sushant Prakash, Pranesh Srinivasan, Manzil Zaheer, Felix Yu, Sanjiv Kumar

    Abstract: Answering complex natural language questions often necessitates multi-step reasoning and integrating external information. Several systems have combined knowledge retrieval with a large language model (LLM) to answer such questions. These systems, however, suffer from various failure cases, and we cannot directly train them end-to-end to fix such failures, as interaction with external knowledge is… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 19 pages, 4 figures, 4 tables, 8 listings

  2. arXiv:2302.00763  [pdf, other

    cs.LG cs.AI cs.CL

    Collaborating with language models for embodied reasoning

    Authors: Ishita Dasgupta, Christine Kaeser-Chen, Kenneth Marino, Arun Ahuja, Sheila Babayan, Felix Hill, Rob Fergus

    Abstract: Reasoning in a complex and ambiguous environment is a key goal for Reinforcement Learning (RL) agents. While some sophisticated RL agents can successfully solve difficult tasks, they require a large amount of training data and often struggle to generalize to new unseen environments and new tasks. On the other hand, Large Scale Language Models (LSLMs) have exhibited strong reasoning ability and the… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Presented at NeurIPS 2022 Language and Reinforcement Learning Workshop (best paper) and NeurIPS 2022 Foundation Models for Decision Making Workshop. 4 pages main; 14 pages total (including references and appendix); 3 figures

  3. arXiv:2211.00177  [pdf, other

    cs.LG cs.IR cs.SI

    Learning to Navigate Wikipedia by Taking Random Walks

    Authors: Manzil Zaheer, Kenneth Marino, Will Grathwohl, John Schultz, Wendy Shang, Sheila Babayan, Arun Ahuja, Ishita Dasgupta, Christine Kaeser-Chen, Rob Fergus

    Abstract: A fundamental ability of an intelligent web-based agent is seeking out and acquiring new information. Internet search engines reliably find the correct vicinity but the top results may be a few links away from the desired target. A complementary approach is navigation via hyperlinks, employing a policy that comprehends local content and selects a link that moves it closer to the target. In this pa… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Journal ref: NeurIPS 2022