Skip to main content

Showing 1–32 of 32 results for author: Ammanabrolu, P

.
  1. arXiv:2310.11564  [pdf, other

    cs.CL

    Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

    Authors: Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Ye** Choi, Prithviraj Ammanabrolu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) aligns Large Language Models (LLMs) with general, aggregate human preferences, it is suboptimal for learning diverse, individual perspectives. In this work, we study Reinforcement Learning from Personalized Human Feedback (RLPHF) problem, wherein LLMs are aligned to multiple (sometimes conflicting) preferences by modeling alignment as a Multi… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint

  2. arXiv:2306.01693  [pdf, other

    cs.CL

    Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

    Authors: Zeqiu Wu, Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf, Hannaneh Hajishirzi

    Abstract: Language models (LMs) often exhibit undesirable text generation behaviors, including generating false, toxic, or irrelevant outputs. Reinforcement learning from human feedback (RLHF) - where human preference judgments on LM outputs are transformed into a learning signal - has recently shown promise in addressing these issues. However, such holistic feedback conveys limited information on long text… ▽ More

    Submitted 30 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 camera-ready

  3. arXiv:2305.17390  [pdf, other

    cs.CL cs.AI cs.LG cs.MA cs.RO

    SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

    Authors: Bill Yuchen Lin, Yicheng Fu, Karina Yang, Faeze Brahman, Shiyu Huang, Chandra Bhagavatula, Prithviraj Ammanabrolu, Ye** Choi, Xiang Ren

    Abstract: We introduce SwiftSage, a novel agent framework inspired by the dual-process theory of human cognition, designed to excel in action planning for complex interactive reasoning tasks. SwiftSage integrates the strengths of behavior cloning and prompting large language models (LLMs) to enhance task completion performance. The framework comprises two primary modules: the Swift module, representing fast… ▽ More

    Submitted 6 December, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023 (spotlight). Project website: https://swiftsage.github.io

  4. arXiv:2305.15065  [pdf, other

    cs.CL

    Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning

    Authors: Ximing Lu, Faeze Brahman, Peter West, Jaehun Jang, Khyathi Chandu, Abhilasha Ravichander, Lianhui Qin, Prithviraj Ammanabrolu, Liwei Jiang, Sahana Ramnath, Nouha Dziri, Jillian Fisher, Bill Yuchen Lin, Skyler Hallinan, Xiang Ren, Sean Welleck, Ye** Choi

    Abstract: While extreme-scale language models have demonstrated exceptional performance on a variety of language tasks, the degree of control over these language models through pure prompting can often be limited. Directly fine-tuning such language models can be effective for tailoring them, but it can be either extremely costly (e.g., GPT-3) or not even feasible for the broader community (e.g., GPT-4). W… ▽ More

    Submitted 6 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  5. arXiv:2301.12050  [pdf, other

    cs.LG cs.CL

    Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling

    Authors: Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Ye** Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox

    Abstract: Reinforcement learning (RL) agents typically learn tabula rasa, without prior knowledge of the world. However, if initialized with knowledge of high-level subgoals and transitions between subgoals, RL agents could utilize this Abstract World Model (AWM) for planning and exploration. We propose using few-shot large language models (LLMs) to hypothesize an AWM, that will be verified through world ex… ▽ More

    Submitted 27 April, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: in proceedings of ICML 23

  6. arXiv:2212.10060  [pdf, other

    cs.CL cs.AI

    I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

    Authors: Pei Zhou, Andrew Zhu, Jennifer Hu, Jay Pujara, Xiang Ren, Chris Callison-Burch, Ye** Choi, Prithviraj Ammanabrolu

    Abstract: We propose a novel task, G4C, to study teacher-student natural language interactions in a goal-driven and grounded environment. Dungeons and Dragons (D&D), a role-playing game, provides an ideal setting to investigate such interactions. Here, the Dungeon Master (DM), i.e., the teacher, guides the actions of several players -- students, each with their own personas and abilities -- to achieve share… ▽ More

    Submitted 30 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023. 18 pages, 11 figures, 5 Tables

  7. arXiv:2210.07382  [pdf, other

    cs.CL cs.AI

    Behavior Cloned Transformers are Neurosymbolic Reasoners

    Authors: Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

    Abstract: In this work, we explore techniques for augmenting interactive agents with information from symbolic modules, much like humans use tools like calculators and GPS systems to assist with arithmetic and navigation. We test our agent's abilities in text games -- challenging benchmarks for evaluating the multi-step reasoning abilities of game agents in grounded, language-based environments. Our experim… ▽ More

    Submitted 11 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to EACL 2023

  8. arXiv:2210.01241  [pdf, other

    cs.CL cs.LG

    Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization

    Authors: Rajkumar Ramamurthy, Prithviraj Ammanabrolu, Kianté Brantley, Jack Hessel, Rafet Sifa, Christian Bauckhage, Hannaneh Hajishirzi, Ye** Choi

    Abstract: We tackle the problem of aligning pre-trained large language models (LMs) with human preferences. If we view text generation as a sequential decision-making problem, reinforcement learning (RL) appears to be a natural conceptual framework. However, using RL for LM-based generation faces empirical challenges, including training instability due to the combinatorial action space, as well as a lack of… ▽ More

    Submitted 28 February, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: In Proceedings of ICLR 2023. Code found at https://github.com/allenai/rl4lms and Project website at https://rl4lms.apps.allenai.org/

  9. arXiv:2207.00746  [pdf, other

    cs.CL

    INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions

    Authors: Zeqiu Wu, Ryu Parish, Hao Cheng, Sewon Min, Prithviraj Ammanabrolu, Mari Ostendorf, Hannaneh Hajishirzi

    Abstract: In an information-seeking conversation, a user may ask questions that are under-specified or unanswerable. An ideal agent would interact by initiating different response types according to the available knowledge sources. However, most current studies either fail to or artificially incorporate such agent-side initiative. This work presents InSCIt, a dataset for Information-Seeking Conversations wi… ▽ More

    Submitted 22 June, 2023; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: TACL 2023

  10. arXiv:2205.13636  [pdf, other

    cs.CL cs.LG

    Quark: Controllable Text Generation with Reinforced Unlearning

    Authors: Ximing Lu, Sean Welleck, Jack Hessel, Liwei Jiang, Lianhui Qin, Peter West, Prithviraj Ammanabrolu, Ye** Choi

    Abstract: Large-scale language models often learn behaviors that are misaligned with user expectations. Generated text may contain offensive or toxic language, contain significant repetition, or be of a different sentiment than desired by the user. We consider the task of unlearning these misalignments by fine-tuning the language model on signals of what not to do. We introduce Quantized Reward Konditioning… ▽ More

    Submitted 16 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Journal ref: NeurIPS 2022 (Oral Selection)

  11. arXiv:2205.12630  [pdf, other

    cs.CL cs.CV

    Multimodal Knowledge Alignment with Reinforcement Learning

    Authors: Youngjae Yu, Jiwan Chung, Heeseung Yun, Jack Hessel, JaeSung Park, Ximing Lu, Prithviraj Ammanabrolu, Rowan Zellers, Ronan Le Bras, Gunhee Kim, Ye** Choi

    Abstract: Large language models readily adapt to novel settings, even without task-specific training data. Can their zero-shot capacity be extended to multimodal inputs? In this work, we propose ESPER which extends language-only zero-shot models to unseen multimodal tasks, like image and audio captioning. Our key novelty is to use reinforcement learning to align multimodal inputs to language model generatio… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    ACM Class: I.2.7; I.4.9

  12. arXiv:2205.01975  [pdf, other

    cs.CL cs.AI

    Aligning to Social Norms and Values in Interactive Narratives

    Authors: Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hannaneh Hajishirzi, Ye** Choi

    Abstract: We focus on creating agents that act in alignment with socially beneficial norms and values in interactive narratives or text-based games -- environments wherein an agent perceives and interacts with a world through natural language. Such interactive agents are often trained via reinforcement learning to optimize task performance, even when such rewards may lead to agent behaviors that violate soc… ▽ More

    Submitted 4 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NAACL-2022

  13. arXiv:2203.07540  [pdf, other

    cs.CL cs.AI

    ScienceWorld: Is your Agent Smarter than a 5th Grader?

    Authors: Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

    Abstract: We present ScienceWorld, a benchmark to test agents' scientific reasoning abilities in a new interactive text environment at the level of a standard elementary school science curriculum. Despite the transformer-based progress seen in question-answering and scientific text processing, we find that current models cannot reason about or explain learned science concepts in novel contexts. For instance… ▽ More

    Submitted 14 November, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to EMNLP 2022

  14. arXiv:2112.08907  [pdf, other

    cs.HC cs.AI cs.CL

    Inherently Explainable Reinforcement Learning in Natural Language

    Authors: Xiangyu Peng, Mark O. Riedl, Prithviraj Ammanabrolu

    Abstract: We focus on the task of creating a reinforcement learning agent that is inherently explainable -- with the ability to produce immediate local explanations by thinking out loud while performing a task and analyzing entire trajectories post-hoc to produce causal explanations. This Hierarchically Explainable Reinforcement Learning agent (HEX-RL), operates in Interactive Fictions, text-based game envi… ▽ More

    Submitted 6 October, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

  15. arXiv:2110.03262  [pdf, other

    cs.CL cs.AI

    Situated Dialogue Learning through Procedural Environment Generation

    Authors: Prithviraj Ammanabrolu, Renee Jia, Mark O. Riedl

    Abstract: We teach goal-driven agents to interactively act and speak in situated environments by training on generated curriculums. Our agents operate in LIGHT (Urbanek et al. 2019) -- a large-scale crowd-sourced fantasy text adventure game wherein an agent perceives and interacts with the world through textual natural language. Goals in this environment take the form of character-based quests, consisting o… ▽ More

    Submitted 24 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: Camera ready. In proceedings of ACL 2022

  16. arXiv:2106.09608  [pdf, other

    cs.LG cs.AI cs.CL

    Learning Knowledge Graph-based World Models of Textual Environments

    Authors: Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: World models improve a learning agent's ability to efficiently operate in interactive and situated environments. This work focuses on the task of building world models of text-based game environments. Text-based games, or interactive narratives, are reinforcement learning environments in which agents perceive and interact with the world using textual natural language. These environments contain lo… ▽ More

    Submitted 20 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Camera read, in Proceedings of NeurIPS 2021 Main Conference

  17. arXiv:2106.09578  [pdf, other

    cs.CL cs.AI

    Modeling Worlds in Text

    Authors: Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: We provide a dataset that enables the creation of learning agents that can build knowledge graph-based world models of interactive narratives. Interactive narratives -- or text-adventure games -- are partially observable environments structured as long puzzles or quests in which an agent perceives and interacts with the world purely through textual natural language. Each individual game typically… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Preprint. Under review. Benchmark can be found at https://github.com/JerichoWorld/JerichoWorld

  18. arXiv:2105.15054  [pdf, other

    cs.CL cs.AI

    Telling Stories through Multi-User Dialogue by Modeling Character Relations

    Authors: Wai Man Si, Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: This paper explores character-driven story continuation, in which the story emerges through characters' first- and second-person narration as well as dialogue -- requiring models to select language that is consistent with a character's persona and their relationships with other characters while following and advancing the story. We hypothesize that a multi-task model that trains on character dialo… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: In Proceedings of SIGDIAL 2021

  19. arXiv:2103.09977  [pdf, other

    cs.AI cs.CL

    Situated Language Learning via Interactive Narratives

    Authors: Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: This paper provides a roadmap that explores the question of how to imbue learning agents with the ability to understand and generate contextually relevant natural language in service of achieving a goal. We hypothesize that two key components in creating such agents are interactivity and environment grounding, shown to be vital parts of language learning in humans, and posit that interactive narra… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: Preprint. Under journal review

  20. arXiv:2012.02757  [pdf, other

    cs.AI cs.CL

    Playing Text-Based Games with Common Sense

    Authors: Sahith Dambekodi, Spencer Frazier, Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: Text based games are simulations in which an agent interacts with the world purely through natural language. They typically consist of a number of puzzles interspersed with interactions with common everyday objects and locations. Deep reinforcement learning agents can learn to solve these puzzles. However, the everyday interactions with the environment, while trivial for human players, present as… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  21. arXiv:2010.00685  [pdf, other

    cs.CL cs.AI

    How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds

    Authors: Prithviraj Ammanabrolu, Jack Urbanek, Margaret Li, Arthur Szlam, Tim Rocktäschel, Jason Weston

    Abstract: We seek to create agents that both act and communicate with other agents in pursuit of a goal. Towards this end, we extend LIGHT (Urbanek et al. 2019) -- a large-scale crowd-sourced fantasy text-game -- with a dataset of quests. These contain natural language motivations paired with in-game goals and human demonstrations; completing a quest might require dialogue or actions (or both). We introduce… ▽ More

    Submitted 25 May, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: In NAACL 2021

  22. arXiv:2009.00829  [pdf, other

    cs.CL cs.AI

    Automated Storytelling via Causal, Commonsense Plot Ordering

    Authors: Prithviraj Ammanabrolu, Wesley Cheung, William Broniec, Mark O. Riedl

    Abstract: Automated story plot generation is the task of generating a coherent sequence of plot events. Causal relations between plot events are believed to increase the perception of story and plot coherence. In this work, we introduce the concept of soft causal relations as causal relations inferred from commonsense reasoning. We demonstrate C2PO, an approach to narrative generation that operationalizes t… ▽ More

    Submitted 30 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: AAAI-21 Camera Ready Version

  23. arXiv:2006.07409  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    How to Avoid Being Eaten by a Grue: Structured Exploration Strategies for Textual Worlds

    Authors: Prithviraj Ammanabrolu, Ethan Tien, Matthew Hausknecht, Mark O. Riedl

    Abstract: Text-based games are long puzzles or quests, characterized by a sequence of sparse and potentially deceptive rewards. They provide an ideal platform to develop agents that perceive and act upon the world using a combinatorially sized natural language state-action space. Standard Reinforcement Learning agents are poorly equipped to effectively explore such spaces and often struggle to overcome bott… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  24. arXiv:2002.08795  [pdf, other

    cs.LG cs.AI cs.CL

    How To Avoid Being Eaten By a Grue: Exploration Strategies for Text-Adventure Agents

    Authors: Prithviraj Ammanabrolu, Ethan Tien, Zhaochen Luo, Mark O. Riedl

    Abstract: Text-based games -- in which an agent interacts with the world through textual natural language -- present us with the problem of combinatorially-sized action-spaces. Most current reinforcement learning algorithms are not capable of effectively handling such a large number of possible actions per turn. Poor sample efficiency, consequently, results in agents that are unable to pass bottleneck state… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

  25. arXiv:2001.10161  [pdf, other

    cs.AI cs.CL

    Bringing Stories Alive: Generating Interactive Fiction Worlds

    Authors: Prithviraj Ammanabrolu, Wesley Cheung, Dan Tu, William Broniec, Mark O. Riedl

    Abstract: World building forms the foundation of any task that requires narrative intelligence. In this work, we focus on procedurally generating interactive fiction worlds---text-based worlds that players "see" and "talk to" using natural language. Generating these worlds requires referencing everyday and thematic commonsense priors in addition to being semantically consistent, interesting, and coherent th… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  26. arXiv:2001.08837  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Graph Constrained Reinforcement Learning for Natural Language Action Spaces

    Authors: Prithviraj Ammanabrolu, Matthew Hausknecht

    Abstract: Interactive Fiction games are text-based simulations in which an agent interacts with the world purely through natural language. They are ideal environments for studying how to extend reinforcement learning agents to meet the challenges of natural language understanding, partial observability, and action generation in combinatorially-large text-based action spaces. We present KG-A2C, an agent that… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted to ICLR 2020

  27. arXiv:1909.06283  [pdf, other

    cs.CL cs.AI

    Toward Automated Quest Generation in Text-Adventure Games

    Authors: Prithviraj Ammanabrolu, William Broniec, Alex Mueller, Jeremy Paul, Mark O. Riedl

    Abstract: Interactive fictions, or text-adventures, are games in which a player interacts with a world entirely through textual descriptions and text actions. Text-adventure games are typically structured as puzzles or quests wherein the player must execute certain actions in a certain order to succeed. In this paper, we consider the problem of procedurally generating a quest, defined as a series of actions… ▽ More

    Submitted 19 August, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: In Proceedings of the International Conference on Computational Creativity (ICCC-20)

  28. arXiv:1909.05398  [pdf, other

    cs.AI cs.CL

    Interactive Fiction Games: A Colossal Adventure

    Authors: Matthew Hausknecht, Prithviraj Ammanabrolu, Marc-Alexandre Côté, Xingdi Yuan

    Abstract: A hallmark of human intelligence is the ability to understand and communicate with language. Interactive Fiction games are fully text-based simulation environments where a player issues text commands to effect change in the environment and progress through the story. We argue that IF games are an excellent testbed for studying language-based autonomous agents. In particular, IF games combine chall… ▽ More

    Submitted 25 February, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  29. arXiv:1909.03480  [pdf, other

    cs.CL cs.AI cs.LG

    Story Realization: Expanding Plot Events into Sentences

    Authors: Prithviraj Ammanabrolu, Ethan Tien, Wesley Cheung, Zhaochen Luo, William Ma, Lara J. Martin, Mark O. Riedl

    Abstract: Neural network based approaches to automated story plot generation attempt to learn how to generate novel plots from a corpus of natural language plot summaries. Prior work has shown that a semantic abstraction of sentences called events improves neural plot generation and and allows one to decompose the problem into: (1) the generation of a sequence of events (event-to-event) and (2) the transfor… ▽ More

    Submitted 21 November, 2019; v1 submitted 8 September, 2019; originally announced September 2019.

    Comments: In proceedings of AAAI 2020

    Journal ref: AAAI Conference on Artificial Intelligence (AAAI), vol. 34, no. 5, pp. 7375-7382, Apr. 2020

  30. arXiv:1908.06556  [pdf, other

    cs.CL cs.AI cs.LG

    Transfer in Deep Reinforcement Learning using Knowledge Graphs

    Authors: Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: Text adventure games, in which players must make sense of the world through text descriptions and declare actions through text descriptions, provide a step** stone toward grounding action in language. Prior work has demonstrated that using a knowledge graph as a state representation and question-answering to pre-train a deep Q-network facilitates faster control policy transfer. In this paper, we… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

  31. arXiv:1812.01628  [pdf, other

    cs.CL cs.AI cs.LG

    Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning

    Authors: Prithviraj Ammanabrolu, Mark O. Riedl

    Abstract: Text-based adventure games provide a platform on which to explore reinforcement learning in the context of a combinatorial action space, such as natural language. We present a deep reinforcement learning architecture that represents the game state as a knowledge graph which is learned during exploration. This graph is used to prune the action space, enabling more efficient exploration. The questio… ▽ More

    Submitted 25 March, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Proceedings of NAACL-HLT 2019

  32. arXiv:1706.01331  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Event Representations for Automated Story Generation with Deep Neural Nets

    Authors: Lara J. Martin, Prithviraj Ammanabrolu, Xinyu Wang, William Hancock, Shruti Singh, Brent Harrison, Mark O. Riedl

    Abstract: Automated story generation is the problem of automatically selecting a sequence of events, actions, or words that can be told as a story. We seek to develop a system that can generate stories by learning everything it needs to know from textual story corpora. To date, recurrent neural networks that learn language models at character, word, or sentence levels have had little success generating cohe… ▽ More

    Submitted 12 September, 2017; v1 submitted 5 June, 2017; originally announced June 2017.

    Comments: Submitted to AAAI'18

    Journal ref: Thirty-Second AAAI Conference on Artificial Intelligence, vol. 32, no. 1, pp. 868-875, Apr. 2018