Skip to main content

Showing 1–50 of 98 results for author: Andreas, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11830  [pdf, other

    cs.CL cs.AI

    Language Modeling with Editable External Knowledge

    Authors: Belinda Z. Li, Emmy Liu, Alexis Ross, Abbas Zeitoun, Graham Neubig, Jacob Andreas

    Abstract: When the world changes, so does the text that humans write about it. How do we build language models that can be easily updated to reflect these changes? One popular approach is retrieval-augmented generation, in which new documents are inserted into a knowledge base and retrieved during prediction for downstream tasks. Most prior work on these systems have focused on improving behavior during pre… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.06950  [pdf, other

    cs.CL

    A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

    Authors: Bairu Hou, Yang Zhang, Jacob Andreas, Shiyu Chang

    Abstract: This paper focuses on the task of hallucination detection, which aims to determine the truthfulness of LLM-generated statements. To address this problem, a popular class of methods utilize the LLM's self-consistencies in its beliefs in a set of logically related augmented statements generated by the LLM, which does not require external knowledge databases and can work with both white-box and black… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 26 pages, 18 figures

  3. arXiv:2405.09605  [pdf, other

    cs.CL cs.AI cs.LG

    Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

    Authors: Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, Setayesh Radkani, Thomas H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian Paulun, Maria Ryskina, Ekin Akyürek, Ethan Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Joshua Tenenbaum, Jacob Andreas

    Abstract: The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages (11 main), 7 figures. Authors Anna Ivanova, Aalok Sathe, Benjamin Lipkin contributed equally

  4. arXiv:2405.04726  [pdf, other

    cs.CL

    Learning Phonotactics from Linguistic Informants

    Authors: Canaan Breiss, Alexis Ross, Amani Maina-Kilaas, Roger Levy, Jacob Andreas

    Abstract: We propose an interactive approach to language learning that utilizes linguistic acceptability judgments from an informant (a competent language user) to learn a grammar. Given a grammar formalism and a framework for synthesizing data, our model iteratively selects or synthesizes a data-point according to one of a range of information-theoretic policies, asks the informant for a binary judgment, a… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2405.04495  [pdf, other

    cs.CL cs.AI cs.LG

    Toward In-Context Teaching: Adapting Examples to Students' Misconceptions

    Authors: Alexis Ross, Jacob Andreas

    Abstract: When a teacher provides examples for a student to study, these examples must be informative, enabling a student to progress from their current state toward a target concept or skill. Good teachers must therefore simultaneously infer what students already know and adapt their teaching to students' changing state of knowledge. There is increasing interest in using computational models, particularly… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2405.04118  [pdf, other

    cs.LG cs.AI cs.CL

    Policy Learning with a Language Bottleneck

    Authors: Megha Srivastava, Cedric Colas, Dorsa Sadigh, Jacob Andreas

    Abstract: Modern AI systems such as self-driving cars and game-playing agents achieve superhuman performance, but often lack human-like features such as generalization, interpretability and human inter-operability. Inspired by the rich interactions between language and decision-making in humans, we introduce Policy Learning with a Language Bottleneck (PLLB), a framework enabling AI agents to generate lingui… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 18 pages, 13 figures

  7. arXiv:2404.14394  [pdf, other

    cs.AI cs.CL cs.CV

    A Multimodal Automated Interpretability Agent

    Authors: Tamar Rott Shaham, Sarah Schwettmann, Franklin Wang, Achyuta Rajaram, Evan Hernandez, Jacob Andreas, Antonio Torralba

    Abstract: This paper describes MAIA, a Multimodal Automated Interpretability Agent. MAIA is a system that uses neural models to automate neural model understanding tasks like feature interpretation and failure mode discovery. It equips a pre-trained vision-language model with a set of tools that support iterative experimentation on subcomponents of other models to explain their behavior. These include tools… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 25 pages, 13 figures

  8. arXiv:2404.14349  [pdf, other

    cs.CV cs.AI

    Automatic Discovery of Visual Circuits

    Authors: Achyuta Rajaram, Neil Chowdhury, Antonio Torralba, Jacob Andreas, Sarah Schwettmann

    Abstract: To date, most discoveries of network subcomponents that implement human-interpretable computations in deep vision models have involved close study of single units and large amounts of human labor. We explore scalable methods for extracting the subgraph of a vision model's computational graph that underlies recognition of a specific visual concept. We introduce a new method for identifying these su… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  9. arXiv:2404.03028  [pdf, other

    cs.CL

    An Incomplete Loop: Deductive, Inductive, and Abductive Learning in Large Language Models

    Authors: Emmy Liu, Graham Neubig, Jacob Andreas

    Abstract: Modern language models (LMs) can learn to perform new tasks in different ways: in instruction following, the target task is described explicitly in natural language; in few-shot prompting, the task is specified implicitly with a small number of examples; in instruction inference, LMs are presented with in-context examples and are then prompted to generate a natural language task description before… ▽ More

    Submitted 10 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  10. arXiv:2403.14551  [pdf, other

    cs.CL cs.AI cs.LG

    Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling

    Authors: Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas

    Abstract: Today's most accurate language models are trained on orders of magnitude more language data than human language learners receive - but with no supervision from other sensory modalities that play a crucial role in human learning. Can we make LMs' representations and predictions more accurate (and more human-like) with more ecologically plausible supervision? This paper describes LexiContrastive Gro… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  11. arXiv:2403.05534  [pdf, other

    cs.CL

    Bayesian Preference Elicitation with Language Models

    Authors: Kunal Handa, Yarin Gal, Ellie Pavlick, Noah Goodman, Jacob Andreas, Alex Tamkin, Belinda Z. Li

    Abstract: Aligning AI systems to users' interests requires understanding and incorporating humans' complex values and preferences. Recently, language models (LMs) have been used to gather information about the preferences of human users. This preference data can be used to fine-tune or guide other LMs and/or AI systems. However, LMs have been shown to struggle with crucial aspects of preference learning: qu… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2402.19471  [pdf, other

    cs.CL cs.AI

    Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling

    Authors: Gabriel Grand, Valerio Pepe, Jacob Andreas, Joshua B. Tenenbaum

    Abstract: Questions combine our mastery of language with our remarkable facility for reasoning about uncertainty. How do people navigate vast hypothesis spaces to pose informative questions given limited cognitive resources? We study these tradeoffs in a classic grounded question-asking task based on the board game Battleship. Our language-informed program sampling (LIPS) model uses large language models (L… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CogSci 2024

  13. arXiv:2402.18759  [pdf, other

    cs.RO cs.AI cs.LG

    Learning with Language-Guided State Abstractions

    Authors: Andi Peng, Ilia Sucholutsky, Belinda Z. Li, Theodore R. Sumers, Thomas L. Griffiths, Jacob Andreas, Julie A. Shah

    Abstract: We describe a framework for using natural language to design state abstractions for imitation learning. Generalizable policy learning in high-dimensional observation spaces is facilitated by well-designed state representations, which can surface important features of an environment and hide irrelevant ones. These state representations are typically manually specified, or derived from other labor-i… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  14. arXiv:2401.12973  [pdf, other

    cs.CL cs.LG

    In-Context Language Learning: Architectures and Algorithms

    Authors: Ekin Akyürek, Bailin Wang, Yoon Kim, Jacob Andreas

    Abstract: Large-scale neural language models exhibit a remarkable capacity for in-context learning (ICL): they can infer novel functions from datasets provided as input. Most of our current understanding of when and how ICL arises comes from LMs trained on extremely simple learning problems like linear regression and associative recall. There remains a significant gap between these model problems and the "r… ▽ More

    Submitted 30 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Fixes a typo in the title, and adds additional references

  15. arXiv:2401.08574  [pdf, other

    cs.CL

    Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability

    Authors: Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas

    Abstract: While language models (LMs) can sometimes generate factually correct text and estimate truth values of individual claims, these generally do not reflect a globally coherent, manipulable model of the world. As a consequence, current LMs also generate incorrect or nonsensical content, and are difficult to edit and bring up to date. We present a method called Deductive Closure Training (DCT) that use… ▽ More

    Submitted 26 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: ACL Findings

  16. arXiv:2312.08566  [pdf, other

    cs.AI cs.CL cs.RO

    Learning adaptive planning representations with natural language guidance

    Authors: Lionel Wong, Jiayuan Mao, Pratyusha Sharma, Zachary S. Siegel, Jiahai Feng, Noa Korneev, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  17. arXiv:2312.06032  [pdf, other

    cs.AI cs.HC

    Evaluating the Utility of Model Explanations for Model Development

    Authors: Shawn Im, Jacob Andreas, Yilun Zhou

    Abstract: One of the motivations for explainable AI is to allow humans to make better and more informed decisions regarding the use and deployment of AI models. But careful evaluations are needed to assess whether this expectation has been fulfilled. Current evaluations mainly focus on algorithmic properties of explanations, and those that involve human subjects often employ subjective questions to test hum… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  18. arXiv:2312.04030  [pdf, other

    cs.AI cs.LG

    Modeling Boundedly Rational Agents with Latent Inference Budgets

    Authors: Athul Paul Jacob, Abhishek Gupta, Jacob Andreas

    Abstract: We study the problem of modeling a population of agents pursuing unknown goals subject to unknown computational constraints. In standard models of bounded rationality, sub-optimal decision-making is simulated by adding homoscedastic noise to optimal decisions rather than explicitly simulating constrained inference. In this work, we introduce a latent inference budget model (L-IBM) that models agen… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  19. arXiv:2312.03729  [pdf, other

    cs.CL cs.AI

    Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness?

    Authors: Kevin Liu, Stephen Casper, Dylan Hadfield-Menell, Jacob Andreas

    Abstract: Neural language models (LMs) can be used to evaluate the truth of factual statements in two ways: they can be either queried for statement probabilities, or probed for internal representations of truthfulness. Past work has found that these two procedures sometimes disagree, and that probes tend to be more accurate than LM outputs. This has led some researchers to conclude that LMs "lie" or otherw… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

    Comments: Accepted to EMNLP, 2024

  20. arXiv:2311.09796  [pdf, other

    cs.CL cs.AI

    Interpreting User Requests in the Context of Natural Language Standing Instructions

    Authors: Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani

    Abstract: Users of natural language interfaces, generally powered by Large Language Models (LLMs),often must repeat their preferences each time they make a similar request. We describe an approach to LLM-based dialogue modeling in which persistent user constraints and preferences -- collectively termed standing instructions -- as additional context for such interfaces. For example, when a user states "I'm h… ▽ More

    Submitted 7 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Updated with results from LLaMA-2

  21. arXiv:2311.09712  [pdf, other

    cs.CL

    Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning

    Authors: Athul Paul Jacob, Gabriele Farina, Jacob Andreas

    Abstract: We present a model of pragmatic language understanding, where utterances are produced and understood by searching for regularized equilibria of signaling games. In this model (which we call ReCo, for Regularized Conventions), speakers and listeners search for contextually appropriate utterance--meaning map**s that are both close to game-theoretically optimal conventions and close to a shared, ''… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  22. arXiv:2311.08718  [pdf, other

    cs.CL

    Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling

    Authors: Bairu Hou, Yujian Liu, Kaizhi Qian, Jacob Andreas, Shiyu Chang, Yang Zhang

    Abstract: Uncertainty decomposition refers to the task of decomposing the total uncertainty of a predictive model into aleatoric (data) uncertainty, resulting from inherent randomness in the data-generating process, and epistemic (model) uncertainty, resulting from missing information in the model's training data. In large language models (LLMs) specifically, identifying sources of uncertainty is an importa… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: ICML 2024, 19 pages, 4 figures

  23. arXiv:2310.19791  [pdf, other

    cs.CL cs.AI cs.LG cs.PL

    LILO: Learning Interpretable Libraries by Compressing and Documenting Code

    Authors: Gabriel Grand, Lionel Wong, Maddy Bowers, Theo X. Olausson, Muxin Liu, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: While large language models (LLMs) now excel at code generation, a key aspect of software development is the art of refactoring: consolidating code into libraries of reusable and readable programs. In this paper, we introduce LILO, a neurosymbolic framework that iteratively synthesizes, compresses, and documents code to build libraries tailored to particular problem domains. LILO combines LLM-guid… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera-ready

  24. arXiv:2310.19089  [pdf, other

    cs.CL

    Pushdown Layers: Encoding Recursive Structure in Transformer Language Models

    Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning

    Abstract: Recursion is a prominent feature of human language, and fundamentally challenging for self-attention due to the lack of an explicit recursive-state tracking mechanism. Consequently, Transformer language models poorly capture long-tail recursive structure and exhibit sample-inefficient syntactic generalization. This work introduces Pushdown Layers, a new self-attention layer that models recursive s… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Long Papers)

  25. arXiv:2310.13257  [pdf, other

    cs.CL cs.AI

    Visual Grounding Helps Learn Word Meanings in Low-Data Regimes

    Authors: Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas

    Abstract: Modern neural language models (LMs) are powerful tools for modeling human sentence production and comprehension, and their internal representations are remarkably well-aligned with representations of language in the human brain. But to achieve these results, LMs must be trained in distinctly un-human-like ways - requiring orders of magnitude more language data than children receive during developm… ▽ More

    Submitted 25 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by NAACL 2024

  26. arXiv:2310.11589  [pdf, other

    cs.CL cs.AI cs.LG

    Eliciting Human Preferences with Language Models

    Authors: Belinda Z. Li, Alex Tamkin, Noah Goodman, Jacob Andreas

    Abstract: Language models (LMs) can be directed to perform target tasks by using labeled examples or natural language prompts. But selecting examples or writing prompts for can be challenging--especially in tasks that involve unusual edge cases, demand precise articulation of nebulous preferences, or require an accurate mental model of LM behavior. We propose to use *LMs themselves* to guide the task specif… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 26 pages, 15 figures

  27. arXiv:2310.09139  [pdf, other

    cs.GT cs.AI cs.CL cs.LG

    The Consensus Game: Language Model Generation via Equilibrium Search

    Authors: Athul Paul Jacob, Yikang Shen, Gabriele Farina, Jacob Andreas

    Abstract: When applied to question answering and other text generation tasks, language models (LMs) may be queried generatively (by sampling answers from their output distribution) or discriminatively (by using them to score or rank a set of candidate outputs). These procedures sometimes yield very different predictions. How do we reconcile mutually incompatible scoring procedures to obtain coherent LM pred… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  28. arXiv:2309.03886  [pdf, other

    cs.CL cs.AI cs.LG

    FIND: A Function Description Benchmark for Evaluating Interpretability Methods

    Authors: Sarah Schwettmann, Tamar Rott Shaham, Joanna Materzynska, Neil Chowdhury, Shuang Li, Jacob Andreas, David Bau, Antonio Torralba

    Abstract: Labeling neural network submodules with human-legible descriptions is useful for many downstream tasks: such descriptions can surface failures, guide interventions, and perhaps even explain important model behaviors. To date, most mechanistic descriptions of trained networks have involved small models, narrowly delimited phenomena, and large amounts of human labor. Labeling all human-interpretable… ▽ More

    Submitted 8 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: 28 pages, 10 figures

    Journal ref: NeurIPS 2023

  29. arXiv:2308.09124  [pdf, other

    cs.CL

    Linearity of Relation Decoding in Transformer Language Models

    Authors: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau

    Abstract: Much of the knowledge encoded in transformer language models (LMs) may be expressed in terms of relations: relations between words and their synonyms, entities and their attributes, etc. We show that, for a subset of relations, this computation is well-approximated by a single linear transformation on the subject representation. Linear relation representations may be obtained by constructing a fir… ▽ More

    Submitted 15 February, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  30. arXiv:2307.02477  [pdf, other

    cs.CL cs.AI

    Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks

    Authors: Zhaofeng Wu, Linlu Qiu, Alexis Ross, Ekin Akyürek, Boyuan Chen, Bailin Wang, Najoung Kim, Jacob Andreas, Yoon Kim

    Abstract: The impressive performance of recent language models across a wide range of tasks suggests that they possess a degree of abstract reasoning skills. Are these skills general and transferable, or specialized to specific tasks seen during pretraining? To disentangle these effects, we propose an evaluation framework based on "counterfactual" task variants that deviate from the default assumptions unde… ▽ More

    Submitted 28 March, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: NAACL 2024

  31. arXiv:2306.17844  [pdf, other

    cs.LG

    The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks

    Authors: Ziqian Zhong, Ziming Liu, Max Tegmark, Jacob Andreas

    Abstract: Do neural networks, trained on well-understood algorithmic tasks, reliably rediscover known algorithms for solving those tasks? Several recent studies, on tasks ranging from group arithmetic to in-context linear regression, have suggested that the answer is yes. Using modular addition as a prototypical problem, we show that algorithm discovery in neural networks is sometimes more complex. Small ch… ▽ More

    Submitted 21 November, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by NeurIPS 2023

  32. arXiv:2306.12672  [pdf, other

    cs.CL cs.AI cs.SC

    From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

    Authors: Lionel Wong, Gabriel Grand, Alexander K. Lew, Noah D. Goodman, Vikash K. Mansinghka, Jacob Andreas, Joshua B. Tenenbaum

    Abstract: How does language inform our downstream thinking? In particular, how do humans make meaning from language--and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose rational meaning construction, a computational framework for language-informed thinking that combines neural language models with probabilistic models for rat… ▽ More

    Submitted 23 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  33. arXiv:2305.20076  [pdf, other

    cs.CL cs.AI

    Decision-Oriented Dialogue for Human-AI Collaboration

    Authors: Jessy Lin, Nicholas Tomlin, Jacob Andreas, Jason Eisner

    Abstract: We describe a class of tasks called decision-oriented dialogues, in which AI assistants such as large language models (LMs) must collaborate with one or more humans via natural language to help them make complex decisions. We formalize three domains in which users face everyday decisions: (1) choosing an assignment of reviewers to conference papers, (2) planning a multi-step itinerary in a city, a… ▽ More

    Submitted 5 May, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: TACL 2024, pre-MIT Press publication version

  34. arXiv:2305.18741  [pdf, other

    cs.CL

    Grokking of Hierarchical Structure in Vanilla Transformers

    Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning

    Abstract: For humans, language production and comprehension is sensitive to the hierarchical structure of sentences. In natural language processing, past work has questioned how effectively neural sequence models like transformers capture this hierarchical structure when generalizing to structurally novel inputs. We show that transformer language models can learn to generalize hierarchically after training… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  35. arXiv:2305.08677  [pdf, other

    cs.CL

    Natural Language Decomposition and Interpretation of Complex Utterances

    Authors: Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme

    Abstract: Designing natural language interfaces has historically required collecting supervised data to translate user requests into carefully designed intent representations. This requires enumerating and labeling a long tail of user requests, which is challenging. At the same time, large language models (LLMs) encode knowledge about goals and plans that can help conversational assistants interpret user re… ▽ More

    Submitted 8 January, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

  36. arXiv:2304.00740  [pdf, other

    cs.CL

    Inspecting and Editing Knowledge Representations in Language Models

    Authors: Evan Hernandez, Belinda Z. Li, Jacob Andreas

    Abstract: Neural language models (LMs) represent facts about the world described by text. Sometimes these facts derive from training data (in most LMs, a representation of the word "banana" encodes the fact that bananas are fruits). Sometimes facts derive from input text itself (a representation of the sentence "I poured out the bottle" encodes the fact that the bottle became empty). We describe REMEDI, a m… ▽ More

    Submitted 22 May, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  37. arXiv:2303.16779  [pdf, other

    cs.CL cs.LG

    Language Models Trained on Media Diets Can Predict Public Opinion

    Authors: Eric Chu, Jacob Andreas, Stephen Ansolabehere, Deb Roy

    Abstract: Public opinion reflects and shapes societal behavior, but the traditional survey-based tools to measure it are limited. We introduce a novel approach to probe media diet models -- language models adapted to online news, TV broadcast, or radio show content -- that can emulate the opinions of subpopulations that have consumed a set of media. To validate this method, we use as ground truth the opinio… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  38. arXiv:2302.06692  [pdf, other

    cs.LG cs.AI cs.CL

    Guiding Pretraining in Reinforcement Learning with Large Language Models

    Authors: Yuqing Du, Olivia Watkins, Zihan Wang, Cédric Colas, Trevor Darrell, Pieter Abbeel, Abhishek Gupta, Jacob Andreas

    Abstract: Reinforcement learning algorithms typically struggle in the absence of a dense, well-shaped reward function. Intrinsically motivated exploration methods address this limitation by rewarding agents for visiting novel states or transitions, but these methods offer limited benefits in large environments where most discovered novelty is irrelevant for downstream tasks. We describe a method that uses b… ▽ More

    Submitted 14 September, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  39. arXiv:2302.02801  [pdf, other

    cs.LG cs.CL

    LaMPP: Language Models as Probabilistic Priors for Perception and Action

    Authors: Belinda Z. Li, William Chen, Pratyusha Sharma, Jacob Andreas

    Abstract: Language models trained on large text corpora encode rich distributional information about real-world environments and action sequences. This information plays a crucial role in current approaches to language processing tasks like question answering and instruction generation. We describe how to leverage language models for *non-linguistic* perception and control tasks. Our approach casts labeling… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 12 pages, 4 tables, 4 figures

  40. arXiv:2212.10012  [pdf, other

    cs.CL

    Language Modeling with Latent Situations

    Authors: Belinda Z. Li, Maxwell Nye, Jacob Andreas

    Abstract: Language models (LMs) often generate incoherent outputs: they refer to events and entity states that are incompatible with the state of the world described in their inputs. We introduce SituationSupervision, a family of approaches for improving coherence in LMs by training them to construct and condition on explicit representations of entities and their states. SituationSupervision has two compone… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 13 pages, 3 figures, 7 tables

  41. arXiv:2212.09257  [pdf, other

    cs.CL

    PromptBoosting: Black-Box Text Classification with Ten Forward Passes

    Authors: Bairu Hou, Joe O'Connor, Jacob Andreas, Shiyu Chang, Yang Zhang

    Abstract: We describe PromptBoosting, a query-efficient procedure for building a text classifier from a neural language model (LM) without access to the LM's parameters, gradients, or hidden representations. This form of "black-box" classifier training has become increasingly important as the cost of training and inference in large-scale LMs grows. But existing black-box LM classifier learning approaches ar… ▽ More

    Submitted 2 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ICML 2023

  42. arXiv:2212.01681  [pdf, other

    cs.CL cs.MA

    Language Models as Agent Models

    Authors: Jacob Andreas

    Abstract: Language models (LMs) are trained on collections of documents, written by individual human agents to achieve specific goals in an outside world. During training, LMs have access only to text of these documents, with no direct evidence of the internal states of the agents that produced them -- a fact often used to argue that LMs are incapable of modeling goal-directed aspects of human language prod… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  43. arXiv:2211.15661  [pdf, other

    cs.LG cs.CL

    What learning algorithm is in-context learning? Investigations with linear models

    Authors: Ekin Akyürek, Dale Schuurmans, Jacob Andreas, Tengyu Ma, Denny Zhou

    Abstract: Neural sequence models, especially transformers, exhibit a remarkable capacity for in-context learning. They can construct new predictors from sequences of labeled examples $(x, f(x))$ presented in the input without further parameter updates. We investigate the hypothesis that transformer-based in-context learners implement standard learning algorithms implicitly, by encoding smaller models in the… ▽ More

    Submitted 17 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: ICLR2023 Camera Ready

  44. arXiv:2211.07906  [pdf, other

    cs.CL

    Hierarchical Phrase-based Sequence-to-Sequence Learning

    Authors: Bailin Wang, Ivan Titov, Jacob Andreas, Yoon Kim

    Abstract: We describe a neural transducer that maintains the flexibility of standard sequence-to-sequence (seq2seq) models while incorporating hierarchical phrases as a source of inductive bias during training and as explicit constraints during inference. Our approach trains two models: a discriminative parser based on a bracketing transduction grammar whose derivation tree hierarchically aligns source and… ▽ More

    Submitted 15 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  45. arXiv:2211.01288  [pdf, other

    cs.CL

    Characterizing Intrinsic Compositionality in Transformers with Tree Projections

    Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher D. Manning

    Abstract: When trained on language data, do transformers learn some arbitrary computation that utilizes the full capacity of the architecture or do they learn a simpler, tree-like computation, hypothesized to underlie compositional meaning systems like human languages? There is an apparent tension between compositional accounts of human language understanding, which are based on a restricted bottom-up compu… ▽ More

    Submitted 3 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Fixed title and metadata

  46. arXiv:2210.11468  [pdf, other

    cs.SE cs.HC cs.LG

    ObSynth: An Interactive Synthesis System for Generating Object Models from Natural Language Specifications

    Authors: Alex Gu, Tamara Mitrovska, Daniela Velez, Jacob Andreas, Armando Solar-Lezama

    Abstract: We introduce ObSynth, an interactive system leveraging the domain knowledge embedded in large language models (LLMs) to help users design object models from high level natural language prompts. This is an example of specification reification, the process of taking a high-level, potentially vague specification and reifying it into a more concrete form. We evaluate ObSynth via a user study, leading… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 25 pages, 15 figures

  47. arXiv:2209.07800  [pdf, other

    cs.CL

    The Whole Truth and Nothing But the Truth: Faithful and Controllable Dialogue Response Generation with Dataflow Transduction and Constrained Decoding

    Authors: Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein

    Abstract: In a real-world dialogue system, generated text must be truthful and informative while remaining fluent and adhering to a prescribed style. Satisfying these constraints simultaneously is difficult for the two predominant paradigms in language generation: neural language modeling and rule-based generation. We describe a hybrid architecture for dialogue response generation that combines the strength… ▽ More

    Submitted 26 May, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Findings of ACL 2023

  48. arXiv:2205.11482  [pdf, other

    cs.CL cs.IR

    Towards Tracing Factual Knowledge in Language Models Back to the Training Data

    Authors: Ekin Akyürek, Tolga Bolukbasi, Frederick Liu, Binbin Xiong, Ian Tenney, Jacob Andreas, Kelvin Guu

    Abstract: Language models (LMs) have been shown to memorize a great deal of factual knowledge contained in their training data. But when an LM generates an assertion, it is often difficult to determine where it learned this information and whether it is true. In this paper, we propose the problem of fact tracing: identifying which training examples taught an LM to generate a particular factual assertion. Pr… ▽ More

    Submitted 25 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Findings of EMNLP, 2022

  49. arXiv:2205.05666  [pdf, other

    cs.CL cs.AI

    Identifying concept libraries from language about object structure

    Authors: Catherine Wong, William P. McCarthy, Gabriel Grand, Yoni Friedman, Joshua B. Tenenbaum, Jacob Andreas, Robert D. Hawkins, Judith E. Fan

    Abstract: Our understanding of the visual world goes beyond naming objects, encompassing our ability to parse objects into meaningful parts, attributes, and relations. In this work, we leverage natural language descriptions for a diverse set of 2K procedurally generated objects to identify the parts people use and the principles leading these parts to be favored over others. We formalize our problem as sear… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Appears in the conference proceedings of CogSci 2022

  50. arXiv:2204.05186  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Correcting Robot Plans with Natural Language Feedback

    Authors: Pratyusha Sharma, Balakumar Sundaralingam, Valts Blukis, Chris Paxton, Tucker Hermans, Antonio Torralba, Jacob Andreas, Dieter Fox

    Abstract: When humans design cost or goal specifications for robots, they often produce specifications that are ambiguous, underspecified, or beyond planners' ability to solve. In these cases, corrections provide a valuable tool for human-in-the-loop robot control. Corrections might take the form of new goal specifications, new constraints (e.g. to avoid specific objects), or hints for planning algorithms (… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 10 pages, 13 figures