Skip to main content

Showing 1–19 of 19 results for author: Li, B Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11830  [pdf, other

    cs.CL cs.AI

    Language Modeling with Editable External Knowledge

    Authors: Belinda Z. Li, Emmy Liu, Alexis Ross, Abbas Zeitoun, Graham Neubig, Jacob Andreas

    Abstract: When the world changes, so does the text that humans write about it. How do we build language models that can be easily updated to reflect these changes? One popular approach is retrieval-augmented generation, in which new documents are inserted into a knowledge base and retrieved during prediction for downstream tasks. Most prior work on these systems have focused on improving behavior during pre… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2403.05534  [pdf, other

    cs.CL

    Bayesian Preference Elicitation with Language Models

    Authors: Kunal Handa, Yarin Gal, Ellie Pavlick, Noah Goodman, Jacob Andreas, Alex Tamkin, Belinda Z. Li

    Abstract: Aligning AI systems to users' interests requires understanding and incorporating humans' complex values and preferences. Recently, language models (LMs) have been used to gather information about the preferences of human users. This preference data can be used to fine-tune or guide other LMs and/or AI systems. However, LMs have been shown to struggle with crucial aspects of preference learning: qu… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2402.18759  [pdf, other

    cs.RO cs.AI cs.LG

    Learning with Language-Guided State Abstractions

    Authors: Andi Peng, Ilia Sucholutsky, Belinda Z. Li, Theodore R. Sumers, Thomas L. Griffiths, Jacob Andreas, Julie A. Shah

    Abstract: We describe a framework for using natural language to design state abstractions for imitation learning. Generalizable policy learning in high-dimensional observation spaces is facilitated by well-designed state representations, which can surface important features of an environment and hide irrelevant ones. These state representations are typically manually specified, or derived from other labor-i… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  4. arXiv:2402.03081  [pdf, other

    cs.RO cs.AI cs.LG

    Preference-Conditioned Language-Guided Abstraction

    Authors: Andi Peng, Andreea Bobu, Belinda Z. Li, Theodore R. Sumers, Ilia Sucholutsky, Nishanth Kumar, Thomas L. Griffiths, Julie A. Shah

    Abstract: Learning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: HRI 2024

  5. arXiv:2311.09827  [pdf, other

    cs.CL

    Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking

    Authors: Nan Xu, Fei Wang, Ben Zhou, Bang Zheng Li, Chaowei Xiao, Muhao Chen

    Abstract: While large language models (LLMs) have demonstrated increasing power, they have also given rise to a wide range of harmful behaviors. As representatives, jailbreak attacks can provoke harmful or unethical responses from LLMs, even after safety alignment. In this paper, we investigate a novel category of jailbreak attacks specifically designed to target the cognitive structure and processes of LLM… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  6. arXiv:2310.11589  [pdf, other

    cs.CL cs.AI cs.LG

    Eliciting Human Preferences with Language Models

    Authors: Belinda Z. Li, Alex Tamkin, Noah Goodman, Jacob Andreas

    Abstract: Language models (LMs) can be directed to perform target tasks by using labeled examples or natural language prompts. But selecting examples or writing prompts for can be challenging--especially in tasks that involve unusual edge cases, demand precise articulation of nebulous preferences, or require an accurate mental model of LM behavior. We propose to use *LMs themselves* to guide the task specif… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 26 pages, 15 figures

  7. arXiv:2307.04008  [pdf, other

    cs.CL

    Toward Interactive Dictation

    Authors: Belinda Z. Li, Jason Eisner, Adam Pauls, Sam Thomson

    Abstract: Voice dictation is an increasingly important text input modality. Existing systems that allow both dictation and editing-by-voice restrict their command language to flat templates invoked by trigger words. In this work, we study the feasibility of allowing users to interrupt their dictation with spoken editing commands in open-ended natural language. We introduce a new task and dataset, TERTiUS, t… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 17 pages, 5 tables, 4 figures; ACL

  8. arXiv:2304.00740  [pdf, other

    cs.CL

    Inspecting and Editing Knowledge Representations in Language Models

    Authors: Evan Hernandez, Belinda Z. Li, Jacob Andreas

    Abstract: Neural language models (LMs) represent facts about the world described by text. Sometimes these facts derive from training data (in most LMs, a representation of the word "banana" encodes the fact that bananas are fruits). Sometimes facts derive from input text itself (a representation of the sentence "I poured out the bottle" encodes the fact that the bottle became empty). We describe REMEDI, a m… ▽ More

    Submitted 22 May, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  9. arXiv:2302.02801  [pdf, other

    cs.LG cs.CL

    LaMPP: Language Models as Probabilistic Priors for Perception and Action

    Authors: Belinda Z. Li, William Chen, Pratyusha Sharma, Jacob Andreas

    Abstract: Language models trained on large text corpora encode rich distributional information about real-world environments and action sequences. This information plays a crucial role in current approaches to language processing tasks like question answering and instruction generation. We describe how to leverage language models for *non-linguistic* perception and control tasks. Our approach casts labeling… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 12 pages, 4 tables, 4 figures

  10. arXiv:2212.10012  [pdf, other

    cs.CL

    Language Modeling with Latent Situations

    Authors: Belinda Z. Li, Maxwell Nye, Jacob Andreas

    Abstract: Language models (LMs) often generate incoherent outputs: they refer to events and entity states that are incompatible with the state of the world described in their inputs. We introduce SituationSupervision, a family of approaches for improving coherence in LMs by training them to construct and condition on explicit representations of entities and their states. SituationSupervision has two compone… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 13 pages, 3 figures, 7 tables

  11. arXiv:2112.03204  [pdf, other

    cs.CL cs.LG

    Quantifying Adaptability in Pre-trained Language Models with 500 Tasks

    Authors: Belinda Z. Li, Jane Yu, Madian Khabsa, Luke Zettlemoyer, Alon Halevy, Jacob Andreas

    Abstract: When a neural language model (LM) is adapted to perform a new task, what aspects of the task predict the eventual performance of the model? In NLP, systematic features of LM generalization to individual examples are well characterized, but systematic aspects of LM adaptability to new tasks are not nearly as well understood. We present a large-scale empirical study of the features and limits of LM… ▽ More

    Submitted 4 May, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: NAACL 2022; 20 pages, 6 figures, 8 tables

  12. arXiv:2106.00737  [pdf, other

    cs.CL

    Implicit Representations of Meaning in Neural Language Models

    Authors: Belinda Z. Li, Maxwell Nye, Jacob Andreas

    Abstract: Does the effectiveness of neural language models derive entirely from accurate modeling of surface word co-occurrence statistics, or do these models represent and reason about the world they describe? In BART and T5 transformer language models, we identify contextual word representations that function as models of entities and situations as they evolve throughout a discourse. These neural represen… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures; accepted to ACL 2021

  13. arXiv:2104.08840  [pdf, other

    cs.CL cs.LG

    On the Influence of Masking Policies in Intermediate Pre-training

    Authors: Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

    Abstract: Current NLP models are predominantly trained through a two-stage "pre-train then fine-tune" pipeline. Prior work has shown that inserting an intermediate pre-training stage, using heuristic masking policies for masked language modeling (MLM), can significantly improve final performance. However, it is still unclear (1) in what cases such intermediate pre-training is helpful, (2) whether hand-craft… ▽ More

    Submitted 30 September, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021. Camera-ready version

  14. arXiv:2104.05243  [pdf, other

    cs.AI cs.CL

    On Unifying Misinformation Detection

    Authors: Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih, Madian Khabsa

    Abstract: In this paper, we introduce UnifiedM2, a general-purpose misinformation model that jointly models multiple domains of misinformation with a single, unified setup. The model is trained to handle four tasks: detecting news bias, clickbait, fake news, and verifying rumors. By grou** these tasks together, UnifiedM2learns a richer representation of misinformation, which leads to state-of-the-art or c… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL2021

  15. arXiv:2012.15856  [pdf, other

    cs.CL cs.AI

    Studying Strategically: Learning to Mask for Closed-book QA

    Authors: Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

    Abstract: Closed-book question-answering (QA) is a challenging task that requires a model to directly answer questions without access to external knowledge. It has been shown that directly fine-tuning pre-trained language models with (question, answer) examples yields surprisingly competitive performance, which is further improved upon through adding an intermediate pre-training stage between general pre-tr… ▽ More

    Submitted 1 January, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

  16. arXiv:2010.02413  [pdf, other

    cs.CL cs.AI

    Efficient One-Pass End-to-End Entity Linking for Questions

    Authors: Belinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih

    Abstract: We present ELQ, a fast end-to-end entity linking model for questions, which uses a biencoder to jointly perform mention detection and linking in one pass. Evaluated on WebQSP and GraphQuestions with extended annotations that cover multiple entities per question, ELQ outperforms the previous state of the art by a large margin of +12.7% and +19.6% F1, respectively. With a very fast inference time (1… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, EMNLP 2020

  17. arXiv:2006.04768  [pdf, other

    cs.LG stat.ML

    Linformer: Self-Attention with Linear Complexity

    Authors: Sinong Wang, Belinda Z. Li, Madian Khabsa, Han Fang, Hao Ma

    Abstract: Large transformer models have shown extraordinary success in achieving state-of-the-art results in many natural language processing applications. However, training and deploying these models can be prohibitively costly for long sequences, as the standard self-attention mechanism of the Transformer uses $O(n^2)$ time and space with respect to sequence length. In this paper, we demonstrate that the… ▽ More

    Submitted 14 June, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

  18. arXiv:2006.04102  [pdf, other

    cs.CL cs.AI

    Language Models as Fact Checkers?

    Authors: Nayeon Lee, Belinda Z. Li, Sinong Wang, Wen-tau Yih, Hao Ma, Madian Khabsa

    Abstract: Recent work has suggested that language models (LMs) store both common-sense and factual knowledge learned from pre-training data. In this paper, we leverage this implicit knowledge to create an effective end-to-end fact checker using a solely a language model, without any external knowledge or explicit retrieval components. While previous work on extracting knowledge from LMs have focused on the… ▽ More

    Submitted 24 July, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted in FEVER Workshop (ACL2020)

  19. arXiv:2004.13671  [pdf, other

    cs.CL

    Active Learning for Coreference Resolution using Discrete Annotation

    Authors: Belinda Z. Li, Gabriel Stanovsky, Luke Zettlemoyer

    Abstract: We improve upon pairwise annotation for active learning in coreference resolution, by asking annotators to identify mention antecedents if a presented mention pair is deemed not coreferent. This simple modification, when combined with a novel mention clustering algorithm for selecting which examples to label, is much more efficient in terms of the performance obtained per annotation budget. In exp… ▽ More

    Submitted 18 May, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 12 pages, 7 figures, ACL 2020