Skip to main content

Showing 1–14 of 14 results for author: Grand, G

.
  1. arXiv:2405.09605  [pdf, other

    cs.CL cs.AI cs.LG

    Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

    Authors: Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, Setayesh Radkani, Thomas H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian Paulun, Maria Ryskina, Ekin Akyürek, Ethan Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Joshua Tenenbaum, Jacob Andreas

    Abstract: The ability to build and leverage world models is essential for a general-purpose AI agent. Testing such capabilities is hard, in part because the building blocks of world models are ill-defined. We present Elements of World Knowledge (EWOK), a framework for evaluating world modeling in language models by testing their ability to use knowledge of a concept to match a target text with a plausible/i… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages (11 main), 7 figures. Authors Anna Ivanova, Aalok Sathe, Benjamin Lipkin contributed equally

  2. arXiv:2404.03683  [pdf, other

    cs.LG cs.AI cs.CL

    Stream of Search (SoS): Learning to Search in Language

    Authors: Kanishk Gandhi, Denise Lee, Gabriel Grand, Muxin Liu, Winson Cheng, Archit Sharma, Noah D. Goodman

    Abstract: Language models are rarely shown fruitful mistakes while training. They then struggle to look beyond the next token, suffering from a snowballing of errors and struggling to predict the consequence of their actions several steps ahead. In this paper, we show how language models can be taught to search by representing the process of search in language, as a flattened string -- a stream of search (S… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. arXiv:2402.19471  [pdf, other

    cs.CL cs.AI

    Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling

    Authors: Gabriel Grand, Valerio Pepe, Jacob Andreas, Joshua B. Tenenbaum

    Abstract: Questions combine our mastery of language with our remarkable facility for reasoning about uncertainty. How do people navigate vast hypothesis spaces to pose informative questions given limited cognitive resources? We study these tradeoffs in a classic grounded question-asking task based on the board game Battleship. Our language-informed program sampling (LIPS) model uses large language models (L… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CogSci 2024

  4. arXiv:2310.19791  [pdf, other

    cs.CL cs.AI cs.LG cs.PL

    LILO: Learning Interpretable Libraries by Compressing and Documenting Code

    Authors: Gabriel Grand, Lionel Wong, Maddy Bowers, Theo X. Olausson, Muxin Liu, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: While large language models (LLMs) now excel at code generation, a key aspect of software development is the art of refactoring: consolidating code into libraries of reusable and readable programs. In this paper, we introduce LILO, a neurosymbolic framework that iteratively synthesizes, compresses, and documents code to build libraries tailored to particular problem domains. LILO combines LLM-guid… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera-ready

  5. arXiv:2306.12672  [pdf, other

    cs.CL cs.AI cs.SC

    From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

    Authors: Lionel Wong, Gabriel Grand, Alexander K. Lew, Noah D. Goodman, Vikash K. Mansinghka, Jacob Andreas, Joshua B. Tenenbaum

    Abstract: How does language inform our downstream thinking? In particular, how do humans make meaning from language--and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose rational meaning construction, a computational framework for language-informed thinking that combines neural language models with probabilistic models for rat… ▽ More

    Submitted 23 June, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  6. arXiv:2306.03081  [pdf, other

    cs.AI cs.CL cs.PL stat.CO

    Sequential Monte Carlo Steering of Large Language Models using Probabilistic Programs

    Authors: Alexander K. Lew, Tan Zhi-Xuan, Gabriel Grand, Vikash K. Mansinghka

    Abstract: Even after fine-tuning and reinforcement learning, large language models (LLMs) can be difficult, if not impossible, to control reliably with prompts alone. We propose a new inference-time approach to enforcing syntactic and semantic constraints on the outputs of LLMs, called sequential Monte Carlo (SMC) steering. The key idea is to specify language generation tasks as posterior inference problems… ▽ More

    Submitted 26 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Minor typo fixes

  7. arXiv:2305.01020  [pdf, other

    cs.CL cs.AI

    Evaluating statistical language models as pragmatic reasoners

    Authors: Benjamin Lipkin, Lionel Wong, Gabriel Grand, Joshua B Tenenbaum

    Abstract: The relationship between communicated language and intended meaning is often probabilistic and sensitive to context. Numerous strategies attempt to estimate such a map**, often leveraging recursive Bayesian models of communication. In parallel, large language models (LLMs) have been increasingly applied to semantic parsing applications, tasked with inferring logical representations from natural… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 8 pages, 4 figures, to appear in the Proceedings of the Annual Meeting of the Cognitive Science Society 2023

  8. arXiv:2211.16605  [pdf, other

    cs.PL cs.AI

    Top-Down Synthesis for Library Learning

    Authors: Matthew Bowers, Theo X. Olausson, Lionel Wong, Gabriel Grand, Joshua B. Tenenbaum, Kevin Ellis, Armando Solar-Lezama

    Abstract: This paper introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm… ▽ More

    Submitted 15 January, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at POPL 2023

    Journal ref: Proc. ACM Program. Lang. 7, POPL, Article 41 (January 2023), pp 1182-1213

  9. arXiv:2209.01712  [pdf, other

    cs.LG cs.AI q-bio.BM

    ChemBERTa-2: Towards Chemical Foundation Models

    Authors: Walid Ahmad, Elana Simon, Seyone Chithrananda, Gabriel Grand, Bharath Ramsundar

    Abstract: Large pretrained models such as GPT-3 have had tremendous impact on modern natural language processing by leveraging self-supervised learning to learn salient representations that can be used to readily finetune on a wide variety of downstream tasks. We investigate the possibility of transferring such advances to molecular machine learning by building a chemical foundation model, ChemBERTa-2, usin… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: ELLIS Machine Learning for Molecule Discovery Workshop

    ACM Class: I.2.7; I.2.1; J.2; J.3

  10. arXiv:2205.05666  [pdf, other

    cs.CL cs.AI

    Identifying concept libraries from language about object structure

    Authors: Catherine Wong, William P. McCarthy, Gabriel Grand, Yoni Friedman, Joshua B. Tenenbaum, Jacob Andreas, Robert D. Hawkins, Judith E. Fan

    Abstract: Our understanding of the visual world goes beyond naming objects, encompassing our ability to parse objects into meaningful parts, attributes, and relations. In this work, we leverage natural language descriptions for a diverse set of 2K procedurally generated objects to identify the parts people use and the principles leading these parts to be favored over others. We formalize our problem as sear… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Appears in the conference proceedings of CogSci 2022

  11. arXiv:2010.09885  [pdf, other

    cs.LG cs.CL physics.chem-ph q-bio.BM

    ChemBERTa: Large-Scale Self-Supervised Pretraining for Molecular Property Prediction

    Authors: Seyone Chithrananda, Gabriel Grand, Bharath Ramsundar

    Abstract: GNNs and chemical fingerprints are the predominant approaches to representing molecules for property prediction. However, in NLP, transformers have become the de-facto standard for representation learning thanks to their strong downstream task transfer. In parallel, the software ecosystem around transformers is maturing rapidly, with libraries like HuggingFace and BertViz enabling streamlined trai… ▽ More

    Submitted 23 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Submitted to NeurIPS 2020 ML for Molecules Workshop

    ACM Class: I.2.7; I.2.1; J.2; J.3

  12. arXiv:1906.08430  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects

    Authors: Gabriel Grand, Yonatan Belinkov

    Abstract: Visual question answering (VQA) models have been shown to over-rely on linguistic biases in VQA datasets, answering questions "blindly" without considering visual context. Adversarial regularization (AdvReg) aims to address this issue via an adversary sub-network that encourages the main model to learn a bias-free representation of the question. In this work, we investigate the strengths and short… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

    Comments: In Proceedings of the 2nd Workshop on Shortcomings in Vision and Language (SiVL) at NAACL-HLT 2019

  13. arXiv:1806.00857  [pdf, other

    cs.CV

    On the Flip Side: Identifying Counterexamples in Visual Question Answering

    Authors: Gabriel Grand, Aron Szanto, Yoon Kim, Alexander Rush

    Abstract: Visual question answering (VQA) models respond to open-ended natural language questions about images. While VQA is an increasingly popular area of research, it is unclear to what extent current VQA architectures learn key semantic distinctions between visually-similar images. To investigate this question, we explore a reformulation of the VQA task that challenges models to identify counterexamples… ▽ More

    Submitted 24 July, 2018; v1 submitted 3 June, 2018; originally announced June 2018.

    Comments: KDD 2018 conference version

  14. arXiv:1802.01241  [pdf

    cs.CL

    Semantic projection: recovering human knowledge of multiple, distinct object features from word embeddings

    Authors: Gabriel Grand, Idan Asher Blank, Francisco Pereira, Evelina Fedorenko

    Abstract: The words of a language reflect the structure of the human mind, allowing us to transmit thoughts between individuals. However, language can represent only a subset of our rich and detailed cognitive architecture. Here, we ask what kinds of common knowledge (semantic memory) are captured by word meanings (lexical semantics). We examine a prominent computational model that represents words as vecto… ▽ More

    Submitted 6 March, 2018; v1 submitted 4 February, 2018; originally announced February 2018.