Skip to main content

Showing 1–12 of 12 results for author: Poesia, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00900  [pdf, other

    cs.AI cs.CL

    MathCAMPS: Fine-grained Synthesis of Mathematical Problems From Human Curricula

    Authors: Shubhra Mishra, Gabriel Poesia, Belinda Mo, Noah D. Goodman

    Abstract: Mathematical problem solving is an important skill for Large Language Models (LLMs), both as an important capability and a proxy for a range of reasoning abilities. Existing benchmarks probe a diverse set of skills, but they yield aggregate accuracy metrics, obscuring specific abilities or weaknesses. Furthermore, they are difficult to extend with new problems, risking data contamination over time… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Dataset and code: https://github.com/gpoesia/mathcamps/

  2. arXiv:2407.00695  [pdf, other

    cs.AI cs.LO

    Learning Formal Mathematics From Intrinsic Motivation

    Authors: Gabriel Poesia, David Broman, Nick Haber, Noah D. Goodman

    Abstract: How did humanity coax mathematics from the aether? We explore the Platonic view that mathematics can be discovered from its axioms - a game of conjecture and proof. We describe Minimo (Mathematics from Intrinsic Motivation): an agent that jointly learns to pose challenging problems for itself (conjecturing) and solve them (theorem proving). Given a mathematical domain axiomatized in dependent type… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.07897  [pdf, other

    cs.LG cs.AI

    When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

    Authors: Zhening Li, Gabriel Poesia, Armando Solar-Lezama

    Abstract: Skills are temporal abstractions that are intended to improve reinforcement learning (RL) performance through hierarchical RL. Despite our intuition about the properties of an environment that make skills useful, a precise characterization has been absent. We provide the first such characterization, focusing on the utility of deterministic skills in deterministic sparse-reward environments with fi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 29 pages, 1 figure. Accepted to ICML 2024

  4. arXiv:2309.05660  [pdf, other

    cs.LG cs.AI cs.CL

    Hypothesis Search: Inductive Reasoning with Language Models

    Authors: Ruocheng Wang, Eric Zelikman, Gabriel Poesia, Yewen Pu, Nick Haber, Noah D. Goodman

    Abstract: Inductive reasoning is a core problem-solving capacity: humans can identify underlying principles from a few examples, which robustly generalize to novel scenarios. Recent work evaluates large language models (LLMs) on inductive reasoning tasks by directly prompting them yielding "in context learning." This works well for straightforward inductive tasks but performs poorly on complex tasks such as… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: ICLR 2024. The first two authors contributed equally. Code: https://github.com/Relento/hypothesis_search

  5. arXiv:2306.04031  [pdf, other

    cs.AI

    Certified Deductive Reasoning with Language Models

    Authors: Gabriel Poesia, Kanishk Gandhi, Eric Zelikman, Noah D. Goodman

    Abstract: Language models often achieve higher accuracy when reasoning step-by-step in complex tasks. However, even when arriving at a correct final answer, their rationales are often logically unsound or inconsistent. This is a major issue when reliable reasoning traces are needed, such when fine-tuning on model-generated reasoning for self-improvement. To tackle these issues, we introduce a class of tools… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

  6. arXiv:2304.09102  [pdf, other

    cs.CL cs.AI

    Solving Math Word Problems by Combining Language Models With Symbolic Solvers

    Authors: Joy He-Yueya, Gabriel Poesia, Rose E. Wang, Noah D. Goodman

    Abstract: Automatically generating high-quality step-by-step solutions to math word problems has many applications in education. Recently, combining large language models (LLMs) with external tools to perform complex reasoning and calculation has emerged as a promising direction for solving math word problems, but prior approaches such as Program-Aided Language model (PAL) are biased towards simple procedur… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  7. arXiv:2212.10561  [pdf, other

    cs.CL cs.AI cs.LG

    Parsel: Algorithmic Reasoning with Language Models by Composing Decompositions

    Authors: Eric Zelikman, Qian Huang, Gabriel Poesia, Noah D. Goodman, Nick Haber

    Abstract: Despite recent success in large language model (LLM) reasoning, LLMs struggle with hierarchical multi-step reasoning tasks like generating complex programs. For these tasks, humans often start with a high-level algorithmic design and implement each part gradually. We introduce Parsel, a framework enabling automatic implementation and validation of complex algorithms with code LLMs. With Parsel, we… ▽ More

    Submitted 28 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: humaneval results, clarity

  8. Peano: Learning Formal Mathematical Reasoning

    Authors: Gabriel Poesia, Noah D. Goodman

    Abstract: General mathematical reasoning is computationally undecidable, but humans routinely solve new problems. Moreover, discoveries developed over centuries are taught to subsequent generations quickly. What structure enables this, and how might that inform automated mathematical reasoning? We posit that central to both puzzles is the structure of procedural abstractions underlying mathematics. We explo… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  9. arXiv:2211.08671  [pdf, other

    cs.AI

    LEMMA: Bootstrap** High-Level Mathematical Reasoning with Learned Symbolic Abstractions

    Authors: Zhening Li, Gabriel Poesia, Omar Costilla-Reyes, Noah Goodman, Armando Solar-Lezama

    Abstract: Humans tame the complexity of mathematical reasoning by develo** hierarchies of abstractions. With proper abstractions, solutions to hard problems can be expressed concisely, thus making them more likely to be found. In this paper, we propose Learning Mathematical Abstractions (LEMMA): an algorithm that implements this idea for reinforcement learning agents in mathematical domains. LEMMA augment… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 10 pages, 2 figures; to appear in 2nd MATH-AI Workshop at NeurIPS'22

  10. arXiv:2201.11227  [pdf, other

    cs.LG cs.PL

    Synchromesh: Reliable code generation from pre-trained language models

    Authors: Gabriel Poesia, Oleksandr Polozov, Vu Le, Ashish Tiwari, Gustavo Soares, Christopher Meek, Sumit Gulwani

    Abstract: Large pre-trained language models have been used to generate code,providing a flexible interface for synthesizing programs from natural language specifications. However, they often violate syntactic and semantic rules of their output language, limiting their practical usability. In this paper, we propose Synchromesh: a framework for substantially improving the reliability of pre-trained models for… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: 10 pages, 9 additional pages of Appendix

  11. arXiv:2110.09779  [pdf, other

    cs.CL

    Open-domain clarification question generation without question examples

    Authors: Julia White, Gabriel Poesia, Robert Hawkins, Dorsa Sadigh, Noah Goodman

    Abstract: An overarching goal of natural language processing is to enable machines to communicate seamlessly with humans. However, natural language can be ambiguous or unclear. In cases of uncertainty, humans engage in an interactive process known as repair: asking questions and seeking clarification until their uncertainty is resolved. We propose a framework for building a visually grounded question-asking… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  12. arXiv:2106.09146  [pdf, other

    cs.AI cs.LG

    Contrastive Reinforcement Learning of Symbolic Reasoning Domains

    Authors: Gabriel Poesia, WenXin Dong, Noah Goodman

    Abstract: Abstract symbolic reasoning, as required in domains such as mathematics and logic, is a key component of human intelligence. Solvers for these domains have important applications, especially to computer-assisted education. But learning to solve symbolic problems is challenging for machine learning algorithms. Existing models either learn from human solutions or use hand-engineered features, making… ▽ More

    Submitted 8 November, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021