Skip to main content

Showing 1–4 of 4 results for author: Staats, C

.
  1. arXiv:2403.18120  [pdf, other

    cs.AI cs.CL cs.LG

    Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

    Authors: ** Peng Zhou, Charles Staats, Wenda Li, Christian Szegedy, Kilian Q. Weinberger, Yuhuai Wu

    Abstract: Large language models (LLM), such as Google's Minerva and OpenAI's GPT families, are becoming increasingly capable of solving mathematical quantitative reasoning problems. However, they still make unjustified logical and computational errors in their reasoning steps and answers. In this paper, we leverage the fact that if the training corpus of LLMs contained sufficiently many examples of formal m… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  2. arXiv:2205.12615  [pdf, ps, other

    cs.LG cs.AI cs.LO cs.SE

    Autoformalization with Large Language Models

    Authors: Yuhuai Wu, Albert Q. Jiang, Wenda Li, Markus N. Rabe, Charles Staats, Mateja Jamnik, Christian Szegedy

    Abstract: Autoformalization is the process of automatically translating from natural language mathematics to formal specifications and proofs. A successful autoformalization system could advance the fields of formal verification, program synthesis, and artificial intelligence. While the long-term goal of autoformalization seemed elusive for a long time, we show large language models provide new prospects to… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: 44 pages

  3. arXiv:2204.11985  [pdf, other

    cs.LG cs.CR

    When adversarial examples are excusable

    Authors: Pieter-Jan Kindermans, Charles Staats

    Abstract: Neural networks work remarkably well in practice and theoretically they can be universal approximators. However, they still make mistakes and a specific type of them called adversarial errors seem inexcusable to humans. In this work, we analyze both test errors and adversarial errors on a well controlled but highly non-linear visual classification problem. We find that, when approximating training… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  4. arXiv:2112.05682  [pdf, ps, other

    cs.LG

    Self-attention Does Not Need $O(n^2)$ Memory

    Authors: Markus N. Rabe, Charles Staats

    Abstract: We present a very simple algorithm for attention that requires $O(1)$ memory with respect to sequence length and an extension to self-attention that requires $O(\log n)$ memory. This is in contrast with the frequently stated belief that self-attention requires $O(n^2)$ memory. While the time complexity is still $O(n^2)$, device memory rather than compute capability is often the limiting factor on… ▽ More

    Submitted 10 October, 2022; v1 submitted 10 December, 2021; originally announced December 2021.