Skip to main content

Showing 1–1 of 1 results for author: Torre, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.09074  [pdf, other

    cs.LG cs.AI cs.CL cs.PL

    Code Simulation Challenges for Large Language Models

    Authors: Emanuele La Malfa, Christoph Weinhuber, Orazio Torre, Fangru Lin, Samuele Marro, Anthony Cohn, Nigel Shadbolt, Michael Wooldridge

    Abstract: Many reasoning, planning, and problem-solving tasks share an intrinsic algorithmic nature: correctly simulating each step is a sufficient condition to solve them correctly. This work studies to what extent Large Language Models (LLMs) can simulate coding and algorithmic tasks to provide insights into general capabilities in such algorithmic reasoning tasks. We introduce benchmarks for straight-lin… ▽ More

    Submitted 12 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Code: https://github.com/EmanueleLM/CodeSimulation