Skip to main content

Showing 1–5 of 5 results for author: Marro, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11911  [pdf, other

    cs.AI cs.CL cs.LG

    A Notion of Complexity for Theory of Mind via Discrete World Models

    Authors: X. Angelo Huang, Emanuele La Malfa, Samuele Marro, Andrea Asperti, Anthony Cohn, Michael Wooldridge

    Abstract: Theory of Mind (ToM) can be used to assess the capabilities of Large Language Models (LLMs) in complex scenarios where social reasoning is required. While the research community has proposed many ToM benchmarks, their hardness varies greatly, and their complexity is not well defined. This work proposes a framework to measure the complexity of ToM tasks. We quantify a problem's complexity as the nu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: https://flecart.github.com/complexity-tom-dwm

  2. arXiv:2401.09074  [pdf, other

    cs.LG cs.AI cs.CL cs.PL

    Code Simulation Challenges for Large Language Models

    Authors: Emanuele La Malfa, Christoph Weinhuber, Orazio Torre, Fangru Lin, Samuele Marro, Anthony Cohn, Nigel Shadbolt, Michael Wooldridge

    Abstract: Many reasoning, planning, and problem-solving tasks share an intrinsic algorithmic nature: correctly simulating each step is a sufficient condition to solve them correctly. This work studies to what extent Large Language Models (LLMs) can simulate coding and algorithmic tasks to provide insights into general capabilities in such algorithmic reasoning tasks. We introduce benchmarks for straight-lin… ▽ More

    Submitted 12 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Code: https://github.com/EmanueleLM/CodeSimulation

  3. arXiv:2306.14326  [pdf, other

    cs.LG cs.CR

    Computational Asymmetries in Robust Classification

    Authors: Samuele Marro, Michele Lombardi

    Abstract: In the context of adversarial robustness, we make three strongly related contributions. First, we prove that while attacking ReLU classifiers is $\mathit{NP}$-hard, ensuring their robustness at training time is $Σ^2_P$-hard (even on a single example). This asymmetry provides a rationale for the fact that robust classifications approaches are frequently fooled in the literature. Second, we show tha… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    MSC Class: 68T07

    Journal ref: 40th International Conference on Machine Learning (ICML 2023)

  4. arXiv:2301.07485  [pdf, other

    cs.CV cs.AI cs.LG

    Image Embedding for Denoising Generative Models

    Authors: Andrea Asperti, Davide Evangelista, Samuele Marro, Fabio Merizzi

    Abstract: Denoising Diffusion models are gaining increasing popularity in the field of generative modeling for several reasons, including the simple and stable training, the excellent generative quality, and the solid probabilistic foundation. In this article, we address the problem of {\em embedding} an image into the latent space of Denoising Diffusion Models, that is finding a suitable ``noisy'' image wh… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    MSC Class: 68T07 ACM Class: I.3.3

  5. arXiv:2202.00003  [pdf, other

    cs.CR

    Green NFTs: A Study on the Environmental Impact of Cryptoart Technologies

    Authors: Samuele Marro, Luca Donno

    Abstract: We introduce a model of greenhouse gas emissions due to on-chain activity on Ethereum, focusing on cryptoart. We also estimate the impact of individual transactions on the environment, both before and after the London hard fork. We find that with the current fee mechanism, spending one dollar on transaction fees corresponds to emitting at least the equivalent of 1.305 kilograms of CO2. We also des… ▽ More

    Submitted 29 August, 2022; v1 submitted 29 January, 2022; originally announced February 2022.

    Comments: This draft was written in May 2021 and might be subject to modifications. August 29th 2022: removed references to old emission figure

    MSC Class: 68-11 ACM Class: J.5; J.4