Skip to main content

Showing 1–12 of 12 results for author: La Malfa, E

.
  1. arXiv:2406.11911  [pdf, other

    cs.AI cs.CL cs.LG

    A Notion of Complexity for Theory of Mind via Discrete World Models

    Authors: X. Angelo Huang, Emanuele La Malfa, Samuele Marro, Andrea Asperti, Anthony Cohn, Michael Wooldridge

    Abstract: Theory of Mind (ToM) can be used to assess the capabilities of Large Language Models (LLMs) in complex scenarios where social reasoning is required. While the research community has proposed many ToM benchmarks, their hardness varies greatly, and their complexity is not well defined. This work proposes a framework to measure the complexity of ToM tasks. We quantify a problem's complexity as the nu… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: https://flecart.github.com/complexity-tom-dwm

  2. arXiv:2404.11172  [pdf, other

    cs.LG cs.AI

    Deep Neural Networks via Complex Network Theory: a Perspective

    Authors: Emanuele La Malfa, Gabriele La Malfa, Giuseppe Nicosia, Vito Latora

    Abstract: Deep Neural Networks (DNNs) can be represented as graphs whose links and vertices iteratively process data and solve tasks sub-optimally. Complex Network Theory (CNT), merging statistical physics with graph theory, provides a method for interpreting neural networks by analysing their weights and neuron structures. However, classic works adapt CNT metrics that only permit a topological analysis as… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: IJCAI'24 (full paper, main track)

  3. arXiv:2402.02805  [pdf, other

    cs.AI cs.CL cs.LG

    Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

    Authors: Fangru Lin, Emanuele La Malfa, Valentin Hofmann, Elle Michelle Yang, Anthony Cohn, Janet B. Pierrehumbert

    Abstract: Planning is a fundamental property of human intelligence. Reasoning about asynchronous plans is challenging since it requires sequential and parallel planning to optimize time costs. Can large language models (LLMs) succeed at this task? Here, we present the first large-scale study investigating this question. We find that a representative set of closed and open-source LLMs, including GPT-4 and LL… ▽ More

    Submitted 3 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML-2024

  4. arXiv:2401.09074  [pdf, other

    cs.LG cs.AI cs.CL cs.PL

    Code Simulation Challenges for Large Language Models

    Authors: Emanuele La Malfa, Christoph Weinhuber, Orazio Torre, Fangru Lin, Samuele Marro, Anthony Cohn, Nigel Shadbolt, Michael Wooldridge

    Abstract: Many reasoning, planning, and problem-solving tasks share an intrinsic algorithmic nature: correctly simulating each step is a sufficient condition to solve them correctly. This work studies to what extent Large Language Models (LLMs) can simulate coding and algorithmic tasks to provide insights into general capabilities in such algorithmic reasoning tasks. We introduce benchmarks for straight-lin… ▽ More

    Submitted 12 June, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Code: https://github.com/EmanueleLM/CodeSimulation

  5. arXiv:2309.16573  [pdf, other

    cs.AI cs.CL cs.CY

    Language Models as a Service: Overview of a New Paradigm and its Challenges

    Authors: Emanuele La Malfa, Aleksandar Petrov, Simon Frieder, Christoph Weinhuber, Ryan Burnell, Raza Nazar, Anthony G. Cohn, Nigel Shadbolt, Michael Wooldridge

    Abstract: Some of the most powerful language models currently are proprietary systems, accessible only via (typically restrictive) web or software programming interfaces. This is the Language-Models-as-a-Service (LMaaS) paradigm. In contrast with scenarios where full model access is available, as in the case of open-source models, such closed-off language models present specific challenges for evaluating, b… ▽ More

    Submitted 30 November, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

  6. arXiv:2305.15425  [pdf

    cs.CL cs.LG

    Language Model Tokenizers Introduce Unfairness Between Languages

    Authors: Aleksandar Petrov, Emanuele La Malfa, Philip H. S. Torr, Adel Bibi

    Abstract: Recent language models have shown impressive multilingual performance, even when not explicitly trained for it. Despite this, there are concerns about the quality of their outputs across different languages. In this paper, we show how disparity in the treatment of different languages arises at the tokenization stage, well before a model is even invoked. The same text translated into different lang… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023, Project webpage: https://aleksandarpetrov.github.io/tokenization-fairness, Code: https://github.com/AleksandarPetrov/tokenization-fairness

  7. arXiv:2210.17406  [pdf, other

    cs.LG cs.CL

    Emergent Linguistic Structures in Neural Networks are Fragile

    Authors: Emanuele La Malfa, Matthew Wicker, Marta Kwiatkowska

    Abstract: Large Language Models (LLMs) have been reported to have strong performance on natural language processing tasks. However, performance metrics such as accuracy do not measure the quality of the model in terms of its ability to robustly represent complex linguistic structures. In this paper, focusing on the ability of language models to represent syntax, we propose a framework to assess the consiste… ▽ More

    Submitted 31 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  8. arXiv:2209.05488  [pdf, other

    cs.LG cs.AI

    Deep Neural Networks as Complex Networks

    Authors: Emanuele La Malfa, Gabriele La Malfa, Claudio Caprioli, Giuseppe Nicosia, Vito Latora

    Abstract: Deep Neural Networks are, from a physical perspective, graphs whose `links` and `vertices` iteratively process data and solve tasks sub-optimally. We use Complex Network Theory (CNT) to represents Deep Neural Networks (DNNs) as directed weighted graphs: within this framework, we introduce metrics to study DNNs as dynamical systems, with a granularity that spans from weights to layers, including ne… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  9. arXiv:2112.07605  [pdf, ps, other

    cs.CL cs.AI

    The King is Naked: on the Notion of Robustness for Natural Language Processing

    Authors: Emanuele La Malfa, Marta Kwiatkowska

    Abstract: There is growing evidence that the classical notion of adversarial robustness originally introduced for images has been adopted as a de facto standard by a large part of the NLP research community. We show that this notion is problematic in the context of NLP as it considers a narrow spectrum of linguistic phenomena. In this paper, we argue for semantic robustness, which is better aligned with the… ▽ More

    Submitted 11 January, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: AAAI 2022 main-track (full-paper)

  10. arXiv:2110.02628  [pdf, ps, other

    cs.LG cs.SI

    Characterizing Learning Dynamics of Deep Neural Networks via Complex Networks

    Authors: Emanuele La Malfa, Gabriele La Malfa, Giuseppe Nicosia, Vito Latora

    Abstract: In this paper, we interpret Deep Neural Networks with Complex Network Theory. Complex Network Theory (CNT) represents Deep Neural Networks (DNNs) as directed weighted graphs to study them as dynamical systems. We efficiently adapt CNT measures to examine the evolution of the learning process of DNNs with different initializations and architectures: we introduce metrics for nodes/neurons and layers… ▽ More

    Submitted 18 October, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: IEEE/ICTAI2021 (full paper)

    Journal ref: IEEE-ICTAI2021

  11. arXiv:2105.03640  [pdf, other

    cs.AI cs.CL

    On Guaranteed Optimal Robust Explanations for NLP Models

    Authors: Emanuele La Malfa, Agnieszka Zbrzezny, Rhiannon Michelmore, Nicola Paoletti, Marta Kwiatkowska

    Abstract: We build on abduction-based explanations for ma-chine learning and develop a method for computing local explanations for neural network models in natural language processing (NLP). Our explanations comprise a subset of the words of the in-put text that satisfies two key features: optimality w.r.t. a user-defined cost function, such as the length of explanation, and robustness, in that they ensure… ▽ More

    Submitted 14 May, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: 13 pages (8+5 Appendix). Accepted as long-paper at IJCAI 2021

    Report number: Pages 2658-2665

    Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence 2021

  12. Assessing Robustness of Text Classification through Maximal Safe Radius Computation

    Authors: Emanuele La Malfa, Min Wu, Luca Laurenti, Benjie Wang, Anthony Hartshorn, Marta Kwiatkowska

    Abstract: Neural network NLP models are vulnerable to small modifications of the input that maintain the original meaning but result in a different prediction. In this paper, we focus on robustness of text classification against word substitutions, aiming to provide guarantees that the model prediction does not change if a word is replaced with a plausible alternative, such as a synonym. As a measure of rob… ▽ More

    Submitted 7 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 12 pages + appendix

    Journal ref: EMNLP-Findings2020