Skip to main content

Showing 1–18 of 18 results for author: Giulianelli, M

.
  1. arXiv:2406.18403  [pdf, other

    cs.CL

    LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

    Authors: Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, André F. T. Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz, Alberto Testoni

    Abstract: There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2402.02896  [pdf, other

    cs.CL cs.AI cs.CY cs.MA

    LLM Agents in Interaction: Measuring Personality Consistency and Linguistic Alignment in Interacting Populations of Large Language Models

    Authors: Ivar Frisch, Mario Giulianelli

    Abstract: While both agent interaction and personalisation are vibrant topics in research on large language models (LLMs), there has been limited focus on the effect of language interaction on the behaviour of persona-conditioned LLM agents. Such an endeavour is important to ensure that agents remain consistent to their assigned traits yet are able to engage in open, naturalistic dialogues. In our experimen… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: To appear in Proceedings of the 1st Personalization of Generative AI Workshop, EACL 2024

  3. arXiv:2311.13061  [pdf, other

    cs.CL

    Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue

    Authors: Aron Molnar, Jaap Jumelet, Mario Giulianelli, Arabella Sinclair

    Abstract: Language models are often used as the backbone of modern dialogue systems. These models are pre-trained on large amounts of written fluent language. Repetition is typically penalised when evaluating language model generations. However, it is a key component of dialogue. Humans use local and partner specific repetitions; these are preferred by human users and lead to more successful communication i… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: CoNLL 2023

  4. arXiv:2310.13676  [pdf, other

    cs.CL

    Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives

    Authors: Mario Giulianelli, Sarenne Wallbridge, Raquel Fernández

    Abstract: We present information value, a measure which quantifies the predictability of an utterance relative to a set of plausible alternatives. We introduce a method to obtain interpretable estimates of information value using neural text generators, and exploit their psychometric predictive power to investigate the dimensions of predictability that drive human comprehension behaviour. Information value… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Main, Long paper)

  5. arXiv:2305.19933  [pdf, other

    cs.CL cs.AI cs.CV

    Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind

    Authors: Ece Takmaz, Nicolo' Brandizzi, Mario Giulianelli, Sandro Pezzelle, Raquel Fernández

    Abstract: Dialogue participants may have varying levels of knowledge about the topic under discussion. In such cases, it is essential for speakers to adapt their utterances by taking their audience into account. Yet, it is an open question how such adaptation can be modelled in computational agents. In this paper, we model a visually grounded referential game between a knowledgeable speaker and a listener w… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: To appear in Findings of ACL 2023

  6. arXiv:2305.11993  [pdf, other

    cs.CL

    Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis

    Authors: Mario Giulianelli, Iris Luden, Raquel Fernandez, Andrey Kutuzov

    Abstract: We propose using automatically generated natural language definitions of contextualised word usages as interpretable word and word sense representations. Given a collection of usage examples for a target word, and the corresponding data-driven usage clusters (i.e., word senses), a definition is generated for each usage with a specialised Flan-T5 language model, and the most prototypical definition… ▽ More

    Submitted 25 July, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  7. arXiv:2305.11707  [pdf, other

    cs.CL cs.AI cs.LG

    What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability

    Authors: Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank

    Abstract: In Natural Language Generation (NLG) tasks, for any input, multiple communicative goals are plausible, and any goal can be put into words, or produced, in multiple ways. We characterise the extent to which human production varies lexically, syntactically, and semantically across four NLG tasks, connecting human production variability to aleatoric or data uncertainty. We then inspect the space of o… ▽ More

    Submitted 20 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Camera ready version for EMNLP 2023

  8. arXiv:2210.12828  [pdf, other

    cs.CL cs.AI

    Towards Pragmatic Production Strategies for Natural Language Generation Tasks

    Authors: Mario Giulianelli

    Abstract: This position paper proposes a conceptual framework for the design of Natural Language Generation (NLG) systems that follow efficient and effective production strategies in order to achieve complex communicative goals. In this general framework, efficiency is characterised as the parsimonious regulation of production and comprehension costs while effectiveness is measured with respect to task-orie… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

  9. arXiv:2210.08321  [pdf, other

    cs.CL

    Construction Repetition Reduces Information Rate in Dialogue

    Authors: Mario Giulianelli, Arabella Sinclair, Raquel Fernández

    Abstract: Speakers repeat constructions frequently in dialogue. Due to their peculiar information-theoretic properties, repetitions can be thought of as a strategy for cost-effective communication. In this study, we focus on the repetition of lexicalised constructions -- i.e., recurring multi-word units -- in English open-domain spoken dialogues. We hypothesise that speakers use construction repetition to m… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022)

  10. State-of-the-art generalisation research in NLP: A taxonomy and review

    Authors: Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhi**g **

    Abstract: The ability to generalise well is one of the primary desiderata of natural language processing (NLP). Yet, what 'good generalisation' entails and how it should be evaluated is not well understood, nor are there any evaluation standards for generalisation. In this paper, we lay the groundwork to address both of these issues. We present a taxonomy for characterising and understanding generalisation… ▽ More

    Submitted 12 January, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: This preprint was published as an Analysis article in Nature Machine Intelligence. Please refer to the published version when citing this work. 28 pages of content + 6 pages of appendix + 52 pages of references

    Journal ref: Nat Mach Intell 5, 1161-1174 (2023)

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2204.05717  [pdf, other

    cs.CL

    Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change

    Authors: Mario Giulianelli, Andrey Kutuzov, Lidia Pivovarova

    Abstract: Morphological and syntactic changes in word usage (as captured, e.g., by grammatical profiles) have been shown to be good predictors of a word's meaning change. In this work, we explore whether large pre-trained contextualised language models, a common tool for lexical semantic change detection, are sensitive to such morphosyntactic changes. To this end, we first compare the performance of grammat… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 3rd International Workshop on Computational Approaches to Historical Language Change 2022 (LChange'22)

  13. arXiv:2109.10397  [pdf, other

    cs.CL

    Grammatical Profiling for Semantic Change Detection

    Authors: Mario Giulianelli, Andrey Kutuzov, Lidia Pivovarova

    Abstract: Semantics, morphology and syntax are strongly interdependent. However, the majority of computational methods for semantic change detection use distributional word representations which encode mostly semantics. We investigate an alternative method, grammatical profiling, based entirely on changes in the morphosyntactic behaviour of words. We demonstrate that it can be used for semantic change detec… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: CoNLL 2021

  14. arXiv:2011.04554  [pdf, other

    cs.CL cs.CV

    Refer, Reuse, Reduce: Generating Subsequent References in Visual and Conversational Contexts

    Authors: Ece Takmaz, Mario Giulianelli, Sandro Pezzelle, Arabella Sinclair, Raquel Fernández

    Abstract: Dialogue participants often refer to entities or situations repeatedly within a conversation, which contributes to its cohesiveness. Subsequent references exploit the common ground accumulated by the interlocutors and hence have several interesting properties, namely, they tend to be shorter and reuse expressions that were effective in previous mentions. In this paper, we tackle the generation of… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020)

  15. arXiv:2005.00050  [pdf, other

    cs.CL

    UiO-UvA at SemEval-2020 Task 1: Contextualised Embeddings for Lexical Semantic Change Detection

    Authors: Andrey Kutuzov, Mario Giulianelli

    Abstract: We apply contextualised word embeddings to lexical semantic change detection in the SemEval-2020 Shared Task 1. This paper focuses on Subtask 2, ranking words by the degree of their semantic drift over time. We analyse the performance of two contextualising architectures (BERT and ELMo) and three change detection algorithms. We find that the most effective algorithms rely on the cosine similarity… ▽ More

    Submitted 18 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: To appear in Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval-2020)

  16. Analysing Lexical Semantic Change with Contextualised Word Representations

    Authors: Mario Giulianelli, Marco Del Tredici, Raquel Fernández

    Abstract: This paper presents the first unsupervised approach to lexical semantic change that makes use of contextualised word representations. We propose a novel method that exploits the BERT neural language model to obtain representations of word usages, clusters these representations into usage types, and measures change along time with three proposed metrics. We create a new evaluation dataset and show… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: To appear in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL-2020)

  17. arXiv:1808.08079  [pdf, other

    cs.CL cs.AI

    Under the Hood: Using Diagnostic Classifiers to Investigate and Improve how Language Models Track Agreement Information

    Authors: Mario Giulianelli, Jacqueline Harding, Florian Mohnert, Dieuwke Hupkes, Willem Zuidema

    Abstract: How do neural language models keep track of number agreement between subject and verb? We show that `diagnostic classifiers', trained to predict number from the internal states of a language model, provide a detailed understanding of how, when, and where this information is represented. Moreover, they give us insight into when and where number information is corrupted in cases where the language m… ▽ More

    Submitted 18 November, 2021; v1 submitted 24 August, 2018; originally announced August 2018.

    Comments: Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP

  18. arXiv:1708.03910  [pdf, other

    cs.CL cs.AI cs.NE

    Semi-supervised emotion lexicon expansion with label propagation and specialized word embeddings

    Authors: Mario Giulianelli

    Abstract: There exist two main approaches to automatically extract affective orientation: lexicon-based and corpus-based. In this work, we argue that these two methods are compatible and show that combining them can improve the accuracy of emotion classifiers. In particular, we introduce a novel variant of the Label Propagation algorithm that is tailored to distributed word representations, we apply batch g… ▽ More

    Submitted 13 August, 2017; originally announced August 2017.

    Journal ref: Computational Linguistics in the Netherlands Journal, 8, 99-121 (2018). Retrieved from https://clinjournal.org/clinj/article/view/82