Skip to main content

Showing 1–20 of 20 results for author: Chersoni, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14859  [pdf, other

    cs.CL cs.AI

    Comparing Plausibility Estimates in Base and Instruction-Tuned Large Language Models

    Authors: Carina Kauf, Emmanuele Chersoni, Alessandro Lenci, Evelina Fedorenko, Anna A. Ivanova

    Abstract: Instruction-tuned LLMs can respond to explicit queries formulated as prompts, which greatly facilitates interaction with human users. However, prompt-based approaches might not always be able to tap into the wealth of implicit knowledge acquired by LLMs during pre-training. This paper presents a comprehensive study of ways to evaluate semantic plausibility in LLMs. We compare base and instruction-… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2306.05276  [pdf, other

    cs.CL cs.AI

    Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction

    Authors: Simone Scaboro, Beatrice Portellia, Emmanuele Chersoni, Enrico Santus, Giuseppe Serra

    Abstract: Adverse Event (ADE) extraction is one of the core tasks in digital pharmacovigilance, especially when applied to informal texts. This task has been addressed by the Natural Language Processing community using large pre-trained language models, such as BERT. Despite the great number of Transformer-based architectures used in the literature, it is unclear which of them has better performances and wh… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  3. arXiv:2305.18915  [pdf, other

    cs.CL

    Empirical Sufficiency Lower Bounds for Language Modeling with Locally-Bootstrapped Semantic Structures

    Authors: Jakob Prange, Emmanuele Chersoni

    Abstract: In this work we build upon negative results from an attempt at language modeling with predicted semantic structure, in order to establish empirical lower bounds on what could have made the attempt successful. More specifically, we design a concise binary vector representation of semantic structure at the lexical level and evaluate in-depth how good an incremental tagger needs to be in order to ach… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To appear at *SEM 2023, Toronto

  4. arXiv:2212.01488  [pdf

    cs.CL cs.AI

    Event knowledge in large language models: the gap between the impossible and the unlikely

    Authors: Carina Kauf, Anna A. Ivanova, Giulia Rambelli, Emmanuele Chersoni, **gyuan Selena She, Zawad Chowdhury, Evelina Fedorenko, Alessandro Lenci

    Abstract: Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs' semantic abilities is whether they acquire generalized knowledge of co… ▽ More

    Submitted 26 October, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: The two lead authors have contributed equally to this work

  5. arXiv:2210.11947  [pdf, other

    cs.CL cs.AI cs.IR

    Generalizing over Long Tail Concepts for Medical Term Normalization

    Authors: Beatrice Portelli, Simone Scaboro, Enrico Santus, Hooman Sedghamiz, Emmanuele Chersoni, Giuseppe Serra

    Abstract: Medical term normalization consists in map** a piece of text to a large number of output classes. Given the small size of the annotated datasets and the extremely long tail distribution of the concepts, it is of utmost importance to develop models that are capable to generalize to scarce or unseen concepts. An important attribute of most target ontologies is their hierarchical structure. In this… ▽ More

    Submitted 3 November, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

  6. arXiv:2209.03452  [pdf, other

    cs.CL cs.LG

    AILAB-Udine@SMM4H 22: Limits of Transformers and BERT Ensembles

    Authors: Beatrice Portelli, Simone Scaboro, Emmanuele Chersoni, Enrico Santus, Giuseppe Serra

    Abstract: This paper describes the models developed by the AILAB-Udine team for the SMM4H 22 Shared Task. We explored the limits of Transformer based models on text classification, entity extraction and entity normalization, tackling Tasks 1, 2, 5, 6 and 10. The main take-aways we got from participating in different tasks are: the overwhelming positive effects of combining different architectures when using… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: Shared Task, SMM4H, Transformers

  7. arXiv:2209.02812  [pdf

    cs.CL

    Increasing Adverse Drug Events extraction robustness on social media: case study on negation and speculation

    Authors: Simone Scaboro, Beatrice Portelli, Emmanuele Chersoni, Enrico Santus, Giuseppe Serra

    Abstract: In the last decade, an increasing number of users have started reporting Adverse Drug Events (ADE) on social media platforms, blogs, and health forums. Given the large volume of reports, pharmacovigilance has focused on ways to use Natural Language Processing (NLP) techniques to rapidly examine these large collections of text, detecting mentions of drug-related adverse reactions to trigger medical… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: Journal Paper, EBM

  8. arXiv:2204.09781  [pdf

    cs.DL cs.CL cs.IR cs.LG

    Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

    Authors: Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, **gcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, **feng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu , et al. (14 additional authors not shown)

    Abstract: The COVID-19 pandemic has been severely impacting global society since December 2019. Massive research has been undertaken to understand the characteristics of the virus and design vaccines and drugs. The related findings have been reported in biomedical literature at a rate of about 10,000 articles on COVID-19 per month. Such rapid growth significantly challenges manual curation and interpretatio… ▽ More

    Submitted 3 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  9. arXiv:2109.10080  [pdf, other

    cs.CL cs.LG

    NADE: A Benchmark for Robust Adverse Drug Events Extraction in Face of Negations

    Authors: Simone Scaboro, Beatrice Portelli, Emmanuele Chersoni, Enrico Santus, Giuseppe Serra

    Abstract: Adverse Drug Event (ADE) extraction models can rapidly examine large collections of social media texts, detecting mentions of drug-related adverse reactions and trigger medical investigations. However, despite the recent advances in NLP, it is currently unknown if such models are robust in face of negation, which is pervasive across language varieties. In this paper we evaluate three state-of-th… ▽ More

    Submitted 24 September, 2021; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: W-NUT Workshop, EMLNP 2021

  10. arXiv:2109.05815  [pdf, other

    cs.CL

    Exploring a Unified Sequence-To-Sequence Transformer for Medical Product Safety Monitoring in Social Media

    Authors: Shivam Raval, Hooman Sedghamiz, Enrico Santus, Tuka Alhanai, Mohammad Ghassemi, Emmanuele Chersoni

    Abstract: Adverse Events (AE) are harmful events resulting from the use of medical products. Although social media may be crucial for early AE detection, the sheer scale of this data makes it logistically intractable to analyze using human agents, with NLP representing the only low-cost and scalable alternative. In this paper, we frame AE Detection and Extraction as a sequence-to-sequence problem using the… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: Short paper, EMNLP 2021, Findings

  11. arXiv:2107.10922  [pdf, other

    cs.CL

    Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge

    Authors: Paolo Pedinotti, Giulia Rambelli, Emmanuele Chersoni, Enrico Santus, Alessandro Lenci, Philippe Blache

    Abstract: Prior research has explored the ability of computational models to predict a word semantic fit with a given predicate. While much work has been devoted to modeling the typicality relation between verbs and arguments in isolation, in this paper we take a broader perspective by assessing whether and to what extent computational approaches have access to the information about the typicality of entire… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  12. arXiv:2105.08882  [pdf, ps, other

    cs.CL cs.LG

    Improving Adverse Drug Event Extraction with SpanBERT on Different Text Typologies

    Authors: Beatrice Portelli, Daniele Passabì, Edoardo Lenzi, Giuseppe Serra, Enrico Santus, Emmanuele Chersoni

    Abstract: In recent years, Internet users are reporting Adverse Drug Events (ADE) on social media, blogs and health forums. Because of the large volume of reports, pharmacovigilance is seeking to resort to NLP to monitor these outlets. We propose for the first time the use of the SpanBERT architecture for the task of ADE extraction: this new version of the popular BERT transformer showed improved capabiliti… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 11 pages, AAAI, conference

  13. arXiv:1906.07280  [pdf, other

    cs.CL

    A Structured Distributional Model of Sentence Meaning and Processing

    Authors: Emmanuele Chersoni, Enrico Santus, Ludovica Pannitto, Alessandro Lenci, Philippe Blache, Chu-Ren Huang

    Abstract: Most compositional distributional semantic models represent sentence meaning with a single vector. In this paper, we propose a Structured Distributional Model (SDM) that combines word embeddings with formal semantics and is based on the assumption that sentences represent events and situations. The semantic representation of a sentence is a formal structure derived from Discourse Representation Th… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: accepted at JLNE; Journal of Natural Language Engineering; 26 pages, thematic fit, selectional preference, natural language processing, nlp, ai

  14. arXiv:1805.01923  [pdf, other

    cs.CL

    A Rank-Based Similarity Metric for Word Embeddings

    Authors: Enrico Santus, Hongmin Wang, Emmanuele Chersoni, Yue Zhang

    Abstract: Word Embeddings have recently imposed themselves as a standard for representing word meaning in NLP. Semantic similarity between word pairs has become the most common evaluation benchmark for these representations, with vector cosine being typically used as the only similarity metric. In this paper, we report experiments with a rank-based metric for WE, which performs comparably to vector cosine i… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: 5 pages, 1 figure, 4 tables, ACL, ACL2018

  15. arXiv:1804.11251  [pdf, other

    cs.CL

    BomJi at SemEval-2018 Task 10: Combining Vector-, Pattern- and Graph-based Information to Identify Discriminative Attributes

    Authors: Enrico Santus, Chris Biemann, Emmanuele Chersoni

    Abstract: This paper describes BomJi, a supervised system for capturing discriminative attributes in word pairs (e.g. yellow as discriminative for banana over watermelon). The system relies on an XGB classifier trained on carefully engineered graph-, pattern- and word embedding based features. It participated in the SemEval- 2018 Task 10 on Capturing Discriminative Attributes, achieving an F1 score of 0:73… ▽ More

    Submitted 30 April, 2018; originally announced April 2018.

    Comments: 3 tables, 4 pages, SemEval, NAACL, NLP, Task

  16. arXiv:1710.00998  [pdf, other

    cs.CL

    Is Structure Necessary for Modeling Argument Expectations in Distributional Semantics?

    Authors: Emmanuele Chersoni, Enrico Santus, Philippe Blache, Alessandro Lenci

    Abstract: Despite the number of NLP studies dedicated to thematic fit estimation, little attention has been paid to the related task of composing and updating verb argument expectations. The few exceptions have mostly modeled this phenomenon with structured distributional models, implicitly assuming a similarly structured representation of events. Recent experimental evidence, however, suggests that human p… ▽ More

    Submitted 3 October, 2017; originally announced October 2017.

    Comments: conference paper, IWCS

  17. arXiv:1707.05967  [pdf, other

    cs.CL

    Measuring Thematic Fit with Distributional Feature Overlap

    Authors: Enrico Santus, Emmanuele Chersoni, Alessandro Lenci, Philippe Blache

    Abstract: In this paper, we introduce a new distributional method for modeling predicate-argument thematic fit judgments. We use a syntax-based DSM to build a prototypical representation of verb-specific roles: for every verb, we extract the most salient second order contexts for each of its roles (i.e. the most salient dimensions of typical role fillers), and then we compute thematic fit as a weighted over… ▽ More

    Submitted 26 July, 2017; v1 submitted 19 July, 2017; originally announced July 2017.

    Comments: 9 pages, 2 figures, 5 tables, EMNLP, 2017, thematic fit, selectional preference, semantic role, DSMs, Distributional Semantic Models, Vector Space Models, VSMs, cosine, APSyn, similarity, prototype

  18. arXiv:1611.01101  [pdf, ps, other

    cs.CL

    CogALex-V Shared Task: ROOT18

    Authors: Emmanuele Chersoni, Giulia Rambelli, Enrico Santus

    Abstract: In this paper, we describe ROOT 18, a classifier using the scores of several unsupervised distributional measures as features to discriminate between semantically related and unrelated words, and then to classify the related pairs according to their semantic relation (i.e. synonymy, antonymy, hypernymy, part-whole meronymy). Our classifier participated in the CogALex-V Shared Task, showing a solid… ▽ More

    Submitted 3 November, 2016; originally announced November 2016.

  19. arXiv:1608.07738  [pdf, other

    cs.CL

    Testing APSyn against Vector Cosine on Similarity Estimation

    Authors: Enrico Santus, Emmanuele Chersoni, Alessandro Lenci, Chu-Ren Huang, Philippe Blache

    Abstract: In Distributional Semantic Models (DSMs), Vector Cosine is widely used to estimate similarity between word vectors, although this measure was noticed to suffer from several shortcomings. The recent literature has proposed other methods which attempt to mitigate such biases. In this paper, we intend to investigate APSyn, a measure that computes the extent of the intersection between the most associ… ▽ More

    Submitted 5 October, 2016; v1 submitted 27 August, 2016; originally announced August 2016.

    Comments: 8 pages, 1 figure, 4 tables, PACLIC, cosine, vectors, DSMs

  20. arXiv:1607.02061  [pdf, ps, other

    cs.CL cs.AI

    Representing Verbs with Rich Contexts: an Evaluation on Verb Similarity

    Authors: Emmanuele Chersoni, Enrico Santus, Alessandro Lenci, Philippe Blache, Chu-Ren Huang

    Abstract: Several studies on sentence processing suggest that the mental lexicon keeps track of the mutual expectations between words. Current DSMs, however, represent context words as separate features, thereby loosing important information for word expectations, such as word interrelations. In this paper, we present a DSM that addresses this issue by defining verb contexts as joint syntactic dependencies.… ▽ More

    Submitted 5 October, 2016; v1 submitted 7 July, 2016; originally announced July 2016.

    Comments: 5 pages