Skip to main content

Showing 1–20 of 20 results for author: Sileo, D

.
  1. arXiv:2406.11035  [pdf, other

    cs.CL

    Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars

    Authors: Damien Sileo

    Abstract: Logical reasoning remains a challenge for natural language processing, but it can be improved by training language models to mimic theorem provers on procedurally generated problems. Previous work used domain-specific proof generation algorithms, which biases reasoning toward specific proof traces and limits auditability and extensibility. We present a simpler and more general declarative framewor… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    ACM Class: I.2.7

  2. arXiv:2310.16787  [pdf, other

    cs.CL cs.AI cs.LG

    The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

    Authors: Shayne Longpre, Robert Mahari, Anthony Chen, Naana Obeng-Marnu, Damien Sileo, William Brannon, Niklas Muennighoff, Nathan Khazam, Jad Kabbara, Kartik Perisetla, Xinyi Wu, Enrico Shippole, Kurt Bollacker, Tongshuang Wu, Luis Villa, Sandy Pentland, Sara Hooker

    Abstract: The race to train language models on vast, diverse, and inconsistently documented datasets has raised pressing concerns about the legal and ethical risks for practitioners. To remedy these practices threatening data transparency and understanding, we convene a multi-disciplinary effort between legal and machine learning experts to systematically audit and trace 1800+ text datasets. We develop tool… ▽ More

    Submitted 4 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 30 pages (18 main), 6 figures, 5 tables

  3. arXiv:2310.01299  [pdf, other

    cs.CL cs.AI

    Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence

    Authors: Wei Sun, Mingxiao Li, Damien Sileo, Jesse Davis, Marie-Francine Moens

    Abstract: Medical Question Answering~(medical QA) systems play an essential role in assisting healthcare workers in finding answers to their questions. However, it is not sufficient to merely provide answers by medical QA systems because users might want explanations, that is, more analytic statements in natural language that describe the elements and context that support the answer. To do so, we propose a… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  4. When Do Discourse Markers Affect Computational Sentence Understanding?

    Authors: Ruiqi Li, Liesbeth Allein, Damien Sileo, Marie-Francine Moens

    Abstract: The capabilities and use cases of automatic natural language processing (NLP) have grown significantly over the last few years. While much work has been devoted to understanding how humans deal with discourse connectives, this phenomenon is understudied in computational systems. Therefore, it is important to put NLP models under the microscope and examine whether they can adequately comprehend, pr… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: Chapter 7 of Discourse Markers in Interaction, published in Trends in Linguistics. Studies and Monographs

    Journal ref: Trends in Linguistics. Studies and Monographs, 2022

  5. arXiv:2305.03353  [pdf, other

    cs.CL cs.AI

    MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic

    Authors: Damien Sileo, Antoine Lernould

    Abstract: Theory of Mind (ToM) is a critical component of intelligence but its assessment remains the subject of heated debates. Prior research applied human ToM assessments to natural language processing models using either human-created standardized tests or rule-based templates. However, these methods primarily focus on simplistic reasoning and require further validation. Here, we leverage dynamic episte… ▽ More

    Submitted 7 November, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP Findings 2023

    MSC Class: 68T01; 68T27; 68T50 ACM Class: I.2.7

  6. arXiv:2303.07069  [pdf, other

    cs.CL

    Generating multiple-choice questions for medical question answering with distractors and cue-masking

    Authors: Damien Sileo, Kanimozhi Uma, Marie-Francine Moens

    Abstract: Medical multiple-choice question answering (MCQA) is particularly difficult. Questions may describe patient symptoms and ask for the correct diagnosis, which requires domain knowledge and complex reasoning. Standard language modeling pretraining alone is not sufficient to achieve the best results. \citet{**2020disease} showed that focusing masked language modeling on disease name prediction when… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    ACM Class: H.4; H.5; I.2

  7. arXiv:2301.05948  [pdf, other

    cs.CL cs.AI

    tasksource: A Dataset Harmonization Framework for Streamlined NLP Multi-Task Learning and Evaluation

    Authors: Damien Sileo

    Abstract: The HuggingFace Datasets Hub hosts thousands of datasets, offering exciting opportunities for language model training and evaluation. However, datasets for a specific task type often have different schemas, making harmonization challenging. Multi-task training or evaluation necessitates manual work to fit data into task templates. Several initiatives independently tackle this issue by releasing ha… ▽ More

    Submitted 16 May, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

    ACM Class: I.2.7

  8. arXiv:2211.03358  [pdf, other

    cs.CL cs.AI

    Probing neural language models for understanding of words of estimative probability

    Authors: Damien Sileo, Marie-Francine Moens

    Abstract: Words of estimative probability (WEP) are expressions of a statement's plausibility (probably, maybe, likely, doubt, likely, unlikely, impossible...). Multiple surveys demonstrate the agreement of human evaluators when assigning numerical probability levels to WEP. For example, highly likely corresponds to a median chance of 0.90+-0.08 in Fagen-Ulmschneider (2015)'s survey. In this work, we measur… ▽ More

    Submitted 25 June, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Accepted at *SEM2023

  9. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  10. arXiv:2112.05647  [pdf, other

    cs.CL

    Analysis and Prediction of NLP Models Via Task Embeddings

    Authors: Damien Sileo, Marie-Francine Moens

    Abstract: Task embeddings are low-dimensional representations that are trained to capture task properties. In this paper, we propose MetaEval, a collection of $101$ NLP tasks. We fit a single transformer to all MetaEval tasks jointly while conditioning it on learned embeddings. The resulting task embeddings enable a novel analysis of the space of tasks. We then show that task aspects can be mapped to task e… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    ACM Class: I.2.7; I.2.6

  11. arXiv:2112.04184  [pdf, other

    cs.CL cs.IR

    Zero-Shot Recommendation as Language Modeling

    Authors: Damien Sileo, Wout Vossen, Robbe Raymaekers

    Abstract: Recommendation is the task of ranking items (e.g. movies or products) according to individual user needs. Current systems rely on collaborative filtering and content-based techniques, which both require structured training data. We propose a framework for recommendation with off-the-shelf pretrained language models (LM) that only used unstructured text corpora as training data. If a user $u$ liked… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted at ECIR 2022

    ACM Class: I.2.7; H.3.3; I.2.6

  12. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  13. arXiv:2105.14774  [pdf, other

    cs.CL

    LIIR at SemEval-2021 task 6: Detection of Persuasion Techniques In Texts and Images using CLIP features

    Authors: Erfan Ghadery, Damien Sileo, Marie-Francine Moens

    Abstract: We describe our approach for SemEval-2021 task 6 on detection of persuasion techniques in multimodal content (memes). Our system combines pretrained multimodal models (CLIP) and chained classifiers. Also, we propose to enrich the data by a data augmentation technique. Our submission achieves a rank of 8/16 in terms of F1-micro and 9/16 with F1-macro on the test set.

    Submitted 31 May, 2021; originally announced May 2021.

  14. arXiv:2103.13942  [pdf, other

    cs.CL

    Visual Grounding Strategies for Text-Only Natural Language Processing

    Authors: Damien Sileo

    Abstract: Visual grounding is a promising path toward more robust and accurate Natural Language Processing (NLP) models. Many multimodal extensions of BERT (e.g., VideoBERT, LXMERT, VL-BERT) allow a joint modeling of texts and images that lead to state-of-the-art results on multimodal tasks such as Visual Question Answering. Here, we leverage multimodal modeling for purely textual tasks (language modeling a… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: Accepted at LANTERN2021

  15. arXiv:2006.01603  [pdf, other

    cs.CL

    DiscSense: Automated Semantic Analysis of Discourse Markers

    Authors: Damien Sileo, Tim Van de Cruys, Camille Pradel, Philippe Muller

    Abstract: Discourse markers ({\it by contrast}, {\it happily}, etc.) are words or phrases that are used to signal semantic and/or pragmatic relationships between clauses or sentences. Recent work has fruitfully explored the prediction of discourse markers between sentence pairs in order to learn accurate sentence representations, that are useful in various classification tasks. In this work, we take another… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: Accepted at LREC2020

  16. arXiv:1907.08672  [pdf, ps, other

    cs.CL

    A Pragmatics-Centered Evaluation Framework for Natural Language Understanding

    Authors: Damien Sileo, Tim Van-de-Cruys, Camille Pradel, Philippe Muller

    Abstract: New models for natural language understanding have recently made an unparalleled amount of progress, which has led some researchers to suggest that the models induce universal text representations. However, current benchmarks are predominantly targeting semantic phenomena; we make the case that pragmatics needs to take center stage in the evaluation of natural language understanding. We introduce… ▽ More

    Submitted 4 April, 2022; v1 submitted 19 July, 2019; originally announced July 2019.

    Comments: Accepted at LREC2022

    ACM Class: I.2.7; I.2.6

  17. arXiv:1904.02464  [pdf, other

    cs.CL

    Composition of Sentence Embeddings:Lessons from Statistical Relational Learning

    Authors: Damien Sileo, Tim Van-De-Cruys, Camille Pradel, Philippe Muller

    Abstract: Various NLP problems -- such as the prediction of sentence similarity, entailment, and discourse relations -- are all instances of the same general task: the modeling of semantic relations between a pair of textual elements. A popular model for such problems is to embed sentences into fixed size vectors, and use composition functions (e.g. concatenation or sum) of those vectors as features for the… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: Camera-ready for *SEM 2019

  18. arXiv:1903.11850  [pdf, other

    cs.CL

    Mining Discourse Markers for Unsupervised Sentence Representation Learning

    Authors: Damien Sileo, Tim Van-De-Cruys, Camille Pradel, Philippe Muller

    Abstract: Current state of the art systems in NLP heavily rely on manually annotated datasets, which are expensive to construct. Very little work adequately exploits unannotated data -- such as discourse markers between sentences -- mainly because of data sparseness and ineffective extraction methods. In the present work, we propose a method to automatically discover sentence pairs with relevant discourse m… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: Camera-ready for NAACL HLT 2019

  19. arXiv:1709.04820  [pdf, other

    cs.CL

    Synapse at CAp 2017 NER challenge: Fasttext CRF

    Authors: Damien Sileo, Camille Pradel, Philippe Muller, Tim Van de Cruys

    Abstract: We present our system for the CAp 2017 NER challenge which is about named entity recognition on French tweets. Our system leverages unsupervised learning on a larger dataset of French tweets to learn features feeding a CRF model. It was ranked first without using any gazetteer or structured external data, with an F-measure of 58.89\%. To the best of our knowledge, it is the first system to use fas… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

    Journal ref: CAP2017

  20. arXiv:1010.2148  [pdf, other

    cs.DB

    Ontological Matchmaking in Recommender Systems

    Authors: Angela Bonifati, Giansalvatore Mecca, Domenica Sileo, Gianvito Summa

    Abstract: The electronic marketplace offers great potential for the recommendation of supplies. In the so called recommender systems, it is crucial to apply matchmaking strategies that faithfully satisfy the predicates specified in the demand, and take into account as much as possible the user preferences. We focus on real-life ontology-driven matchmaking scenarios and identify a number of challenges, being… ▽ More

    Submitted 11 October, 2010; originally announced October 2010.

    Comments: 28 pages, 8 figures