Skip to main content

Showing 1–10 of 10 results for author: Youssef, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02765  [pdf, other

    cs.CL cs.AI

    Detecting Edited Knowledge in Language Models

    Authors: Paul Youssef, Zhixue Zhao, Jörg Schlötterer, Christin Seifert

    Abstract: Knowledge editing methods (KEs) can update language models' obsolete or inaccurate knowledge learned from pre-training. However, KEs can be used for malicious applications, e.g., inserting misinformation and toxic content. Knowing whether a generated output is based on edited knowledge or first-hand knowledge from pre-training can increase users' trust in generative models and provide more transpa… ▽ More

    Submitted 1 July, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  2. arXiv:2405.00722  [pdf, other

    cs.CL cs.AI

    LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study

    Authors: Van Bach Nguyen, Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: As NLP models become more complex, understanding their decisions becomes more crucial. Counterfactuals (CFs), where minimal changes to inputs flip a model's prediction, offer a way to explain these models. While Large Language Models (LLMs) have shown remarkable performance in NLP tasks, their efficacy in generating high-quality CFs remains uncertain. This work fills this gap by investigating how… ▽ More

    Submitted 26 April, 2024; originally announced May 2024.

  3. arXiv:2404.05090  [pdf, other

    cs.LG cs.AI cs.CL

    How Bad is Training on Synthetic Data? A Statistical Analysis of Language Model Collapse

    Authors: Mohamed El Amine Seddik, Suei-Wen Chen, Soufiane Hayou, Pierre Youssef, Merouane Debbah

    Abstract: The phenomenon of model collapse, introduced in (Shumailov et al., 2023), refers to the deterioration in performance that occurs when new models are trained on synthetic data generated from previously trained models. This recursive training loop makes the tails of the original distribution disappear, thereby making future-generation models forget about the initial (real) distribution. With the aim… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  4. arXiv:2402.01453  [pdf, other

    cs.CL

    The Queen of England is not England's Queen: On the Lack of Factual Coherency in PLMs

    Authors: Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: Factual knowledge encoded in Pre-trained Language Models (PLMs) enriches their representations and justifies their use as knowledge bases. Previous work has focused on probing PLMs for factual knowledge by measuring how often they can correctly predict an object entity given a subject and a relation, and improving fact retrieval by optimizing the prompts used for querying PLMs. In this work, we co… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL Findings 2024

  5. arXiv:2310.16570  [pdf, other

    cs.CL

    Give Me the Facts! A Survey on Factual Knowledge Probing in Pre-trained Language Models

    Authors: Paul Youssef, Osman Alperen Koraş, Meijie Li, Jörg Schlötterer, Christin Seifert

    Abstract: Pre-trained Language Models (PLMs) are trained on vast unlabeled data, rich in world knowledge. This fact has sparked the interest of the community in quantifying the amount of factual knowledge present in PLMs, as this explains their performance on downstream tasks, and potentially justifies their use as knowledge bases. In this work, we survey methods and datasets that are used to probe PLMs for… ▽ More

    Submitted 4 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP Findings 2023

  6. arXiv:2307.12803  [pdf, other

    cs.CL

    Guidance in Radiology Report Summarization: An Empirical Evaluation and Error Analysis

    Authors: Jan Trienes, Paul Youssef, Jörg Schlötterer, Christin Seifert

    Abstract: Automatically summarizing radiology reports into a concise impression can reduce the manual burden of clinicians and improve the consistency of reporting. Previous work aimed to enhance content selection and factuality through guided abstractive summarization. However, two key issues persist. First, current methods heavily rely on domain-specific resources to extract the guidance signal, limiting… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at INLG2023

  7. arXiv:2208.00522  [pdf, other

    cs.LG cs.DS math.OC

    Online Decentralized Frank-Wolfe: From theoretical bound to applications in smart-building

    Authors: Angan Mitra, Nguyen Kim Thang, Tuan-Anh Nguyen, Denis Trystram, Paul Youssef

    Abstract: The design of decentralized learning algorithms is important in the fast-growing world in which data are distributed over participants with limited local computation resources and communication. In this direction, we propose an online algorithm minimizing non-convex loss functions aggregated from individual data/models distributed over a network. We provide the theoretical performance guarantee of… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

  8. arXiv:2207.02057  [pdf, other

    cs.DS cs.AI

    Online 2-stage Stable Matching

    Authors: Evripidis Bampis, Bruno Escoffier, Paul Youssef

    Abstract: We focus on an online 2-stage problem, motivated by the following situation: consider a system where students shall be assigned to universities. There is a first round where some students apply, and a first (stable) matching $M_1$ has to be computed. However, some students may decide to leave the system (change their plan, go to a foreign university, or to some institution not in the system). Then… ▽ More

    Submitted 2 May, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

  9. arXiv:1911.10392  [pdf, other

    cs.CL

    When is ACL's Deadline? A Scientific Conversational Agent

    Authors: Mohsen Mesgar, Paul Youssef, Lin Li, Dominik Bierwirth, Yihao Li, Christian M. Meyer, Iryna Gurevych

    Abstract: Our conversational agent UKP-ATHENA assists NLP researchers in finding and exploring scientific literature, identifying relevant authors, planning or post-processing conference visits, and preparing paper submissions using a unified interface based on natural language inputs and responses. UKP-ATHENA enables new access paths to our swiftly evolving research area with its massive amounts of scienti… ▽ More

    Submitted 23 November, 2019; originally announced November 2019.

  10. arXiv:1901.02671  [pdf, other

    cs.CL cs.LG cs.NE

    Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks

    Authors: Steffen Eger, Paul Youssef, Iryna Gurevych

    Abstract: Activation functions play a crucial role in neural networks because they are the nonlinearities which have been attributed to the success story of deep learning. One of the currently most popular activation functions is ReLU, but several competitors have recently been proposed or 'discovered', including LReLU functions and swish. While most works compare newly proposed activation functions on few… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

    Comments: Published at EMNLP 2018