Skip to main content

Showing 1–8 of 8 results for author: D'Oosterlinck, K

.
  1. arXiv:2406.09458  [pdf, other

    cs.CV cs.AI cs.CL

    Updating CLIP to Prefer Descriptions Over Captions

    Authors: Amir Zur, Elisa Kreiss, Karel D'Oosterlinck, Christopher Potts, Atticus Geiger

    Abstract: Although CLIPScore is a powerful generic metric that captures the similarity between a text and an image, it fails to distinguish between a caption that is meant to complement the information in an image and a description that is meant to replace an image entirely, e.g., for accessibility. We address this shortcoming by updating the CLIP model with the Concadia dataset to assign higher scores to d… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2401.12178  [pdf, other

    cs.CL cs.AI

    In-Context Learning for Extreme Multi-Label Classification

    Authors: Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, $\texttt{Infer--Retrieve--Rank}$, that defines multi-step interactions between LMs and… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  3. arXiv:2311.10905  [pdf, other

    cs.CL cs.AI

    Flexible Model Interpretability through Natural Language Model Editing

    Authors: Karel D'Oosterlinck, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Model interpretability and model editing are crucial goals in the age of large language models. Interestingly, there exists a link between these two goals: if a method is able to systematically edit model behavior with regard to a human concept of interest, this editor method can help make internal representations more interpretable by pointing towards relevant representations and systematically m… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Extended Abstract -- work in progress. BlackboxNLP2023

  4. arXiv:2310.06165  [pdf, other

    cs.CL cs.AI

    CAW-coref: Conjunction-Aware Word-level Coreference Resolution

    Authors: Karel D'Oosterlinck, Semere Kiros Bitew, Brandon Papineau, Christopher Potts, Thomas Demeester, Chris Develder

    Abstract: State-of-the-art coreference resolutions systems depend on multiple LLM calls per document and are thus prohibitively expensive for many use cases (e.g., information extraction with large corpora). The leading word-level coreference system (WL-coref) attains 96.6% of these SOTA systems' performance while being much more efficient. In this work, we identify a routine yet important failure case of W… ▽ More

    Submitted 19 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted at CRAC 2023

  5. arXiv:2309.10312  [pdf, other

    cs.CL

    Rigorously Assessing Natural Language Explanations of Neurons

    Authors: **g Huang, Atticus Geiger, Karel D'Oosterlinck, Zhengxuan Wu, Christopher Potts

    Abstract: Natural language is an appealing medium for explaining how large language models process and store information, but evaluating the faithfulness of such explanations is challenging. To help address this, we develop two modes of evaluation for natural language explanations that claim individual neurons represent a concept in a text input. In the observational mode, we evaluate claims that a neuron… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  6. arXiv:2305.13395  [pdf, other

    cs.CL

    BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

    Authors: Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

    Abstract: Timely and accurate extraction of Adverse Drug Events (ADE) from biomedical literature is paramount for public safety, but involves slow and costly manual labor. We set out to improve drug safety monitoring (pharmacovigilance, PV) through the use of Natural Language Processing (NLP). We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages. EMNLP Findings 2023

  7. arXiv:2209.14279  [pdf, other

    cs.CL

    Causal Proxy Models for Concept-Based Model Explanations

    Authors: Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts

    Abstract: Explainability methods for NLP systems encounter a version of the fundamental problem of causal inference: for a given ground-truth input text, we never truly observe the counterfactual texts necessary for isolating the causal effects of model representations on outputs. In response, many explainability methods make no use of counterfactual texts, assuming they will be unavailable. In this paper,… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 23 pages

  8. arXiv:2205.14140  [pdf, other

    cs.CL

    CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior

    Authors: Eldar David Abraham, Karel D'Oosterlinck, Amir Feder, Yair Ori Gat, Atticus Geiger, Christopher Potts, Roi Reichart, Zhengxuan Wu

    Abstract: The increasing size and complexity of modern ML systems has improved their predictive capabilities but made their behavior harder to explain. Many techniques for model explanation have been developed in response, but we lack clear criteria for assessing these techniques. In this paper, we cast model explanation as the causal inference problem of estimating causal effects of real-world concepts on… ▽ More

    Submitted 12 October, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS 2022