Skip to main content

Showing 1–9 of 9 results for author: Remy, F

.
  1. arXiv:2401.12178  [pdf, other

    cs.CL cs.AI

    In-Context Learning for Extreme Multi-Label Classification

    Authors: Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts

    Abstract: Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, $\texttt{Infer--Retrieve--Rank}$, that defines multi-step interactions between LMs and… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  2. arXiv:2311.16075  [pdf

    cs.CL cs.AI cs.IR

    BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights

    Authors: François Remy, Kris Demuynck, Thomas Demeester

    Abstract: In this study, we investigate the potential of Large Language Models to complement biomedical knowledge graphs in the training of semantic models for the biomedical and clinical domains. Drawing on the wealth of the UMLS knowledge graph and harnessing cutting-edge Large Language Models, we propose a new state-of-the-art approach for obtaining high-fidelity representations of biomedical concepts an… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Preprint of upcoming journal article

  3. arXiv:2310.03477  [pdf, other

    cs.CL cs.AI

    Tik-to-Tok: Translating Language Models One Token at a Time: An Embedding Initialization Strategy for Efficient Language Adaptation

    Authors: François Remy, Pieter Delobelle, Bettina Berendt, Kris Demuynck, Thomas Demeester

    Abstract: Training monolingual language models for low and mid-resource languages is made challenging by limited and often inadequate pretraining data. In this study, we propose a novel model conversion strategy to address this issue, adapting high-resources monolingual language models to a new target language. By generalizing over a word translation dictionary encompassing both the source and target langua… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: As first reviewed at TACL

  4. arXiv:2308.00157  [pdf, other

    cs.CL

    Boosting Adverse Drug Event Normalization on Social Media: General-Purpose Model Initialization and Biomedical Semantic Text Similarity Benefit Zero-Shot Linking in Informal Contexts

    Authors: François Remy, Simone Scaboro, Beatrice Portelli

    Abstract: Biomedical entity linking, also known as biomedical concept normalization, has recently witnessed the rise to prominence of zero-shot contrastive models. However, the pre-training material used for these models has, until now, largely consisted of specialist biomedical content such as MIMIC-III clinical notes (Johnson et al., 2016) and PubMed papers (Sayers et al., 2021; Gao et al., 2020). While t… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  5. arXiv:2306.00665  [pdf, other

    cs.CL

    Automatic Glossary of Clinical Terminology: a Large-Scale Dictionary of Biomedical Definitions Generated from Ontological Knowledge

    Authors: François Remy, Thomas Demeester

    Abstract: Background: More than 400,000 biomedical concepts and some of their relationships are contained in SnomedCT, a comprehensive biomedical ontology. However, their concept names are not always readily interpretable by non-experts, or patients looking at their own electronic health records (EHR). Clear definitions or descriptions in understandable language are often not available. Therefore, generatin… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted at the BioNLP 2023 workshop

  6. arXiv:2305.13395  [pdf, other

    cs.CL

    BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

    Authors: Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

    Abstract: Timely and accurate extraction of Adverse Drug Events (ADE) from biomedical literature is paramount for public safety, but involves slow and costly manual labor. We set out to improve drug safety monitoring (pharmacovigilance, PV) through the use of Natural Language Processing (NLP). We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages. EMNLP Findings 2023

  7. arXiv:2305.06801  [pdf, other

    cs.CL

    Detecting Idiomatic Multiword Expressions in Clinical Terminology using Definition-Based Representation Learning

    Authors: François Remy, Alfiya Khabibullina, Thomas Demeester

    Abstract: This paper shines a light on the potential of definition-based semantic models for detecting idiomatic and semi-idiomatic multiword expressions (MWEs) in clinical terminology. Our study focuses on biomedical entities defined in the UMLS ontology and aims to help prioritize the translation efforts of these entities. In particular, we develop an effective tool for scoring the idiomaticity of biomedi… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Best Paper Award @ MWE 2023

  8. arXiv:2210.11892  [pdf, other

    cs.CL cs.IR

    BioLORD: Learning Ontological Representations from Definitions (for Biomedical Concepts and their Textual Descriptions)

    Authors: François Remy, Kris Demuynck, Thomas Demeester

    Abstract: This work introduces BioLORD, a new pre-training strategy for producing meaningful representations for clinical sentences and biomedical concepts. State-of-the-art methodologies operate by maximizing the similarity in representation of names referring to the same concept, and preventing collapse through contrastive learning. However, because biomedical names are not always self-explanatory, it som… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted in Findings of EMNLP 2022

  9. Dykes for filtering ocean waves using c-shaped vertical cylinders

    Authors: Guillaume Dupont, Fabien Remy, Olivier Kimmoun, Bernard Molin, Sebastien Guenneau, Stefan Enoch

    Abstract: The present study investigates a way to design dykes which can filter the wavelengths of ocean surface waves. This offers the possibility to achieve a structure that can attenuate waves associated with storm swell, without affecting coastline in other conditions. Our approach is based on low frequency resonances in metamaterials combined with Bragg frequencies for which waves cannot propagate in p… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

    Journal ref: Phys. Rev. B 96, 180302 (2017)