Skip to main content

Showing 1–6 of 6 results for author: Poth, C

.
  1. arXiv:2407.01091  [pdf, other

    cs.CL

    M2QA: Multi-domain Multilingual Question Answering

    Authors: Leon Engländer, Hannah Sterz, Clifton Poth, Jonas Pfeiffer, Ilia Kuznetsov, Iryna Gurevych

    Abstract: Generalization and robustness to input variation are core desiderata of machine learning research. Language varies along several axes, most importantly, language instance (e.g. French) and domain (e.g. news). While adapting NLP models to new languages within a single domain, or to new domains within a single language, is widely studied, research in joint adaptation is hampered by the lack of evalu… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2311.11077  [pdf, other

    cs.CL cs.AI cs.LG

    Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning

    Authors: Clifton Poth, Hannah Sterz, Indraneil Paul, Sukannya Purkayastha, Leon Engländer, Timo Imhof, Ivan Vulić, Sebastian Ruder, Iryna Gurevych, Jonas Pfeiffer

    Abstract: We introduce Adapters, an open-source library that unifies parameter-efficient and modular transfer learning in large language models. By integrating 10 diverse adapter methods into a unified interface, Adapters offers ease of use and flexible configuration. Our library allows researchers and practitioners to leverage adapter modularity through composition blocks, enabling the design of complex ad… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023: Systems Demonstrations

  3. arXiv:2310.06372  [pdf, other

    cs.CR cs.CV cs.LG

    Leveraging Diffusion-Based Image Variations for Robust Training on Poisoned Data

    Authors: Lukas Struppek, Martin B. Hentschel, Clifton Poth, Dominik Hintersdorf, Kristian Kersting

    Abstract: Backdoor attacks pose a serious security threat for training neural networks as they surreptitiously introduce hidden functionalities into a model. Such backdoors remain silent during inference on clean inputs, evading detection due to inconspicuous behavior. However, once a specific trigger pattern appears in the input data, the backdoor activates, causing the model to execute its concealed funct… ▽ More

    Submitted 13 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Published at NeurIPS 2023 Workshop on Backdoors in Deep Learning: The Good, the Bad, and the Ugly

  4. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  5. arXiv:2104.08247  [pdf, other

    cs.CL

    What to Pre-Train on? Efficient Intermediate Task Selection

    Authors: Clifton Poth, Jonas Pfeiffer, Andreas Rücklé, Iryna Gurevych

    Abstract: Intermediate task fine-tuning has been shown to culminate in large transfer gains across many NLP tasks. With an abundance of candidate datasets as well as pre-trained language models, it has become infeasible to run the cross-product of all combinations to find the best transfer setting. In this work we first establish that similar sequential fine-tuning gains can be achieved in adapter settings,… ▽ More

    Submitted 10 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021

  6. arXiv:2007.07779  [pdf, other

    cs.CL

    AdapterHub: A Framework for Adapting Transformers

    Authors: Jonas Pfeiffer, Andreas Rücklé, Clifton Poth, Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, Iryna Gurevych

    Abstract: The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes progress towards more general and versatile NLP methods that learn from and for many tasks. Adapters -- small learnt bottleneck layers inserted within each laye… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: EMNLP 2020: Systems Demonstrations