Skip to main content

Showing 1–15 of 15 results for author: Rücklé, A

.
  1. arXiv:2306.16900  [pdf, other

    cs.CL

    Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

    Authors: Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge

    Abstract: Many recent improvements in NLP stem from the development and use of large pre-trained language models (PLMs) with billions of parameters. Large model sizes makes computational cost one of the main limiting factors for training and evaluating such models; and has raised severe concerns about the sustainability, reproducibility, and inclusiveness for researching PLMs. These concerns are often based… ▽ More

    Submitted 9 November, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  2. arXiv:2104.08663  [pdf, other

    cs.IR cs.AI cs.CL

    BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

    Authors: Nandan Thakur, Nils Reimers, Andreas Rücklé, Abhishek Srivastava, Iryna Gurevych

    Abstract: Existing neural information retrieval (IR) models have often been studied in homogeneous and narrow settings, which has considerably limited insights into their out-of-distribution (OOD) generalization capabilities. To address this, and to facilitate researchers to broadly evaluate the effectiveness of their models, we introduce Benchmarking-IR (BEIR), a robust and heterogeneous evaluation benchma… ▽ More

    Submitted 20 October, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: Accepted at NeurIPS 2021 Dataset and Benchmark Track

  3. arXiv:2104.08296  [pdf, other

    cs.CL

    Learning to Reason for Text Generation from Scientific Tables

    Authors: Nafise Sadat Moosavi, Andreas Rücklé, Dan Roth, Iryna Gurevych

    Abstract: In this paper, we introduce SciGen, a new challenge dataset for the task of reasoning-aware data-to-text generation consisting of tables from scientific articles and their corresponding descriptions. Describing scientific tables goes beyond the surface realization of the table content and requires reasoning over table values. The unique properties of SciGen are that (1) tables mostly contain numer… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  4. arXiv:2104.08247  [pdf, other

    cs.CL

    What to Pre-Train on? Efficient Intermediate Task Selection

    Authors: Clifton Poth, Jonas Pfeiffer, Andreas Rücklé, Iryna Gurevych

    Abstract: Intermediate task fine-tuning has been shown to culminate in large transfer gains across many NLP tasks. With an abundance of candidate datasets as well as pre-trained language models, it has become infeasible to run the cross-product of all combinations to find the best transfer setting. In this work we first establish that similar sequential fine-tuning gains can be achieved in adapter settings,… ▽ More

    Submitted 10 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021

  5. arXiv:2104.07081  [pdf, other

    cs.CL

    TWEAC: Transformer with Extendable QA Agent Classifiers

    Authors: Gregor Geigle, Nils Reimers, Andreas Rücklé, Iryna Gurevych

    Abstract: Question answering systems should help users to access knowledge on a broad range of topics and to answer a wide array of different questions. Most systems fall short of this expectation as they are only specialized in one particular setting, e.g., answering factual questions with Wikipedia data. To overcome this limitation, we propose composing multiple QA agents within a meta-QA system. We argue… ▽ More

    Submitted 16 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

  6. arXiv:2010.11918  [pdf, other

    cs.LG cs.CL

    AdapterDrop: On the Efficiency of Adapters in Transformers

    Authors: Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers, Iryna Gurevych

    Abstract: Massively pre-trained transformer models are computationally expensive to fine-tune, slow for inference, and have large storage requirements. Recent approaches tackle these shortcomings by training smaller models, dynamically reducing the model size, and by training light-weight adapters. In this paper, we propose AdapterDrop, removing adapters from lower transformer layers during training and inf… ▽ More

    Submitted 5 October, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021

  7. arXiv:2010.03338  [pdf, other

    cs.CL

    Improving QA Generalization by Concurrent Modeling of Multiple Biases

    Authors: Mingzhu Wu, Nafise Sadat Moosavi, Andreas Rücklé, Iryna Gurevych

    Abstract: Existing NLP datasets contain various biases that models can easily exploit to achieve high performances on the corresponding evaluation sets. However, focusing on dataset-specific biases limits their ability to learn more generalizable knowledge about the task from more general data patterns. In this paper, we investigate the impact of debiasing methods for improving generalization and propose a… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  8. arXiv:2010.00980  [pdf, other

    cs.CL cs.IR

    MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale

    Authors: Andreas Rücklé, Jonas Pfeiffer, Iryna Gurevych

    Abstract: We study the zero-shot transfer capabilities of text matching models on a massive scale, by self-supervised training on 140 source domains from community question answering forums in English. We investigate the model performances on nine benchmarks of answer selection and question similarity tasks, and show that all 140 models transfer surprisingly well, where the large majority of models substant… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: EMNLP-2020

  9. arXiv:2007.07779  [pdf, other

    cs.CL

    AdapterHub: A Framework for Adapting Transformers

    Authors: Jonas Pfeiffer, Andreas Rücklé, Clifton Poth, Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, Iryna Gurevych

    Abstract: The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes progress towards more general and versatile NLP methods that learn from and for many tasks. Adapters -- small learnt bottleneck layers inserted within each laye… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: EMNLP 2020: Systems Demonstrations

  10. arXiv:2005.00247  [pdf, other

    cs.CL

    AdapterFusion: Non-Destructive Task Composition for Transfer Learning

    Authors: Jonas Pfeiffer, Aishwarya Kamath, Andreas Rücklé, Kyunghyun Cho, Iryna Gurevych

    Abstract: Sequential fine-tuning and multi-task learning are methods aiming to incorporate knowledge from multiple tasks; however, they suffer from catastrophic forgetting and difficulties in dataset balancing. To address these shortcomings, we propose AdapterFusion, a new two stage learning algorithm that leverages knowledge from multiple tasks. First, in the knowledge extraction stage we learn task specif… ▽ More

    Submitted 26 January, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Journal ref: Proceedings of EACL 2021

  11. arXiv:1911.05594  [pdf, other

    cs.CL cs.LG

    Neural Duplicate Question Detection without Labeled Training Data

    Authors: Andreas Rücklé, Nafise Sadat Moosavi, Iryna Gurevych

    Abstract: Supervised training of neural models to duplicate question detection in community Question Answering (cQA) requires large amounts of labeled question pairs, which are costly to obtain. To minimize this cost, recent works thus often used alternative methods, e.g., adversarial domain adaptation. In this work, we propose two novel methods: (1) the automatic generation of duplicate questions, and (2)… ▽ More

    Submitted 19 September, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: Accepted as long paper at EMNLP-2019

  12. arXiv:1909.08940  [pdf, ps, other

    cs.CL

    Improving Generalization by Incorporating Coverage in Natural Language Inference

    Authors: Nafise Sadat Moosavi, Prasetya Ajie Utama, Andreas Rücklé, Iryna Gurevych

    Abstract: The task of natural language inference (NLI) is to identify the relation between the given premise and hypothesis. While recent NLI models achieve very high performance on individual datasets, they fail to generalize across similar datasets. This indicates that they are solving NLI datasets instead of the task itself. In order to improve generalization, we propose to extend the input representatio… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  13. arXiv:1906.01575  [pdf, other

    cs.CL

    Pitfalls in the Evaluation of Sentence Embeddings

    Authors: Steffen Eger, Andreas Rücklé, Iryna Gurevych

    Abstract: Deep learning models continuously break new records across different NLP tasks. At the same time, their success exposes weaknesses of model evaluation. Here, we compile several key pitfalls of evaluation of sentence embeddings, a currently very popular NLP paradigm. These pitfalls include the comparison of embeddings of different sizes, normalization of embeddings, and the low (and diverging) corr… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted at Repl4NLP 2019

  14. arXiv:1903.11508  [pdf, other

    cs.CL cs.CR cs.CV cs.LG

    Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems

    Authors: Steffen Eger, Gözde Gül Şahin, Andreas Rücklé, Ji-Ung Lee, Claudia Schulz, Mohsen Mesgar, Krishnkant Swarnkar, Edwin Simpson, Iryna Gurevych

    Abstract: Visual modifications to text are often used to obfuscate offensive comments in social media (e.g., "!d10t") or as a writing style ("1337" in "leet speak"), among other scenarios. We consider this as a new type of adversarial attack in NLP, a setting to which humans are very robust, as our experiments with both simple and more difficult visual input perturbations demonstrate. We then investigate th… ▽ More

    Submitted 10 June, 2020; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: Accepted as long paper at NAACL-2019; fixed one ungrammatical sentence

  15. arXiv:1803.01400  [pdf, other

    cs.CL

    Concatenated Power Mean Word Embeddings as Universal Cross-Lingual Sentence Representations

    Authors: Andreas Rücklé, Steffen Eger, Maxime Peyrard, Iryna Gurevych

    Abstract: Average word embeddings are a common baseline for more sophisticated sentence embedding techniques. However, they typically fall short of the performances of more complex models such as InferSent. Here, we generalize the concept of average word embeddings to power mean word embeddings. We show that the concatenation of different types of power mean word embeddings considerably closes the gap to st… ▽ More

    Submitted 12 September, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

    Comments: Experiments/plots added: Normalization + Figure 1 (dimensionality vs. performance)