Skip to main content

Showing 1–24 of 24 results for author: Reimers, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12332  [pdf, other

    cs.CL

    Triple-Encoders: Representations That Fire Together, Wire Together

    Authors: Justus-Jonas Erker, Florian Mai, Nils Reimers, Gerasimos Spanakis, Iryna Gurevych

    Abstract: Search-based dialog models typically re-encode the dialog history at every turn, incurring high cost. Curved Contrastive Learning, a representation learning method that encodes relative distances between utterances into the embedding space via a bi-encoder, has recently shown promising results for dialog modeling at far superior efficiency. While high efficiency is achieved through independently e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: in Review at ACL Rolling Review

  2. arXiv:2305.13915  [pdf, other

    cs.IR cs.CL

    DAPR: A Benchmark on Document-Aware Passage Retrieval

    Authors: Kexin Wang, Nils Reimers, Iryna Gurevych

    Abstract: The work of neural retrieval so far focuses on ranking short texts and is challenged with long documents. There are many cases where the users want to find a relevant passage within a long document from a huge corpus, e.g. Wikipedia articles, research papers, etc. We propose and name this task \emph{Document-Aware Passage Retrieval} (DAPR). While analyzing the errors of the State-of-The-Art (SoTA)… ▽ More

    Submitted 9 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2024 Main Conference

  3. arXiv:2210.10695  [pdf, other

    cs.IR cs.CL

    Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking

    Authors: Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych

    Abstract: Pairing a lexical retriever with a neural re-ranking model has set state-of-the-art performance on large-scale information retrieval datasets. This pipeline covers scenarios like question answering or navigational queries, however, for information-seeking scenarios, users often provide information on whether a document is relevant to their query in form of clicks or explicit feedback. Therefore, i… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  4. arXiv:2210.07316  [pdf, other

    cs.CL cs.IR cs.LG

    MTEB: Massive Text Embedding Benchmark

    Authors: Niklas Muennighoff, Nouamane Tazi, Loïc Magne, Nils Reimers

    Abstract: Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being propos… ▽ More

    Submitted 19 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 24 pages, 14 tables, 6 figures

  5. arXiv:2209.11055  [pdf, other

    cs.CL

    Efficient Few-Shot Learning Without Prompts

    Authors: Lewis Tunstall, Nils Reimers, Unso Eun Seo Jo, Luke Bates, Daniel Korat, Moshe Wasserblat, Oren Pereg

    Abstract: Recent few-shot methods, such as parameter-efficient fine-tuning (PEFT) and pattern exploiting training (PET), have achieved impressive results in label-scarce settings. However, they are difficult to employ since they are subject to high variability from manually crafted prompts, and typically require billion-parameter language models to achieve high accuracy. To address these shortcomings, we pr… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  6. arXiv:2205.11498  [pdf, other

    cs.IR cs.AI cs.LG

    Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense Retrieval

    Authors: Nandan Thakur, Nils Reimers, Jimmy Lin

    Abstract: Dense retrieval overcome the lexical gap and has shown great success in ad-hoc information retrieval (IR). Despite their success, dense retrievers are expensive to serve across practical use cases. For use cases requiring to search from millions of documents, the dense index becomes bulky and requires high memory usage for storing the index. More recently, learning-to-hash (LTH) techniques, for e.… ▽ More

    Submitted 19 July, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted at ReNeuIR 2023 Workshop

  7. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  8. arXiv:2112.07577  [pdf, other

    cs.CL cs.IR

    GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval

    Authors: Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych

    Abstract: Dense retrieval approaches can overcome the lexical gap and lead to significantly improved search results. However, they require large amounts of training data which is not available for most domains. As shown in previous work (Thakur et al., 2021b), the performance of dense retrievers severely degrades under a domain shift. This limits the usage of dense retrieval approaches to only a few domains… ▽ More

    Submitted 25 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted at NAACL 2022

  9. arXiv:2104.08663  [pdf, other

    cs.IR cs.AI cs.CL

    BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models

    Authors: Nandan Thakur, Nils Reimers, Andreas Rücklé, Abhishek Srivastava, Iryna Gurevych

    Abstract: Existing neural information retrieval (IR) models have often been studied in homogeneous and narrow settings, which has considerably limited insights into their out-of-distribution (OOD) generalization capabilities. To address this, and to facilitate researchers to broadly evaluate the effectiveness of their models, we introduce Benchmarking-IR (BEIR), a robust and heterogeneous evaluation benchma… ▽ More

    Submitted 20 October, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: Accepted at NeurIPS 2021 Dataset and Benchmark Track

  10. arXiv:2104.07081  [pdf, other

    cs.CL

    TWEAC: Transformer with Extendable QA Agent Classifiers

    Authors: Gregor Geigle, Nils Reimers, Andreas Rücklé, Iryna Gurevych

    Abstract: Question answering systems should help users to access knowledge on a broad range of topics and to answer a wide array of different questions. Most systems fall short of this expectation as they are only specialized in one particular setting, e.g., answering factual questions with Wikipedia data. To overcome this limitation, we propose composing multiple QA agents within a meta-QA system. We argue… ▽ More

    Submitted 16 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

  11. arXiv:2104.06979  [pdf, other

    cs.CL

    TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning

    Authors: Kexin Wang, Nils Reimers, Iryna Gurevych

    Abstract: Learning sentence embeddings often requires a large amount of labeled data. However, for most tasks and domains, labeled data is seldom available and creating it is expensive. In this work, we present a new state-of-the-art unsupervised method based on pre-trained Transformers and Sequential Denoising Auto-Encoder (TSDAE) which outperforms previous approaches by up to 6.4 points. It can achieve up… ▽ More

    Submitted 10 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021 Findings

  12. arXiv:2103.11920  [pdf, other

    cs.CV cs.CL

    Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval

    Authors: Gregor Geigle, Jonas Pfeiffer, Nils Reimers, Ivan Vulić, Iryna Gurevych

    Abstract: Current state-of-the-art approaches to cross-modal retrieval process text and visual input jointly, relying on Transformer-based architectures with cross-attention mechanisms that attend over all words and objects in an image. While offering unmatched retrieval performance, such models: 1) are typically pretrained from scratch and thus less scalable, 2) suffer from huge retrieval latency and ineff… ▽ More

    Submitted 18 February, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: TACL 2022

  13. arXiv:2012.14210  [pdf, other

    cs.IR cs.CL

    The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: Information Retrieval using dense low-dimensional representations recently became popular and showed out-performance to traditional sparse-representations like BM25. However, no previous work investigated how dense representations perform with large index sizes. We show theoretically and empirically that the performance for dense representations decreases quicker than sparse representations for in… ▽ More

    Submitted 9 June, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Published at ACL 2021

  14. arXiv:2011.12249  [pdf, other

    cs.CL

    Generalizing Cross-Document Event Coreference Resolution Across Multiple Corpora

    Authors: Michael Bugert, Nils Reimers, Iryna Gurevych

    Abstract: Cross-document event coreference resolution (CDCR) is an NLP task in which mentions of events need to be identified and clustered throughout a collection of documents. CDCR aims to benefit downstream multi-document applications, but despite recent progress on corpora and system development, downstream improvements from applying CDCR have not been shown yet. We make the observation that every CDCR… ▽ More

    Submitted 10 June, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Accepted at CL Journal

  15. arXiv:2010.11918  [pdf, other

    cs.LG cs.CL

    AdapterDrop: On the Efficiency of Adapters in Transformers

    Authors: Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers, Iryna Gurevych

    Abstract: Massively pre-trained transformer models are computationally expensive to fine-tune, slow for inference, and have large storage requirements. Recent approaches tackle these shortcomings by training smaller models, dynamically reducing the model size, and by training light-weight adapters. In this paper, we propose AdapterDrop, removing adapters from lower transformer layers during training and inf… ▽ More

    Submitted 5 October, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021

  16. arXiv:2010.08240  [pdf, other

    cs.CL

    Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks

    Authors: Nandan Thakur, Nils Reimers, Johannes Daxenberger, Iryna Gurevych

    Abstract: There are two approaches for pairwise sentence scoring: Cross-encoders, which perform full-attention over the input pair, and Bi-encoders, which map each input independently to a dense vector space. While cross-encoders often achieve higher performance, they are too slow for many practical use cases. Bi-encoders, on the other hand, require substantial training data and fine-tuning over the target… ▽ More

    Submitted 12 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: Accepted at NAACL 2021

  17. arXiv:2004.09813  [pdf, other

    cs.CL

    Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: We present an easy and efficient method to extend existing sentence embedding models to new languages. This allows to create multilingual versions from previously monolingual models. The training is based on the idea that a translated sentence should be mapped to the same location in the vector space as the original sentence. We use the original (monolingual) model to generate sentence embeddings… ▽ More

    Submitted 5 October, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: Accepted at EMNLP 2020

  18. arXiv:1908.10084  [pdf, other

    cs.CL

    Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computati… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: Published at EMNLP 2019

  19. arXiv:1906.09821  [pdf, ps, other

    cs.CL

    Classification and Clustering of Arguments with Contextualized Word Embeddings

    Authors: Nils Reimers, Benjamin Schiller, Tilman Beck, Johannes Daxenberger, Christian Stab, Iryna Gurevych

    Abstract: We experiment with two recent contextualized word embedding methods (ELMo and BERT) in the context of open-domain argument search. For the first time, we show how to leverage the power of contextualized word embeddings to classify and cluster topic-dependent arguments, achieving impressive results on both tasks and across multiple datasets. For argument classification, we improve the state-of-the-… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Conference paper at ACL 2019

  20. arXiv:1906.01753  [pdf, other

    cs.CL

    Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution

    Authors: Shany Barhom, Vered Shwartz, Alon Eirew, Michael Bugert, Nils Reimers, Ido Dagan

    Abstract: Recognizing coreferring events and entities across multiple texts is crucial for many NLP applications. Despite the task's importance, research focus was given mostly to within-document entity coreference, with rather little attention to the other variants. We propose a neural architecture for cross-document coreference resolution. Inspired by Lee et al (2012), we jointly model entity and event co… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  21. arXiv:1904.02954  [pdf, other

    cs.CL

    Alternative Weighting Schemes for ELMo Embeddings

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: ELMo embeddings (Peters et. al, 2018) had a huge impact on the NLP community and may recent publications use these embeddings to boost the performance for downstream NLP tasks. However, integration of ELMo embeddings in existent NLP architectures is not straightforward. In contrast to traditional word embeddings, like GloVe or word2vec embeddings, the bi-directional language model of ELMo produces… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

  22. arXiv:1803.09578  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Why Comparing Single Performance Scores Does Not Allow to Draw Conclusions About Machine Learning Approaches

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: Develo** state-of-the-art approaches for specific tasks is a major driving force in our research community. Depending on the prestige of the task, publishing it can come along with a lot of visibility. The question arises how reliable are our evaluation methodologies to compare approaches? One common methodology to identify the state-of-the-art is to partition data into a train, a development… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  23. arXiv:1707.09861  [pdf, other

    cs.CL stat.ML

    Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: In this paper we show that reporting a single performance score is insufficient to compare non-deterministic approaches. We demonstrate for common sequence tagging tasks that the seed value for the random number generator can result in statistically significant (p < 10^-4) differences for state-of-the-art systems. For two recent systems for NER, we observe an absolute difference of one percentage… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: Accepted at EMNLP 2017

  24. arXiv:1707.06799  [pdf, other

    cs.CL

    Optimal Hyperparameters for Deep LSTM-Networks for Sequence Labeling Tasks

    Authors: Nils Reimers, Iryna Gurevych

    Abstract: Selecting optimal parameters for a neural network architecture can often make the difference between mediocre and state-of-the-art performance. However, little is published which parameters and design choices should be evaluated or selected making the correct hyperparameter optimization often a "black art that requires expert experiences" (Snoek et al., 2012). In this paper, we evaluate the import… ▽ More

    Submitted 16 August, 2017; v1 submitted 21 July, 2017; originally announced July 2017.

    Comments: 34 pages. 9 page version of this paper published at EMNLP 2017