Skip to main content

Showing 1–3 of 3 results for author: Schlatt, F

.
  1. arXiv:2405.07920  [pdf, other

    cs.IR

    A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking

    Authors: Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen

    Abstract: Cross-encoders distilled from large language models (LLMs) are often more effective re-rankers than cross-encoders fine-tuned on manually labeled data. However, the distilled models usually do not reach their teacher LLM's effectiveness. To investigate whether best practices for fine-tuning cross-encoders on manually labeled data (e.g., hard-negative sampling, deep sampling, and listwise loss func… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2404.06912  [pdf, other

    cs.IR

    Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders

    Authors: Ferdinand Schlatt, Maik Fröbe, Harrisen Scells, Shengyao Zhuang, Bevan Koopman, Guido Zuccon, Benno Stein, Martin Potthast, Matthias Hagen

    Abstract: Existing cross-encoder re-rankers can be categorized as pointwise, pairwise, or listwise models. Pair- and listwise models allow passage interactions, which usually makes them more effective than pointwise models but also less efficient and less robust to input order permutations. To enable efficient permutation-invariant passage interactions during re-ranking, we propose a new cross-encoder archi… ▽ More

    Submitted 16 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  3. Investigating the Effects of Sparse Attention on Cross-Encoders

    Authors: Ferdinand Schlatt, Maik Fröbe, Matthias Hagen

    Abstract: Cross-encoders are effective passage and document re-rankers but less efficient than other neural or classic retrieval models. A few previous studies have applied windowed self-attention to make cross-encoders more efficient. However, these studies did not investigate the potential and limits of different attention patterns or window sizes. We close this gap and systematically analyze how token in… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: Accepted at ECIR'24