Skip to main content

Showing 1–14 of 14 results for author: Althammer, S

.
  1. arXiv:2404.18796  [pdf, other

    cs.CL cs.AI

    Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

    Authors: Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick Lewis

    Abstract: As Large Language Models (LLMs) have become more advanced, they have outpaced our abilities to accurately evaluate their quality. Not only is finding data to adequately probe particular model properties difficult, but evaluating the correctness of a model's freeform generation alone is a challenge. To address this, many evaluations now rely on using LLMs themselves as judges to score the quality o… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2309.06131  [pdf, other

    cs.IR cs.CL

    Annotating Data for Fine-Tuning a Neural Ranker? Current Active Learning Strategies are not Better than Random Selection

    Authors: Sophia Althammer, Guido Zuccon, Sebastian Hofstätter, Suzan Verberne, Allan Hanbury

    Abstract: Search methods based on Pretrained Language Models (PLM) have demonstrated great effectiveness gains compared to statistical and early neural ranking models. However, fine-tuning PLM-based rankers requires a great amount of annotated training data. Annotating data involves a large manual effort and thus is expensive, especially in domain specific tasks. In this paper we investigate fine-tuning PLM… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted at SIGIR-AP 2023

  3. arXiv:2305.15048  [pdf, other

    cs.CL cs.IR

    Ranger: A Toolkit for Effect-Size Based Multi-Task Evaluation

    Authors: Mete Sertkan, Sophia Althammer, Sebastian Hofstätter

    Abstract: In this paper, we introduce Ranger - a toolkit to facilitate the easy use of effect-size-based meta-analysis for multi-task evaluation in NLP and IR. We observed that our communities often face the challenge of aggregating results over incomparable metrics and scenarios, which makes conclusions and take-away messages less reliable. With Ranger, we aim to address this issue by providing a task-agno… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 (System Demonstrations)

  4. TripJudge: A Relevance Judgement Test Collection for TripClick Health Retrieval

    Authors: Sophia Althammer, Sebastian Hofstätter, Suzan Verberne, Allan Hanbury

    Abstract: Robust test collections are crucial for Information Retrieval research. Recently there is a growing interest in evaluating retrieval systems for domain-specific retrieval tasks, however these tasks often lack a reliable test collection with human-annotated relevance assessments following the Cranfield paradigm. In the medical domain, the TripClick collection was recently proposed, which contains c… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

    Comments: To be published at CIKM 2022 as resource paper

  5. arXiv:2203.13088  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Introducing Neural Bag of Whole-Words with ColBERTer: Contextualized Late Interactions using Enhanced Reduction

    Authors: Sebastian Hofstätter, Omar Khattab, Sophia Althammer, Mete Sertkan, Allan Hanbury

    Abstract: Recent progress in neural information retrieval has demonstrated large gains in effectiveness, while often sacrificing the efficiency and interpretability of the neural model compared to classical approaches. This paper proposes ColBERTer, a neural retrieval model using contextualized late interaction (ColBERT) with enhanced reduction. Along the effectiveness Pareto frontier, ColBERTer's reduction… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  6. arXiv:2201.01614  [pdf, other

    cs.IR

    PARM: A Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval

    Authors: Sophia Althammer, Sebastian Hofstätter, Mete Sertkan, Suzan Verberne, Allan Hanbury

    Abstract: Dense passage retrieval (DPR) models show great effectiveness gains in first stage retrieval for the web domain. However in the web domain we are in a setting with large amounts of training data and a query-to-passage or a query-to-document retrieval task. We investigate in this paper dense document-to-document retrieval with limited labelled target data for training, in particular legal case retr… ▽ More

    Submitted 14 August, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: Accepted at ECIR 2022

  7. arXiv:2201.00365  [pdf, ps, other

    cs.IR cs.CL

    Establishing Strong Baselines for TripClick Health Retrieval

    Authors: Sebastian Hofstätter, Sophia Althammer, Mete Sertkan, Allan Hanbury

    Abstract: We present strong Transformer-based re-ranking and dense retrieval baselines for the recently released TripClick health ad-hoc retrieval collection. We improve the - originally too noisy - training data with a simple negative sampling policy. We achieve large gains over BM25 in the re-ranking task of TripClick, which were not achieved with the original baselines. Furthermore, we study the impact o… ▽ More

    Submitted 2 January, 2022; originally announced January 2022.

    Comments: Accepted at ECIR 2022

  8. arXiv:2110.05601  [pdf

    cs.HC cs.IR

    A Time-Optimized Content Creation Workflow for Remote Teaching

    Authors: Sebastian Hofstätter, Sophia Althammer, Mete Sertkan, Allan Hanbury

    Abstract: We describe our workflow to create an engaging remote learning experience for a university course, while minimizing the post-production time of the educators. We make use of ubiquitous and commonly free services and platforms, so that our workflow is inclusive for all educators and provides polished experiences for students. Our learning materials provide for each lecture: 1) a recorded video, upl… ▽ More

    Submitted 13 October, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted at SIGCSE-TS 2022

  9. arXiv:2109.12026  [pdf, other

    cs.LG cs.IR

    Description-based Label Attention Classifier for Explainable ICD-9 Classification

    Authors: Malte Feucht, Zhiliang Wu, Sophia Althammer, Volker Tresp

    Abstract: ICD-9 coding is a relevant clinical billing task, where unstructured texts with information about a patient's diagnosis and treatments are annotated with multiple ICD-9 codes. Automated ICD-9 coding is an active research field, where CNN- and RNN-based model architectures represent the state-of-the-art approaches. In this work, we propose a description-based label attention classifier to improve t… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at the Workshop on Noisy User-generated Text (W-NUT) at EMNLP 2021

  10. arXiv:2108.03937  [pdf, other

    cs.IR

    DoSSIER@COLIEE 2021: Leveraging dense retrieval and summarization-based re-ranking for case law retrieval

    Authors: Sophia Althammer, Arian Askari, Suzan Verberne, Allan Hanbury

    Abstract: In this paper, we present our approaches for the case law retrieval and the legal case entailment task in the Competition on Legal Information Extraction/Entailment (COLIEE) 2021. As first stage retrieval methods combined with neural re-ranking methods using contextualized language models like BERT achieved great performance improvements for information retrieval in the web and news domain, we eva… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: Published in COLIEE 2021

  11. arXiv:2106.05768  [pdf, other

    cs.CL cs.IR

    Linguistically Informed Masking for Representation Learning in the Patent Domain

    Authors: Sophia Althammer, Mark Buckley, Sebastian Hofstätter, Allan Hanbury

    Abstract: Domain-specific contextualized language models have demonstrated substantial effectiveness gains for domain-specific downstream tasks, like similarity matching, entity recognition or information retrieval. However successfully applying such models in highly specific language domains requires domain adaptation of the pre-trained models. In this paper we propose the empirically motivated Linguistica… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Published at SIGIR 2021 PatentSemTech workshop

  12. arXiv:2101.06980  [pdf, other

    cs.IR cs.CL

    Mitigating the Position Bias of Transformer Models in Passage Re-Ranking

    Authors: Sebastian Hofstätter, Aldo Lipani, Sophia Althammer, Markus Zlabinger, Allan Hanbury

    Abstract: Supervised machine learning models and their evaluation strongly depends on the quality of the underlying dataset. When we search for a relevant piece of information it may appear anywhere in a given passage. However, we observe a bias in the position of the correct answer in the text in two popular Question Answering datasets used for passage re-ranking. The excessive favoring of earlier position… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted at ECIR 2021 (Full paper track)

  13. arXiv:2012.11405  [pdf, other

    cs.IR

    Cross-domain Retrieval in the Legal and Patent Domains: a Reproducibility Study

    Authors: Sophia Althammer, Sebastian Hofstätter, Allan Hanbury

    Abstract: Domain specific search has always been a challenging information retrieval task due to several challenges such as the domain specific language, the unique task setting, as well as the lack of accessible queries and corresponding relevance judgements. In the last years, pretrained language models, such as BERT, revolutionized web and news search. Naturally, the community aims to adapt these advance… ▽ More

    Submitted 19 January, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted at ECIR 2021 (Reproducibility paper track)

  14. arXiv:2010.02666  [pdf, other

    cs.IR

    Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation

    Authors: Sebastian Hofstätter, Sophia Althammer, Michael Schröder, Mete Sertkan, Allan Hanbury

    Abstract: Retrieval and ranking models are the backbone of many applications such as web search, open domain QA, or text-based recommender systems. The latency of neural ranking models at query time is largely dependent on the architecture and deliberate choices by their designers to trade-off effectiveness for higher efficiency. This focus on low query latency of a rising number of efficient ranking archit… ▽ More

    Submitted 22 January, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Updated paper with dense retrieval results and query-level analysis