Search | arXiv e-print repository

Large Language Models as Recommender Systems: A Study of Popularity Bias

Authors: Jan Malte Lichtenberg, Alexander Buchholz, Pola Schwöbel

Abstract: The issue of popularity bias -- where popular items are disproportionately recommended, overshadowing less popular but potentially relevant items -- remains a significant challenge in recommender systems. Recent advancements have seen the integration of general-purpose Large Language Models (LLMs) into the architecture of such systems. This integration raises concerns that it might exacerbate popu… ▽ More The issue of popularity bias -- where popular items are disproportionately recommended, overshadowing less popular but potentially relevant items -- remains a significant challenge in recommender systems. Recent advancements have seen the integration of general-purpose Large Language Models (LLMs) into the architecture of such systems. This integration raises concerns that it might exacerbate popularity bias, given that the LLM's training data is likely dominated by popular items. However, it simultaneously presents a novel opportunity to address the bias via prompt tuning. Our study explores this dichotomy, examining whether LLMs contribute to or can alleviate popularity bias in recommender systems. We introduce a principled way to measure popularity bias by discussing existing metrics and proposing a novel metric that fulfills a series of desiderata. Based on our new metric, we compare a simple LLM-based recommender to traditional recommender systems on a movie recommendation task. We find that the LLM recommender exhibits less popularity bias, even without any explicit mitigation. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: Accepted at Gen-IR@SIGIR24 workshop

arXiv:2310.14777 [pdf, other]

Geographical Erasure in Language Generation

Authors: Pola Schwöbel, Jacek Golebiowski, Michele Donini, Cédric Archambeau, Danish Pruthi

Abstract: Large language models (LLMs) encode vast amounts of world knowledge. However, since these models are trained on large swaths of internet data, they are at risk of inordinately capturing information about dominant groups. This imbalance can propagate into generated language. In this work, we study and operationalise a form of geographical erasure, wherein language models underpredict certain countr… ▽ More Large language models (LLMs) encode vast amounts of world knowledge. However, since these models are trained on large swaths of internet data, they are at risk of inordinately capturing information about dominant groups. This imbalance can propagate into generated language. In this work, we study and operationalise a form of geographical erasure, wherein language models underpredict certain countries. We demonstrate consistent instances of erasure across a range of LLMs. We discover that erasure strongly correlates with low frequencies of country mentions in the training corpus. Lastly, we mitigate erasure by finetuning using a custom objective. △ Less

Submitted 23 October, 2023; originally announced October 2023.

Comments: EMNLP 2023 Findings

arXiv:2203.06038 [pdf, ps, other]

The Long Arc of Fairness: Formalisations and Ethical Discourse

Authors: Pola Schwöbel, Peter Remmers

Abstract: In recent years, the idea of formalising and modelling fairness for algorithmic decision making (ADM) has advanced to a point of sophisticated specialisation. However, the relations between technical (formalised) and ethical discourse on fairness are not always clear and productive. Arguing for an alternative perspective, we review existing fairness metrics and discuss some common issues. For inst… ▽ More In recent years, the idea of formalising and modelling fairness for algorithmic decision making (ADM) has advanced to a point of sophisticated specialisation. However, the relations between technical (formalised) and ethical discourse on fairness are not always clear and productive. Arguing for an alternative perspective, we review existing fairness metrics and discuss some common issues. For instance, the fairness of procedures and distributions is often formalised and discussed statically, disregarding both structural preconditions of the status quo and downstream effects of a given intervention. We then introduce dynamic fairness modelling, a more comprehensive approach that realigns formal fairness metrics with arguments from the ethical discourse. A dynamic fairness model incorporates (1) ethical goals, (2) formal metrics to quantify decision procedures and outcomes and (3) mid-term or long-term downstream effects. By contextualising these elements of fairness-related processes, dynamic fairness modelling explicates formerly latent ethical aspects and thereby provides a helpful tool to navigate trade-offs between different fairness interventions. To illustrate the framework, we discuss an example application -- the current European efforts to increase the number of women on company boards, e.g. via quota solutions -- and present early technical work that fits within our framework. △ Less

Submitted 8 March, 2022; originally announced March 2022.

arXiv:2106.07512 [pdf, other]

Last Layer Marginal Likelihood for Invariance Learning

Authors: Pola Schwöbel, Martin Jørgensen, Sebastian W. Ober, Mark van der Wilk

Abstract: Data augmentation is often used to incorporate inductive biases into models. Traditionally, these are hand-crafted and tuned with cross validation. The Bayesian paradigm for model selection provides a path towards end-to-end learning of invariances using only the training data, by optimising the marginal likelihood. Computing the marginal likelihood is hard for neural networks, but success with tr… ▽ More Data augmentation is often used to incorporate inductive biases into models. Traditionally, these are hand-crafted and tuned with cross validation. The Bayesian paradigm for model selection provides a path towards end-to-end learning of invariances using only the training data, by optimising the marginal likelihood. Computing the marginal likelihood is hard for neural networks, but success with tractable approaches that compute the marginal likelihood for the last layer only raises the question of whether this convenient approach might be employed for learning invariances. We show partial success on standard benchmarks, in the low-data regime and on a medical imaging dataset by designing a custom optimisation routine. Introducing a new lower bound to the marginal likelihood allows us to perform inference for a larger class of likelihood functions than before. On the other hand, we demonstrate failure modes on the CIFAR10 dataset, where the last layer approximation is not sufficient due to the increased complexity of our neural network. Our results indicate that once more sophisticated approximations become available the marginal likelihood is a promising approach for invariance learning in neural networks. △ Less

Submitted 1 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

Comments: AISTATS '22

arXiv:2004.03637 [pdf, other]

Probabilistic Spatial Transformer Networks

Authors: Pola Schwöbel, Frederik Warburg, Martin Jørgensen, Kristoffer H. Madsen, Søren Hauberg

Abstract: Spatial Transformer Networks (STNs) estimate image transformations that can improve downstream tasks by `zooming in' on relevant regions in an image. However, STNs are hard to train and sensitive to mis-predictions of transformations. To circumvent these limitations, we propose a probabilistic extension that estimates a stochastic transformation rather than a deterministic one. Marginalizing trans… ▽ More Spatial Transformer Networks (STNs) estimate image transformations that can improve downstream tasks by `zooming in' on relevant regions in an image. However, STNs are hard to train and sensitive to mis-predictions of transformations. To circumvent these limitations, we propose a probabilistic extension that estimates a stochastic transformation rather than a deterministic one. Marginalizing transformations allows us to consider each image at multiple poses, which makes the localization task easier and the training more robust. As an additional benefit, the stochastic transformations act as a localized, learned data augmentation that improves the downstream tasks. We show across standard imaging benchmarks and on a challenging real-world dataset that these two properties lead to improved classification performance, robustness and model calibration. We further demonstrate that the approach generalizes to non-visual domains by improving model performance on time-series data. △ Less

Submitted 15 June, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

Comments: UAI 2022

arXiv:0812.0829 [pdf]

doi 10.1016/j.nima.2007.12.015

Field desorption ion source development for neutron generators

Authors: I. Solano, B. Reichenbach, P. R. Schwoebel, D. L. Chichester, C. E. Holland, K. L. Hertz, J. P. Brainard

Abstract: A new approach to deuterium ion sources for deuterium-tritium neutron generators is being developed. The source is based upon the field desorption of deuterium from the surfaces of metal tips. Field desorption studies of microfabricated field emitter tip arrays have been conducted for the first time. Maximum fields of 30 V/nm have been applied to the array tip surfaces to date, although achievin… ▽ More A new approach to deuterium ion sources for deuterium-tritium neutron generators is being developed. The source is based upon the field desorption of deuterium from the surfaces of metal tips. Field desorption studies of microfabricated field emitter tip arrays have been conducted for the first time. Maximum fields of 30 V/nm have been applied to the array tip surfaces to date, although achieving fields of 20 V/nm to possibly 25 V/nm is more typical. Both the desorption of atomic deuterium ions and the gas phase field ionization of molecular deuterium has been observed at fields of roughly 20 V/nm and 20-30 V/nm, respectively, at room temperature. The desorption of common surface adsorbates, such as hydrogen, carbon, water, and carbon monoxide is observed at fields exceeding ~10 V/nm. In vacuo heating of the arrays to temperatures of the order of 800 C can be effective in removing many of the surface contaminants observed. △ Less

Submitted 3 December, 2008; originally announced December 2008.

Journal ref: Nucl.Instrum.Meth.A587:76-81,2008

arXiv:0811.4193 [pdf]

doi 10.1063/1.2913331

A field evaporation deuterium ion source for neutron generators

Authors: Birk Reichenbach, I. Solano, P. R. Schwoebel

Abstract: Proof-of-principle experiments have demonstrated an electrostatic field evaporation based deuterium ion source for use in compact, high-output deuterium-tritium neutron generators. The ion source produces principally atomic deuterium and titanium ions. More than 100 monolayers of deuterated titanium thin film can be removed and ionized from a single tip in less than 20 ns. The measurements indic… ▽ More Proof-of-principle experiments have demonstrated an electrostatic field evaporation based deuterium ion source for use in compact, high-output deuterium-tritium neutron generators. The ion source produces principally atomic deuterium and titanium ions. More than 100 monolayers of deuterated titanium thin film can be removed and ionized from a single tip in less than 20 ns. The measurements indicate that with the use of microfabricated tip arrays the deuterium ion source could provide sufficient ion current to produce 10^9 to 10^10 n/cm^2 of tip array area. △ Less

Submitted 25 November, 2008; originally announced November 2008.

Journal ref: J.Appl.Phys.103:094912,2008

Showing 1–7 of 7 results for author: Schwöbel, P