Skip to main content

Showing 1–14 of 14 results for author: Maystre, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.02649  [pdf, other

    cs.LG

    On the Importance of Uncertainty in Decision-Making with Large Language Models

    Authors: Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek

    Abstract: We investigate the role of uncertainty in decision-making problems with natural language as input. For such tasks, using Large Language Models as agents has become the norm. However, none of the recent approaches employ any additional phase for estimating the uncertainty the agent has about the world during the decision-making task. We focus on a fundamental decision-making framework with natural… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 12 pages of main content, 25 pages with references and appendix

  2. Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay

    Authors: Thomas M. McDonald, Lucas Maystre, Mounia Lalmas, Daniel Russo, Kamil Ciosek

    Abstract: Recommender systems are a ubiquitous feature of online platforms. Increasingly, they are explicitly tasked with increasing users' long-term satisfaction. In this context, we study a content exploration task, which we formalize as a multi-armed bandit problem with delayed rewards. We observe that there is an apparent trade-off in choosing the learning signal: Waiting for the full reward to become a… ▽ More

    Submitted 20 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Presented at the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23)

  3. arXiv:2306.01814  [pdf, other

    cs.IR cs.HC cs.LG

    Fast Interactive Search with a Scale-Free Comparison Oracle

    Authors: Daniyar Chumbalov, Lars Klein, Lucas Maystre, Matthias Grossglauser

    Abstract: A comparison-based search algorithm lets a user find a target item $t$ in a database by answering queries of the form, ``Which of items $i$ and $j$ is closer to $t$?'' Instead of formulating an explicit query (such as one or several keywords), the user navigates towards the target via a sequence of such (typically noisy) queries. We propose a scale-free probabilistic oracle model called $γ$-CKL… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  4. arXiv:2302.10625  [pdf, other

    stat.ML cs.LG

    Estimating long-term causal effects from short-term experiments and long-term observational data with unobserved confounding

    Authors: Graham Van Goffrier, Lucas Maystre, Ciarán Gilligan-Lee

    Abstract: Understanding and quantifying cause and effect is an important problem in many domains. The generally-agreed solution to this problem is to perform a randomised controlled trial. However, even when randomised controlled trials can be performed, they usually have relatively short duration's due to cost considerations. This makes learning long-term causal effects a very challenging task in practice,… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 23 pages, 8 figures, 2nd Conference on Causal Learning and Reasoning

  5. arXiv:2302.03561  [pdf, other

    cs.LG cs.AI cs.IR eess.SY stat.ML

    Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective

    Authors: Lucas Maystre, Daniel Russo, Yu Zhao

    Abstract: We study the problem of optimizing a recommender system for outcomes that occur over several weeks or months. We begin by drawing on reinforcement learning to formulate a comprehensive model of users' recurring relationships with a recommender system. Measurement, attribution, and coordination challenges complicate algorithm design. We describe careful modeling -- including a new representation of… ▽ More

    Submitted 28 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  6. arXiv:2302.02788  [pdf, other

    cs.LG

    A Strong Baseline for Batch Imitation Learning

    Authors: Matthew Smith, Lucas Maystre, Zhenwen Dai, Kamil Ciosek

    Abstract: Imitation of expert behaviour is a highly desirable and safe approach to the problem of sequential decision making. We provide an easy-to-implement, novel algorithm for imitation learning under a strict data paradigm, in which the agent must learn solely from data collected a priori. This paradigm allows our algorithm to be used for environments in which safety or cost are of critical concern. Our… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 28 pages (10 main, 18 appendix), 4 figures

  7. arXiv:1911.11658  [pdf, other

    stat.ML cs.CY cs.LG physics.soc-ph

    A User Study of Perceived Carbon Footprint

    Authors: Victor Kristof, Valentin Quelquejay-Leclère, Robin Zbinden, Lucas Maystre, Matthias Grossglauser, Patrick Thiran

    Abstract: We propose a statistical model to understand people's perception of their carbon footprint. Driven by the observation that few people think of CO2 impact in absolute terms, we design a system to probe people's perception from simple pairwise comparisons of the relative carbon footprint of their actions. The formulation of the model enables us to take an active-learning approach to selecting the pa… ▽ More

    Submitted 4 December, 2019; v1 submitted 26 November, 2019; originally announced November 2019.

  8. arXiv:1905.05049  [pdf, other

    stat.ML cs.LG

    Scalable and Efficient Comparison-based Search without Features

    Authors: Daniyar Chumbalov, Lucas Maystre, Matthias Grossglauser

    Abstract: We consider the problem of finding a target object $t$ using pairwise comparisons, by asking an oracle questions of the form \emph{"Which object from the pair $(i,j)$ is more similar to $t$?"}. Objects live in a space of latent features, from which the oracle generates noisy answers. First, we consider the {\em non-blind} setting where these features are accessible. We propose a new Bayesian compa… ▽ More

    Submitted 3 September, 2020; v1 submitted 13 May, 2019; originally announced May 2019.

  9. arXiv:1903.07746  [pdf, other

    stat.ML cs.LG

    Pairwise Comparisons with Flexible Time-Dynamics

    Authors: Lucas Maystre, Victor Kristof, Matthias Grossglauser

    Abstract: Inspired by applications in sports where the skill of players or teams competing against each other varies over time, we propose a probabilistic model of pairwise-comparison outcomes that can capture a wide range of time dynamics. We achieve this by replacing the static parameters of a class of popular pairwise-comparison models by continuous-time Gaussian processes; the covariance function of the… ▽ More

    Submitted 17 May, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted at KDD 2019

  10. arXiv:1801.04159  [pdf, other

    stat.AP cs.SI stat.ML

    Can Who-Edits-What Predict Edit Survival?

    Authors: Ali Batuhan Yardım, Victor Kristof, Lucas Maystre, Matthias Grossglauser

    Abstract: As the number of contributors to online peer-production systems grows, it becomes increasingly important to predict whether the edits that users make will eventually be beneficial to the project. Existing solutions either rely on a user reputation system or consist of a highly specialized predictor that is tailored to a specific peer-production system. In this work, we explore a different point in… ▽ More

    Submitted 5 July, 2018; v1 submitted 12 January, 2018; originally announced January 2018.

    Comments: Accepted at KDD 2018

  11. arXiv:1610.06525  [pdf, other

    stat.ML cs.LG cs.SI

    ChoiceRank: Identifying Preferences from Node Traffic in Networks

    Authors: Lucas Maystre, Matthias Grossglauser

    Abstract: Understanding how users navigate in a network is of high interest in many applications. We consider a setting where only aggregate node-level traffic is observed and tackle the task of learning edge transition probabilities. We cast it as a preference learning problem, and we study a model where choices follow Luce's axiom. In this case, the $O(n)$ marginal counts of node visits are a sufficient s… ▽ More

    Submitted 15 June, 2017; v1 submitted 20 October, 2016; originally announced October 2016.

    Comments: Accepted at ICML 2017

  12. arXiv:1609.01176  [pdf, other

    cs.LG stat.AP

    The Player Kernel: Learning Team Strengths Based on Implicit Player Contributions

    Authors: Lucas Maystre, Victor Kristof, Antonio J. González Ferrer, Matthias Grossglauser

    Abstract: In this work, we draw attention to a connection between skill-based models of game outcomes and Gaussian process classification models. The Gaussian process perspective enables a) a principled way of dealing with uncertainty and b) rich models, specified through kernel functions. Using this connection, we tackle the problem of predicting outcomes of football matches between national teams. We deve… ▽ More

    Submitted 5 September, 2016; originally announced September 2016.

  13. arXiv:1502.05556  [pdf, other

    stat.ML cs.LG

    Just Sort It! A Simple and Effective Approach to Active Preference Learning

    Authors: Lucas Maystre, Matthias Grossglauser

    Abstract: We address the problem of learning a ranking by using adaptively chosen pairwise comparisons. Our goal is to recover the ranking accurately but to sample the comparisons sparingly. If all comparison outcomes are consistent with the ranking, the optimal solution is to use an efficient sorting algorithm, such as Quicksort. But how do sorting algorithms behave if some comparison outcomes are inconsis… ▽ More

    Submitted 15 June, 2017; v1 submitted 19 February, 2015; originally announced February 2015.

    Comments: Accepted at ICML 2017

  14. arXiv:1307.2084  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Mitigating Epidemics through Mobile Micro-measures

    Authors: Mohamed Kafsi, Ehsan Kazemi, Lucas Maystre, Lyudmila Yartseva, Matthias Grossglauser, Patrick Thiran

    Abstract: Epidemics of infectious diseases are among the largest threats to the quality of life and the economic and social well-being of develo** countries. The arsenal of measures against such epidemics is well-established, but costly and insufficient to mitigate their impact. In this paper, we argue that mobile technology adds a powerful weapon to this arsenal, because (a) mobile devices endow us with… ▽ More

    Submitted 8 July, 2013; originally announced July 2013.

    Comments: Presented at NetMob 2013, Boston