Skip to main content

Showing 1–8 of 8 results for author: Rahier, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15171  [pdf, ps, other

    cs.LG math.ST stat.ML

    Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits

    Authors: Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

    Abstract: We address the problem of stochastic combinatorial semi-bandits, where a player selects among $P$ actions from the power set of a set containing $d$ base items. Adaptivity to the problem's structure is essential in order to obtain optimal regret upper bounds. As estimating the coefficients of a covariance matrix can be manageable in practice, leveraging them should improve the regret. We design ``… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2312.16267  [pdf, other

    cs.IR cs.GT cs.LG stat.ML

    Maximizing the Success Probability of Policy Allocations in Online Systems

    Authors: Artem Betlei, Mariia Vladimirova, Mehdi Sebbar, Nicolas Urien, Thibaud Rahier, Benjamin Heymann

    Abstract: The effectiveness of advertising in e-commerce largely depends on the ability of merchants to bid on and win impressions for their targeted users. The bidding procedure is highly complex due to various factors such as market competition, user behavior, and the diverse objectives of advertisers. In this paper we consider the problem at the level of user timelines instead of individual bid requests,… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: accepted to AAAI 2024, 7 pages main text, 9 pages references and appendix, 22 figures

  3. arXiv:2206.09348  [pdf, other

    cs.LG cs.GT math.OC

    Nested bandits

    Authors: Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier, Houssam Zenati

    Abstract: In many online decision processes, the optimizing agent is called to choose between large numbers of alternatives with many inherent similarities; in turn, these similarities imply closely correlated losses that may confound standard discrete choice models and bandit algorithms. We study this question in the context of nested bandits, a class of adversarial multi-armed bandit problems where the le… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 35 pages, 14 figures; to appear in ICML 2022

    MSC Class: Primary 68Q32; secondary 91B06

  4. arXiv:2205.09739  [pdf, other

    cs.CV cs.AI cs.LG

    Diverse Weight Averaging for Out-of-Distribution Generalization

    Authors: Alexandre Ramé, Matthieu Kirchmeyer, Thibaud Rahier, Alain Rakotomamonjy, Patrick Gallinari, Matthieu Cord

    Abstract: Standard neural networks struggle to generalize under distribution shifts in computer vision. Fortunately, combining multiple networks can consistently improve out-of-distribution generalization. In particular, weight averaging (WA) strategies were shown to perform best on the competitive DomainBed benchmark; they directly average the weights of multiple networks despite their nonlinearities. In t… ▽ More

    Submitted 27 January, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 36 pages, 16 figures, 15 tables

  5. arXiv:2111.10106  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    A Large Scale Benchmark for Individual Treatment Effect Prediction and Uplift Modeling

    Authors: Eustache Diemert, Artem Betlei, Christophe Renaudin, Massih-Reza Amini, Théophane Gregoir, Thibaud Rahier

    Abstract: Individual Treatment Effect (ITE) prediction is an important area of research in machine learning which aims at explaining and estimating the causal impact of an action at the granular level. It represents a problem of growing interest in multiple sectors of application such as healthcare, online advertising or socioeconomics. To foster research on this topic we release a publicly available collec… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

  6. arXiv:2109.05829  [pdf, other

    cs.LG math.OC

    Zeroth-order non-convex learning via hierarchical dual averaging

    Authors: Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier

    Abstract: We propose a hierarchical version of dual averaging for zeroth-order online non-convex optimization - i.e., learning processes where, at each stage, the optimizer is facing an unknown non-convex loss function and only receives the incurred loss as feedback. The proposed class of policies relies on the construction of an online model that aggregates loss information as it arrives, and it consists o… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 40 pages, 14 figures

    MSC Class: Primary 68Q32; 90C56; secondary 90C15; 90C26

  7. arXiv:2010.08496  [pdf, other

    cs.LG math.OC

    Online non-convex optimization with imperfect feedback

    Authors: Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier

    Abstract: We consider the problem of online learning with non-convex losses. In terms of feedback, we assume that the learner observes - or otherwise constructs - an inexact model for the loss function encountered at each stage, and we propose a mixed-strategy learning policy based on dual averaging. In this general context, we derive a series of tight regret minimization guarantees, both for the learner's… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 30 pages, 2 figures, 1 table

    MSC Class: Primary 68Q32; secondary 90C26; 91A26

  8. arXiv:2008.03235  [pdf, other

    stat.ML cs.LG stat.ME

    Individual Treatment Prescription Effect Estimation in a Low Compliance Setting

    Authors: Thibaud Rahier, Amélie Héliou, Matthieu Martin, Christophe Renaudin, Eustache Diemert

    Abstract: Individual Treatment Effect (ITE) estimation is an extensively researched problem, with applications in various domains. We model the case where there exists heterogeneous non-compliance to a randomly assigned treatment, a typical situation in health (because of non-compliance to prescription) or digital advertising (because of competition and ad blockers for instance). The lower the compliance, t… ▽ More

    Submitted 23 October, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 28 pages, 10 figures