Skip to main content

Showing 1–21 of 21 results for author: Carpentier, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18672  [pdf, ps, other

    math.OC cs.LG stat.ML

    A simple and improved algorithm for noisy, convex, zeroth-order optimisation

    Authors: Alexandra Carpentier

    Abstract: In this paper, we study the problem of noisy, convex, zeroth order optimisation of a function $f$ over a bounded convex set $\bar{\mathcal X}\subset \mathbb{R}^d$. Given a budget $n$ of noisy queries to the function $f$ that can be allocated sequentially and adaptively, our aim is to construct an algorithm that returns a point $\hat x\in \bar{\mathcal X}$ such that $f(\hat x)$ is as small as possi… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.11485  [pdf, other

    stat.ML cs.LG

    Active clustering with bandit feedback

    Authors: Victor Thuot, Alexandra Carpentier, Christophe Giraud, Nicolas Verzelen

    Abstract: We investigate the Active Clustering Problem (ACP). A learner interacts with an $N$-armed stochastic bandit with $d$-dimensional subGaussian feedback. There exists a hidden partition of the arms into $K$ groups, such that arms within the same group, share the same mean vector. The learner's task is to uncover this hidden partition with the smallest budget - i.e., the least number of observation -… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 50 pages

  3. arXiv:2306.02971  [pdf, other

    cs.LG cs.IT math.ST

    Online Learning with Feedback Graphs: The True Shape of Regret

    Authors: Tomáš Kocák, Alexandra Carpentier

    Abstract: Sequential learning with feedback graphs is a natural extension of the multi-armed bandit problem where the problem is equipped with an underlying graph structure that provides additional information - playing an action reveals the losses of all the neighbors of the action. This problem was introduced by \citet{mannor2011} and received considerable attention in recent years. It is generally stated… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  4. arXiv:2306.02628  [pdf, other

    stat.ML cs.LG

    Active Ranking of Experts Based on their Performances in Many Tasks

    Authors: El Mehdi Saad, Nicolas Verzelen, Alexandra Carpentier

    Abstract: We consider the problem of ranking n experts based on their performances on d tasks. We make a monotonicity assumption stating that for each pair of experts, one outperforms the other on all tasks. We consider the sequential setting where in each round, the learner has access to noisy evaluations of actively chosen pair of expert-task, given the information available up to the actual round. Given… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  5. arXiv:2109.04346  [pdf, ps, other

    math.ST cs.IT

    Goodness-of-Fit Testing for Hölder-Continuous Densities: Sharp Local Minimax Rates

    Authors: Julien Chhor, Alexandra Carpentier

    Abstract: We consider the goodness-of fit testing problem for Hölder smooth densities over $\mathbb{R}^d$: given $n$ iid observations with unknown density $p$ and given a known density $p_0$, we investigate how large $ρ$ should be to distinguish, with high probability, the case $p=p_0$ from the composite alternative of all Hölder-smooth densities $p$ such that $\|p-p_0\|_t \geq ρ$ where $t \in [1,2]$. The d… ▽ More

    Submitted 17 March, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: 79 pages

    MSC Class: 62G10 (Primary); 62B10; 62C20 (Secondary)

  6. arXiv:2106.10166  [pdf, other

    stat.ML cs.LG

    Problem Dependent View on Structured Thresholding Bandit Problems

    Authors: James Cheshire, Pierre Ménard, Alexandra Carpentier

    Abstract: We investigate the problem dependent regime in the stochastic Thresholding Bandit problem (TBP) under several shape constraints. In the TBP, the objective of the learner is to output, at the end of a sequential game, the set of arms whose means are above a given threshold. The vanilla, unstructured, case is already well studied in the literature. Taking $K$ as the number of arms, we consider the c… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: 25 pages. arXiv admin note: text overlap with arXiv:2006.10006

  7. arXiv:2103.12452  [pdf, other

    cs.LG stat.ML

    Bandits with many optimal arms

    Authors: Rianne de Heide, James Cheshire, Pierre Ménard, Alexandra Carpentier

    Abstract: We consider a stochastic bandit problem with a possibly infinite number of arms. We write $p^*$ for the proportion of optimal arms and $Δ$ for the minimal mean-gap between optimal and sub-optimal arms. We characterize the optimal learning rates both in the cumulative regret setting, and in the best-arm identification setting in terms of the problem parameters $T$ (the budget), $p^*$ and $Δ$. For t… ▽ More

    Submitted 5 November, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Substantial rewrite and added experiments. Accepted for NeurIPS 2021

  8. arXiv:2102.00725  [pdf, ps, other

    stat.ML cs.LG math.ST

    Generalized non-stationary bandits

    Authors: Anne Gael Manegueu, Alexandra Carpentier, Yi Yu

    Abstract: In this paper, we study a non-stationary stochastic bandit problem, which generalizes the switching bandit problem. On top of the switching bandit problem (\textbf{Case a}), we are interested in three concrete examples: (\textbf{b}) the means of the arms are local polynomials, (\textbf{c}) the means of the arms are locally smooth, and (\textbf{d}) the gaps of the arms have a bounded number of infl… ▽ More

    Submitted 2 February, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

  9. arXiv:2010.10182  [pdf, ps, other

    stat.ML cs.LG

    The Elliptical Potential Lemma Revisited

    Authors: Alexandra Carpentier, Claire Vernade, Yasin Abbasi-Yadkori

    Abstract: This note proposes a new proof and new perspectives on the so-called Elliptical Potential Lemma. This result is important in online learning, especially for linear stochastic bandits. The original proof of the result, however short and elegant, does not give much flexibility on the type of potentials considered and we believe that this new interpretation can be of interest for future research in t… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 8 pages

  10. arXiv:2006.10459  [pdf, other

    stat.ML cs.LG

    Stochastic bandits with arm-dependent delays

    Authors: Anne Gael Manegueu, Claire Vernade, Alexandra Carpentier, Michal Valko

    Abstract: Significant work has been recently dedicated to the stochastic delayed bandit setting because of its relevance in applications. The applicability of existing algorithms is however restricted by the fact that strong assumptions are often made on the delay distributions, such as full observability, restrictive shape constraints, or uniformity over arms. In this work, we weaken them significantly and… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 19 Pages, 4 figures

    MSC Class: 62L10

  11. arXiv:2006.10006  [pdf, ps, other

    cs.LG stat.ML

    The Influence of Shape Constraints on the Thresholding Bandit Problem

    Authors: James Cheshire, Pierre Menard, Alexandra Carpentier

    Abstract: We investigate the stochastic Thresholding Bandit problem (TBP) under several shape constraints. On top of (i) the vanilla, unstructured TBP, we consider the case where (ii) the sequence of arm's means $(μ_k)_k$ is monotonically increasing MTBP, (iii) the case where $(μ_k)_k$ is unimodal UTBP and (iv) the case where $(μ_k)_k$ is concave CTBP. In the TBP problem the aim is to output, at the end of… ▽ More

    Submitted 23 February, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

  12. arXiv:1906.10454  [pdf, ps, other

    stat.ML cs.LG

    Restless dependent bandits with fading memory

    Authors: Oleksandr Zadorozhnyi, Gilles Blanchard, Alexandra Carpentier

    Abstract: We study the stochastic multi-armed bandit problem in the case when the arm samples are dependent over time and generated from so-called weak $\cC$-mixing processes. We establish a $\cC-$Mix Improved UCB agorithm and provide both problem-dependent and independent regret analysis in two different scenarios. In the first, so-called fast-mixing scenario, we show that pseudo-regret enjoys the same upp… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: 30 pages

  13. arXiv:1902.01219  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Local minimax rates for closeness testing of discrete distributions

    Authors: Joseph Lam-Weil, Alexandra Carpentier, Bharath K. Sriperumbudur

    Abstract: We consider the closeness testing problem for discrete distributions. The goal is to distinguish whether two samples are drawn from the same unspecified distribution, or whether their respective distributions are separated in $L_1$-norm. In this paper, we focus on adapting the rate to the shape of the underlying distributions, i.e. we consider \textit{a local minimax setting}. We provide, to the b… ▽ More

    Submitted 19 January, 2021; v1 submitted 1 February, 2019; originally announced February 2019.

    MSC Class: 62F03; 62G10; 62F35 ACM Class: G.3; I.2.6

  14. arXiv:1811.11043  [pdf, other

    stat.ML cs.LG

    Rotting bandits are not harder than stochastic ones

    Authors: Julien Seznec, Andrea Locatelli, Alexandra Carpentier, Alessandro Lazaric, Michal Valko

    Abstract: In stochastic multi-armed bandits, the reward distribution of each arm is assumed to be stationary. This assumption is often violated in practice (e.g., in recommendation systems), where the reward of an arm may change whenever is selected, i.e., rested bandit setting. In this paper, we consider the non-parametric rotting bandit setting, where rewards can only decrease. We introduce the filtering… ▽ More

    Submitted 9 May, 2020; v1 submitted 27 November, 2018; originally announced November 2018.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  15. arXiv:1810.09390  [pdf, other

    stat.ML cs.LG

    A minimax near-optimal algorithm for adaptive rejection sampling

    Authors: Juliette Achdou, Joseph C. Lam, Alexandra Carpentier, Gilles Blanchard

    Abstract: Rejection Sampling is a fundamental Monte-Carlo method. It is used to sample from distributions admitting a probability density function which can be evaluated exactly at any given point, albeit at a high computational cost. However, without proper tuning, this technique implies a high rejection rate. Several methods have been explored to cope with this problem, based on the principle of adaptivel… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 32 pages, 4 figures. Submitted to ALT 2019

    MSC Class: 62D05; 62L12; 62G05 (Primary) 62L05; 62G07 (Secondary) ACM Class: G.3; I.2.6

  16. arXiv:1807.02089  [pdf, other

    stat.ML cs.LG

    Linear Bandits with Stochastic Delayed Feedback

    Authors: Claire Vernade, Alexandra Carpentier, Tor Lattimore, Giovanni Zappella, Beyza Ermis, Michael Brueckner

    Abstract: Stochastic linear bandits are a natural and well-studied model for structured exploration/exploitation problems and are widely used in applications such as online marketing and recommendation. One of the main challenges faced by practitioners ho** to apply existing algorithms is that usually the feedback is randomly delayed and delays are only partially observable. For example, while a purchase… ▽ More

    Submitted 2 March, 2020; v1 submitted 5 July, 2018; originally announced July 2018.

  17. arXiv:1711.09294  [pdf, other

    stat.ML cs.LG

    An Adaptive Strategy for Active Learning with Smooth Decision Boundary

    Authors: Andrea Locatelli, Alexandra Carpentier, Samory Kpotufe

    Abstract: We present the first adaptive strategy for active learning in the setting of classification with smooth decision boundary. The problem of adaptivity (to unknown distributional parameters) has remained opened since the seminal work of Castro and Nowak (2007), which first established (active learning) rates for this setting. While some recent advances on this problem establish adaptive rates in the… ▽ More

    Submitted 25 November, 2017; originally announced November 2017.

  18. arXiv:1605.09004  [pdf, ps, other

    stat.ML cs.LG

    Tight (Lower) Bounds for the Fixed Budget Best Arm Identification Bandit Problem

    Authors: Alexandra Carpentier, Andrea Locatelli

    Abstract: We consider the problem of \textit{best arm identification} with a \textit{fixed budget $T$}, in the $K$-armed stochastic bandit setting, with arms distribution defined on $[0,1]$. We prove that any bandit strategy, for at least one bandit problem characterized by a complexity $H$, will misidentify the best arm with probability lower bounded by $$\exp\Big(-\frac{T}{\log(K)H}\Big),$$ where $H$ is t… ▽ More

    Submitted 29 May, 2016; originally announced May 2016.

    Comments: COLT 2016

  19. arXiv:1605.08671  [pdf, other

    stat.ML cs.LG

    An optimal algorithm for the Thresholding Bandit Problem

    Authors: Andrea Locatelli, Maurilio Gutzeit, Alexandra Carpentier

    Abstract: We study a specific \textit{combinatorial pure exploration stochastic bandit problem} where the learner aims at finding the set of arms whose means are above a given threshold, up to a given precision, and \textit{for a fixed time horizon}. We propose a parameter-free algorithm based on an original heuristic, and prove that it is optimal for this problem by deriving matching upper and lower bounds… ▽ More

    Submitted 27 May, 2016; originally announced May 2016.

    Comments: ICML 2016

  20. arXiv:1507.04523  [pdf, ps, other

    cs.LG

    Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits

    Authors: Alexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, Peter Auer, András Antos

    Abstract: In this paper, we study the problem of estimating uniformly well the mean values of several distributions given a finite budget of samples. If the variance of the distributions were known, one could design an optimal sampling strategy by collecting a number of independent samples per distribution that is proportional to their variance. However, in the more realistic case where the distributions ar… ▽ More

    Submitted 16 July, 2015; originally announced July 2015.

    Comments: 30 pages, 2 Postscript figures, uses elsarticle.cls, earlier, shorter version published in Proceedings of the 22nd International Conference, Algorithmic Learning Theory

    ACM Class: G.3

  21. arXiv:1505.04627  [pdf, other

    cs.LG stat.ML

    Simple regret for infinitely many armed bandits

    Authors: Alexandra Carpentier, Michal Valko

    Abstract: We consider a stochastic bandit problem with infinitely many arms. In this setting, the learner has no chance of trying all the arms even once and has to dedicate its limited number of samples only to a certain number of arms. All previous algorithms for this setting were designed for minimizing the cumulative regret of the learner. In this paper, we propose an algorithm aiming at minimizing the s… ▽ More

    Submitted 18 May, 2015; originally announced May 2015.

    Comments: in 32th International Conference on Machine Learning (ICML 2015)