Skip to main content

Showing 1–19 of 19 results for author: Perchet, V

Searching in archive math. Search in all archives.
.
  1. arXiv:2407.03141  [pdf, other

    math.PR math.CO

    Optimal Unimodular Matching

    Authors: Nathanaël Enriquez, Mike Liu, Laurent Ménard, Vianney Perchet

    Abstract: We consider sequences of finite weighted random graphs that converge locally to unimodular i.i.d. weighted random trees. When the weights are atomless, we prove that the matchings of maximal weight converge locally to a matching on the limiting tree. For this purpose, we introduce and study unimodular matchings on weighted unimodular random trees as well as a notion of optimality for these objects… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 54 pages, 19 figures

    MSC Class: 05C70; 05C82; 60C05; 60K35

  2. arXiv:2210.12882  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Stochastic Mirror Descent for Large-Scale Sparse Recovery

    Authors: Sasila Ilandarideva, Yannis Bekri, Anatoli Juditsky, Vianney Perchet

    Abstract: In this paper we discuss an application of Stochastic Approximation to statistical estimation of high-dimensional sparse parameters. The proposed solution reduces to resolving a penalized stochastic optimization problem on each stage of a multistage algorithm; each problem being solved to a prescribed accuracy by the non-Euclidean Composite Stochastic Mirror Descent (CSMD) algorithm. Assuming that… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  3. arXiv:2102.08087  [pdf, other

    stat.ML cs.LG math.OC stat.OT

    Making the most of your day: online learning for optimal allocation of time

    Authors: Etienne Boursier, Tristan Garrec, Vianney Perchet, Marco Scarsini

    Abstract: We study online learning for optimal allocation when the resource to be allocated is time. %Examples of possible applications include job scheduling for a computing server, a driver filling a day with rides, a landlord renting an estate, etc. An agent receives task proposals sequentially according to a Poisson process and can either accept or reject a proposed task. If she accepts the proposal, sh… ▽ More

    Submitted 4 November, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021 camera ready

  4. arXiv:2007.09996  [pdf, ps, other

    math.OC cs.LG stat.OT

    Social Learning in Non-Stationary Environments

    Authors: Etienne Boursier, Vianney Perchet, Marco Scarsini

    Abstract: Potential buyers of a product or service, before making their decisions, tend to read reviews written by previous consumers. We consider Bayesian consumers with heterogeneous preferences, who sequentially decide whether to buy an item of unknown quality, based on previous buyers' reviews. The quality is multi-dimensional and may occasionally vary over time; the reviews are also multi-dimensional.… ▽ More

    Submitted 23 February, 2022; v1 submitted 20 July, 2020; originally announced July 2020.

  5. arXiv:1906.08509  [pdf, other

    stat.ML cs.LG math.OC

    Online A-Optimal Design and Active Linear Regression

    Authors: Xavier Fontaine, Pierre Perrault, Michal Valko, Vianney Perchet

    Abstract: We consider in this paper the problem of optimal experiment design where a decision maker can choose which points to sample to obtain an estimate $\hatβ$ of the hidden parameter $β^{\star}$ of an underlying linear model. The key challenge of this work lies in the heteroscedasticity assumption that we make, meaning that each covariate has a different and unknown variance. The goal of the decision m… ▽ More

    Submitted 30 December, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: 29 pages, 5 figures

  6. arXiv:1902.04376  [pdf, ps, other

    stat.ML cs.LG math.OC

    An adaptive stochastic optimization algorithm for resource allocation

    Authors: Xavier Fontaine, Shie Mannor, Vianney Perchet

    Abstract: We consider the classical problem of sequential resource allocation where a decision maker must repeatedly divide a budget between several resources, each with diminishing returns. This can be recast as a specific stochastic optimization problem where the objective is to maximize the cumulative reward, or equivalently to minimize the regret. We construct an algorithm that is {\em adaptive} to the… ▽ More

    Submitted 16 January, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: ALT2020, 45 pages, 9 figures

    Journal ref: Proceedings of Machine Learning Research (PMLR), volume 117, 2020

  7. arXiv:1811.04575  [pdf, ps, other

    math.OC cs.GT

    A differential game on Wasserstein space. Application to weak approachability with partial monitoring

    Authors: Vianney Perchet, Marc Quincampoix

    Abstract: Studying continuous time counterpart of some discrete time dynamics is now a standard and fruitful technique, as some properties hold in both setups. In game theory, this is usually done by considering differential games on Euclidean spaces. This allows to infer properties on the convergence of values of a repeated game, to deal with the various concepts of approachability, etc. In this paper, we… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  8. arXiv:1810.05065  [pdf, ps, other

    stat.ML cs.LG math.OC

    Regularized Contextual Bandits

    Authors: Xavier Fontaine, Quentin Berthet, Vianney Perchet

    Abstract: We consider the stochastic contextual bandit problem with additional regularization. The motivation comes from problems where the policy of the agent must be close to some baseline policy which is known to perform well on the task. To tackle this problem we use a nonparametric model and propose an algorithm splitting the context space into bins, and solving simultaneously - and independently - reg… ▽ More

    Submitted 5 June, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: AISTATS 2019, 23 pages, 2 figures

    Journal ref: Proceedings of Machine Learning Research, PMLR 89:2144-2153, 2019

  9. arXiv:1806.02282  [pdf, ps, other

    stat.ML cs.LG math.OC

    Finding the bandit in a graph: Sequential search-and-stop

    Authors: Pierre Perrault, Vianney Perchet, Michal Valko

    Abstract: We consider the problem where an agent wants to find a hidden object that is randomly located in some vertex of a directed acyclic graph (DAG) according to a fixed but possibly unknown distribution. The agent can only examine vertices whose in-neighbors have already been examined. In this paper, we address a learning setting where we allow the agent to stop before having found the object and resta… ▽ More

    Submitted 22 April, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: in International Conference on Artificial Intelligence and Statistics (AISTATS 2019), April 2019, Naha, Okinawa, Japan

  10. arXiv:1702.06917  [pdf, ps, other

    cs.LG math.OC stat.ML

    Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe

    Authors: Quentin Berthet, Vianney Perchet

    Abstract: We consider the problem of bandit optimization, inspired by stochastic optimization and online learning problems with bandit feedback. In this problem, the objective is to minimize a global loss function of all the actions, not necessarily a cumulative loss. This framework allows us to study a very general class of problems, with applications in statistics, machine learning, and other fields. To s… ▽ More

    Submitted 6 September, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

  11. arXiv:1605.08165  [pdf, ps, other

    cs.LG math.OC

    Highly-Smooth Zero-th Order Online Optimization Vianney Perchet

    Authors: Francis Bach, Vianney Perchet

    Abstract: The minimization of convex functions which are only available through partial and noisy information is a key methodological problem in many disciplines. In this paper we consider convex optimization with noisy zero-th order information, that is noisy function evaluations at any desired point. We focus on problems with high degrees of smoothness, such as logistic regression. We show that as opposed… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: Conference on Learning Theory (COLT), Jun 2016, New York, United States. 2016

  12. Batched bandit problems

    Authors: Vianney Perchet, Philippe Rigollet, Sylvain Chassang, Erik Snowberg

    Abstract: Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. We propose a simple policy, and show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost fo… ▽ More

    Submitted 29 March, 2016; v1 submitted 2 May, 2015; originally announced May 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1381 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1381

    Journal ref: Annals of Statistics 2016, Vol. 44, No. 2, 660-681

  13. arXiv:1402.2043  [pdf, other

    stat.ML cs.LG math.ST

    Approachability in unknown games: Online learning meets multi-objective optimization

    Authors: Shie Mannor, Vianney Perchet, Gilles Stoltz

    Abstract: In the standard setting of approachability there are two players and a target set. The players play repeatedly a known vector-valued game where the first player wants to have the average vector-valued payoff converge to the target set which the other player tries to exclude it from this set. We revisit this setting in the spirit of online learning and do not assume that the first player knows the… ▽ More

    Submitted 17 June, 2016; v1 submitted 10 February, 2014; originally announced February 2014.

  14. arXiv:1305.5399  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    A Primal Condition for Approachability with Partial Monitoring

    Authors: Shie Mannor, Vianney Perchet, Gilles Stoltz

    Abstract: In approachability with full monitoring there are two types of conditions that are known to be equivalent for convex sets: a primal and a dual condition. The primal one is of the form: a set C is approachable if and only all containing half-spaces are approachable in the one-shot game; while the dual one is of the form: a convex set C is approachable if and only if it intersects all payoff sets of… ▽ More

    Submitted 23 May, 2013; originally announced May 2013.

  15. arXiv:1302.1611  [pdf, ps, other

    math.ST cs.LG stat.ML

    Bounded regret in stochastic multi-armed bandits

    Authors: Sébastien Bubeck, Vianney Perchet, Philippe Rigollet

    Abstract: We study the stochastic multi-armed bandit problem when one knows the value $μ^{(\star)}$ of an optimal arm, as a well as a positive lower bound on the smallest positive gap $Δ$. We propose a new randomized policy that attains a regret {\em uniformly bounded over time} in this setting. We also prove several lower bounds, which show in particular that bounded regret is not possible if one only know… ▽ More

    Submitted 12 February, 2013; v1 submitted 6 February, 2013; originally announced February 2013.

    MSC Class: 62L05

  16. arXiv:1301.3609  [pdf, ps, other

    cs.GT math.OC

    On an unified framework for approachability in games with or without signals

    Authors: Vianney Perchet, Marc Quincampoix

    Abstract: We unify standard frameworks for approachability both in full or partial monitoring by defining a new abstract game, called the "purely informative game", where the outcome at each stage is the maximal information players can obtain, represented as some probability measure. Objectives of players can be rewritten as the convergence (to some given set) of sequences of averages of these probability m… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

  17. arXiv:1110.6084  [pdf, ps, other

    math.ST cs.LG stat.ML

    The multi-armed bandit problem with covariates

    Authors: Vianney Perchet, Philippe Rigollet

    Abstract: We consider a multi-armed bandit problem in a setting where each arm produces a noisy reward realization which depends on an observable random covariate. As opposed to the traditional static multi-armed bandit problem, this setting allows for dynamically changing rewards that better describe applications where side information is available. We adopt a nonparametric model where the expected rewards… ▽ More

    Submitted 24 May, 2013; v1 submitted 27 October, 2011; originally announced October 2011.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1101 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1101

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 2, 693-721

  18. arXiv:1105.4995  [pdf, ps, other

    math.ST cs.LG

    Robust approachability and regret minimization in games with partial monitoring

    Authors: Shie Mannor, Vianney Perchet, Gilles Stoltz

    Abstract: Approachability has become a standard tool in analyzing earning algorithms in the adversarial online learning setup. We develop a variant of approachability for games where there is ambiguity in the obtained reward that belongs to a set, rather than being a single vector. Using this variant we tackle the problem of approachability in games with partial monitoring and develop simple and efficient a… ▽ More

    Submitted 15 February, 2012; v1 submitted 25 May, 2011; originally announced May 2011.

  19. arXiv:1102.4442  [pdf, ps, other

    cs.LG cs.GT math.OC

    Internal Regret with Partial Monitoring. Calibration-Based Optimal Algorithms

    Authors: Vianney Perchet

    Abstract: We provide consistent random algorithms for sequential decision under partial monitoring, i.e. when the decision maker does not observe the outcomes but receives instead random feedback signals. Those algorithms have no internal regret in the sense that, on the set of stages where the decision maker chose his action according to a given law, the average payoff could not have been improved in avera… ▽ More

    Submitted 22 February, 2011; originally announced February 2011.