Skip to main content

Showing 1–19 of 19 results for author: Boursier, E

.
  1. arXiv:2406.19824  [pdf, ps, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2401.10791  [pdf, other

    cs.LG stat.ML

    Early alignment in two-layer networks training is a two-edged sword

    Authors: Etienne Boursier, Nicolas Flammarion

    Abstract: Training neural networks with first order optimisation methods is at the core of the empirical success of deep learning. The scale of initialisation is a crucial factor, as small initialisations are generally associated to a feature learning regime, for which gradient descent is implicitly biased towards simple solutions. This work provides a general and quantitative description of the early align… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  4. arXiv:2310.12563  [pdf, other

    stat.ML cs.LG

    Approximate information maximization for bandit games

    Authors: Alex Barbier-Chebbah, Christian L. Vestergaard, Jean-Baptiste Masson, Etienne Boursier

    Abstract: Entropy maximization and free energy minimization are general physical principles for modeling the dynamics of various physical systems. Notable examples include modeling decision-making within the brain using the free-energy principle, optimizing the accuracy-complexity trade-off when accessing hidden variables with the information bottleneck principle (Tishby et al., 2000), and navigation in ran… ▽ More

    Submitted 30 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

  5. arXiv:2305.19691  [pdf, other

    cs.LG stat.ML

    Constant or logarithmic regret in asynchronous multiplayer bandits

    Authors: Hugo Richard, Etienne Boursier, Vianney Perchet

    Abstract: Multiplayer bandits have recently been extensively studied because of their application to cognitive radio networks. While the literature mostly considers synchronous players, radio networks (e.g. for IoT) tend to have asynchronous devices. This motivates the harder, asynchronous multiplayer bandits problem, which was first tackled with an explore-then-commit (ETC) algorithm (see Dakdouk, 2022),… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  6. arXiv:2303.01353  [pdf, other

    stat.ML cs.LG

    Penalising the biases in norm regularisation enforces sparsity

    Authors: Etienne Boursier, Nicolas Flammarion

    Abstract: Controlling the parameters' norm often yields good generalisation when training neural networks. Beyond simple intuitions, the relation between regularising parameters' norm and obtained estimators remains theoretically misunderstood. For one hidden ReLU layer networks with unidimensional data, this work shows the parameters' norm required to represent a function is given by the total variation of… ▽ More

    Submitted 9 November, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  7. arXiv:2303.01335  [pdf, other

    cs.LG stat.ML

    First-order ANIL learns linear representations despite misspecified latent dimension

    Authors: Oğuz Kaan Yuksel, Etienne Boursier, Nicolas Flammarion

    Abstract: Due to its empirical success in few-shot classification and reinforcement learning, meta-learning has recently received significant interest. Meta-learning methods leverage data from previous tasks to learn a new task in a sample-efficient manner. In particular, model-agnostic methods look for initialisation points from which gradient descent quickly adapts to any new task. Although it has been em… ▽ More

    Submitted 30 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  8. arXiv:2211.16275  [pdf, ps, other

    stat.ML cs.GT cs.LG

    A survey on multi-player bandits

    Authors: Etienne Boursier, Vianney Perchet

    Abstract: Due mostly to its application to cognitive radio networks, multiplayer bandits gained a lot of interest in the last decade. A considerable progress has been made on its theoretical aspect. However, the current algorithms are far from applicable and many obstacles remain between these theoretical results and a possible implementation of multiplayer bandits algorithms in real cognitive radio network… ▽ More

    Submitted 3 June, 2024; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: final version, accepted at JMLR

  9. arXiv:2206.00939  [pdf, other

    stat.ML cs.LG

    Gradient flow dynamics of shallow ReLU networks for square loss and orthogonal inputs

    Authors: Etienne Boursier, Loucas Pillaud-Vivien, Nicolas Flammarion

    Abstract: The training of neural networks by gradient descent methods is a cornerstone of the deep learning revolution. Yet, despite some recent progress, a complete theory explaining its success is still missing. This article presents, for orthogonal input vectors, a precise description of the gradient flow dynamics of training one-hidden layer ReLU neural networks for the mean squared error at small initi… ▽ More

    Submitted 31 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  10. arXiv:2202.06742  [pdf, other

    stat.ML cs.LG

    Trace norm regularization for multi-task learning with scarce data

    Authors: Etienne Boursier, Mikhail Konobeev, Nicolas Flammarion

    Abstract: Multi-task learning leverages structural similarities between multiple tasks to learn despite very few samples. Motivated by the recent success of neural networks applied to data-scarce tasks, we consider a linear low-dimensional shared representation model. Despite an extensive literature, existing theoretical results either guarantee weak estimation rates or require a large number of samples per… ▽ More

    Submitted 10 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: COLT 2022

  11. arXiv:2106.04228  [pdf, ps, other

    stat.ML cs.GT cs.LG cs.NI

    Decentralized Learning in Online Queuing Systems

    Authors: Flore Sentenac, Etienne Boursier, Vianney Perchet

    Abstract: Motivated by packet routing in computer networks, online queuing systems are composed of queues receiving packets at different rates. Repeatedly, they send packets to servers, each of them treating only at most one packet at a time. In the centralized case, the number of accumulated packets remains bounded (i.e., the system is \textit{stable}) as long as the ratio between service rates and arrival… ▽ More

    Submitted 4 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 camera ready

  12. arXiv:2102.08087  [pdf, other

    stat.ML cs.LG math.OC stat.OT

    Making the most of your day: online learning for optimal allocation of time

    Authors: Etienne Boursier, Tristan Garrec, Vianney Perchet, Marco Scarsini

    Abstract: We study online learning for optimal allocation when the resource to be allocated is time. %Examples of possible applications include job scheduling for a computing server, a driver filling a day with rides, a landlord renting an estate, etc. An agent receives task proposals sequentially according to a Poisson process and can either accept or reject a proposed task. If she accepts the proposal, sh… ▽ More

    Submitted 4 November, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021 camera ready

  13. arXiv:2007.09996  [pdf, ps, other

    math.OC cs.LG stat.OT

    Social Learning in Non-Stationary Environments

    Authors: Etienne Boursier, Vianney Perchet, Marco Scarsini

    Abstract: Potential buyers of a product or service, before making their decisions, tend to read reviews written by previous consumers. We consider Bayesian consumers with heterogeneous preferences, who sequentially decide whether to buy an item of unknown quality, based on previous buyers' reviews. The quality is multi-dimensional and may occasionally vary over time; the reviews are also multi-dimensional.… ▽ More

    Submitted 23 February, 2022; v1 submitted 20 July, 2020; originally announced July 2020.

  14. arXiv:2006.06613  [pdf, ps, other

    stat.ML cs.LG

    Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits

    Authors: Pierre Perrault, Etienne Boursier, Vianney Perchet, Michal Valko

    Abstract: We investigate stochastic combinatorial multi-armed bandit with semi-bandit feedback (CMAB). In CMAB, the question of the existence of an efficient policy with an optimal asymptotic regret (up to a factor poly-logarithmic with the action size) is still open for many families of distributions, including mutually independent outcomes, and more generally the multivariate sub-Gaussian family. We propo… ▽ More

    Submitted 3 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: accepted to NeurIPS 2020

  15. arXiv:2002.01197  [pdf, ps, other

    cs.LG stat.ML

    Selfish Robustness and Equilibria in Multi-Player Bandits

    Authors: Etienne Boursier, Vianney Perchet

    Abstract: Motivated by cognitive radios, stochastic multi-player multi-armed bandits gained a lot of interest recently. In this class of problems, several players simultaneously pull arms and encounter a collision - with 0 reward - if some of them pull the same arm at the same time. While the cooperative case where players maximize the collective reward (obediently following some fixed protocol) has been mo… ▽ More

    Submitted 19 June, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

  16. arXiv:1905.11148  [pdf, other

    stat.ML cs.LG stat.AP

    Utility/Privacy Trade-off through the lens of Optimal Transport

    Authors: Etienne Boursier, Vianney Perchet

    Abstract: Strategic information is valuable either by remaining private (for instance if it is sensitive) or, on the other hand, by being used publicly to increase some utility. These two objectives are antagonistic and leaking this information might be more rewarding than concealing it. Unlike classical solutions that focus on the first point, we consider instead agents that optimize a natural trade-off be… ▽ More

    Submitted 2 March, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: AISTATS 2020

  17. arXiv:1902.01239  [pdf, other

    stat.ML cs.LG

    A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players

    Authors: Etienne Boursier, Emilie Kaufmann, Abbas Mehrabian, Vianney Perchet

    Abstract: We study a multiplayer stochastic multi-armed bandit problem in which players cannot communicate, and if two or more players pull the same arm, a collision occurs and the involved players receive zero reward. We consider the challenging heterogeneous setting, in which different arms may have different means for different players, and propose a new and efficient algorithm that combines the idea of… ▽ More

    Submitted 20 March, 2020; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: AISTATS2020

  18. arXiv:1809.08151  [pdf, other

    cs.LG stat.ML

    SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits

    Authors: Etienne Boursier, Vianney Perchet

    Abstract: Motivated by cognitive radio networks, we consider the stochastic multiplayer multi-armed bandit problem, where several players pull arms simultaneously and collisions occur if one of them is pulled by several players at the same stage. We present a decentralized algorithm that achieves the same performance as a centralized one, contradicting the existing lower bounds for that problem. This is pos… ▽ More

    Submitted 19 November, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

    Journal ref: NeurIPS 2019

  19. arXiv:1808.04876  [pdf, other

    cs.DB

    Plato: Approximate Analytics over Compressed Time Series with Tight Deterministic Error Guarantees

    Authors: Chunbin Lin, Etienne Boursier, Yannis Papakonstantinou

    Abstract: Plato provides fast approximate analytics on time series, by precomputing and storing compressed time series. Plato's key novelty is the delivery of tight deterministic error guarantees for time series analytics. Plato evaluates any time series expression composed by the linear algebra operators over vectors, along with arithmetic operators. This large scope of possible expressions includes common… ▽ More

    Submitted 13 September, 2019; v1 submitted 14 August, 2018; originally announced August 2018.