Skip to main content

Showing 1–14 of 14 results for author: Alquier, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14335  [pdf, other

    stat.ML cs.LG

    Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning

    Authors: Otmane Sakhi, Imad Aouali, Pierre Alquier, Nicolas Chopin

    Abstract: This work investigates the offline formulation of the contextual bandit problem, where the goal is to leverage past interactions collected under a behavior policy to evaluate, select, and learn new, potentially better-performing, policies. Motivated by critical applications, we move beyond point estimators. Instead, we adopt the principle of pessimism where we construct upper bounds that assess a… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2302.11709  [pdf, ps, other

    stat.ML cs.LG

    Bayes meets Bernstein at the Meta Level: an Analysis of Fast Rates in Meta-Learning with PAC-Bayes

    Authors: Charles Riou, Pierre Alquier, Badr-Eddine Chérief-Abdellatif

    Abstract: Bernstein's condition is a key assumption that guarantees fast rates in machine learning. For example, the Gibbs algorithm with prior $π$ has an excess risk in $O(d_π/n)$, as opposed to the standard $O(\sqrt{d_π/n})$, where $n$ denotes the number of observations and $d_π$ is a complexity parameter which depends on the prior $π$. In this paper, we examine the Gibbs algorithm in the context of meta-… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  3. arXiv:2210.13132  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Offline Contextual Bandits With Guarantees

    Authors: Otmane Sakhi, Pierre Alquier, Nicolas Chopin

    Abstract: This paper introduces a new principled approach for off-policy learning in contextual bandits. Unlike previous work, our approach does not derive learning principles from intractable or loose bounds. We analyse the problem through the PAC-Bayesian lens, interpreting policies as mixtures of decision rules. This allows us to propose novel generalization bounds and provide tractable algorithms to opt… ▽ More

    Submitted 27 May, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to ICML 2023

  4. arXiv:2210.06672  [pdf, other

    math.ST cs.LG stat.ML

    Variance-Aware Estimation of Kernel Mean Embedding

    Authors: Geoffrey Wolfer, Pierre Alquier

    Abstract: An important feature of kernel mean embeddings (KME) is that the rate of convergence of the empirical KME to the true distribution KME can be bounded independently of the dimension of the space, properties of the distribution and smoothness features of the kernel. We show how to speed-up convergence by leveraging variance information in the RKHS. Furthermore, we show that even when such informatio… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  5. arXiv:2110.11216  [pdf, ps, other

    stat.ML cs.LG math.ST

    User-friendly introduction to PAC-Bayes bounds

    Authors: Pierre Alquier

    Abstract: Aggregated predictors are obtained by making a set of basic predictors vote according to some weights, that is, to some probability distribution. Randomized predictors are obtained by sampling in a set of basic predictors, according to some prescribed probability distribution. Thus, aggregated and randomized predictors have in common that they are not defined by a minimization problem, but by… ▽ More

    Submitted 8 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

  6. arXiv:2102.02504  [pdf, other

    stat.ML cs.LG math.ST stat.CO

    Meta-strategy for Learning Tuning Parameters with Guarantees

    Authors: Dimitri Meunier, Pierre Alquier

    Abstract: Online learning methods, like the online gradient algorithm (OGA) and exponentially weighted aggregation (EWA), often depend on tuning parameters that are difficult to set in practice. We consider an online meta-learning scenario, and we propose a meta-strategy to learn these parameters from past tasks. Our strategy is based on the minimization of a regret bound. It allows to learn the initializat… ▽ More

    Submitted 6 August, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Journal ref: Entropy, 2021, vol. 23, no. 10, 1257

  7. arXiv:2010.04003  [pdf, other

    cs.LG cs.AI stat.ML

    A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix

    Authors: Thang Doan, Mehdi Bennani, Bogdan Mazoure, Guillaume Rabusseau, Pierre Alquier

    Abstract: Continual learning (CL) is a setting in which an agent has to learn from an incoming stream of data during its entire lifetime. Although major advances have been made in the field, one recurring problem which remains unsolved is that of Catastrophic Forgetting (CF). While the issue has been extensively studied empirically, little attention has been paid from a theoretical angle. In this paper, we… ▽ More

    Submitted 25 February, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: Accepted to AISTATS 2021. Keywords: continual learning, catastrophic forgetting, NTK regime, orthgonal gradient descent

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS 2021)

  8. arXiv:2009.03017  [pdf, ps, other

    stat.ML cs.LG

    Non-exponentially weighted aggregation: regret bounds for unbounded loss functions

    Authors: Pierre Alquier

    Abstract: We tackle the problem of online optimization with a general, possibly unbounded, loss function. It is well known that when the loss is bounded, the exponentially weighted aggregation strategy (EWA) leads to a regret in $\sqrt{T}$ after $T$ steps. In this paper, we study a generalized aggregation strategy, where the weights no longer depend exponentially on the losses. Our strategy is based on Foll… ▽ More

    Submitted 17 June, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, Proceedings of Machine Learning Research, 2021, vol. 139, pp. 207-218

  9. arXiv:1912.05737  [pdf, other

    math.ST cs.LG stat.CO stat.ME

    Finite sample properties of parametric MMD estimation: robustness to misspecification and dependence

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier

    Abstract: Many works in statistics aim at designing a universal estimation procedure, that is, an estimator that would converge to the best approximation of the (unknown) data generating distribution in a model, without any assumption on this distribution. This question is of major interest, in particular because the universality property leads to the robustness of the estimator. In this paper, we tackle th… ▽ More

    Submitted 4 March, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Journal ref: Bernoulli, 2022, vol. 28(1), no. 1, pp. 181-213

  10. arXiv:1909.13339  [pdf, other

    math.ST cs.LG stat.ML

    MMD-Bayes: Robust Bayesian Estimation via Maximum Mean Discrepancy

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier

    Abstract: In some misspecified settings, the posterior distribution in Bayesian statistics may lead to inconsistent estimates. To fix this issue, it has been suggested to replace the likelihood by a pseudo-likelihood, that is the exponential of a loss function enjoying suitable robustness properties. In this paper, we build a pseudo-likelihood based on the Maximum Mean Discrepancy, defined via an embedding… ▽ More

    Submitted 11 December, 2019; v1 submitted 29 September, 2019; originally announced September 2019.

  11. arXiv:1904.03920  [pdf, other

    stat.ML cs.LG math.ST stat.CO

    A Generalization Bound for Online Variational Inference

    Authors: Badr-Eddine Chérief-Abdellatif, Pierre Alquier, Mohammad Emtiyaz Khan

    Abstract: Bayesian inference provides an attractive online-learning framework to analyze sequential data, and offers generalization guarantees which hold even with model mismatch and adversaries. Unfortunately, exact Bayesian inference is rarely feasible in practice and approximation methods are usually employed, but do such methods preserve the generalization properties of Bayesian inference ? In this pape… ▽ More

    Submitted 10 December, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Published in the proceedings of ACML 2019

    Journal ref: Proceedings in Machine Learning Research, 2019, vol. 101, pp. 662-677

  12. Exponential inequalities for nonstationary Markov Chains

    Authors: Pierre Alquier, Paul Doukhan, Xiequan Fan

    Abstract: Exponential inequalities are main tools in machine learning theory. To prove exponential inequalities for non i.i.d random variables allows to extend many learning techniques to these variables. Indeed, much work has been done both on inequalities and learning theory for time series, in the past 15 years. However, for the non independent case, almost all the results concern stationary time series.… ▽ More

    Submitted 4 May, 2019; v1 submitted 27 August, 2018; originally announced August 2018.

    Journal ref: Dependence Modeling, 2019, vol. 7, pp. 150-168

  13. arXiv:1706.09293  [pdf, ps, other

    math.ST cs.LG

    Concentration of tempered posteriors and of their variational approximations

    Authors: Pierre Alquier, James Ridgway

    Abstract: While Bayesian methods are extremely popular in statistics and machine learning, their application to massive datasets is often challenging, when possible at all. Indeed, the classical MCMC algorithms are prohibitively slow when both the model dimension and the sample size are large. Variational Bayesian methods aim at approximating the posterior by a distribution in a tractable family. Thus, MCMC… ▽ More

    Submitted 22 April, 2019; v1 submitted 28 June, 2017; originally announced June 2017.

  14. arXiv:1610.08628  [pdf, other

    stat.ML cs.LG

    Regret Bounds for Lifelong Learning

    Authors: Pierre Alquier, The Tien Mai, Massimiliano Pontil

    Abstract: We consider the problem of transfer learning in an online setting. Different tasks are presented sequentially and processed by a within-task algorithm. We propose a lifelong learning strategy which refines the underlying data representation used by the within-task algorithm, thereby transferring information from one task to the next. We show that when the within-task algorithm comes with some regr… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Journal ref: Proceedings of Machine Learning Research, 2017, vol. 54 (AISTAT 2017), pp. 261-269