Skip to main content

Showing 1–13 of 13 results for author: Haddouche, M

.
  1. arXiv:2402.08508  [pdf, other

    stat.ML cs.LG

    A PAC-Bayesian Link Between Generalisation and Flat Minima

    Authors: Maxime Haddouche, Paul Viallard, Umut Simsekli, Benjamin Guedj

    Abstract: Modern machine learning usually involves predictors in the overparametrised setting (number of trained parameters greater than dataset size), and their training yield not only good performances on training data, but also good generalisation capacity. This phenomenon challenges many theoretical results, and remains an open problem. To reach a better understanding, we provide novel generalisation bo… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: We provide novel PAC-Bayesian generalisation bounds involving gradient norms and being interpretable under the lens of flat minima

  2. arXiv:2402.05101  [pdf, ps, other

    stat.ML cs.LG

    Tighter Generalisation Bounds via Interpolation

    Authors: Paul Viallard, Maxime Haddouche, Umut Şimşekli, Benjamin Guedj

    Abstract: This paper contains a recipe for deriving new PAC-Bayes generalisation bounds based on the $(f, Γ)$-divergence, and, in addition, presents PAC-Bayes generalisation bounds where we interpolate between a series of probability divergences (including but not limited to KL, Wasserstein, and total variation), making the best out of many worlds depending on the posterior distributions properties. We expl… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  3. arXiv:2310.11203  [pdf, other

    cs.LG stat.ML

    Federated Learning with Nonvacuous Generalisation Bounds

    Authors: Pierre Jobic, Maxime Haddouche, Benjamin Guedj

    Abstract: We introduce a novel strategy to train randomised predictors in federated learning, where each node of the network aims at preserving its privacy by releasing a local predictor but kee** secret its training dataset with respect to the other nodes. We then build a global randomised predictor which inherits the properties of the local private predictors in the sense of a PAC-Bayesian generalisatio… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  4. arXiv:2306.04375  [pdf, ps, other

    stat.ML cs.LG

    Learning via Wasserstein-Based High Probability Generalisation Bounds

    Authors: Paul Viallard, Maxime Haddouche, Umut Şimşekli, Benjamin Guedj

    Abstract: Minimising upper bounds on the population risk or the generalisation gap has been widely used in structural risk minimisation (SRM) -- this is in particular at the core of PAC-Bayesian learning. Despite its successes and unfailing surge of interest in recent years, a limitation of the PAC-Bayesian framework is that most bounds involve a Kullback-Leibler (KL) divergence term (or its variations), wh… ▽ More

    Submitted 27 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023

  5. arXiv:2304.07048  [pdf, other

    stat.ML cs.LG math.OC

    Wasserstein PAC-Bayes Learning: Exploiting Optimisation Guarantees to Explain Generalisation

    Authors: Maxime Haddouche, Benjamin Guedj

    Abstract: PAC-Bayes learning is an established framework to both assess the generalisation ability of learning algorithms, and design new learning algorithm by exploiting generalisation bounds as training objectives. Most of the exisiting bounds involve a \emph{Kullback-Leibler} (KL) divergence, which fails to capture the geometric properties of the loss function which are often useful in optimisation. We a… ▽ More

    Submitted 30 May, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  6. arXiv:2301.07530  [pdf, other

    cs.LG math.OC stat.ML

    Optimistically Tempered Online Learning

    Authors: Maxime Haddouche, Olivier Wintenberger, Benjamin Guedj

    Abstract: Optimistic Online Learning algorithms have been developed to exploit expert advices, assumed optimistically to be always useful. However, it is legitimate to question the relevance of such advices \emph{w.r.t.} the learning information provided by gradient-based online algorithms. In this work, we challenge the confidence assumption on the expert and develop the \emph{optimistically tempered} (OT)… ▽ More

    Submitted 14 February, 2024; v1 submitted 18 January, 2023; originally announced January 2023.

  7. arXiv:2210.00928  [pdf, ps, other

    stat.ML cs.LG math.ST

    PAC-Bayes Generalisation Bounds for Heavy-Tailed Losses through Supermartingales

    Authors: Maxime Haddouche, Benjamin Guedj

    Abstract: While PAC-Bayes is now an established learning framework for light-tailed losses (\emph{e.g.}, subgaussian or subexponential), its extension to the case of heavy-tailed losses remains largely uncharted and has attracted a growing interest in recent years. We contribute PAC-Bayes generalisation bounds for heavy-tailed losses under the sole assumption of bounded variance of the loss function. Under… ▽ More

    Submitted 24 April, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: New Section 3 on Online PAC-Bayes

  8. arXiv:2206.00024  [pdf, other

    cs.LG math.ST stat.ML

    Online PAC-Bayes Learning

    Authors: Maxime Haddouche, Benjamin Guedj

    Abstract: Most PAC-Bayesian bounds hold in the batch learning setting where data is collected at once, prior to inference or prediction. This somewhat departs from many contemporary learning problems where data streams are collected and the algorithms must dynamically adjust. We prove new PAC-Bayesian bounds in this online learning framework, leveraging an updated definition of regret, and we revisit classi… ▽ More

    Submitted 13 October, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: 21 pages

    Journal ref: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  9. arXiv:2103.11147  [pdf, ps, other

    math.ST stat.OT

    A unified approach for covariance matrix estimation under Stein loss

    Authors: Anis M. Haddouche, Wei Lu

    Abstract: In this paper, we address the problem of estimating a covariance matrix of a multivariate Gaussian distribution, relative to a Stein loss function, from a decision theoretic point of view. We investigate the case where the covariance matrix is invertible and the case when it is non--invertible in a unified approach.

    Submitted 20 March, 2021; originally announced March 2021.

  10. arXiv:2012.11920  [pdf, other

    math.ST stat.AP

    Covariance matrix estimation under data-based loss

    Authors: Anis M. Haddouche, Dominique Fourdrinier, Fatiha Mezoued

    Abstract: In this paper, we consider the problem of estimating the $p\times p$ scale matrix $Σ$ of a multivariate linear regression model $Y=X\,β+ \mathcal{E}\,$ when the distribution of the observed matrix $Y$ belongs to a large class of elliptically symmetric distributions. After deriving the canonical form $(Z^T U^T)^T$ of this model, any estimator $\hat{ Σ}$ of $Σ$ is assessed through the data-based los… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  11. arXiv:2012.10369  [pdf, ps, other

    cs.LG math.ST stat.ML

    Upper and Lower Bounds on the Performance of Kernel PCA

    Authors: Maxime Haddouche, Benjamin Guedj, John Shawe-Taylor

    Abstract: Principal Component Analysis (PCA) is a popular method for dimension reduction and has attracted an unfailing interest for decades. More recently, kernel PCA (KPCA) has emerged as an extension of PCA but, despite its use in practice, a sound theoretical understanding of KPCA is missing. We contribute several lower and upper bounds on the efficiency of KPCA, involving the empirical eigenvalues of t… ▽ More

    Submitted 23 January, 2023; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: 16 pages

  12. arXiv:2006.07279  [pdf, other

    stat.ML cs.LG math.ST

    PAC-Bayes unleashed: generalisation bounds with unbounded losses

    Authors: Maxime Haddouche, Benjamin Guedj, Omar Rivasplata, John Shawe-Taylor

    Abstract: We present new PAC-Bayesian generalisation bounds for learning problems with unbounded loss functions. This extends the relevance and applicability of the PAC-Bayes learning framework, where most of the existing literature focuses on supervised learning problems with a bounded loss function (typically assumed to take values in the interval [0;1]). In order to relax this assumption, we propose a ne… ▽ More

    Submitted 30 September, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: 24 pages

    Journal ref: Entropy 2021

  13. arXiv:2006.00243  [pdf, ps, other

    math.ST stat.AP

    Scale matrix estimation under data-based loss in high and low dimensions

    Authors: Mohamed Anis Haddouche, Dominique Fourdrinier, Fatiha Mezoued

    Abstract: We consider the problem of estimating the scale matrix $Σ$ of the additif model $Y_{p\times n} = M + \mathcal{E}$, under a theoretical decision point of view. Here, $ p $ is the number of variables, $ n$ is the number of observations, $ M $ is a matrix of unknown parameters with rank $q<p$ and $ \mathcal {E}$ is a random noise, whose distribution is elliptically symmetric with covariance matrix pr… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.