Skip to main content

Showing 1–13 of 13 results for author: Rodríguez-Gálvez, B

.
  1. arXiv:2403.16681  [pdf, other

    stat.ML cs.LG

    A note on generalization bounds for losses with finite moments

    Authors: Borja Rodríguez-Gálvez, Omar Rivasplata, Ragnar Thobaben, Mikael Skoglund

    Abstract: This paper studies the truncation method from Alquier [1] to derive high-probability PAC-Bayes bounds for unbounded losses with heavy tails. Assuming that the $p$-th moment is bounded, the resulting bounds interpolate between a slow rate $1 / \sqrt{n}$ when $p=2$, and a fast rate $1 / n$ when $p \to \infty$ and the loss is essentially bounded. Moreover, the paper derives a high-probability PAC-Bay… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 9 pages: 5 of main text, 1 of references, and 3 of appendices

  2. arXiv:2403.03361  [pdf, ps, other

    stat.ML cs.LG

    Chained Information-Theoretic bounds and Tight Regret Rate for Linear Bandit Problems

    Authors: Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund

    Abstract: This paper studies the Bayesian regret of a variant of the Thompson-Sampling algorithm for bandit problems. It builds upon the information-theoretic framework of [Russo and Van Roy, 2015] and, more specifically, on the rate-distortion analysis from [Dong and Van Roy, 2020], where they proved a bound with regret rate of $O(d\sqrt{T \log(T)})$ for the $d$-dimensional linear bandit setting. We focus… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 15 pages: 8 of main text and 7 of appendices

  3. arXiv:2307.10907  [pdf, other

    cs.LG

    The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning

    Authors: Borja Rodríguez-Gálvez, Arno Blaas, Pau Rodríguez, Adam Goliński, Xavier Suau, Jason Ramapuram, Dan Busbridge, Luca Zappella

    Abstract: The mechanisms behind the success of multi-view self-supervised learning (MVSSL) are not yet fully understood. Contrastive MVSSL methods have been studied through the lens of InfoNCE, a lower bound of the Mutual Information (MI). However, the relation between other MVSSL methods and MI remains unclear. We consider a different lower bound on the MI consisting of an entropy and a reconstruction term… ▽ More

    Submitted 9 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 18 pages: 9 of main text, 2 of references, and 7 of supplementary material [Updated typo in page 6 (Section 3.2)]. Appears in the proceedings of ICML 2023

  4. arXiv:2306.12214  [pdf, other

    stat.ML cs.LG

    More PAC-Bayes bounds: From bounded losses, to losses with general tail behaviors, to anytime validity

    Authors: Borja Rodríguez-Gálvez, Ragnar Thobaben, Mikael Skoglund

    Abstract: In this paper, we present new high-probability PAC-Bayes bounds for different types of losses. Firstly, for losses with a bounded range, we recover a strengthened version of Catoni's bound that holds uniformly for all parameter values. This leads to new fast-rate and mixed-rate bounds that are interpretable and tighter than previous bounds in the literature. In particular, the fast-rate bound is e… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: 43 pages: ~20 of main text, ~6.5 of references, and ~17.5 of appendices. Published at JMLR

  5. arXiv:2304.13593  [pdf, ps, other

    stat.ML cs.LG

    Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards

    Authors: Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund

    Abstract: In this work, we study the performance of the Thompson Sampling algorithm for Contextual Bandit problems based on the framework introduced by Neu et al. and their concept of lifted information ratio. First, we prove a comprehensive bound on the Thompson Sampling expected cumulative regret that depends on the mutual information of the environment parameters and the history. Then, we introduce new b… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 8 pages: 5 of the main text, 1 of references, and 2 of appendices. Accepted to ISIT 2023

  6. arXiv:2212.13556  [pdf, other

    cs.LG stat.ML

    Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization

    Authors: Mahdi Haghifam, Borja Rodríguez-Gálvez, Ragnar Thobaben, Mikael Skoglund, Daniel M. Roy, Gintare Karolina Dziugaite

    Abstract: To date, no "information-theoretic" frameworks for reasoning about generalization error have been shown to establish minimax rates for gradient descent in the setting of stochastic convex optimization. In this work, we consider the prospect of establishing such rates via several existing information-theoretic frameworks: input-output mutual information bounds, conditional mutual information bounds… ▽ More

    Submitted 13 July, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: 49 pages, 2 figures. This version corrects a mistake in the proof of Theorem 17. Proc. International Conference on Algorithmic Learning Theory (ALT), 2023

  7. arXiv:2207.08735  [pdf, ps, other

    cs.LG stat.ML

    An Information-Theoretic Analysis of Bayesian Reinforcement Learning

    Authors: Amaury Gouverneur, Borja Rodríguez-Gálvez, Tobias J. Oechtering, Mikael Skoglund

    Abstract: Building on the framework introduced by Xu and Raginksy [1] for supervised learning problems, we study the best achievable performance for model-based Bayesian reinforcement learning problems. With this purpose, we define minimum Bayesian regret (MBR) as the difference between the maximum expected cumulative reward obtainable either by learning from the collected data or by knowing the environment… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: 10 pages: 6 of the main text, 1 of references, and 3 of appendices

  8. arXiv:2109.08604  [pdf, other

    cs.LG stat.ML

    Enforcing fairness in private federated learning via the modified method of differential multipliers

    Authors: Borja Rodríguez-Gálvez, Filip Granqvist, Rogier van Dalen, Matt Seigel

    Abstract: Federated learning with differential privacy, or private federated learning, provides a strategy to train machine learning models while respecting users' privacy. However, differential privacy can disproportionately degrade the performance of the models on under-represented groups, as these parts of the distribution are difficult to learn in the presence of noise. Existing approaches for enforcing… ▽ More

    Submitted 15 April, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: Presented at PriML workshop at NeurIPS 2021. 20 pages: 11 of main content, 3 of references, and 6 of supplementary material

  9. arXiv:2101.09315  [pdf, other

    stat.ML cs.IT cs.LG

    Tighter expected generalization error bounds via Wasserstein distance

    Authors: Borja Rodríguez-Gálvez, Germán Bassi, Ragnar Thobaben, Mikael Skoglund

    Abstract: This work presents several expected generalization error bounds based on the Wasserstein distance. More specifically, it introduces full-dataset, single-letter, and random-subset bounds, and their analogues in the randomized subsample setting from Steinke and Zakynthinou [1]. Moreover, when the loss function is bounded and the geometry of the space is ignored by the choice of the metric in the Was… ▽ More

    Submitted 25 March, 2022; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: 29 pages: 9 of the main text, 3 of references, and 17 of appendices. Presented at ITR3 at ICML 2021. Accepted at NeurIPS 2021

  10. On Random Subset Generalization Error Bounds and the Stochastic Gradient Langevin Dynamics Algorithm

    Authors: Borja Rodríguez-Gálvez, Germán Bassi, Ragnar Thobaben, Mikael Skoglund

    Abstract: In this work, we unify several expected generalization error bounds based on random subsets using the framework developed by Hellström and Durisi [1]. First, we recover the bounds based on the individual sample mutual information from Bu et al. [2] and on a random subset of the dataset from Negrea et al. [3]. Then, we introduce their new, analogous bounds in the randomized subsample setting from S… ▽ More

    Submitted 16 January, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: To appear in the Information Theory Workshop (ITW 2020) conference. 10 pages, 5 of the main text, and 5 of appendices

  11. arXiv:2006.06332  [pdf, other

    stat.ML cs.IT cs.LG

    A Variational Approach to Privacy and Fairness

    Authors: Borja Rodríguez-Gálvez, Ragnar Thobaben, Mikael Skoglund

    Abstract: In this article, we propose a new variational approach to learn private and/or fair representations. This approach is based on the Lagrangians of a new formulation of the privacy and fairness optimization problems that we propose. In this formulation, we aim to generate representations of the data that keep a prescribed level of the relevant information that is not shared by the private or sensiti… ▽ More

    Submitted 6 September, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted at the ITW 2021 conference. Previously presented at the PPAI-21 workshop from the AAAI-21 conference. Content distribution: 5 pages of main content + 2 pages of references + 11 pages of supplementary material

  12. arXiv:2005.05889  [pdf, other

    cs.IT cs.LG stat.ML

    Upper Bounds on the Generalization Error of Private Algorithms for Discrete Data

    Authors: Borja Rodríguez-Gálvez, Germán Bassi, Mikael Skoglund

    Abstract: In this work, we study the generalization capability of algorithms from an information-theoretic perspective. It has been shown that the expected generalization error of an algorithm is bounded from above by a function of the relative entropy between the conditional probability distribution of the algorithm's output hypothesis, given the dataset with which it was trained, and its marginal probabil… ▽ More

    Submitted 13 September, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: 18 pages (double column), 4 figures, accepted at IEEE Transactions on Information Theory

    Journal ref: IEEE Trans. Inf. Theory, vol. 67, no. 11, pp. 7362-7379, Nov. 2021

  13. arXiv:1911.11000  [pdf, other

    stat.ML cs.IT cs.LG

    The Convex Information Bottleneck Lagrangian

    Authors: Borja Rodríguez-Gálvez, Ragnar Thobaben, Mikael Skoglund

    Abstract: The information bottleneck (IB) problem tackles the issue of obtaining relevant compressed representations $T$ of some random variable $X$ for the task of predicting $Y$. It is defined as a constrained optimization problem which maximizes the information the representation has about the task, $I(T;Y)$, while ensuring that a certain level of compression $r$ is achieved (i.e., $ I(X;T) \leq r$). For… ▽ More

    Submitted 10 January, 2020; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 10 pages of main text, 2 page of references and 14 pages of appendices with the proofs, experimental details and caveats