Skip to main content

Showing 1–6 of 6 results for author: Ortega, L A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.01148  [pdf, ps, other

    stat.ML cs.LG

    PAC-Bayes-Chernoff bounds for unbounded losses

    Authors: Ioar Casado, Luis A. Ortega, Andrés R. Masegosa, Aritz Pérez

    Abstract: We introduce a new PAC-Bayes oracle bound for unbounded losses. This result can be understood as a PAC-Bayesian version of the Cramér-Chernoff bound. The proof technique relies on controlling the tails of certain random variables involving the Cramér transform of the loss. We highlight several applications of the main theorem. First, we show that our result naturally allows exact optimization of t… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Updated Section 5

  2. arXiv:2310.01189  [pdf, other

    stat.ML cs.LG

    If there is no underfitting, there is no Cold Posterior Effect

    Authors: Yijie Zhang, Yi-Shan Wu, Luis A. Ortega, Andrés R. Masegosa

    Abstract: The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$). As the Bayesian posterior is known to be optimal under perfect model specification, many recent works have studied the presence of CPE as a model misspecification p… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 9 pages, 3 figures, ICLR 2024

  3. arXiv:2306.10947  [pdf, other

    cs.LG math.ST stat.ML

    PAC-Chernoff Bounds: Understanding Generalization in the Interpolation Regime

    Authors: Andrés R. Masegosa, Luis A. Ortega

    Abstract: This paper introduces a distribution-dependent PAC-Chernoff bound that exhibits perfect tightness for interpolators, even within over-parameterized model classes. This bound, which relies on basic principles of Large Deviation Theory, defines a natural measure of the smoothness of a model, characterized by simple real-valued functions. Building upon this bound and the new concept of smoothness, we… ▽ More

    Submitted 29 April, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 56 pages, 11 figures, Pre-print

  4. arXiv:2302.12565  [pdf, other

    stat.ML cs.LG

    Variational Linearized Laplace Approximation for Bayesian Deep Learning

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: The Linearized Laplace Approximation (LLA) has been recently used to perform uncertainty estimation on the predictions of pre-trained deep neural networks (DNNs). However, its widespread application is hindered by significant computational costs, particularly in scenarios with a large number of training points or DNN parameters. Consequently, additional approximations of LLA, such as Kronecker-fac… ▽ More

    Submitted 22 May, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 22 pages, 8 figures, ICML 2024

    Journal ref: PMLR 235 (2024)

  5. arXiv:2207.10673  [pdf, other

    stat.ML cs.LG stat.CO

    Correcting Model Bias with Sparse Implicit Processes

    Authors: Simón Rodríguez Santana, Luis A. Ortega, Daniel Hernández-Lobato, Bryan Zaldívar

    Abstract: Model selection in machine learning (ML) is a crucial part of the Bayesian learning procedure. Model choice may impose strong biases on the resulting predictions, which can hinder the performance of methods such as Bayesian neural networks and neural samplers. On the other hand, newly proposed approaches for Bayesian ML exploit features of approximate inference in function space with implicit stoc… ▽ More

    Submitted 8 August, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 4 pages, 1 double figure. Included in ICML 2022 workshop "Beyond Bayes: Paths Towards Universal Reasoning Systems". Extension of previous work on Sparse Implicit Processes (arXiv:2110.07618)

  6. arXiv:2206.06720  [pdf, other

    stat.ML cs.LG

    Deep Variational Implicit Processes

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: Implicit processes (IPs) are a generalization of Gaussian processes (GPs). IPs may lack a closed-form expression but are easy to sample from. Examples include, among others, Bayesian neural networks or neural samplers. IPs can be used as priors over functions, resulting in flexible models with well-calibrated prediction uncertainty estimates. Methods based on IPs usually carry out function-space a… ▽ More

    Submitted 16 February, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 19 pages, 6 figures, ICLR 2023