Skip to main content

Showing 1–10 of 10 results for author: Masegosa, A R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.01148  [pdf, ps, other

    stat.ML cs.LG

    PAC-Bayes-Chernoff bounds for unbounded losses

    Authors: Ioar Casado, Luis A. Ortega, Andrés R. Masegosa, Aritz Pérez

    Abstract: We introduce a new PAC-Bayes oracle bound for unbounded losses. This result can be understood as a PAC-Bayesian version of the Cramér-Chernoff bound. The proof technique relies on controlling the tails of certain random variables involving the Cramér transform of the loss. We highlight several applications of the main theorem. First, we show that our result naturally allows exact optimization of t… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Updated Section 5

  2. arXiv:2310.01189  [pdf, other

    stat.ML cs.LG

    If there is no underfitting, there is no Cold Posterior Effect

    Authors: Yijie Zhang, Yi-Shan Wu, Luis A. Ortega, Andrés R. Masegosa

    Abstract: The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$). As the Bayesian posterior is known to be optimal under perfect model specification, many recent works have studied the presence of CPE as a model misspecification p… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 9 pages, 3 figures, ICLR 2024

  3. arXiv:2306.10947  [pdf, other

    cs.LG math.ST stat.ML

    PAC-Chernoff Bounds: Understanding Generalization in the Interpolation Regime

    Authors: Andrés R. Masegosa, Luis A. Ortega

    Abstract: This paper introduces a distribution-dependent PAC-Chernoff bound that exhibits perfect tightness for interpolators, even within over-parameterized model classes. This bound, which relies on basic principles of Large Deviation Theory, defines a natural measure of the smoothness of a model, characterized by simple real-valued functions. Building upon this bound and the new concept of smoothness, we… ▽ More

    Submitted 29 April, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 56 pages, 11 figures, Pre-print

  4. arXiv:2106.13624  [pdf, other

    cs.LG stat.ML

    Chebyshev-Cantelli PAC-Bayes-Bennett Inequality for the Weighted Majority Vote

    Authors: Yi-Shan Wu, Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a new second-order oracle bound for the expected risk of a weighted majority vote. The bound is based on a novel parametric form of the Chebyshev- Cantelli inequality (a.k.a. one-sided Chebyshev's), which is amenable to efficient minimization. The new form resolves the optimization challenge faced by prior oracle bounds based on the Chebyshev-Cantelli inequality, the C-bounds [Germain e… ▽ More

    Submitted 17 January, 2023; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: aligned with the camera-ready version published at NeurIPS 2021

  5. arXiv:2007.13532  [pdf, other

    cs.LG stat.ML

    Second Order PAC-Bayesian Bounds for the Weighted Majority Vote

    Authors: Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin

    Abstract: We present a novel analysis of the expected risk of weighted majority vote in multiclass classification. The analysis takes correlation of predictions by ensemble members into account and provides a bound that is amenable to efficient minimization, which yields improved weighting for the majority vote. We also provide a specialized version of our bound for binary classification, which allows to ex… ▽ More

    Submitted 17 December, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  6. arXiv:1912.08335  [pdf, other

    cs.LG math.ST stat.ML

    Learning under Model Misspecification: Applications to Variational and Ensemble methods

    Authors: Andres R. Masegosa

    Abstract: Virtually any model we use in machine learning to make predictions does not perfectly represent reality. So, most of the learning happens under model misspecification. In this work, we present a novel analysis of the generalization performance of Bayesian model averaging under model misspecification and i.i.d. data using a new family of second-order PAC-Bayes bounds. This analysis shows, in simple… ▽ More

    Submitted 22 October, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: Camera-Ready Version. NeurIPS 2020. Minor changes

  7. arXiv:1908.11161  [pdf, other

    cs.LG stat.ML

    InferPy: Probabilistic Modeling with Deep Neural Networks Made Easy

    Authors: Javier Cózar, Rafael Cabañas, Antonio Salmerón, Andrés R. Masegosa

    Abstract: InferPy is a Python package for probabilistic modeling with deep neural networks. It defines a user-friendly API that trades-off model complexity with ease of use, unlike other libraries whose focus is on dealing with very general probabilistic models at the cost of having a more complex API. In particular, this package allows to define, learn and evaluate general hierarchical probabilistic models… ▽ More

    Submitted 12 February, 2020; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: 5 pages limit (paper submitted to an original software publication track). This paper briefly describes a scientific software

  8. arXiv:1908.03442  [pdf, other

    cs.LG math.ST stat.ML

    Probabilistic Models with Deep Neural Networks

    Authors: Andrés R. Masegosa, Rafael Cabañas, Helge Langseth, Thomas D. Nielsen, Antonio Salmerón

    Abstract: Recent advances in statistical inference have significantly expanded the toolbox of probabilistic modeling. Historically, probabilistic modeling has been constrained to (i) very restricted model classes where exact or approximate probabilistic inference were feasible, and (ii) small or medium-sized data sets which fit within the main memory of the computer. However, developments in variational inf… ▽ More

    Submitted 2 October, 2019; v1 submitted 9 August, 2019; originally announced August 2019.

  9. AMIDST: a Java Toolbox for Scalable Probabilistic Machine Learning

    Authors: Andrés R. Masegosa, Ana M. Martínez, Darío Ramos-López, Rafael Cabañas, Antonio Salmerón, Thomas D. Nielsen, Helge Langseth, Anders L. Madsen

    Abstract: The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorit… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    ACM Class: I.2.6

  10. arXiv:1604.07990  [pdf, other

    cs.AI cs.DC stat.ML

    Probabilistic Graphical Models on Multi-Core CPUs using Java 8

    Authors: Andres R. Masegosa, Ana M. Martinez, Hanen Borchani

    Abstract: In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum… ▽ More

    Submitted 27 April, 2016; originally announced April 2016.

    Comments: Pre-print version of the paper presented in the special issue on Computational Intelligence Software at IEEE Computational Intelligence Magazine journal

    Journal ref: IEEE Computational Intelligence Magazine, 11(2), 41-54. 2016