Skip to main content

Showing 1–5 of 5 results for author: Federici, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.01808  [pdf, other

    stat.ML cs.LG stat.CO

    Simulation-based Inference with the Generalized Kullback-Leibler Divergence

    Authors: Benjamin Kurt Miller, Marco Federici, Christoph Weniger, Patrick Forré

    Abstract: In Simulation-based Inference, the goal is to solve the inverse problem when the likelihood is only known implicitly. Neural Posterior Estimation commonly fits a normalized density estimator as a surrogate model for the posterior. This formulation cannot easily fit unnormalized surrogates because it optimizes the Kullback-Leibler divergence. We propose to optimize a generalized Kullback-Leibler di… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at Synergy of Scientific and Machine Learning Modeling ICML 2023 Workshop https://syns-ml.github.io/2023/contributions/

  2. arXiv:2306.00608  [pdf, other

    stat.ML cs.IT cs.LG

    On the Effectiveness of Hybrid Mutual Information Estimation

    Authors: Marco Federici, David Ruhe, Patrick Forré

    Abstract: Estimating the mutual information from samples from a joint distribution is a challenging problem in both science and engineering. In this work, we realize a variational bound that generalizes both discriminative and generative approaches. Using this bound, we propose a hybrid method to mitigate their respective shortcomings. Further, we propose Predictive Quantization (PQ): a simple generative me… ▽ More

    Submitted 2 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  3. arXiv:2107.09301  [pdf, other

    stat.ML cs.LG

    A Bayesian Approach to Invariant Deep Neural Networks

    Authors: Nikolaos Mourdoukoutas, Marco Federici, Georges Pantalos, Mark van der Wilk, Vincent Fortuin

    Abstract: We propose a novel Bayesian neural network architecture that can learn invariances from data alone by inferring a posterior distribution over different weight-sharing schemes. We show that our model outperforms other non-invariant architectures, when trained on datasets that contain specific invariances. The same holds true when no data augmentation is performed.

    Submitted 2 November, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 8 pages, 3 figures, To be published in ICML UDL 2021

  4. arXiv:2002.07017  [pdf, other

    cs.LG stat.ML

    Learning Robust Representations via Multi-View Information Bottleneck

    Authors: Marco Federici, Anjan Dutta, Patrick Forré, Nate Kushman, Zeynep Akata

    Abstract: The information bottleneck principle provides an information-theoretic method for representation learning, by training an encoder to retain all information which is relevant for predicting the label while minimizing the amount of other, excess information in the representation. The original formulation, however, requires labeled data to identify the superfluous information. In this work, we extend… ▽ More

    Submitted 18 February, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

  5. arXiv:1711.06494  [pdf, other

    stat.ML

    Improved Bayesian Compression

    Authors: Marco Federici, Karen Ullrich, Max Welling

    Abstract: Compression of Neural Networks (NN) has become a highly studied topic in recent years. The main reason for this is the demand for industrial scale usage of NNs such as deploying them on mobile devices, storing them efficiently, transmitting them via band-limited channels and most importantly doing inference at scale. In this work, we propose to join the Soft-Weight Sharing and Variational Dropout… ▽ More

    Submitted 7 December, 2017; v1 submitted 17 November, 2017; originally announced November 2017.