Skip to main content

Showing 1–15 of 15 results for author: Vergari, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.08458  [pdf, other

    stat.ML cs.AI cs.LG

    On the Independence Assumption in Neurosymbolic Learning

    Authors: Emile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari

    Abstract: State-of-the-art neurosymbolic learning systems use probabilistic reasoning to guide neural networks towards predictions that conform to logical constraints over symbols. Many such systems assume that the probabilities of the considered symbols are conditionally independent given the input to simplify learning and reasoning. We study and criticise this assumption, highlighting how it can hinder op… ▽ More

    Submitted 7 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at ICML 2024

  2. arXiv:2305.19951  [pdf, other

    cs.LG stat.ML

    Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts

    Authors: Emanuele Marconato, Stefano Teso, Antonio Vergari, Andrea Passerini

    Abstract: Neuro-Symbolic (NeSy) predictive models hold the promise of improved compliance with given constraints, systematic generalization, and interpretability, as they allow to infer labels that are consistent with some prior knowledge by reasoning over high-level concepts extracted from sub-symbolic inputs. It was recently shown that NeSy predictors are affected by reasoning shortcuts: they can attain h… ▽ More

    Submitted 18 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  3. arXiv:2202.08566  [pdf, ps, other

    cs.LG stat.ML

    Efficient and Reliable Probabilistic Interactive Learning with Structured Outputs

    Authors: Stefano Teso, Antonio Vergari

    Abstract: In this position paper, we study interactive learning for structured output spaces, with a focus on active learning, in which labels are unknown and must be acquired, and on skeptical learning, in which the labels are noisy and may need relabeling. These scenarios require expressive models that guarantee reliable and efficient computation of probabilistic quantities to measure uncertainty. We iden… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning

  4. arXiv:2102.06137  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    A Compositional Atlas of Tractable Circuit Operations: From Simple Transformations to Complex Information-Theoretic Queries

    Authors: Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van den Broeck

    Abstract: Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models. In this paper, we show how complex inference scenarios for these models that commonly arise in machine learning -- from computing the expectations of decision tree ensembles to information-theoretic divergences of deep mixture models -- can be represented in terms of t… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    ACM Class: G.3; I.2.4; I.2.6

  5. arXiv:2006.16341  [pdf, other

    cs.LG cs.AI stat.ML

    Handling Missing Data in Decision Trees: A Probabilistic Approach

    Authors: Pasha Khosravi, Antonio Vergari, YooJung Choi, Yitao Liang, Guy Van den Broeck

    Abstract: Decision trees are a popular family of models due to their attractive properties such as interpretability and ability to handle heterogeneous data. Concurrently, missing data is a prevalent occurrence that hinders performance of machine learning models. As such, handling missing data in decision trees is a well studied problem. In this paper, we tackle this problem by taking a probabilistic approa… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  6. arXiv:2004.06231  [pdf, other

    cs.LG stat.ML

    Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

    Authors: Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

    Abstract: Probabilistic circuits (PCs) are a promising avenue for probabilistic modeling, as they permit a wide range of exact and efficient inference routines. Recent ``deep-learning-style'' implementations of PCs strive for a better scalability, but are still difficult to train on real-world data, due to their sparsely connected computational graphs. In this paper, we propose Einsum Networks (EiNets), a n… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  7. arXiv:1910.02182  [pdf, other

    cs.LG cs.AI stat.ML

    On Tractable Computation of Expected Predictions

    Authors: Pasha Khosravi, YooJung Choi, Yitao Liang, Antonio Vergari, Guy Van den Broeck

    Abstract: Computing expected predictions of discriminative models is a fundamental task in machine learning that appears in many interesting applications such as fairness, handling missing values, and data analysis. Unfortunately, computing expectations of a discriminative model with respect to a probability distribution defined by an arbitrary generative model has been proven to be hard in general. In fact… ▽ More

    Submitted 31 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

  8. arXiv:1905.08550  [pdf, other

    cs.LG stat.ML

    Conditional Sum-Product Networks: Imposing Structure on Deep Probabilistic Architectures

    Authors: Xiaoting Shao, Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Thomas Liebig, Kristian Kersting

    Abstract: Probabilistic graphical models are a central tool in AI; however, they are generally not as expressive as deep neural models, and inference is notoriously hard and slow. In contrast, deep probabilistic models such as sum-product networks (SPNs) capture joint distributions in a tractable fashion, but still lack the expressive power of intractable models based on deep neural networks. Therefore, we… ▽ More

    Submitted 29 September, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 13 pages, 6 figures

  9. arXiv:1903.12436  [pdf, other

    cs.LG stat.ML

    From Variational to Deterministic Autoencoders

    Authors: Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari, Michael Black, Bernhard Schölkopf

    Abstract: Variational Autoencoders (VAEs) provide a theoretically-backed and popular framework for deep generative models. However, learning a VAE from data poses still unanswered theoretical questions and considerable practical challenges. In this work, we propose an alternative framework for generative modeling that is simpler, easier to train, and deterministic, yet has many of the advantages of VAEs. We… ▽ More

    Submitted 29 May, 2020; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: Partha Ghosh and Mehdi S. M. Sajjadi contributed equally to this work

  10. arXiv:1901.03704  [pdf, other

    cs.LG stat.ML

    SPFlow: An Easy and Extensible Library for Deep Probabilistic Learning using Sum-Product Networks

    Authors: Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Pranav Subramani, Nicola Di Mauro, Pascal Poupart, Kristian Kersting

    Abstract: We introduce SPFlow, an open-source Python library providing a simple interface to inference, learning and manipulation routines for deep and tractable probabilistic models called Sum-Product Networks (SPNs). The library allows one to quickly create SPNs both from data and through a domain specific language (DSL). It efficiently implements several probabilistic inference routines like computing ma… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: 4 pages, 1 figure, code

  11. arXiv:1807.09306  [pdf, other

    stat.ML cs.LG

    Automatic Bayesian Density Analysis

    Authors: Antonio Vergari, Alejandro Molina, Robert Peharz, Zoubin Ghahramani, Kristian Kersting, Isabel Valera

    Abstract: Making sense of a dataset in an automatic and unsupervised fashion is a challenging problem in statistics and AI. Classical approaches for {exploratory data analysis} are usually not flexible enough to deal with the uncertainty inherent to real-world data: they are often restricted to fixed latent interaction models and homogeneous likelihoods; they are sensitive to missing, corrupt and anomalous… ▽ More

    Submitted 10 February, 2019; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: In proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

  12. arXiv:1806.01910  [pdf, other

    cs.LG cs.AI stat.ML

    Probabilistic Deep Learning using Random Sum-Product Networks

    Authors: Robert Peharz, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Kristian Kersting, Zoubin Ghahramani

    Abstract: The need for consistent treatment of uncertainty has recently triggered increased interest in probabilistic deep learning methods. However, most current approaches have severe limitations when it comes to inference, since many of these models do not even permit to evaluate exact data likelihoods. Sum-product networks (SPNs), on the other hand, are an excellent architecture in that regard, as they… ▽ More

    Submitted 22 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  13. arXiv:1710.03297  [pdf, other

    cs.LG stat.ML

    Sum-Product Networks for Hybrid Domains

    Authors: Alejandro Molina, Antonio Vergari, Nicola Di Mauro, Sriraam Natarajan, Floriana Esposito, Kristian Kersting

    Abstract: While all kinds of mixed data -from personal data, over panel and scientific data, to public and commercial data- are collected and stored, building probabilistic graphical models for these hybrid domains becomes more difficult. Users spend significant amounts of time in identifying the parametric form of the random variables (Gaussian, Poisson, Logit, etc.) involved and learning the mixed models.… ▽ More

    Submitted 6 November, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    Comments: 16 Pages, 5 Figures

  14. Visualizing and Understanding Sum-Product Networks

    Authors: Antonio Vergari, Nicola Di Mauro, Floriana Esposito

    Abstract: Sum-Product Networks (SPNs) are recently introduced deep tractable probabilistic models by which several kinds of inference queries can be answered exactly and in a tractable time. Up to now, they have been largely used as black box density estimators, assessed only by comparing their likelihood scores only. In this paper we explore and exploit the inner representations learned by SPNs. We do this… ▽ More

    Submitted 24 August, 2018; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: Machine Learning Journal paper (First Online), 24 pages

  15. arXiv:1608.02341  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Representation Learning with Tractable Probabilistic Models

    Authors: Antonio Vergari, Nicola Di Mauro, Floriana Esposito

    Abstract: Probabilistic models learned as density estimators can be exploited in representation learning beside being toolboxes used to answer inference queries only. However, how to extract useful representations highly depends on the particular model involved. We argue that tractable inference, i.e. inference that can be computed in polynomial time, can enable general schemes to extract features from blac… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

    Comments: 10 pages, submitted to ECML-PKDD 2016 Doctoral Consortium