Skip to main content

Showing 1–50 of 86 results for author: Broeck, G V d

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13892  [pdf, other

    cs.CL

    Adaptable Logical Control for Large Language Models

    Authors: Honghua Zhang, Po-Nien Kung, Masahiro Yoshida, Guy Van den Broeck, Nanyun Peng

    Abstract: Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Mark… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.00766  [pdf, other

    cs.LG

    Scaling Tractable Probabilistic Circuits: A Systems Perspective

    Authors: Anji Liu, Kareem Ahmed, Guy Van den Broeck

    Abstract: Probabilistic Circuits (PCs) are a general framework for tractable deep generative models, which support exact and efficient probabilistic inference on their learned distributions. Recent modeling and training advancements have enabled their application to complex real-world tasks. However, the time and memory inefficiency of existing PC implementations hinders further scaling up. This paper propo… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  3. arXiv:2405.15506  [pdf, other

    cs.LG

    Learning to Discretize Denoising Diffusion ODEs

    Authors: Vinh Tong, Anji Liu, Trung-Dung Hoang, Guy Van den Broeck, Mathias Niepert

    Abstract: Diffusion Probabilistic Models (DPMs) are powerful generative models showing competitive performance in various domains, including image synthesis and 3D point cloud generation. However, sampling from pre-trained DPMs involves multiple neural function evaluations (NFE) to transform Gaussian noise samples into images, resulting in higher computational costs compared to single-step generative models… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  4. arXiv:2405.07387  [pdf, other

    cs.LG

    Semantic Loss Functions for Neuro-Symbolic Structured Prediction

    Authors: Kareem Ahmed, Stefano Teso, Paolo Morettin, Luca Di Liello, Pierfrancesco Ardino, Jacopo Gobbi, Yitao Liang, Eric Wang, Kai-Wei Chang, Andrea Passerini, Guy Van den Broeck

    Abstract: Structured output prediction problems are ubiquitous in machine learning. The prominent approach leverages neural networks as powerful feature extractors, otherwise assuming the independence of the outputs. These outputs, however, jointly encode an object, e.g. a path in a graph, and are therefore related through the structure underlying the output space. We discuss the semantic loss, which inject… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Preprint of Ch. 22 "Semantic Loss Functions for Neuro-Symbolic Structured Prediction" in "Compendium of Neurosymbolic Artificial Intelligence", https://ebooks.iospress.nl/ISBN/978-1-64368-406-2. arXiv admin note: substantial text overlap with arXiv:2201.11250, arXiv:2007.13197

  5. arXiv:2404.09674  [pdf, ps, other

    cs.DS cs.DB cs.FL

    A Circus of Circuits: Connections Between Decision Diagrams, Circuits, and Automata

    Authors: Antoine Amarilli, Marcelo Arenas, YooJung Choi, Mikaël Monet, Guy Van den Broeck, Benjie Wang

    Abstract: This document is an introduction to two related formalisms to define Boolean functions: binary decision diagrams, and Boolean circuits. It presents these formalisms and several of their variants studied in the setting of knowledge compilation. Last, it explains how these formalisms can be connected to the notions of automata over words and trees.

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 26 pages

  6. arXiv:2404.09529  [pdf, other

    cs.LG cs.AI cs.CL

    Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

    Authors: Siyan Zhao, Daniel Israel, Guy Van den Broeck, Aditya Grover

    Abstract: During inference for transformer-based large language models (LLM), prefilling is the computation of the key-value (KV) cache for input tokens in the prompt prior to autoregressive generation. For longer input prompt lengths, prefilling will incur a significant overhead on decoding time. In this work, we highlight the following pitfall of prefilling: for batches containing high-varying prompt leng… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 18 pages, code in https://github.com/siyan-zhao/prepacking

  7. arXiv:2403.00025  [pdf, ps, other

    cs.LG cs.AI

    On the Challenges and Opportunities in Generative AI

    Authors: Laura Manduchi, Kushagra Pandey, Robert Bamler, Ryan Cotterell, Sina Däubener, Sophie Fellenz, Asja Fischer, Thomas Gärtner, Matthias Kirchler, Marius Kloft, Yingzhen Li, Christoph Lippert, Gerard de Melo, Eric Nalisnick, Björn Ommer, Rajesh Ranganath, Maja Rudolph, Karen Ullrich, Guy Van den Broeck, Julia E Vogt, Yixin Wang, Florian Wenzel, Frank Wood, Stephan Mandt, Vincent Fortuin

    Abstract: The field of deep generative modeling has grown rapidly and consistently over the years. With the availability of massive amounts of training data coupled with advances in scalable unsupervised learning paradigms, recent large-scale generative models show tremendous promise in synthesizing high-resolution images and text, as well as structured data such as videos and molecules. However, we argue t… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  8. arXiv:2402.09085  [pdf, other

    cs.AI

    Polynomial Semantics of Tractable Probabilistic Circuits

    Authors: Oliver Broadrick, Honghua Zhang, Guy Van den Broeck

    Abstract: Probabilistic circuits compute multilinear polynomials that represent multivariate probability distributions. They are tractable models that support efficient marginal inference. However, various polynomial semantics have been considered in the literature (e.g., network polynomials, likelihood polynomials, generating functions, and Fourier transforms). The relationships between circuit representat… ▽ More

    Submitted 28 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  9. arXiv:2401.03349  [pdf, other

    cs.CV cs.LG

    Image Inpainting via Tractable Steering of Diffusion Models

    Authors: Anji Liu, Mathias Niepert, Guy Van den Broeck

    Abstract: Diffusion models are the current state of the art for generating photorealistic images. Controlling the sampling process for constrained image generation tasks such as inpainting, however, remains challenging since exact conditioning on such constraints is intractable. While existing methods use various techniques to approximate the constrained posterior, this paper proposes to exploit the ability… ▽ More

    Submitted 28 November, 2023; originally announced January 2024.

  10. Bit Blasting Probabilistic Programs

    Authors: Poorva Garg, Steven Holtzen, Guy Van den Broeck, Todd Millstein

    Abstract: Probabilistic programming languages (PPLs) are expressive means for creating and reasoning about probabilistic models. Unfortunately hybrid probabilistic programs, involving both continuous and discrete structures, are not well supported by today's PPLs. In this paper we develop a new approximate inference algorithm for hybrid probabilistic programs that first discretizes the continuous distributi… ▽ More

    Submitted 29 April, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    ACM Class: G.3

  11. arXiv:2312.03905  [pdf, other

    cs.LG cs.AI cs.CL

    A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints

    Authors: Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

    Abstract: Neuro-symbolic AI bridges the gap between purely symbolic and neural approaches to learning. This often requires maximizing the likelihood of a symbolic constraint w.r.t the neural network's output distribution. Such output distributions are typically assumed to be fully-factorized. This limits the applicability of neuro-symbolic learning to the more expressive autoregressive distributions, e.g.,… ▽ More

    Submitted 26 January, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Updated detoxification experiments; moved example toxic generations to Github and added link

  12. arXiv:2311.13718  [pdf, other

    cs.LG cs.AI

    A Unified Approach to Count-Based Weakly-Supervised Learning

    Authors: Vinay Shukla, Zhe Zeng, Kareem Ahmed, Guy Van den Broeck

    Abstract: High-quality labels are often very scarce, whereas unlabeled data with inferred weak labels occurs more naturally. In many cases, these weak labels dictate the frequency of each respective class over a set of instances. In this paper, we develop a unified approach to learning from such weakly-labeled data, which we call count-based weakly-supervised learning. At the heart of our approach is the ab… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  13. arXiv:2311.00094  [pdf, other

    cs.LG cs.AI

    A Tractable Inference Perspective of Offline RL

    Authors: Xuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang

    Abstract: A popular paradigm for offline Reinforcement Learning (RL) tasks is to first fit the offline trajectories to a sequence model, and then prompt the model for actions that lead to high expected return. In addition to obtaining accurate sequence models, this paper highlights that tractability, the ability to exactly and efficiently answer various probabilistic queries, plays an important role in offl… ▽ More

    Submitted 25 May, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  14. arXiv:2310.06100  [pdf, other

    cs.AI cs.LG stat.ML

    High Dimensional Causal Inference with Variational Backdoor Adjustment

    Authors: Daniel Israel, Aditya Grover, Guy Van den Broeck

    Abstract: Backdoor adjustment is a technique in causal inference for estimating interventional quantities from purely observational data. For example, in medical settings, backdoor adjustment can be used to control for confounding and estimate the effectiveness of a treatment. However, high dimensional treatments and confounders pose a series of potential pitfalls: tractability, identifiability, optimizatio… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  15. arXiv:2310.02156  [pdf, other

    cs.LG cs.NE

    Probabilistically Rewired Message-Passing Neural Networks

    Authors: Chendi Qian, Andrei Manolache, Kareem Ahmed, Zhe Zeng, Guy Van den Broeck, Mathias Niepert, Christopher Morris

    Abstract: Message-passing graph neural networks (MPNNs) emerged as powerful tools for processing graph-structured input. However, they operate on a fixed input graph structure, ignoring potential noise and missing information. Furthermore, their local aggregation mechanism can lead to problems such as over-squashing and limited expressive power in capturing relevant graph structures. Existing solutions to t… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  16. arXiv:2307.13837  [pdf, other

    cs.AI cs.PL

    Scaling Integer Arithmetic in Probabilistic Programs

    Authors: William X. Cao, Poorva Garg, Ryan Tjoa, Steven Holtzen, Todd Millstein, Guy Van den Broeck

    Abstract: Distributions on integers are ubiquitous in probabilistic modeling but remain challenging for many of today's probabilistic programming languages (PPLs). The core challenge comes from discrete structure: many of today's PPL inference strategies rely on enumeration, sampling, or differentiation in order to scale, which fail for high-dimensional complex discrete distributions involving integers. Our… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted to UAI 2023

  17. arXiv:2306.09686  [pdf, other

    cs.LG cs.AI stat.ML

    Collapsed Inference for Bayesian Deep Learning

    Authors: Zhe Zeng, Guy Van den Broeck

    Abstract: Bayesian neural networks (BNNs) provide a formalism to quantify and calibrate uncertainty in deep learning. Current inference approaches for BNNs often resort to few-sample estimation for scalability, which can harm predictive performance, while its alternatives tend to be computationally prohibitively expensive. We tackle this challenge by revealing a previously unseen connection between inferenc… ▽ More

    Submitted 12 February, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  18. arXiv:2304.07438  [pdf, other

    cs.CL cs.AI

    Tractable Control for Autoregressive Language Generation

    Authors: Honghua Zhang, Meihua Dang, Nanyun Peng, Guy Van den Broeck

    Abstract: Despite the success of autoregressive large language models in text generation, it remains a major challenge to generate text that satisfies complex constraints: sampling from the conditional distribution ${\Pr}(\text{text} | α)$ is intractable for even the simplest lexical constraints $α$. To overcome this challenge, we propose to use tractable probabilistic models (TPMs) to impose lexical constr… ▽ More

    Submitted 15 November, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

  19. arXiv:2302.14207  [pdf, other

    cs.LG cs.AI

    Semantic Strengthening of Neuro-Symbolic Learning

    Authors: Kareem Ahmed, Kai-Wei Chang, Guy Van den Broeck

    Abstract: Numerous neuro-symbolic approaches have recently been proposed typically with the goal of adding symbolic knowledge to the output layer of a neural network. Ideally, such losses maximize the probability that the neural network's predictions satisfy the underlying domain. Unfortunately, this type of probabilistic inference is often computationally infeasible. Neuro-symbolic approaches therefore com… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted at AISTATS 2023

  20. arXiv:2302.14202  [pdf, other

    cs.LG

    Mixtures of All Trees

    Authors: Nikil Roashan Selvam, Honghua Zhang, Guy Van den Broeck

    Abstract: Tree-shaped graphical models are widely used for their tractability. However, they unfortunately lack expressive power as they require committing to a particular sparse dependency structure. We propose a novel class of generative models called mixtures of all trees: that is, a mixture over all possible ($n^{n-2}$) tree-shaped graphical models over $n$ variables. We show that it is possible to para… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to AISTATS 2023

  21. arXiv:2302.08086  [pdf, other

    cs.LG cs.AI

    Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits

    Authors: Xuejie Liu, Anji Liu, Guy Van den Broeck, Yitao Liang

    Abstract: Probabilistic Circuits (PCs) are a general and unified computational framework for tractable probabilistic models that support efficient computation of various inference tasks (e.g., computing marginal probabilities). Towards enabling such reasoning capabilities in complex real-world tasks, Liu et al. (2022) propose to distill knowledge (through latent variable assignments) from less tractable but… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  22. arXiv:2212.02474  [pdf, other

    cs.LG cs.AI cs.CY

    Certifying Fairness of Probabilistic Circuits

    Authors: Nikil Roashan Selvam, Guy Van den Broeck, YooJung Choi

    Abstract: With the increased use of machine learning systems for decision making, questions about the fairness properties of such systems start to take center stage. Most existing work on algorithmic fairness assume complete observation of features at prediction time, as is the case for popular notions like statistical parity and equal opportunity. However, this is not sufficient for models that can make pr… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted to AAAI23

  23. arXiv:2211.12551  [pdf, other

    cs.LG cs.AI

    Sparse Probabilistic Circuits via Pruning and Growing

    Authors: Meihua Dang, Anji Liu, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) are a tractable representation of probability distributions allowing for exact and efficient computation of likelihoods and marginals. There has been significant recent progress on improving the scale and expressiveness of PCs. However, PC training performance plateaus as model size increases. We discover that most capacity in existing large PC structures is wasted: fu… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  24. arXiv:2210.04398  [pdf, other

    cs.LG cs.AI

    Scaling Up Probabilistic Circuits by Latent Variable Distillation

    Authors: Anji Liu, Honghua Zhang, Guy Van den Broeck

    Abstract: Probabilistic Circuits (PCs) are a unified framework for tractable probabilistic models that support efficient computation of various probabilistic queries (e.g., marginal probabilities). One key challenge is to scale PCs to model large and high-dimensional real-world datasets: we observe that as the number of parameters in PCs increases, their performance immediately plateaus. This phenomenon sug… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  25. arXiv:2210.01941  [pdf, other

    cs.LG cs.AI

    SIMPLE: A Gradient Estimator for $k$-Subset Sampling

    Authors: Kareem Ahmed, Zhe Zeng, Mathias Niepert, Guy Van den Broeck

    Abstract: $k$-subset sampling is ubiquitous in machine learning, enabling regularization and interpretability through sparsity. The challenge lies in rendering $k$-subset sampling amenable to end-to-end learning. This has typically involved relaxing the reparameterized samples to allow for backpropagation, with the risk of introducing high bias and high variance. In this work, we fall back to discrete $k… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: ICLR 2023; fixed typo in Theorem 1

  26. arXiv:2206.00426  [pdf, other

    cs.LG cs.AI

    Semantic Probabilistic Layers for Neuro-Symbolic Learning

    Authors: Kareem Ahmed, Stefano Teso, Kai-Wei Chang, Guy Van den Broeck, Antonio Vergari

    Abstract: We design a predictive layer for structured-output prediction (SOP) that can be plugged into any neural network guaranteeing its predictions are consistent with a set of predefined symbolic constraints. Our Semantic Probabilistic Layer (SPL) can model intricate correlations, and hard constraints, over a structured output space all while being amenable to end-to-end learning via maximum likelihood.… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  27. arXiv:2205.11502  [pdf, other

    cs.CL cs.AI

    On the Paradox of Learning to Reason from Data

    Authors: Honghua Zhang, Liunian Harold Li, Tao Meng, Kai-Wei Chang, Guy Van den Broeck

    Abstract: Logical reasoning is needed in a wide range of NLP tasks. Can a BERT model be trained end-to-end to solve logical reasoning problems presented in natural language? We attempt to answer this question in a confined problem space where there exists a set of parameters that perfectly simulates logical reasoning. We make observations that seem to contradict each other: BERT attains near-perfect accurac… ▽ More

    Submitted 24 May, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Table 1 & 2 numbers were out-dated in v1; we have updated them; the observations and conclusions remain unchanged

  28. arXiv:2201.11250  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Neuro-Symbolic Entropy Regularization

    Authors: Kareem Ahmed, Eric Wang, Kai-Wei Chang, Guy Van den Broeck

    Abstract: In structured prediction, the goal is to jointly predict many output variables that together encode a structured object -- a path in a graph, an entity-relation triple, or an ordering of objects. Such a large output space makes learning hard and requires vast amounts of labeled data. Different approaches leverage alternate sources of supervision. One approach -- entropy regularization -- posits th… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  29. arXiv:2111.11632  [pdf, other

    cs.LG cs.IT

    Lossless Compression with Probabilistic Circuits

    Authors: Anji Liu, Stephan Mandt, Guy Van den Broeck

    Abstract: Despite extensive progress on image generation, common deep generative model architectures are not easily applied to lossless compression. For example, VAEs suffer from a compression cost overhead due to their latent variables. This overhead can only be partially eliminated with elaborate schemes such as bits-back coding, often resulting in poor single-sample compression rates. To overcome such pr… ▽ More

    Submitted 16 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  30. arXiv:2111.04833  [pdf, other

    cs.AI cs.LG

    Solving Marginal MAP Exactly by Probabilistic Circuit Transformations

    Authors: YooJung Choi, Tal Friedman, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) are a class of tractable probabilistic models that allow efficient, often linear-time, inference of queries such as marginals and most probable explanations (MPE). However, marginal MAP, which is central to many decision-making problems, remains a hard query for PCs unless they satisfy highly restrictive structural constraints. In this paper, we develop a pruning algor… ▽ More

    Submitted 3 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  31. arXiv:2110.10284  [pdf, other

    cs.AI

    flip-hoisting: Exploiting Repeated Parameters in Discrete Probabilistic Programs

    Authors: Ellie Y. Cheng, Todd Millstein, Guy Van den Broeck, Steven Holtzen

    Abstract: Many of today's probabilistic programming languages (PPLs) have brittle inference performance: the performance of the underlying inference algorithm is very sensitive to the precise way in which the probabilistic program is written. A standard way of addressing this challenge in traditional programming languages is via program optimizations, which seek to unburden the programmer from writing low-l… ▽ More

    Submitted 20 February, 2023; v1 submitted 19 October, 2021; originally announced October 2021.

  32. arXiv:2107.07713  [pdf, other

    cs.LG

    Towards an Interpretable Latent Space in Structured Models for Video Prediction

    Authors: Rushil Gupta, Vishal Sharma, Yash Jain, Yitao Liang, Guy Van den Broeck, Parag Singla

    Abstract: We focus on the task of future frame prediction in video governed by underlying physical dynamics. We work with models which are object-centric, i.e., explicitly work with object representations, and propagate a loss in the latent space. Specifically, our research builds on recent work by Kipf et al. \cite{kipf&al20}, which predicts the next state via contrastive learning of object interactions in… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Accepted at Weakly Supervised Representation Learning Workshop at IJCAI 2021

  33. arXiv:2106.02264  [pdf, other

    cs.LG

    Tractable Regularization of Probabilistic Circuits

    Authors: Anji Liu, Guy Van den Broeck

    Abstract: Probabilistic Circuits (PCs) are a promising avenue for probabilistic modeling. They combine advantages of probabilistic graphical models (PGMs) with those of neural networks (NNs). Crucially, however, they are tractable probabilistic models, supporting efficient and exact computation of many probabilistic inference queries, such as marginals and MAP. Further, since PCs are structured computation… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  34. arXiv:2105.12326  [pdf, other

    cs.LO

    Model Checking Finite-Horizon Markov Chains with Probabilistic Inference

    Authors: Steven Holtzen, Sebastian Junges, Marcell Vazquez-Chanlatte, Todd Millstein, Sanjit A. Seshia, Guy Van Den Broeck

    Abstract: We revisit the symbolic verification of Markov chains with respect to finite horizon reachability properties. The prevalent approach iteratively computes step-bounded state reachability probabilities. By contrast, recent advances in probabilistic inference suggest symbolically representing all horizon-length paths through the Markov chain. We ask whether this perspective advances the state-of-the-… ▽ More

    Submitted 30 June, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Technical Report. Accepted at CAV 2021

  35. arXiv:2105.10118  [pdf, other

    cs.LG cs.AI

    Probabilistic Sufficient Explanations

    Authors: Eric Wang, Pasha Khosravi, Guy Van den Broeck

    Abstract: Understanding the behavior of learned classifiers is an important task, and various black-box explanations, logical reasoning approaches, and model-specific methods have been proposed. In this paper, we introduce probabilistic sufficient explanations, which formulate explaining an instance of classification as choosing the "simplest" subset of features such that only observing those features is "s… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

  36. arXiv:2103.17226  [pdf, other

    quant-ph cs.ET

    Logical Abstractions for Noisy Variational Quantum Algorithm Simulation

    Authors: Yipeng Huang, Steven Holtzen, Todd Millstein, Guy Van den Broeck, Margaret Martonosi

    Abstract: Due to the unreliability and limited capacity of existing quantum computer prototypes, quantum circuit simulation continues to be a vital tool for validating next generation quantum computers and for studying variational quantum algorithms, which are among the leading candidates for useful quantum computation. Existing quantum circuit simulators do not address the common traits of variational algo… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: ASPLOS '21, April 19-23, 2021, Virtual, USA

  37. arXiv:2103.11062  [pdf, other

    cs.LG cs.CL

    Leveraging Unlabeled Data for Entity-Relation Extraction through Probabilistic Constraint Satisfaction

    Authors: Kareem Ahmed, Eric Wang, Guy Van den Broeck, Kai-Wei Chang

    Abstract: We study the problem of entity-relation extraction in the presence of symbolic domain knowledge. Such knowledge takes the form of an ontology defining relations and their permissible arguments. Previous approaches set out to integrate such knowledge in their learning approaches either through self-training, or through approximations that lose the precise meaning of the logical expressions. By cont… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 8 pages, 2 figures

  38. arXiv:2102.10562  [pdf, other

    cs.LG cs.AI

    Tractable Computation of Expected Kernels

    Authors: Wenzhe Li, Zhe Zeng, Antonio Vergari, Guy Van den Broeck

    Abstract: Computing the expectation of kernel functions is a ubiquitous task in machine learning, with applications from classical support vector machines to exploiting kernel embeddings of distributions in probabilistic modeling, statistical inference, causal discovery, and deep learning. In all these scenarios, we tend to resort to Monte Carlo estimates as expectations of kernels are intractable in genera… ▽ More

    Submitted 22 July, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

  39. arXiv:2102.09768  [pdf, other

    cs.AI cs.LG

    Probabilistic Generating Circuits

    Authors: Honghua Zhang, Brendan Juba, Guy Van den Broeck

    Abstract: Generating functions, which are widely used in combinatorics and probability theory, encode function values into the coefficients of a polynomial. In this paper, we explore their use as a tractable probabilistic model, and propose probabilistic generating circuits (PGCs) for their efficient representation. PGCs are strictly more expressive efficient than many existing tractable probabilistic model… ▽ More

    Submitted 11 June, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  40. arXiv:2102.06137  [pdf, other

    stat.ML cs.AI cs.DS cs.LG

    A Compositional Atlas of Tractable Circuit Operations: From Simple Transformations to Complex Information-Theoretic Queries

    Authors: Antonio Vergari, YooJung Choi, Anji Liu, Stefano Teso, Guy Van den Broeck

    Abstract: Circuit representations are becoming the lingua franca to express and reason about tractable generative and discriminative models. In this paper, we show how complex inference scenarios for these models that commonly arise in machine learning -- from computing the expectations of decision tree ensembles to information-theoretic divergences of deep mixture models -- can be represented in terms of t… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    ACM Class: G.3; I.2.4; I.2.6

  41. arXiv:2009.09031  [pdf, other

    cs.LG cs.AI stat.ML

    Group Fairness by Probabilistic Modeling with Latent Fair Decisions

    Authors: YooJung Choi, Meihua Dang, Guy Van den Broeck

    Abstract: Machine learning systems are increasingly being used to make impactful decisions such as loan applications and criminal justice risk assessments, and as such, ensuring fairness of these systems is critical. This is often challenging as the labels in the data are biased. This paper studies learning fair probability distributions from biased data by explicitly modeling a latent variable that represe… ▽ More

    Submitted 16 December, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

  42. arXiv:2009.08634  [pdf, ps, other

    cs.AI cs.CC cs.LG

    On the Tractability of SHAP Explanations

    Authors: Guy Van den Broeck, Anton Lykov, Maximilian Schleich, Dan Suciu

    Abstract: SHAP explanations are a popular feature-attribution mechanism for explainable AI. They use game-theoretic notions to measure the influence of individual features on the prediction of a machine learning model. Despite a lot of recent interest from both academia and industry, it is not known whether SHAP explanations of common machine learning models can be computed efficiently. In this paper, we es… ▽ More

    Submitted 30 January, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: Proceedings of the 35th AAAI Conference on Artificial Intelligence

  43. arXiv:2007.09331  [pdf, other

    cs.LG cs.AI

    Strudel: Learning Structured-Decomposable Probabilistic Circuits

    Authors: Meihua Dang, Antonio Vergari, Guy Van den Broeck

    Abstract: Probabilistic circuits (PCs) represent a probability distribution as a computational graph. Enforcing structural properties on these graphs guarantees that several inference scenarios become tractable. Among these properties, structured decomposability is a particularly appealing one: it enables the efficient and exact computations of the probability of complex logical formulas, and can be used to… ▽ More

    Submitted 2 September, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: 12 pages, 3 figures, to be published on PGM2020 (The 10th International Conference on Probabilistic Graphical Models)

    ACM Class: I.2.6

  44. arXiv:2006.16341  [pdf, other

    cs.LG cs.AI stat.ML

    Handling Missing Data in Decision Trees: A Probabilistic Approach

    Authors: Pasha Khosravi, Antonio Vergari, YooJung Choi, Yitao Liang, Guy Van den Broeck

    Abstract: Decision trees are a popular family of models due to their attractive properties such as interpretability and ability to handle heterogeneous data. Concurrently, missing data is a prevalent occurrence that hinders performance of machine learning models. As such, handling missing data in decision trees is a well studied problem. In this paper, we tackle this problem by taking a probabilistic approa… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  45. arXiv:2006.15233  [pdf, other

    cs.AI cs.LG

    On the Relationship Between Probabilistic Circuits and Determinantal Point Processes

    Authors: Honghua Zhang, Steven Holtzen, Guy Van den Broeck

    Abstract: Scaling probabilistic models to large realistic problems and datasets is a key challenge in machine learning. Central to this effort is the development of tractable probabilistic models (TPMs): models whose structure guarantees efficient probabilistic inference algorithms. The current landscape of TPMs is fragmented: there exist various kinds of TPMs with different strengths and weaknesses. Two of… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  46. arXiv:2006.08852  [pdf, ps, other

    cs.LG stat.ML

    Counterexample-Guided Learning of Monotonic Neural Networks

    Authors: Aishwarya Sivaraman, Golnoosh Farnadi, Todd Millstein, Guy Van den Broeck

    Abstract: The widespread adoption of deep learning is often attributed to its automatic feature construction with minimal inductive bias. However, in many real-world tasks, the learned function is intended to satisfy domain-specific constraints. We focus on monotonicity constraints, which are common and require that the function's output increases with increasing values of specific input features. We develo… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  47. arXiv:2006.08785  [pdf, other

    cs.LG cs.AI stat.ML

    On Effective Parallelization of Monte Carlo Tree Search

    Authors: Anji Liu, Yitao Liang, Ji Liu, Guy Van den Broeck, Jianshu Chen

    Abstract: Despite its groundbreaking success in Go and computer games, Monte Carlo Tree Search (MCTS) is computationally expensive as it requires a substantial number of rollouts to construct the search tree, which calls for effective parallelization. However, how to design effective parallel MCTS algorithms has not been systematically studied and remains poorly understood. In this paper, we seek to lay its… ▽ More

    Submitted 4 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  48. Scaling Exact Inference for Discrete Probabilistic Programs

    Authors: Steven Holtzen, Guy Van den Broeck, Todd Millstein

    Abstract: Probabilistic programming languages (PPLs) are an expressive means of representing and reasoning about probabilistic models. The computational challenge of probabilistic inference remains the primary roadblock for applying PPLs in practice. Inference is fundamentally hard, so there is no one-size-fits all solution. In this work, we target scalable inference for an important class of probabilistic… ▽ More

    Submitted 16 October, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Journal ref: Proc. ACM Program. Lang. 4, OOPSLA (2020)

  49. arXiv:2004.06231  [pdf, other

    cs.LG stat.ML

    Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits

    Authors: Robert Peharz, Steven Lang, Antonio Vergari, Karl Stelzner, Alejandro Molina, Martin Trapp, Guy Van den Broeck, Kristian Kersting, Zoubin Ghahramani

    Abstract: Probabilistic circuits (PCs) are a promising avenue for probabilistic modeling, as they permit a wide range of exact and efficient inference routines. Recent ``deep-learning-style'' implementations of PCs strive for a better scalability, but are still difficult to train on real-world data, due to their sparsely connected computational graphs. In this paper, we propose Einsum Networks (EiNets), a n… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  50. arXiv:2003.00126  [pdf, other

    cs.AI

    Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing

    Authors: Zhe Zeng, Paolo Morettin, Fanqi Yan, Antonio Vergari, Guy Van den Broeck

    Abstract: Weighted model integration (WMI) is a very appealing framework for probabilistic inference: it allows to express the complex dependencies of real-world problems where variables are both continuous and discrete, via the language of Satisfiability Modulo Theories (SMT), as well as to compute probabilistic queries with complex logical and arithmetic constraints. Yet, existing WMI solvers are not read… ▽ More

    Submitted 19 August, 2020; v1 submitted 28 February, 2020; originally announced March 2020.