Skip to main content

Showing 1–50 of 51 results for author: Heskes, T

.
  1. arXiv:2406.02293  [pdf, other

    stat.ML cs.LG

    Composite Quantile Regression With XGBoost Using the Novel Arctan Pinball Loss

    Authors: Laurens Sluijterman, Frank Kreuwel, Eric Cator, Tom Heskes

    Abstract: This paper explores the use of XGBoost for composite quantile regression. XGBoost is a highly popular model renowned for its flexibility, efficiency, and capability to deal with missing data. The optimization uses a second order approximation of the loss function, complicating the use of loss functions with a zero or vanishing second derivative. Quantile regression -- a popular approach to obtain… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures

  2. arXiv:2405.16164  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Acquiring Better Load Estimates by Combining Anomaly and Change-point Detection in Power Grid Time-series Measurements

    Authors: Roel Bouman, Linda Schmeitz, Luco Buise, Jacco Heres, Yuliya Shapovalova, Tom Heskes

    Abstract: In this paper we present novel methodology for automatic anomaly and switch event filtering to improve load estimation in power grid systems. By leveraging unsupervised methods with supervised optimization, our approach prioritizes interpretability while ensuring robust and generalizable performance on unseen data. Through experimentation, a combination of binary segmentation for change point dete… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

    Comments: All code can be found at: https://github.com/RoelBouman/StormPhase2

  3. arXiv:2403.02443  [pdf, other

    astro-ph.IM

    Tuning neural posterior estimation for gravitational wave inference

    Authors: Alex Kolmus, Justin Janquart, Tomasz Baka, Twan van Laarhoven, Chris Van Den Broeck, Tom Heskes

    Abstract: Modern simulation-based inference techniques use neural networks to solve inverse problems efficiently. One notable strategy is neural posterior estimation (NPE), wherein a neural network parameterizes a distribution to approximate the posterior. This approach is particularly advantageous for tackling low-latency or high-volume inverse problems. However, the accuracy of NPE varies significantly wi… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 28 pages, 9 figures

  4. arXiv:2402.16073  [pdf, other

    cs.IR cs.AI cs.LG

    Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

    Authors: Binyam Gebre, Karoliina Ranta, Stef van den Elzen, Ernst Kuiper, Thijs Baars, Tom Heskes

    Abstract: In personalized recommender systems, embeddings are often used to encode customer actions and items, and retrieval is then performed in the embedding space using approximate nearest neighbor search. However, this approach can lead to two challenges: 1) user embeddings can restrict the diversity of interests captured and 2) the need to keep them up-to-date requires an expensive, real-time infrastru… ▽ More

    Submitted 6 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: 9 pages, 8 figures

    ACM Class: H.3.3

  5. Fault Trees, Decision Trees, And Binary Decision Diagrams: A Systematic Comparison

    Authors: L. A. Jimenez-Roa, T. Heskes, M. Stoelinga

    Abstract: In reliability engineering, we need to understand system dependencies, cause-effect relations, identify critical components, and analyze how they trigger failures. Three prominent graph models commonly used for these purposes are fault trees (FTs), decision trees (DTs), and binary decision diagrams (BDDs). These models are popular because they are easy to interpret, serve as a communication tool b… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  6. Deterioration modeling of sewer pipes via discrete-time Markov chains: A large-scale case study in the Netherlands

    Authors: L. A. Jimenez-Roa, T. Heskes, T. Tinga, H. Molegraaf, M. Stoelinga

    Abstract: Sewer pipe network systems are an important part of civil infrastructure, and in order to find a good trade-off between maintenance costs and system performance, reliable sewer pipe degradation models are essential. In this paper, we present a large-scale case study in the city of Breda in the Netherlands. Our dataset has information on sewer pipes built since the 1920s and contains information on… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  7. arXiv:2310.01181  [pdf, other

    cs.LG cs.AI

    Graph Isomorphic Networks for Assessing Reliability of the Medium-Voltage Grid

    Authors: Charlotte Cambier van Nooten, Tom van de Poll, Sonja Füllhase, Jacco Heres, Tom Heskes, Yuliya Shapovalova

    Abstract: Ensuring electricity grid reliability becomes increasingly challenging with the shift towards renewable energy and declining conventional capacities. Distribution System Operators (DSOs) aim to achieve grid reliability by verifying the n-1 principle, ensuring continuous operation in case of component failure. Electricity networks' complex graph-based data holds crucial information for n-1 assessme… ▽ More

    Submitted 3 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Under review

  8. arXiv:2308.02221  [pdf, other

    stat.ML cs.LG

    Likelihood-ratio-based confidence intervals for neural networks

    Authors: Laurens Sluijterman, Eric Cator, Tom Heskes

    Abstract: This paper introduces a first implementation of a novel likelihood-ratio-based approach for constructing confidence intervals for neural networks. Our method, called DeepLR, offers several qualitative advantages: most notably, the ability to construct asymmetric intervals that expand in regions with a limited amount of data, and the inherent incorporation of factors such as the amount of training… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 28 pages, 9 figures

  9. arXiv:2305.00735  [pdf, other

    cs.LG cs.AI

    Unsupervised anomaly detection algorithms on real-world data: how many do we need?

    Authors: Roel Bouman, Zaharah Bukhsh, Tom Heskes

    Abstract: In this study we evaluate 32 unsupervised anomaly detection algorithms on 52 real-world multivariate tabular datasets, performing the largest comparison of unsupervised anomaly detection algorithms to date. On this collection of datasets, the $k$-thNN (distance to the $k$-nearest neighbor) algorithm significantly outperforms the most other algorithms. Visualizing and then clustering the relative p… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: The associated Git repository can be found at: https://github.com/RoelBouman/outlierdetection

    Journal ref: Journal of Machine Learning Research 25.105 (2024): 1-34

  10. arXiv:2302.08875  [pdf, other

    stat.ML cs.LG

    Optimal Training of Mean Variance Estimation Neural Networks

    Authors: Laurens Sluijterman, Eric Cator, Tom Heskes

    Abstract: This paper focusses on the optimal implementation of a Mean Variance Estimation network (MVE network) (Nix and Weigend, 1994). This type of network is often used as a building block for uncertainty estimation methods in a regression setting, for instance Concrete dropout (Gal et al., 2017) and Deep Ensembles (Lakshminarayanan et al., 2017). Specifically, an MVE network assumes that the data is pro… ▽ More

    Submitted 3 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures

  11. arXiv:2210.03590  [pdf, other

    cs.LG cs.AI cs.LO

    Machine Learning Meets The Herbrand Universe

    Authors: Jelle Piepenbrock, Josef Urban, Konstantin Korovin, Miroslav Olšák, Tom Heskes, Mikolaš Janota

    Abstract: The appearance of strong CDCL-based propositional (SAT) solvers has greatly advanced several areas of automated reasoning (AR). One of the directions in AR is thus to apply SAT solvers to expressive formalisms such as first-order logic, for which large corpora of general mathematical problems exist today. This is possible due to Herbrand's theorem, which allows reduction of first-order problems to… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 8 pages, 10 figures

  12. arXiv:2204.03743  [pdf, other

    cs.NE

    Automatic inference of fault tree models via multi-objective evolutionary algorithms

    Authors: Lisandro A. Jimenez-Roa, Tom Heskes, Tiedo Tinga, Marielle Stoelinga

    Abstract: Fault tree analysis is a well-known technique in reliability engineering and risk assessment, which supports decision-making processes and the management of complex systems. Traditionally, fault tree (FT) models are built manually together with domain experts, considered a time-consuming process prone to human errors. With Industry 4.0, there is an increasing availability of inspection and monitor… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  13. arXiv:2202.10903  [pdf, other

    stat.ML cs.LG

    Confident Neural Network Regression with Bootstrapped Deep Ensembles

    Authors: Laurens Sluijterman, Eric Cator, Tom Heskes

    Abstract: With the rise of the popularity and usage of neural networks, trustworthy uncertainty estimation is becoming increasingly essential. One of the most prominent uncertainty estimation methods is Deep Ensembles (Lakshminarayanan et al., 2017) . A classical parametric model has uncertainty in the parameters due to the fact that the data on which the model is build is a random sample. A modern neural n… ▽ More

    Submitted 3 August, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 20 pages, 11 figures

    MSC Class: 62F40

  14. arXiv:2111.13244  [pdf, other

    cs.CV cs.CR

    Going Grayscale: The Road to Understanding and Improving Unlearnable Examples

    Authors: Zhuoran Liu, Zhengyu Zhao, Alex Kolmus, Tijn Berns, Twan van Laarhoven, Tom Heskes, Martha Larson

    Abstract: Recent work has shown that imperceptible perturbations can be applied to craft unlearnable examples (ULEs), i.e. images whose content cannot be used to improve a classifier during training. In this paper, we reveal the road that researchers should follow for understanding ULEs and improving ULEs as they were originally formulated (ULEOs). The paper makes four contributions. First, we show that ULE… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  15. arXiv:2111.00833  [pdf, other

    gr-qc astro-ph.IM cs.LG

    Swift sky localization of gravitational waves using deep learning seeded importance sampling

    Authors: Alex Kolmus, Grégory Baltus, Justin Janquart, Twan van Laarhoven, Sarah Caudill, Tom Heskes

    Abstract: Fast, highly accurate, and reliable inference of the sky origin of gravitational waves would enable real-time multi-messenger astronomy. Current Bayesian inference methodologies, although highly accurate and reliable, are slow. Deep learning models have shown themselves to be accurate and extremely fast for inference tasks on gravitational waves, but their output is inherently questionable due to… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 12 pages, 9 figures, 1 table

  16. How to Evaluate Uncertainty Estimates in Machine Learning for Regression?

    Authors: Laurens Sluijterman, Eric Cator, Tom Heskes

    Abstract: As neural networks become more popular, the need for accompanying uncertainty estimates increases. There are currently two main approaches to test the quality of these estimates. Most methods output a density. They can be compared by evaluating their loglikelihood on a test set. Other methods output a prediction interval directly. These methods are often tested by examining the fraction of test po… ▽ More

    Submitted 3 August, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 14 pages, 10 figures

  17. arXiv:2102.05547  [pdf, other

    cs.LG cs.AI cs.LO

    Learning Equational Theorem Proving

    Authors: Jelle Piepenbrock, Tom Heskes, Mikoláš Janota, Josef Urban

    Abstract: We develop Stratified Shortest Solution Imitation Learning (3SIL) to learn equational theorem proving in a deep reinforcement learning (RL) setting. The self-trained models achieve state-of-the-art performance in proving problems generated by one of the top open conjectures in quasigroup theory, the Abelian Inner Map** (AIM) Conjecture. To develop the methods, we first use two simpler arithmetic… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 17 pages, 4 figures

  18. arXiv:2012.13195  [pdf, other

    cs.IT nlin.AO physics.soc-ph

    Spectral Ranking of Causal Influence in Complex Systems

    Authors: Errol Zalmijn, Tom Heskes, Tom Claassen

    Abstract: Like natural complex systems such as the Earth's climate or a living cell, semiconductor lithography systems are characterized by nonlinear dynamics across more than a dozen orders of magnitude in space and time. Thousands of sensors measure relevant process variables at appropriate sampling rates, to provide time series as primary sources for system diagnostics. However, high-dimensionality, non-… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 5 pages, 4 figures

  19. arXiv:2012.10167  [pdf, ps, other

    stat.ME q-bio.GN q-bio.QM stat.AP stat.ML

    Inferring the Direction of a Causal Link and Estimating Its Effect via a Bayesian Mendelian Randomization Approach

    Authors: Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

    Abstract: The use of genetic variants as instrumental variables - an approach known as Mendelian randomization - is a popular epidemiological method for estimating the causal effect of an exposure (phenotype, biomarker, risk factor) on a disease or health-related outcome from observational data. Instrumental variables must satisfy strong, often untestable assumptions, which means that finding good genetic i… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 26 pages, 22 figures, published in Statistical Methods in Medical Research

    Journal ref: Statistical Methods in Medical Research, Vol 29, Issue 4, 2020

  20. arXiv:2012.10141  [pdf, other

    stat.ML cs.AI cs.LG q-bio.GN

    MASSIVE: Tractable and Robust Bayesian Learning of Many-Dimensional Instrumental Variable Models

    Authors: Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

    Abstract: The recent availability of huge, many-dimensional data sets, like those arising from genome-wide association studies (GWAS), provides many opportunities for strengthening causal inference. One popular approach is to utilize these many-dimensional measurements as instrumental variables (instruments) for improving the causal effect estimate between other pairs of variables. Unfortunately, searching… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 14 pages, 7 figures, Published in the Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI)

    Journal ref: PMLR 124:1049-1058, 2020

  21. arXiv:2011.01625  [pdf, other

    cs.AI cs.LG

    Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models

    Authors: Tom Heskes, Evi Sijben, Ioan Gabriel Bucur, Tom Claassen

    Abstract: Shapley values underlie one of the most popular model-agnostic methods within explainable artificial intelligence. These values are designed to attribute the difference between a model's prediction and an average baseline to the different features used as input to the model. Being based on solid game-theoretic principles, Shapley values uniquely satisfy several desirable properties, which is why t… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

    ACM Class: I.2.6

  22. arXiv:2004.03047  [pdf, other

    cs.HC eess.SP

    Probabilistic modelling of gait for robust passive monitoring in daily life

    Authors: Yordan P. Raykov, Luc J. W. Evers, Reham Badawy, Bastiaan Bloem, Tom M. Heskes, Marjan Meinders, Kasper Claes, Max A. Little

    Abstract: Passive monitoring in daily life may provide invaluable insights about a person's health throughout the day. Wearable sensor devices are likely to play a key role in enabling such monitoring in a non-obtrusive fashion. However, sensor data collected in daily life reflects multiple health and behavior related factors together. This creates the need for structured principled analysis to produce reli… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  23. arXiv:1909.03818  [pdf, other

    stat.ML cs.LG q-bio.GN q-bio.MN q-bio.QM

    Large-Scale Local Causal Inference of Gene Regulatory Relationships

    Authors: Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

    Abstract: Gene regulatory networks play a crucial role in controlling an organism's biological processes, which is why there is significant interest in develo** computational methods that are able to extract their structure from high-throughput genetic data. Many of these computational methods are designed to infer individual regulatory relationships among genes from data on gene expression. We propose a… ▽ More

    Submitted 10 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: 32 pages, 9 figures, 2 tables. This manuscript version has been accepted for publication in the International Journal of Approximate Reasoning. It incorporates reviewer comments and has a new title. This manuscript constitutes an extended version of a previous paper shared on arXiv (arXiv:1809.06827) that has been published in the proceedings of the PGM 2018 conference

  24. arXiv:1905.08628  [pdf, other

    cs.LG astro-ph.IM hep-ex hep-ph hep-th

    Constraining the Parameters of High-Dimensional Models with Active Learning

    Authors: Sascha Caron, Tom Heskes, Sydney Otten, Bob Stienen

    Abstract: Constraining the parameters of physical models with $>5-10$ parameters is a widespread problem in fields like particle physics and astronomy. The generation of data to explore this parameter space often requires large amounts of computational resources. The commonly used solution of reducing the number of relevant physical parameters hampers the generality of the results. In this paper we show tha… ▽ More

    Submitted 28 October, 2019; v1 submitted 19 May, 2019; originally announced May 2019.

  25. arXiv:1809.06827  [pdf, other

    stat.ML cs.LG q-bio.MN

    A Bayesian Approach for Inferring Local Causal Structure in Gene Regulatory Networks

    Authors: Ioan Gabriel Bucur, Tom van Bussel, Tom Claassen, Tom Heskes

    Abstract: Gene regulatory networks play a crucial role in controlling an organism's biological processes, which is why there is significant interest in develo** computational methods that are able to extract their structure from high-throughput genetic data. A typical approach consists of a series of conditional independence tests on the covariance structure meant to progressively reduce the space of poss… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: 12 pages, 4 figures, 3 tables

    Journal ref: PMLR 72 (2018) 37-48

  26. arXiv:1806.04610  [pdf, ps, other

    stat.ML cs.LG

    A Novel Bayesian Approach for Latent Variable Modeling from Mixed Data with Missing Values

    Authors: Ruifei Cui, Ioan Gabriel Bucur, Perry Groot, Tom Heskes

    Abstract: We consider the problem of learning parameters of latent variable models from mixed (continuous and ordinal) data with missing values. We propose a novel Bayesian Gaussian copula factor (BGCF) approach that is consistent under certain conditions and that is quite robust to the violations of these conditions. In simulations, BGCF substantially outperforms two state-of-the-art alternative approaches… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

  27. arXiv:1805.09527  [pdf, other

    stat.ML cs.LG

    Stable specification search in structural equation model with latent variables

    Authors: Ridho Rahmadi, Perry Groot, Tom Heskes

    Abstract: In our previous study, we introduced stable specification search for cross-sectional data (S3C). It is an exploratory causal method that combines stability selection concept and multi-objective optimization to search for stable and parsimonious causal structures across the entire range of model complexities. In this study, we extended S3C to S3C-Latent, to model causal relations between latent var… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

  28. Convolutional Neural Networks for Transient Candidate Vetting in Large-Scale Surveys

    Authors: Fabian Gieseke, Steven Bloemen, Cas van den Bogaard, Tom Heskes, Jonas Kindler, Richard A. Scalzo, Valério A. R. M. Ribeiro, Jan van Roestel, Paul J. Groot, Fang Yuan, Anais Möller, Brad E. Tucker

    Abstract: Current synoptic sky surveys monitor large areas of the sky to find variable and transient astronomical sources. As the number of detections per night at a single telescope easily exceeds several thousand, current detection pipelines make intensive use of machine learning algorithms to classify the detected objects and to filter out the most interesting candidates. A number of upcoming surveys wil… ▽ More

    Submitted 29 August, 2017; originally announced August 2017.

    Comments: 15 pages, 15 figures, 3 tables

    Journal ref: Monthly Notices of the Royal Astronomical Society, 2017

  29. arXiv:1704.01864  [pdf, other

    stat.ML cs.AI stat.ME

    Robust Causal Estimation in the Large-Sample Limit without Strict Faithfulness

    Authors: Ioan Gabriel Bucur, Tom Claassen, Tom Heskes

    Abstract: Causal effect estimation from observational data is an important and much studied research topic. The instrumental variable (IV) and local causal discovery (LCD) patterns are canonical examples of settings where a closed-form expression exists for the causal effect of one variable on another, given the presence of a third variable. Both rely on faithfulness to infer that the latter only influences… ▽ More

    Submitted 6 April, 2017; originally announced April 2017.

    Comments: 10 pages, 12 figures, Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS) 2017

    Journal ref: PMLR 54:1523-1531, 2017

  30. Deep Multi-scale Location-aware 3D Convolutional Neural Networks for Automated Detection of Lacunes of Presumed Vascular Origin

    Authors: Mohsen Ghafoorian, Nico Karssemeijer, Tom Heskes, Mayra Bergkamp, Joost Wissink, Jiri Obels, Karlijn Keizer, Frank-Erik de Leeuw, Bram van Ginneken, Elena Marchiori, Bram Platel

    Abstract: Lacunes of presumed vascular origin (lacunes) are associated with an increased risk of stroke, gait impairment, and dementia and are a primary imaging feature of the small vessel disease. Quantification of lacunes may be of great importance to elucidate the mechanisms behind neuro-degenerative disorders and is recommended as part of study standards for small vessel disease research. However, due t… ▽ More

    Submitted 29 October, 2016; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: 11 pages, 7 figures

    Journal ref: Neuroimage Clin 14 (2017) 391-399

  31. arXiv:1610.04834  [pdf, other

    cs.CV

    Location Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities

    Authors: Mohsen Ghafoorian, Nico Karssemeijer, Tom Heskes, Inge van Uden, Clara Sanchez, Geert Litjens, Frank-Erik de Leeuw, Bram van Ginneken, Elena Marchiori, Bram Platel

    Abstract: The anatomical location of imaging features is of crucial importance for accurate diagnosis in many medical tasks. Convolutional neural networks (CNN) have had huge successes in computer vision, but they lack the natural ability to incorporate the anatomical location in their decision making process, hindering success in some medical image analysis tasks. In this paper, to integrate the anatomic… ▽ More

    Submitted 29 October, 2016; v1 submitted 16 October, 2016; originally announced October 2016.

    Comments: 13 pages, 8 figures

  32. arXiv:1605.06838  [pdf, other

    stat.ML cs.AI

    Causality on Longitudinal Data: Stable Specification Search in Constrained Structural Equation Modeling

    Authors: Ridho Rahmadi, Perry Groot, Marieke HC van Rijn, Jan AJG van den Brand, Marianne Heins, Hans Knoop, Tom Heskes

    Abstract: A typical problem in causal modeling is the instability of model structure learning, i.e., small changes in finite data can result in completely different optimal models. The present work introduces a novel causal modeling algorithm for longitudinal data, that is robust for finite samples based on recent advances in stability selection using subsampling and selection algorithms. Our approach uses… ▽ More

    Submitted 4 April, 2017; v1 submitted 22 May, 2016; originally announced May 2016.

  33. arXiv:1604.04931  [pdf, other

    stat.AP stat.ML

    Regularizing Solutions to the MEG Inverse Problem Using Space-Time Separable Covariance Functions

    Authors: Arno Solin, Pasi Jylänki, Jaakko Kauramäki, Tom Heskes, Marcel A. J. van Gerven, Simo Särkkä

    Abstract: In magnetoencephalography (MEG) the conventional approach to source reconstruction is to solve the underdetermined inverse problem independently over time and space. Here we present how the conventional approach can be extended by regularizing the solution in space and time by a Gaussian process (Gaussian random field) model. Assuming a separable covariance function in space and time, the computat… ▽ More

    Submitted 17 April, 2016; originally announced April 2016.

    Comments: 25 pages, 7 figures

  34. arXiv:1604.04428  [pdf, other

    cs.LG cs.NE

    The Artificial Mind's Eye: Resisting Adversarials for Convolutional Neural Networks using Internal Projection

    Authors: Harm Berntsen, Wouter Kuijper, Tom Heskes

    Abstract: We introduce a novel artificial neural network architecture that integrates robustness to adversarial input in the network structure. The main idea of our approach is to force the network to make predictions on what the given instance of the class under consideration would look like and subsequently test those predictions. By forcing the network to redraw the relevant parts of the image and subseq… ▽ More

    Submitted 14 July, 2016; v1 submitted 15 April, 2016; originally announced April 2016.

  35. arXiv:1512.02831  [pdf, other

    cs.DC cs.DS cs.LG

    Bigger Buffer k-d Trees on Multi-Many-Core Systems

    Authors: Fabian Gieseke, Cosmin Eugen Oancea, Ashish Mahabal, Christian Igel, Tom Heskes

    Abstract: A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are limited by the amount of points that can fit on a single device. In this work, we show how to modify the original data structure and the associated workflow to… ▽ More

    Submitted 9 December, 2015; originally announced December 2015.

  36. Causality on Cross-Sectional Data: Stable Specification Search in Constrained Structural Equation Modeling

    Authors: Ridho Rahmadi, Perry Groot, Marianne Heins, Hans Knoop, Tom Heskes

    Abstract: Causal modeling has long been an attractive topic for many researchers and in recent decades there has seen a surge in theoretical development and discovery algorithms. Generally discovery algorithms can be divided into two approaches: constraint-based and score-based. The constraint-based approach is able to detect common causes of the observed variables but the use of independence tests makes it… ▽ More

    Submitted 14 July, 2016; v1 submitted 18 June, 2015; originally announced June 2015.

    Journal ref: Applied.Soft.Comp. 52 (2017) 687-698

  37. arXiv:1411.1557  [pdf, other

    stat.ML

    Proof Supplement - Learning Sparse Causal Models is not NP-hard (UAI2013)

    Authors: Tom Claassen, Joris M. Mooij, Tom Heskes

    Abstract: This article contains detailed proofs and additional examples related to the UAI-2013 submission `Learning Sparse Causal Models is not NP-hard'. It describes the FCI+ algorithm: a method for sound and complete causal model discovery in the presence of latent confounders and/or selection bias, that has worst case polynomial complexity of order $N^{2(k+1)}$ in the number of independence tests, for s… ▽ More

    Submitted 6 November, 2014; originally announced November 2014.

    Comments: 11 pages, supplement to `Learning Sparse Causal Models is not NP-hard' (UAI2013)

  38. arXiv:1409.2676  [pdf, other

    q-bio.NC stat.ME

    Efficient sampling of Gaussian graphical models using conditional Bayes factors

    Authors: Max Hinne, Alex Lenkoski, Tom Heskes, Marcel van Gerven

    Abstract: Bayesian estimation of Gaussian graphical models has proven to be challenging because the conjugate prior distribution on the Gaussian precision matrix, the G-Wishart distribution, has a doubly intractable partition function. Recent developments provide a direct way to sample from the G-Wishart distribution, which allows for more efficient algorithms for model selection than previously possible. S… ▽ More

    Submitted 9 September, 2014; originally announced September 2014.

    Comments: 9 pages, 1 figure

  39. arXiv:1401.3877  [pdf

    cs.LG cs.AI stat.ML

    Properties of Bethe Free Energies and Message Passing in Gaussian Models

    Authors: Botond Cseke, Tom Heskes

    Abstract: We address the problem of computing approximate marginals in Gaussian probabilistic models by using mean field and fractional Bethe approximations. We define the Gaussian fractional Bethe free energy in terms of the moment parameters of the approximate marginals, derive a lower and an upper bound on the fractional Bethe free energy and establish a necessary condition for the lower bound to be boun… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 41, pages 1-24, 2011

  40. arXiv:1309.6849  [pdf

    cs.LG cs.AI stat.ML

    Cyclic Causal Discovery from Continuous Equilibrium Data

    Authors: Joris Mooij, Tom Heskes

    Abstract: We propose a method for learning cyclic causal models from a combination of observational and interventional equilibrium data. Novel aspects of the proposed method are its ability to work with continuous data (without assuming linearity) and to deal with feedback loops. Within the context of biochemical reactions, we also propose a novel way of modeling interventions that modify the activity of co… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-431-439

  41. arXiv:1309.6824  [pdf

    cs.AI

    Learning Sparse Causal Models is not NP-hard

    Authors: Tom Claassen, Joris Mooij, Tom Heskes

    Abstract: This paper shows that causal model discovery is not an NP-hard problem, in the sense that for sparse graphs bounded by node degree k the sound and complete causal model can be obtained in worst case order N^{2(k+2)} independence tests, even when latent variables and selection bias may be present. We present a modification of the well-known FCI algorithm that implements the method for an independen… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-172-181

  42. arXiv:1307.0846  [pdf, ps, other

    stat.ML cs.IR cs.LG

    Semi-supervised Ranking Pursuit

    Authors: Evgeni Tsivtsivadze, Tom Heskes

    Abstract: We propose a novel sparse preference learning/ranking algorithm. Our algorithm approximates the true utility function by a weighted sum of basis functions using the squared loss on pairs of data points, and is a generalization of the kernel matching pursuit method. It can operate both in a supervised and a semi-supervised setting and allows efficient search for multiple, near-optimal solutions. Fu… ▽ More

    Submitted 2 July, 2013; originally announced July 2013.

  43. arXiv:1305.4152  [pdf, other

    stat.ML

    Sparse Approximate Inference for Spatio-Temporal Point Process Models

    Authors: Botond Cseke, Andrew Zammit Mangion, Tom Heskes, Guido Sanguinetti

    Abstract: Spatio-temporal point process models play a central role in the analysis of spatially distributed systems in several disciplines. Yet, scalable inference remains computa- tionally challenging both due to the high resolution modelling generally required and the analytically intractable likelihood function. Here, we exploit the sparsity structure typical of (spatially) discretised log-Gaussian Cox p… ▽ More

    Submitted 6 July, 2015; v1 submitted 17 May, 2013; originally announced May 2013.

    MSC Class: 60G55; 60G15; 62F15; 62M30; 62H11

  44. arXiv:1301.0613  [pdf

    cs.LG cs.AI stat.ML

    IPF for Discrete Chain Factor Graphs

    Authors: Wim Wiegerinck, Tom Heskes

    Abstract: Iterative Proportional Fitting (IPF), combined with EM, is commonly used as an algorithm for likelihood maximization in undirected graphical models. In this paper, we present two iterative algorithms that generalize upon IPF. The first one is for likelihood maximization in discrete chain factor graphs, which we define as a wide class of discrete variable models including undirected graphical mod… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-560-567

  45. arXiv:1301.0572  [pdf

    cs.AI

    Expectation Propogation for approximate inference in dynamic Bayesian networks

    Authors: Tom Heskes, Onno Zoeter

    Abstract: We describe expectation propagation for approximate inference in dynamic Bayesian networks as a natural extension of Pearl s exact belief propagation.Expectation propagation IS a greedy algorithm, converges IN many practical cases, but NOT always.We derive a DOUBLE - loop algorithm, guaranteed TO converge TO a local minimum OF a Bethe free energy.Furthermore, we show that st… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-216-223

  46. arXiv:1212.2480  [pdf

    cs.LG cs.AI stat.ML

    Approximate Inference and Constrained Optimization

    Authors: Tom Heskes, Kees Albers, Hilbert Kappen

    Abstract: Loopy and generalized belief propagation are popular algorithms for approximate inference in Markov random fields and Bayesian networks. Fixed points of these algorithms correspond to extrema of the Bethe and Kikuchi free energy. However, belief propagation does not always converge, which explains the need for approaches that explicitly minimize the Kikuchi/Bethe free energy, such… ▽ More

    Submitted 19 October, 2012; originally announced December 2012.

    Comments: Appears in Proceedings of the Nineteenth Conference on Uncertainty in Artificial Intelligence (UAI2003)

    Report number: UAI-P-2003-PG-313-320

  47. arXiv:1210.4866  [pdf

    cs.AI stat.ME

    A Bayesian Approach to Constraint Based Causal Inference

    Authors: Tom Claassen, Tom Heskes

    Abstract: We target the problem of accuracy and robustness in causal inference from finite data sets. Some state-of-the-art algorithms produce clear output complete with solid theoretical guarantees but are susceptible to propagating erroneous decisions, while others are very adept at handling and representing uncertainty, but need to rely on undesirable assumptions. Our aim is to combine the inherent robus… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-207-216

  48. arXiv:1206.3243  [pdf

    cs.LG stat.ML

    Bounds on the Bethe Free Energy for Gaussian Networks

    Authors: Botond Cseke, Tom Heskes

    Abstract: We address the problem of computing approximate marginals in Gaussian probabilistic models by using mean field and fractional Bethe approximations. As an extension of Welling and Teh (2001), we define the Gaussian fractional Bethe free energy in terms of the moment parameters of the approximate marginals and derive an upper and lower bound for it. We give necessary conditions for the Gaussian frac… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-97-104

  49. arXiv:1202.3711  [pdf

    cs.AI

    A Logical Characterization of Constraint-Based Causal Discovery

    Authors: Tom Claassen, Tom Heskes

    Abstract: We present a novel approach to constraint-based causal discovery, that takes the form of straightforward logical inference, applied to a list of simple, logical statements about causal relations that are derived directly from observed (in)dependencies. It is both sound and complete, in the sense that all invariant features of the corresponding partial ancestral graph (PAG) are identified, even in… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-135-144

  50. arXiv:1202.1696  [pdf, ps, other

    q-bio.NC stat.AP

    Bayesian Inference of Whole-Brain Networks

    Authors: M. Hinne, T. Heskes, M. A. J. van Gerven

    Abstract: In structural brain networks the connections of interest consist of white-matter fibre bundles between spatially segregated brain regions. The presence, location and orientation of these white matter tracts can be derived using diffusion MRI in combination with probabilistic tractography. Unfortunately, as of yet no approaches have been suggested that provide an undisputed way of inferring brain n… ▽ More

    Submitted 8 February, 2012; originally announced February 2012.

    Comments: 10 pages, 2 figures