Skip to main content

Showing 1–25 of 25 results for author: Schulte, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04221  [pdf, other

    cs.LG cs.AI

    Why Online Reinforcement Learning is Causal

    Authors: Oliver Schulte, Pascal Poupart

    Abstract: Reinforcement learning (RL) and causal modelling naturally complement each other. The goal of causal modelling is to predict the effects of interventions in an environment, while the goal of reinforcement learning is to select interventions that maximize the rewards the agent receives from the environment. Reinforcement learning includes the two most powerful sources of information for estimating… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 27 pages

    ACM Class: I.2.6

  2. arXiv:2402.11124  [pdf, other

    cs.LG

    Implicit Causal Representation Learning via Switchable Mechanisms

    Authors: Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley

    Abstract: Learning causal representations from observational and interventional data in the absence of known ground-truth graph structures necessitates implicit latent causal representation learning. Implicit learning of causal mechanisms typically involves two categories of interventional data: hard and soft interventions. In real-world scenarios, soft interventions are often more realistic than hard inter… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  3. arXiv:2305.01089  [pdf, other

    cs.LG cs.AI

    Computing Expected Motif Counts for Exchangeable Graph Generative Models

    Authors: Oliver Schulte

    Abstract: Estimating the expected value of a graph statistic is an important inference task for using and learning graph models. This note presents a scalable estimation procedure for expected motif counts, a widely used type of graph statistic. The procedure applies for generative mixture models of the type used in neural and Bayesian approaches to graph data.

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 8 pages

    MSC Class: 60G09 ACM Class: I.2.6

  4. arXiv:2302.08635  [pdf, other

    cs.LG stat.ML

    Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting

    Authors: Shayan Shirahmad Gale Bagi, Zahra Gharaee, Oliver Schulte, Mark Crowley

    Abstract: Conventional supervised learning methods typically assume i.i.d samples and are found to be sensitive to out-of-distribution (OOD) data. We propose Generative Causal Representation Learning (GCRL) which leverages causality to facilitate knowledge transfer under distribution shifts. While we evaluate the effectiveness of our proposed method in human trajectory prediction models, GCRL can be applied… ▽ More

    Submitted 25 April, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  5. arXiv:2302.07989  [pdf, ps, other

    cs.LG stat.ML

    From Graph Generation to Graph Classification

    Authors: Oliver Schulte

    Abstract: This note describes a new approach to classifying graphs that leverages graph generative models (GGM). Assuming a GGM that defines a joint probability distribution over graphs and their class labels, I derive classification formulas for the probability of a class label given a graph. A new conditional ELBO can be used to train a generative graph auto-encoder model for discrimination. While leverag… ▽ More

    Submitted 23 July, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: I welcome suggestions, comments, and proposals for collaboration to develop further the ideas in this paper. Please email [email protected]. I am grateful to Renjie Liao for helpful comments

    ACM Class: I.2.6

  6. arXiv:2301.12930  [pdf, other

    cs.LG stat.ML

    Cause-Effect Inference in Location-Scale Noise Models: Maximum Likelihood vs. Independence Testing

    Authors: Xiangyu Sun, Oliver Schulte

    Abstract: A fundamental problem of causal discovery is cause-effect inference, learning the correct causal direction between two random variables. Significant progress has been made through modelling the effect as a function of its cause and a noise term, which allows us to leverage assumptions about the generating function class. The recently introduced heteroscedastic location-scale noise functional model… ▽ More

    Submitted 25 October, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2023

  7. arXiv:2210.16844  [pdf, other

    cs.LG

    Micro and Macro Level Graph Modeling for Graph Variational Auto-Encoders

    Authors: Kiarash Zahirnia, Oliver Schulte, Parmis Naddaf, Ke Li

    Abstract: Generative models for graph data are an important research topic in machine learning. Graph data comprise two levels that are typically analyzed separately: node-level properties such as the existence of a link between a pair of nodes, and global aggregate graph-level statistics, such as motif counts. This paper proposes a new multi-level framework that jointly models node-level properties and gra… ▽ More

    Submitted 13 January, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems, 2022

  8. arXiv:2110.09767  [pdf, other

    cs.LG cs.AI cs.DB

    Pre and Post Counting for Scalable Statistical-Relational Model Discovery

    Authors: Richard Mar, Oliver Schulte

    Abstract: Statistical-Relational Model Discovery aims to find statistically relevant patterns in relational data. For example, a relational dependency pattern may stipulate that a user's gender is associated with the gender of their friends. As with propositional (non-relational) graphical models, the major scalability bottleneck for model discovery is computing instantiation counts: the number of times a r… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: Presented at the Tenth International Workshop on Statistical Relational AI at the 1st International Joint Conference on Learning & Reasoning

    MSC Class: 68T05 ACM Class: I.2.6

  9. arXiv:2109.04286  [pdf, other

    cs.LG cs.AI stat.ML

    NTS-NOTEARS: Learning Nonparametric DBNs With Prior Knowledge

    Authors: Xiangyu Sun, Oliver Schulte, Guiliang Liu, Pascal Poupart

    Abstract: We describe NTS-NOTEARS, a score-based structure learning method for time-series data to learn dynamic Bayesian networks (DBNs) that captures nonlinear, lagged (inter-slice) and instantaneous (intra-slice) relations among variables. NTS-NOTEARS utilizes 1D convolutional neural networks (CNNs) to model the dependence of child variables on their parents; 1D CNN is a neural function approximation mod… ▽ More

    Submitted 1 March, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: AISTATS 2023

  10. arXiv:2106.15239  [pdf, other

    cs.LG

    Generating the Graph Gestalt: Kernel-Regularized Graph Representation Learning

    Authors: Kiarash Zahirnia, Ankita Sakhuja, Oliver Schulte, Parmis Nadaf, Ke Li, Xia Hu

    Abstract: Recent work on graph generative models has made remarkable progress towards generating increasingly realistic graphs, as measured by global graph features such as degree distribution, density, and clustering coefficients. Deep generative models have also made significant advances through better modelling of the local correlations in the graph topology, which have been very useful for predicting un… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  11. Cracking the Black Box: Distilling Deep Sports Analytics

    Authors: Xiangyu Sun, Jack Davis, Oliver Schulte, Guiliang Liu

    Abstract: This paper addresses the trade-off between Accuracy and Transparency for deep learning applied to sports analytics. Neural nets achieve great predictive accuracy through deep learning, and are popular in sports analytics. But it is hard to interpret a neural net model and harder still to extract actionable insights from the knowledge implicit in it. Therefore, we built a simple and transparent mod… ▽ More

    Submitted 29 June, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted by the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2020); Added the tenth feature to Table 3 for soccer;

  12. arXiv:2004.10984  [pdf, other

    cs.LG cs.AI stat.ML

    A Complete Characterization of Projectivity for Statistical Relational Models

    Authors: Manfred Jaeger, Oliver Schulte

    Abstract: A generative probabilistic model for relational data consists of a family of probability distributions for relational structures over domains of different sizes. In most existing statistical relational learning (SRL) frameworks, these models are not projective in the sense that the marginal of the distribution for size-$n$ structures on induced sub-structures of size $k<n$ is equal to the given di… ▽ More

    Submitted 22 June, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Extended version (with proof appendix) of paper that is too appear in Proceedings of IJCAI 2020

    MSC Class: 60G09 ACM Class: I.2.6

  13. arXiv:1902.09711  [pdf, ps, other

    cs.DB

    Detecting Data Errors with Statistical Constraints

    Authors: **g Nathan Yan, Oliver Schulte, Jiannan Wang, Reynold Cheng

    Abstract: A powerful approach to detecting erroneous data is to check which potentially dirty data records are incompatible with a user's domain knowledge. Previous approaches allow the user to specify domain knowledge in the form of logical constraints (e.g., functional dependency and denial constraints). We extend the constraint-based approach by introducing a novel class of statistical constraints (SCs).… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

  14. arXiv:1807.05887  [pdf, other

    cs.LG stat.ML

    Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

    Authors: Guiliang Liu, Oliver Schulte, Wang Zhu, Qingcan Li

    Abstract: Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge,… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

    Comments: This paper is accepted by ECML-PKDD 2018

  15. arXiv:1807.00564  [pdf, ps, other

    cs.AI

    Inference, Learning, and Population Size: Projectivity for SRL Models

    Authors: Manfred Jaeger, Oliver Schulte

    Abstract: A subtle difference between propositional and relational data is that in many relational models, marginal probabilities depend on the population or domain size. This paper connects the dependence on population size to the classic notion of projectivity from statistical theory: Projectivity implies that relational predictions are robust with respect to changes in domain size. We discuss projectivit… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

  16. arXiv:1807.00381  [pdf, other

    cs.AI cs.LG stat.ML

    Model-based Exception Mining for Object-Relational Data

    Authors: Fatemeh Riahi, Oliver Schulte

    Abstract: This paper is based on a previous publication [29]. Our work extends exception mining and outlier detection to the case of object-relational data. Object-relational data represent a complex heterogeneous network [12], which comprises objects of different types, links among these objects, also of different types, and attributes of these links. This special structure prohibits a direct vectorial dat… ▽ More

    Submitted 1 July, 2018; originally announced July 2018.

    Comments: StarAI 2018

  17. arXiv:1805.11088  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reinforcement Learning in Ice Hockey for Context-Aware Player Evaluation

    Authors: Guiliang Liu, Oliver Schulte

    Abstract: A variety of machine learning models have been proposed to assess the performance of players in professional sports. However, they have only a limited ability to model how player performance depends on the game context. This paper proposes a new approach to capturing game context: we apply Deep Reinforcement Learning (DRL) to learn an action-value Q function from 3M play-by-play events in the Nati… ▽ More

    Submitted 16 July, 2018; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: This paper has been accepted by IJCAI 2018

  18. arXiv:1802.08765  [pdf, other

    cs.LG cs.SI

    Model Trees for Identifying Exceptional Players in the NHL Draft

    Authors: Oliver Schulte, Yejia Liu, Chao Li

    Abstract: Drafting strong players is crucial for the team success. We describe a new data-driven interpretable approach for assessing draft prospects in the National Hockey League. Successful previous approaches have built a predictive model based on player features, or derived performance predictions from the observed performance of comparable players in a cohort. This paper develops model tree learning, w… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: 14 pages

  19. arXiv:1511.03086  [pdf, ps, other

    cs.LG cs.DB

    The CTU Prague Relational Learning Repository

    Authors: Jan Motl, Oliver Schulte

    Abstract: The aim of the Prague Relational Learning Repository is to support machine learning research with multi-relational data. The repository currently contains 148 SQL databases hosted on a public MySQL server located at \url{https://relational-data.org}. The server is provided by getML to support the relational machine learning community (\url{www.getml.com}). A searchable meta-database provides metad… ▽ More

    Submitted 11 March, 2024; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: 7 pages

    ACM Class: I.2.6; H.2.8

  20. arXiv:1508.02428  [pdf, other

    cs.DB cs.LG

    FactorBase: SQL for Learning A Multi-Relational Graphical Model

    Authors: Oliver Schulte, Zhensong Qian

    Abstract: We describe FactorBase, a new SQL-based framework that leverages a relational database management system to support multi-relational model discovery. A multi-relational statistical model provides an integrated analysis of the heterogeneous and interdependent data resources in the database. We adopt the BayesStore design philosophy: statistical models are stored and managed as first-class citizens… ▽ More

    Submitted 10 August, 2015; originally announced August 2015.

    Comments: 14 pages, 10 figures, 10 tables, Published on 2015 IEEE International Conference on Data Science and Advanced Analytics (IEEE DSAA'2015), Oct 19-21, 2015, Paris, France

    ACM Class: H.2.8; H.2.4

  21. arXiv:1507.00646  [pdf, other

    cs.LG cs.DB

    SQL for SRL: Structure Learning Inside a Database System

    Authors: Oliver Schulte, Zhensong Qian

    Abstract: The position we advocate in this paper is that relational algebra can provide a unified language for both representing and computing with statistical-relational objects, much as linear algebra does for traditional single-table machine learning. Relational algebra is implemented in the Structured Query Language (SQL), which is the basis of relational database management systems. To support our posi… ▽ More

    Submitted 2 July, 2015; originally announced July 2015.

    Comments: 3 pages, 1 figure, Position Paper of the Fifth International Workshop on Statistical Relational AI at UAI 2015

    ACM Class: H.2.8; H.2.4

  22. arXiv:1410.7835  [pdf, other

    cs.LG

    Fast Learning of Relational Dependency Networks

    Authors: Oliver Schulte, Zhensong Qian, Arthur E. Kirkpatrick, Xiaoqian Yin, Yan Sun

    Abstract: A Relational Dependency Network (RDN) is a directed graphical model widely used for multi-relational data. These networks allow cyclic dependencies, necessary to represent relational autocorrelations. We describe an approach for learning both the RDN's structure and its parameters, given an input relational database: First learn a Bayesian network (BN), then transform the Bayesian network to an RD… ▽ More

    Submitted 8 December, 2014; v1 submitted 28 October, 2014; originally announced October 2014.

    Comments: 17 pages, 2 figures, 3 tables, Accepted as long paper by ILP 2014, September 14- 16th, Nancy, France. Added the Appendix: Proof of Consistency Characterization

  23. Computing Multi-Relational Sufficient Statistics for Large Databases

    Authors: Zhensong Qian, Oliver Schulte, Yan Sun

    Abstract: Databases contain information about which relationships do and do not hold among entities. To make this information accessible for statistical analysis requires computing sufficient statistics that combine information from different database tables. Such statistics may involve any number of {\em positive and negative} relationships. With a naive enumeration approach, computing sufficient statistic… ▽ More

    Submitted 22 August, 2014; originally announced August 2014.

    Comments: 11pages, 8 figures, 8 tables, CIKM'14,November 3--7, 2014, Shanghai, China

    ACM Class: H.2.8; H.2.4

  24. arXiv:0811.4458  [pdf, other

    cs.LG cs.AI

    Learning Class-Level Bayes Nets for Relational Data

    Authors: Oliver Schulte, Hassan Khosravi, Flavia Moser, Martin Ester

    Abstract: Many databases store data in relational format, with different types of entities and information about links between the entities. The field of statistical-relational learning (SRL) has developed a number of new statistical models for such data. In this paper we focus on learning class-level or first-order dependencies, which model the general database statistics over attributes of linked object… ▽ More

    Submitted 20 October, 2009; v1 submitted 26 November, 2008; originally announced November 2008.

    Comments: 14 pages (2 column)

    Report number: TR 2008-17, School of Computing Science, Simon Fraser University ACM Class: I.2.6

  25. arXiv:0710.2083  [pdf, ps, other

    cs.DB cs.LG cs.LO

    Association Rules in the Relational Calculus

    Authors: Oliver Schulte, Flavia Moser, Martin Ester, Zhiyong Lu

    Abstract: One of the most utilized data mining tasks is the search for association rules. Association rules represent significant relationships between items in transactions. We extend the concept of association rule to represent a much broader class of associations, which we refer to as \emph{entity-relationship rules.} Semantically, entity-relationship rules express associations between properties of re… ▽ More

    Submitted 10 October, 2007; originally announced October 2007.

    Comments: 16 pages, 13 tables

    Report number: SFU School of Computing Science, TR 2007-23