Skip to main content

Showing 1–13 of 13 results for author: Schleich, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.17071  [pdf, other

    cs.LG cs.DB

    Computing Rule-Based Explanations by Leveraging Counterfactuals

    Authors: Zixuan Geng, Maximilian Schleich, Dan Suciu

    Abstract: Sophisticated machine models are increasingly used for high-stakes decisions in everyday life. There is an urgent need to develop effective explanation techniques for such automated decisions. Rule-Based Explanations have been proposed for high-stake decisions like loan applications, because they increase the users' trust in the decision. However, rule-based explanations are very inefficient to co… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  2. arXiv:2210.06267  [pdf, other

    cs.DB cs.PL

    Optimizing Tensor Programs on Flexible Storage

    Authors: Maximilian Schleich, Amir Shaikhha, Dan Suciu

    Abstract: Tensor programs often need to process large tensors (vectors, matrices, or higher order tensors) that require a specialized storage format for their memory layout. Several such layouts have been proposed in the literature, such as the Coordinate Format, the Compressed Sparse Row format, and many others, that were especially designed to optimally store tensors with specific sparsity properties. How… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  3. arXiv:2101.01292  [pdf, other

    cs.LG cs.DB

    GeCo: Quality Counterfactual Explanations in Real Time

    Authors: Maximilian Schleich, Zixuan Geng, Yihong Zhang, Dan Suciu

    Abstract: Machine learning is increasingly applied in high-stakes decision making that directly affect people's lives, and this leads to an increased demand for systems to explain their decisions. Explanations often take the form of counterfactuals, which consists of conveying to the end user what she/he needs to change in order to improve the outcome. Computing counterfactual explanations is challenging, b… ▽ More

    Submitted 18 May, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: 16 pages, 12 figures, 3 tables, 3 algorithms

  4. arXiv:2009.08634  [pdf, ps, other

    cs.AI cs.CC cs.LG

    On the Tractability of SHAP Explanations

    Authors: Guy Van den Broeck, Anton Lykov, Maximilian Schleich, Dan Suciu

    Abstract: SHAP explanations are a popular feature-attribution mechanism for explainable AI. They use game-theoretic notions to measure the influence of individual features on the prediction of a machine learning model. Despite a lot of recent interest from both academia and industry, it is not known whether SHAP explanations of common machine learning models can be computed efficiently. In this paper, we es… ▽ More

    Submitted 30 January, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: Proceedings of the 35th AAAI Conference on Artificial Intelligence

  5. arXiv:2008.08657  [pdf, other

    cs.DB

    LMFAO: An Engine for Batches of Group-By Aggregates

    Authors: Maximilian Schleich, Dan Olteanu

    Abstract: LMFAO is an in-memory optimization and execution engine for large batches of group-by aggregates over joins. Such database workloads capture the data-intensive computation of a variety of data science applications. We demonstrate LMFAO for three popular models: ridge linear regression with batch gradient descent, decision trees with CART, and clustering with Rk-means.

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: 4 pages, 4 figures

  6. arXiv:2003.06868  [pdf, other

    cs.LG cs.AI cs.DB stat.ML

    Causality-based Explanation of Classification Outcomes

    Authors: Leopoldo Bertossi, Jordan Li, Maximilian Schleich, Dan Suciu, Zografoula Vagena

    Abstract: We propose a simple definition of an explanation for the outcome of a classifier based on concepts from causality. We compare it with previously proposed notions of explanation, and study their complexity. We conduct an experimental evaluation with two real datasets from the financial domain.

    Submitted 25 May, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: 16 pages, 6 figures, 1 table

  7. arXiv:2001.03541  [pdf, other

    cs.PL cs.DB cs.LG

    Multi-layer Optimizations for End-to-End Data Analytics

    Authors: Amir Shaikhha, Maximilian Schleich, Alexandru Ghita, Dan Olteanu

    Abstract: We consider the problem of training machine learning models over multi-relational data. The mainstream approach is to first construct the training dataset using a feature extraction query over input database and then use a statistical software package of choice to train the model. In this paper we introduce Iterative Functional Aggregate Queries (IFAQ), a framework that realizes an alternative app… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

  8. arXiv:1911.06577  [pdf, ps, other

    cs.DB

    Learning Models over Relational Data: A Brief Tutorial

    Authors: Maximilian Schleich, Dan Olteanu, Mahmoud Abo-Khamis, Hung Q. Ngo, XuanLong Nguyen

    Abstract: This tutorial overviews the state of the art in learning models over relational databases and makes the case for a first-principles approach that exploits recent developments in database research. The input to learning classification and regression models is a training dataset defined by feature extraction queries over relational databases. The mainstream approach to learning over relational dat… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: 10 pages, 1 figure

    ACM Class: H.2.4; I.2.6

  9. arXiv:1910.04939  [pdf, ps, other

    cs.LG cs.DB stat.ML

    Rk-means: Fast Clustering for Relational Data

    Authors: Ryan Curtin, Ben Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: Conventional machine learning algorithms cannot be applied until a data matrix is available to process. When the data matrix needs to be obtained from a relational database via a feature extraction query, the computation cost can be prohibitive, as the data matrix may be (much) larger than the total input relation size. This paper introduces Rk-means, or relational k -means algorithm, for clusteri… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  10. arXiv:1906.08687  [pdf, other

    cs.DB

    A Layered Aggregate Engine for Analytics Workloads

    Authors: Maximilian Schleich, Dan Olteanu, Mahmoud Abo Khamis, Hung Q. Ngo, XuanLong Nguyen

    Abstract: This paper introduces LMFAO (Layered Multiple Functional Aggregate Optimization), an in-memory optimization and execution engine for batches of aggregates over the input database. The primary motivation for this work stems from the observation that for a variety of analytics over databases, their data-intensive tasks can be decomposed into group-by aggregates over the join of the input database re… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 18 pages, 7 figures, 4 tables

    ACM Class: H.2.4; I.2.6

  11. arXiv:1812.09526  [pdf, ps, other

    cs.DB cs.DS cs.IT cs.LG

    Functional Aggregate Queries with Additive Inequalities

    Authors: Mahmoud Abo Khamis, Ryan R. Curtin, Benjamin Moseley, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: Motivated by fundamental applications in databases and relational machine learning, we formulate and study the problem of answering functional aggregate queries (FAQ) in which some of the input factors are defined by a collection of additive inequalities between variables. We refer to these queries as FAQ-AI for short. To answer FAQ-AI in the Boolean semiring, we define relaxed tree decompositio… ▽ More

    Submitted 15 September, 2020; v1 submitted 22 December, 2018; originally announced December 2018.

  12. arXiv:1803.07480  [pdf, other

    cs.DB

    AC/DC: In-Database Learning Thunderstruck

    Authors: Mahmoud Abo Khamis, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: We report on the design and implementation of the AC/DC gradient descent solver for a class of optimization problems over normalized databases. AC/DC decomposes an optimization problem into a set of aggregates over the join of the database relations. It then uses the answers to these aggregates to iteratively improve the solution to the problem until it converges. The challenges faced by AC/DC a… ▽ More

    Submitted 15 June, 2018; v1 submitted 20 March, 2018; originally announced March 2018.

    Comments: 10 pages, 3 figures

    ACM Class: H.2.4; I.2.6

  13. arXiv:1703.04780  [pdf, other

    cs.DB

    Learning Models over Relational Data using Sparse Tensors and Functional Dependencies

    Authors: Mahmoud Abo Khamis, Hung Q. Ngo, XuanLong Nguyen, Dan Olteanu, Maximilian Schleich

    Abstract: Integrated solutions for analytics over relational databases are of great practical importance as they avoid the costly repeated loop data scientists have to deal with on a daily basis: select features from data residing in relational databases using feature extraction queries involving joins, projections, and aggregations; export the training dataset defined by such queries; convert this dataset… ▽ More

    Submitted 6 February, 2020; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: 61 pages, 9 figures, 2 tables

    ACM Class: H.2.4; I.2.6