Skip to main content

Showing 1–11 of 11 results for author: Mey, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.05218  [pdf, other

    cs.LG

    Invariant Causal Prediction with Locally Linear Models

    Authors: Alexander Mey, Rui Manuel Castro

    Abstract: We consider the task of identifying the causal parents of a target variable among a set of candidate variables from observational data. Our main assumption is that the candidate variables are observed in different environments which may, for example, correspond to different settings of a machine or different time intervals in a dynamical process. Under certain assumptions different environments ca… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  2. arXiv:2011.01788  [pdf, other

    cs.AI

    Loss Bounds for Approximate Influence-Based Abstraction

    Authors: Elena Congeduti, Alexander Mey, Frans A. Oliehoek

    Abstract: Sequential decision making techniques hold great promise to improve the performance of many real-world systems, but computational complexity hampers their principled application. Influence-based abstraction aims to gain leverage by modeling local subproblems together with the 'influence' that the rest of the system exerts on them. While computing exact representations of such influence might be in… ▽ More

    Submitted 23 February, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: 13 pages, 9 figures

  3. arXiv:2010.02576  [pdf, ps, other

    cs.LG stat.ML

    A Note on High-Probability versus In-Expectation Guarantees of Generalization Bounds in Machine Learning

    Authors: Alexander Mey

    Abstract: Statistical machine learning theory often tries to give generalization guarantees of machine learning models. Those models naturally underlie some fluctuation, as they are based on a data sample. If we were unlucky, and gathered a sample that is not representative of the underlying distribution, one cannot expect to construct a reliable machine learning model. Following that, statements made about… ▽ More

    Submitted 18 November, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

  4. A Brief Prehistory of Double Descent

    Authors: Marco Loog, Tom Viering, Alexander Mey, Jesse H. Krijthe, David M. J. Tax

    Abstract: In their thought-provoking paper [1], Belkin et al. illustrate and discuss the shape of risk curves in the context of modern high-complexity learners. Given a fixed training sample size $n$, such curves show the risk of a learner as a function of some (approximate) measure of its complexity $N$. With $N$ the number of features, these curves are also referred to as feature curves. A salient observa… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  5. arXiv:1911.11030  [pdf, other

    cs.LG stat.ML

    Making Learners (More) Monotone

    Authors: Tom J. Viering, Alexander Mey, Marco Loog

    Abstract: Learning performance can show non-monotonic behavior. That is, more data does not necessarily lead to better models, even on average. We propose three algorithms that take a supervised learning model and make it perform more monotone. We prove consistency and monotonicity with high probability, and evaluate the algorithms on scenarios where non-monotone behaviour occurs. Our proposed algorithm… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

  6. arXiv:1908.11823  [pdf, other

    cs.LG stat.ML

    Consistency and Finite Sample Behavior of Binary Class Probability Estimation

    Authors: Alexander Mey, Marco Loog

    Abstract: In this work we investigate to which extent one can recover class probabilities within the empirical risk minimization (ERM) paradigm. The main aim of our paper is to extend existing results and emphasize the tight relations between empirical risk minimization and class probability estimation. Based on existing literature on excess risk bounds and proper scoring rules, we derive a class probabilit… ▽ More

    Submitted 21 July, 2020; v1 submitted 30 August, 2019; originally announced August 2019.

  7. arXiv:1908.09574  [pdf, other

    cs.LG stat.ML

    Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results

    Authors: Alexander Mey, Marco Loog

    Abstract: Semi-supervised learning is a setting in which one has labeled and unlabeled data available. In this survey we explore different types of theoretical results when one uses unlabeled data in classification and regression tasks. Most methods that use unlabeled data rely on certain assumptions about the data distribution. When those assumptions are not met in reality, including unlabeled data may act… ▽ More

    Submitted 30 July, 2020; v1 submitted 26 August, 2019; originally announced August 2019.

  8. arXiv:1907.05476  [pdf, other

    cs.LG stat.ML

    Minimizers of the Empirical Risk and Risk Monotonicity

    Authors: Marco Loog, Tom Viering, Alexander Mey

    Abstract: Plotting a learner's average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduc… ▽ More

    Submitted 13 March, 2020; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: New version fixes some minor issues especially in the proof of Theorem 1

  9. A Distribution Dependent and Independent Complexity Analysis of Manifold Regularization

    Authors: Alexander Mey, Tom Viering, Marco Loog

    Abstract: Manifold regularization is a commonly used technique in semi-supervised learning. It enforces the classification rule to be smooth with respect to the data-manifold. Here, we derive sample complexity bounds based on pseudo-dimension for models that add a convex data dependent regularization term to a supervised learning process, as is in particular done in Manifold regularization. We then compare… ▽ More

    Submitted 27 November, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  10. arXiv:1905.12081  [pdf, other

    stat.ML cs.LG stat.OT

    Semi-Supervised Learning, Causality and the Conditional Cluster Assumption

    Authors: Julius von Kügelgen, Alexander Mey, Marco Loog, Bernhard Schölkopf

    Abstract: While the success of semi-supervised learning (SSL) is still not fully understood, Schölkopf et al. (2012) have established a link to the principle of independent causal mechanisms. They conclude that SSL should be impossible when predicting a target variable from its causes, but possible when predicting it from its effects. Since both these cases are somewhat restrictive, we extend their work by… ▽ More

    Submitted 24 June, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 36th Conference on Uncertainty in Artificial Intelligence (2020) (Previously presented at the NeurIPS 2019 workshop "Do the right thing": machine learning and causal inference for improved decision making, Vancouver, Canada.)

  11. arXiv:1807.07879  [pdf, other

    stat.ML cs.LG

    Semi-Generative Modelling: Covariate-Shift Adaptation with Cause and Effect Features

    Authors: Julius von Kügelgen, Alexander Mey, Marco Loog

    Abstract: Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this re… ▽ More

    Submitted 26 February, 2019; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha, Okinawa, Japan. (Camera-ready version)