Skip to main content

Showing 1–22 of 22 results for author: Van Erven, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2307.02501  [pdf, ps, other

    stat.ML cs.LG

    Generalization Guarantees via Algorithm-dependent Rademacher Complexity

    Authors: Sarah Sachs, Tim van Erven, Liam Hodgkinson, Rajiv Khanna, Umut Simsekli

    Abstract: Algorithm- and data-dependent generalization bounds are required to explain the generalization behavior of modern machine learning algorithms. In this context, there exists information theoretic generalization bounds that involve (various forms of) mutual information, as well as bounds based on hypothesis set stability. We propose a conceptually related, but technically distinct complexity measure… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  2. arXiv:2306.00497  [pdf, other

    cs.LG cs.CY stat.ML

    The Risks of Recourse in Binary Classification

    Authors: Hidde Fokkema, Damien Garreau, Tim van Erven

    Abstract: Algorithmic recourse provides explanations that help users overturn an unfavorable decision by a machine learning system. But so far very little attention has been paid to whether providing recourse is beneficial or not. We introduce an abstract learning-theoretic framework that compares the risks (i.e., expected losses) for classification with and without algorithmic recourse. This allows us to a… ▽ More

    Submitted 1 March, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 24 pages, 8 figures, 5 tables

  3. arXiv:2305.00832  [pdf, ps, other

    cs.LG stat.ML

    First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

    Authors: Julia Olkhovskaya, Jack Mayo, Tim van Erven, Gergely Neu, Chen-Yu Wei

    Abstract: We consider the adversarial linear contextual bandit setting, which allows for the loss functions associated with each of $K$ arms to change over time without restriction. Assuming the $d$-dimensional contexts are drawn from a fixed known distribution, the worst-case expected regret over the course of $T$ rounds is known to scale as $\tilde O(\sqrt{Kd T})$. Under the additional assumption that the… ▽ More

    Submitted 24 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  4. arXiv:2304.12768  [pdf, ps, other

    cs.GT math.OC stat.ML

    Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games

    Authors: Hédi Hadiji, Sarah Sachs, Tim van Erven, Wouter M. Koolen

    Abstract: In the first-order query model for zero-sum $K\times K$ matrix games, players observe the expected pay-offs for all their possible actions under the randomized action played by their opponent. This classical model has received renewed interest after the discovery by Rakhlin and Sridharan that $ε$-approximate Nash equilibria can be computed efficiently from $O(\frac{\ln K}ε)$ instead of… ▽ More

    Submitted 2 November, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  5. arXiv:2303.03272  [pdf, ps, other

    cs.LG math.OC stat.ML

    Accelerated Rates between Stochastic and Adversarial Online Convex Optimization

    Authors: Sarah Sachs, Hedi Hadiji, Tim van Erven, Cristobal Guzman

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Extended version of 'Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness' by the same authors. arXiv admin note: text overlap with arXiv:2202.07554

  6. arXiv:2302.08397  [pdf, ps, other

    stat.ML cs.LG

    Adaptive Selective Sampling for Online Prediction with Experts

    Authors: Rui M. Castro, Fredrik Hellström, Tim van Erven

    Abstract: We consider online prediction of a binary sequence with expert advice. For this setting, we devise label-efficient forecasting algorithms, which use a selective sampling scheme that enables collecting much fewer labels than standard procedures, while still retaining optimal worst-case regret guarantees. These algorithms are based on exponentially weighted forecasters, suitable for settings with an… ▽ More

    Submitted 20 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Journal ref: NeurIPS 2023

  7. arXiv:2205.15834  [pdf, other

    stat.ML cs.LG

    Attribution-based Explanations that Provide Recourse Cannot be Robust

    Authors: Hidde Fokkema, Rianne de Heide, Tim van Erven

    Abstract: Different users of machine learning methods require different explanations, depending on their goals. To make machine learning accountable to society, one important goal is to get actionable options for recourse, which allow an affected user to change the decision $f(x)$ of a machine learning system by making limited changes to its input $x$. We formalize this by providing a general definition of… ▽ More

    Submitted 20 December, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 32 pages, 6 figures

  8. arXiv:2202.07554  [pdf, ps, other

    cs.LG math.OC stat.ML

    Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness

    Authors: Sarah Sachs, Hédi Hadiji, Tim van Erven, Cristóbal Guzmán

    Abstract: Stochastic and adversarial data are two widely studied settings in online learning. But many optimization tasks are neither i.i.d. nor fully adversarial, which makes it of fundamental interest to get a better theoretical understanding of the world between these extremes. In this work we establish novel regret bounds for online convex optimization in a setting that interpolates between stochastic i… ▽ More

    Submitted 8 June, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  9. arXiv:2102.07521  [pdf, ps, other

    cs.LG stat.ML

    Distributed Online Learning for Joint Regret with Communication Constraints

    Authors: Dirk van der Hoeven, Hédi Hadiji, Tim van Erven

    Abstract: We consider distributed online learning for joint regret with communication constraints. In this setting, there are multiple agents that are connected in a graph. Each round, an adversary first activates one of the agents to issue a prediction and provides a corresponding gradient, and then the agents are allowed to send a $b$-bit message to their neighbors in the graph. All agents cooperate to co… ▽ More

    Submitted 25 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  10. arXiv:2102.06622  [pdf, other

    cs.LG stat.ML

    MetaGrad: Adaptation using Multiple Learning Rates in Online Learning

    Authors: Tim van Erven, Wouter M. Koolen, Dirk van der Hoeven

    Abstract: We provide a new adaptive method for online convex optimization, MetaGrad, that is robust to general convex losses but achieves faster rates for a broad class of special functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. We prove this by drawing a connection to the Bernstein condition, which is kn… ▽ More

    Submitted 30 August, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Journal ref: Journal of Machine Learning Research 22(161):1-61, 2021

  11. arXiv:2006.07985  [pdf, other

    cs.LG stat.ML

    Explaining Predictions by Approximating the Local Decision Boundary

    Authors: Georgios Vlassopoulos, Tim van Erven, Henry Brighton, Vlado Menkovski

    Abstract: Constructing accurate model-agnostic explanations for opaque machine learning models remains a challenging task. Classification models for high-dimensional data, like images, are often inherently complex. To reduce this complexity, individual predictions may be explained locally, either in terms of a simpler local surrogate model or by communicating how the predictions contrast with those of anoth… ▽ More

    Submitted 22 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

  12. arXiv:1902.10797  [pdf, ps, other

    cs.LG stat.ML

    Lipschitz Adaptivity with Multiple Learning Rates in Online Learning

    Authors: Zakaria Mhammedi, Wouter M. Koolen, Tim van Erven

    Abstract: We aim to design adaptive online learning algorithms that take advantage of any special structure that might be present in the learning task at hand, with as little manual tuning by the user as possible. A fundamental obstacle that comes up in the design of such adaptive algorithms is to calibrate a so-called step-size or learning rate hyperparameter depending on variance, gradient norms, etc. A r… ▽ More

    Submitted 30 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 22 pages. To appear in COLT 2019

  13. arXiv:1810.10883  [pdf, other

    stat.ME math.ST stat.CO

    Fast Exact Bayesian Inference for Sparse Signals in the Normal Sequence Model

    Authors: Tim van Erven, Botond Szabo

    Abstract: We consider exact algorithms for Bayesian inference with model selection priors (including spike-and-slab priors) in the sparse normal sequence model. Because the best existing exact algorithm becomes numerically unstable for sample sizes over n=500, there has been much attention for alternative approaches like approximate algorithms (Gibbs sampling, variational Bayes, etc.), shrinkage priors (e.g… ▽ More

    Submitted 15 April, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

  14. arXiv:1802.07543  [pdf, ps, other

    stat.ML cs.LG

    The Many Faces of Exponential Weights in Online Learning

    Authors: Dirk van der Hoeven, Tim van Erven, Wojciech Kotłowski

    Abstract: A standard introduction to online learning might place Online Gradient Descent at its center and then proceed to develop generalizations and extensions like Online Mirror Descent and second-order methods. Here we explore the alternative approach of putting Exponential Weights (EW) first. We show that many standard methods and their regret bounds then follow as a special case by plugging in suitabl… ▽ More

    Submitted 5 June, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Journal ref: Proceedings of the 31st Conference On Learning Theory, PMLR 75:2067-2092, 2018

  15. arXiv:1507.02592  [pdf, other

    cs.LG stat.ML

    Fast rates in statistical and online learning

    Authors: Tim van Erven, Peter D. Grünwald, Nishant A. Mehta, Mark D. Reid, Robert C. Williamson

    Abstract: The speed with which a learning algorithm converges as it is presented with more data is a central problem in machine learning --- a fast rate of convergence means less data is needed for the same level of performance. The pursuit of fast rates in online and statistical learning has led to the discovery of many conditions in learning theory under which fast learning is possible. We show that most… ▽ More

    Submitted 1 September, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: 69 pages, 3 figures

    Journal ref: Journal of Machine Learning Research 6(54):1793-1861, 2015

  16. arXiv:1502.08009  [pdf, ps, other

    cs.LG stat.ML

    Second-order Quantile Methods for Experts and Combinatorial Games

    Authors: Wouter M. Koolen, Tim van Erven

    Abstract: We aim to design strategies for sequential decision making that adjust to the difficulty of the learning problem. We study this question both in the setting of prediction with expert advice, and for more general combinatorial decision tasks. We are not satisfied with just guaranteeing minimax regret rates, but we want our algorithms to perform significantly better on easy data. Two popular ways to… ▽ More

    Submitted 27 February, 2015; originally announced February 2015.

  17. arXiv:1405.1580  [pdf, ps, other

    stat.ML

    PAC-Bayes Mini-tutorial: A Continuous Union Bound

    Authors: Tim van Erven

    Abstract: When I first encountered PAC-Bayesian concentration inequalities they seemed to me to be rather disconnected from good old-fashioned results like Hoeffding's and Bernstein's inequalities. But, at least for one flavour of the PAC-Bayesian bounds, there is actually a very close relation, and the main innovation is a continuous version of the union bound, along with some ingenious applications. Here'… ▽ More

    Submitted 7 May, 2014; originally announced May 2014.

  18. arXiv:1402.2044  [pdf, ps, other

    stat.ML cs.LG math.ST

    A Second-order Bound with Excess Losses

    Authors: Pierre Gaillard, Gilles Stoltz, Tim Van Erven

    Abstract: We study online aggregation of the predictions of experts, and first show new second-order regret bounds in the standard setting, which are obtained via a version of the Prod algorithm (and also a version of the polynomially weighted average algorithm) with multiple learning rates. These bounds are in terms of excess losses, the differences between the instantaneous losses suffered by the algorith… ▽ More

    Submitted 10 February, 2014; originally announced February 2014.

  19. arXiv:1301.0534  [pdf, ps, other

    cs.LG stat.ML

    Follow the Leader If You Can, Hedge If You Must

    Authors: Steven de Rooij, Tim van Erven, Peter D. Grünwald, Wouter M. Koolen

    Abstract: Follow-the-Leader (FTL) is an intuitive sequential prediction strategy that guarantees constant regret in the stochastic setting, but has terrible performance for worst-case data. Other hedging strategies have better worst-case guarantees but may perform much worse than FTL if the data are not maximally adversarial. We introduce the FlipFlop algorithm, which is the first method that provably combi… ▽ More

    Submitted 17 January, 2013; v1 submitted 3 January, 2013; originally announced January 2013.

    Comments: under submission

    Journal ref: Journal of Machine Learning Research 15(37):1281-1316, 2014

  20. arXiv:1206.2459  [pdf, other

    cs.IT math.ST stat.ML

    Rényi Divergence and Kullback-Leibler Divergence

    Authors: Tim van Erven, Peter Harremoës

    Abstract: Rényi divergence is related to Rényi entropy much like Kullback-Leibler divergence is related to Shannon's entropy, and comes up in many settings. It was introduced by Rényi as a measure of information that satisfies almost the same axioms as Kullback-Leibler divergence, and depends on a parameter that is called its order. In particular, the Rényi divergence of order 1 equals the Kullback-Leibler… ▽ More

    Submitted 24 April, 2014; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: To appear in IEEE Transactions on Information Theory

  21. arXiv:1110.6416  [pdf, other

    stat.ML

    Adaptive Hedge

    Authors: Tim van Erven, Peter Grünwald, Wouter M. Koolen, Steven de Rooij

    Abstract: Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new wa… ▽ More

    Submitted 28 October, 2011; originally announced October 2011.

    Comments: This is the full version of the paper with the same name that will appear in Advances in Neural Information Processing Systems 24 (NIPS 2011), 2012. The two papers are identical, except that this version contains an extra section of Additional Material

    Journal ref: Advances in Neural Information Processing Systems 24, pages 1656-1664, December 2011

  22. arXiv:0807.1005  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ME stat.ML

    Catching Up Faster by Switching Sooner: A Prequential Solution to the AIC-BIC Dilemma

    Authors: Tim van Erven, Peter Grunwald, Steven de Rooij

    Abstract: Bayesian model averaging, model selection and its approximations such as BIC are generally statistically consistent, but sometimes achieve slower rates og convergence than other methods such as AIC and leave-one-out cross-validation. On the other hand, these other methods can br inconsistent. We identify the "catch-up phenomenon" as a novel explanation for the slow convergence of Bayesian method… ▽ More

    Submitted 7 July, 2008; originally announced July 2008.

    Comments: A preliminary version of a part of this paper appeared at the NIPS 2007 conference

    MSC Class: 62G99; 94A99