Skip to main content

Showing 1–13 of 13 results for author: Schmidt, D F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00492  [pdf, ps, other

    cs.LG stat.CO stat.ML

    Fast Gibbs sampling for the local and global trend Bayesian exponential smoothing model

    Authors: Xueying Long, Daniel F. Schmidt, Christoph Bergmeir, Slawek Smyl

    Abstract: In Smyl et al. [Local and global trend Bayesian exponential smoothing models. International Journal of Forecasting, 2024.], a generalised exponential smoothing model was proposed that is able to capture strong trends and volatility in time series. This method achieved state-of-the-art performance in many forecasting tasks, but its fitting procedure, which is based on the NUTS sampler, is very comp… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2401.15610  [pdf, other

    cs.LG stat.ML

    Prevalidated ridge regression is a highly-efficient drop-in replacement for logistic regression for high-dimensional data

    Authors: Angus Dempster, Geoffrey I. Webb, Daniel F. Schmidt

    Abstract: Logistic regression is a ubiquitous method for probabilistic classification. However, the effectiveness of logistic regression depends upon careful and relatively computationally expensive tuning, especially for the regularisation hyperparameter, and especially in the context of high-dimensional data. We present a prevalidated ridge regression model that closely matches logistic regression in term… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 13 pages, 11 figures

  3. arXiv:2311.15549  [pdf

    cond-mat.mtrl-sci cs.AI cs.LG

    From Prediction to Action: Critical Role of Performance Estimation for Machine-Learning-Driven Materials Discovery

    Authors: Mario Boley, Felix Luong, Simon Teshuva, Daniel F Schmidt, Lucas Foppa, Matthias Scheffler

    Abstract: Materials discovery driven by statistical property models is an iterative decision process, during which an initial data collection is extended with new data proposed by a model-informed acquisition function--with the goal to maximize a certain "reward" over time, such as the maximum property value discovered so far. While the materials science community achieved much progress in develo** proper… ▽ More

    Submitted 6 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Simplified notation

  4. arXiv:2311.00993  [pdf, other

    cs.LG

    Scalable Probabilistic Forecasting in Retail with Gradient Boosted Trees: A Practitioner's Approach

    Authors: Xueying Long, Quang Bui, Grady Oktavian, Daniel F. Schmidt, Christoph Bergmeir, Rakshitha Godahewa, Seong Per Lee, Kaifeng Zhao, Paul Condylis

    Abstract: The recent M5 competition has advanced the state-of-the-art in retail forecasting. However, we notice important differences between the competition challenge and the challenges we face in a large e-commerce company. The datasets in our scenario are larger (hundreds of thousands of time series), and e-commerce can afford to have a larger assortment than brick-and-mortar retailers, leading to more i… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  5. arXiv:2310.18860  [pdf, other

    stat.ML cs.LG

    Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization

    Authors: Shu Yu Tew, Mario Boley, Daniel F. Schmidt

    Abstract: We present a novel method for tuning the regularization hyper-parameter, $λ$, of a ridge regression that is faster to compute than leave-one-out cross-validation (LOOCV) while yielding estimates of the regression parameters of equal, or particularly in the setting of sparse covariates, superior quality to those obtained by minimising the LOOCV risk. The LOOCV risk can suffer from multiple and bad… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  6. arXiv:2310.09129  [pdf, other

    cs.LG stat.ML

    Computing Marginal and Conditional Divergences between Decomposable Models with Applications

    Authors: Loong Kuan Lee, Geoffrey I. Webb, Daniel F. Schmidt, Nico Piatkowski

    Abstract: The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time ex… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures, Accepted at the IEEE International Conference on Data Mining (ICDM) 2023

  7. arXiv:2308.00928  [pdf, other

    cs.LG

    QUANT: A Minimalist Interval Method for Time Series Classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: We show that it is possible to achieve the same accuracy, on average, as the most accurate existing interval methods for time series classification on a standard set of benchmark datasets using a single type of feature (quantiles), fixed intervals, and an 'off the shelf' classifier. This distillation of interval-based approaches represents a fast and accurate method for time series classification,… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 26 pages, 20 figures

  8. arXiv:2305.11921  [pdf, other

    stat.ME cs.AI cs.LG cs.PF

    An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set

    Authors: Ali Ismail-Fawaz, Angus Dempster, Chang Wei Tan, Matthieu Herrmann, Lynn Miller, Daniel F. Schmidt, Stefano Berretti, Jonathan Weber, Maxime Devanne, Germain Forestier, Geoffrey I. Webb

    Abstract: The measurement of progress using benchmarks evaluations is ubiquitous in computer science and machine learning. However, common approaches to analyzing and presenting the results of benchmark comparisons of multiple algorithms over multiple datasets, such as the critical difference diagram introduced by Demšar (2006), have important shortcomings and, we show, are open to both inadvertent and inte… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  9. arXiv:2211.03248  [pdf, other

    stat.ML cs.LG

    Sparse Horseshoe Estimation via Expectation-Maximisation

    Authors: Shu Yu Tew, Daniel F. Schmidt, Enes Makalic

    Abstract: The horseshoe prior is known to possess many desirable properties for Bayesian estimation of sparse parameter vectors, yet its density function lacks an analytic form. As such, it is challenging to find a closed-form solution for the posterior mode. Conventional horseshoe estimators use the posterior mean to estimate the parameters, but these estimates are not sparse. We propose a novel expectatio… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  10. arXiv:2203.13652  [pdf, other

    cs.LG

    HYDRA: Competing convolutional kernels for fast and accurate time series classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: We demonstrate a simple connection between dictionary methods for time series classification, which involve extracting and counting symbolic patterns in time series, and methods based on transforming input time series using convolutional kernels, namely ROCKET and its variants. We show that by adjusting a single hyperparameter it is possible to move by degrees between models resembling dictionary… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 27 pages, 18 figures

  11. MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: Until recently, the most accurate methods for time series classification were limited by high computational complexity. ROCKET achieves state-of-the-art accuracy with a fraction of the computational expense of most existing methods by transforming input time series using random convolutional kernels, and using the transformed features to train a linear classifier. We reformulate ROCKET into a new… ▽ More

    Submitted 14 July, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 10 pages, 11 figures; Updated to accepted version

  12. InceptionTime: Finding AlexNet for Time Series Classification

    Authors: Hassan Ismail Fawaz, Benjamin Lucas, Germain Forestier, Charlotte Pelletier, Daniel F. Schmidt, Jonathan Weber, Geoffrey I. Webb, Lhassane Idoumghar, Pierre-Alain Muller, François Petitjean

    Abstract: This paper brings deep learning at the forefront of research into Time Series Classification (TSC). TSC is the area of machine learning tasked with the categorization (or labelling) of time series. The last few decades of work in this area have led to significant progress in the accuracy of classifiers, with the state of the art now represented by the HIVE-COTE algorithm. While extremely accurate,… ▽ More

    Submitted 5 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  13. arXiv:1801.02321  [pdf, other

    math.ST cs.LG

    Log-Scale Shrinkage Priors and Adaptive Bayesian Global-Local Shrinkage Estimation

    Authors: Daniel F. Schmidt, Enes Makalic

    Abstract: Global-local shrinkage hierarchies are an important innovation in Bayesian estimation. We propose the use of log-scale distributions as a novel basis for generating familes of prior distributions for local shrinkage hyperparameters. By varying the scale parameter one may vary the degree to which the prior distribution promotes sparsity in the coefficient estimates. By examining the class of distri… ▽ More

    Submitted 30 January, 2020; v1 submitted 8 January, 2018; originally announced January 2018.

    Comments: 34 pages