Skip to main content

Showing 1–18 of 18 results for author: Schmidt, D F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.00492  [pdf, ps, other

    cs.LG stat.CO stat.ML

    Fast Gibbs sampling for the local and global trend Bayesian exponential smoothing model

    Authors: Xueying Long, Daniel F. Schmidt, Christoph Bergmeir, Slawek Smyl

    Abstract: In Smyl et al. [Local and global trend Bayesian exponential smoothing models. International Journal of Forecasting, 2024.], a generalised exponential smoothing model was proposed that is able to capture strong trends and volatility in time series. This method achieved state-of-the-art performance in many forecasting tasks, but its fitting procedure, which is based on the NUTS sampler, is very comp… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2401.15610  [pdf, other

    cs.LG stat.ML

    Prevalidated ridge regression is a highly-efficient drop-in replacement for logistic regression for high-dimensional data

    Authors: Angus Dempster, Geoffrey I. Webb, Daniel F. Schmidt

    Abstract: Logistic regression is a ubiquitous method for probabilistic classification. However, the effectiveness of logistic regression depends upon careful and relatively computationally expensive tuning, especially for the regularisation hyperparameter, and especially in the context of high-dimensional data. We present a prevalidated ridge regression model that closely matches logistic regression in term… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 13 pages, 11 figures

  3. arXiv:2310.18860  [pdf, other

    stat.ML cs.LG

    Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization

    Authors: Shu Yu Tew, Mario Boley, Daniel F. Schmidt

    Abstract: We present a novel method for tuning the regularization hyper-parameter, $λ$, of a ridge regression that is faster to compute than leave-one-out cross-validation (LOOCV) while yielding estimates of the regression parameters of equal, or particularly in the setting of sparse covariates, superior quality to those obtained by minimising the LOOCV risk. The LOOCV risk can suffer from multiple and bad… ▽ More

    Submitted 2 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  4. arXiv:2310.09129  [pdf, other

    cs.LG stat.ML

    Computing Marginal and Conditional Divergences between Decomposable Models with Applications

    Authors: Loong Kuan Lee, Geoffrey I. Webb, Daniel F. Schmidt, Nico Piatkowski

    Abstract: The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time ex… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures, Accepted at the IEEE International Conference on Data Mining (ICDM) 2023

  5. arXiv:2305.11921  [pdf, other

    stat.ME cs.AI cs.LG cs.PF

    An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set

    Authors: Ali Ismail-Fawaz, Angus Dempster, Chang Wei Tan, Matthieu Herrmann, Lynn Miller, Daniel F. Schmidt, Stefano Berretti, Jonathan Weber, Maxime Devanne, Germain Forestier, Geoffrey I. Webb

    Abstract: The measurement of progress using benchmarks evaluations is ubiquitous in computer science and machine learning. However, common approaches to analyzing and presenting the results of benchmark comparisons of multiple algorithms over multiple datasets, such as the critical difference diagram introduced by Demšar (2006), have important shortcomings and, we show, are open to both inadvertent and inte… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  6. arXiv:2211.03248  [pdf, other

    stat.ML cs.LG

    Sparse Horseshoe Estimation via Expectation-Maximisation

    Authors: Shu Yu Tew, Daniel F. Schmidt, Enes Makalic

    Abstract: The horseshoe prior is known to possess many desirable properties for Bayesian estimation of sparse parameter vectors, yet its density function lacks an analytic form. As such, it is challenging to find a closed-form solution for the posterior mode. Conventional horseshoe estimators use the posterior mean to estimate the parameters, but these estimates are not sparse. We propose a novel expectatio… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  7. arXiv:2209.14587  [pdf, other

    stat.ME

    Minimum message length inference of the Weibull distribution with complete and censored data

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: The Weibull distribution, with shape parameter $k>0$ and scale parameter $λ>0$, is one of the most popular parametric distributions in survival analysis with complete or censored data. Although inference of the parameters of the Weibull distribution is commonly done through maximum likelihood, it is well established that the maximum likelihood estimate of the shape parameter is inadequate due to t… ▽ More

    Submitted 16 March, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  8. arXiv:2209.14571  [pdf, other

    stat.ME

    Introduction to minimum message length inference

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: The aim of this manuscript is to introduce the Bayesian minimum message length principle of inductive inference to a general statistical audience that may not be familiar with information theoretic statistics. We describe two key minimum message length inference approaches and demonstrate how the principle can be used to develop a new Bayesian alternative to the frequentist $t$-test as well as new… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  9. arXiv:2209.14567  [pdf, other

    stat.ME

    Maximum likelihood estimation of the Weibull distribution with reduced bias

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: In this short note, we derive a new bias adjusted maximum likelihood estimate for the shape parameter of the Weibull distribution with complete data and type I censored data. The proposed estimate of the shape parameter is significantly less biased and more efficient than the corresponding maximum likelihood estimate, while being simple to compute using existing maximum likelihood software procedu… ▽ More

    Submitted 9 February, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  10. arXiv:2209.14559  [pdf, other

    stat.ME

    MML Probabilistic Principal Component Analysis

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: Principal component analysis (PCA) is perhaps the most widely method for data dimensionality reduction. A key question in PCA decomposition of data is deciding how many factors to retain. This manuscript describes a new approach to automatically selecting the number of principal components based on the Bayesian minimum message length method of inductive inference. We also derive a new estimate of… ▽ More

    Submitted 16 February, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

  11. MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

    Authors: Angus Dempster, Daniel F. Schmidt, Geoffrey I. Webb

    Abstract: Until recently, the most accurate methods for time series classification were limited by high computational complexity. ROCKET achieves state-of-the-art accuracy with a fraction of the computational expense of most existing methods by transforming input time series using random convolutional kernels, and using the transformed features to train a linear classifier. We reformulate ROCKET into a new… ▽ More

    Submitted 14 July, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 10 pages, 11 figures; Updated to accepted version

  12. InceptionTime: Finding AlexNet for Time Series Classification

    Authors: Hassan Ismail Fawaz, Benjamin Lucas, Germain Forestier, Charlotte Pelletier, Daniel F. Schmidt, Jonathan Weber, Geoffrey I. Webb, Lhassane Idoumghar, Pierre-Alain Muller, François Petitjean

    Abstract: This paper brings deep learning at the forefront of research into Time Series Classification (TSC). TSC is the area of machine learning tasked with the categorization (or labelling) of time series. The last few decades of work in this area have led to significant progress in the accuracy of classifiers, with the state of the art now represented by the HIVE-COTE algorithm. While extremely accurate,… ▽ More

    Submitted 5 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  13. arXiv:1809.05212  [pdf, ps, other

    stat.CO

    An efficient algorithm for sampling from $\sin^k(x)$ for generating random correlation matrices

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: In this note, we develop a novel algorithm for generating random numbers from a distribution with a probability density function proportional to $\sin^k(x)$, $x \in (0,π)$ and $k \geq 1$. Our algorithm is highly efficient and is based on rejection sampling where the envelope distribution is an appropriately chosen beta distribution. An example application illustrating how the new algorithm can be… ▽ More

    Submitted 20 November, 2018; v1 submitted 13 September, 2018; originally announced September 2018.

  14. arXiv:1802.03141  [pdf, ps, other

    stat.ME

    A Minimum Message Length Criterion for Robust Linear Regression

    Authors: Chi Kuen Wong, Enes Makalic, Daniel F. Schmidt

    Abstract: This paper applies the minimum message length principle to inference of linear regression models with Student-t errors. A new criterion for variable selection and parameter estimation in Student-t regression is proposed. By exploiting properties of the regression model, we derive a suitable non-informative proper uniform prior distribution for the regression coefficients that leads to a simple and… ▽ More

    Submitted 19 February, 2018; v1 submitted 9 February, 2018; originally announced February 2018.

  15. arXiv:1709.04333  [pdf, ps, other

    stat.ME

    Bayesian Sparse Global-Local Shrinkage Regression for Selection of Grouped Variables

    Authors: Zemei Xu, Daniel F. Schmidt, Enes Makalic, Guoqi Qian, John L. Hopper

    Abstract: Most estimates for penalised linear regression can be viewed as posterior modes for an appropriate choice of prior distribution. Bayesian shrinkage methods, particularly the horseshoe estimator, have recently attracted a great deal of attention in the problem of estimating sparse, high-dimensional linear models. This paper extends these ideas, and presents a Bayesian grouped model with continuous… ▽ More

    Submitted 3 November, 2017; v1 submitted 13 September, 2017; originally announced September 2017.

  16. arXiv:1708.02742  [pdf, other

    stat.ME

    Minimum message length inference of the Poisson and geometric models using heavy-tailed prior distributions

    Authors: Chi Kuen Wong, Enes Makalic, Daniel F. Schmidt

    Abstract: Minimum message length is a general Bayesian principle for model selection and parameter estimation that is based on information theory. This paper applies the minimum message length principle to a small-sample model selection problem involving Poisson and geometric data models. Since MML is a Bayesian principle, it requires prior distributions for all model parameters. We introduce three candidat… ▽ More

    Submitted 11 February, 2018; v1 submitted 9 August, 2017; originally announced August 2017.

  17. arXiv:1611.06649  [pdf, other

    stat.CO

    High-Dimensional Bayesian Regularised Regression with the BayesReg Package

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: Bayesian penalized regression techniques, such as the Bayesian lasso and the Bayesian horseshoe estimator, have recently received a significant amount of attention in the statistics literature. However, software implementing state-of-the-art Bayesian penalized regression, outside of general purpose Markov chain Monte Carlo platforms such as STAN, is relatively rare. This paper introduces bayesreg,… ▽ More

    Submitted 19 December, 2016; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: 17 pages, 1 figure

  18. A simple sampler for the horseshoe estimator

    Authors: Enes Makalic, Daniel F. Schmidt

    Abstract: In this note we derive a simple Bayesian sampler for linear regression with the horseshoe hierarchy. A new interpretation of the horseshoe model is presented, and extensions to logistic regression and alternative hierarchies, such as horseshoe$+$, are discussed. Due to the conjugacy of the proposed hierarchy, Chib's algorithm may be used to easily compute the marginal likelihood of the model.

    Submitted 28 September, 2015; v1 submitted 16 August, 2015; originally announced August 2015.

    Journal ref: IEEE Signal Processing Letters, Vol. 23(1), pp. 179-182, 2016