Skip to main content

Showing 1–16 of 16 results for author: Reeve, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18091  [pdf, ps, other

    math.ST cs.LG

    An adaptive transfer learning perspective on classification in non-stationary environments

    Authors: Henry W J Reeve

    Abstract: We consider a semi-supervised classification problem with non-stationary label-shift in which we observe a labelled data set followed by a sequence of unlabelled covariate vectors in which the marginal probabilities of the class labels may change over time. Our objective is to predict the corresponding class-label for each covariate vector, without ever observing the ground-truth labels, beyond th… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2302.10655  [pdf, other

    stat.ML cs.LG stat.ME

    Density Ratio Estimation and Neyman Pearson Classification with Missing Data

    Authors: Josh Givens, Song Liu, Henry W J Reeve

    Abstract: Density Ratio Estimation (DRE) is an important machine learning technique with many downstream applications. We consider the challenge of DRE with missing not at random (MNAR) data. In this setting, we show that using standard DRE methods leads to biased results while our proposal (M-KLIEP), an adaptation of the popular DRE procedure KLIEP, restores consistency. Moreover, we provide finite sample… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 40 pages, 11 Figures. To be published in proceedings for AISTAT 2023

  3. arXiv:2301.03962  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Theory of Diversity in Ensemble Learning

    Authors: Danny Wood, Tingting Mu, Andrew Webb, Henry Reeve, Mikel Luján, Gavin Brown

    Abstract: We present a theory of ensemble diversity, explaining the nature of diversity for a wide range of supervised learning scenarios. This challenge has been referred to as the holy grail of ensemble learning, an open research issue for over 30 years. Our framework reveals that diversity is in fact a hidden dimension in the bias-variance decomposition of the ensemble loss. We prove a family of exact bi… ▽ More

    Submitted 7 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Journal ref: Journal of Machine Learning Research, 24(359), 2023

  4. arXiv:2109.09427  [pdf, other

    cs.LG

    Asymptotic Optimality for Decentralised Bandits

    Authors: Conor Newton, Ayalvadi Ganesh, Henry W. J. Reeve

    Abstract: We consider a large number of agents collaborating on a multi-armed bandit problem with a large number of arms. The goal is to minimise the regret of each agent in a communication-constrained setting. We present a decentralised algorithm which builds upon and improves the Gossip-Insert-Eliminate method of Chawla et al. arxiv:2001.05452. We provide a theoretical analysis of the regret incurred whic… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  5. arXiv:2109.01077  [pdf, ps, other

    math.ST cs.LG stat.ME stat.ML

    Optimal subgroup selection

    Authors: Henry W. J. Reeve, Timothy I. Cannings, Richard J. Samworth

    Abstract: In clinical trials and other applications, we often see regions of the feature space that appear to exhibit interesting behaviour, but it is unclear whether these observed phenomena are reflected at the population level. Focusing on a regression setting, we consider the subgroup selection challenge of identifying a region of the feature space on which the regression function exceeds a pre-determin… ▽ More

    Submitted 20 September, 2023; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: 65 pages, 2 figures, to appear in the Annals of Statistics

    MSC Class: 62-XX; 62G08; 62Gxx; 62C20

  6. arXiv:2106.04455  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Adaptive transfer learning

    Authors: Henry W. J. Reeve, Timothy I. Cannings, Richard J. Samworth

    Abstract: In transfer learning, we wish to make inference about a target population when we have access to data both from the distribution itself, and from a different but related source distribution. We introduce a flexible framework for transfer learning in the context of binary classification, allowing for covariate-dependent relationships between the source and target distributions that are not required… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    MSC Class: 62G05

  7. arXiv:2106.01092  [pdf, ps, other

    cs.LG math.ST

    Statistical optimality conditions for compressive ensembles

    Authors: Henry W. J. Reeve, Ata Kaban

    Abstract: We present a framework for the theoretical analysis of ensembles of low-complexity empirical risk minimisers trained on independent random compressions of high-dimensional data. First we introduce a general distribution-dependent upper-bound on the excess risk, framed in terms of a natural notion of compressibility. This bound is independent of the dimension of the original data representation, an… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    MSC Class: 62-08

  8. arXiv:2002.09769  [pdf, ps, other

    stat.ML cs.LG

    Optimistic bounds for multi-output prediction

    Authors: Henry WJ Reeve, Ata Kaban

    Abstract: We investigate the challenge of multi-output learning, where the goal is to learn a vector-valued function based on a supervised data set. This includes a range of important problems in Machine Learning including multi-target regression, multi-class classification and multi-label classification. We begin our analysis by introducing the self-bounding Lipschitz condition for multi-output loss functi… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

  9. arXiv:2001.10318  [pdf, other

    cs.LG cs.IT stat.ML

    Margin Maximization as Lossless Maximal Compression

    Authors: Nikolaos Nikolaou, Henry Reeve, Gavin Brown

    Abstract: The ultimate goal of a supervised learning algorithm is to produce models constructed on the training data that can generalize well to new examples. In classification, functional margin maximization -- correctly classifying as many training examples as possible with maximal confidence --has been known to construct models with good generalization guarantees. This work gives an information-theoretic… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: 19 pages Main Paper + 7 pages Supplementary Material, 7 Figures, Submitted to the Machine Learning journal (11/11/19)

  10. arXiv:1906.04542  [pdf, other

    cs.LG stat.ML

    Fast Rates for a kNN Classifier Robust to Unknown Asymmetric Label Noise

    Authors: Henry W. J. Reeve, Ata Kaban

    Abstract: We consider classification in the presence of class-dependent asymmetric label noise with unknown noise probabilities. In this setting, identifiability conditions are known, but additional assumptions were shown to be required for finite sample rates, and so far only the parametric rate has been obtained. Assuming these identifiability conditions, together with a measure-smoothness condition on th… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: ICML 2019

  11. arXiv:1902.05627  [pdf, other

    stat.ML cs.LG

    Classification with unknown class-conditional label noise on non-compact feature spaces

    Authors: Henry W J Reeve, Ata Kaban

    Abstract: We investigate the problem of classification in the presence of unknown class-conditional label noise in which the labels observed by the learner have been corrupted with some unknown class dependent probability. In order to obtain finite sample rates, previous approaches to classification with unknown class-conditional label noise have required that the regression function is close to its extrema… ▽ More

    Submitted 9 June, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

  12. arXiv:1902.04422  [pdf, other

    stat.ML cs.CV cs.LG

    To Ensemble or Not Ensemble: When does End-To-End Training Fail?

    Authors: Andrew M. Webb, Charles Reynolds, Wenlin Chen, Henry Reeve, Dan-Andrei Iliescu, Mikel Lujan, Gavin Brown

    Abstract: End-to-End training (E2E) is becoming more and more popular to train complex Deep Network architectures. An interesting question is whether this trend will continue-are there any clear failure cases for E2E training? We study this question in depth, for the specific case of E2E training an ensemble of networks. Our strategy is to blend the gradient smoothly in between two extremes: from independen… ▽ More

    Submitted 6 August, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: Code: https://github.com/grey-area/modular-loss-experiments. Preprint updated to reflect version accepted for publication at ECML

  13. arXiv:1803.00316  [pdf, other

    cs.LG stat.ML

    The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

    Authors: Henry WJ Reeve, Joe Mellor, Gavin Brown

    Abstract: In this paper we propose and explore the k-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates. We focus on a setting where the covariates are supported on a metric space of low intrinsic dimension, such as a manifold embedded within a high dimensional ambient feature space. The algorithm is conceptually simple and straightforward to implement. The k-Nearest Neighbour UCB algor… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: To be presented at ALT 2018

    Journal ref: Algorithmic Learning Theory 2018

  14. arXiv:1803.00314  [pdf, other

    cs.LG

    Diversity and degrees of freedom in regression ensembles

    Authors: Henry WJ Reeve, Gavin Brown

    Abstract: Ensemble methods are a cornerstone of modern machine learning. The performance of an ensemble depends crucially upon the level of diversity between its constituent learners. This paper establishes a connection between diversity and degrees of freedom (i.e. the capacity of the model), showing that diversity may be viewed as a form of inverse regularisation. This is achieved by focusing on a previou… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: Neurocomputing 2018

    Journal ref: Neurocomputing 2018

  15. arXiv:1803.00310  [pdf, other

    cs.LG stat.ML

    Minimax rates for cost-sensitive learning on manifolds with approximate nearest neighbours

    Authors: Henry WJ Reeve, Gavin Brown

    Abstract: We study the approximate nearest neighbour method for cost-sensitive classification on low-dimensional manifolds embedded within a high-dimensional feature space. We determine the minimax learning rates for distributions on a smooth manifold, in a cost-sensitive setting. This generalises a classic result of Audibert and Tsybakov. Building upon recent work of Chaudhuri and Dasgupta we prove that th… ▽ More

    Submitted 1 March, 2018; originally announced March 2018.

    Comments: Published in ALT 2017

    Journal ref: Algorithmic Learning Theory 2017

  16. arXiv:1511.07340  [pdf, other

    cs.LG

    Modular Autoencoders for Ensemble Feature Extraction

    Authors: Henry W J Reeve, Gavin Brown

    Abstract: We introduce the concept of a Modular Autoencoder (MAE), capable of learning a set of diverse but complementary representations from unlabelled data, that can later be used for supervised tasks. The learning of the representations is controlled by a trade off parameter, and we show on six benchmark datasets the optimum lies between two extremes: a set of smaller, independent autoencoders each with… ▽ More

    Submitted 23 November, 2015; originally announced November 2015.

    Comments: 18 pages, 8 figures, to appear in a special issue of The Journal Of Machine Learning Research (vol.44, Dec 2015)