Skip to main content

Showing 1–20 of 20 results for author: Williams, C K I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.18190  [pdf, other

    cs.LG stat.ML

    Naive Bayes Classifiers and One-hot Encoding of Categorical Variables

    Authors: Christopher K. I. Williams

    Abstract: This paper investigates the consequences of encoding a $K$-valued categorical variable incorrectly as $K$ bits via one-hot encoding, when using a Naïve Bayes classifier. This gives rise to a product-of-Bernoullis (PoB) assumption, rather than the correct categorical Naïve Bayes classifier. The differences between the two classifiers are analysed mathematically and experimentally. In our experiment… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 pages, 3 figures

  2. arXiv:2306.03066  [pdf, other

    cs.CV cs.LG stat.ML

    Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across Cages

    Authors: Michael P. J. Camilleri, Rasneer S. Bains, Christopher K. I. Williams

    Abstract: Behavioural experiments often happen in specialised arenas, but this may confound the analysis. To address this issue, we provide tools to study mice in the home-cage environment, equip** biologists with the possibility to capture the temporal aspect of the individual's behaviour and model the interaction and interdependence between cage-mates with minimal human intervention. Our main contributi… ▽ More

    Submitted 24 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: International Journal of Computer Vision (2024)

  3. arXiv:2210.04221  [pdf, other

    stat.ME cs.IT math.ST

    The Elliptical Quartic Exponential Distribution: An Annular Distribution Obtained via Maximum Entropy

    Authors: Christopher K I Williams

    Abstract: This paper describes the Elliptical Quartic Exponential distribution in $\mathbb{R}^D$, obtained via a maximum entropy construction by imposing second and fourth moment constraints. I discuss relationships to related work, analytical expressions for the normalization constant and the entropy, and the conditional and marginal distributions.

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: 6 pages, 1 figure

  4. arXiv:2203.04694  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Align-Deform-Subtract: An Interventional Framework for Explaining Object Differences

    Authors: Cian Eastwood, Li Nanbo, Christopher K. I. Williams

    Abstract: Given two object images, how can we explain their differences in terms of the underlying object properties? To address this question, we propose Align-Deform-Subtract (ADS) -- an interventional framework for explaining object differences. By leveraging semantic alignments in image-space as counterfactual interventions on the underlying object properties, ADS iteratively quantifies and removes diff… ▽ More

    Submitted 20 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 Workshop on Objects, Structure and Causality

  5. arXiv:2107.05446  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration

    Authors: Cian Eastwood, Ian Mason, Christopher K. I. Williams, Bernhard Schölkopf

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model trained on labelled data in a source domain to unlabelled data in a target domain without access to the source-domain data during adaptation. Existing methods for SFDA leverage entropy-minimization techniques which: (i) apply only to classification; (ii) destroy model calibration; and (iii) rely on the source model achieving a good level o… ▽ More

    Submitted 17 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: ICLR 2022 (Spotlight)

  6. arXiv:2106.03216  [pdf, other

    cs.LG stat.ML

    On Memorization in Probabilistic Deep Generative Models

    Authors: Gerrit J. J. van den Burg, Christopher K. I. Williams

    Abstract: Recent advances in deep generative models have led to impressive results in a variety of application domains. Motivated by the possibility that deep learning models might memorize part of the input data, there have been increased efforts to understand how memorization arises. In this work, we extend a recently proposed measure of memorization for supervised learning (Feldman, 2019) to the unsuperv… ▽ More

    Submitted 29 December, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at NeurIPS 2021

    MSC Class: 68T07

  7. The Effect of Class Imbalance on Precision-Recall Curves

    Authors: Christopher K I Williams

    Abstract: In this note I study how the precision of a classifier depends on the ratio $r$ of positive to negative cases in the test set, as well as the classifier's true and false positive rates. This relationship allows prediction of how the precision-recall curve will change with $r$, which seems not to be well known. It also allows prediction of how $F_β$ and the Precision Gain and Recall Gain measures o… ▽ More

    Submitted 27 April, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 4 pages, 1 figure. Added ref to Siblini et al (2020) and last sentence. Final m/s version of paper published in Neural Computation

    Journal ref: Neural Computation 33(4) 853-857 (2021)

  8. arXiv:2006.05301  [pdf, other

    cs.LG stat.ML

    VAEs in the Presence of Missing Data

    Authors: Mark Collier, Alfredo Nazabal, Christopher K. I. Williams

    Abstract: Real world datasets often contain entries with missing elements e.g. in a medical dataset, a patient is unlikely to have taken all possible diagnostic tests. Variational Autoencoders (VAEs) are popular generative models often used for unsupervised learning. Despite their widespread use it is unclear how best to apply VAEs to datasets with missing data. We develop a novel latent variable model of a… ▽ More

    Submitted 21 March, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Accepted to ICML Workshop on the Art of Learning with Missing Values (Artemiss), 17 July 2020

  9. arXiv:2003.06222  [pdf, other

    stat.ML cs.LG stat.ME

    An Evaluation of Change Point Detection Algorithms

    Authors: Gerrit J. J. van den Burg, Christopher K. I. Williams

    Abstract: Change point detection is an important part of time series analysis, as the presence of a change point indicates an abrupt and significant change in the data generating process. While many algorithms for change point detection have been proposed, comparatively little attention has been paid to evaluating their performance on real-world time series. Algorithms are typically evaluated on simulated d… ▽ More

    Submitted 12 February, 2022; v1 submitted 13 March, 2020; originally announced March 2020.

    Comments: For code and data, see https://github.com/alan-turing-institute/TCPDBench ; Changelog in pdf

    MSC Class: 62M10 ACM Class: G.3

  10. arXiv:1911.10081  [pdf, other

    cs.LG cs.PL stat.ML

    ptype: Probabilistic Type Inference

    Authors: Taha Ceritli, Christopher K. I. Williams, James Geddes

    Abstract: Type inference refers to the task of inferring the data type of a given column of data. Current approaches often fail when data contains missing data and anomalies, which are found commonly in real-world data sets. In this paper, we propose ptype, a probabilistic robust type inference method that allows us to detect such entries, and infer data types. We further show that the proposed method outpe… ▽ More

    Submitted 23 March, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Journal ref: Data Mining and Knowledge Discovery (2020)

  11. arXiv:1910.05026  [pdf, other

    cs.LG stat.ML

    Customizing Sequence Generation with Multi-Task Dynamical Systems

    Authors: Alex Bird, Christopher K. I. Williams

    Abstract: Dynamical system models (including RNNs) often lack the ability to adapt the sequence generation or prediction to a given context, limiting their real-world application. In this paper we show that hierarchical multi-task dynamical systems (MTDSs) provide direct user control over sequence generation, via use of a latent code $\mathbf{z}$ that specifies the customization to the individual data seque… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  12. arXiv:1907.06671  [pdf, other

    cs.LG stat.ML

    Robust Variational Autoencoders for Outlier Detection and Repair of Mixed-Type Data

    Authors: Simão Eduardo, Alfredo Nazábal, Christopher K. I. Williams, Charles Sutton

    Abstract: We focus on the problem of unsupervised cell outlier detection and repair in mixed-type tabular data. Traditional methods are concerned only with detecting which rows in the dataset are outliers. However, identifying which cells are corrupted in a specific row is an important problem in practice, and the very first step towards repairing them. We introduce the Robust Variational Autoencoder (RVAE)… ▽ More

    Submitted 3 March, 2020; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at AISTATS 2020

  13. The Extended Dawid-Skene Model: Fusing Information from Multiple Data Schemas

    Authors: Michael P. J. Camilleri, Christopher K. I. Williams

    Abstract: While label fusion from multiple noisy annotations is a well understood concept in data wrangling (tackled for example by the Dawid-Skene (DS) model), we consider the extended problem of carrying out learning when the labels themselves are not consistently annotated with the same schema. We show that even if annotators use disparate, albeit related, label-sets, we can still draw inferences for the… ▽ More

    Submitted 6 March, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Updated with Author-Preprint version following Publication in P. Cellier and K. Driessens (Eds.): ECML PKDD 2019 Workshops, CCIS 1167, pp. 121 - 136, 2020

    Journal ref: in ECML PKDD 2019 Workshops, CCIS 1167, pp. 121 - 136, 2020

  14. arXiv:1903.08970  [pdf, other

    cs.LG stat.ML

    Multi-Task Time Series Analysis applied to Drug Response Modelling

    Authors: Alex Bird, Christopher K. I. Williams, Christopher Hawthorne

    Abstract: Time series models such as dynamical systems are frequently fitted to a cohort of data, ignoring variation between individual entities such as patients. In this paper we show how these models can be personalised to an individual level while retaining statistical power, via use of multi-task learning (MTL). To our knowledge this is a novel development of MTL which applies to time series both with a… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: To appear in AISTATS 2019

  15. arXiv:1806.00400  [pdf, other

    stat.ML cs.LG

    Inverting Supervised Representations with Autoregressive Neural Density Models

    Authors: Charlie Nash, Nate Kushman, Christopher K. I. Williams

    Abstract: We present a method for feature interpretation that makes use of recent advances in autoregressive density estimation models to invert model representations. We train generative inversion models to express a distribution over input features conditioned on intermediate model representations. Insights into the invariances learned by supervised models can be gained by viewing samples from these inver… ▽ More

    Submitted 2 January, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Accepted for publication by AISTATS 2019

  16. arXiv:1801.03851  [pdf, other

    cs.LG stat.ML

    Autoencoders and Probabilistic Inference with Missing Data: An Exact Solution for The Factor Analysis Case

    Authors: Christopher K. I. Williams, Charlie Nash, Alfredo Nazábal

    Abstract: Latent variable models can be used to probabilistically "fill-in" missing data entries. The variational autoencoder architecture (Kingma and Welling, 2014; Rezende et al., 2014) includes a "recognition" or "encoder" network that infers the latent variables given the data variables. However, it is not clear how to handle missing data variables in this network. The factor analysis (FA) model is a ba… ▽ More

    Submitted 19 February, 2019; v1 submitted 11 January, 2018; originally announced January 2018.

    Comments: 7 pages, 2 figures, Adding ref to Ilin and Raiko (2010)

  17. arXiv:1711.04674  [pdf, other

    stat.ML

    Model Criticism in Latent Space

    Authors: Sohan Seth, Iain Murray, Christopher K. I. Williams

    Abstract: Model criticism is usually carried out by assessing if replicated data generated under the fitted model looks similar to the observed data, see e.g. Gelman, Carlin, Stern, and Rubin [2004, p. 165]. This paper presents a method for latent variable models by pulling back the data into the space of latent variables, and carrying out model criticism in that space. Making use of a model's structure ena… ▽ More

    Submitted 2 July, 2018; v1 submitted 13 November, 2017; originally announced November 2017.

  18. arXiv:1612.00662  [pdf, other

    stat.ML cs.LG

    Predicting Patient State-of-Health using Sliding Window and Recurrent Classifiers

    Authors: Adam McCarthy, Christopher K. I. Williams

    Abstract: Bedside monitors in Intensive Care Units (ICUs) frequently sound incorrectly, slowing response times and desensitising nurses to alarms (Chambrin, 2001), causing true alarms to be missed (Hug et al., 2011). We compare sliding window predictors with recurrent predictors to classify patient state-of-health from ICU multivariate time series; we report slightly improved performance for the RNN for thr… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Machine Learning for Health

  19. arXiv:1506.03852  [pdf, other

    stat.ML cs.CV

    Tree-Cut for Probabilistic Image Segmentation

    Authors: Shell X. Hu, Christopher K. I. Williams, Sinisa Todorovic

    Abstract: This paper presents a new probabilistic generative model for image segmentation, i.e. the task of partitioning an image into homogeneous regions. Our model is grounded on a mid-level image representation, called a region tree, in which regions are recursively split into subregions until superpixels are reached. Given the region tree, image segmentation is formalized as sampling cuts in the tree fr… ▽ More

    Submitted 11 June, 2015; originally announced June 2015.

  20. arXiv:1205.6326  [pdf, other

    stat.ML cs.LG stat.CO

    A Framework for Evaluating Approximation Methods for Gaussian Process Regression

    Authors: Krzysztof Chalupka, Christopher K. I. Williams, Iain Murray

    Abstract: Gaussian process (GP) predictors are an important component of many Bayesian approaches to machine learning. However, even a straightforward implementation of Gaussian process regression (GPR) requires O(n^2) space and O(n^3) time for a dataset of n examples. Several approximation methods have been proposed, but there is a lack of understanding of the relative merits of the different approximation… ▽ More

    Submitted 5 November, 2012; v1 submitted 29 May, 2012; originally announced May 2012.

    Comments: 19 pages, 4 figures