Skip to main content

Showing 1–12 of 12 results for author: Tsuchida, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09608  [pdf, other

    cs.LG stat.ML

    Exact, Fast and Expressive Poisson Point Processes via Squared Neural Families

    Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

    Abstract: We introduce squared neural Poisson point processes (SNEPPPs) by parameterising the intensity function by the squared norm of a two layer neural network. When the hidden layer is fixed and the second layer has a single neuron, our approach resembles previous uses of squared Gaussian process or kernel methods, but allowing the hidden layer to be learnt allows for additional flexibility. In many cas… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 camera ready submission

  2. arXiv:2402.08193  [pdf, other

    cs.LG stat.ML

    Gaussian Ensemble Belief Propagation for Efficient Inference in High-Dimensional Systems

    Authors: Dan MacKinlay, Russell Tsuchida, Dan Pagendam, Petra Kuhnert

    Abstract: Efficient inference in high-dimensional models remains a central challenge in machine learning. This paper introduces the Gaussian Ensemble Belief Propagation (GEnBP) algorithm, a fusion of the Ensemble Kalman filter and Gaussian Belief Propagation (GaBP) methods. GEnBP updates ensembles by passing low-rank local messages over a graphical model. This combination inherits favourable qualities from… ▽ More

    Submitted 22 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Under conference submission

    MSC Class: 62-07 (Primary) 62F15; 62M40; 68T05; 68W25 ACM Class: I.2.6; H.2.4; I.2.8; J.2

  3. arXiv:2305.13552  [pdf, other

    cs.LG cs.AI stat.ML

    Squared Neural Families: A New Class of Tractable Density Models

    Authors: Russell Tsuchida, Cheng Soon Ong, Dino Sejdinovic

    Abstract: Flexible models for probability distributions are an essential ingredient in many machine learning tasks. We develop and investigate a new class of probability distributions, which we call a Squared Neural Family (SNEFY), formed by squaring the 2-norm of a neural network and normalising it with respect to a base measure. Following the reasoning similar to the well established connections between i… ▽ More

    Submitted 25 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Spotlight award at NeurIPS 2023

  4. Scalable Optimal Transport Methods in Machine Learning: A Contemporary Survey

    Authors: Abdelwahed Khamis, Russell Tsuchida, Mohamed Tarek, Vivien Rolland, Lars Petersson

    Abstract: Optimal Transport (OT) is a mathematical framework that first emerged in the eighteenth century and has led to a plethora of methods for answering many theoretical and applied questions. The last decade has been a witness to the remarkable contributions of this classical optimization problem to machine learning. This paper is about where and how optimal transport is used in machine learning with a… ▽ More

    Submitted 21 March, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted @ TPAMI 24

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2024

  5. arXiv:2211.05943  [pdf, other

    cs.LG stat.ML

    Deep equilibrium models as estimators for continuous latent variables

    Authors: Russell Tsuchida, Cheng Soon Ong

    Abstract: Principal Component Analysis (PCA) and its exponential family extensions have three components: observations, latents and parameters of a linear transformation. We consider a generalised setting where the canonical parameters of the exponential family are a nonlinear transformation of the latents. We show explicit relationships between particular neural network architectures and the corresponding… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 25 pages

  6. arXiv:2210.06120  [pdf, other

    cs.CV

    Efficient Gaussian Process Model on Class-Imbalanced Datasets for Generalized Zero-Shot Learning

    Authors: Changkun Ye, Nick Barnes, Lars Petersson, Russell Tsuchida

    Abstract: Zero-Shot Learning (ZSL) models aim to classify object classes that are not seen during the training process. However, the problem of class imbalance is rarely discussed, despite its presence in several ZSL datasets. In this paper, we propose a Neural Network model that learns a latent feature embedding and a Gaussian Process (GP) regression model that predicts latent feature prototypes of unseen… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Paper Accepted in ICPR 2022

  7. arXiv:2112.13029  [pdf, other

    cs.LG stat.ML

    Gaussian Process Bandits with Aggregated Feedback

    Authors: Mengyan Zhang, Russell Tsuchida, Cheng Soon Ong

    Abstract: We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback. This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available. We constrain the set of reward functions by assuming that they… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: to be published in 36th AAAI Conference on Artificial Intelligence (2022)

  8. arXiv:2002.08517  [pdf, other

    cs.LG stat.ML

    Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

    Authors: Russell Tsuchida, Tim Pearce, Chris van der Heide, Fred Roosta, Marcus Gallagher

    Abstract: Analysing and computing with Gaussian processes arising from infinitely wide neural networks has recently seen a resurgence in popularity. Despite this, many explicit covariance functions of networks with activation functions used in modern networks remain unknown. Furthermore, while the kernels of deep networks can be computed iteratively, theoretical understanding of deep kernels is lacking, par… ▽ More

    Submitted 28 February, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: AAAI camera ready version. 18 pages, 9 figures, 2 tables. Corrected name particle capitalisation and formatting

  9. arXiv:1911.12927  [pdf, other

    cs.LG stat.ML

    Richer priors for infinitely wide multi-layer perceptrons

    Authors: Russell Tsuchida, Fred Roosta, Marcus Gallagher

    Abstract: It is well-known that the distribution over functions induced through a zero-mean iid prior distribution over the parameters of a multi-layer perceptron (MLP) converges to a Gaussian process (GP), under mild conditions. We extend this result firstly to independent priors with general zero or non-zero means, and secondly to a family of partially exchangeable priors which generalise iid priors. We d… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: Pre-print

  10. arXiv:1905.06076  [pdf, other

    stat.ML cs.LG

    Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions

    Authors: Tim Pearce, Russell Tsuchida, Mohamed Zaki, Alexandra Brintrup, Andy Neely

    Abstract: A simple, flexible approach to creating expressive priors in Gaussian process (GP) models makes new kernels from a combination of basic kernels, e.g. summing a periodic and linear kernel can capture seasonal variation with a long term trend. Despite a well-studied link between GPs and Bayesian neural networks (BNNs), the BNN analogue of this has not yet been explored. This paper derives BNN archit… ▽ More

    Submitted 28 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Journal ref: The 35th Conference on Uncertainty in Artificial Intelligence (UAI 2019)

  11. arXiv:1810.08351  [pdf, other

    cs.LG stat.ML

    Exchangeability and Kernel Invariance in Trained MLPs

    Authors: Russell Tsuchida, Fred Roosta, Marcus Gallagher

    Abstract: In the analysis of machine learning models, it is often convenient to assume that the parameters are IID. This assumption is not satisfied when the parameters are updated through training processes such as SGD. A relaxation of the IID condition is a probabilistic symmetry known as exchangeability. We show the sense in which the weights in MLPs are exchangeable. This yields the result that in certa… ▽ More

    Submitted 27 October, 2018; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: 26 pages, 16 Figures; Changed Fred (Farbod) Roosta to Fred Roosta in Metadata

  12. arXiv:1711.09090  [pdf, other

    cs.LG stat.ML

    Invariance of Weight Distributions in Rectified MLPs

    Authors: Russell Tsuchida, Farbod Roosta-Khorasani, Marcus Gallagher

    Abstract: An interesting approach to analyzing neural networks that has received renewed attention is to examine the equivalent kernel of the neural network. This is based on the fact that a fully connected feedforward network with one hidden layer, a certain weight distribution, an activation function, and an infinite number of neurons can be viewed as a map** into a Hilbert space. We derive the equivale… ▽ More

    Submitted 31 May, 2018; v1 submitted 24 November, 2017; originally announced November 2017.

    Comments: ICML 2018