Skip to main content

Showing 1–14 of 14 results for author: Knowles, D A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.04412  [pdf, other

    cs.LG cs.AI stat.ML

    The VampPrior Mixture Model

    Authors: Andrew Stirn, David A. Knowles

    Abstract: Current clustering priors for deep latent variable models (DLVMs) require defining the number of clusters a-priori and are susceptible to poor initializations. Addressing these deficiencies could greatly benefit deep learning-based scRNA-seq analysis by performing integration and clustering simultaneously. We adapt the VampPrior (Tomczak & Welling, 2018) into a Dirichlet process Gaussian mixture m… ▽ More

    Submitted 4 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2212.09184  [pdf, other

    cs.LG stat.ML

    Faithful Heteroscedastic Regression with Neural Networks

    Authors: Andrew Stirn, Hans-Hermann Wessels, Megan Schertzer, Laura Pereira, Neville E. Sanjana, David A. Knowles

    Abstract: Heteroscedastic regression models a Gaussian variable's mean and variance as a function of covariates. Parametric methods that employ neural networks for these parameter maps can capture complex relationships in the data. Yet, optimizing network parameters via log likelihood gradients can yield suboptimal mean and uncalibrated variance estimates. Current solutions side-step this optimization probl… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

  3. arXiv:2006.04910  [pdf, other

    cs.LG stat.ML

    Variational Variance: Simple, Reliable, Calibrated Heteroscedastic Noise Variance Parameterization

    Authors: Andrew Stirn, David A. Knowles

    Abstract: Brittle optimization has been observed to adversely impact model likelihoods for regression and VAEs when simultaneously fitting neural network map**s from a (random) variable onto the mean and variance of a dependent Gaussian variable. Previous works have bolstered optimization and improved likelihoods, but fail other basic posterior predictive checks (PPCs). Under the PPC framework, we propose… ▽ More

    Submitted 30 October, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 17 pages, 6 figures, 10 tables

  4. arXiv:1905.12052  [pdf, other

    cs.LG stat.ML

    A New Distribution on the Simplex with Auto-Encoding Applications

    Authors: Andrew Stirn, Tony Jebara, David A Knowles

    Abstract: We construct a new distribution for the simplex using the Kumaraswamy distribution and an ordered stick-breaking process. We explore and develop the theoretical properties of this new distribution and prove that it exhibits symmetry under the same conditions as the well-known Dirichlet. Like the Dirichlet, the new distribution is adept at capturing sparsity but, unlike the Dirichlet, has an exact… ▽ More

    Submitted 14 December, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 15 pages, 6 figures, 1 tables

  5. arXiv:1509.01631  [pdf, other

    stat.ML

    Stochastic gradient variational Bayes for gamma approximating distributions

    Authors: David A. Knowles

    Abstract: While stochastic variational inference is relatively well known for scaling inference in Bayesian probabilistic models, related methods also offer ways to circumnavigate the approximation of analytically intractable expectations. The key challenge in either setting is controlling the variance of gradient estimates: recent work has shown that for continuous latent variables, particularly multivaria… ▽ More

    Submitted 4 September, 2015; originally announced September 2015.

  6. arXiv:1506.08180  [pdf, other

    stat.ML cs.LG stat.AP stat.CO stat.ME

    An Empirical Study of Stochastic Variational Algorithms for the Beta Bernoulli Process

    Authors: Amar Shah, David A. Knowles, Zoubin Ghahramani

    Abstract: Stochastic variational inference (SVI) is emerging as the most promising candidate for scaling inference in Bayesian probabilistic models to large datasets. However, the performance of these methods has been assessed primarily in the context of Bayesian topic models, particularly latent Dirichlet allocation (LDA). Deriving several new algorithms, and using synthetic, image and genomic datasets, we… ▽ More

    Submitted 26 June, 2015; originally announced June 2015.

    Comments: ICML, 12 pages. Volume 37: Proceedings of The 32nd International Conference on Machine Learning, 2015

  7. arXiv:1408.3378  [pdf, other

    stat.ML

    Beta diffusion trees and hierarchical feature allocations

    Authors: Creighton Heaukulani, David A. Knowles, Zoubin Ghahramani

    Abstract: We define the beta diffusion tree, a random tree structure with a set of leaves that defines a collection of overlap** subsets of objects, known as a feature allocation. A generative process for the tree structure is defined in terms of particles (representing the objects) diffusing in some continuous space, analogously to the Dirichlet diffusion tree (Neal, 2003), which defines a tree structure… ▽ More

    Submitted 3 April, 2015; v1 submitted 14 August, 2014; originally announced August 2014.

    Comments: 43 pages, 13 figures. Major revision to the proof of Thm. 2. Large portions of Chs. 2 & 4 moved into the appendix. Added Fig. 4. Revisions throughout

  8. arXiv:1403.4206  [pdf, other

    stat.ML

    A reversible infinite HMM using normalised random measures

    Authors: Konstantina Palla, David A. Knowles, Zoubin Ghahramani

    Abstract: We present a nonparametric prior over reversible Markov chains. We use completely random measures, specifically gamma processes, to construct a countably infinite graph with weighted edges. By enforcing symmetry to make the edges undirected we define a prior over random walks on graphs that results in a reversible Markov chain. The resulting prior over infinite transition matrices is closely relat… ▽ More

    Submitted 17 March, 2014; originally announced March 2014.

    Comments: 9 pages, 6 figures

  9. arXiv:1401.1022  [pdf, ps, other

    stat.CO

    On Using Control Variates with Stochastic Approximation for Variational Bayes and its Connection to Stochastic Linear Regression

    Authors: Tim Salimans, David A. Knowles

    Abstract: Recently, we and several other authors have written about the possibilities of using stochastic approximation techniques for fitting variational approximations to intractable Bayesian posterior distributions. Naive implementations of stochastic approximation suffer from high variance in this setting. Several authors have therefore suggested using control variates to reduce this variance, while we… ▽ More

    Submitted 12 January, 2014; v1 submitted 6 January, 2014; originally announced January 2014.

  10. arXiv:1309.6858  [pdf

    cs.LG stat.ML

    The Supervised IBP: Neighbourhood Preserving Infinite Latent Feature Models

    Authors: Novi Quadrianto, Viktoriia Sharmanska, David A. Knowles, Zoubin Ghahramani

    Abstract: We propose a probabilistic model to infer supervised latent variables in the Hamming space from observed data. Our model allows simultaneous inference of the number of binary latent variables, and their values. The latent variables preserve neighbourhood structure of the data in a sense that objects in the same semantic concept have similar latent values, and objects in different concepts have dis… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-527-536

  11. arXiv:1303.3265  [pdf, other

    stat.ML

    A dependent partition-valued process for multitask clustering and time evolving network modelling

    Authors: Konstantina Palla, David A. Knowles, Zoubin Ghahramani

    Abstract: The fundamental aim of clustering algorithms is to partition data points. We consider tasks where the discovered partition is allowed to vary with some covariate such as space or time. One approach would be to use fragmentation-coagulation processes, but these, being Markov processes, are restricted to linear or tree structured covariate spaces. We define a partition-valued process on an arbitrary… ▽ More

    Submitted 31 October, 2013; v1 submitted 13 March, 2013; originally announced March 2013.

    Comments: 9 pages, 7 figures, submitted for review

  12. arXiv:1206.6679  [pdf, other

    stat.CO cs.CV stat.ML

    Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression

    Authors: Tim Salimans, David A. Knowles

    Abstract: We propose a general algorithm for approximating nonstandard Bayesian posterior distributions. The algorithm minimizes the Kullback-Leibler divergence of an approximating distribution to the intractable posterior distribution. Our method can be used to approximate any posterior distribution, provided that it is given in closed form up to the proportionality constant. The approximation can be any d… ▽ More

    Submitted 28 July, 2014; v1 submitted 28 June, 2012; originally announced June 2012.

    MSC Class: 62F15

    Journal ref: Bayesian Analysis, Volume 8, Number 4 (2013), 837-882

  13. arXiv:1110.4411  [pdf, other

    stat.ML q-fin.ST stat.ME

    Gaussian Process Regression Networks

    Authors: Andrew Gordon Wilson, David A. Knowles, Zoubin Ghahramani

    Abstract: We introduce a new regression framework, Gaussian process regression networks (GPRN), which combines the structural properties of Bayesian neural networks with the non-parametric flexibility of Gaussian processes. This model accommodates input dependent signal and noise correlations between multiple response variables, input dependent length-scales and amplitudes, and heavy-tailed predictive distr… ▽ More

    Submitted 19 October, 2011; originally announced October 2011.

    Comments: 17 pages, 3 figures, 1 table. Submitted for publication

  14. arXiv:1106.2494  [pdf, other

    stat.ML

    Pitman-Yor Diffusion Trees

    Authors: David A. Knowles, Zoubin Ghahramani

    Abstract: We introduce the Pitman Yor Diffusion Tree (PYDT) for hierarchical clustering, a generalization of the Dirichlet Diffusion Tree (Neal, 2001) which removes the restriction to binary branching structure. The generative process is described and shown to result in an exchangeable distribution over data points. We prove some theoretical properties of the model and then present two inference methods: a… ▽ More

    Submitted 16 June, 2011; v1 submitted 13 June, 2011; originally announced June 2011.

    Comments: 8 pages, to be presented at UAI 2011

    MSC Class: 62G07; 62H30 ACM Class: G.3.7