Skip to main content

Showing 51–66 of 66 results for author: Lakshminarayanan, B

.
  1. arXiv:1710.08446  [pdf, other

    stat.ML cs.LG

    Many Paths to Equilibrium: GANs Do Not Need to Decrease a Divergence At Every Step

    Authors: William Fedus, Mihaela Rosca, Balaji Lakshminarayanan, Andrew M. Dai, Shakir Mohamed, Ian Goodfellow

    Abstract: Generative adversarial networks (GANs) are a family of generative models that do not minimize a single training criterion. Unlike other generative models, the data distribution is learned via a game between a generator (the generative model) and a discriminator (a teacher providing training signal) that each minimize their own cost. GANs are designed to reach a Nash equilibrium at which each playe… ▽ More

    Submitted 20 February, 2018; v1 submitted 23 October, 2017; originally announced October 2017.

    Comments: 18 pages

  2. arXiv:1706.04987  [pdf, other

    stat.ML cs.LG

    Variational Approaches for Auto-Encoding Generative Adversarial Networks

    Authors: Mihaela Rosca, Balaji Lakshminarayanan, David Warde-Farley, Shakir Mohamed

    Abstract: Auto-encoding generative adversarial networks (GANs) combine the standard GAN algorithm, which discriminates between real and model-generated data, with a reconstruction loss given by an auto-encoder. Such models aim to prevent mode collapse in the learned generative model by ensuring that it is grounded in all the available training data. In this paper, we develop a principle upon which auto-enco… ▽ More

    Submitted 21 October, 2017; v1 submitted 15 June, 2017; originally announced June 2017.

  3. arXiv:1705.10743  [pdf, other

    cs.LG stat.ML

    The Cramer Distance as a Solution to Biased Wasserstein Gradients

    Authors: Marc G. Bellemare, Ivo Danihelka, Will Dabney, Shakir Mohamed, Balaji Lakshminarayanan, Stephan Hoyer, Rémi Munos

    Abstract: The Wasserstein probability metric has received much attention from the machine learning community. Unlike the Kullback-Leibler divergence, which strictly measures change in probability, the Wasserstein metric reflects the underlying geometry between outcomes. The value of being sensitive to this geometry has been demonstrated, among others, in ordinal regression and generative modelling. In this… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  4. arXiv:1705.05263  [pdf, other

    cs.LG

    Comparison of Maximum Likelihood and GAN-based training of Real NVPs

    Authors: Ivo Danihelka, Balaji Lakshminarayanan, Benigno Uria, Daan Wierstra, Peter Dayan

    Abstract: We train a generator by maximum likelihood and we also train the same generator architecture by Wasserstein GAN. We then compare the generated samples, exact log-probability densities and approximate Wasserstein distances. We show that an independent critic trained to approximate Wasserstein distance between the validation set and the generator distribution helps detect overfitting. Finally, we us… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

  5. arXiv:1702.08833  [pdf, other

    cs.LG

    Learning Deep Nearest Neighbor Representations Using Differentiable Boundary Trees

    Authors: Daniel Zoran, Balaji Lakshminarayanan, Charles Blundell

    Abstract: Nearest neighbor (kNN) methods have been gaining popularity in recent years in light of advances in hardware and efficiency of algorithms. There is a plethora of methods to choose from today, each with their own advantages and disadvantages. One requirement shared between all kNN based methods is the need for a good representation and distance measure between samples. We introduce a new method c… ▽ More

    Submitted 28 February, 2017; originally announced February 2017.

  6. arXiv:1612.01474  [pdf, other

    stat.ML cs.LG

    Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles

    Authors: Balaji Lakshminarayanan, Alexander Pritzel, Charles Blundell

    Abstract: Deep neural networks (NNs) are powerful black box predictors that have recently achieved impressive performance on a wide spectrum of tasks. Quantifying predictive uncertainty in NNs is a challenging and yet unsolved problem. Bayesian NNs, which learn a distribution over weights, are currently the state-of-the-art for estimating predictive uncertainty; however these require significant modificatio… ▽ More

    Submitted 3 November, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: NIPS 2017

  7. arXiv:1610.03483  [pdf, other

    stat.ML cs.LG stat.CO

    Learning in Implicit Generative Models

    Authors: Shakir Mohamed, Balaji Lakshminarayanan

    Abstract: Generative adversarial networks (GANs) provide an algorithmic framework for constructing generative models with several appealing properties: they do not require a likelihood function to be specified, only a generating procedure; they provide samples that are sharp and compelling; and they allow us to harness our knowledge of building highly accurate neural network classifiers. Here, we develop ou… ▽ More

    Submitted 27 February, 2017; v1 submitted 11 October, 2016; originally announced October 2016.

  8. arXiv:1606.05241  [pdf, other

    stat.ML

    The Mondrian Kernel

    Authors: Matej Balog, Balaji Lakshminarayanan, Zoubin Ghahramani, Daniel M. Roy, Yee Whye Teh

    Abstract: We introduce the Mondrian kernel, a fast random feature approximation to the Laplace kernel. It is suitable for both batch and online learning, and admits a fast kernel-width-selection procedure as the random features can be re-used efficiently for all kernel widths. The features are constructed by sampling trees via a Mondrian process [Roy and Teh, 2009], and we highlight the connection to Mondri… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: Accepted for presentation at the 32nd Conference on Uncertainty in Artificial Intelligence (UAI 2016)

  9. arXiv:1512.09327  [pdf, other

    cs.LG stat.ML

    Distributed Bayesian Learning with Stochastic Natural-gradient Expectation Propagation and the Posterior Server

    Authors: Leonard Hasenclever, Stefan Webb, Thibaut Lienart, Sebastian Vollmer, Balaji Lakshminarayanan, Charles Blundell, Yee Whye Teh

    Abstract: This paper makes two contributions to Bayesian machine learning algorithms. Firstly, we propose stochastic natural gradient expectation propagation (SNEP), a novel alternative to expectation propagation (EP), a popular variational inference algorithm. SNEP is a black box variational algorithm, in that it does not require any simplifying assumptions on the distribution of interest, beyond the exist… ▽ More

    Submitted 7 September, 2017; v1 submitted 31 December, 2015; originally announced December 2015.

    Comments: 37 pages, 7 figures

    Journal ref: Journal of Machine Learning Research 18 (2017) 1-37

  10. arXiv:1506.06100  [pdf, other

    stat.ML cs.LG math.FA

    Approximate Inference with the Variational Holder Bound

    Authors: Guillaume Bouchard, Balaji Lakshminarayanan

    Abstract: We introduce the Variational Holder (VH) bound as an alternative to Variational Bayes (VB) for approximate Bayesian inference. Unlike VB which typically involves maximization of a non-convex lower bound with respect to the variational parameters, the VH bound involves minimization of a convex upper bound to the intractable integral with respect to the variational parameters. Minimization of the VH… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

  11. arXiv:1506.03805  [pdf, other

    stat.ML cs.LG

    Mondrian Forests for Large-Scale Regression when Uncertainty Matters

    Authors: Balaji Lakshminarayanan, Daniel M. Roy, Yee Whye Teh

    Abstract: Many real-world regression problems demand a measure of the uncertainty associated with each prediction. Standard decision forests deliver efficient state-of-the-art predictive performance, but high-quality uncertainty estimates are lacking. Gaussian processes (GPs) deliver uncertainty estimates, but scaling GPs to large-scale data sets comes at the cost of approximating the uncertainty estimates.… ▽ More

    Submitted 27 May, 2016; v1 submitted 11 June, 2015; originally announced June 2015.

    Comments: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics (AISTATS) 2016, Cadiz, Spain. JMLR: W&CP volume 51

  12. arXiv:1503.02551  [pdf, other

    stat.ML cs.LG

    Kernel-Based Just-In-Time Learning for Passing Expectation Propagation Messages

    Authors: Wittawat Jitkrittum, Arthur Gretton, Nicolas Heess, S. M. Ali Eslami, Balaji Lakshminarayanan, Dino Sejdinovic, Zoltán Szabó

    Abstract: We propose an efficient nonparametric strategy for learning a message operator in expectation propagation (EP), which takes as input the set of incoming messages to a factor node, and produces an outgoing message as output. This learned operator replaces the multivariate integral required in classical EP, which may not have an analytic expression. We use kernel-based regression, which is trained o… ▽ More

    Submitted 9 June, 2015; v1 submitted 9 March, 2015; originally announced March 2015.

    Comments: accepted to UAI 2015. Correct typos. Add more content to the appendix. Main results unchanged

    MSC Class: 62F15; 46e22; 62-09; 62F30 ACM Class: G.3; I.2.6

  13. arXiv:1502.04622  [pdf, other

    stat.ML cs.LG stat.CO

    Particle Gibbs for Bayesian Additive Regression Trees

    Authors: Balaji Lakshminarayanan, Daniel M. Roy, Yee Whye Teh

    Abstract: Additive regression trees are flexible non-parametric models and popular off-the-shelf tools for real-world non-linear regression. In application domains, such as bioinformatics, where there is also demand for probabilistic predictions with measures of uncertainty, the Bayesian additive regression trees (BART) model, introduced by Chipman et al. (2010), is increasingly popular. As data sets have g… ▽ More

    Submitted 16 February, 2015; originally announced February 2015.

    Journal ref: Proceedings of the 18th International Conference on Artificial Intelligence and Statistics (AISTATS) 2015, San Diego, CA, USA. JMLR: W&CP volume 38

  14. arXiv:1406.2673  [pdf, other

    stat.ML cs.LG

    Mondrian Forests: Efficient Online Random Forests

    Authors: Balaji Lakshminarayanan, Daniel M. Roy, Yee Whye Teh

    Abstract: Ensembles of randomized decision trees, usually referred to as random forests, are widely used for classification and regression tasks in machine learning and statistics. Random forests achieve competitive predictive performance and are computationally efficient to train and test, making them excellent candidates for real-world prediction tasks. The most popular random forest variants (such as Bre… ▽ More

    Submitted 16 February, 2015; v1 submitted 10 June, 2014; originally announced June 2014.

    Journal ref: Advances in Neural Information Processing Systems 27 (NIPS), pages 3140-3148, 2014

  15. arXiv:1305.0015  [pdf, other

    stat.ML cs.LG

    Inferring ground truth from multi-annotator ordinal data: a probabilistic approach

    Authors: Balaji Lakshminarayanan, Yee Whye Teh

    Abstract: A popular approach for large scale data annotation tasks is crowdsourcing, wherein each data point is labeled by multiple noisy annotators. We consider the problem of inferring ground truth from noisy ordinal labels obtained from multiple annotators of varying and unknown expertise levels. Annotation models for ordinal data have been proposed mostly as extensions of their binary/categorical counte… ▽ More

    Submitted 30 April, 2013; originally announced May 2013.

  16. arXiv:1303.0561  [pdf, other

    stat.ML cs.LG

    Top-down particle filtering for Bayesian decision trees

    Authors: Balaji Lakshminarayanan, Daniel M. Roy, Yee Whye Teh

    Abstract: Decision tree learning is a popular approach for classification and regression in machine learning and statistics, and Bayesian formulations---which introduce a prior distribution over decision trees, and formulate learning as posterior inference given data---have been shown to produce competitive performance. Unlike classic decision tree learning algorithms like ID3, C4.5 and CART, which work in… ▽ More

    Submitted 22 August, 2013; v1 submitted 3 March, 2013; originally announced March 2013.

    Comments: ICML 2013

    Journal ref: JMLR W&CP 28(3):280-288, 2013