Skip to main content

Showing 51–100 of 102 results for author: de Freitas, N

.
  1. arXiv:1701.00980  [pdf, ps, other

    cond-mat.stat-mech

    Phase Transition and Monopoles Densities in a Nearest Neighbors Two-Dimensional Spin Ice Model

    Authors: C. W. Morais, D. N. de Freitas, A. L. Mota, E. C. Bastone

    Abstract: In this work, we show that, due to the alternating orientation of the spins in the ground state of the artificial square spin ice, the influence of a set of spins at a certain distance of a reference spin decreases faster than the expected result for the long range dipolar interaction, justifying the use of the nearest neighbor two dimensional square spin ice model as an effective model. Using an… ▽ More

    Submitted 4 January, 2017; originally announced January 2017.

    Comments: 11 pages, 8 figures

    Journal ref: International Journal of Modern Physics B Vol. 31, No. 31, 1750237 (2017)

  2. arXiv:1611.03824  [pdf, other

    stat.ML cs.LG

    Learning to Learn without Gradient Descent by Gradient Descent

    Authors: Yutian Chen, Matthew W. Hoffman, Sergio Gomez Colmenarejo, Misha Denil, Timothy P. Lillicrap, Matt Botvinick, Nando de Freitas

    Abstract: We learn recurrent neural network optimizers trained on simple synthetic functions by gradient descent. We show that these learned optimizers exhibit a remarkable degree of transfer in that they can be used to efficiently optimize a broad range of derivative-free black-box functions, including Gaussian process bandits, simple control objectives, global optimization benchmarks and hyper-parameter t… ▽ More

    Submitted 12 June, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: Accepted by ICML 2017. Previous version "Learning to Learn for Global Optimization of Black Box Functions" was published in the Deep Reinforcement Learning Workshop, NIPS 2016

  3. arXiv:1611.01843  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE physics.soc-ph

    Learning to Perform Physics Experiments via Deep Reinforcement Learning

    Authors: Misha Denil, Pulkit Agrawal, Tejas D Kulkarni, Tom Erez, Peter Battaglia, Nando de Freitas

    Abstract: When encountering novel objects, humans are able to infer a wide range of physical properties such as mass, friction and deformability by interacting with them in a goal driven way. This process of active interaction is in the same spirit as a scientist performing experiments to discover hidden facts. Recent advances in artificial intelligence have yielded machines that can achieve superhuman perf… ▽ More

    Submitted 17 August, 2017; v1 submitted 6 November, 2016; originally announced November 2016.

  4. arXiv:1611.01599  [pdf, other

    cs.LG cs.CL cs.CV

    LipNet: End-to-End Sentence-level Lipreading

    Authors: Yannis M. Assael, Brendan Shillingford, Shimon Whiteson, Nando de Freitas

    Abstract: Lipreading is the task of decoding text from the movement of a speaker's mouth. Traditional approaches separated the problem into two stages: designing or learning visual features, and prediction. More recent deep lipreading approaches are end-to-end trainable (Wand et al., 2016; Chung & Zisserman, 2016a). However, existing work on models trained end-to-end perform only word classification, rather… ▽ More

    Submitted 16 December, 2016; v1 submitted 5 November, 2016; originally announced November 2016.

  5. arXiv:1611.01224  [pdf, other

    cs.LG

    Sample Efficient Actor-Critic with Experience Replay

    Authors: Ziyu Wang, Victor Bapst, Nicolas Heess, Volodymyr Mnih, Remi Munos, Koray Kavukcuoglu, Nando de Freitas

    Abstract: This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several continuous control problems. To achieve this, the paper introduces several innovations, including truncated importance sampling with bias correction, stochasti… ▽ More

    Submitted 10 July, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

    Comments: 20 pages. Prepared for ICLR 2017

  6. arXiv:1606.04474  [pdf, other

    cs.NE cs.LG

    Learning to learn by gradient descent by gradient descent

    Authors: Marcin Andrychowicz, Misha Denil, Sergio Gomez, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, Nando de Freitas

    Abstract: The move from hand-designed features to learned features in machine learning has been wildly successful. In spite of this, optimization algorithms are still designed by hand. In this paper we show how the design of an optimization algorithm can be cast as a learning problem, allowing the algorithm to learn to exploit structure in the problems of interest in an automatic way. Our learned algorithms… ▽ More

    Submitted 30 November, 2016; v1 submitted 14 June, 2016; originally announced June 2016.

  7. arXiv:1605.06676  [pdf, other

    cs.AI cs.LG cs.MA

    Learning to Communicate with Deep Multi-Agent Reinforcement Learning

    Authors: Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson

    Abstract: We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate end-to-end learning of protocols in complex environments inspired by communicati… ▽ More

    Submitted 24 May, 2016; v1 submitted 21 May, 2016; originally announced May 2016.

  8. arXiv:1602.02672  [pdf, other

    cs.AI cs.LG

    Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks

    Authors: Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson

    Abstract: We propose deep distributed recurrent Q-networks (DDRQN), which enable teams of agents to learn to solve communication-based coordination tasks. In these tasks, the agents are not given any pre-designed communication protocol. Therefore, in order to successfully communicate, they must first automatically develop and agree upon their own communication protocol. We present empirical results on two m… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

  9. arXiv:1511.06581  [pdf, other

    cs.LG

    Dueling Network Architectures for Deep Reinforcement Learning

    Authors: Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas

    Abstract: In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning. Our dueling network represents two separate estimators: one for the state… ▽ More

    Submitted 5 April, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: 15 pages, 5 figures, and 5 tables

  10. arXiv:1511.06279  [pdf, other

    cs.LG cs.NE

    Neural Programmer-Interpreters

    Authors: Scott Reed, Nando de Freitas

    Abstract: We propose the neural programmer-interpreter (NPI): a recurrent and compositional neural network that learns to represent and execute programs. NPI has three learnable components: a task-agnostic recurrent core, a persistent key-value program memory, and domain-specific encoders that enable a single NPI to operate in multiple perceptually diverse environments with distinct affordances. By learning… ▽ More

    Submitted 29 February, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: ICLR 2016 conference submission

  11. arXiv:1511.05946  [pdf, other

    cs.LG cs.NE

    ACDC: A Structured Efficient Linear Layer

    Authors: Marcin Moczulski, Misha Denil, Jeremy Appleyard, Nando de Freitas

    Abstract: The linear layer is one of the most pervasive modules in deep learning representations. However, it requires $O(N^2)$ parameters and $O(N^2)$ operations. These costs can be prohibitive in mobile applications or prevent scaling in many domains. Here, we introduce a deep, differentiable, fully-connected neural network module composed of diagonal matrices of parameters, $\mathbf{A}$ and $\mathbf{D}$,… ▽ More

    Submitted 19 March, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

  12. arXiv:1508.03666  [pdf, other

    stat.ML

    Unbounded Bayesian Optimization via Regularization

    Authors: Bobak Shahriari, Alexandre Bouchard-Côté, Nando de Freitas

    Abstract: Bayesian optimization has recently emerged as a popular and efficient tool for global optimization and hyperparameter tuning. Currently, the established Bayesian optimization practice requires a user-defined bounding box which is assumed to contain the optimizer. However, when little is known about the probed objective function, it can be difficult to prescribe such bounds. In this work we modify… ▽ More

    Submitted 14 August, 2015; originally announced August 2015.

    Comments: 9 pages, 4 figures

  13. arXiv:1412.7149  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Fried Convnets

    Authors: Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang

    Abstract: The fully connected layers of a deep convolutional neural network typically contain over 90% of the network parameters, and consume the majority of the memory required to store the network parameters. Reducing the number of parameters while preserving essentially the same predictive performance is critically important for operating deep neural networks in memory constrained environments such as GP… ▽ More

    Submitted 17 July, 2015; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: svd experiments included

  14. arXiv:1412.6815  [pdf, other

    cs.CL cs.IR cs.LG

    Extraction of Salient Sentences from Labelled Documents

    Authors: Misha Denil, Alban Demiraj, Nando de Freitas

    Abstract: We present a hierarchical convolutional document model with an architecture designed to support introspection of the document structure. Using this model, we show how to use visualisation techniques from the computer vision literature to identify and extract topic-relevant sentences. We also introduce a new scalable evaluation technique for automatic sentence extraction systems that avoids the n… ▽ More

    Submitted 28 February, 2015; v1 submitted 21 December, 2014; originally announced December 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1406.3830

  15. arXiv:1411.3128  [pdf, other

    cs.LG stat.ML

    Deep Multi-Instance Transfer Learning

    Authors: Dimitrios Kotzias, Misha Denil, Phil Blunsom, Nando de Freitas

    Abstract: We present a new approach for transferring knowledge from groups to individuals that comprise them. We evaluate our method in text, by inferring the ratings of individual sentences using full-review ratings. This approach, which combines ideas from transfer learning, deep learning and multi-instance learning, reduces the need for laborious human labelling of fine-grained data when abundant labels… ▽ More

    Submitted 10 December, 2014; v1 submitted 12 November, 2014; originally announced November 2014.

  16. arXiv:1410.7172  [pdf, other

    cs.LG math.OC stat.ML

    Heteroscedastic Treed Bayesian Optimisation

    Authors: John-Alexander M. Assael, Ziyu Wang, Bobak Shahriari, Nando de Freitas

    Abstract: Optimising black-box functions is important in many disciplines, such as tuning machine learning models, robotics, finance and mining exploration. Bayesian optimisation is a state-of-the-art technique for the global optimisation of black-box functions which are expensive to evaluate. At the core of this approach is a Gaussian process prior that captures our belief about the distribution over funct… ▽ More

    Submitted 4 March, 2015; v1 submitted 27 October, 2014; originally announced October 2014.

  17. arXiv:1406.7758  [pdf, other

    stat.ML cs.LG

    Theoretical Analysis of Bayesian Optimisation with Unknown Gaussian Process Hyper-Parameters

    Authors: Ziyu Wang, Nando de Freitas

    Abstract: Bayesian optimisation has gained great popularity as a tool for optimising the parameters of machine learning algorithms and models. Somewhat ironically, setting up the hyper-parameters of Bayesian optimisation methods is notoriously hard. While reasonable practical solutions have been advanced, they can often fail to find the best optima. Surprisingly, there is little theoretical analysis of this… ▽ More

    Submitted 30 June, 2014; originally announced June 2014.

    Comments: 16 pages, 1 figure

  18. arXiv:1406.4625  [pdf, other

    stat.ML cs.LG

    An Entropy Search Portfolio for Bayesian Optimization

    Authors: Bobak Shahriari, Ziyu Wang, Matthew W. Hoffman, Alexandre Bouchard-Côté, Nando de Freitas

    Abstract: Bayesian optimization is a sample-efficient method for black-box global optimization. How- ever, the performance of a Bayesian optimization method very much depends on its exploration strategy, i.e. the choice of acquisition function, and it is not clear a priori which choice will result in superior performance. While portfolio methods provide an effective, principled way of combining a collection… ▽ More

    Submitted 4 March, 2015; v1 submitted 18 June, 2014; originally announced June 2014.

    Comments: 10 pages, 5 figures

  19. arXiv:1406.3830  [pdf, other

    cs.CL cs.LG stat.ML

    Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network

    Authors: Misha Denil, Alban Demiraj, Nal Kalchbrenner, Phil Blunsom, Nando de Freitas

    Abstract: Capturing the compositional process which maps the meaning of words to that of documents is a central challenge for researchers in Natural Language Processing and Information Retrieval. We introduce a model that is able to represent the meaning of documents by embedding them in a low dimensional vector space, while preserving distinctions of word and sentence order crucial for capturing nuanced se… ▽ More

    Submitted 15 June, 2014; originally announced June 2014.

  20. arXiv:1406.3070  [pdf, other

    stat.ML

    Distributed Parameter Estimation in Probabilistic Graphical Models

    Authors: Yariv Dror Mizrahi, Misha Denil, Nando de Freitas

    Abstract: This paper presents foundational theoretical results on distributed parameter estimation for undirected probabilistic graphical models. It introduces a general condition on composite likelihood decompositions of these models which guarantees the global consistency of distributed estimators, provided the local estimators are consistent.

    Submitted 11 June, 2014; originally announced June 2014.

  21. arXiv:1404.7296  [pdf, other

    cs.CL

    A Deep Architecture for Semantic Parsing

    Authors: Edward Grefenstette, Phil Blunsom, Nando de Freitas, Karl Moritz Hermann

    Abstract: Many successful approaches to semantic parsing build on top of the syntactic analysis of text, and make use of distributional representations or statistical models to match parses to ontology-specific queries. This paper presents a novel deep learning architecture which provides a semantic parsing system through the union of two neural models of language semantics. It allows for the generation of… ▽ More

    Submitted 29 April, 2014; originally announced April 2014.

    Comments: In Proceedings of the Semantic Parsing Workshop at ACL 2014 (forthcoming)

  22. arXiv:1402.7005  [pdf, other

    stat.ML cs.LG

    Bayesian Multi-Scale Optimistic Optimization

    Authors: Ziyu Wang, Babak Shakibi, Lin **, Nando de Freitas

    Abstract: Bayesian optimization is a powerful global optimization technique for expensive black-box functions. One of its shortcomings is that it requires auxiliary optimization of an acquisition function at each iteration. This auxiliary optimization can be costly and very hard to carry out in practice. Moreover, it creates serious theoretical concerns, as most of the convergence results assume that the ex… ▽ More

    Submitted 27 February, 2014; originally announced February 2014.

    Comments: 15 pages

  23. arXiv:1310.1415  [pdf, other

    stat.ML cs.LG

    Narrowing the Gap: Random Forests In Theory and In Practice

    Authors: Misha Denil, David Matheson, Nando de Freitas

    Abstract: Despite widespread interest and practical use, the theoretical properties of random forests are still not well understood. In this paper we contribute to this understanding in two ways. We present a new theoretically tractable variant of random regression forests and prove that our algorithm is consistent. We also provide an empirical evaluation, comparing our algorithm and other theoretically tra… ▽ More

    Submitted 4 October, 2013; originally announced October 2013.

    Comments: Under review by the International Conference on Machine Learning (ICML) 2014

  24. arXiv:1308.6342  [pdf, other

    stat.ML cs.LG

    Linear and Parallel Learning of Markov Random Fields

    Authors: Yariv Dror Mizrahi, Misha Denil, Nando de Freitas

    Abstract: We introduce a new embarrassingly parallel parameter learning algorithm for Markov random fields with untied parameters which is efficient for a large class of practical models. Our algorithm parallelizes naturally over cliques and, for graphs of bounded degree, its complexity is linear in the number of cliques. Unlike its competitors, our algorithm is fully parallel and for log-linear models it i… ▽ More

    Submitted 5 February, 2014; v1 submitted 28 August, 2013; originally announced August 2013.

  25. arXiv:1306.0543  [pdf, other

    cs.LG cs.NE stat.ML

    Predicting Parameters in Deep Learning

    Authors: Misha Denil, Babak Shakibi, Laurent Dinh, Marc'Aurelio Ranzato, Nando de Freitas

    Abstract: We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that not only can the parameter values be predicted, but many of them need not be learned at all. We train several different architectures by learning only a small nu… ▽ More

    Submitted 27 October, 2014; v1 submitted 3 June, 2013; originally announced June 2013.

  26. arXiv:1303.6746  [pdf, other

    stat.ML cs.LG

    Exploiting correlation and budget constraints in Bayesian multi-armed bandit optimization

    Authors: Matthew W. Hoffman, Bobak Shahriari, Nando de Freitas

    Abstract: We address the problem of finding the maximizer of a nonlinear smooth function, that can only be evaluated point-wise, subject to constraints on the number of permitted function evaluations. This problem is also known as fixed-budget best arm identification in the multi-armed bandit literature. We introduce a Bayesian approach for this problem and show that it empirically outperforms both the exis… ▽ More

    Submitted 11 November, 2013; v1 submitted 27 March, 2013; originally announced March 2013.

  27. arXiv:1302.6182  [pdf, other

    stat.CO

    Adaptive Hamiltonian and Riemann Manifold Monte Carlo Samplers

    Authors: ziyu wang, Shakir Mohamed, Nando de Freitas

    Abstract: In this paper we address the widely-experienced difficulty in tuning Hamiltonian-based Monte Carlo samplers. We develop an algorithm that allows for the adaptation of Hamiltonian and Riemann manifold Hamiltonian Monte Carlo samplers using Bayesian optimization that allows for infinite adaptation of the parameters of these samplers. We show that the resulting sampling algorithms are ergodic, and th… ▽ More

    Submitted 25 February, 2013; originally announced February 2013.

    Comments: 10 pages, 4 figures

  28. arXiv:1302.4853  [pdf, other

    stat.ML

    Consistency of Online Random Forests

    Authors: Misha Denil, David Matheson, Nando de Freitas

    Abstract: As a testament to their success, the theory of random forests has long been outpaced by their application in practice. In this paper, we take a step towards narrowing this gap by providing a consistency result for online random forests.

    Submitted 8 May, 2013; v1 submitted 20 February, 2013; originally announced February 2013.

    Comments: To appear in Proceedings of the 30th International Conference on Machine Learning, 2013

  29. arXiv:1301.4604   

    cs.AI

    Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (2012)

    Authors: Nando de Freitas, Kevin Murphy

    Abstract: This is the Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, which was held on Catalina Island, CA August 14-18 2012.

    Submitted 28 August, 2014; v1 submitted 19 January, 2013; originally announced January 2013.

    Report number: UAI2012

  30. arXiv:1301.4168  [pdf, other

    cs.LG stat.CO stat.ML

    Herded Gibbs Sampling

    Authors: Luke Bornn, Yutian Chen, Nando de Freitas, Mareija Eskelin, **g Fang, Max Welling

    Abstract: The Gibbs sampler is one of the most popular algorithms for inference in statistical models. In this paper, we introduce a herding variant of this algorithm, called herded Gibbs, that is entirely deterministic. We prove that herded Gibbs has an $O(1/T)$ convergence rate for models with independent variables and for fully connected probabilistic graphical models. Herded Gibbs is shown to outperform… ▽ More

    Submitted 15 March, 2013; v1 submitted 17 January, 2013; originally announced January 2013.

    Comments: 19 pages, including the appendix. Submission for ICLR 2013

  31. arXiv:1301.3853  [pdf

    cs.LG cs.AI stat.CO

    Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks

    Authors: Arnaud Doucet, Nando de Freitas, Kevin Murphy, Stuart Russell

    Abstract: Particle filters (PFs) are powerful sampling-based inference/learning algorithms for dynamic Bayesian networks (DBNs). They allow us to treat, in a principled way, any type of probability distribution, nonlinearity and non-stationarity. They have appeared in several fields under such names as "condensation", "sequential Monte Carlo" and "survival of the fittest". In this paper, we show how we can… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-176-183

  32. arXiv:1301.3833  [pdf

    cs.LG cs.NE stat.ML

    Reversible Jump MCMC Simulated Annealing for Neural Networks

    Authors: Christophe Andrieu, Nando de Freitas, Arnaud Doucet

    Abstract: We propose a novel reversible jump Markov chain Monte Carlo (MCMC) simulated annealing algorithm to optimize radial basis function (RBF) networks. This algorithm enables us to maximize the joint posterior distribution of the network parameters and the number of basis functions. It performs a global search in the joint space of the parameters and number of parameters, thereby surmounting the proble… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-11-18

  33. arXiv:1301.2266  [pdf

    cs.LG stat.CO stat.ML

    Variational MCMC

    Authors: Nando de Freitas, Pedro Hojen-Sorensen, Michael I. Jordan, Stuart Russell

    Abstract: We propose a new class of learning algorithms that combines variational approximation and Markov chain Monte Carlo (MCMC) simulation. Naive algorithms that use the variational approximation as proposal distribution can perform poorly because this approximation tends to underestimate the true variance and other features of the data. We solve this problem by introducing more sophisticated MCMC algor… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-120-127

  34. arXiv:1301.1942  [pdf, other

    stat.ML cs.LG

    Bayesian Optimization in a Billion Dimensions via Random Embeddings

    Authors: Ziyu Wang, Frank Hutter, Masrour Zoghi, David Matheson, Nando de Freitas

    Abstract: Bayesian optimization techniques have been successfully applied to robotics, planning, sensor placement, recommendation, advertising, intelligent user interfaces and automatic algorithm configuration. Despite these successes, the approach is restricted to problems of moderate dimension, and several workshops on Bayesian optimization have identified its scaling to high-dimensions as one of the holy… ▽ More

    Submitted 10 January, 2016; v1 submitted 9 January, 2013; originally announced January 2013.

    Comments: 33 pages

  35. arXiv:1208.0959  [pdf, other

    cs.LG cs.CV stat.ML

    Recklessly Approximate Sparse Coding

    Authors: Misha Denil, Nando de Freitas

    Abstract: It has recently been observed that certain extremely simple feature encoding techniques are able to achieve state of the art performance on several standard image classification benchmarks including deep belief networks, convolutional nets, factored RBMs, mcRBMs, convolutional RBMs, sparse autoencoders and several others. Moreover, these "triangle" or "soft threshold" encodings are ex- tremely eff… ▽ More

    Submitted 6 January, 2013; v1 submitted 4 August, 2012; originally announced August 2012.

  36. arXiv:1207.4149  [pdf

    stat.CO cs.LG

    From Fields to Trees

    Authors: Firas Hamze, Nando de Freitas

    Abstract: We present new MCMC algorithms for computing the posterior distributions and expectations of the unknown variables in undirected graphical models with regular structure. For demonstration purposes, we focus on Markov Random Fields (MRFs). By partitioning the MRFs into non-overlap** trees, it is possible to compute the posterior distribution of a particular tree exactly by conditioning on the rem… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-243-250

  37. arXiv:1207.1396  [pdf

    stat.CO cs.LG stat.ML

    Toward Practical N2 Monte Carlo: the Marginal Particle Filter

    Authors: Mike Klaas, Nando de Freitas, Arnaud Doucet

    Abstract: Sequential Monte Carlo techniques are useful for state estimation in non-linear, non-Gaussian dynamic models. These methods allow us to approximate the joint posterior distribution using sequential importance sampling. In this framework, the dimension of the target distribution grows with each time step, thus it is necessary to introduce some resampling steps to ensure that the estimates provided… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-308-315

  38. arXiv:1207.1393  [pdf

    cs.LG stat.ML

    Learning about individuals from group statistics

    Authors: Hendrik Kuck, Nando de Freitas

    Abstract: We propose a new problem formulation which is similar to, but more informative than, the binary multiple-instance learning problem. In this setting, we are given groups of instances (described by feature vectors) along with estimates of the fraction of positively-labeled instances per group. The task is to learn an instance level classifier from this information. That is, we are trying to estimate… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-332-339

  39. arXiv:1207.1375  [pdf

    cs.AI

    Nonparametric Bayesian Logic

    Authors: Peter Carbonetto, Jacek Kisynski, Nando de Freitas, David L Poole

    Abstract: The Bayesian Logic (BLOG) language was recently developed for defining first-order probability models over worlds with unknown numbers of objects. It handles important problems in AI, including data association and population estimation. This paper extends BLOG by adopting generative processes over function spaces - known as nonparametrics in the Bayesian literature. We introduce syntax for reason… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-85-93

  40. arXiv:1206.6457  [pdf

    cs.LG stat.ML

    Exponential Regret Bounds for Gaussian Process Bandits with Deterministic Observations

    Authors: Nando de Freitas, Alex Smola, Masrour Zoghi

    Abstract: This paper analyzes the problem of Gaussian process (GP) bandits with deterministic observations. The analysis uses a branch and bound algorithm that is related to the UCB algorithm of (Srinivas et al, 2010). For GPs with Gaussian observation noise, with variance strictly greater than zero, Srinivas et al proved that the regret vanishes at the approximate rate of $O(1/\sqrt{t})$, where t is the nu… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012). arXiv admin note: substantial text overlap with arXiv:1203.2177

  41. arXiv:1206.5239  [pdf

    stat.CO cs.AI

    Large-Flip Importance Sampling

    Authors: Firas Hamze, Nando de Freitas

    Abstract: We propose a new Monte Carlo algorithm for complex discrete distributions. The algorithm is motivated by the N-Fold Way, which is an ingenious event-driven MCMC sampler that avoids rejection moves at any specific state. The N-Fold Way can however get "trapped" in cycles. We surmount this problem by modifying the sampling process. This correction does introduce bias, but the bias is subsequently co… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-167-174

  42. arXiv:1205.2643  [pdf

    cs.LG eess.SY math.OC stat.CO stat.ML

    New inference strategies for solving Markov Decision Processes using reversible jump MCMC

    Authors: Matthias Hoffman, Hendrik Kueck, Nando de Freitas, Arnaud Doucet

    Abstract: In this paper we build on previous work which uses inferences techniques, in particular Markov Chain Monte Carlo (MCMC) methods, to solve parameterized control problems. We propose a number of modifications in order to make this approach more practical in general, higher-dimensional spaces. We first introduce a new target distribution which is able to incorporate more reward information from sampl… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-223-231

  43. arXiv:1203.3484  [pdf

    stat.CO cs.AI

    Intracluster Moves for Constrained Discrete-Space MCMC

    Authors: Firas Hamze, Nando de Freitas

    Abstract: This paper addresses the problem of sampling from binary distributions with constraints. In particular, it proposes an MCMC method to draw samples from a distribution of the set of all states at a specified distance from some reference state. For example, when the reference state is the vector of zeros, the algorithm can draw samples from a binary distribution with a constraint on the number of ac… ▽ More

    Submitted 15 March, 2012; originally announced March 2012.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-236-243

  44. arXiv:1203.2394  [pdf, other

    stat.ML cs.LG stat.CO

    Decentralized, Adaptive, Look-Ahead Particle Filtering

    Authors: Mohamed Osama Ahmed, Pouyan T. Bibalan, Nando de Freitas, Simon Fauvel

    Abstract: The decentralized particle filter (DPF) was proposed recently to increase the level of parallelism of particle filtering. Given a decomposition of the state space into two nested sets of variables, the DPF uses a particle filter to sample the first set and then conditions on this sample to generate a set of samples for the second set of variables. The DPF can be understood as a variant of the popu… ▽ More

    Submitted 11 March, 2012; originally announced March 2012.

    Comments: 16 pages, 11 figures, Authorship in alphabetical order

  45. arXiv:1203.2177  [pdf, other

    cs.LG stat.ML

    Regret Bounds for Deterministic Gaussian Process Bandits

    Authors: Nando de Freitas, Alex Smola, Masrour Zoghi

    Abstract: This paper analyses the problem of Gaussian process (GP) bandits with deterministic observations. The analysis uses a branch and bound algorithm that is related to the UCB algorithm of (Srinivas et al., 2010). For GPs with Gaussian observation noise, with variance strictly greater than zero, (Srinivas et al., 2010) proved that the regret vanishes at the approximate rate of $O(\frac{1}{\sqrt{t}})$,… ▽ More

    Submitted 9 March, 2012; originally announced March 2012.

    Comments: 17 pages, 5 figures

  46. arXiv:1202.3746  [pdf

    cs.LG stat.ML

    Asymptotic Efficiency of Deterministic Estimators for Discrete Energy-Based Models: Ratio Matching and Pseudolikelihood

    Authors: Benjamin Marlin, Nando de Freitas

    Abstract: Standard maximum likelihood estimation cannot be applied to discrete energy-based models in the general case because the computation of exact model probabilities is intractable. Recent research has seen the proposal of several new estimators designed specifically to overcome this intractability, but virtually nothing is known about their theoretical properties. In this paper, we present a generali… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-497-505

  47. arXiv:1111.5379  [pdf, other

    stat.CO cond-mat.dis-nn physics.comp-ph stat.ML

    Self-Avoiding Random Dynamics on Integer Complex Systems

    Authors: Firas Hamze, Ziyu Wang, Nando de Freitas

    Abstract: This paper introduces a new specialized algorithm for equilibrium Monte Carlo sampling of binary-valued systems, which allows for large moves in the state space. This is achieved by constructing self-avoiding walks (SAWs) in the state space. As a consequence, many bits are flipped in a single MCMC step. We name the algorithm SARDONICS, an acronym for Self-Avoiding Random Dynamics on Integer Comple… ▽ More

    Submitted 25 November, 2011; v1 submitted 22 November, 2011; originally announced November 2011.

    Comments: 22 pages. 9 figures

  48. arXiv:1110.6497  [pdf, other

    stat.CO stat.ML

    Bayesian Optimization for Adaptive MCMC

    Authors: Nimalan Mahendran, Ziyu Wang, Firas Hamze, Nando de Freitas

    Abstract: This paper proposes a new randomized strategy for adaptive MCMC using Bayesian optimization. This approach applies to non-differentiable objective functions and trades off exploration and exploitation to reduce the number of potentially costly objective function evaluations. We demonstrate the strategy in the complex setting of sampling from constrained, discrete and densely connected probabilisti… ▽ More

    Submitted 29 October, 2011; originally announced October 2011.

    Comments: This paper contains 12 pages and 6 figures. A similar version of this paper has been submitted to AISTATS 2012 and is currently under review

  49. arXiv:1109.3737  [pdf, other

    cs.AI

    Learning where to Attend with Deep Architectures for Image Tracking

    Authors: Misha Denil, Loris Bazzani, Hugo Larochelle, Nando de Freitas

    Abstract: We discuss an attentional model for simultaneous object tracking and recognition that is driven by gaze data. Motivated by theories of perception, the model consists of two interacting pathways: identity and control, intended to mirror the what and where pathways in neuroscience models. The identity pathway models object appearance and performs classification using deep (factored)-Restricted Boltz… ▽ More

    Submitted 16 September, 2011; originally announced September 2011.

  50. arXiv:1108.3298  [pdf, other

    cs.LG cs.AI cs.CV cs.IR stat.ML

    A Machine Learning Perspective on Predictive Coding with PAQ

    Authors: Byron Knoll, Nando de Freitas

    Abstract: PAQ8 is an open source lossless data compression algorithm that currently achieves the best compression rates on many benchmarks. This report presents a detailed description of PAQ8 from a statistical machine learning perspective. It shows that it is possible to understand some of the modules of PAQ8 and use this understanding to improve the method. However, intuitive statistical explanations of t… ▽ More

    Submitted 16 August, 2011; originally announced August 2011.