Skip to main content

Showing 1–43 of 43 results for author: Nowozin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.01170  [pdf, other

    stat.ML cond-mat.stat-mech cs.LG physics.chem-ph

    Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics

    Authors: Leon Klein, Andrew Y. K. Foong, Tor Erlend Fjelde, Bruno Mlodozeniec, Marc Brockschmidt, Sebastian Nowozin, Frank Noé, Ryota Tomioka

    Abstract: Molecular dynamics (MD) simulation is a widely used technique to simulate molecular systems, most commonly at the all-atom resolution where equations of motion are integrated with timesteps on the order of femtoseconds ($1\textrm{fs}=10^{-15}\textrm{s}$). MD is often used to compute equilibrium properties, which requires sampling from an equilibrium distribution such as the Boltzmann distribution.… ▽ More

    Submitted 1 December, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  2. arXiv:2301.06496  [pdf

    physics.optics cs.AI cs.CV

    Efficient data transport over multimode light-pipes with Megapixel images using differentiable ray tracing and Machine-learning

    Authors: Joowon Lim, Jannes Gladrow, Douglas Kelly, Greg O'Shea, Govert Verkes, Ioan Stefanovici, Sebastian Nowozin, Benn Thomsen

    Abstract: Retrieving images transmitted through multi-mode fibers is of growing interest, thanks to their ability to confine and transport light efficiently in a compact system. Here, we demonstrate machine-learning-based decoding of large-scale digital images (pages), maximizing page capacity for optical storage applications. Using a millimeter-sized square cross-section waveguide, we image an 8-bit spatia… ▽ More

    Submitted 24 August, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: 21 pages, 5 figures

  3. arXiv:2206.09843  [pdf, other

    cs.CV cs.LG

    Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification

    Authors: Massimiliano Patacchiola, John Bronskill, Aliaksandra Shysheya, Katja Hofmann, Sebastian Nowozin, Richard E. Turner

    Abstract: Recent years have seen a growth in user-centric applications that require effective knowledge transfer across tasks in the low-data regime. An example is personalization, where a pretrained system is adapted by learning on small amounts of labeled data belonging to a specific user. This setting requires high accuracy under low computational complexity, therefore the Pareto frontier of accuracy vs.… ▽ More

    Submitted 11 January, 2023; v1 submitted 20 June, 2022; originally announced June 2022.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2022)

  4. arXiv:2206.08671  [pdf, other

    stat.ML cs.CV cs.LG

    FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification

    Authors: Aliaksandra Shysheya, John Bronskill, Massimiliano Patacchiola, Sebastian Nowozin, Richard E Turner

    Abstract: Modern deep learning systems are increasingly deployed in situations such as personalization and federated learning where it is necessary to support i) learning on small amounts of data, and ii) communication efficient distributed training protocols. In this work, we develop FiLM Transfer (FiT) which fulfills these requirements in the image classification setting by combining ideas from transfer l… ▽ More

    Submitted 2 February, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Journal ref: The Eleventh International Conference on Learning Representations (ICLR 2023)

  5. arXiv:2107.01105  [pdf, other

    stat.ML cs.LG

    Memory Efficient Meta-Learning with Large Images

    Authors: John Bronskill, Daniela Massiceti, Massimiliano Patacchiola, Katja Hofmann, Sebastian Nowozin, Richard E. Turner

    Abstract: Meta learning approaches to few-shot classification are computationally efficient at test time, requiring just a few optimization steps or single forward pass to learn a new task, but they remain highly memory-intensive to train. This limitation arises because a task's entire support set, which can contain up to 1000 images, must be processed before an optimization step can be taken. Harnessing th… ▽ More

    Submitted 26 October, 2021; v1 submitted 2 July, 2021; originally announced July 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  6. arXiv:2106.06615  [pdf, other

    cs.LG

    Precise characterization of the prior predictive distribution of deep ReLU networks

    Authors: Lorenzo Noci, Gregor Bachmann, Kevin Roth, Sebastian Nowozin, Thomas Hofmann

    Abstract: Recent works on Bayesian neural networks (BNNs) have highlighted the need to better understand the implications of using Gaussian priors in combination with the compositional structure of the network architecture. Similar in spirit to the kind of analysis that has been developed to devise better initialization schemes for neural networks (cf. He- or Xavier initialization), we derive a precise char… ▽ More

    Submitted 27 October, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Journal ref: NeurIPS 2021

  7. arXiv:2106.06596  [pdf, other

    cs.LG cs.AI

    Disentangling the Roles of Curation, Data-Augmentation and the Prior in the Cold Posterior Effect

    Authors: Lorenzo Noci, Kevin Roth, Gregor Bachmann, Sebastian Nowozin, Thomas Hofmann

    Abstract: The "cold posterior effect" (CPE) in Bayesian deep learning describes the uncomforting observation that the predictive performance of Bayesian neural networks can be significantly improved if the Bayes posterior is artificially sharpened using a temperature parameter T<1. The CPE is problematic in theory and practice and since the effect was identified many researchers have proposed hypotheses to… ▽ More

    Submitted 27 October, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Journal ref: NeurIPS 2021

  8. arXiv:2003.03284  [pdf, other

    stat.ML cs.LG

    TaskNorm: Rethinking Batch Normalization for Meta-Learning

    Authors: John Bronskill, Jonathan Gordon, James Requeima, Sebastian Nowozin, Richard E. Turner

    Abstract: Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical nature of the meta-learning setting presents several challenges that can render conventional batch normalization ineffective, giving rise to the need to rethink no… ▽ More

    Submitted 28 June, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of Machine Learning and Systems 2020, 4683-4694

  9. arXiv:2002.02655  [pdf, other

    cs.LG stat.ML

    The k-tied Normal Distribution: A Compact Parameterization of Gaussian Mean Field Posteriors in Bayesian Neural Networks

    Authors: Jakub Swiatkowski, Kevin Roth, Bastiaan S. Veeling, Linh Tran, Joshua V. Dillon, Jasper Snoek, Stephan Mandt, Tim Salimans, Rodolphe Jenatton, Sebastian Nowozin

    Abstract: Variational Bayesian Inference is a popular methodology for approximating posterior distributions over Bayesian neural network weights. Recent work develo** this class of methods has explored ever richer parameterizations of the approximate posterior in the hope of improving performance. In contrast, here we share a curious experimental finding that suggests instead restricting the variational d… ▽ More

    Submitted 5 July, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  10. arXiv:2002.02405  [pdf, other

    stat.ML cs.LG stat.CO

    How Good is the Bayes Posterior in Deep Neural Networks Really?

    Authors: Florian Wenzel, Kevin Roth, Bastiaan S. Veeling, Jakub Świątkowski, Linh Tran, Stephan Mandt, Jasper Snoek, Tim Salimans, Rodolphe Jenatton, Sebastian Nowozin

    Abstract: During the past five years the Bayesian deep learning community has developed increasingly accurate and efficient approximate inference procedures that allow for Bayesian inference in deep neural networks. However, despite this algorithmic progress and the promise of improved uncertainty quantification and sample efficiency there are---as of early 2020---no publicized deployments of Bayesian neura… ▽ More

    Submitted 2 July, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Full version (main paper and appendix) of the ICML 2020 publication

  11. arXiv:2001.04694  [pdf, other

    cs.LG stat.ML

    Hydra: Preserving Ensemble Diversity for Model Distillation

    Authors: Linh Tran, Bastiaan S. Veeling, Kevin Roth, Jakub Swiatkowski, Joshua V. Dillon, Jasper Snoek, Stephan Mandt, Tim Salimans, Sebastian Nowozin, Rodolphe Jenatton

    Abstract: Ensembles of models have been empirically shown to improve predictive performance and to yield robust measures of uncertainty. However, they are expensive in computation and memory. Therefore, recent research has focused on distilling ensembles into a single compact model, reducing the computational and memory burden of the ensemble while trying to preserve its predictive behavior. Most existing d… ▽ More

    Submitted 19 March, 2021; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: Accepted to ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning

  12. arXiv:1909.05063  [pdf, other

    stat.ML cs.LG

    Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations

    Authors: Jan Stühmer, Richard E. Turner, Sebastian Nowozin

    Abstract: Recently there has been an increased interest in unsupervised learning of disentangled representations using the Variational Autoencoder (VAE) framework. Most of the existing work has focused largely on modifying the variational cost function to achieve this goal. We first show that these modifications, e.g. beta-VAE, simplify the tendency of variational inference to underfit causing pathological… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

  13. arXiv:1908.04537  [pdf, other

    cs.LG cs.AI stat.ML

    Icebreaker: Element-wise Active Information Acquisition with Bayesian Deep Latent Gaussian Model

    Authors: Wenbo Gong, Sebastian Tschiatschek, Richard Turner, Sebastian Nowozin, José Miguel Hernández-Lobato, Cheng Zhang

    Abstract: In this paper we introduce the ice-start problem, i.e., the challenge of deploying machine learning models when only little or no training data is initially available, and acquiring each feature element of data is associated with costs. This setting is representative for the real-world machine learning applications. For instance, in the health-care domain, when training an AI system for predicting… ▽ More

    Submitted 14 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

  14. arXiv:1906.07697  [pdf, other

    stat.ML cs.LG

    Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes

    Authors: James Requeima, Jonathan Gordon, John Bronskill, Sebastian Nowozin, Richard E. Turner

    Abstract: The goal of this paper is to design image classification systems that, after an initial multi-task training phase, can automatically adapt to new tasks encountered at test time. We introduce a conditional neural process based approach to the multi-task classification setting for this purpose, and establish connections to the meta-learning and few-shot learning literature. The resulting approach, c… ▽ More

    Submitted 7 January, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Published in NeurIPS 2019

    Journal ref: Advances in Neural Information Processing Systems 32 (2019) 7957-7968

  15. arXiv:1906.02530  [pdf, other

    stat.ML cs.LG

    Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

    Authors: Yaniv Ovadia, Emily Fertig, Jie Ren, Zachary Nado, D Sculley, Sebastian Nowozin, Joshua V. Dillon, Balaji Lakshminarayanan, Jasper Snoek

    Abstract: Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying uncertainty is especially critical in real-world settings, which often involve input distributions that are shifted from the training distribution due to a var… ▽ More

    Submitted 17 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: Advances in Neural Information Processing Systems, 2019

  16. arXiv:1812.03828  [pdf, other

    cs.CV

    Occupancy Networks: Learning 3D Reconstruction in Function Space

    Authors: Lars Mescheder, Michael Oechsle, Michael Niemeyer, Sebastian Nowozin, Andreas Geiger

    Abstract: With the advent of deep neural networks, learning-based approaches for 3D reconstruction have gained popularity. However, unlike for images, in 3D there is no canonical representation which is both computationally and memory efficient yet allows for representing high-resolution geometry of arbitrary topology. Many of the state-of-the-art learning-based 3D reconstruction approaches can hence only r… ▽ More

    Submitted 30 April, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: To be presented at CVPR 2019. Supplementary material and code is available at http://avg.is.tuebingen.mpg.de/publications/occupancy-networks

  17. arXiv:1811.07753  [pdf, other

    cs.CV

    Contextual Face Recognition with a Nested-Hierarchical Nonparametric Identity Model

    Authors: Daniel C. Castro, Sebastian Nowozin

    Abstract: Current face recognition systems typically operate via classification into known identities obtained from supervised identity annotations. There are two problems with this paradigm: (1) current systems are unable to benefit from often abundant unlabelled data; and (2) they equate successful recognition with labelling a given input image. Humans, on the other hand, regularly perform identification… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: NeurIPS 2018 Workshop on All of Bayesian Nonparametrics (BNP@NeurIPS 2018). arXiv admin note: substantial text overlap with arXiv:1807.07872

  18. arXiv:1810.03958  [pdf, other

    cs.LG stat.ML

    Deterministic Variational Inference for Robust Bayesian Neural Networks

    Authors: Anqi Wu, Sebastian Nowozin, Edward Meeds, Richard E. Turner, José Miguel Hernández-Lobato, Alexander L. Gaunt

    Abstract: Bayesian neural networks (BNNs) hold great promise as a flexible and principled solution to deal with uncertainty when learning from finite data. Among approaches to realize probabilistic inference in deep neural networks, variational Bayes (VB) is theoretically grounded, generally applicable, and computationally efficient. With wide recognition of potential advantages, why is it that variational… ▽ More

    Submitted 7 March, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

  19. arXiv:1809.11142  [pdf, other

    cs.LG stat.ML

    EDDI: Efficient Dynamic Discovery of High-Value Information with Partial VAE

    Authors: Chao Ma, Sebastian Tschiatschek, Konstantina Palla, José Miguel Hernández-Lobato, Sebastian Nowozin, Cheng Zhang

    Abstract: Many real-life decision-making situations allow further relevant information to be acquired at a specific cost, for example, in assessing the health status of a patient we may decide to take additional measurements such as diagnostic tests or imaging scans before making a final assessment. Acquiring more relevant information enables better decision making, but may be costly. How can we trade off t… ▽ More

    Submitted 15 May, 2019; v1 submitted 28 September, 2018; originally announced September 2018.

    Comments: icml 2019 camera-ready version

  20. From Face Recognition to Models of Identity: A Bayesian Approach to Learning about Unknown Identities from Unsupervised Data

    Authors: Daniel C. Castro, Sebastian Nowozin

    Abstract: Current face recognition systems robustly recognize identities across a wide variety of imaging conditions. In these systems recognition is performed via classification into known identities obtained from supervised identity annotations. There are two problems with this current paradigm: (1) current systems are unable to benefit from unlabelled data which may be available in large quantities; and… ▽ More

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: Accepted for publication at ECCV 2018

  21. arXiv:1805.09921  [pdf, other

    stat.ML cs.LG

    Meta-Learning Probabilistic Inference For Prediction

    Authors: Jonathan Gordon, John Bronskill, Matthias Bauer, Sebastian Nowozin, Richard E. Turner

    Abstract: This paper introduces a new framework for data efficient and versatile learning. Specifically: 1) We develop ML-PIP, a general framework for Meta-Learning approximate Probabilistic Inference for Prediction. ML-PIP extends existing probabilistic interpretations of meta-learning to cover a broad class of methods. 2) We introduce VERSA, an instance of the framework employing a flexible and versatile… ▽ More

    Submitted 6 August, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: International Conference on Learning Representations (ICLR) 2019

    Journal ref: International Conference on Learning Representations (2019)

  22. arXiv:1805.08736  [pdf, other

    stat.ML cs.LG

    Adversarially Robust Training through Structured Gradient Regularization

    Authors: Kevin Roth, Aurelien Lucchi, Sebastian Nowozin, Thomas Hofmann

    Abstract: We propose a novel data-dependent structured gradient regularizer to increase the robustness of neural networks vis-a-vis adversarial perturbations. Our regularizer can be derived as a controlled approximation from first principles, leveraging the fundamental link between training with noise and regularization. It adds very little computational overhead during learning and is simple to implement g… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

  23. arXiv:1805.03430  [pdf, other

    cs.CV

    Deep Directional Statistics: Pose Estimation with Uncertainty Quantification

    Authors: Sergey Prokudin, Peter Gehler, Sebastian Nowozin

    Abstract: Modern deep learning systems successfully solve many perception tasks such as object pose estimation when the input image is of high quality. However, in challenging imaging conditions such as on low-resolution images or when the image is corrupted by imaging artifacts, current systems degrade considerably in accuracy. While a loss in performance is unavoidable, we would like our models to quantif… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

  24. arXiv:1801.04406  [pdf, other

    cs.LG cs.AI cs.GT

    Which Training Methods for GANs do actually Converge?

    Authors: Lars Mescheder, Andreas Geiger, Sebastian Nowozin

    Abstract: Recent work has shown local convergence of GAN training for absolutely continuous data and generator distributions. In this paper, we show that the requirement of absolute continuity is necessary: we describe a simple yet prototypical counterexample showing that in the more realistic case of distributions that are not absolutely continuous, unregularized GAN training is not always convergent. Furt… ▽ More

    Submitted 31 July, 2018; v1 submitted 13 January, 2018; originally announced January 2018.

    Comments: conference

    Journal ref: International Conference on Machine Learning 2018

  25. arXiv:1711.11566  [pdf, other

    cs.LG cs.CV

    Hybrid VAE: Improving Deep Generative Models using Partial Observations

    Authors: Sergey Tulyakov, Andrew Fitzgibbon, Sebastian Nowozin

    Abstract: Deep neural network models trained on large labeled datasets are the state-of-the-art in a large variety of computer vision tasks. In many applications, however, labeled data is expensive to obtain or requires a time consuming manual annotation process. In contrast, unlabeled data is often abundant and available in large quantities. We present a principled framework to capitalize on unlabeled data… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

  26. arXiv:1710.10766  [pdf, other

    cs.LG

    PixelDefend: Leveraging Generative Models to Understand and Defend against Adversarial Examples

    Authors: Yang Song, Taesup Kim, Sebastian Nowozin, Stefano Ermon, Nate Kushman

    Abstract: Adversarial perturbations of normal images are usually imperceptible to humans, but they can seriously confuse state-of-the-art machine learning models. What makes them so special in the eyes of image classifiers? In this paper, we show empirically that adversarial examples mainly lie in the low probability regions of the training distribution, regardless of attack types and targeted models. Using… ▽ More

    Submitted 21 May, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: ICLR 2018

  27. arXiv:1705.10998  [pdf, other

    cs.AI

    The Atari Grand Challenge Dataset

    Authors: Vitaly Kurin, Sebastian Nowozin, Katja Hofmann, Lucas Beyer, Bastian Leibe

    Abstract: Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with complex virtual environments, yet real-world applications of RL are still scarce. A key limitation is data efficiency, with current state-of-the-art approaches requiring millions of training samples. A promising way to tackle this problem is to a… ▽ More

    Submitted 31 May, 2017; originally announced May 2017.

  28. arXiv:1705.10461  [pdf, other

    cs.LG

    The Numerics of GANs

    Authors: Lars Mescheder, Sebastian Nowozin, Andreas Geiger

    Abstract: In this paper, we analyze the numerics of common algorithms for training Generative Adversarial Networks (GANs). Using the formalism of smooth two-player games we analyze the associated gradient vector field of GAN training objectives. Our findings suggest that the convergence of current algorithms suffers due to two factors: i) presence of eigenvalues of the Jacobian of the gradient vector field… ▽ More

    Submitted 11 June, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  29. arXiv:1705.09367  [pdf, other

    cs.LG stat.ML

    Stabilizing Training of Generative Adversarial Networks through Regularization

    Authors: Kevin Roth, Aurelien Lucchi, Sebastian Nowozin, Thomas Hofmann

    Abstract: Deep generative models based on Generative Adversarial Networks (GANs) have demonstrated impressive sample quality but in order to work they require a careful choice of architecture, parameter initialization, and selection of hyper-parameters. This fragility is in part due to a dimensional mismatch or non-overlap** support between the model distribution and the data distribution, causing their d… ▽ More

    Submitted 7 November, 2017; v1 submitted 25 May, 2017; originally announced May 2017.

  30. arXiv:1705.08841  [pdf, other

    cs.LG stat.ML

    Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

    Authors: Diane Bouchacourt, Ryota Tomioka, Sebastian Nowozin

    Abstract: We would like to learn a representation of the data which decomposes an observation into factors of variation which we can independently control. Specifically, we want to use minimal supervision to learn a latent representation that reflects the semantics behind a specific grou** of the data, where within a group the samples share a common factor of variation. For example, consider a collection… ▽ More

    Submitted 24 May, 2017; originally announced May 2017.

  31. arXiv:1701.04722  [pdf, other

    cs.LG

    Adversarial Variational Bayes: Unifying Variational Autoencoders and Generative Adversarial Networks

    Authors: Lars Mescheder, Sebastian Nowozin, Andreas Geiger

    Abstract: Variational Autoencoders (VAEs) are expressive latent variable models that can be used to learn complex probability distributions from training data. However, the quality of the resulting model crucially relies on the expressiveness of the inference model. We introduce Adversarial Variational Bayes (AVB), a technique for training Variational Autoencoders with arbitrarily expressive inference model… ▽ More

    Submitted 11 June, 2018; v1 submitted 17 January, 2017; originally announced January 2017.

  32. arXiv:1612.03779  [pdf, other

    cs.CV

    PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

    Authors: Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

    Abstract: State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the tas… ▽ More

    Submitted 11 April, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  33. arXiv:1611.06928  [pdf, other

    cs.AI stat.ML

    Memory Lens: How Much Memory Does an Agent Use?

    Authors: Christoph Dann, Katja Hofmann, Sebastian Nowozin

    Abstract: We propose a new method to study the internal memory used by reinforcement learning policies. We estimate the amount of relevant past information by estimating mutual information between behavior histories and the current action of an agent. We perform this estimation in the passive setting, that is, we do not intervene but merely observe the natural behavior of the agent. Moreover, we provide a t… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  34. arXiv:1611.06684  [pdf, other

    cs.LG math.PR stat.ML

    Probabilistic Duality for Parallel Gibbs Sampling without Graph Coloring

    Authors: Lars Mescheder, Sebastian Nowozin, Andreas Geiger

    Abstract: We present a new notion of probabilistic duality for random variables involving mixture distributions. Using this notion, we show how to implement a highly-parallelizable Gibbs sampler for weakly coupled discrete pairwise graphical models with strictly positive factors that requires almost no preprocessing and is easy to implement. Moreover, we show how our method can be combined with blocking to… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

  35. arXiv:1611.05705  [pdf, other

    cs.CV

    DSAC - Differentiable RANSAC for Camera Localization

    Authors: Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother

    Abstract: RANSAC is an important algorithm in robust optimization and a central building block for many computer vision applications. In recent years, traditionally hand-crafted pipelines have been replaced by deep learning pipelines, which can be trained in an end-to-end fashion. However, RANSAC has so far not been used as part of such deep learning pipelines, because its hypothesis selection procedure is… ▽ More

    Submitted 21 March, 2018; v1 submitted 17 November, 2016; originally announced November 2016.

    Comments: CVPR 2017

  36. arXiv:1611.01989  [pdf, other

    cs.LG

    DeepCoder: Learning to Write Programs

    Authors: Matej Balog, Alexander L. Gaunt, Marc Brockschmidt, Sebastian Nowozin, Daniel Tarlow

    Abstract: We develop a first line of attack for solving programming competition-style problems from input-output examples using deep learning. The approach is to train a neural network to predict properties of the program that generated the outputs from the inputs. We use the neural network's predictions to augment search techniques from the programming languages community, including enumerative search and… ▽ More

    Submitted 8 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: Submitted to ICLR 2017

  37. arXiv:1606.02556  [pdf, other

    cs.CV cs.AI

    DISCO Nets: DISsimilarity COefficient Networks

    Authors: Diane Bouchacourt, M. Pawan Kumar, Sebastian Nowozin

    Abstract: We present a new type of probabilistic model which we call DISsimilarity COefficient Networks (DISCO Nets). DISCO Nets allow us to efficiently sample from a posterior distribution parametrised by a neural network. During training, DISCO Nets are learned by minimising the dissimilarity coefficient between the true distribution and the estimated distribution. This allows us to tailor the training to… ▽ More

    Submitted 28 October, 2016; v1 submitted 8 June, 2016; originally announced June 2016.

  38. arXiv:1606.00709  [pdf, other

    stat.ML cs.LG stat.ME

    f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization

    Authors: Sebastian Nowozin, Botond Cseke, Ryota Tomioka

    Abstract: Generative neural samplers are probabilistic models that implement sampling using feedforward neural networks: they take a random input vector and produce a sample from a probability distribution defined by the network weights. These models are expressive and allow efficient computation of samples and derivatives, but cannot be used for computing likelihoods or for marginalization. The generative-… ▽ More

    Submitted 2 June, 2016; originally announced June 2016.

    Comments: 17 pages

  39. arXiv:1507.06173  [pdf, other

    cs.CV

    Bayesian Time-of-Flight for Realtime Shape, Illumination and Albedo

    Authors: Amit Adam, Christoph Dann, Omer Yair, Shai Mazor, Sebastian Nowozin

    Abstract: We propose a computational model for shape, illumination and albedo inference in a pulsed time-of-flight (TOF) camera. In contrast to TOF cameras based on phase modulation, our camera enables general exposure profiles. This results in added flexibility and requires novel computational approaches. To address this challenge we propose a generative probabilistic model that accurately relates latent… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

  40. Cascades of Regression Tree Fields for Image Restoration

    Authors: Uwe Schmidt, Jeremy Jancsary, Sebastian Nowozin, Stefan Roth, Carsten Rother

    Abstract: Conditional random fields (CRFs) are popular discriminative models for computer vision and have been successfully applied in the domain of image restoration, especially to image denoising. For image deblurring, however, discriminative approaches have been mostly lacking. We posit two reasons for this: First, the blur kernel is often only known at test time, requiring any discriminative approach to… ▽ More

    Submitted 20 November, 2014; v1 submitted 8 April, 2014; originally announced April 2014.

    Comments: Submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  41. arXiv:1404.0533  [pdf, other

    cs.CV

    A Comparative Study of Modern Inference Techniques for Structured Discrete Energy Minimization Problems

    Authors: Jörg H. Kappes, Bjoern Andres, Fred A. Hamprecht, Christoph Schnörr, Sebastian Nowozin, Dhruv Batra, Sungwoong Kim, Bernhard X. Kausler, Thorben Kröger, Jan Lellmann, Nikos Komodakis, Bogdan Savchynskyy, Carsten Rother

    Abstract: Szeliski et al. published an influential study in 2006 on energy minimization methods for Markov Random Fields (MRF). This study provided valuable insights in choosing the best optimization technique for certain classes of problems. While these insights remain generally useful today, the phenomenal success of random field models means that the kinds of inference problems that have to be solved cha… ▽ More

    Submitted 2 April, 2014; originally announced April 2014.

  42. arXiv:1402.0859  [pdf, other

    cs.CV cs.LG stat.ML

    The Informed Sampler: A Discriminative Approach to Bayesian Inference in Generative Computer Vision Models

    Authors: Varun Jampani, Sebastian Nowozin, Matthew Loper, Peter V. Gehler

    Abstract: Computer vision is hard because of a large variability in lighting, shape, and texture; in addition the image signal is non-additive due to occlusion. Generative models promised to account for this variability by accurately modelling the image formation process as a function of latent variables with prior beliefs. Bayesian posterior inference could then, in principle, explain the observation. Whil… ▽ More

    Submitted 7 March, 2015; v1 submitted 4 February, 2014; originally announced February 2014.

    Comments: Appearing in Computer Vision and Image Understanding Journal (Special Issue on Generative Models in Computer Vision)

  43. arXiv:1206.4620  [pdf

    cs.LG stat.ML

    Improved Information Gain Estimates for Decision Tree Induction

    Authors: Sebastian Nowozin

    Abstract: Ensembles of classification and regression trees remain popular machine learning methods because they define flexible non-parametric models that predict well and are computationally efficient both during training and testing. During induction of decision trees one aims to find predicates that are maximally informative about the prediction target. To select good predicates most approaches estimate… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012