Skip to main content

Showing 1–14 of 14 results for author: Strathmann, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  3. arXiv:2208.07698  [pdf, other

    stat.ML cs.LG

    Score-Based Diffusion meets Annealed Importance Sampling

    Authors: Arnaud Doucet, Will Grathwohl, Alexander G. D. G. Matthews, Heiko Strathmann

    Abstract: More than twenty years after its introduction, Annealed Importance Sampling (AIS) remains one of the most effective methods for marginal likelihood estimation. It relies on a sequence of distributions interpolating between a tractable initial distribution and the target distribution of interest which we simulate from approximately using a non-homogeneous Markov chain. To obtain an importance sampl… ▽ More

    Submitted 24 October, 2022; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: accepted at NeurIPS 2022

  4. arXiv:2107.10731  [pdf, other

    cs.LG stat.CO stat.ML

    Neural Variational Gradient Descent

    Authors: Lauro Langosco di Langosco, Vincent Fortuin, Heiko Strathmann

    Abstract: Particle-based approximate Bayesian inference approaches such as Stein Variational Gradient Descent (SVGD) combine the flexibility and convergence guarantees of sampling methods with the computational benefits of variational inference. In practice, SVGD relies on the choice of an appropriate kernel function, which impacts its ability to model the target distribution -- a challenging problem with o… ▽ More

    Submitted 29 July, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

  5. arXiv:2104.00587  [pdf, other

    stat.ML cs.LG

    NeRF-VAE: A Geometry Aware 3D Scene Generative Model

    Authors: Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Pol Moreno, Rosalia Schneider, Soňa Mokrá, Danilo J. Rezende

    Abstract: We propose NeRF-VAE, a 3D scene generative model that incorporates geometric structure via NeRF and differentiable volume rendering. In contrast to NeRF, our model takes into account shared structure across scenes, and is able to infer the structure of a novel scene -- without the need to re-train -- using amortized inference. NeRF-VAE's explicit 3D rendering process further contrasts previous gen… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: 17 pages, 15 figures, under review

  6. arXiv:2103.01043  [pdf, other

    cs.LG cs.AI cs.SI stat.ML

    Persistent Message Passing

    Authors: Heiko Strathmann, Mohammadamin Barekatain, Charles Blundell, Petar Veličković

    Abstract: Graph neural networks (GNNs) are a powerful inductive bias for modelling algorithmic reasoning procedures and data structures. Their prowess was mainly demonstrated on tasks featuring Markovian dynamics, where querying any associated data structure depends only on its latest state. For many tasks of interest, however, it may be highly beneficial to support efficient data structure queries dependen… ▽ More

    Submitted 27 April, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: 7 pages, 2 figures. Published as a workshop paper at ICLR 2021 SimDL Workshop. Accepted at the ICLR 2021 Workshop on Geometrical and Topological Representation Learning

  7. arXiv:1901.08098  [pdf, other

    stat.ML cs.LG

    Meta-Learning Mean Functions for Gaussian Processes

    Authors: Vincent Fortuin, Heiko Strathmann, Gunnar Rätsch

    Abstract: When fitting Bayesian machine learning models on scarce data, the main challenge is to obtain suitable prior knowledge and encode it into the model. Recent advances in meta-learning offer powerful methods for extracting such prior knowledge from data acquired in related tasks. When it comes to meta-learning in Gaussian process models, approaches in this setting have mostly focused on learning the… ▽ More

    Submitted 14 February, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  8. arXiv:1811.08357  [pdf, other

    stat.ML cs.LG stat.ME

    Learning deep kernels for exponential family densities

    Authors: Li Wenliang, Danica J. Sutherland, Heiko Strathmann, Arthur Gretton

    Abstract: The kernel exponential family is a rich class of distributions, which can be fit efficiently and with statistical guarantees by score matching. Being required to choose a priori a simple kernel such as the Gaussian, however, limits its practical applicability. We provide a scheme for learning a kernel parameterized by a deep network, which can find complex location-dependent local features of the… ▽ More

    Submitted 14 January, 2021; v1 submitted 20 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the 36th International Conference on Machine Learning (ICML 2019), PMLR 97:6737-6746

  9. arXiv:1810.10368  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Gaussian Processes on Discrete Domains

    Authors: Vincent Fortuin, Gideon Dresdner, Heiko Strathmann, Gunnar Rätsch

    Abstract: Kernel methods on discrete domains have shown great promise for many challenging data types, for instance, biological sequence data and molecular structure data. Scalable kernel methods like Support Vector Machines may offer good predictive performances but do not intrinsically provide uncertainty estimates. In contrast, probabilistic kernel methods like Gaussian Processes offer uncertainty estima… ▽ More

    Submitted 26 May, 2021; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: Published at IEEE Access

  10. arXiv:1806.02199  [pdf, other

    cs.LG stat.ML

    SOM-VAE: Interpretable Discrete Representation Learning on Time Series

    Authors: Vincent Fortuin, Matthias Hüser, Francesco Locatello, Heiko Strathmann, Gunnar Rätsch

    Abstract: High-dimensional time series are common in many domains. Since human cognition is not optimized to work well in high-dimensional spaces, these areas could benefit from interpretable low-dimensional representations. However, most representation learning algorithms for time series data are difficult to interpret. This is due to non-intuitive map**s from data features to salient properties of the r… ▽ More

    Submitted 4 January, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Accepted for publication at the Seventh International Conference on Learning Representations (ICLR 2019)

  11. arXiv:1705.08360  [pdf, other

    stat.ML cs.LG stat.ME

    Efficient and principled score estimation with Nyström kernel exponential families

    Authors: Danica J. Sutherland, Heiko Strathmann, Michael Arbel, Arthur Gretton

    Abstract: We propose a fast method with statistical guarantees for learning an exponential family density model where the natural parameter is in a reproducing kernel Hilbert space, and may be infinite-dimensional. The model is learned by fitting the derivative of the log density, the score, thus avoiding the need to compute a normalization constant. Our approach improves the computational efficiency of an… ▽ More

    Submitted 14 January, 2021; v1 submitted 23 May, 2017; originally announced May 2017.

    Journal ref: Proceedings of the Twenty-First International Conference on Artificial Intelligence and Statistics (AISTATS 2018), PMLR 84:652-660

  12. arXiv:1611.04488  [pdf, other

    stat.ML cs.AI cs.LG cs.NE stat.ME

    Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy

    Authors: Danica J. Sutherland, Hsiao-Yu Tung, Heiko Strathmann, Soumyajit De, Aaditya Ramdas, Alex Smola, Arthur Gretton

    Abstract: We propose a method to optimize the representation and distinguishability of samples from two probability distributions, by maximizing the estimated power of a statistical test based on the maximum mean discrepancy (MMD). This optimized MMD is applied to the setting of unsupervised learning by generative adversarial networks (GAN), in which a model attempts to generate realistic samples, and a dis… ▽ More

    Submitted 14 January, 2021; v1 submitted 14 November, 2016; originally announced November 2016.

    Comments: Published at ICLR 2017 (public comments: http://openreview.net/forum?id=HJWHIKqgl )

  13. arXiv:1501.03326  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Unbiased Bayes for Big Data: Paths of Partial Posteriors

    Authors: Heiko Strathmann, Dino Sejdinovic, Mark Girolami

    Abstract: A key quantity of interest in Bayesian inference are expectations of functions with respect to a posterior distribution. Markov Chain Monte Carlo is a fundamental tool to consistently compute these expectations via averaging samples drawn from an approximate posterior. However, its feasibility is being challenged in the era of so called Big Data as all data needs to be processed in every iteration… ▽ More

    Submitted 9 February, 2015; v1 submitted 14 January, 2015; originally announced January 2015.

    Comments: 18 pages, 10 figures

  14. arXiv:1307.5302  [pdf, ps, other

    stat.ML cs.LG

    Kernel Adaptive Metropolis-Hastings

    Authors: Dino Sejdinovic, Heiko Strathmann, Maria Lomeli Garcia, Christophe Andrieu, Arthur Gretton

    Abstract: A Kernel Adaptive Metropolis-Hastings algorithm is introduced, for the purpose of sampling from a target distribution with strongly nonlinear support. The algorithm embeds the trajectory of the Markov chain into a reproducing kernel Hilbert space (RKHS), such that the feature space covariance of the samples informs the choice of proposal. The procedure is computationally efficient and straightforw… ▽ More

    Submitted 12 June, 2014; v1 submitted 19 July, 2013; originally announced July 2013.

    Comments: Proceedings of the 31st International Conference on Machine Learning, Bei**g, China, 2014; JMLR: W&CP volume 32(2)