Skip to main content

Showing 1–50 of 50 results for author: Hauberg, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04739  [pdf, other

    cs.LG stat.ML

    A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences

    Authors: Miguel González-Duque, Richard Michael, Simon Bartels, Yevgen Zainchkovskyy, Søren Hauberg, Wouter Boomsma

    Abstract: Optimizing discrete black-box functions is key in several domains, e.g. protein engineering and drug design. Due to the lack of gradient information and the need for sample efficiency, Bayesian optimization is an ideal candidate for these tasks. Several methods for high-dimensional continuous and categorical Bayesian optimization have been proposed recently. However, our survey of the field reveal… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.03334  [pdf, other

    cs.LG stat.ML

    Reparameterization invariance in approximate Bayesian inference

    Authors: Hrittik Roy, Marco Miani, Carl Henrik Ek, Philipp Hennig, Marvin Pförtner, Lukas Tatzel, Søren Hauberg

    Abstract: Current approximate posteriors in Bayesian neural networks (BNNs) exhibit a crucial limitation: they fail to maintain invariance under reparameterization, i.e. BNNs assign different posterior densities to different parametrizations of identical functions. This creates a fundamental flaw in the application of Bayesian principles as it breaks the correspondence between uncertainty over the parameter… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.17277  [pdf, other

    cs.LG math.NA stat.ML

    Gradients of Functions of Large Matrices

    Authors: Nicholas Krämer, Pablo Moreno-Muñoz, Hrittik Roy, Søren Hauberg

    Abstract: Tuning scientific and probabilistic machine learning models -- for example, partial differential equations, Gaussian processes, or Bayesian neural networks -- often relies on evaluating functions of matrices whose size grows with the data set or the number of parameters. While the state-of-the-art for evaluating these quantities is almost always based on Lanczos and Arnoldi iterations, the present… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2404.17452  [pdf, other

    cs.LG stat.ML

    A Continuous Relaxation for Discrete Bayesian Optimization

    Authors: Richard Michael, Simon Bartels, Miguel González-Duque, Yevgen Zainchkovskyy, Jes Frellsen, Søren Hauberg, Wouter Boomsma

    Abstract: To optimize efficiently over discrete data and with only few available target observations is a challenge in Bayesian optimization. We propose a continuous relaxation of the objective function and show that inference and optimization can be computationally tractable. We consider in particular the optimization domain where very few observations and strict budgets exist; motivated by optimizing prot… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  5. arXiv:2403.01666  [pdf, other

    cs.LG cs.CV

    Improving Adversarial Energy-Based Model via Diffusion Process

    Authors: Cong Geng, Tian Han, Peng-Tao Jiang, Hao Zhang, **wei Chen, Søren Hauberg, Bo Li

    Abstract: Generative models have shown strong generation ability while efficient likelihood estimation is less explored. Energy-based models~(EBMs) define a flexible energy function to parameterize unnormalized densities efficiently but are notorious for being difficult to train. Adversarial EBMs introduce a generator to form a minimax training game to avoid expensive MCMC sampling used in traditional EBMs,… ▽ More

    Submitted 8 June, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  6. arXiv:2401.09352  [pdf, other

    cs.RO cs.AI cs.LG

    Neural Contractive Dynamical Systems

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Nadia Figueroa, Gerhard Neumann, Leonel Rozo

    Abstract: Stability guarantees are crucial when ensuring a fully autonomous robot does not take undesirable or potentially harmful actions. Unfortunately, global stability guarantees are hard to provide in dynamical systems learned from data, especially when the learned dynamics are governed by neural networks. We propose a novel methodology to learn neural contractive dynamical systems, where our neural ar… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  7. arXiv:2308.16900  [pdf, other

    cs.LG

    Learning to Taste: A Multimodal Wine Dataset

    Authors: Thoranna Bender, Simon Moe Sørensen, Alireza Kashani, K. Eldjarn Hjorleifsson, Grethe Hyldig, Søren Hauberg, Serge Belongie, Frederik Warburg

    Abstract: We present WineSensed, a large multimodal wine dataset for studying the relations between visual perception, language, and flavor. The dataset encompasses 897k images of wine labels and 824k reviews of wines curated from the Vivino platform. It has over 350k unique bottlings, annotated with year, region, rating, alcohol percentage, price, and grape composition. We obtained fine-grained flavor anno… ▽ More

    Submitted 15 January, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted to NeurIPS 2023. See project page: https://thoranna.github.io/learning_to_taste/

  8. arXiv:2307.10895  [pdf, other

    cs.CV cs.LG

    Variational Autoencoding of Dental Point Clouds

    Authors: Johan Ziruo Ye, Thomas Ørkild, Peter Lempel Søndergaard, Søren Hauberg

    Abstract: Digital dentistry has made significant advancements, yet numerous challenges remain. This paper introduces the FDI 16 dataset, an extensive collection of tooth meshes and point clouds. Additionally, we present a novel approach: Variational FoldingNet (VF-Net), a fully probabilistic variational autoencoder designed for point clouds. Notably, prior latent variable models for point clouds lack a one-… ▽ More

    Submitted 31 January, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

  9. arXiv:2306.07158  [pdf, other

    stat.ML cs.LG stat.ME

    Riemannian Laplace approximations for Bayesian neural networks

    Authors: Federico Bergamin, Pablo Moreno-Muñoz, Søren Hauberg, Georgios Arvanitidis

    Abstract: Bayesian neural networks often approximate the weight-posterior with a Gaussian distribution. However, practical posteriors are often, even locally, highly non-Gaussian, and empirical performance deteriorates. We propose a simple parametric approximate posterior that adapts to the shape of the true posterior through a Riemannian metric that is determined by the log-posterior gradient. We develop a… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 28 pages, 12 figures. Under submission

  10. arXiv:2306.00520  [pdf, other

    stat.ML cs.LG

    On Masked Pre-training and the Marginal Likelihood

    Authors: Pablo Moreno-Muñoz, Pol G. Recasens, Søren Hauberg

    Abstract: Masked pre-training removes random input dimensions and learns a model that can predict the missing values. Empirical results indicate that this intuitive form of self-supervised learning yields models that generalize very well to new domains. A theoretical understanding is, however, lacking. This paper shows that masked pre-training with a suitable cumulative scoring function corresponds to maxim… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  11. arXiv:2303.13123  [pdf, other

    cs.CV cs.LG

    Laplacian Segmentation Networks: Improved Epistemic Uncertainty from Spatial Aleatoric Uncertainty

    Authors: Kilian Zepf, Selma Wanna, Marco Miani, Juston Moore, Jes Frellsen, Søren Hauberg, Aasa Feragen, Frederik Warburg

    Abstract: Out of distribution (OOD) medical images are frequently encountered, e.g. because of site- or scanner differences, or image corruption. OOD images come with a risk of incorrect image segmentation, potentially negatively affecting downstream diagnoses or treatment. To ensure robustness to such incorrect segmentations, we propose Laplacian Segmentation Networks (LSN) that jointly model epistemic (mo… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  12. arXiv:2302.01332  [pdf, other

    cs.LG cs.CV

    Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval

    Authors: Frederik Warburg, Marco Miani, Silas Brack, Soren Hauberg

    Abstract: We propose the first Bayesian encoder for metric learning. Rather than relying on neural amortization as done in prior works, we learn a distribution over the network weights with the Laplace Approximation. We actualize this by first proving that the contrastive loss is a valid log-posterior. We then propose three methods that ensure a positive definite Hessian. Lastly, we present a novel decompos… ▽ More

    Submitted 4 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Code: https://github.com/FrederikWarburg/bayesian-metric-learning

  13. arXiv:2212.10010  [pdf, other

    cs.LG

    Identifying latent distances with Finslerian geometry

    Authors: Alison Pouplin, David Eklund, Carl Henrik Ek, Søren Hauberg

    Abstract: Riemannian geometry provides us with powerful tools to explore the latent space of generative models while preserving the underlying structure of the data. The latent space can be equipped it with a Riemannian metric, pulled back from the data manifold. With this metric, we can systematically navigate the space relying on geodesics defined as the shortest curves between two points. Generative mode… ▽ More

    Submitted 11 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: 36 pages, 12 figures, accepted at TMLR (October 2023)

  14. arXiv:2211.05698  [pdf, other

    stat.ML cs.LG

    Probabilistic thermal stability prediction through sparsity promoting transformer representation

    Authors: Yevgen Zainchkovskyy, Jesper Ferkinghoff-Borg, Anja Bennett, Thomas Egebjerg, Nikolai Lorenzen, Per Jr. Greisen, Søren Hauberg, Carsten Stahlhut

    Abstract: Pre-trained protein language models have demonstrated significant applicability in different protein engineering task. A general usage of these pre-trained transformer models latent representation is to use a mean pool across residue positions to reduce the feature dimensions to further downstream tasks such as predicting bio-physics properties or other functional behaviours. In this paper we prov… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  15. arXiv:2209.04636  [pdf, other

    stat.ML cs.LG

    Revisiting Active Sets for Gaussian Process Decoders

    Authors: Pablo Moreno-Muñoz, Cilie W Feldager, Søren Hauberg

    Abstract: Decoders built on Gaussian processes (GPs) are enticing due to the marginalisation over the non-linear function space. Such models (also known as GP-LVMs) are often expensive and notoriously difficult to train in practice, but can be scaled using variational inference and inducing points. In this paper, we revisit active set approximations. We develop a new stochastic estimate of the log-marginal… ▽ More

    Submitted 24 November, 2022; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Accepted at Advances in Neural Information Processing Systems (NeurIPS) 2022

  16. arXiv:2206.15078  [pdf, other

    cs.LG

    Laplacian Autoencoders for Learning Stochastic Representations

    Authors: Marco Miani, Frederik Warburg, Pablo Moreno-Muñoz, Nicke Skafte Detlefsen, Søren Hauberg

    Abstract: Established methods for unsupervised representation learning such as variational autoencoders produce none or poorly calibrated uncertainty estimates making it difficult to evaluate if learned representations are stable and reliable. In this work, we present a Bayesian autoencoder for unsupervised representation learning, which is trained using a novel variational lower-bound of the autoencoder ev… ▽ More

    Submitted 23 August, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

  17. ICLR 2022 Challenge for Computational Geometry and Topology: Design and Results

    Authors: Adele Myers, Saiteja Utpala, Shubham Talbar, Sophia Sanborn, Christian Shewmake, Claire Donnat, Johan Mathe, Umberto Lupo, Rishi Sonthalia, Xinyue Cui, Tom Szwagier, Arthur Pignet, Andri Bergsson, Soren Hauberg, Dmitriy Nielsen, Stefan Sommer, David Klindt, Erik Hermansen, Melvin Vaupel, Benjamin Dunn, Jeffrey Xiong, Noga Aharony, Itsik Pe'er, Felix Ambellan, Martin Hanik , et al. (3 additional authors not shown)

    Abstract: This paper presents the computational challenge on differential geometry and topology that was hosted within the ICLR 2022 workshop ``Geometric and Topological Representation Learning". The competition asked participants to provide implementations of machine learning algorithms on manifolds that would respect the API of the open-source software Geomstats (manifold part) and Scikit-Learn (machine l… ▽ More

    Submitted 26 June, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  18. arXiv:2206.01552  [pdf, other

    cs.LG

    Is an encoder within reach?

    Authors: Helene Hauschultz, Rasmus Berg Palm. Pablo Moreno-Muños, Nicki Skafte Detlefsen, Andrew Allan du Plessis, Søren Hauberg

    Abstract: The encoder network of an autoencoder is an approximation of the nearest point projection onto the manifold spanned by the decoder. A concern with this approximation is that, while the output of the encoder is always unique, the projection can possibly have infinitely many values. This implies that the latent representations learned by the autoencoder can be misleading. Borrowing from geometric me… ▽ More

    Submitted 25 October, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 11 pages, 10 figures

    MSC Class: 68T07 (Primary) 57Z25 (Secondary)

  19. arXiv:2206.00106  [pdf, other

    cs.LG

    Mario Plays on a Manifold: Generating Functional Content in Latent Space through Differential Geometry

    Authors: Miguel González-Duque, Rasmus Berg Palm, Søren Hauberg, Sebastian Risi

    Abstract: Deep generative models can automatically create content of diverse types. However, there are no guarantees that such content will satisfy the criteria necessary to present it to end-users and be functional, e.g. the generated levels could be unsolvable or incoherent. In this paper we study this problem from a geometric perspective, and provide a method for reliable interpolation and random walks i… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at CoG 2022

  20. arXiv:2203.09253  [pdf, other

    cs.LG cs.HC

    Visualizing Riemannian data with Rie-SNE

    Authors: Andri Bergsson, Søren Hauberg

    Abstract: Faithful visualizations of data residing on manifolds must take the underlying geometry into account when producing a flat planar view of the data. In this paper, we extend the classic stochastic neighbor embedding (SNE) algorithm to data on general Riemannian manifolds. We replace standard Gaussian assumptions with Riemannian diffusion counterparts and propose an efficient approximation that only… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 7 pages, 4 figures

  21. arXiv:2203.07761  [pdf, other

    cs.RO cs.AI cs.LG

    Reactive Motion Generation on Learned Riemannian Manifolds

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

    Abstract: In recent decades, advancements in motion learning have enabled robots to acquire new skills and adapt to unseen conditions in both structured and unstructured environments. In practice, motion learning methods capture relevant patterns and adjust them to new conditions such as dynamic obstacle avoidance or variable targets. In this paper, we investigate the robot motion learning paradigm from a R… ▽ More

    Submitted 17 August, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

  22. arXiv:2203.01097  [pdf, other

    stat.ML cs.LG

    Model-agnostic out-of-distribution detection using combined statistical tests

    Authors: Federico Bergamin, Pierre-Alexandre Mattei, Jakob D. Havtorn, Hugo Senetaire, Hugo Schmutz, Lars Maaløe, Søren Hauberg, Jes Frellsen

    Abstract: We present simple methods for out-of-distribution detection using a trained generative model. These techniques, based on classical statistical tests, are model-agnostic in the sense that they can be applied to any differentiable generative model. The idea is to combine a classical parametric test (Rao's score test) with the recently introduced typicality test. These two test statistics are both th… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022

  23. arXiv:2202.12707  [pdf, other

    eess.AS cs.AI cs.LG cs.SD stat.ML

    Benchmarking Generative Latent Variable Models for Speech

    Authors: Jakob D. Havtorn, Lasse Borgholt, Søren Hauberg, Jes Frellsen, Lars Maaløe

    Abstract: Stochastic latent variable models (LVMs) achieve state-of-the-art performance on natural image generation but are still inferior to deterministic models on speech. In this paper, we develop a speech benchmark of popular temporal LVMs and compare them against state-of-the-art deterministic models. We report the likelihood, which is a much used metric in the image domain, but rarely, or incomparably… ▽ More

    Submitted 5 April, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at the 2022 ICLR workshop on Deep Generative Models for Highly Structured Data (https://deep-gen-struct.github.io)

  24. arXiv:2202.10769  [pdf, other

    cs.LG

    Adaptive Cholesky Gaussian Processes

    Authors: Simon Bartels, Kristoffer Stensbo-Smidt, Pablo Moreno-Muñoz, Wouter Boomsma, Jes Frellsen, Søren Hauberg

    Abstract: We present a method to approximate Gaussian process regression models for large datasets by considering only a subset of the data. Our approach is novel in that the size of the subset is selected on the fly during exact inference with little computational overhead. From an empirical observation that the log-marginal likelihood often exhibits a linear trend once a sufficient subset of a dataset has… ▽ More

    Submitted 23 February, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (2023)

  25. arXiv:2202.01821  [pdf, other

    cs.CV cs.RO

    Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization

    Authors: Andrea Vallone, Frederik Warburg, Hans Hansen, Søren Hauberg, Javier Civera

    Abstract: Place recognition and visual localization are particularly challenging in wide baseline configurations. In this paper, we contribute with the \emph{Danish Airs and Grounds} (DAG) dataset, a large collection of street-level and aerial images targeting such cases. Its main challenge lies in the extreme viewing-angle difference between query and reference images with consequent changes in illuminatio… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: Submitted to RA-L (IROS)

  26. arXiv:2201.05890  [pdf, other

    cs.LG stat.ML

    Robust uncertainty estimates with out-of-distribution pseudo-inputs training

    Authors: Pierre Segonne, Yevgen Zainchkovskyy, Søren Hauberg

    Abstract: Probabilistic models often use neural networks to control their predictive uncertainty. However, when making out-of-distribution (OOD)} predictions, the often-uncontrollable extrapolation properties of neural networks yield poor uncertainty predictions. Such models then don't know what they don't know, which directly limits their robustness w.r.t unexpected inputs. To counter this, we propose to e… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

  27. arXiv:2111.00929  [pdf, other

    cs.LG stat.ML

    Bounds all around: training energy-based models with bidirectional bounds

    Authors: Cong Geng, Jia Wang, Zhiyong Gao, Jes Frellsen, Søren Hauberg

    Abstract: Energy-based models (EBMs) provide an elegant framework for density estimation, but they are notoriously difficult to train. Recent work has established links to generative adversarial networks, where the EBM is trained through a minimax game with a variational value function. We propose a bidirectional bound on the EBM log-likelihood, such that we maximize a lower bound and minimize an upper boun… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted by NeurIPS 2021

  28. arXiv:2106.05367  [pdf, other

    cs.LG stat.ML

    Pulling back information geometry

    Authors: Georgios Arvanitidis, Miguel González-Duque, Alison Pouplin, Dimitris Kalatzis, Søren Hauberg

    Abstract: Latent space geometry has shown itself to provide a rich and rigorous framework for interacting with the latent variables of deep generative models. The existing theory, however, relies on the decoder being a Gaussian distribution as its simple reparametrization allows us to interpret the generating process as a random projection of a deterministic manifold. Consequently, this approach breaks down… ▽ More

    Submitted 23 April, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Presented at AISTATS 2022

  29. arXiv:2106.04315  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Riemannian Manifolds for Geodesic Motion Skills

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

    Abstract: For robots to work alongside humans and perform in unstructured environments, they must learn new motion skills and adapt them to unseen situations on the fly. This demands learning models that capture relevant motion patterns, while offering enough flexibility to adapt the encoded skills to new requirements, such as dynamic obstacle avoidance. We introduce a Riemannian manifold perspective on thi… ▽ More

    Submitted 1 July, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  30. arXiv:2106.03500  [pdf, other

    cs.LG stat.ML

    Density estimation on smooth manifolds with normalizing flows

    Authors: Dimitris Kalatzis, Johan Ziruo Ye, Alison Pouplin, Jesper Wohlert, Søren Hauberg

    Abstract: We present a framework for learning probability distributions on topologically non-trivial manifolds, utilizing normalizing flows. Current methods focus on manifolds that are homeomorphic to Euclidean space, enforce strong structural priors on the learned models or use operations that do not easily scale to high dimensions. In contrast, our method learns distributions on a data manifold by "gluing… ▽ More

    Submitted 9 July, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

  31. arXiv:2102.08248  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Hierarchical VAEs Know What They Don't Know

    Authors: Jakob D. Havtorn, Jes Frellsen, Søren Hauberg, Lars Maaløe

    Abstract: Deep generative models have been demonstrated as state-of-the-art density estimators. Yet, recent work has found that they often assign a higher likelihood to data from outside the training distribution. This seemingly paradoxical behavior has caused concerns over the quality of the attained density estimates. In the context of hierarchical variational autoencoders, we provide evidence to explain… ▽ More

    Submitted 18 January, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Appeared in Proceedings of the 38th International Conference on Machine Learning (ICML 2021). 18 pages, source code available at https://github.com/JakobHavtorn/hvae-oodd, https://github.com/vlievin/biva-pytorch and https://github.com/larsmaaloee/BIVA

  32. arXiv:2012.02679  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    What is a meaningful representation of protein sequences?

    Authors: Nicki Skafte Detlefsen, Søren Hauberg, Wouter Boomsma

    Abstract: How we choose to represent our data has a fundamental impact on our ability to subsequently extract information from them. Machine learning promises to automatically determine efficient representations from large unstructured datasets, such as those arising in biology. However, empirical evidence suggests that seemingly minor changes to these machine learning models yield drastically different dat… ▽ More

    Submitted 7 March, 2022; v1 submitted 28 November, 2020; originally announced December 2020.

    Comments: 17 pages, 8 figures, 2 tables

    Journal ref: Nature Communications 13, 1914 (2022)

  33. arXiv:2011.12663  [pdf, other

    cs.CV

    Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval

    Authors: Frederik Warburg, Martin Jørgensen, Javier Civera, Søren Hauberg

    Abstract: Uncertainty quantification in image retrieval is crucial for downstream decisions, yet it remains a challenging and largely unexplored problem. Current methods for estimating uncertainties are poorly calibrated, computationally expensive, or based on heuristics. We present a new method that views image embeddings as stochastic features rather than deterministic features. Our two main contributions… ▽ More

    Submitted 17 September, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Journal ref: 2021 ICCV

  34. arXiv:2008.05552  [pdf, other

    stat.ML cs.LG

    Reparametrization Invariance in non-parametric Causal Discovery

    Authors: Martin Jørgensen, Søren Hauberg

    Abstract: Causal discovery estimates the underlying physical process that generates the observed data: does X cause Y or does Y cause X? Current methodologies use structural conditions to turn the causal query into a statistical query, when only observational data is available. But what if these statistical queries are sensitive to causal invariants? This study investigates one such invariant: the causal re… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

  35. arXiv:2008.00565  [pdf, other

    stat.ML cs.LG

    Geometrically Enriched Latent Spaces

    Authors: Georgios Arvanitidis, Søren Hauberg, Bernhard Schölkopf

    Abstract: A common assumption in generative models is that the generator immerses the latent space into a Euclidean ambient space. Instead, we consider the ambient space to be a Riemannian manifold, which allows for encoding domain knowledge through the associated Riemannian metric. Shortest paths can then be defined accordingly in the latent space to both follow the learned manifold and respect the ambient… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

  36. arXiv:2006.11741  [pdf, other

    stat.ML cs.LG

    Isometric Gaussian Process Latent Variable Model for Dissimilarity Data

    Authors: Martin Jørgensen, Søren Hauberg

    Abstract: We present a probabilistic model where the latent variable respects both the distances and the topology of the modeled data. The model leverages the Riemannian geometry of the generated manifold to endow the latent space with a well-defined stochastic distance measure, which is modeled locally as Nakagami distributions. These stochastic distances are sought to be as similar as possible to observed… ▽ More

    Submitted 8 June, 2021; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: ICML 2021

  37. arXiv:2004.03637  [pdf, other

    cs.LG stat.ML

    Probabilistic Spatial Transformer Networks

    Authors: Pola Schwöbel, Frederik Warburg, Martin Jørgensen, Kristoffer H. Madsen, Søren Hauberg

    Abstract: Spatial Transformer Networks (STNs) estimate image transformations that can improve downstream tasks by `zooming in' on relevant regions in an image. However, STNs are hard to train and sensitive to mis-predictions of transformations. To circumvent these limitations, we propose a probabilistic extension that estimates a stochastic transformation rather than a deterministic one. Marginalizing trans… ▽ More

    Submitted 15 June, 2022; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: UAI 2022

  38. arXiv:2002.05227  [pdf, other

    cs.LG stat.ML

    Variational Autoencoders with Riemannian Brownian Motion Priors

    Authors: Dimitris Kalatzis, David Eklund, Georgios Arvanitidis, Søren Hauberg

    Abstract: Variational Autoencoders (VAEs) represent the given data in a low-dimensional latent space, which is generally assumed to be Euclidean. This assumption naturally leads to the common choice of a standard Gaussian prior over continuous latent variables. Recent work has, however, shown that this prior has a detrimental effect on model capacity, leading to subpar performance. We propose that the Eucli… ▽ More

    Submitted 7 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published in ICML 2020

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, Vienna, Austria, PMLR 119, 2020

  39. arXiv:1908.07377  [pdf, other

    cs.LG stat.ML

    Expected path length on random manifolds

    Authors: David Eklund, Søren Hauberg

    Abstract: Manifold learning seeks a low dimensional representation that faithfully captures the essence of data. Current methods can successfully learn such representations, but do not provide a meaningful set of operations that are associated with the representation. Working towards operational representation learning, we endow the latent space of a large class of generative models with a random Riemannian… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

  40. arXiv:1906.11881  [pdf, other

    cs.CV cs.LG stat.ML

    Explicit Disentanglement of Appearance and Perspective in Generative Models

    Authors: Nicki Skafte Detlefsen, Søren Hauberg

    Abstract: Disentangled representation learning finds compact, independent and easy-to-interpret factors of the data. Learning such has been shown to require an inductive bias, which we explicitly encode in a generative model of images. Specifically, we propose a model with two latent spaces: one that represents spatial transformations of the input data, and another that represents the transformed data. We f… ▽ More

    Submitted 13 November, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 9 main pages + 2 pages references + 8 pages of supplementary material

  41. arXiv:1906.03260  [pdf, other

    stat.ML cs.LG

    Reliable training and estimation of variance networks

    Authors: Nicki S. Detlefsen, Martin Jørgensen, Søren Hauberg

    Abstract: We propose and investigate new complementary methodologies for estimating predictive variance networks in regression neural networks. We derive a locally aware mini-batching scheme that result in sparse robust gradients, and show how to make unbiased weight updates to a variance network. Further, we formulate a heuristic for robustly fitting both the mean and variance networks post hoc. Finally, w… ▽ More

    Submitted 4 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Appeared at NeurIPS 2019

  42. arXiv:1901.07229  [pdf, other

    stat.ML cs.LG

    Fast and Robust Shortest Paths on Manifolds Learned from Data

    Authors: Georgios Arvanitidis, Søren Hauberg, Philipp Hennig, Michael Schober

    Abstract: We propose a fast, simple and robust algorithm for computing shortest paths and distances on Riemannian manifolds learned from data. This amounts to solving a system of ordinary differential equations (ODEs) subject to boundary conditions. Here standard solvers perform poorly because they require well-behaved Jacobians of the ODE, and usually, manifolds learned from data imply unstable and ill-con… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

    Comments: Accepted at Artificial Intelligence and Statistics (AISTATS) 2019

  43. arXiv:1809.04747  [pdf, other

    cs.LG stat.ML

    Geodesic Clustering in Deep Generative Models

    Authors: Tao Yang, Georgios Arvanitidis, Dongmei Fu, Xiaogang Li, Søren Hauberg

    Abstract: Deep generative models are tremendously successful in learning low-dimensional latent representations that well-describe the data. These representations, however, tend to much distort relationships between points, i.e. pairwise distances tend to not reflect semantic similarities well. This renders unsupervised tasks, such as clustering, difficult when working with the latent representations. We de… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

  44. arXiv:1806.04994  [pdf, other

    stat.ML cs.LG

    Only Bayes should learn a manifold (on the estimation of differential geometric structure from data)

    Authors: Søren Hauberg

    Abstract: We investigate learning of the differential geometric structure of a data manifold embedded in a high-dimensional Euclidean space. We first analyze kernel-based algorithms and show that under the usual regularizations, non-probabilistic methods cannot recover the differential geometric structure, but instead find mostly linear manifolds or spaces equipped with teleports. To properly learn the diff… ▽ More

    Submitted 26 September, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

  45. arXiv:1805.09122  [pdf, other

    stat.ML cs.LG

    Probabilistic Riemannian submanifold learning with wrapped Gaussian process latent variable models

    Authors: Anton Mallasto, Søren Hauberg, Aasa Feragen

    Abstract: Latent variable models (LVMs) learn probabilistic models of data manifolds lying in an \emph{ambient} Euclidean space. In a number of applications, a priori known spatial constraints can shrink the ambient space into a considerably smaller manifold. Additionally, in these applications the Euclidean geometry might induce a suboptimal similarity measure, which could be improved by choosing a differe… ▽ More

    Submitted 24 February, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

  46. arXiv:1702.01005  [pdf, other

    cs.LG cs.CV

    Intrinsic Grassmann Averages for Online Linear, Robust and Nonlinear Subspace Learning

    Authors: Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri

    Abstract: Principal Component Analysis (PCA) and Kernel Principal Component Analysis (KPCA) are fundamental methods in machine learning for dimensionality reduction. The former is a technique for finding this approximation in finite dimensions and the latter is often in an infinite dimensional Reproducing Kernel Hilbert-space (RKHS). In this paper, we present a geometric framework for computing the principa… ▽ More

    Submitted 9 July, 2018; v1 submitted 3 February, 2017; originally announced February 2017.

  47. arXiv:1510.02795  [pdf, other

    cs.CV

    Dreaming More Data: Class-dependent Distributions over Diffeomorphisms for Learned Data Augmentation

    Authors: Søren Hauberg, Oren Freifeld, Anders Boesen Lindbo Larsen, John W. Fisher III, Lars Kai Hansen

    Abstract: Data augmentation is a key element in training high-dimensional models. In this approach, one synthesizes new observations by applying pre-specified transformations to the original training data; e.g.~new images are formed by rotating old ones. Current augmentation schemes, however, rely on manual specification of the applied transformations, making data augmentation an implicit form of feature en… ▽ More

    Submitted 30 June, 2016; v1 submitted 9 October, 2015; originally announced October 2015.

    Journal ref: Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, pp. 342-350, 2016

  48. arXiv:1411.7432  [pdf, other

    stat.ML cs.LG

    Metrics for Probabilistic Geometries

    Authors: Alessandra Tosi, Søren Hauberg, Alfredo Vellido, Neil D. Lawrence

    Abstract: We investigate the geometrical structure of probabilistic generative dimensionality reduction models using the tools of Riemannian geometry. We explicitly define a distribution over the natural metric given by the models. We provide the necessary algorithms to compute expected metric tensors where the distribution over map**s is given by a Gaussian process. We treat the corresponding latent vari… ▽ More

    Submitted 26 November, 2014; originally announced November 2014.

    Comments: UAI 2014

  49. arXiv:1411.0296  [pdf, other

    cs.LG cs.CV

    Geodesic Exponential Kernels: When Curvature and Linearity Conflict

    Authors: Aasa Feragen, Francois Lauze, Søren Hauberg

    Abstract: We consider kernel methods on general geodesic metric spaces and provide both negative and positive results. First we show that the common Gaussian kernel can only be generalized to a positive definite kernel on a geodesic metric space if the space is flat. As a result, for data on a Riemannian manifold, the geodesic Gaussian kernel is only positive definite if the Riemannian manifold is Euclidean… ▽ More

    Submitted 17 November, 2014; v1 submitted 2 November, 2014; originally announced November 2014.

    Comments: 13 pages

  50. arXiv:1306.0308  [pdf, other

    stat.ML cs.LG math.NA

    Probabilistic Solutions to Differential Equations and their Application to Riemannian Statistics

    Authors: Philipp Hennig, Søren Hauberg

    Abstract: We study a probabilistic numerical method for the solution of both boundary and initial value problems that returns a joint Gaussian process posterior over the solution. Such methods have concrete value in the statistics on Riemannian manifolds, where non-analytic ordinary differential equations are involved in virtually all computations. The probabilistic formulation permits marginalising the unc… ▽ More

    Submitted 12 February, 2014; v1 submitted 3 June, 2013; originally announced June 2013.

    Comments: 11 page (9 page conference paper, plus supplements)

    MSC Class: 65L05; 65L10; 58D17

    Journal ref: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS) 2014, Reykjavik, Iceland. Journal of Machine Learning Research: W&CP volume 33