Skip to main content

Showing 1–48 of 48 results for author: Köthe, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15104  [pdf, other

    cs.CR cs.CV

    Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors

    Authors: Peter Lorenz, Mario Fernandez, Jens Müller, Ullrich Köthe

    Abstract: Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast and showing an option to protect a pre-trained classifier against natural distribution shifts, claiming to be… ▽ More

    Submitted 28 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.03154  [pdf, other

    cs.LG cs.AI

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Recent advances in probabilistic deep learning enable efficient amortized Bayesian inference in settings where the likelihood function is only implicitly defined by a simulation program (simulation-based inference; SBI). But how faithful is such inference if the simulation represents reality somewhat inaccurately, that is, if the true system behavior at test time deviates from the one seen during… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Extended version of the conference paper https://doi.org/10.1007/978-3-031-54605-1_35. arXiv admin note: text overlap with arXiv:2112.08866

  3. DALSA: Domain Adaptation for Supervised Learning From Sparsely Annotated MR Images

    Authors: Michael Götz, Christian Weber, Franciszek Binczyk, Joanna Polanska, Rafal Tarnawski, Barbara Bobek-Billewicz, Ullrich Köthe, Jens Kleesiek, Bram Stieltjes, Klaus H. Maier-Hein

    Abstract: We propose a new method that employs transfer learning techniques to effectively correct sampling selection errors introduced by sparse annotations during supervised learning for automated tumor segmentation. The practicality of current learning-based automated tissue classification approaches is severely impeded by their dependency on manually segmented training databases that need to be recreate… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Journal ref: IEEE Transactions on Medical Imaging ( Volume: 35, Issue: 1, January 2016)

  4. arXiv:2402.06578  [pdf, other

    cs.LG stat.ML

    On the Universality of Coupling-based Normalizing Flows

    Authors: Felix Draxler, Stefan Wahl, Christoph Schnörr, Ullrich Köthe

    Abstract: We present a novel theoretical framework for understanding the expressive power of normalizing flows. Despite their prevalence in scientific applications, a comprehensive understanding of flows remains elusive due to their restricted architectures. Existing theorems fall short as they require the use of arbitrarily ill-conditioned neural networks, limiting practical applicability. We propose a dis… ▽ More

    Submitted 5 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  5. arXiv:2312.10107  [pdf, other

    cs.LG cs.AI

    Towards Context-Aware Domain Generalization: Understanding the Benefits and Limits of Marginal Transfer Learning

    Authors: Jens Müller, Lars Kühmichel, Martin Rohbeck, Stefan T. Radev, Ullrich Köthe

    Abstract: In this work, we analyze the conditions under which information about the context of an input $X$ can improve the predictions of deep learning models in new domains. Following work in marginal transfer learning in Domain Generalization (DG), we formalize the notion of context as a permutation-invariant representation of a set of data points that originate from the same domain as the input itself.… ▽ More

    Submitted 21 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  6. arXiv:2312.09852  [pdf, other

    cs.LG stat.ML

    Learning Distributions on Manifolds with Free-form Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Ullrich Köthe

    Abstract: Many real world data, particularly in the natural sciences and computer vision, lie on known Riemannian manifolds such as spheres, tori or the group of rotation matrices. The predominant approaches to learning a distribution on such a manifold require solving a differential equation in order to sample from the model and evaluate densities. The resulting sampling times are slowed down by a high num… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Preprint, under review

  7. arXiv:2312.05440  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency Models for Scalable and Fast Simulation-Based Inference

    Authors: Marvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T Radev

    Abstract: Simulation-based inference (SBI) is constantly in search of more expressive algorithms for accurately inferring the parameters of complex models from noisy data. We present consistency models for neural posterior estimation (CMPE), a new free-form conditional sampler for scalable, fast, and amortized SBI with generative neural networks. CMPE combines the advantages of normalizing flows and flow ma… ▽ More

    Submitted 27 February, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  8. arXiv:2310.16624  [pdf, other

    cs.LG stat.ML

    Free-form Flows: Make Any Architecture a Normalizing Flow

    Authors: Felix Draxler, Peter Sorrenson, Lea Zimmermann, Armand Rousselot, Ullrich Köthe

    Abstract: Normalizing Flows are generative models that directly maximize the likelihood. Previously, the design of normalizing flows was largely constrained by the need for analytical invertibility. We overcome this constraint by a training procedure that uses an efficient estimator for the gradient of the change of variables formula. This enables any dimension-preserving neural network to serve as a genera… ▽ More

    Submitted 24 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Camera-ready version: accepted at AISTATS 2024

  9. arXiv:2310.11122  [pdf, other

    stat.ML cs.LG stat.ME

    Sensitivity-Aware Amortized Bayesian Inference

    Authors: Lasse Elsemüller, Hans Olischläger, Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose sensitivity-aware amortized Bayesian inference (SA-ABI), a multifaceted approach to efficiently integrate sensitivity analyses into simulation-based inference with neur… ▽ More

    Submitted 8 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  10. arXiv:2310.04395  [pdf, other

    cs.LG cs.AI

    Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference

    Authors: Marvin Schmitt, Desi R. Ivanova, Daniel Habermann, Ullrich Köthe, Paul-Christian Bürkner, Stefan T. Radev

    Abstract: We propose a method to improve the efficiency and accuracy of amortized Bayesian inference by leveraging universal symmetries in the joint probabilistic model of parameters and data. In a nutshell, we invert Bayes' theorem and estimate the marginal likelihood based on approximate representations of the joint model. Upon perfect approximation, the marginal likelihood is constant across all paramete… ▽ More

    Submitted 26 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: previously published as an extended abstract at NeurIPS UniReps 2023

  11. arXiv:2309.09764  [pdf, other

    cs.CV cs.LG eess.IV

    Application-driven Validation of Posteriors in Inverse Problems

    Authors: Tim J. Adler, Jan-Hinrich Nölke, Annika Reinke, Minu Dietlinde Tizabi, Sebastian Gruber, Dasha Trofimova, Lynton Ardizzone, Paul F. Jaeger, Florian Buettner, Ullrich Köthe, Lena Maier-Hein

    Abstract: Current deep learning-based solutions for image analysis tasks are commonly incapable of handling problems to which multiple different plausible solutions exist. In response, posterior-based methods such as conditional Diffusion Models and Invertible Neural Networks have emerged; however, their translation is hampered by a lack of research on adequate validation. In other words, the way progress i… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Shared first authors: Tim J. Adler and Jan-Hinrich Nölke. 16 pages, 8 figures, 1 table

  12. arXiv:2308.02652  [pdf, other

    cs.LG

    A Review of Change of Variable Formulas for Generative Modeling

    Authors: Ullrich Köthe

    Abstract: Change-of-variables (CoV) formulas allow to reduce complicated probability densities to simpler ones by a learned transformation with tractable Jacobian determinant. They are thus powerful tools for maximum-likelihood learning, Bayesian inference, outlier detection, model selection, etc. CoV formulas have been derived for a large variety of model types, but this information is scattered over many… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  13. arXiv:2306.16015  [pdf, other

    cs.LG cs.AI stat.ML

    BayesFlow: Amortized Bayesian Workflows With Neural Networks

    Authors: Stefan T Radev, Marvin Schmitt, Lukas Schumacher, Lasse Elsemüller, Valentin Pratz, Yannik Schälte, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows are the approximation of intractable posterior distributions for diverse model types and the comparison of competing models of the same process in terms of the… ▽ More

    Submitted 10 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  14. arXiv:2306.13520  [pdf, other

    cs.LG stat.ML

    On the Convergence Rate of Gaussianization with Random Rotations

    Authors: Felix Draxler, Lars Kühmichel, Armand Rousselot, Jens Müller, Christoph Schnörr, Ullrich Köthe

    Abstract: Gaussianization is a simple generative model that can be trained without backpropagation. It has shown compelling performance on low dimensional data. As the dimension increases, however, it has been observed that the convergence speed slows down. We show analytically that the number of required layers scales linearly with the dimension for Gaussian input. We argue that this is because the model i… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  15. arXiv:2306.01843  [pdf, other

    cs.LG

    Lifting Architectural Constraints of Injective Flows

    Authors: Peter Sorrenson, Felix Draxler, Armand Rousselot, Sander Hummerich, Lea Zimmermann, Ullrich Köthe

    Abstract: Normalizing Flows explicitly maximize a full-dimensional likelihood on the training data. However, real data is typically only supported on a lower-dimensional manifold leading the model to expend significant compute on modeling noise. Injective Flows fix this by jointly learning a manifold and the distribution on it. So far, they have been limited by restrictive architectures and/or high computat… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Camera-ready version: accepted to ICLR 2024

  16. Training Invertible Neural Networks as Autoencoders

    Authors: The-Gia Leo Nguyen, Lynton Ardizzone, Ullrich Köthe

    Abstract: Autoencoders are able to learn useful data representations in an unsupervised matter and have been widely used in various machine learning and computer vision tasks. In this work, we present methods to train Invertible Neural Networks (INNs) as (variational) autoencoders which we call INN (variational) autoencoders. Our experiments on MNIST, CIFAR and CelebA show that for low bottleneck sizes our… ▽ More

    Submitted 21 March, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Conference Paper at GCPR2019

    ACM Class: I.5.1; I.4.10; I.4.2; I.4.5

    Journal ref: In: Fink, G., Frintrop, S., Jiang, X. (eds) Pattern Recognition. DAGM GCPR 2019. Lecture Notes in Computer Science, vol 11824. Springer, Cham

  17. Unsupervised Domain Transfer with Conditional Invertible Neural Networks

    Authors: Kris K. Dreher, Leonardo Ayala, Melanie Schellenberg, Marco Hübner, Jan-Hinrich Nölke, Tim J. Adler, Silvia Seidlitz, Jan Sellner, Alexander Studier-Fischer, Janek Gröhl, Felix Nickel, Ullrich Köthe, Alexander Seitel, Lena Maier-Hein

    Abstract: Synthetic medical image generation has evolved as a key technique for neural network training and validation. A core challenge, however, remains in the domain gap between simulations and real data. While deep learning-based domain transfer using Cycle Generative Adversarial Networks and similar architectures has led to substantial progress in the field, there are use cases in which state-of-the-ar… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  18. arXiv:2303.09989  [pdf, other

    cs.LG stat.ML

    Finding Competence Regions in Domain Generalization

    Authors: Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Köthe

    Abstract: We investigate a "learning to reject" framework to address the problem of silent failures in Domain Generalization (DG), where the test distribution differs from the training distribution. Assuming a mild distribution shift, we wish to accept out-of-distribution (OOD) data from a new domain whenever a model's estimated competence foresees trustworthy responses, instead of rejecting OOD data outrig… ▽ More

    Submitted 21 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: The paper has been published at TMLR (see https://openreview.net/forum?id=TSy0vuwQFN)

    Journal ref: Transactions on Machine Learning Research (06/2023)

  19. arXiv:2302.09125  [pdf, other

    cs.LG stat.ML

    JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models

    Authors: Stefan T. Radev, Marvin Schmitt, Valentin Pratz, Umberto Picchini, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: This work proposes ``jointly amortized neural approximation'' (JANA) of intractable likelihood functions and posterior densities arising in Bayesian surrogate modeling and simulation-based inference. We train three complementary networks in an end-to-end fashion: 1) a summary network to compress individual data points, sets, or time series into informative embedding vectors; 2) a posterior network… ▽ More

    Submitted 20 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  20. arXiv:2301.13462  [pdf, other

    physics.ao-ph cs.LG

    Towards Learned Emulation of Interannual Water Isotopologue Variations in General Circulation Models

    Authors: Jonathan Wider, Jakob Kruse, Nils Weitzel, Janica C. Bühler, Ullrich Köthe, Kira Rehfeld

    Abstract: Simulating abundances of stable water isotopologues, i.e. molecules differing in their isotopic composition, within climate models allows for comparisons with proxy data and, thus, for testing hypotheses about past climate and validating climate models under varying climatic conditions. However, many models are run without explicitly simulating water isotopologues. We investigate the possibility t… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Journal ref: Environmental Data Science, Volume 2 (2023), e35

  21. arXiv:2210.14032  [pdf, other

    cs.LG stat.ML

    Whitening Convergence Rate of Coupling-based Normalizing Flows

    Authors: Felix Draxler, Christoph Schnörr, Ullrich Köthe

    Abstract: Coupling-based normalizing flows (e.g. RealNVP) are a popular family of normalizing flow architectures that work surprisingly well in practice. This calls for theoretical understanding. Existing work shows that such flows weakly converge to arbitrary data distributions. However, they make no statement about the stricter convergence criterion used in practice, the maximum likelihood loss. For the f… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Proceedings of 36th Conference on Neural Information Processing System (NeurIPS 2022)

  22. arXiv:2208.14024  [pdf, other

    cs.LG

    Positive Difference Distribution for Image Outlier Detection using Normalizing Flows and Contrastive Data

    Authors: Robert Schmier, Ullrich Köthe, Christoph-Nikolas Straehle

    Abstract: Detecting test data deviating from training data is a central problem for safe and robust machine learning. Likelihoods learned by a generative model, e.g., a normalizing flow via standard log-likelihood training, perform poorly as an outlier score. We propose to use an unlabelled auxiliary dataset and a probabilistic outlier score for outlier detection. We use a self-supervised feature extractor… ▽ More

    Submitted 26 April, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research (04/2023)

  23. arXiv:2207.14625  [pdf, other

    cs.CR cs.CV cs.LG

    Content-Aware Differential Privacy with Conditional Invertible Neural Networks

    Authors: Malte Tölle, Ullrich Köthe, Florian André, Benjamin Meder, Sandy Engelhardt

    Abstract: Differential privacy (DP) has arisen as the gold standard in protecting an individual's privacy in datasets by adding calibrated noise to each data sample. While the application to categorical data is straightforward, its usability in the context of images has been limited. Contrary to categorical data the meaning of an image is inherent in the spatial correlation of neighboring pixels making the… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: Accepted at 3rd DeCaF Workshop (MICCAI22)

    MSC Class: J.3 I.4.0 J.3 I.2.6

  24. arXiv:2203.16542  [pdf, other

    cs.CV

    Towards Multimodal Depth Estimation from Light Fields

    Authors: Titus Leistner, Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  25. arXiv:2202.00027  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Exoplanet Characterization using Conditional Invertible Neural Networks

    Authors: Jonas Haldemann, Victor Ksoll, Daniel Walter, Yann Alibert, Ralf S. Klessen, Willy Benz, Ullrich Koethe, Lynton Ardizzone, Carsten Rother

    Abstract: The characterization of an exoplanet's interior is an inverse problem, which requires statistical methods such as Bayesian inference in order to be solved. Current methods employ Markov Chain Monte Carlo (MCMC) sampling to infer the posterior probability of planetary structure parameters for a given exoplanet. These methods are time consuming since they require the calculation of a large number of… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 15 pages, 13 figures, submitted to Astronomy & Astrophysics

    Journal ref: A&A 672, A180 (2023)

  26. arXiv:2112.08866  [pdf, other

    stat.ME cs.LG stat.ML

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Neural density estimators have proven remarkably powerful in performing efficient simulation-based Bayesian inference in various research domains. In particular, the BayesFlow framework uses a two-step approach to enable amortized parameter estimation in settings where the likelihood function is implicitly defined by a simulation program. But how faithful is such inference when simulations are poo… ▽ More

    Submitted 8 November, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

  27. arXiv:2105.02104  [pdf, other

    cs.CV cs.AI

    Conditional Invertible Neural Networks for Diverse Image-to-Image Translation

    Authors: Lynton Ardizzone, Jakob Kruse, Carsten Lüth, Niels Bracher, Carsten Rother, Ullrich Köthe

    Abstract: We introduce a new architecture called a conditional invertible neural network (cINN), and use it to address the task of diverse image-to-image translation for natural images. This is not easily possible with existing INN models due to some fundamental limitations. The cINN combines the purely generative INN model with an unconstrained feed-forward network, which efficiently preprocesses the condi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1907.02392

    MSC Class: 68T01

  28. arXiv:2101.10763  [pdf, other

    cs.LG

    Benchmarking Invertible Architectures on Inverse Problems

    Authors: Jakob Kruse, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Recent work demonstrated that flow-based invertible neural networks are promising tools for solving ambiguous inverse problems. Following up on this, we investigate how ten invertible architectures and related models fare on two intuitive, low-dimensional benchmark problems, obtaining the best results with coupling layers and simple autoencoders. We hope that our initial efforts inspire other rese… ▽ More

    Submitted 22 June, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    MSC Class: 68T01

    Journal ref: Workshop on Invertible Neural Networks and Normalizing Flows (ICML 2019)

  29. arXiv:2012.08195  [pdf, other

    cs.CV cs.LG

    Representing Ambiguity in Registration Problems with Conditional Invertible Neural Networks

    Authors: Darya Trofimova, Tim Adler, Lisa Kausch, Lynton Ardizzone, Klaus Maier-Hein, Ulrich Köthe, Carsten Rother, Lena Maier-Hein

    Abstract: Image registration is the basis for many applications in the fields of medical image computing and computer assisted interventions. One example is the registration of 2D X-ray images with preoperative three-dimensional computed tomography (CT) images in intraoperative surgical guidance systems. Due to the high safety requirements in medical applications, estimating registration uncertainty is of a… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: The paper got accepted at Medical Imaging Meets NeurIPS Workshop at Neural Information Processing Systems 2020

  30. arXiv:2011.05110  [pdf, other

    physics.med-ph cs.AI cs.LG

    Invertible Neural Networks for Uncertainty Quantification in Photoacoustic Imaging

    Authors: Jan-Hinrich Nölke, Tim Adler, Janek Gröhl, Thomas Kirchner, Lynton Ardizzone, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Multispectral photoacoustic imaging (PAI) is an emerging imaging modality which enables the recovery of functional tissue parameters such as blood oxygenation. However, the underlying inverse problems are potentially ill-posed, meaning that radically different tissue properties may - in theory - yield comparable measurements. In this work, we present a new approach for handling this specific type… ▽ More

    Submitted 23 November, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures, submitted to "Bildverarbeitung für die Medizin (BVM) 2021"

  31. arXiv:2010.07167  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Robust Models Using The Principle of Independent Causal Mechanisms

    Authors: Jens Müller, Robert Schmier, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Standard supervised learning breaks down under data distribution shift. However, the principle of independent causal mechanisms (ICM, Peters et al. (2017)) can turn this weakness into an opportunity: one can take advantage of distribution shift between different environments during training in order to obtain more robust models. We propose a new gradient-based learning framework whose objective fu… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  32. arXiv:2010.00300  [pdf, other

    stat.AP cs.LG q-bio.PE

    OutbreakFlow: Model-based Bayesian inference of disease outbreak dynamics with invertible neural networks and its application to the COVID-19 pandemics in Germany

    Authors: Stefan T. Radev, Frederik Graw, Simiao Chen, Nico T. Mutters, Vanessa M. Eichel, Till Bärnighausen, Ullrich Köthe

    Abstract: Mathematical models in epidemiology are an indispensable tool to determine the dynamics and important characteristics of infectious diseases. Apart from their scientific merit, these models are often used to inform political decisions and intervention measures during an ongoing outbreak. However, reliably inferring the dynamics of ongoing outbreaks by connecting complex models to real data is stil… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

  33. arXiv:2007.15036  [pdf, other

    cs.CV cs.LG

    Generative Classifiers as a Basis for Trustworthy Image Classification

    Authors: Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: With the maturing of deep learning systems, trustworthiness is becoming increasingly important for model assessment. We understand trustworthiness as the combination of explainability and robustness. Generative classifiers (GCs) are a promising class of models that are said to naturally accomplish these qualities. However, this has mostly been demonstrated on simple datasets such as MNIST and CIFA… ▽ More

    Submitted 2 December, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

  34. arXiv:2004.10629  [pdf, other

    stat.ML cs.LG

    Amortized Bayesian model comparison with evidential deep learning

    Authors: Stefan T. Radev, Marco D'Alessandro, Ulf K. Mertens, Andreas Voss, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Comparing competing mathematical models of complex natural processes is a shared goal among many branches of science. The Bayesian probabilistic framework offers a principled way to perform model comparison and extract useful metrics for guiding decisions. However, many interesting models are intractable with standard Bayesian methods, as they lack a closed-form likelihood function or the likeliho… ▽ More

    Submitted 2 March, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

  35. arXiv:2003.06281  [pdf, other

    stat.ML cs.LG

    BayesFlow: Learning complex stochastic models with invertible neural networks

    Authors: Stefan T. Radev, Ulf K. Mertens, Andreas Voss, Lynton Ardizzone, Ullrich Köthe

    Abstract: Estimating the parameters of mathematical models is a common problem in almost all branches of science. However, this problem can prove notably difficult when processes and model descriptions become increasingly complex and an explicit likelihood function is not available. With this work, we propose a novel method for globally amortized Bayesian inference based on invertible neural networks which… ▽ More

    Submitted 1 December, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  36. arXiv:2001.06448  [pdf, other

    cs.LG stat.ML

    Training Normalizing Flows with the Information Bottleneck for Competitive Generative Classification

    Authors: Lynton Ardizzone, Radek Mackowiak, Carsten Rother, Ullrich Köthe

    Abstract: The Information Bottleneck (IB) objective uses information theory to formulate a task-performance versus robustness trade-off. It has been successfully applied in the standard discriminative classification setting. We pose the question whether the IB can also be used to train generative likelihood models such as normalizing flows. Since normalizing flows use invertible network architectures (INNs)… ▽ More

    Submitted 12 January, 2021; v1 submitted 17 January, 2020; originally announced January 2020.

    MSC Class: 68T01

    Journal ref: Advances in Neural Information Processing Systems 33 Proceedings (NeurIPS 2020)

  37. arXiv:2001.04872  [pdf, other

    cs.LG stat.ML

    Disentanglement by Nonlinear ICA with General Incompressible-flow Networks (GIN)

    Authors: Peter Sorrenson, Carsten Rother, Ullrich Köthe

    Abstract: A central question of representation learning asks under which conditions it is possible to reconstruct the true latent variables of an arbitrarily complex generative process. Recent breakthrough work by Khemakhem et al. (2019) on nonlinear ICA has answered this question for a broad class of conditional generative processes. We extend this important result in a direction relevant for application t… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 23 pages, 15 figures, ICLR 2020

  38. arXiv:1911.01877  [pdf, other

    eess.IV cs.LG physics.med-ph stat.ML

    Out of distribution detection for intra-operative functional imaging

    Authors: Tim J. Adler, Leonardo Ayala, Lynton Ardizzone, Hannes G. Kenngott, Anant Vemuri, Beat P. Müller-Stich, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Multispectral optical imaging is becoming a key tool in the operating room. Recent research has shown that machine learning algorithms can be used to convert pixel-wise reflectance measurements to tissue parameters, such as oxygenation. However, the accuracy of these algorithms can only be guaranteed if the spectra acquired during surgery match the ones seen during training. It is therefore of gre… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: The final authenticated version is available online at https://doi.org/10.1007/978-3-030-32689-0_8

    Journal ref: Proceedings of the First International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, UNSURE 2019, and the 8th International Workshop on Clinical Image-Based Procedures, CLIP 2019

  39. arXiv:1909.10341  [pdf, other

    cs.CV eess.IV

    Object Segmentation using Pixel-wise Adversarial Loss

    Authors: Ricard Durall, Franz-Josef Pfreundt, Ullrich Köthe, Janis Keuper

    Abstract: Recent deep learning based approaches have shown remarkable success on object segmentation tasks. However, there is still room for further improvement. Inspired by generative adversarial networks, we present a generic end-to-end adversarial approach, which can be combined with a wide range of existing semantic segmentation networks to improve their segmentation performance. The key element of our… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

  40. arXiv:1907.02392  [pdf, other

    cs.CV cs.LG

    Guided Image Generation with Conditional Invertible Neural Networks

    Authors: Lynton Ardizzone, Carsten Lüth, Jakob Kruse, Carsten Rother, Ullrich Köthe

    Abstract: In this work, we address the task of natural image generation guided by a conditioning input. We introduce a new architecture called conditional invertible neural network (cINN). The cINN combines the purely generative INN model with an unconstrained feed-forward network, which efficiently preprocesses the conditioning input into useful features. All parameters of the cINN are jointly optimized wi… ▽ More

    Submitted 10 July, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

    MSC Class: 68T01

  41. arXiv:1905.10687  [pdf, other

    stat.ML cs.AI cs.LG

    HINT: Hierarchical Invertible Neural Transport for Density Estimation and Bayesian Inference

    Authors: Jakob Kruse, Gianluca Detommaso, Ullrich Köthe, Robert Scheichl

    Abstract: Many recent invertible neural architectures are based on coupling block designs where variables are divided in two subsets which serve as inputs of an easily invertible (usually affine) triangular transformation. While such a transformation is invertible, its Jacobian is very sparse and thus may lack expressiveness. This work presents a simple remedy by noting that subdivision and (affine) couplin… ▽ More

    Submitted 25 May, 2021; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Published at AAAI 2021

  42. arXiv:1904.12654  [pdf, other

    cs.CV cs.LG stat.ML

    The Mutex Watershed and its Objective: Efficient, Parameter-Free Graph Partitioning

    Authors: Steffen Wolf, Alberto Bailoni, Constantin Pape, Nasim Rahaman, Anna Kreshuk, Ullrich Köthe, Fred A. Hamprecht

    Abstract: Image partitioning, or segmentation without semantics, is the task of decomposing an image into distinct segments, or equivalently to detect closed contours. Most prior work either requires seeds, one per segment; or a threshold; or formulates the task as multicut / correlation clustering, an NP-hard problem. Here, we propose an efficient algorithm for graph partitioning, the "Mutex Watershed''. U… ▽ More

    Submitted 19 April, 2021; v1 submitted 25 April, 2019; originally announced April 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (2020) 1-1

  43. arXiv:1903.03441  [pdf, other

    physics.med-ph cs.LG stat.ML

    Uncertainty-aware performance assessment of optical imaging modalities with invertible neural networks

    Authors: Tim J. Adler, Lynton Ardizzone, Anant Vemuri, Leonardo Ayala, Janek Gröhl, Thomas Kirchner, Sebastian Wirkert, Jakob Kruse, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Purpose: Optical imaging is evolving as a key technique for advanced sensing in the operating room. Recent research has shown that machine learning algorithms can be used to address the inverse problem of converting pixel-wise multispectral reflectance measurements to underlying tissue parameters, such as oxygenation. Assessment of the specific hardware used in conjunction with such algorithms, ho… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted at IPCAI 2019

  44. arXiv:1808.04730  [pdf, other

    cs.LG stat.ML

    Analyzing Inverse Problems with Invertible Neural Networks

    Authors: Lynton Ardizzone, Jakob Kruse, Sebastian Wirkert, Daniel Rahner, Eric W. Pellegrini, Ralf S. Klessen, Lena Maier-Hein, Carsten Rother, Ullrich Köthe

    Abstract: In many tasks, in particular in natural science, the goal is to determine hidden system parameters from a set of measurements. Often, the forward process from parameter- to measurement-space is a well-defined function, whereas the inverse problem is ambiguous: one measurement may map to multiple different sets of parameters. In this setting, the posterior parameter distribution, conditioned on an… ▽ More

    Submitted 6 February, 2019; v1 submitted 14 August, 2018; originally announced August 2018.

    MSC Class: 68T01

  45. arXiv:1704.02249  [pdf, other

    cs.CV

    Learned Watershed: End-to-End Learning of Seeded Segmentation

    Authors: Steffen Wolf, Lukas Schott, Ullrich Köthe, Fred Hamprecht

    Abstract: Learned boundary maps are known to outperform hand- crafted ones as a basis for the watershed algorithm. We show, for the first time, how to train watershed computation jointly with boundary map prediction. The estimator for the merging priorities is cast as a neural network that is con- volutional (over space) and recurrent (over iterations). The latter allows learning of complex shape priors. Th… ▽ More

    Submitted 4 September, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: The first two authors contributed equally

  46. arXiv:1009.6215  [pdf, other

    cs.CG cs.CV cs.DS

    How to Extract the Geometry and Topology from Very Large 3D Segmentations

    Authors: Bjoern Andres, Ullrich Koethe, Thorben Kroeger, Fred A. Hamprecht

    Abstract: Segmentation is often an essential intermediate step in image analysis. A volume segmentation characterizes the underlying volume image in terms of geometric information--segments, faces between segments, curves in which several faces meet--as well as a topology on these objects. Existing algorithms encode this information in designated data structures, but require that these data structures fit e… ▽ More

    Submitted 30 September, 2010; originally announced September 2010.

    Comments: C++ source code, free command line tools and MATLAB mex files are avilable from http://hci.iwr.uni-heidelberg.de/software.php

  47. arXiv:1009.4102  [pdf, other

    cs.DS cs.CC

    The Lazy Flipper: MAP Inference in Higher-Order Graphical Models by Depth-limited Exhaustive Search

    Authors: Bjoern Andres, Joerg H. Kappes, Ullrich Koethe, Fred A. Hamprecht

    Abstract: This article presents a new search algorithm for the NP-hard problem of optimizing functions of binary variables that decompose according to a graphical model. It can be applied to models of any order and structure. The main novelty is a technique to constrain the search space based on the topology of the model. When pursued to the full search depth, the algorithm is guaranteed to converge to a gl… ▽ More

    Submitted 21 September, 2010; originally announced September 2010.

    Comments: C++ Source Code available from http://hci.iwr.uni-heidelberg.de/software.php

  48. arXiv:1008.2909  [pdf, other

    cs.DS cs.MS cs.PL cs.SE

    Runtime-Flexible Multi-dimensional Arrays and Views for C++98 and C++0x

    Authors: Bjoern Andres, Ullrich Koethe, Thorben Kroeger, Fred A. Hamprecht

    Abstract: Multi-dimensional arrays are among the most fundamental and most useful data structures of all. In C++, excellent template libraries exist for arrays whose dimension is fixed at runtime. Arrays whose dimension can change at runtime have been implemented in C. However, a generic object-oriented C++ implementation of runtime-flexible arrays has so far been missing. In this article, we discuss our ne… ▽ More

    Submitted 17 August, 2010; originally announced August 2010.

    Comments: Free source code available