Skip to main content

Showing 1–50 of 72 results for author: Rother, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.09799  [pdf, other

    cs.CV cs.RO

    BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

    Authors: Tomas Hodan, Martin Sundermeyer, Yann Labbe, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2023, the fifth in a series of public competitions organized to capture the state of the art in model-based 6D object pose estimation from an RGB/RGB-D image and related tasks. Besides the three tasks from 2022 (model-based 2D detection, 2D segmentation, and 6D localization of objects seen during training), the 2023 c… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.13075

  2. arXiv:2312.06573  [pdf, other

    cs.CV

    ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models

    Authors: Denis Zavadski, Johann-Friedrich Feiden, Carsten Rother

    Abstract: The field of image synthesis has made tremendous strides forward in the last years. Besides defining the desired output image with text-prompts, an intuitive approach is to additionally use spatial guidance in form of an image, such as a depth map. For this, a recent and highly popular approach is to use a controlling network, such as ControlNet, in combination with a pre-trained image generation… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  3. arXiv:2307.08930  [pdf, other

    cs.CV cs.AI

    Unsupervised Deep Graph Matching Based on Cycle Consistency

    Authors: Siddharth Tourani, Carsten Rother, Muhammad Haris Khan, Bogdan Savchynskyy

    Abstract: We contribute to the sparsely populated area of unsupervised deep graph matching with application to keypoint matching in images. Contrary to the standard \emph{supervised} approach, our method does not require ground truth correspondences between keypoint pairs. Instead, it is self-supervised by enforcing consistency of matchings between images of the same object category. As the matching and the… ▽ More

    Submitted 11 February, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 12 pages, 5 figures, 3 papers

  4. arXiv:2303.09989  [pdf, other

    cs.LG stat.ML

    Finding Competence Regions in Domain Generalization

    Authors: Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Köthe

    Abstract: We investigate a "learning to reject" framework to address the problem of silent failures in Domain Generalization (DG), where the test distribution differs from the training distribution. Assuming a mild distribution shift, we wish to accept out-of-distribution (OOD) data from a new domain whenever a model's estimated competence foresees trustworthy responses, instead of rejecting OOD data outrig… ▽ More

    Submitted 21 June, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: The paper has been published at TMLR (see https://openreview.net/forum?id=TSy0vuwQFN)

    Journal ref: Transactions on Machine Learning Research (06/2023)

  5. arXiv:2302.13075  [pdf, other

    cs.CV

    BOP Challenge 2022 on Detection, Segmentation and Pose Estimation of Specific Rigid Objects

    Authors: Martin Sundermeyer, Tomas Hodan, Yann Labbe, Gu Wang, Eric Brachmann, Bertram Drost, Carsten Rother, Jiri Matas

    Abstract: We present the evaluation methodology, datasets and results of the BOP Challenge 2022, the fourth in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB/RGB-D image. In 2022, we witnessed another significant improvement in the pose estimation accuracy -- the state of the art, which was 56.9 AR$_C$ in 2019 (Vidal et… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2009.07378

  6. arXiv:2207.00291  [pdf, other

    cs.CV math.OC

    A Comparative Study of Graph Matching Algorithms in Computer Vision

    Authors: Stefan Haller, Lorenz Feineis, Lisa Hutschenreiter, Florian Bernard, Carsten Rother, Dagmar Kainmüller, Paul Swoboda, Bogdan Savchynskyy

    Abstract: The graph matching optimization problem is an essential component for many tasks in computer vision, such as bringing two deformable objects in correspondence. Naturally, a wide range of applicable algorithms have been proposed in the last decades. Since a common standard benchmark has not been developed, their performance claims are often hard to verify as evaluation on differing problem instance… ▽ More

    Submitted 29 July, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted In: European Conference on Computer Vision (ECCV) 2022

  7. arXiv:2203.16542  [pdf, other

    cs.CV

    Towards Multimodal Depth Estimation from Light Fields

    Authors: Titus Leistner, Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: Light field applications, especially light field rendering and depth estimation, developed rapidly in recent years. While state-of-the-art light field rendering methods handle semi-transparent and reflective objects well, depth estimation methods either ignore these cases altogether or only deliver a weak performance. We argue that this is due current methods only considering a single "true" depth… ▽ More

    Submitted 1 April, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  8. arXiv:2202.09206  [pdf, other

    cs.CV

    Spatio-Temporal Outdoor Lighting Aggregation on Image Sequences using Transformer Networks

    Authors: Haebom Lee, Christian Homeyer, Robert Herzog, Jan Rexilius, Carsten Rother

    Abstract: In this work, we focus on outdoor lighting estimation by aggregating individual noisy estimates from images, exploiting the rich image information from wide-angle cameras and/or temporal image sequences. Photographs inherently encode information about the scene's lighting in the form of shading and shadows. Recovering the lighting is an inverse rendering problem and as that ill-posed. Recent work… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: 11 pages, 7 figures, 1 table, currently under a review process

  9. arXiv:2202.00027  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Exoplanet Characterization using Conditional Invertible Neural Networks

    Authors: Jonas Haldemann, Victor Ksoll, Daniel Walter, Yann Alibert, Ralf S. Klessen, Willy Benz, Ullrich Koethe, Lynton Ardizzone, Carsten Rother

    Abstract: The characterization of an exoplanet's interior is an inverse problem, which requires statistical methods such as Bayesian inference in order to be solved. Current methods employ Markov Chain Monte Carlo (MCMC) sampling to infer the posterior probability of planetary structure parameters for a given exoplanet. These methods are time consuming since they require the calculation of a large number of… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

    Comments: 15 pages, 13 figures, submitted to Astronomy & Astrophysics

    Journal ref: A&A 672, A180 (2023)

  10. arXiv:2112.01554  [pdf, other

    cs.CV cs.GR

    Neural Head Avatars from Monocular RGB Videos

    Authors: Philip-William Grassal, Malte Prinzler, Titus Leistner, Carsten Rother, Matthias Nießner, Justus Thies

    Abstract: We present Neural Head Avatars, a novel neural representation that explicitly models the surface geometry and appearance of an animatable human avatar that can be used for teleconferencing in AR/VR or other applications in the movie or games industry that rely on a digital human. Our representation can be learned from a monocular RGB portrait video that features a range of different expressions an… ▽ More

    Submitted 28 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Camera-ready revision - Video: https://youtu.be/I17GbCCoytk Project page: https://philgras.github.io/neural_head_avatars/neural_head_avatars.html

  11. arXiv:2110.09848  [pdf, other

    cs.CV

    Self-Supervised Object Detection via Generative Image Synthesis

    Authors: Siva Karthik Mustikovela, Shalini De Mello, Aayush Prakash, Umar Iqbal, Sifei Liu, Thu Nguyen-Phuoc, Carsten Rother, Jan Kautz

    Abstract: We present SSOD, the first end-to-end analysis-by synthesis framework with controllable GANs for the task of self-supervised object detection. We use collections of real world images without bounding box annotations to learn to synthesize and detect objects. We leverage controllable GANs to synthesize images with pre-defined object properties and use them to train object detectors. We propose a ti… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  12. arXiv:2109.00524  [pdf, other

    cs.CV cs.LG

    On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation

    Authors: Eric Brachmann, Martin Humenberger, Carsten Rother, Torsten Sattler

    Abstract: Benchmark datasets that measure camera pose accuracy have driven progress in visual re-localisation research. To obtain poses for thousands of images, it is common to use a reference algorithm to generate pseudo ground truth. Popular choices include Structure-from-Motion (SfM) and Simultaneous-Localisation-and-Map** (SLAM) using additional sensors like depth cameras if available. Re-localisation… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: ICCV 2021

  13. arXiv:2105.02104  [pdf, other

    cs.CV cs.AI

    Conditional Invertible Neural Networks for Diverse Image-to-Image Translation

    Authors: Lynton Ardizzone, Jakob Kruse, Carsten Lüth, Niels Bracher, Carsten Rother, Ullrich Köthe

    Abstract: We introduce a new architecture called a conditional invertible neural network (cINN), and use it to address the task of diverse image-to-image translation for natural images. This is not easily possible with existing INN models due to some fundamental limitations. The cINN combines the purely generative INN model with an unconstrained feed-forward network, which efficiently preprocesses the condi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1907.02392

    MSC Class: 68T01

  14. arXiv:2101.12085  [pdf, other

    cs.CV math.OC

    Fusion Moves for Graph Matching

    Authors: Lisa Hutschenreiter, Stefan Haller, Lorenz Feineis, Carsten Rother, Dagmar Kainmüller, Bogdan Savchynskyy

    Abstract: We contribute to approximate algorithms for the quadratic assignment problem also known as graph matching. Inspired by the success of the fusion moves technique developed for multilabel discrete Markov random fields, we investigate its applicability to graph matching. In particular, we show how fusion moves can be efficiently combined with the dedicated state-of-the-art dual methods that have rece… ▽ More

    Submitted 20 August, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 180 pages (including appendix), accepted in: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2021

  15. arXiv:2101.10763  [pdf, other

    cs.LG

    Benchmarking Invertible Architectures on Inverse Problems

    Authors: Jakob Kruse, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Recent work demonstrated that flow-based invertible neural networks are promising tools for solving ambiguous inverse problems. Following up on this, we investigate how ten invertible architectures and related models fare on two intuitive, low-dimensional benchmark problems, obtaining the best results with coupling layers and simple autoencoders. We hope that our initial efforts inspire other rese… ▽ More

    Submitted 22 June, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    MSC Class: 68T01

    Journal ref: Workshop on Invertible Neural Networks and Normalizing Flows (ICML 2019)

  16. arXiv:2012.08195  [pdf, other

    cs.CV cs.LG

    Representing Ambiguity in Registration Problems with Conditional Invertible Neural Networks

    Authors: Darya Trofimova, Tim Adler, Lisa Kausch, Lynton Ardizzone, Klaus Maier-Hein, Ulrich Köthe, Carsten Rother, Lena Maier-Hein

    Abstract: Image registration is the basis for many applications in the fields of medical image computing and computer assisted interventions. One example is the registration of 2D X-ray images with preoperative three-dimensional computed tomography (CT) images in intraoperative surgical guidance systems. Due to the high safety requirements in medical applications, estimating registration uncertainty is of a… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: The paper got accepted at Medical Imaging Meets NeurIPS Workshop at Neural Information Processing Systems 2020

  17. arXiv:2011.05110  [pdf, other

    physics.med-ph cs.AI cs.LG

    Invertible Neural Networks for Uncertainty Quantification in Photoacoustic Imaging

    Authors: Jan-Hinrich Nölke, Tim Adler, Janek Gröhl, Thomas Kirchner, Lynton Ardizzone, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Multispectral photoacoustic imaging (PAI) is an emerging imaging modality which enables the recovery of functional tissue parameters such as blood oxygenation. However, the underlying inverse problems are potentially ill-posed, meaning that radically different tissue properties may - in theory - yield comparable measurements. In this work, we present a new approach for handling this specific type… ▽ More

    Submitted 23 November, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: 7 pages, 4 figures, submitted to "Bildverarbeitung für die Medizin (BVM) 2021"

  18. arXiv:2010.07167  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Robust Models Using The Principle of Independent Causal Mechanisms

    Authors: Jens Müller, Robert Schmier, Lynton Ardizzone, Carsten Rother, Ullrich Köthe

    Abstract: Standard supervised learning breaks down under data distribution shift. However, the principle of independent causal mechanisms (ICM, Peters et al. (2017)) can turn this weakness into an opportunity: one can take advantage of distribution shift between different environments during training in order to obtain more robust models. We propose a new gradient-based learning framework whose objective fu… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  19. Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

    Authors: Christoph Kamann, Burkhard Güssefeld, Robin Hutmacher, Jan Hendrik Metzen, Carsten Rother

    Abstract: For safety-critical applications such as autonomous driving, CNNs have to be robust with respect to unavoidable image corruptions, such as image noise. While previous works addressed the task of robust prediction in the context of full-image classification, we consider it for dense semantic segmentation. We build upon an insight from image classification that output robustness can be improved by i… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  20. arXiv:2009.07378  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    BOP Challenge 2020 on 6D Object Localization

    Authors: Tomas Hodan, Martin Sundermeyer, Bertram Drost, Yann Labbe, Eric Brachmann, Frank Michel, Carsten Rother, Jiri Matas

    Abstract: This paper presents the evaluation methodology, datasets, and results of the BOP Challenge 2020, the third in a series of public competitions organized with the goal to capture the status quo in the field of 6D object pose estimation from an RGB-D image. In 2020, to reduce the domain gap between synthetic training and real test RGB images, the participants were provided 350K photorealistic trainin… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: In ECCV 2020 Workshops Proceedings

  21. arXiv:2007.15036  [pdf, other

    cs.CV cs.LG

    Generative Classifiers as a Basis for Trustworthy Image Classification

    Authors: Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother

    Abstract: With the maturing of deep learning systems, trustworthiness is becoming increasingly important for model assessment. We understand trustworthiness as the combination of explainability and robustness. Generative classifiers (GCs) are a promising class of models that are said to naturally accomplish these qualities. However, this has mostly been demonstrated on simple datasets such as MNIST and CIFA… ▽ More

    Submitted 2 December, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

  22. arXiv:2006.16011  [pdf, other

    cs.CV cs.GR

    Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image Decomposition

    Authors: Hassan Abu Alhaija, Siva Karthik Mustikovela, Justus Thies, Varun Jampani, Matthias Nießner, Andreas Geiger, Carsten Rother

    Abstract: Neural rendering techniques promise efficient photo-realistic image synthesis while at the same time providing rich control over scene parameters by learning the physical image formation process. While several supervised methods have been proposed for this task, acquiring a dataset of images with accurately aligned 3D models is very difficult. The main contribution of this work is to lift this res… ▽ More

    Submitted 29 March, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

  23. arXiv:2006.07742  [pdf, other

    cs.CV

    Split-Merge Pooling

    Authors: Omid Hosseini Jafari, Carsten Rother

    Abstract: There are a variety of approaches to obtain a vast receptive field with convolutional neural networks (CNNs), such as pooling or striding convolutions. Most of these approaches were initially designed for image classification and later adapted to dense prediction tasks, such as semantic segmentation. However, the major drawback of this adaptation is the loss of spatial information. Even the popula… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

  24. arXiv:2004.08227  [pdf, other

    cs.LG cs.CV stat.ML

    MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical Models

    Authors: Siddharth Tourani, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: Dense, discrete Graphical Models with pairwise potentials are a powerful class of models which are employed in state-of-the-art computer vision and bio-imaging applications. This work introduces a new MAP-solver, based on the popular Dual Block-Coordinate Ascent principle. Surprisingly, by making a small change to the low-performing solver, the Max Product Linear Programming (MPLP) algorithm, we d… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted in ECCV-2018

  25. arXiv:2004.07715  [pdf, other

    cs.LG stat.ML

    Taxonomy of Dual Block-Coordinate Ascent Methods for Discrete Energy Minimization

    Authors: Siddharth Tourani, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: We consider the maximum-a-posteriori inference problem in discrete graphical models and study solvers based on the dual block-coordinate ascent rule. We map all existing solvers in a single framework, allowing for a better understanding of their design principles. We theoretically show that some block-optimizing updates are sub-optimal and how to strictly improve them. On a wide range of problem i… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted in AISTATS 2020

  26. arXiv:2004.06375  [pdf, other

    cs.CV math.OC

    A Primal-Dual Solver for Large-Scale Tracking-by-Assignment

    Authors: Stefan Haller, Mangal Prakash, Lisa Hutschenreiter, Tobias Pietzsch, Carsten Rother, Florian Jug, Paul Swoboda, Bogdan Savchynskyy

    Abstract: We propose a fast approximate solver for the combinatorial problem known as tracking-by-assignment, which we apply to cell tracking. The latter plays a key role in discovery in many life sciences, especially in cell and developmental biology. So far, in the most general setting this problem was addressed by off-the-shelf solvers like Gurobi, whose run time and memory requirements rapidly grow with… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 2020

  27. arXiv:2004.01793  [pdf, other

    cs.CV

    Self-Supervised Viewpoint Learning From Image Collections

    Authors: Siva Karthik Mustikovela, Varun Jampani, Shalini De Mello, Sifei Liu, Umar Iqbal, Carsten Rother, Jan Kautz

    Abstract: Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets. However, manually labeling viewpoints is notoriously hard, error-prone, and time-consuming. On the other hand, it is relatively easy to mine many unlabelled images of an object category from the internet, e.g., of cars or faces. We seek to answer the research question of whether such unlabe… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

    Comments: Accepted at CVPR 20

  28. arXiv:2002.12324  [pdf, other

    cs.CV cs.LG

    Visual Camera Re-Localization from RGB and RGB-D Images Using DSAC

    Authors: Eric Brachmann, Carsten Rother

    Abstract: We describe a learning-based system that estimates the camera position and orientation from a single input image relative to a known environment. The system is flexible w.r.t. the amount of information available at test and at training time, catering to different applications. Input images can be RGB-D or RGB, and a 3D model of the environment can be utilized for training but is not necessary. In… ▽ More

    Submitted 9 October, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

  29. arXiv:2001.06448  [pdf, other

    cs.LG stat.ML

    Training Normalizing Flows with the Information Bottleneck for Competitive Generative Classification

    Authors: Lynton Ardizzone, Radek Mackowiak, Carsten Rother, Ullrich Köthe

    Abstract: The Information Bottleneck (IB) objective uses information theory to formulate a task-performance versus robustness trade-off. It has been successfully applied in the standard discriminative classification setting. We pose the question whether the IB can also be used to train generative likelihood models such as normalizing flows. Since normalizing flows use invertible network architectures (INNs)… ▽ More

    Submitted 12 January, 2021; v1 submitted 17 January, 2020; originally announced January 2020.

    MSC Class: 68T01

    Journal ref: Advances in Neural Information Processing Systems 33 Proceedings (NeurIPS 2020)

  30. arXiv:2001.04872  [pdf, other

    cs.LG stat.ML

    Disentanglement by Nonlinear ICA with General Incompressible-flow Networks (GIN)

    Authors: Peter Sorrenson, Carsten Rother, Ullrich Köthe

    Abstract: A central question of representation learning asks under which conditions it is possible to reconstruct the true latent variables of an arbitrarily complex generative process. Recent breakthrough work by Khemakhem et al. (2019) on nonlinear ICA has answered this question for a broad class of conditional generative processes. We extend this important result in a direction relevant for application t… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 23 pages, 15 figures, ICLR 2020

  31. arXiv:2001.02643  [pdf, other

    cs.CV

    CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

    Authors: Florian Kluger, Eric Brachmann, Hanno Ackermann, Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

    Abstract: We present a robust estimator for fitting multiple parametric models of the same form to noisy measurements. Applications include finding multiple vanishing points in man-made scenes, fitting planes to architectural imagery, or estimating multiple rigid motions within the same sequence. In contrast to previous works, which resorted to hand-crafted search strategies for multiple model detection, we… ▽ More

    Submitted 25 March, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: CVPR 2020

  32. arXiv:1912.00623  [pdf, other

    cs.CV cs.LG

    Reinforced Feature Points: Optimizing Feature Detection and Description for a High-Level Task

    Authors: Aritra Bhowmik, Stefan Gumhold, Carsten Rother, Eric Brachmann

    Abstract: We address a core problem of computer vision: Detection and description of 2D feature points for image matching. For a long time, hand-crafted designs, like the seminal SIFT algorithm, were unsurpassed in accuracy and efficiency. Recently, learned feature detectors emerged that implement detection and description using neural networks. Training these networks usually resorts to optimizing low-leve… ▽ More

    Submitted 20 March, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 (oral)

  33. arXiv:1911.01877  [pdf, other

    eess.IV cs.LG physics.med-ph stat.ML

    Out of distribution detection for intra-operative functional imaging

    Authors: Tim J. Adler, Leonardo Ayala, Lynton Ardizzone, Hannes G. Kenngott, Anant Vemuri, Beat P. Müller-Stich, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Multispectral optical imaging is becoming a key tool in the operating room. Recent research has shown that machine learning algorithms can be used to convert pixel-wise reflectance measurements to tissue parameters, such as oxygenation. However, the accuracy of these algorithms can only be guaranteed if the spectra acquired during surgery match the ones seen during training. It is therefore of gre… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: The final authenticated version is available online at https://doi.org/10.1007/978-3-030-32689-0_8

    Journal ref: Proceedings of the First International Workshop on Uncertainty for Safe Utilization of Machine Learning in Medical Imaging, UNSURE 2019, and the 8th International Workshop on Clinical Image-Based Procedures, CLIP 2019

  34. Learning to Think Outside the Box: Wide-Baseline Light Field Depth Estimation with EPI-Shift

    Authors: Titus Leistner, Hendrik Schilling, Radek Mackowiak, Stefan Gumhold, Carsten Rother

    Abstract: We propose a method for depth estimation from light field data, based on a fully convolutional neural network architecture. Our goal is to design a pipeline which achieves highly accurate results for small- and wide-baseline light fields. Since light field training data is scarce, all learning-based approaches use a small receptive field and operate on small disparity ranges. In order to work with… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Published at International Conference on 3D Vision (3DV) 2019

  35. Benchmarking the Robustness of Semantic Segmentation Models

    Authors: Christoph Kamann, Carsten Rother

    Abstract: When designing a semantic segmentation module for a practical application, such as autonomous driving, it is crucial to understand the robustness of the module with respect to a wide range of image corruptions. While there are recent robustness studies for full-image classification, we are the first to present an exhaustive study for semantic segmentation, based on the state-of-the-art model DeepL… ▽ More

    Submitted 10 August, 2020; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: CVPR 2020 camera ready

  36. arXiv:1908.02484  [pdf, other

    cs.CV

    Expert Sample Consensus Applied to Camera Re-Localization

    Authors: Eric Brachmann, Carsten Rother

    Abstract: Fitting model parameters to a set of noisy data points is a common problem in computer vision. In this work, we fit the 6D camera pose to a set of noisy correspondences between the 2D input image and a known 3D environment. We estimate these correspondences from the image using a neural network. Since the correspondences often contain outliers, we utilize a robust estimator such as Random Sample C… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: ICCV 2019. Supplementary materials included

  37. arXiv:1907.02392  [pdf, other

    cs.CV cs.LG

    Guided Image Generation with Conditional Invertible Neural Networks

    Authors: Lynton Ardizzone, Carsten Lüth, Jakob Kruse, Carsten Rother, Ullrich Köthe

    Abstract: In this work, we address the task of natural image generation guided by a conditioning input. We introduce a new architecture called conditional invertible neural network (cINN). The cINN combines the purely generative INN model with an unconstrained feed-forward network, which efficiently preprocesses the conditioning input into useful features. All parameters of the cINN are jointly optimized wi… ▽ More

    Submitted 10 July, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

    MSC Class: 68T01

  38. arXiv:1905.04132  [pdf, other

    cs.CV

    Neural-Guided RANSAC: Learning Where to Sample Model Hypotheses

    Authors: Eric Brachmann, Carsten Rother

    Abstract: We present Neural-Guided RANSAC (NG-RANSAC), an extension to the classic RANSAC algorithm from robust optimization. NG-RANSAC uses prior information to improve model hypothesis search, increasing the chance of finding outlier-free minimal sets. Previous works use heuristic side-information like hand-crafted descriptor distance to guide hypothesis search. In contrast, we learn hypothesis search in… ▽ More

    Submitted 31 July, 2019; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: ICCV 2019

  39. arXiv:1903.03441  [pdf, other

    physics.med-ph cs.LG stat.ML

    Uncertainty-aware performance assessment of optical imaging modalities with invertible neural networks

    Authors: Tim J. Adler, Lynton Ardizzone, Anant Vemuri, Leonardo Ayala, Janek Gröhl, Thomas Kirchner, Sebastian Wirkert, Jakob Kruse, Carsten Rother, Ullrich Köthe, Lena Maier-Hein

    Abstract: Purpose: Optical imaging is evolving as a key technique for advanced sensing in the operating room. Recent research has shown that machine learning algorithms can be used to address the inverse problem of converting pixel-wise multispectral reflectance measurements to underlying tissue parameters, such as oxygenation. Assessment of the specific hardware used in conjunction with such algorithms, ho… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted at IPCAI 2019

  40. arXiv:1810.09726  [pdf, other

    cs.CV

    CEREALS - Cost-Effective REgion-based Active Learning for Semantic Segmentation

    Authors: Radek Mackowiak, Philip Lenz, Omair Ghori, Ferran Diego, Oliver Lange, Carsten Rother

    Abstract: State of the art methods for semantic image segmentation are trained in a supervised fashion using a large corpus of fully labeled training images. However, gathering such a corpus is expensive, due to human annotation effort, in contrast to gathering unlabeled data. We propose an active learning-based strategy, called CEREALS, in which a human only has to hand-label a few, automatically selected,… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: Published at British Machine Vision Conference 2018 (BMVC)

  41. A Summary of the 4th International Workshop on Recovering 6D Object Pose

    Authors: Tomas Hodan, Rigas Kouskouridas, Tae-Kyun Kim, Federico Tombari, Kostas Bekris, Bertram Drost, Thibault Groueix, Krzysztof Walas, Vincent Lepetit, Ales Leonardis, Carsten Steger, Frank Michel, Caner Sahin, Carsten Rother, Jiri Matas

    Abstract: This document summarizes the 4th International Workshop on Recovering 6D Object Pose which was organized in conjunction with ECCV 2018 in Munich. The workshop featured four invited talks, oral and poster presentations of accepted workshop papers, and an introduction of the BOP benchmark for 6D object pose estimation. The workshop was attended by 100+ people working on relevant topics in both acade… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: In: Computer Vision - ECCV 2018 Workshops - Munich, Germany, September 8-9 and 14, 2018, Proceedings

  42. arXiv:1809.04696  [pdf, other

    cs.CV

    Geometric Image Synthesis

    Authors: Hassan Abu Alhaija, Siva Karthik Mustikovela, Andreas Geiger, Carsten Rother

    Abstract: The task of generating natural images from 3D scenes has been a long standing goal in computer graphics. On the other hand, recent developments in deep neural networks allow for trainable models that can produce natural-looking images with little or no knowledge about the scene structure. While the generated images often consist of realistic looking local patterns, the overall structure of the gen… ▽ More

    Submitted 1 December, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

  43. arXiv:1808.08319  [pdf, other

    cs.CV cs.AI cs.RO

    BOP: Benchmark for 6D Object Pose Estimation

    Authors: Tomas Hodan, Frank Michel, Eric Brachmann, Wadim Kehl, Anders Glent Buch, Dirk Kraft, Bertram Drost, Joel Vidal, Stephan Ihrke, Xenophon Zabulis, Caner Sahin, Fabian Manhardt, Federico Tombari, Tae-Kyun Kim, Jiri Matas, Carsten Rother

    Abstract: We propose a benchmark for 6D pose estimation of a rigid object from a single RGB-D input image. The training data consists of a texture-mapped 3D object model or images of the object in known 6D poses. The benchmark comprises of: i) eight datasets in a unified format that cover different practical scenarios, including two new datasets focusing on varying lighting conditions, ii) an evaluation met… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Comments: ECCV 2018

  44. arXiv:1808.04730  [pdf, other

    cs.LG stat.ML

    Analyzing Inverse Problems with Invertible Neural Networks

    Authors: Lynton Ardizzone, Jakob Kruse, Sebastian Wirkert, Daniel Rahner, Eric W. Pellegrini, Ralf S. Klessen, Lena Maier-Hein, Carsten Rother, Ullrich Köthe

    Abstract: In many tasks, in particular in natural science, the goal is to determine hidden system parameters from a set of measurements. Often, the forward process from parameter- to measurement-space is a well-defined function, whereas the inverse problem is ambiguous: one measurement may map to multiple different sets of parameters. In this setting, the posterior parameter distribution, conditioned on an… ▽ More

    Submitted 6 February, 2019; v1 submitted 14 August, 2018; originally announced August 2018.

    MSC Class: 68T01

  45. arXiv:1804.06423  [pdf, other

    cs.CV

    Deep Object Co-Segmentation

    Authors: Weihao Li, Omid Hosseini Jafari, Carsten Rother

    Abstract: This work presents a deep object co-segmentation (DOCS) approach for segmenting common objects of the same class within a pair of images. This means that the method learns to ignore common, or uncommon, background stuff and focuses on objects. If multiple object classes are presented in the image pair, they are jointly extracted as foreground. To address this task, we propose a CNN-based Siamese e… ▽ More

    Submitted 28 May, 2019; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: Accepted at ACCV 2018

  46. arXiv:1801.00868  [pdf, other

    cs.CV

    Panoptic Segmentation

    Authors: Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, Piotr Dollár

    Abstract: We propose and study a task we name panoptic segmentation (PS). Panoptic segmentation unifies the typically distinct tasks of semantic segmentation (assign a class label to each pixel) and instance segmentation (detect and segment each object instance). The proposed task requires generating a coherent scene segmentation that is rich and complete, an important step toward real-world vision systems.… ▽ More

    Submitted 10 April, 2019; v1 submitted 2 January, 2018; originally announced January 2018.

    Comments: accepted to CVPR 2019

  47. arXiv:1712.01924  [pdf, other

    cs.CV

    iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects

    Authors: Omid Hosseini Jafari, Siva Karthik Mustikovela, Karl Pertsch, Eric Brachmann, Carsten Rother

    Abstract: We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded. Recent RGB-D-based methods are robust to moderate degrees of occlusion. For RGB inputs, no previous method works well for partly occluded objects. Our main contribution is to present the first deep learning-based system that estimates accurate poses for part… ▽ More

    Submitted 18 June, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

  48. arXiv:1711.10228  [pdf, other

    cs.CV

    Learning Less is More - 6D Camera Localization via 3D Surface Regression

    Authors: Eric Brachmann, Carsten Rother

    Abstract: Popular research areas like autonomous driving and augmented reality have renewed the interest in image-based camera localization. In this work, we address the task of predicting the 6D camera pose from a single RGB image in a given 3D environment. With the advent of neural networks, previous works have either learned the entire camera localization process, or multiple components of a camera local… ▽ More

    Submitted 27 March, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: CVPR 2018

  49. arXiv:1708.01566  [pdf, other

    cs.CV

    Augmented Reality Meets Computer Vision : Efficient Data Generation for Urban Driving Scenes

    Authors: Hassan Abu Alhaija, Siva Karthik Mustikovela, Lars Mescheder, Andreas Geiger, Carsten Rother

    Abstract: The success of deep learning in computer vision is based on availability of large annotated datasets. To lower the need for hand labeled images, virtually rendered 3D worlds have recently gained popularity. Creating realistic 3D content is challenging on its own and requires significant human effort. In this work, we propose an alternative paradigm which combines real and synthetic data for learni… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

  50. Analyzing Modular CNN Architectures for Joint Depth Prediction and Semantic Segmentation

    Authors: Omid Hosseini Jafari, Oliver Groth, Alexander Kirillov, Michael Ying Yang, Carsten Rother

    Abstract: This paper addresses the task of designing a modular neural network architecture that jointly solves different tasks. As an example we use the tasks of depth estimation and semantic segmentation given a single RGB image. The main focus of this work is to analyze the cross-modality influence between depth and semantic prediction maps on their joint refinement. While most previous works solely focus… ▽ More

    Submitted 26 February, 2017; originally announced February 2017.

    Comments: Accepted to ICRA 2017