Skip to main content

Showing 51–100 of 226 results for author: Cremers, D

.
  1. arXiv:2212.10368  [pdf, other

    cs.CV

    Masked Event Modeling: Self-Supervised Pretraining for Event Cameras

    Authors: Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers

    Abstract: Event cameras asynchronously capture brightness changes with low latency, high temporal resolution, and high dynamic range. However, annotation of event data is a costly and laborious process, which limits the use of deep learning methods for classification and other semantic tasks with the event modality. To reduce the dependency on labeled event data, we introduce Masked Event Modeling (MEM), a… ▽ More

    Submitted 23 December, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: To appear at WACV 2024. Code: https://github.com/tum-vision/mem

  2. arXiv:2212.04968  [pdf, other

    cs.CV

    SupeRVol: Super-Resolution Shape and Reflectance Estimation in Inverse Volume Rendering

    Authors: Mohammed Brahimi, Bjoern Haefner, Tarun Yenamandra, Bastian Goldluecke, Daniel Cremers

    Abstract: We propose an end-to-end inverse rendering pipeline called SupeRVol that allows us to recover 3D shape and material parameters from a set of color images in a super-resolution manner. To this end, we represent both the bidirectional reflectance distribution function (BRDF) and the signed distance function (SDF) by multi-layer perceptrons. In order to obtain both the surface shape and its reflectan… ▽ More

    Submitted 16 March, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  3. arXiv:2212.02988  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    PRISM: Probabilistic Real-Time Inference in Spatial World Models

    Authors: Atanas Mirchev, Baris Kayalibay, Ahmed Agha, Patrick van der Smagt, Daniel Cremers, Justin Bayer

    Abstract: We introduce PRISM, a method for real-time filtering in a probabilistic generative model of agent motion and visual perception. Previous approaches either lack uncertainty estimates for the map and agent state, do not run in real-time, do not have a dense scene representation or do not model agent dynamics. Our solution reconciles all of these aspects. We start from a predefined state-space model… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Will appear in PMLR, CoRL 2022

  4. arXiv:2212.02910  [pdf, other

    cs.CV

    G-MSM: Unsupervised Multi-Shape Matching with Graph-based Affinity Priors

    Authors: Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers

    Abstract: We present G-MSM (Graph-based Multi-Shape Matching), a novel unsupervised learning approach for non-rigid shape correspondence. Rather than treating a collection of input poses as an unordered set of samples, we explicitly model the underlying shape data manifold. To this end, we propose an adaptive multi-shape matching architecture that constructs an affinity graph on a given set of training shap… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  5. arXiv:2211.12542  [pdf, other

    cs.CV

    CASSPR: Cross Attention Single Scan Place Recognition

    Authors: Yan Xia, Mariia Gladkova, Rui Wang, Qianyun Li, Uwe Stilla, João F. Henriques, Daniel Cremers

    Abstract: Place recognition based on point clouds (LiDAR) is an important component for autonomous robots or self-driving vehicles. Current SOTA performance is achieved on accumulated LiDAR submaps using either point-based or voxel-based structures. While voxel-based approaches nicely integrate spatial context across multiple scales, they do not exhibit the local precision of point-based methods. As a resul… ▽ More

    Submitted 29 August, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted by ICCV2023

  6. arXiv:2210.15575  [pdf, other

    cs.LG cs.AI stat.ML

    A Graph Is More Than Its Nodes: Towards Structured Uncertainty-Aware Learning on Graphs

    Authors: Hans Hao-Hsun Hsu, Yuesong Shen, Daniel Cremers

    Abstract: Current graph neural networks (GNNs) that tackle node classification on graphs tend to only focus on nodewise scores and are solely evaluated by nodewise metrics. This limits uncertainty estimation on graphs since nodewise marginals do not fully characterize the joint distribution given the graph structure. In this work, we propose novel edgewise metrics, namely the edgewise expected calibration e… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)

  7. arXiv:2210.12202  [pdf, other

    cs.CV

    High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF

    Authors: Lu Sang, Bjoern Haefner, Xingxing Zuo, Daniel Cremers

    Abstract: Fine-detailed reconstructions are in high demand in many applications. However, most of the existing RGB-D reconstruction methods rely on pre-calculated accurate camera poses to recover the detailed surface geometry, where the representation of a surface needs to be adapted when optimizing different quantities. In this paper, we present a novel multi-view RGB-D based reconstruction method that tac… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: WACV 2023

  8. arXiv:2210.06436  [pdf, other

    cs.LG

    Deep Combinatorial Aggregation

    Authors: Yuesong Shen, Daniel Cremers

    Abstract: Neural networks are known to produce poor uncertainty estimations, and a variety of approaches have been proposed to remedy this issue. This includes deep ensemble, a simple and effective method that achieves state-of-the-art results for uncertainty-aware learning tasks. In this work, we explore a combinatorial generalization of deep ensemble called deep combinatorial aggregation (DCA). DCA create… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  9. arXiv:2210.06391  [pdf, other

    cs.LG cs.AI

    What Makes Graph Neural Networks Miscalibrated?

    Authors: Hans Hao-Hsun Hsu, Yuesong Shen, Christian Tomani, Daniel Cremers

    Abstract: Given the importance of getting calibrated predictions and reliable uncertainty estimations, various post-hoc calibration methods have been developed for neural networks on standard multi-class classification tasks. However, these methods are not well suited for calibrating graph neural networks (GNNs), which presents unique challenges such as accounting for the graph structure and the graph-induc… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  10. arXiv:2209.14965  [pdf, other

    cs.CV

    DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment

    Authors: Mariia Gladkova, Nikita Korobov, Nikolaus Demmel, Aljoša Ošep, Laura Leal-Taixé, Daniel Cremers

    Abstract: Direct methods have shown excellent performance in the applications of visual odometry and SLAM. In this work we propose to leverage their effectiveness for the task of 3D multi-object tracking. To this end, we propose DirectTracker, a framework that effectively combines direct image alignment for the short-term tracking and sliding-window photometric bundle adjustment for 3D object detection. Obj… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: In Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS), 2022

  11. arXiv:2208.11300  [pdf, other

    cs.CV cs.RO

    E-NeRF: Neural Radiance Fields from a Moving Event Camera

    Authors: Simon Klenk, Lukas Koestler, Davide Scaramuzza, Daniel Cremers

    Abstract: Estimating neural radiance fields (NeRFs) from "ideal" images has been extensively studied in the computer vision community. Most approaches assume optimal illumination and slow camera motion. These assumptions are often violated in robotic applications, where images may contain motion blur, and the scene may not have suitable illumination. This can cause significant problems for downstream tasks… ▽ More

    Submitted 24 January, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: revised RAL version + added suppl. material

  12. arXiv:2208.05788  [pdf, other

    cs.CV

    Semantic Self-adaptation: Enhancing Generalization with a Single Sample

    Authors: Sherwin Bahmani, Oliver Hahn, Eduard Zamfir, Nikita Araslanov, Daniel Cremers, Stefan Roth

    Abstract: The lack of out-of-domain generalization is a critical weakness of deep networks for semantic segmentation. Previous studies relied on the assumption of a static model, i. e., once the training process is complete, model parameters remain fixed at test time. In this work, we challenge this premise with a self-adaptive approach for semantic segmentation that adjusts the inference process to each in… ▽ More

    Submitted 13 December, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Published in TMLR (July 2023) | OpenReview: https://openreview.net/forum?id=ILNqQhGbLx | Code: https://github.com/visinf/self-adaptive | Video: https://youtu.be/s4DG65ic0EA

  13. arXiv:2206.09596  [pdf, other

    cs.CV math.OC

    Efficient and Flexible Sublabel-Accurate Energy Minimization

    Authors: Zhakshylyk Nurlanov, Daniel Cremers, Florian Bernard

    Abstract: We address the problem of minimizing a class of energy functions consisting of data and smoothness terms that commonly occur in machine learning, computer vision, and pattern recognition. While discrete optimization methods are able to give theoretical optimality guarantees, they can only handle a finite number of labels and therefore suffer from label discretization bias. Existing continuous opti… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: To be published at ICPR 2022, Copyright 2022 IEEE

  14. arXiv:2206.05971  [pdf, other

    cs.LG cs.AI

    Biologically Inspired Neural Path Finding

    Authors: Hang Li, Qadeer Khan, Volker Tresp, Daniel Cremers

    Abstract: The human brain can be considered to be a graphical structure comprising of tens of billions of biological neurons connected by synapses. It has the remarkable ability to automatically re-route information flow through alternate paths in case some neurons are damaged. Moreover, the brain is capable of retaining information and applying it to similar but completely unseen scenarios. In this paper,… ▽ More

    Submitted 25 August, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  15. arXiv:2205.15094  [pdf, other

    cs.LG cs.AI

    CHALLENGER: Training with Attribution Maps

    Authors: Christian Tomani, Daniel Cremers

    Abstract: We show that utilizing attribution maps for training neural networks can improve regularization of models and thus increase performance. Regularization is key in deep learning, especially when training complex models on relatively small datasets. In order to understand inner workings of neural networks, attribution methods such as Layer-wise Relevance Propagation (LRP) have been extensively studie… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Technical report

  16. arXiv:2205.11567  [pdf, other

    cs.CV

    VPAIR -- Aerial Visual Place Recognition and Localization in Large-scale Outdoor Environments

    Authors: Michael Schleiss, Fahmi Rouatbi, Daniel Cremers

    Abstract: Visual Place Recognition and Visual Localization are essential components in navigation and map** for autonomous vehicles especially in GNSS-denied navigation scenarios. Recent work has focused on ground or close to ground applications such as self-driving cars or indoor-scenarios and low-altitude drone flights. However, applications such as Urban Air Mobility require operations in large-scale o… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: ICRA 2022 AERIAL ROBOTICS WORKSHOP

  17. arXiv:2205.06688  [pdf, other

    cs.CV

    A Unified Framework for Implicit Sinkhorn Differentiation

    Authors: Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Florian Bernard, Daniel Cremers

    Abstract: The Sinkhorn operator has recently experienced a surge of popularity in computer vision and related fields. One major reason is its ease of integration into deep learning frameworks. To allow for an efficient training of respective neural networks, we propose an algorithm that obtains analytical gradients of a Sinkhorn layer via implicit differentiation. In comparison to prior work, our framework… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: To appear at CVPR 2022

  18. arXiv:2204.14030  [pdf, other

    cs.CV

    Neural Implicit Representations for Physical Parameter Inference from a Single Video

    Authors: Florian Hofherr, Lukas Koestler, Florian Bernard, Daniel Cremers

    Abstract: Neural networks have recently been used to analyze diverse physical systems and to identify the underlying dynamics. While existing methods achieve impressive results, they are limited by their strong demand for training data and their weak generalization abilities to out-of-distribution data. To overcome these limitations, in this work we propose to combine neural implicit representations for app… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Published in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  19. arXiv:2204.12834  [pdf, other

    cs.CV

    Power Bundle Adjustment for Large-Scale 3D Reconstruction

    Authors: Simon Weber, Nikolaus Demmel, Tin Chon Chan, Daniel Cremers

    Abstract: We introduce Power Bundle Adjustment as an expansion type algorithm for solving large-scale bundle adjustment problems. It is based on the power series expansion of the inverse Schur complement and constitutes a new family of solvers that we call inverse expansion methods. We theoretically justify the use of power series and we prove the convergence of our approach. Using the real-world BAL datase… ▽ More

    Submitted 17 April, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

  20. arXiv:2204.12805  [pdf, other

    cs.CV cs.GR math.OC

    A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching

    Authors: Paul Roetzer, Paul Swoboda, Daniel Cremers, Florian Bernard

    Abstract: We present a scalable combinatorial algorithm for globally optimizing over the space of geometrically consistent map**s between 3D shapes. We use the mathematically elegant formalism proposed by Windheuser et al. (ICCV 2011) where 3D shape matching was formulated as an integer linear program over the space of orientation-preserving diffeomorphisms. Until now, the resulting formulation had limite… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: CVPR 2022

  21. arXiv:2204.10060  [pdf, other

    cs.CV

    Implicit Shape Completion via Adversarial Shape Priors

    Authors: Abhishek Saroha, Marvin Eisenberger, Tarun Yenamandra, Daniel Cremers

    Abstract: We present a novel neural implicit shape method for partial point cloud completion. To that end, we combine a conditional Deep-SDF architecture with learned, adversarial shape priors. More specifically, our network converts partial inputs into a global latent code and then recovers the full geometry via an implicit, signed distance generator. Additionally, we train a PointNet++ discriminator that… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  22. arXiv:2204.02256  [pdf, other

    cs.CV cs.RO

    The Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation Optimization under Uncertain Feature Positions

    Authors: Dominik Muhle, Lukas Koestler, Nikolaus Demmel, Florian Bernard, Daniel Cremers

    Abstract: The estimation of the relative pose of two camera views is a fundamental problem in computer vision. Kneip et al. proposed to solve this problem by introducing the normal epipolar constraint (NEC). However, their approach does not take into account uncertainties, so that the accuracy of the estimated relative pose is highly dependent on accurate feature positions in the target frame. In this work,… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  23. arXiv:2203.16284  [pdf, other

    cs.CV cs.GR

    FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions

    Authors: Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers

    Abstract: Neural 3D implicit representations learn priors that are useful for diverse applications, such as single- or multiple-view 3D reconstruction. A major downside of existing approaches while rendering an image is that they require evaluating the network multiple times per camera ray so that the high computational time forms a bottleneck for downstream applications. We address this problem by introduc… ▽ More

    Submitted 19 December, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: News: Accepted to WACV'24. Project page: https://vision.in.tum.de/research/geometry/fire

  24. arXiv:2203.12560  [pdf, other

    cs.CV

    DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation

    Authors: Aysim Toker, Lukas Kondmann, Mark Weber, Marvin Eisenberger, Andrés Camero, **gliang Hu, Ariadna Pregel Hoderlein, Çağlar Şenaras, Timothy Davis, Daniel Cremers, Giovanni Marchisio, Xiao Xiang Zhu, Laura Leal-Taixé

    Abstract: Earth observation is a fundamental tool for monitoring the evolution of land use in specific areas of interest. Observing and precisely defining change, in this context, requires both time-series data and pixel-wise segmentations. To that end, we propose the DynamicEarthNet dataset that consists of daily, multi-spectral satellite observations of 75 selected areas of interest distributed over the g… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022, evaluation webpage: https://codalab.lisn.upsaclay.fr/competitions/2882

  25. arXiv:2203.10662  [pdf, other

    cs.CV

    Lateral Ego-Vehicle Control without Supervision using Point Clouds

    Authors: Florian Müller, Qadeer Khan, Daniel Cremers

    Abstract: Existing vision based supervised approaches to lateral vehicle control are capable of directly map** RGB images to the appropriate steering commands. However, they are prone to suffering from inadequate robustness in real world scenarios due to a lack of failure cases in the training data. In this paper, a framework for training a more robust and scalable model for lateral vehicle control is pro… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

  26. arXiv:2203.07967  [pdf, other

    cs.CV cs.LG

    Intrinsic Neural Fields: Learning Functions on Manifolds

    Authors: Lukas Koestler, Daniel Grittner, Michael Moeller, Daniel Cremers, Zorah Lähner

    Abstract: Neural fields have gained significant attention in the computer vision community due to their excellent performance in novel view synthesis, geometry reconstruction, and generative modeling. Some of their advantages are a sound theoretic foundation and an easy implementation in current deep learning frameworks. While neural fields have been applied to signals on manifolds, e.g., for texture recons… ▽ More

    Submitted 23 March, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  27. arXiv:2203.01087  [pdf, other

    cs.CV cs.AI

    Vision-based Large-scale 3D Semantic Map** for Autonomous Driving Applications

    Authors: Qing Cheng, Niclas Zeller, Daniel Cremers

    Abstract: In this paper, we present a complete pipeline for 3D semantic map** solely based on a stereo camera system. The pipeline comprises a direct sparse visual odometry front-end as well as a back-end for global optimization including GNSS integration, and semantic 3D point cloud labeling. We propose a simple but effective temporal voting scheme which improves the quality and consistency of the 3D poi… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: ICRA 2022 Contributed paper

    ACM Class: I.2.10; I.4.5; I.4.6; I.4.8; I.5.4

  28. DM-VIO: Delayed Marginalization Visual-Inertial Odometry

    Authors: Lukas von Stumberg, Daniel Cremers

    Abstract: We present DM-VIO, a monocular visual-inertial odometry system based on two novel techniques called delayed marginalization and pose graph bundle adjustment. DM-VIO performs photometric bundle adjustment with a dynamic weight for visual residuals. We adopt marginalization, which is a popular strategy to keep the update time constrained, but it cannot easily be reversed, and linearization points of… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

  29. arXiv:2112.04165  [pdf, other

    cs.CV cs.GR cs.LG math.OC

    Shortest Paths in Graphs with Matrix-Valued Edges: Concepts, Algorithm and Application to 3D Multi-Shape Analysis

    Authors: Viktoria Ehm, Daniel Cremers, Florian Bernard

    Abstract: Finding shortest paths in a graph is relevant for numerous problems in computer vision and graphics, including image segmentation, shape matching, or the computation of geodesic distances on discrete surfaces. Traditionally, the concept of a shortest path is considered for graphs with scalar edge weights, which makes it possible to compute the length of a path by adding up the individual edge weig… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: published at 3DV

  30. arXiv:2111.13652  [pdf, other

    cs.CV

    Gradient-SDF: A Semi-Implicit Surface Representation for 3D Reconstruction

    Authors: Christiane Sommer, Lu Sang, David Schubert, Daniel Cremers

    Abstract: We present Gradient-SDF, a novel representation for 3D geometry that combines the advantages of implict and explicit representations. By storing at every voxel both the signed distance field as well as its gradient vector field, we enhance the capability of implicit representations with approaches originally formulated for explicit surfaces. As concrete examples, we show that (1) the Gradient-SDF… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: First two authors contributed equally

  31. arXiv:2111.07418  [pdf, other

    cs.CV cs.RO

    TANDEM: Tracking and Dense Map** in Real-time using Deep Multi-view Stereo

    Authors: Lukas Koestler, Nan Yang, Niclas Zeller, Daniel Cremers

    Abstract: In this paper, we present TANDEM a real-time monocular tracking and dense map** framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: CoRL 2021. The manuscript contains the main paper and the supplementary materials. Project page: https://go.vision.in.tum.de/tandem

  32. arXiv:2110.04015  [pdf, other

    cs.CV

    Multidirectional Conjugate Gradients for Scalable Bundle Adjustment

    Authors: Simon Weber, Nikolaus Demmel, Daniel Cremers

    Abstract: We revisit the problem of large-scale bundle adjustment and propose a technique called Multidirectional Conjugate Gradients that accelerates the solution of the normal equation by up to 61%. The key idea is that we enlarge the search space of classical preconditioned conjugate gradients to include multiple search directions. As a consequence, the resulting algorithm requires fewer iterations, lead… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  33. arXiv:2110.00053  [pdf, other

    math.OC cs.CV cs.LG stat.ML

    Sparse Quadratic Optimisation over the Stiefel Manifold with Application to Permutation Synchronisation

    Authors: Florian Bernard, Daniel Cremers, Johan Thunberg

    Abstract: We address the non-convex optimisation problem of finding a sparse matrix on the Stiefel manifold (matrices with mutually orthogonal columns of unit length) that maximises (or minimises) a quadratic objective function. Optimisation problems on the Stiefel manifold occur for example in spectral relaxations of various combinatorial problems, such as graph matching, clustering, or permutation synchro… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: To appear at NeurIPS 2021

  34. arXiv:2109.11398  [pdf, other

    cs.CV

    Scene Graph Generation for Better Image Captioning?

    Authors: Maximilian Mozes, Martin Schmitt, Vladimir Golkov, Hinrich Schütze, Daniel Cremers

    Abstract: We investigate the incorporation of visual relationships into the task of supervised image caption generation by proposing a model that leverages detected objects and auto-generated visual relationships to describe images in natural language. To do so, we first generate a scene graph from raw image pixels by identifying individual objects and visual relationships between them. This scene graph the… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Technical report. This work was done and the paper was written in 2019

  35. arXiv:2109.05509  [pdf, other

    cs.RO cs.CV

    Towards Robust Monocular Visual Odometry for Flying Robots on Planetary Missions

    Authors: Martin Wudenka, Marcus G. Müller, Nikolaus Demmel, Armin Wedler, Rudolph Triebel, Daniel Cremers, Wolfgang Stürzl

    Abstract: In the future, extraterrestrial expeditions will not only be conducted by rovers but also by flying robots. The technical demonstration drone Ingenuity, that just landed on Mars, will mark the beginning of a new era of exploration unhindered by terrain traversability. Robust self-localization is crucial for that. Cameras that are lightweight, cheap and information-rich sensors are already used to… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: Accepted to IROS 2021. Updated version corresponding to IROS camera-ready. The source code is publicly available at: https://github.com/DLR-RM/granite

  36. Square Root Marginalization for Sliding-Window Bundle Adjustment

    Authors: Nikolaus Demmel, David Schubert, Christiane Sommer, Daniel Cremers, Vladyslav Usenko

    Abstract: In this paper we propose a novel square root sliding-window bundle adjustment suitable for real-time odometry applications. The square root formulation pervades three major aspects of our optimization-based sliding-window estimator: for bundle adjustment we eliminate landmark variables with nullspace projection; to store the marginalization prior we employ a matrix square root of the Hessian; and… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: to be published at ICCV 2021; camera ready version

  37. arXiv:2109.01461  [pdf, other

    cs.LG cs.NE

    Dive into Layers: Neural Network Capacity Bounding using Algebraic Geometry

    Authors: Ji Yang, Lu Sang, Daniel Cremers

    Abstract: The empirical results suggest that the learnability of a neural network is directly related to its size. To mathematically prove this, we borrow a tool in topological algebra: Betti numbers to measure the topological geometric complexity of input data and the neural network. By characterizing the expressive capacity of a neural network with its topological complexity, we conduct a thorough analysi… ▽ More

    Submitted 4 November, 2021; v1 submitted 3 September, 2021; originally announced September 2021.

  38. arXiv:2108.07329  [pdf, other

    cs.CV

    TUM-VIE: The TUM Stereo Visual-Inertial Event Dataset

    Authors: Simon Klenk, Jason Chui, Nikolaus Demmel, Daniel Cremers

    Abstract: Event cameras are bio-inspired vision sensors which measure per pixel brightness changes. They offer numerous benefits over traditional, frame-based cameras, including low latency, high dynamic range, high temporal resolution and low power consumption. Thus, these sensors are suited for robotics and virtual reality applications. To foster the development of 3D perception and navigation algorithms… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted to IROS 2021. First two authors contributed equally

  39. arXiv:2107.13059  [pdf, other

    cs.LG stat.ML

    Explicit Pairwise Factorized Graph Neural Network for Semi-Supervised Node Classification

    Authors: Yu Wang, Yuesong Shen, Daniel Cremers

    Abstract: Node features and structural information of a graph are both crucial for semi-supervised node classification problems. A variety of graph neural network (GNN) based approaches have been proposed to tackle these problems, which typically determine output labels through feature aggregation. This can be problematic, as it implies conditional independence of output nodes given hidden representations,… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  40. arXiv:2107.06028  [pdf, other

    math.OC cs.CV

    Lifting the Convex Conjugate in Lagrangian Relaxations: A Tractable Approach for Continuous Markov Random Fields

    Authors: Hartmut Bauermeister, Emanuel Laude, Thomas Möllenhoff, Michael Moeller, Daniel Cremers

    Abstract: Dual decomposition approaches in nonconvex optimization may suffer from a duality gap. This poses a challenge when applying them directly to nonconvex problems such as MAP-inference in a Markov random field (MRF) with continuous state spaces. To eliminate such gaps, this paper considers a reformulation of the original nonconvex task in the space of measures. This infinite-dimensional reformulation… ▽ More

    Submitted 16 May, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

  41. arXiv:2107.04536  [pdf, other

    cs.CV

    Event-Based Feature Tracking in Continuous Time with Sliding Window Optimization

    Authors: Jason Chui, Simon Klenk, Daniel Cremers

    Abstract: We propose a novel method for continuous-time feature tracking in event cameras. To this end, we track features by aligning events along an estimated trajectory in space-time such that the projection on the image plane results in maximally sharp event patch images. The trajectory is parameterized by $n^{th}$ order B-splines, which are continuous up to $(n-2)^{th}$ derivative. In contrast to previo… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 9 pages, 4 figures, 1 table

  42. arXiv:2106.09748  [pdf, other

    cs.CV

    DeepLab2: A TensorFlow Library for Deep Labeling

    Authors: Mark Weber, Huiyu Wang, Siyuan Qiao, Jun Xie, Maxwell D. Collins, Yukun Zhu, Liangzhe Yuan, Dahun Kim, Qihang Yu, Daniel Cremers, Laura Leal-Taixe, Alan L. Yuille, Florian Schroff, Hartwig Adam, Liang-Chieh Chen

    Abstract: DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a state-of-the-art and easy-to-use TensorFlow codebase for general dense pixel prediction problems in computer vision. DeepLab2 includes all our recently developed DeepLab model variants with pretrained checkpoints as well as model training and evaluation code, allowing the community to reproduce and further improve upon the sta… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 4-page technical report. The first three authors contributed equally to this work

  43. arXiv:2106.09431  [pdf, other

    cs.CV

    NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

    Authors: Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi

    Abstract: We present NeuroMorph, a new neural network architecture that takes as input two 3D shapes and produces in one go, i.e. in a single feed forward pass, a smooth interpolation and point-to-point correspondences between them. The interpolation, expressed as a deformation field, changes the pose of the source shape to resemble the target, but leaves the object identity unchanged. NeuroMorph uses an el… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Published at the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021

  44. arXiv:2103.17229  [pdf, other

    cs.CV cs.LG

    Joint Deep Multi-Graph Matching and 3D Geometry Learning from Inhomogeneous 2D Image Collections

    Authors: Zhenzhang Ye, Tarun Yenamandra, Florian Bernard, Daniel Cremers

    Abstract: Graph matching aims to establish correspondences between vertices of graphs such that both the node and edge attributes agree. Various learning-based methods were recently proposed for finding correspondences between image key points based on deep graph matching formulations. While these approaches mainly focus on learning node and edge attributes, they completely ignore the 3D geometry of the und… ▽ More

    Submitted 7 May, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Presented at the 36th AAAI conference on artificial intelligence, 2022 (AAAI22)

  45. arXiv:2103.11204  [pdf, other

    cs.CV

    Self-Supervised Steering Angle Prediction for Vehicle Control Using Visual Odometry

    Authors: Qadeer Khan, Patrick Wenzel, Daniel Cremers

    Abstract: Vision-based learning methods for self-driving cars have primarily used supervised approaches that require a large number of labels for training. However, those labels are usually difficult and expensive to obtain. In this paper, we demonstrate how a model can be trained to control a vehicle's trajectory using camera poses estimated through visual odometry methods in an entirely self-supervised fa… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: Accepted at International Conference on Artificial Intelligence and Statistics (AISTATS), 2021

  46. arXiv:2103.04727  [pdf, other

    cs.LG cs.CV cs.RO

    Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning

    Authors: Patrick Wenzel, Torsten Schön, Laura Leal-Taixé, Daniel Cremers

    Abstract: Obstacle avoidance is a fundamental and challenging problem for autonomous navigation of mobile robots. In this paper, we consider the problem of obstacle avoidance in simple 3D environments where the robot has to solely rely on a single monocular camera. In particular, we are interested in solving this problem without relying on localization, map**, or planning techniques. Most of the existing… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  47. Square Root Bundle Adjustment for Large-Scale Reconstruction

    Authors: Nikolaus Demmel, Christiane Sommer, Daniel Cremers, Vladyslav Usenko

    Abstract: We propose a new formulation for the bundle adjustment problem which relies on nullspace marginalization of landmark variables by QR decomposition. Our approach, which we call square root bundle adjustment, is algebraically equivalent to the commonly used Schur complement trick, improves the numeric stability of computations, and allows for solving large-scale bundle adjustment problems with singl… ▽ More

    Submitted 30 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021. Updated version corresponding to CVPR camera-ready. Formatting changes and minor tweaks to fit page requirements

  48. arXiv:2102.12182  [pdf, other

    cs.LG cs.AI

    Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration

    Authors: Christian Tomani, Daniel Cremers, Florian Buettner

    Abstract: We address the problem of uncertainty calibration and introduce a novel calibration method, Parametrized Temperature Scaling (PTS). Standard deep neural networks typically yield uncalibrated predictions, which can be transformed into calibrated confidence scores using post-hoc calibration methods. In this contribution, we demonstrate that the performance of accuracy-preserving state-of-the-art pos… ▽ More

    Submitted 17 September, 2022; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: In Proceedings of the European Conference on Computer Vision (ECCV), 2022. Code available at https://github.com/tochris/pts-uncertainty

  49. arXiv:2102.11859  [pdf, other

    cs.CV

    STEP: Segmenting and Tracking Every Pixel

    Authors: Mark Weber, Jun Xie, Maxwell Collins, Yukun Zhu, Paul Voigtlaender, Hartwig Adam, Bradley Green, Andreas Geiger, Bastian Leibe, Daniel Cremers, Aljoša Ošep, Laura Leal-Taixé, Liang-Chieh Chen

    Abstract: The task of assigning semantic classes and track identities to every pixel in a video is called video panoptic segmentation. Our work is the first that targets this task in a real-world setting requiring dense interpretation in both spatial and temporal domains. As the ground-truth for this task is difficult and expensive to obtain, existing datasets are either constructed synthetically or only sp… ▽ More

    Submitted 7 December, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted to NeurIPS 2021 Track on Datasets and Benchmarks. Code: https://github.com/google-research/deeplab2

  50. arXiv:2102.11192  [pdf, other

    cs.LG physics.ao-ph

    Variational Data Assimilation with a Learned Inverse Observation Operator

    Authors: Thomas Frerix, Dmitrii Kochkov, Jamie A. Smith, Daniel Cremers, Michael P. Brenner, Stephan Hoyer

    Abstract: Variational data assimilation optimizes for an initial state of a dynamical system such that its evolution fits observational data. The physical model can subsequently be evolved into the future to make predictions. This principle is a cornerstone of large scale forecasting applications such as numerical weather prediction. As such, it is implemented in current operational systems of weather forec… ▽ More

    Submitted 20 May, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Published at the International Conference on Machine Learning (ICML) 2021