Skip to main content

Showing 101–150 of 226 results for author: Cremers, D

.
  1. arXiv:2102.06942  [pdf, other

    cs.CV cs.LG cs.NE

    Rotation-Equivariant Deep Learning for Diffusion MRI

    Authors: Philip Müller, Vladimir Golkov, Valentina Tomassini, Daniel Cremers

    Abstract: Convolutional networks are successful, but they have recently been outperformed by new neural networks that are equivariant under rotations and translations. These new networks work better because they do not struggle with learning each possible orientation of each image feature separately. So far, they have been proposed for 2D and 3D data. Here we generalize them to 6D diffusion MRI data, ensuri… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 24 pages, 8 figures

  2. arXiv:2102.01191  [pdf, other

    cs.CV

    Tight Integration of Feature-based Relocalization in Monocular Direct Visual Odometry

    Authors: Mariia Gladkova, Rui Wang, Niclas Zeller, Daniel Cremers

    Abstract: In this paper we propose a framework for integrating map-based relocalization into online direct visual odometry. To achieve map-based relocalization for direct methods, we integrate image features into Direct Sparse Odometry (DSO) and rely on feature matching to associate online visual odometry (VO) with a previously built map. The integration of the relocalization poses is threefold. Firstly, th… ▽ More

    Submitted 29 March, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: ICRA 2021 camera-ready submission; 7 pages, 5 figures and 3 tables

  3. arXiv:2012.10988  [pdf, other

    cs.LG cs.AI stat.ML

    Post-hoc Uncertainty Calibration for Domain Drift Scenarios

    Authors: Christian Tomani, Sebastian Gruber, Muhammed Ebrar Erdem, Daniel Cremers, Florian Buettner

    Abstract: We address the problem of uncertainty calibration. While standard deep neural networks typically yield uncalibrated predictions, calibrated confidence scores that are representative of the true likelihood of a prediction can be achieved using post-hoc calibration methods. However, to date the focus of these approaches has been on in-domain calibration. Our contribution is two-fold. First, we show… ▽ More

    Submitted 23 June, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021. Code available at https://github.com/tochris/calibration-domain-drift

  4. arXiv:2012.03345  [pdf, other

    cs.LG cs.AI

    Neural Online Graph Exploration

    Authors: Ioannis Chiotellis, Daniel Cremers

    Abstract: Can we learn how to explore unknown spaces efficiently? To answer this question, we study the problem of Online Graph Exploration, the online version of the Traveling Salesperson Problem. We reformulate graph exploration as a reinforcement learning problem and apply Direct Future Prediction (Dosovitskiy and Koltun, 2017) to solve it. As the graph is discovered online, the corresponding Markov Deci… ▽ More

    Submitted 6 April, 2021; v1 submitted 6 December, 2020; originally announced December 2020.

  5. arXiv:2012.02689  [pdf, other

    cs.CV

    Isometric Multi-Shape Matching

    Authors: Maolin Gao, Zorah Lähner, Johan Thunberg, Daniel Cremers, Florian Bernard

    Abstract: Finding correspondences between shapes is a fundamental problem in computer vision and graphics, which is relevant for many applications, including 3D reconstruction, object tracking, and style transfer. The vast majority of correspondence methods aim to find a solution between pairs of shapes, even if multiple instances of the same class are available. While isometries are often studied in shape… ▽ More

    Submitted 3 April, 2024; v1 submitted 4 December, 2020; originally announced December 2020.

  6. arXiv:2011.14143  [pdf, other

    cs.CV cs.GR cs.LG

    i3DMM: Deep Implicit 3D Morphable Model of Human Heads

    Authors: Tarun Yenamandra, Ayush Tewari, Florian Bernard, Hans-Peter Seidel, Mohamed Elgharib, Daniel Cremers, Christian Theobalt

    Abstract: We present the first deep implicit 3D morphable model (i3DMM) of full heads. Unlike earlier morphable face models it not only captures identity-specific geometry, texture, and expressions of the frontal face, but also models the entire head, including hair. We collect a new dataset consisting of 64 people with different expressions and hairstyles to train i3DMM. Our approach has the following favo… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: Project page: http://gvv.mpi-inf.mpg.de/projects/i3DMM/

  7. Non-Rigid Puzzles

    Authors: Or Litany, Emanuele Rodolà, Alex Bronstein, Michael Bronstein, Daniel Cremers

    Abstract: Shape correspondence is a fundamental problem in computer graphics and vision, with applications in various problems including animation, texture map**, robotic vision, medical imaging, archaeology and many more. In settings where the shapes are allowed to undergo non-rigid deformations and only partial views are available, the problem becomes very challenging. To this end, we present a non-rigi… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Journal ref: Computer Graphics Forum, Volume 35, Issue 5, August 2016

  8. arXiv:2011.12430  [pdf, other

    cs.CV

    SOE-Net: A Self-Attention and Orientation Encoding Network for Point Cloud based Place Recognition

    Authors: Yan Xia, Yusheng Xu, Shuang Li, Rui Wang, Juan Du, Daniel Cremers, Uwe Stilla

    Abstract: We tackle the problem of place recognition from point cloud data and introduce a self-attention and orientation encoding network (SOE-Net) that fully explores the relationship between points and incorporates long-range context into point-wise local descriptors. Local information of each point from eight orientations is captured in a PointOE module, whereas long-range feature dependencies among loc… ▽ More

    Submitted 23 May, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Accepted by CVPR2021 (Oral)

  9. arXiv:2011.11814  [pdf, other

    cs.CV

    MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

    Authors: Felix Wimbauer, Nan Yang, Lukas von Stumberg, Niclas Zeller, Daniel Cremers

    Abstract: In this paper, we propose MonoRec, a semi-supervised monocular dense reconstruction architecture that predicts depth maps from a single moving camera in dynamic environments. MonoRec is based on a multi-view stereo setting which encodes the information of multiple consecutive images in a cost volume. To deal with dynamic objects in the scene, we introduce a MaskModule that predicts moving object m… ▽ More

    Submitted 6 May, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: CVPR 2021, Project page with video can be found under https://vision.in.tum.de/research/monorec. 14 pages, 10 figures, 5 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 6112-6122

  10. arXiv:2010.15261  [pdf, other

    cs.CV

    Deep Shells: Unsupervised Shape Correspondence with Optimal Transport

    Authors: Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers

    Abstract: We propose a novel unsupervised learning approach to 3D shape correspondence that builds a multiscale matching pipeline into a deep neural network. This approach is based on smooth shells, the current state-of-the-art axiomatic correspondence method, which requires an a priori stochastic search over the space of initial poses. Our goal is to replace this costly preprocessing step by directly learn… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  11. arXiv:2010.15084  [pdf, other

    eess.AS cs.SD

    Speech Synthesis and Control Using Differentiable DSP

    Authors: Giorgio Fabbro, Vladimir Golkov, Thomas Kemp, Daniel Cremers

    Abstract: Modern text-to-speech systems are able to produce natural and high-quality speech, but speech contains factors of variation (e.g. pitch, rhythm, loudness, timbre)\ that text alone cannot contain. In this work we move towards a speech synthesis system that can produce diverse speech renditions of a text by allowing (but not requiring) explicit control over the various factors of variation. We propo… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures, for associated audio files, see https://thesmith1.github.io/DDSPeech/

  12. arXiv:2010.12682  [pdf, other

    cs.CV

    Unsupervised Dense Shape Correspondence using Heat Kernels

    Authors: Mehmet Aygün, Zorah Lähner, Daniel Cremers

    Abstract: In this work, we propose an unsupervised method for learning dense correspondences between shapes using a recent deep functional map framework. Instead of depending on ground-truth correspondences or the computationally expensive geodesic distances, we use heat kernels. These can be computed quickly during training as the supervisor signal. Moreover, we propose a curriculum learning strategy using… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: In International Conference on 3D Vision (3DV), 2020

  13. arXiv:2010.07548  [pdf, other

    cs.CV

    MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking

    Authors: Patrick Dendorfer, Aljoša Ošep, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth, Laura Leal-Taixé

    Abstract: Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since the advent of deep learning. Although leaderboards should not be over-claimed, they often provide the most objective measure of performance and are therefore important guides for research. We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched… ▽ More

    Submitted 8 December, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted at IJCV

  14. arXiv:2010.06323  [pdf, other

    cs.CV

    LM-Reloc: Levenberg-Marquardt Based Direct Visual Relocalization

    Authors: Lukas von Stumberg, Patrick Wenzel, Nan Yang, Daniel Cremers

    Abstract: We present LM-Reloc -- a novel approach for visual relocalization based on direct image alignment. In contrast to prior works that tackle the problem with a feature-based formulation, the proposed method does not rely on feature matching and RANSAC. Hence, the method can utilize not only corners but any region of the image with gradients. In particular, we propose a loss formulation inspired by th… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: International Conference on 3D Vision (3DV), 2020

  15. arXiv:2010.03506  [pdf, other

    cs.CV

    Learning Monocular 3D Vehicle Detection without 3D Bounding Box Labels

    Authors: L. Koestler, N. Yang, R. Wang, D. Cremers

    Abstract: The training of deep-learning-based 3D object detectors requires large datasets with 3D bounding box labels for supervision that have to be generated by hand-labeling. We propose a network architecture and training procedure for learning monocular 3D object detection without 3D bounding box labels. By representing the objects as triangular meshes and employing differentiable shape rendering, we de… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  16. arXiv:2009.06364  [pdf, other

    cs.CV

    4Seasons: A Cross-Season Dataset for Multi-Weather SLAM in Autonomous Driving

    Authors: Patrick Wenzel, Rui Wang, Nan Yang, Qing Cheng, Qadeer Khan, Lukas von Stumberg, Niclas Zeller, Daniel Cremers

    Abstract: We present a novel dataset covering seasonal and challenging perceptual conditions for autonomous driving. Among others, it enables research on visual odometry, global place recognition, and map-based re-localization tracking. The data was collected in different scenarios and under a wide variety of weather conditions and illuminations, including day and night. This resulted in more than 350 km of… ▽ More

    Submitted 14 October, 2020; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: German Conference on Pattern Recognition (GCPR 2020)

  17. arXiv:2007.09217  [pdf, other

    cs.CV

    DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization

    Authors: Juan Du, Rui Wang, Daniel Cremers

    Abstract: For relocalization in large-scale point clouds, we propose the first approach that unifies global place recognition and local 6DoF pose refinement. To this end, we design a Siamese network that jointly learns 3D local feature detection and description directly from raw 3D points. It integrates FlexConv and Squeeze-and-Excitation (SE) to assure that the learned local descriptor captures multi-level… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: ECCV 2020, sportlight

  18. arXiv:2007.07029  [pdf, ps, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions

    Authors: Vladimir Golkov, Alexander Becker, Daniel T. Plop, Daniel Čuturilo, Neda Davoudi, Jeffrey Mendenhall, Rocco Moretti, Jens Meiler, Daniel Cremers

    Abstract: Computer-aided drug discovery is an essential component of modern drug development. Therein, deep learning has become an important tool for rapid screening of billions of molecules in silico for potential hits containing desired chemical features. Despite its importance, substantial challenges persist in training these models, such as severe class imbalance, high decision thresholds, and lack of g… ▽ More

    Submitted 25 June, 2020; originally announced July 2020.

    Comments: 10 pages

    MSC Class: 68T07 (Primary) 62H30; 92E99; 68T10; 62F07 (Secondary) ACM Class: G.3; I.2.1; I.2.6; I.5.1; J.3

  19. arXiv:2006.16856  [pdf, other

    cs.LG stat.ML

    A Chain Graph Interpretation of Real-World Neural Networks

    Authors: Yuesong Shen, Daniel Cremers

    Abstract: The last decade has witnessed a boom of deep learning research and applications achieving state-of-the-art results in various domains. However, most advances have been established empirically, and their theoretical analysis remains lacking. One major issue is that our current interpretation of neural networks (NNs) as function approximators is too generic to support in-depth analysis. In this pape… ▽ More

    Submitted 6 October, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

  20. arXiv:2006.12456  [pdf, other

    cs.LG cs.CV stat.ML

    Effective Version Space Reduction for Convolutional Neural Networks

    Authors: Jiayu Liu, Ioannis Chiotellis, Rudolph Triebel, Daniel Cremers

    Abstract: In active learning, sampling bias could pose a serious inconsistency problem and hinder the algorithm from finding the optimal hypothesis. However, many methods for neural networks are hypothesis space agnostic and do not address this problem. We examine active learning with convolutional neural networks through the principled lens of version space reduction. We identify the connection between two… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 22 pages, 8 figures, to be published in the Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2020

    ACM Class: I.2.6; G.3; I.5.1

  21. PrimiTect: Fast Continuous Hough Voting for Primitive Detection

    Authors: Christiane Sommer, Yumin Sun, Erik Bylow, Daniel Cremers

    Abstract: This paper tackles the problem of data abstraction in the context of 3D point sets. Our method classifies points into different geometric primitives, such as planes and cones, leading to a compact representation of the data. Being based on a semi-global Hough voting scheme, the method does not need initialization and is robust, accurate, and efficient. We use a local, low-dimensional parameterizat… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: Accepted to IEEE International Conference on Robotics and Automation (ICRA), 2020 | Code: https://github.com/c-sommer/primitect

  22. arXiv:2004.05199  [pdf, other

    cs.CV

    Hamiltonian Dynamics for Real-World Shape Interpolation

    Authors: Marvin Eisenberger, Daniel Cremers

    Abstract: We revisit the classical problem of 3D shape interpolation and propose a novel, physically plausible approach based on Hamiltonian dynamics. While most prior work focuses on synthetic input shapes, our formulation is designed to be applicable to real-world scans with imperfect input correspondences and various types of noise. To that end, we use recent progress on dynamic thin shell simulation and… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

  23. arXiv:2003.09003  [pdf, other

    cs.CV

    MOT20: A benchmark for multi object tracking in crowded scenes

    Authors: Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of mu… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: The sequences of the new MOT20 benchmark were previously presented in the CVPR 2019 tracking challenge ( arXiv:1906.04567 ). The differences between the two challenges are: - New and corrected annotations - New sequences, as we had to crop and transform some old sequences to achieve higher quality in the annotations. - New baselines evaluations and different sets of public detections

  24. arXiv:2003.01060  [pdf, other

    cs.CV cs.AI

    D3VO: Deep Depth, Deep Pose and Deep Uncertainty for Monocular Visual Odometry

    Authors: Nan Yang, Lukas von Stumberg, Rui Wang, Daniel Cremers

    Abstract: We propose D3VO as a novel framework for monocular visual odometry that exploits deep networks on three levels -- deep depth, pose and uncertainty estimation. We first propose a novel self-supervised monocular depth estimation network trained on stereo videos without any external supervision. In particular, it aligns the training image pairs into similar lighting condition with predictive brightne… ▽ More

    Submitted 28 March, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  25. arXiv:2002.12236  [pdf, ps, other

    math.OC cs.CV

    Optimization of Graph Total Variation via Active-Set-based Combinatorial Reconditioning

    Authors: Zhenzhang Ye, Thomas Möllenhoff, Tao Wu, Daniel Cremers

    Abstract: Structured convex optimization on weighted graphs finds numerous applications in machine learning and computer vision. In this work, we propose a novel adaptive preconditioning strategy for proximal algorithms on this problem class. Our preconditioner is driven by a sharp analysis of the local linear convergence rate depending on the "active set" at the current iterate. We show that nested-forest… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Presented at the 23 rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020. Code: https://github.com/zhenzhangye/graph_TV_recond

  26. arXiv:2001.11845  [pdf, other

    cs.CV cs.LG

    Learn to Predict Sets Using Feed-Forward Neural Networks

    Authors: Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

    Abstract: This paper addresses the task of set prediction using deep feed-forward neural networks. A set is a collection of elements which is invariant under permutation and the size of a set is not fixed in advance. Many real-world problems, such as image tagging and object detection, have outputs that are naturally expressed as sets of entities. This creates a challenge for traditional deep neural network… ▽ More

    Submitted 25 October, 2021; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022. arXiv admin note: substantial text overlap with arXiv:1805.00613

  27. From Planes to Corners: Multi-Purpose Primitive Detection in Unorganized 3D Point Clouds

    Authors: Christiane Sommer, Yumin Sun, Leonidas Guibas, Daniel Cremers, Tolga Birdal

    Abstract: We propose a new method for segmentation-free joint estimation of orthogonal planes, their intersection lines, relationship graph and corners lying at the intersection of three orthogonal planes. Such unified scene exploration under orthogonality allows for multitudes of applications such as semantic plane detection or local and global scan alignment, which in turn can aid robot localization or gr… ▽ More

    Submitted 24 April, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: Accepted to IEEE Robotics and Automation Letters 2020 | Video: https://youtu.be/nHWJrA6RcB0 | Code: https://github.com/c-sommer/orthogonal-planes

    Journal ref: IEEE Robotics and Automation Letters 5(2) 2020, 1764-1771

  28. arXiv:1912.06501  [pdf, other

    cs.CV

    Inferring Super-Resolution Depth from a Moving Light-Source Enhanced RGB-D Sensor: A Variational Approach

    Authors: Lu Sang, Bjoern Haefner, Daniel Cremers

    Abstract: A novel approach towards depth map super-resolution using multi-view uncalibrated photometric stereo is presented. Practically, an LED light source is attached to a commodity RGB-D sensor and is used to capture objects from multiple viewpoints with unknown motion. This non-static camera-to-object setup is described with a nonconvex variational approach such that no calibration on lighting or camer… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: WACV2020 conference paper

  29. arXiv:1912.02160  [pdf, other

    cs.LG stat.ML

    Informative GANs via Structured Regularization of Optimal Transport

    Authors: Pierre Bréchet, Tao Wu, Thomas Möllenhoff, Daniel Cremers

    Abstract: We tackle the challenge of disentangled representation learning in generative adversarial networks (GANs) from the perspective of regularized optimal transport (OT). Specifically, a smoothed OT loss gives rise to an implicit transportation plan between the latent space and the data space. Based on this theoretical observation, we exploit a structured regularization on the transportation plan to en… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

    Comments: Presented at the Optimal Transport and Machine Learning Workshop, NeurIPS 2019

  30. Efficient Derivative Computation for Cumulative B-Splines on Lie Groups

    Authors: Christiane Sommer, Vladyslav Usenko, David Schubert, Nikolaus Demmel, Daniel Cremers

    Abstract: Continuous-time trajectory representation has recently gained popularity for tasks where the fusion of high-frame-rate sensors and multiple unsynchronized devices is required. Lie group cumulative B-splines are a popular way of representing continuous trajectories without singularities. They have been used in near real-time SLAM and odometry systems with IMU, LiDAR, regular, RGB-D and event camera… ▽ More

    Submitted 30 May, 2020; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: First two authors contributed equally

  31. arXiv:1911.07268  [pdf, other

    cs.CV

    On the well-posedness of uncalibrated photometric stereo under general lighting

    Authors: Mohammed Brahimi, Yvain Quéau, Bjoern Haefner, Daniel Cremers

    Abstract: Uncalibrated photometric stereo aims at estimating the 3D-shape of a surface, given a set of images captured from the same viewing angle, but under unknown, varying illumination. While the theoretical foundations of this inverse problem under directional lighting are well-established, there is a lack of mathematical evidence for the uniqueness of a solution under general lighting. On the other han… ▽ More

    Submitted 16 September, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

  32. Rolling-Shutter Modelling for Direct Visual-Inertial Odometry

    Authors: David Schubert, Nikolaus Demmel, Lukas von Stumberg, Vladyslav Usenko, Daniel Cremers

    Abstract: We present a direct visual-inertial odometry (VIO) method which estimates the motion of the sensor setup and sparse 3D geometry of the environment based on measurements from a rolling-shutter camera and an inertial measurement unit (IMU). The visual part of the system performs a photometric bundle adjustment on a sparse set of points. This direct approach does not extract feature points and is a… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

  33. arXiv:1910.14594  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Deep Learning for 2D and 3D Rotatable Data: An Overview of Methods

    Authors: Luca Della Libera, Vladimir Golkov, Yue Zhu, Arman Mielke, Daniel Cremers

    Abstract: Convolutional networks are successful due to their equivariance/invariance under translations. However, rotatable data such as images, volumes, shapes, or point clouds require processing with equivariance/invariance under rotations in cases where the rotational orientation of the coordinate system does not affect the meaning of the data (e.g. object classification). On the other hand, estimation/p… ▽ More

    Submitted 22 November, 2021; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Improved Definition 1, improved and merged Sections 3.3-3.4, minor additional changes

    MSC Class: 62M45; 68T45; 62H35; 65D18; 68U10 ACM Class: I.2.6; I.5.1; G.3

  34. arXiv:1910.06632  [pdf, other

    cs.RO cs.CV

    Multi-Frame GAN: Image Enhancement for Stereo Visual Odometry in Low Light

    Authors: Eunah Jung, Nan Yang, Daniel Cremers

    Abstract: We propose the concept of a multi-frame GAN (MFGAN) and demonstrate its potential as an image sequence enhancement for stereo visual odometry in low light conditions. We base our method on an invertible adversarial network to transfer the beneficial features of brightly illuminated scenes to the sequence in poor illumination without costly paired datasets. In order to preserve the coherent geometr… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: Accepted by the 3rd Conference on Robot Learning, Osaka, Japan (CoRL 2019). The first two authors contributed equally to this paper

  35. arXiv:1910.03638  [pdf, other

    math.OC cs.CV cs.IR cs.LG

    Bregman Proximal Framework for Deep Linear Neural Networks

    Authors: Mahesh Chandra Mukkamala, Felix Westerkamp, Emanuel Laude, Daniel Cremers, Peter Ochs

    Abstract: A typical assumption for the analysis of first order optimization methods is the Lipschitz continuity of the gradient of the objective function. However, for many practical applications this assumption is violated, including loss functions in deep learning. To overcome this issue, certain extensions based on generalized proximity measures known as Bregman distances were introduced. This initiated… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: 34 pages, 54 images

    MSC Class: 90C26; 26B25; 90C30; 49M27; 47J25; 65K05; 65F22

  36. Sparse Surface Constraints for Combining Physics-based Elasticity Simulation and Correspondence-Free Object Reconstruction

    Authors: Sebastian Weiss, Robert Maier, Rüdiger Westermann, Daniel Cremers, Nils Thuerey

    Abstract: We address the problem to infer physical material parameters and boundary conditions from the observed motion of a homogeneous deformable object via the solution of an inverse problem. Parameters are estimated from potentially unreliable real-world data sources such as sparse observations without correspondences. We introduce a novel Lagrangian-Eulerian optimization formulation, including a cost f… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    ACM Class: I.6

  37. arXiv:1908.03776  [pdf, other

    math.NA

    Lifting methods for manifold-valued variational problems

    Authors: Thomas Vogt, Evgeny Strekalovskiy, Daniel Cremers, Jan Lellmann

    Abstract: Lifting methods allow to transform hard variational problems such as segmentation and optical flow estimation into convex problems in a suitable higher-dimensional space. The lifted models can then be efficiently solved to a global optimum, which allows to find approximate global minimizers of the original problem. Recently, these techniques have also been applied to problems with values in a mani… ▽ More

    Submitted 10 August, 2019; originally announced August 2019.

    Comments: In press as part of a Springer Handbook

  38. arXiv:1907.11025  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Towards Generalizing Sensorimotor Control Across Weather Conditions

    Authors: Qadeer Khan, Patrick Wenzel, Daniel Cremers, Laura Leal-Taixé

    Abstract: The ability of deep learning models to generalize well across different scenarios depends primarily on the quality and quantity of annotated data. Labeling large amounts of data for all possible scenarios that a model may encounter would not be feasible; if even possible. We propose a framework to deal with limited labeled training data and demonstrate it on the application of vision-based vehicle… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  39. arXiv:1907.04306  [pdf, other

    math.OC

    Bregman Proximal Map**s and Bregman-Moreau Envelopes under Relative Prox-Regularity

    Authors: Emanuel Laude, Peter Ochs, Daniel Cremers

    Abstract: We systematically study the local single-valuedness of the Bregman proximal map** and local smoothness of the Bregman--Moreau envelope of a nonconvex function under relative prox-regularity - an extension of prox-regularity - which was originally introduced by Poliquin and Rockafellar. As Bregman distances are asymmetric in general, in accordance with Bauschke et al., it is natural to consider t… ▽ More

    Submitted 31 January, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: This article is published in Journal of Optimization Theory and Applications

  40. arXiv:1906.04567  [pdf, other

    cs.CV cs.LG

    CVPR19 Tracking and Detection Challenge: How crowded can it get?

    Authors: Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixe

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.00831, arXiv:1504.01942

  41. arXiv:1905.12512  [pdf, other

    cs.CV cs.GR

    Smooth Shells: Multi-Scale Shape Registration with Functional Maps

    Authors: Marvin Eisenberger, Zorah Lähner, Daniel Cremers

    Abstract: We propose a novel 3D shape correspondence method based on the iterative alignment of so-called smooth shells. Smooth shells define a series of coarse-to-fine shape approximations designed to work well with multiscale algorithms. The main idea is to first align rough approximations of the geometry and then add more and more details to refine the correspondence. We fuse classical shape registration… ▽ More

    Submitted 2 December, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

  42. arXiv:1905.04730  [pdf, other

    cs.LG cs.CV stat.ML

    Flat Metric Minimization with Applications in Generative Modeling

    Authors: Thomas Möllenhoff, Daniel Cremers

    Abstract: We take the novel perspective to view data not as a probability distribution but rather as a current. Primarily studied in the field of geometric measure theory, $k$-currents are continuous linear functionals acting on compactly supported smooth differential forms and can be understood as a generalized notion of oriented $k$-dimensional manifold. By moving from distributions (which are $0$-current… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

  43. arXiv:1905.03389  [pdf, other

    cs.NE cs.AI cs.CV cs.LG stat.ML

    Learning to Evolve

    Authors: Jan Schuchardt, Vladimir Golkov, Daniel Cremers

    Abstract: Evolution and learning are two of the fundamental mechanisms by which life adapts in order to survive and to transcend limitations. These biological phenomena inspired successful computational methods such as evolutionary algorithms and deep learning. Evolution relies on random mutations and on random genetic recombination. Here we show that learning to evolve, i.e. learning to mutate and recombin… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    MSC Class: 62M45; 68T05; 68W25; 68T20; 90C40; 91A22; 92D15; 92D25 ACM Class: G.1.6; I.2.6; I.2.8; G.3; I.5.1

  44. arXiv:1905.00851  [pdf, other

    cs.CV eess.IV

    Lifting Vectorial Variational Problems: A Natural Formulation based on Geometric Measure Theory and Discrete Exterior Calculus

    Authors: Thomas Möllenhoff, Daniel Cremers

    Abstract: Numerous tasks in imaging and vision can be formulated as variational problems over vector-valued maps. We approach the relaxation and convexification of such vectorial variational problems via a lifting to the space of currents. To that end, we recall that functionals with polyconvex Lagrangians can be reparametrized as convex one-homogeneous functionals on the graph of the function. This leads t… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

    Comments: Oral presentation at CVPR 2019

  45. arXiv:1904.11932  [pdf, other

    cs.CV

    GN-Net: The Gauss-Newton Loss for Multi-Weather Relocalization

    Authors: Lukas von Stumberg, Patrick Wenzel, Qadeer Khan, Daniel Cremers

    Abstract: Direct SLAM methods have shown exceptional performance on odometry tasks. However, they are susceptible to dynamic lighting and weather changes while also suffering from a bad initialization on large baselines. To overcome this, we propose GN-Net: a network optimized with the novel Gauss-Newton loss for training weather invariant deep features, tailored for direct image alignment. Our network can… ▽ More

    Submitted 27 November, 2019; v1 submitted 26 April, 2019; originally announced April 2019.

  46. arXiv:1904.10097  [pdf, other

    cs.CV

    DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation

    Authors: Rui Wang, Nan Yang, Joerg Stueckler, Daniel Cremers

    Abstract: Scene understanding from images is a challenging problem encountered in autonomous driving. On the object level, while 2D methods have gradually evolved from computing simple bounding boxes to delivering finer grained results like instance segmentations, the 3D family is still dominated by estimating 3D bounding boxes. In this paper, we propose a novel approach to jointly infer the 3D rigid-body p… ▽ More

    Submitted 9 March, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA) 2020

  47. Visual-Inertial Map** with Non-Linear Factor Recovery

    Authors: Vladyslav Usenko, Nikolaus Demmel, David Schubert, Jörg Stückler, Daniel Cremers

    Abstract: Cameras and inertial measurement units are complementary sensors for ego-motion estimation and environment map**. Their combination makes visual-inertial odometry (VIO) systems more accurate and robust. For globally consistent map**, however, combining visual and inertial information is not straightforward. To estimate the motion and geometry with a set of images large baselines are required.… ▽ More

    Submitted 30 May, 2020; v1 submitted 13 April, 2019; originally announced April 2019.

  48. arXiv:1904.03942  [pdf, other

    cs.CV

    Variational Uncalibrated Photometric Stereo under General Lighting

    Authors: Bjoern Haefner, Zhenzhang Ye, Maolin Gao, Tao Wu, Yvain Quéau, Daniel Cremers

    Abstract: Photometric stereo (PS) techniques nowadays remain constrained to an ideal laboratory setup where modeling and calibration of lighting is amenable. To eliminate such restrictions, we propose an efficient principled variational approach to uncalibrated PS under general illumination. To this end, the Lambertian reflectance model is approximated through a spherical harmonic expansion, which preserves… ▽ More

    Submitted 27 August, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Haefner and Ye contributed equally

    Journal ref: The IEEE International Conference on Computer Vision (ICCV), 2019

  49. arXiv:1904.03081  [pdf, other

    cs.LG cs.CV stat.ML

    Controlling Neural Networks via Energy Dissipation

    Authors: Michael Moeller, Thomas Möllenhoff, Daniel Cremers

    Abstract: The last decade has shown a tremendous success in solving various computer vision problems with the help of deep learning techniques. Lately, many works have demonstrated that learning-based approaches with suitable network architectures even exhibit superior performance for the solution of (ill-posed) image reconstruction problems such as deblurring, super-resolution, or medical image reconstruct… ▽ More

    Submitted 20 August, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Published as a conference paper at ICCV 2019, Seoul

  50. arXiv:1903.11690  [pdf, other

    math.OC cs.AI cs.LG

    Optimization of Inf-Convolution Regularized Nonconvex Composite Problems

    Authors: Emanuel Laude, Tao Wu, Daniel Cremers

    Abstract: In this work, we consider nonconvex composite problems that involve inf-convolution with a Legendre function, which gives rise to an anisotropic generalization of the proximal map** and Moreau-envelope. In a convex setting such problems can be solved via alternating minimization of a splitting formulation, where the consensus constraint is penalized with a Legendre function. In contrast, for non… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: Accepted as a Conference Paper to International Conference on Artificial Intelligence and Statistics (AISTATS) 2019, Naha