Skip to main content

Showing 1–8 of 8 results for author: Enzweiler, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06264  [pdf, other

    cs.CV

    DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

    Authors: Simon Doll, Niklas Hanselmann, Lukas Schneider, Richard Schulz, Marius Cordts, Markus Enzweiler, Hendrik P. A. Lensch

    Abstract: State-of-the-art approaches for autonomous driving integrate multiple sub-tasks of the overall driving task into a single pipeline that can be trained in an end-to-end fashion by passing latent representations between the different modules. In contrast to previous approaches that rely on a unified grid to represent the belief state of the scene, we propose dedicated representations to disentangle… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024; Copyright 2024 IEEE; Project Website: https://simondoll.github.io/publications/dualad

  2. arXiv:2306.17602  [pdf, other

    cs.CV cs.AI cs.RO

    S.T.A.R.-Track: Latent Motion Models for End-to-End 3D Object Tracking with Adaptive Spatio-Temporal Appearance Representations

    Authors: Simon Doll, Niklas Hanselmann, Lukas Schneider, Richard Schulz, Markus Enzweiler, Hendrik P. A. Lensch

    Abstract: Following the tracking-by-attention paradigm, this paper introduces an object-centric, transformer-based framework for tracking in 3D. Traditional model-based tracking approaches incorporate the geometric effect of object- and ego motion between frames with a geometric motion model. Inspired by this, we propose S.T.A.R.-Track, which uses a novel latent motion model (LMM) to additionally adjust obj… ▽ More

    Submitted 22 December, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Robotics and Automation Letters, Vol. 9, No. 2 (2024), PP 1326-1333

  3. arXiv:2011.09141  [pdf, other

    cs.CV cs.LG

    Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data

    Authors: Christoph B. Rist, David Emmerichs, Markus Enzweiler, Dariu M. Gavrila

    Abstract: Semantic scene completion is the task of jointly estimating 3D geometry and semantics of objects and surfaces within a given extent. This is a particularly challenging task on real-world data that is sparse and occluded. We propose a scene segmentation network based on local Deep Implicit Functions as a novel learning-based method for scene completion. Unlike previous work on scene completion, our… ▽ More

    Submitted 12 April, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    ACM Class: I.4.8

  4. arXiv:1907.00787  [pdf, other

    eess.IV cs.LG stat.ML

    CNN-based synthesis of realistic high-resolution LiDAR data

    Authors: Larissa T. Triess, David Peter, Christoph B. Rist, Markus Enzweiler, J. Marius Zöllner

    Abstract: This paper presents a novel CNN-based approach for synthesizing high-resolution LiDAR point cloud data. Our approach generates semantically and perceptually realistic results with guidance from specialized loss-functions. First, we utilize a modified per-point loss that addresses missing LiDAR point measurements. Second, we align the quality of our generated output with real-world sensor data by a… ▽ More

    Submitted 24 September, 2021; v1 submitted 28 June, 2019; originally announced July 2019.

    Comments: Project Page: http://ltriess.github.io/pc-upsampling

    Journal ref: IEEE Intelligent Vehicles Symposium (IV), 2019, pp. 1512-1519

  5. arXiv:1809.08993  [pdf, other

    cs.CV

    Improved Semantic Stixels via Multimodal Sensor Fusion

    Authors: Florian Piewak, Peter **gera, Markus Enzweiler, David Pfeiffer, Marius Zöllner

    Abstract: This paper presents a compact and accurate representation of 3D scenes that are observed by a LiDAR sensor and a monocular camera. The proposed method is based on the well-established Stixel model originally developed for stereo vision applications. We extend this Stixel concept to incorporate data from multiple sensor modalities. The resulting mid-level fusion scheme takes full advantage of the g… ▽ More

    Submitted 27 September, 2018; v1 submitted 24 September, 2018; originally announced September 2018.

  6. arXiv:1804.09915  [pdf, other

    cs.CV

    Boosting LiDAR-based Semantic Labeling by Cross-Modal Training Data Generation

    Authors: Florian Piewak, Peter **gera, Manuel Schäfer, David Peter, Beate Schwarz, Nick Schneider, David Pfeiffer, Markus Enzweiler, Marius Zöllner

    Abstract: Mobile robots and autonomous vehicles rely on multi-modal sensor setups to perceive and understand their surroundings. Aside from cameras, LiDAR sensors represent a central component of state-of-the-art perception systems. In addition to accurate spatial perception, a comprehensive semantic understanding of the environment is essential for efficient and safe operation. In this paper we present a n… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  7. The Stixel world: A medium-level representation of traffic scenes

    Authors: Marius Cordts, Timo Rehfeld, Lukas Schneider, David Pfeiffer, Markus Enzweiler, Stefan Roth, Marc Pollefeys, Uwe Franke

    Abstract: Recent progress in advanced driver assistance systems and the race towards autonomous vehicles is mainly driven by two factors: (1) increasingly sophisticated algorithms that interpret the environment around the vehicle and react accordingly, and (2) the continuous improvements of sensor technology itself. In terms of cameras, these improvements typically include higher spatial resolution, which a… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

    Comments: Accepted for publication in Image and Vision Computing

  8. arXiv:1604.01685  [pdf, other

    cs.CV

    The Cityscapes Dataset for Semantic Urban Scene Understanding

    Authors: Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele

    Abstract: Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a be… ▽ More

    Submitted 7 April, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: Includes supplemental material