DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

Doll, Simon; Hanselmann, Niklas; Schneider, Lukas; Schulz, Richard; Cordts, Marius; Enzweiler, Markus; Lensch, Hendrik P. A.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.06264 (cs)

[Submitted on 10 Jun 2024]

Title:DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

Authors:Simon Doll, Niklas Hanselmann, Lukas Schneider, Richard Schulz, Marius Cordts, Markus Enzweiler, Hendrik P.A. Lensch

View PDF HTML (experimental)

Abstract:State-of-the-art approaches for autonomous driving integrate multiple sub-tasks of the overall driving task into a single pipeline that can be trained in an end-to-end fashion by passing latent representations between the different modules. In contrast to previous approaches that rely on a unified grid to represent the belief state of the scene, we propose dedicated representations to disentangle dynamic agents and static scene elements. This allows us to explicitly compensate for the effect of both ego and object motion between consecutive time steps and to flexibly propagate the belief state through time. Furthermore, dynamic objects can not only attend to the input camera images, but also directly benefit from the inferred static scene structure via a novel dynamic-static cross-attention. Extensive experiments on the challenging nuScenes benchmark demonstrate the benefits of the proposed dual-stream design, especially for modelling highly dynamic agents in the scene, and highlight the improved temporal consistency of our approach. Our method titled DualAD not only outperforms independently trained single-task networks, but also improves over previous state-of-the-art end-to-end models by a large margin on all tasks along the functional chain of driving.

Comments:	Accepted at CVPR 2024; Copyright 2024 IEEE; Project Website: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.06264 [cs.CV]
	(or arXiv:2406.06264v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.06264

Submission history

From: Simon Doll [view email]
[v1] Mon, 10 Jun 2024 13:46:07 UTC (10,958 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DualAD: Disentangling the Dynamic and Static World for End-to-End Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators