Skip to main content

Showing 1–12 of 12 results for author: Kretzschmar, H

.
  1. arXiv:2309.16889  [pdf, other

    cs.CV

    Superpixel Transformers for Efficient Semantic Segmentation

    Authors: Alex Zihao Zhu, Jieru Mei, Siyuan Qiao, Hang Yan, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar

    Abstract: Semantic segmentation, which aims to classify every pixel in an image, is a key task in machine perception, with many applications across robotics and autonomous driving. Due to the high dimensionality of this task, most existing approaches use local operations, such as convolutions, to generate per-pixel features. However, these methods are typically unable to effectively leverage global context… ▽ More

    Submitted 2 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures, 4 tables. Presented at IROS 2023. Equal contribution by A. Zhu and J. Mei

  2. arXiv:2210.09267  [pdf, other

    cs.CV cs.LG cs.RO

    CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection

    Authors: Jyh-**g Hwang, Henrik Kretzschmar, Joshua Manela, Sean Rafferty, Nicholas Armstrong-Crews, Tiffany Chen, Dragomir Anguelov

    Abstract: Robust 3D object detection is critical for safe autonomous driving. Camera and radar sensors are synergistic as they capture complementary information and work well under different environmental conditions. Fusing camera and radar data is challenging, however, as each of the sensors lacks information along a perpendicular axis, that is, depth is unknown to camera and elevation is unknown to radar.… ▽ More

    Submitted 17 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: ECCV 2022

  3. arXiv:2210.08113  [pdf, other

    cs.CV

    Instance Segmentation with Cross-Modal Consistency

    Authors: Alex Zihao Zhu, Vincent Casser, Reza Mahjourian, Henrik Kretzschmar, Sören Pirk

    Abstract: Segmenting object instances is a key task in machine perception, with safety-critical applications in robotics and autonomous driving. We introduce a novel approach to instance segmentation that jointly leverages measurements from multiple sensor modalities, such as cameras and LiDAR. Our method learns to predict embeddings for each pixel or point that give rise to a dense segmentation of the scen… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 8 pages, 9 figures, 5 tables. Presented at IROS 2022

  4. arXiv:2206.07705  [pdf, other

    cs.CV

    LET-3D-AP: Longitudinal Error Tolerant 3D Average Precision for Camera-Only 3D Detection

    Authors: Wei-Chih Hung, Vincent Casser, Henrik Kretzschmar, Jyh-**g Hwang, Dragomir Anguelov

    Abstract: The 3D Average Precision (3D AP) relies on the intersection over union between predictions and ground truth objects. However, camera-only detectors have limited depth accuracy, which may cause otherwise reasonable predictions that suffer from such longitudinal localization errors to be treated as false positives. We therefore propose variants of the 3D AP metric to be more permissive with respect… ▽ More

    Submitted 3 May, 2024; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: Find the primary metrics for the 2022 Waymo Open Dataset 3D Camera-Only Detection Challenge at https://waymo.com/open/challenges/2022/3d-camera-only-detection/ . Find the code at https://github.com/waymo-research/waymo-open-dataset

  5. arXiv:2206.07704  [pdf, other

    cs.CV

    Waymo Open Dataset: Panoramic Video Panoptic Segmentation

    Authors: Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar, Dragomir Anguelov

    Abstract: Panoptic image segmentation is the computer vision task of finding groups of pixels in an image and assigning semantic classes and object instance identifiers to them. Research in image segmentation has become increasingly popular due to its critical applications in robotics and autonomous driving. The research community thereby relies on publicly available benchmark dataset to advance the state-o… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Our dataset can be found at https://waymo.com/open

  6. arXiv:2206.03666  [pdf, other

    cs.CV

    Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking

    Authors: Longlong **g, Ruichi Yu, Henrik Kretzschmar, Kang Li, Charles R. Qi, Hang Zhao, Alper Ayvaci, Xu Chen, Dillon Cower, Yingwei Li, Yurong You, Han Deng, Congcong Li, Dragomir Anguelov

    Abstract: Monocular image-based 3D perception has become an active research area in recent years owing to its applications in autonomous driving. Approaches to monocular 3D perception including detection and tracking, however, often yield inferior performance when compared to LiDAR-based techniques. Through systematic analysis, we identified that per-object depth estimation accuracy is a major factor boundi… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Journal ref: ICRA2022

  7. arXiv:2202.05263  [pdf, other

    cs.CV cs.GR

    Block-NeRF: Scalable Large Scene Neural View Synthesis

    Authors: Matthew Tancik, Vincent Casser, Xinchen Yan, Sabeek Pradhan, Ben Mildenhall, Pratul P. Srinivasan, Jonathan T. Barron, Henrik Kretzschmar

    Abstract: We present Block-NeRF, a variant of Neural Radiance Fields that can represent large-scale environments. Specifically, we demonstrate that when scaling NeRF to render city-scale scenes spanning multiple blocks, it is vital to decompose the scene into individually trained NeRFs. This decomposition decouples rendering time from scene size, enables rendering to scale to arbitrarily large environments,… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Project page: https://waymo.com/research/block-nerf/

  8. arXiv:2201.05938  [pdf, other

    cs.LG cs.CV

    GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

    Authors: Zhao Chen, Vincent Casser, Henrik Kretzschmar, Dragomir Anguelov

    Abstract: We propose GradTail, an algorithm that uses gradients to improve model performance on the fly in the face of long-tailed training data distributions. Unlike conventional long-tail classifiers which operate on converged - and possibly overfit - models, we demonstrate that an approach based on gradient dot product agreement can isolate long-tailed data early on during model training and improve perf… ▽ More

    Submitted 18 January, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

    Comments: 15 pages (including Appendix), 8 figures

  9. arXiv:2010.06808  [pdf, other

    cs.LG cs.CV

    Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout

    Authors: Zhao Chen, Jiquan Ngiam, Yan** Huang, Thang Luong, Henrik Kretzschmar, Yuning Chai, Dragomir Anguelov

    Abstract: The vast majority of deep models use multiple gradient signals, typically corresponding to a sum of multiple loss terms, to update a shared set of trainable weights. However, these multiple updates can impede optimal training by pulling the model in conflicting directions. We present Gradient Sign Dropout (GradDrop), a probabilistic masking procedure which samples gradients at an activation layer… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Conference on Neural Information Processing Systems (NeurIPS) 2020

  10. arXiv:2008.07725  [pdf, other

    cs.CV

    SoDA: Multi-Object Tracking with Soft Data Association

    Authors: Wei-Chih Hung, Henrik Kretzschmar, Tsung-Yi Lin, Yuning Chai, Ruichi Yu, Ming-Hsuan Yang, Dragomir Anguelov

    Abstract: Robust multi-object tracking (MOT) is a prerequisite fora safe deployment of self-driving cars. Tracking objects, however, remains a highly challenging problem, especially in cluttered autonomous driving scenes in which objects tend to interact with each other in complex ways and frequently get occluded. We propose a novel approach to MOT that uses attention to compute track embeddings that encode… ▽ More

    Submitted 19 August, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

  11. arXiv:2005.03844  [pdf, other

    cs.CV

    SurfelGAN: Synthesizing Realistic Sensor Data for Autonomous Driving

    Authors: Zhenpei Yang, Yuning Chai, Dragomir Anguelov, Yin Zhou, Pei Sun, Dumitru Erhan, Sean Rafferty, Henrik Kretzschmar

    Abstract: Autonomous driving system development is critically dependent on the ability to replay complex and diverse traffic scenarios in simulation. In such scenarios, the ability to accurately simulate the vehicle sensors such as cameras, lidar or radar is essential. However, current sensor simulators leverage gaming engines such as Unreal or Unity, requiring manual creation of environments, objects and m… ▽ More

    Submitted 25 June, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

    Journal ref: CVPR 2020

  12. arXiv:1912.04838  [pdf, other

    cs.CV cs.LG stat.ML

    Scalability in Perception for Autonomous Driving: Waymo Open Dataset

    Authors: Pei Sun, Henrik Kretzschmar, Xerxes Dotiwalla, Aurelien Chouard, Vijaysai Patnaik, Paul Tsui, James Guo, Yin Zhou, Yuning Chai, Benjamin Caine, Vijay Vasudevan, Wei Han, Jiquan Ngiam, Hang Zhao, Aleksei Timofeev, Scott Ettinger, Maxim Krivokon, Amy Gao, Aditya Joshi, Sheng Zhao, Shuyang Cheng, Yu Zhang, Jonathon Shlens, Zhifeng Chen, Dragomir Anguelov

    Abstract: The research community has increasing interest in autonomous driving research, despite the resource intensity of obtaining representative real world data. Existing self-driving datasets are limited in the scale and variation of the environments they capture, even though generalization within and between operating regions is crucial to the overall viability of the technology. In an effort to help a… ▽ More

    Submitted 12 May, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: CVPR 2020