Skip to main content

Showing 1–9 of 9 results for author: Luo, K Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16034  [pdf, other

    cs.CV

    DiffuBox: Refining 3D Object Detection with Point Diffusion

    Authors: Xiangyu Chen, Zhenzhen Liu, Katie Z Luo, Siddhartha Datta, Adhitya Polavaram, Yan Wang, Yurong You, Boyi Li, Marco Pavone, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2404.05139  [pdf, other

    cs.CV cs.RO

    Better Monocular 3D Detectors with LiDAR from the Past

    Authors: Yurong You, Cheng Perng Phoo, Carlos Andres Diaz-Ruiz, Katie Z Luo, Wei-Lun Chao, Mark Campbell, Bharath Hariharan, Kilian Q Weinberger

    Abstract: Accurate 3D object detection is crucial to autonomous driving. Though LiDAR-based detectors have achieved impressive performance, the high cost of LiDAR sensors precludes their widespread adoption in affordable vehicles. Camera-based detectors are cheaper alternatives but often suffer inferior performance compared to their LiDAR-based counterparts due to inherent depth ambiguities in images. In th… ▽ More

    Submitted 9 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by ICRA 2024. The code can be found at https://github.com/YurongYou/AsyncDepth

  3. arXiv:2401.02957  [pdf, other

    cs.CV

    Denoising Vision Transformers

    Authors: Jiawei Yang, Katie Z Luo, Jiefeng Li, Kilian Q Weinberger, Yonglong Tian, Yue Wang

    Abstract: We delve into a nuanced but significant challenge inherent to Vision Transformers (ViTs): feature maps of these models exhibit grid-like artifacts, which detrimentally hurt the performance of ViTs in downstream tasks. Our investigations trace this fundamental issue down to the positional embeddings at the input stage. To address this, we propose a novel noise model, which is universally applicable… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Project website: https://jiawei-yang.github.io/DenoisingViT/

  4. arXiv:2311.04079  [pdf, other

    cs.CV

    Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps

    Authors: Katie Z Luo, Xinshuo Weng, Yan Wang, Shuang Wu, Jie Li, Kilian Q Weinberger, Yue Wang, Marco Pavone

    Abstract: Autonomous driving has traditionally relied heavily on costly and labor-intensive High Definition (HD) maps, hindering scalability. In contrast, Standard Definition (SD) maps are more affordable and have worldwide coverage, offering a scalable alternative. In this work, we systematically explore the effect of SD maps for real-time lane-topology understanding. We propose a novel framework to integr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  5. arXiv:2310.19080  [pdf, other

    cs.CV

    Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery

    Authors: Katie Z Luo, Zhenzhen Liu, Xiangyu Chen, Yurong You, Sagie Benaim, Cheng Perng Phoo, Mark Campbell, Wen Sun, Bharath Hariharan, Kilian Q. Weinberger

    Abstract: Recent advances in machine learning have shown that Reinforcement Learning from Human Feedback (RLHF) can improve machine learning models and align them with human preferences. Although very successful for Large Language Models (LLMs), these advancements have not had a comparable impact in research for autonomous vehicles -- where alignment with human expectations can be imperative. In this paper,… ▽ More

    Submitted 5 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  6. arXiv:2310.14592  [pdf, other

    cs.CV cs.LG

    Pre-Training LiDAR-Based 3D Object Detectors Through Colorization

    Authors: Tai-Yu Pan, Chenyang Ma, Tianle Chen, Cheng Perng Phoo, Katie Z Luo, Yurong You, Mark Campbell, Kilian Q. Weinberger, Bharath Hariharan, Wei-Lun Chao

    Abstract: Accurate 3D object detection and understanding for self-driving cars heavily relies on LiDAR point clouds, necessitating large amounts of labeled data to train. In this work, we introduce an innovative pre-training approach, Grounded Point Colorization (GPC), to bridge the gap between data and labels by teaching the model to colorize LiDAR point clouds, equip** it with valuable semantic cues. To… ▽ More

    Submitted 25 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  7. arXiv:2303.15286  [pdf, other

    cs.CV cs.LG

    Unsupervised Adaptation from Repeated Traversals for Autonomous Driving

    Authors: Yurong You, Cheng Perng Phoo, Katie Z Luo, Travis Zhang, Wei-Lun Chao, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: For a self-driving car to operate reliably, its perceptual system must generalize to the end-user's environment -- ideally without additional annotation efforts. One potential solution is to leverage unlabeled data (e.g., unlabeled LiDAR point clouds) collected from the end-users' environments (i.e. target domain) to adapt the system to the difference between training and testing environments. Whi… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by NeurIPS 2022. Code is available at https://github.com/YurongYou/Rote-DA

  8. arXiv:2203.15882  [pdf, other

    cs.CV

    Learning to Detect Mobile Objects from LiDAR Scans Without Labels

    Authors: Yurong You, Katie Z Luo, Cheng Perng Phoo, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Current 3D object detectors for autonomous driving are almost entirely trained on human-annotated data. Although of high quality, the generation of such data is laborious and costly, restricting them to a few specific locations and object types. This paper proposes an alternative approach entirely based on unlabeled data, which can be collected cheaply and in abundance almost everywhere on earth.… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022. Code is available at https://github.com/YurongYou/MODEST

  9. arXiv:2203.11405  [pdf, other

    cs.CV

    Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception

    Authors: Yurong You, Katie Z Luo, Xiangyu Chen, Junan Chen, Wei-Lun Chao, Wen Sun, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger

    Abstract: Self-driving cars must detect vehicles, pedestrians, and other traffic participants accurately to operate safely. Small, far-away, or highly occluded objects are particularly challenging because there is limited information in the LiDAR point clouds for detecting them. To address this challenge, we leverage valuable information from the past: in particular, data collected in past traversals of the… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted by ICLR 2022. Code is available at https://github.com/YurongYou/Hindsight