Skip to main content

Showing 1–7 of 7 results for author: Huang, T E

.
  1. arXiv:2309.04422  [pdf, other

    cs.CV

    Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving

    Authors: Thomas E. Huang, Yifan Liu, Luc Van Gool, Fisher Yu

    Abstract: Performing multiple heterogeneous visual tasks in dynamic scenes is a hallmark of human perception capability. Despite remarkable progress in image and video recognition via representation learning, current research still focuses on designing specialized networks for singular, homogeneous, or simple combination of tasks. We instead explore the construction of a unified model for major image and vi… ▽ More

    Submitted 26 November, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: ICCV 2023, project page at https://www.vis.xyz/pub/vtd

  2. arXiv:2210.07239  [pdf, other

    cs.CV

    Composite Learning for Robust and Effective Dense Predictions

    Authors: Menelaos Kanakis, Thomas E. Huang, David Bruggemann, Fisher Yu, Luc Van Gool

    Abstract: Multi-task learning promises better model generalization on a target task by jointly optimizing it with an auxiliary task. However, the current practice requires additional labeling efforts for the auxiliary task, while not guaranteeing better model performance. In this paper, we find that jointly training a dense prediction (target) task with a self-supervised (auxiliary) task can consistently im… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Winter Conference on Applications of Computer Vision (WACV), 2023

  3. arXiv:2210.06984  [pdf, other

    cs.CV

    QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking

    Authors: Tobias Fischer, Thomas E. Huang, Jiangmiao Pang, Linlu Qiu, Haofeng Chen, Trevor Darrell, Fisher Yu

    Abstract: Similarity learning has been recognized as a crucial step for object tracking. However, existing multiple object tracking methods only use sparse ground truth matching as the training objective, while ignoring the majority of the informative regions in images. In this paper, we present Quasi-Dense Similarity Learning, which densely samples hundreds of object regions on a pair of images for contras… ▽ More

    Submitted 27 September, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  4. arXiv:2207.12978  [pdf, other

    cs.CV

    Tracking Every Thing in the Wild

    Authors: Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu

    Abstract: Current multi-category Multiple Object Tracking (MOT) metrics use class labels to group tracking results for per-class evaluation. Similarly, MOT methods typically only associate objects with the same class predictions. These two prevalent strategies in MOT implicitly assume that the classification performance is near-perfect. However, this is far from the case in recent large-scale MOT datasets,… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: ECCV2022

  5. arXiv:2111.00770  [pdf, other

    cs.CV

    Dense Prediction with Attentive Feature Aggregation

    Authors: Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu

    Abstract: Aggregating information from features across different layers is an essential operation for dense prediction models. Despite its limited expressiveness, feature concatenation dominates the choice of aggregation operations. In this paper, we introduce Attentive Feature Aggregation (AFA) to fuse different network layers with more expressive non-linear operations. AFA exploits both spatial and channe… ▽ More

    Submitted 19 January, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: 20 pages, 14 figures, WACV 2023

  6. arXiv:2104.08381  [pdf, other

    cs.CV

    Robust Object Detection via Instance-Level Temporal Cycle Confusion

    Authors: Xin Wang, Thomas E. Huang, Benlin Liu, Fisher Yu, Xiaolong Wang, Joseph E. Gonzalez, Trevor Darrell

    Abstract: Building reliable object detectors that are robust to domain shifts, such as various changes in context, viewpoint, and object appearances, is critical for real-world applications. In this work, we study the effectiveness of auxiliary self-supervised tasks to improve the out-of-distribution generalization of object detectors. Inspired by the principle of maximum entropy, we introduce a novel self-… ▽ More

    Submitted 23 August, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: ICCV 2021

  7. arXiv:2003.06957  [pdf, other

    cs.CV

    Frustratingly Simple Few-Shot Object Detection

    Authors: Xin Wang, Thomas E. Huang, Trevor Darrell, Joseph E. Gonzalez, Fisher Yu

    Abstract: Detecting rare objects from a few examples is an emerging problem. Prior works show meta-learning is a promising approach. But, fine-tuning techniques have drawn scant attention. We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task. Such a simple approach outperforms the meta-learning methods by roughly 2~20 points on c… ▽ More

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: 12 pages, 8 figures