Skip to main content

Showing 1–10 of 10 results for author: Meinhardt, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11426  [pdf, other

    cs.CV

    SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow

    Authors: Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

    Abstract: Increasing the annotation efficiency of trajectory annotations from videos has the potential to enable the next generation of data-hungry tracking algorithms to thrive on large-scale datasets. Despite the importance of this task, there are currently very few works exploring how to efficiently label tracking datasets comprehensively. In this work, we introduce SPAM, a tracking data engine that prov… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2403.13129  [pdf, other

    cs.CV cs.RO

    Better Call SAL: Towards Learning to Segment Anything in Lidar

    Authors: Aljoša Ošep, Tim Meinhardt, Francesco Ferroni, Neehar Peri, Deva Ramanan, Laura Leal-Taixé

    Abstract: We propose $\texttt{SAL}$ ($\texttt{S}$egment $\texttt{A}$nything in $\texttt{L}$idar) method consisting of a text-promptable zero-shot model for segmenting and classifying any object in Lidar, and a pseudo-labeling engine that facilitates model training without manual supervision. While the established paradigm for $\textit{Lidar Panoptic Segmentation}$ (LPS) relies on manual supervision for a ha… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  3. arXiv:2308.15266  [pdf, other

    cs.CV

    NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation

    Authors: Tim Meinhardt, Matt Feiszli, Yuchen Fan, Laura Leal-Taixe, Rakesh Ranjan

    Abstract: Until recently, the Video Instance Segmentation (VIS) community operated under the common belief that offline methods are generally superior to a frame by frame online processing. However, the recent success of online methods questions this belief, in particular, for challenging and long video sequences. We understand this work as a rebuttal of those recent observations and an appeal to the commun… ▽ More

    Submitted 18 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  4. arXiv:2306.11710  [pdf, other

    cs.CV

    Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis

    Authors: Maxim Maximov, Tim Meinhardt, Ismail Elezi, Zoe Papakipos, Caner Hazirbas, Cristian Canton Ferrer, Laura Leal-Taixé

    Abstract: The advent of data-driven technology solutions is accompanied by an increasing concern with data privacy. This is of particular importance for human-centered image recognition tasks, such as pedestrian detection, re-identification, and tracking. To highlight the importance of privacy issues and motivate future research, we motivate and introduce the Pedestrian Dataset De-Identification (PDI) task.… ▽ More

    Submitted 22 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  5. arXiv:2207.11103  [pdf, other

    cs.CV cs.LG cs.RO

    DeVIS: Making Deformable Transformers Work for Video Instance Segmentation

    Authors: Adrià Caelles, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

    Abstract: Video Instance Segmentation (VIS) jointly tackles multi-object detection, tracking, and segmentation in video sequences. In the past, VIS methods mirrored the fragmentation of these subtasks in their architectural design, hence missing out on a joint solution. Transformers recently allowed to cast the entire VIS task as a single set-prediction problem. Nevertheless, the quadratic complexity of exi… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  6. arXiv:2101.02702  [pdf, other

    cs.CV

    TrackFormer: Multi-Object Tracking with Transformers

    Authors: Tim Meinhardt, Alexander Kirillov, Laura Leal-Taixe, Christoph Feichtenhofer

    Abstract: The challenging task of multi-object tracking (MOT) requires simultaneous reasoning about track initialization, identity, and spatio-temporal trajectories. We formulate this task as a frame-to-frame set prediction problem and introduce TrackFormer, an end-to-end trainable MOT approach based on an encoder-decoder Transformer architecture. Our model achieves data association between frames via atten… ▽ More

    Submitted 29 April, 2022; v1 submitted 7 January, 2021; originally announced January 2021.

  7. arXiv:2012.01866  [pdf, other

    cs.CV cs.LG cs.RO

    Make One-Shot Video Object Segmentation Efficient Again

    Authors: Tim Meinhardt, Laura Leal-Taixe

    Abstract: Video object segmentation (VOS) describes the task of segmenting a set of objects in each frame of a video. In the semi-supervised setting, the first mask of each object is provided at test time. Following the one-shot principle, fine-tuning VOS methods train a segmentation model separately on each given object mask. However, recently the VOS community has deemed such a test time optimization and… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  8. Tracking without bells and whistles

    Authors: Philipp Bergmann, Tim Meinhardt, Laura Leal-Taixe

    Abstract: The problem of tracking multiple objects in a video sequence poses several challenging tasks. For tracking-by-detection, these include object re-identification, motion prediction and dealing with occlusions. We present a tracker (without bells and whistles) that accomplishes tracking without specifically targeting any of these tasks, in particular, we perform no training or optimization on trackin… ▽ More

    Submitted 17 August, 2019; v1 submitted 13 March, 2019; originally announced March 2019.

  9. arXiv:1803.08660  [pdf, other

    cs.CV cs.NE

    Lifting Layers: Analysis and Applications

    Authors: Peter Ochs, Tim Meinhardt, Laura Leal-Taixe, Michael Moeller

    Abstract: The great advances of learning-based approaches in image processing and computer vision are largely based on deeply nested networks that compose linear transfer functions with suitable non-linearities. Interestingly, the most frequently used non-linearities in imaging applications (variants of the rectified linear unit) are uncommon in low dimensional approximation problems. In this paper we propo… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  10. Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems

    Authors: Tim Meinhardt, Michael Moeller, Caner Hazirbas, Daniel Cremers

    Abstract: While variational methods have been among the most powerful tools for solving linear inverse problems in imaging, deep (convolutional) neural networks have recently taken the lead in many challenging benchmarks. A remaining drawback of deep learning approaches is their requirement for an expensive retraining whenever the specific problem, the noise level, noise type, or desired measure of fidelity… ▽ More

    Submitted 30 August, 2017; v1 submitted 11 April, 2017; originally announced April 2017.