Skip to main content

Showing 51–92 of 92 results for author: Leal-Taixe, L

.
  1. arXiv:2010.15261  [pdf, other

    cs.CV

    Deep Shells: Unsupervised Shape Correspondence with Optimal Transport

    Authors: Marvin Eisenberger, Aysim Toker, Laura Leal-Taixé, Daniel Cremers

    Abstract: We propose a novel unsupervised learning approach to 3D shape correspondence that builds a multiscale matching pipeline into a deep neural network. This approach is based on smooth shells, the current state-of-the-art axiomatic correspondence method, which requires an a priori stochastic search over the space of initial poses. Our goal is to replace this costly preprocessing step by directly learn… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  2. arXiv:2010.14300  [pdf, other

    cs.CV cs.AI

    Ice Monitoring in Swiss Lakes from Optical Satellites and Webcams using Machine Learning

    Authors: Manu Tom, Rajanie Prabha, Tianyu Wu, Emmanuel Baltsavias, Laura Leal-Taixe, Konrad Schindler

    Abstract: Continuous observation of climate indicators, such as trends in lake freezing, is important to understand the dynamics of the local and global climate system. Consequently, lake ice has been included among the Essential Climate Variables (ECVs) of the Global Climate Observing System (GCOS), and there is a need to set up operational monitoring capabilities. Multi-temporal satellite images and publi… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: Accepted for publication in MDPI Remote Sensing Journal

  3. arXiv:2010.07548  [pdf, other

    cs.CV

    MOTChallenge: A Benchmark for Single-Camera Multiple Target Tracking

    Authors: Patrick Dendorfer, Aljoša Ošep, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth, Laura Leal-Taixé

    Abstract: Standardized benchmarks have been crucial in pushing the performance of computer vision algorithms, especially since the advent of deep learning. Although leaderboards should not be over-claimed, they often provide the most objective measure of performance and are therefore important guides for research. We present MOTChallenge, a benchmark for single-camera Multiple Object Tracking (MOT) launched… ▽ More

    Submitted 8 December, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted at IJCV

  4. arXiv:2010.01114  [pdf, other

    cs.CV

    Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation

    Authors: Patrick Dendorfer, Aljoša Ošep, Laura Leal-Taixé

    Abstract: In this paper, we present Goal-GAN, an interpretable and end-to-end trainable model for human trajectory prediction. Inspired by human navigation, we model the task of trajectory prediction as an intuitive two-stage process: (i) goal estimation, which predicts the most likely target positions of the agent, followed by a (ii) routing module which estimates a set of plausible trajectories that route… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: Oral presentation at ACCV 2020

  5. HOLISMOKES -- IV. Efficient mass modeling of strong lenses through deep learning

    Authors: S. Schuldt, S. H. Suyu, T. Meinhardt, L. Leal-Taixé, R. Cañameras, S. Taubenberger, A. Halkola

    Abstract: Modelling the mass distributions of strong gravitational lenses is often necessary to use them as astrophysical and cosmological probes. With the high number of lens systems ($>10^5$) expected from upcoming surveys, it is timely to explore efficient modeling approaches beyond traditional MCMC techniques that are time consuming. We train a CNN on images of galaxy-scale lenses to predict the paramet… ▽ More

    Submitted 18 December, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 17 pages, 14 Figures

  6. HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

    Authors: Jonathon Luiten, Aljosa Osep, Patrick Dendorfer, Philip Torr, Andreas Geiger, Laura Leal-Taixe, Bastian Leibe

    Abstract: Multi-Object Tracking (MOT) has been notoriously difficult to evaluate. Previous metrics overemphasize the importance of either detection or association. To address this, we present a novel MOT evaluation metric, HOTA (Higher Order Tracking Accuracy), which explicitly balances the effect of performing accurate detection, association and localization into a single unified metric for comparing track… ▽ More

    Submitted 29 September, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: Pre-print. Accepted for Publication in the International Journal of Computer Vision, 19 August 2020. Code is available at https://github.com/JonathonLuiten/HOTA-metrics

    Journal ref: International Journal of Computer Vision (2020)

  7. arXiv:2008.11516  [pdf, other

    cs.CV

    Making a Case for 3D Convolutions for Object Segmentation in Videos

    Authors: Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe

    Abstract: The task of object segmentation in videos is usually accomplished by processing appearance and motion information separately using standard 2D convolutional networks, followed by a learned fusion of the two sources of information. On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involv… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: BMVC '20

  8. arXiv:2005.09623  [pdf, other

    cs.CV

    Focus on defocus: bridging the synthetic to real domain gap for depth estimation

    Authors: Maxim Maximov, Kevin Galim, Laura Leal-Taixé

    Abstract: Data-driven depth estimation methods struggle with the generalization outside their training scenes due to the immense variability of the real-world scenes. This problem can be partially addressed by utilising synthetically generated images, but closing the synthetic-real domain gap is far from trivial. In this paper, we tackle this issue by using domain invariant defocus blur as direct supervisio… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: CVPR 2020

  9. CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks

    Authors: Maxim Maximov, Ismail Elezi, Laura Leal-Taixé

    Abstract: The unprecedented increase in the usage of computer vision technology in society goes hand in hand with an increased concern in data privacy. In many real-world scenarios like people tracking or action recognition, it is important to be able to process the data while taking careful consideration in protecting people's identity. We propose and develop CIAGAN, a model for image and video anonymizati… ▽ More

    Submitted 30 November, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: CVPR 2020

  10. arXiv:2005.03770  [pdf, other

    cs.LG cs.CV stat.ML

    Planning from Images with Deep Latent Gaussian Process Dynamics

    Authors: Nathanael Bosch, Jan Achterhold, Laura Leal-Taixé, Jörg Stückler

    Abstract: Planning is a powerful approach to control problems with known environment dynamics. In unknown environments the agent needs to learn a model of the system dynamics to make planning applicable. This is particularly challenging when the underlying states are only indirectly observable through images. We propose to learn a deep latent Gaussian process dynamics (DLGPD) model that learns low-dimension… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: Accepted for publication at the 2nd Annual Conference on Learning for Dynamics and Control (L4DC) 2020, with supplementary material. First two authors contributed equally

  11. arXiv:2004.13048  [pdf, other

    astro-ph.GA astro-ph.CO

    HOLISMOKES -- II. Identifying galaxy-scale strong gravitational lenses in Pan-STARRS using convolutional neural networks

    Authors: R. Canameras, S. Schuldt, S. H. Suyu, S. Taubenberger, T. Meinhardt, L. Leal-Taixe, C. Lemon, K. Rojas, E. Savary

    Abstract: We present a systematic search for wide-separation (Einstein radius >1.5"), galaxy-scale strong lenses in the 30 000 sq.deg of the Pan-STARRS 3pi survey on the Northern sky. With long time delays of a few days to weeks, such systems are particularly well suited for catching strongly lensed supernovae with spatially-resolved multiple images and open new perspectives on early-phase supernova spectro… ▽ More

    Submitted 7 April, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 18 pages and 11 figures (plus appendix), version published in A&A

    Journal ref: A&A 644, A163 (2020)

  12. arXiv:2003.09003  [pdf, other

    cs.CV

    MOT20: A benchmark for multi object tracking in crowded scenes

    Authors: Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixé

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of mu… ▽ More

    Submitted 19 March, 2020; originally announced March 2020.

    Comments: The sequences of the new MOT20 benchmark were previously presented in the CVPR 2019 tracking challenge ( arXiv:1906.04567 ). The differences between the two challenges are: - New and corrected annotations - New sequences, as we had to crop and transform some old sequences to achieve higher quality in the annotations. - New baselines evaluations and different sets of public detections

  13. STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos

    Authors: Ali Athar, Sabarinath Mahadevan, Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe

    Abstract: Existing methods for instance segmentation in videos typically involve multi-stage pipelines that follow the tracking-by-detection paradigm and model a video clip as a sequence of images. Multiple networks are used to detect objects in individual frames, and then associate these detections over time. Hence, these methods are often non-end-to-end trainable and highly tailored to specific tasks. In… ▽ More

    Submitted 1 September, 2023; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: ECCV 2020 28 pages, 6 figures

    MSC Class: 68T45; 68T10; 62H30 ACM Class: I.2.10; I.4.6; I.4.8; I.5.3

  14. arXiv:2002.07875  [pdf, other

    cs.CV eess.IV

    Lake Ice Monitoring with Webcams and Crowd-Sourced Images

    Authors: Rajanie Prabha, Manu Tom, Mathias Rothermel, Emmanuel Baltsavias, Laura Leal-Taixe, Konrad Schindler

    Abstract: Lake ice is a strong climate indicator and has been recognised as part of the Essential Climate Variables (ECV) by the Global Climate Observing System (GCOS). The dynamics of freezing and thawing, and possible shifts of freezing patterns over time, can help in understanding the local and global climate systems. One way to acquire the spatio-temporal information about lake ice formation, independen… ▽ More

    Submitted 7 May, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: Accepted for ISPRS Congress 2020, Nice, France

  15. arXiv:2001.11845  [pdf, other

    cs.CV cs.LG

    Learn to Predict Sets Using Feed-Forward Neural Networks

    Authors: Hamid Rezatofighi, Tianyu Zhu, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Anton Milan, Daniel Cremers, Laura Leal-Taixé, Ian Reid

    Abstract: This paper addresses the task of set prediction using deep feed-forward neural networks. A set is a collection of elements which is invariant under permutation and the size of a set is not fixed in advance. Many real-world problems, such as image tagging and object detection, have outputs that are naturally expressed as sets of entities. This creates a challenge for traditional deep neural network… ▽ More

    Submitted 25 October, 2021; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2022. arXiv admin note: substantial text overlap with arXiv:1805.00613

  16. arXiv:1912.07515  [pdf, other

    cs.CV

    Learning a Neural Solver for Multiple Object Tracking

    Authors: Guillem Brasó, Laura Leal-Taixé

    Abstract: Graphs offer a natural way to formulate Multiple Object Tracking (MOT) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as defining a model that can operate on such \textit{structured domain} is not trivial. As a consequence, most learning-based work has been devoted to learning better features for MOT, and then using these with well-e… ▽ More

    Submitted 18 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020 (oral)

  17. arXiv:1912.05227  [pdf, other

    cs.CV

    HistoNet: Predicting size histograms of object instances

    Authors: Kishan Sharma, Moritz Gold, Christian Zurbruegg, Laura Leal-Taixé, Jan Dirk Wegner

    Abstract: We propose to predict histograms of object sizes in crowded scenes directly without any explicit object instance segmentation. What makes this task challenging is the high density of objects (of the same category), which makes instance identification hard. Instead of explicitly segmenting object instances, we show that directly learning histograms of object sizes improves accuracy while using dras… ▽ More

    Submitted 4 June, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  18. arXiv:1912.00385  [pdf, other

    cs.CV cs.LG stat.ML

    The Group Loss for Deep Metric Learning

    Authors: Ismail Elezi, Sebastiano Vascon, Alessandro Torcinovich, Marcello Pelillo, Laura Leal-Taixe

    Abstract: Deep metric learning has yielded impressive results in tasks such as clustering and image retrieval by leveraging neural networks to obtain highly discriminative feature embeddings, which can be used to group samples into different classes. Much research has been devoted to the design of smart loss functions or data mining strategies for training such networks. Most methods consider only pairs or… ▽ More

    Submitted 20 July, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

    Comments: Accepted to European Conference on Computer Vision (ECCV) 2020, includes non-archival supplementary material

  19. arXiv:1908.01293  [pdf, other

    cs.CV

    To Learn or Not to Learn: Visual Localization from Essential Matrices

    Authors: Qunjie Zhou, Torsten Sattler, Marc Pollefeys, Laura Leal-Taixe

    Abstract: Visual localization is the problem of estimating a camera within a scene and a key component in computer vision applications such as self-driving cars and Mixed Reality. State-of-the-art approaches for accurate visual localization use scene-specific representations, resulting in the overhead of constructing these models when applying the techniques to new scenes. Recently, deep learning-based appr… ▽ More

    Submitted 9 March, 2020; v1 submitted 4 August, 2019; originally announced August 2019.

    Comments: Accepted to ICRA 2020

  20. arXiv:1907.11025  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Towards Generalizing Sensorimotor Control Across Weather Conditions

    Authors: Qadeer Khan, Patrick Wenzel, Daniel Cremers, Laura Leal-Taixé

    Abstract: The ability of deep learning models to generalize well across different scenarios depends primarily on the quality and quantity of annotated data. Labeling large amounts of data for all possible scenarios that a model may encounter would not be feasible; if even possible. We propose a framework to deal with limited labeled training data and demonstrate it on the application of vision-based vehicle… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  21. arXiv:1906.06618  [pdf, other

    cs.CV

    How To Train Your Deep Multi-Object Tracker

    Authors: Yihong Xu, Aljosa Osep, Yutong Ban, Radu Horaud, Laura Leal-Taixe, Xavier Alameda-Pineda

    Abstract: The recent trend in vision-based multi-object tracking (MOT) is heading towards leveraging the representational power of deep learning to jointly learn to detect and track objects. However, existing methods train only certain sub-modules using loss functions that often do not correlate with established tracking evaluation measures such as Multi-Object Tracking Accuracy (MOTA) and Precision (MOTP).… ▽ More

    Submitted 23 April, 2020; v1 submitted 15 June, 2019; originally announced June 2019.

    Comments: 14 pages, 9 figures, 6 tables

  22. arXiv:1906.04567  [pdf, other

    cs.CV cs.LG

    CVPR19 Tracking and Detection Challenge: How crowded can it get?

    Authors: Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, Laura Leal-Taixe

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. The benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal to establish a standardized evaluation of… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.00831, arXiv:1504.01942

  23. arXiv:1903.07504  [pdf, other

    cs.CV

    Understanding the Limitations of CNN-based Absolute Camera Pose Regression

    Authors: Torsten Sattler, Qunjie Zhou, Marc Pollefeys, Laura Leal-Taixe

    Abstract: Visual localization is the task of accurate camera pose estimation in a known scene. It is a key problem in computer vision and robotics, with applications including self-driving cars, Structure-from-Motion, SLAM, and Mixed Reality. Traditionally, the localization problem has been tackled using 3D geometry. Recently, end-to-end approaches based on convolutional neural networks have become popular.… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Initial version of a paper accepted to CVPR 2019

  24. Tracking without bells and whistles

    Authors: Philipp Bergmann, Tim Meinhardt, Laura Leal-Taixe

    Abstract: The problem of tracking multiple objects in a video sequence poses several challenging tasks. For tracking-by-detection, these include object re-identification, motion prediction and dealing with occlusions. We present a tracker (without bells and whistles) that accomplishes tracking without specifically targeting any of these tasks, in particular, we perform no training or optimization on trackin… ▽ More

    Submitted 17 August, 2019; v1 submitted 13 March, 2019; originally announced March 2019.

  25. Learning Temporal Coherence via Self-Supervision for GAN-based Video Generation

    Authors: Mengyu Chu, You Xie, Jonas Mayer, Laura Leal-Taixé, Nils Thuerey

    Abstract: Our work explores temporal self-supervision for GAN-based video generation tasks. While adversarial training successfully yields generative models for a variety of areas, temporal relationships in the generated data are much less explored. Natural temporal changes are crucial for sequential generation tasks, e.g. video super-resolution and unpaired video translation. For the former, state-of-the-a… ▽ More

    Submitted 21 May, 2020; v1 submitted 23 November, 2018; originally announced November 2018.

    Comments: Project page: https://ge.in.tum.de/publications/2019-tecogan-chu/, code link: https://github.com/thunil/TecoGAN

  26. arXiv:1807.01001  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Modular Vehicle Control for Transferring Semantic Information Between Weather Conditions Using GANs

    Authors: Patrick Wenzel, Qadeer Khan, Daniel Cremers, Laura Leal-Taixé

    Abstract: Even though end-to-end supervised learning has shown promising results for sensorimotor control of self-driving cars, its performance is greatly affected by the weather conditions under which it was trained, showing poor generalization to unseen conditions. In this paper, we show how knowledge can be transferred using semantic maps to new weather conditions without the need to obtain new ground tr… ▽ More

    Submitted 1 October, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: 2nd Conference on Robot Learning (CoRL 2018), Zürich, Switzerland

  27. arXiv:1805.00613  [pdf, other

    cs.CV

    Deep Perm-Set Net: Learn to predict sets with unknown permutation and cardinality using deep neural networks

    Authors: S. Hamid Rezatofighi, Roman Kaskman, Farbod T. Motlagh, Qinfeng Shi, Daniel Cremers, Laura Leal-Taixé, Ian Reid

    Abstract: Many real-world problems, e.g. object detection, have outputs that are naturally expressed as sets of entities. This creates a challenge for traditional deep neural networks which naturally deal with structured outputs such as vectors, matrices or tensors. We present a novel approach for learning to predict sets with unknown permutation and cardinality using deep neural networks. Specifically, in… ▽ More

    Submitted 2 October, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

  28. arXiv:1804.00863  [pdf, other

    cs.CV cs.GR

    Deep Appearance Maps

    Authors: Maxim Maximov, Laura Leal-Taixé, Mario Fritz, Tobias Ritschel

    Abstract: We propose a deep representation of appearance, i. e., the relation of color, surface orientation, viewer position, material and illumination. Previous approaches have useddeep learning to extract classic appearance representationsrelating to reflectance model parameters (e. g., Phong) orillumination (e. g., HDR environment maps). We suggest todirectly represent appearance itself as a network we c… ▽ More

    Submitted 29 October, 2019; v1 submitted 3 April, 2018; originally announced April 2018.

    Journal ref: ICCV 2019

  29. arXiv:1803.08660  [pdf, other

    cs.CV cs.NE

    Lifting Layers: Analysis and Applications

    Authors: Peter Ochs, Tim Meinhardt, Laura Leal-Taixe, Michael Moeller

    Abstract: The great advances of learning-based approaches in image processing and computer vision are largely based on deeply nested networks that compose linear transfer functions with suitable non-linearities. Interestingly, the most frequently used non-linearities in imaging applications (variants of the rectified linear unit) are uncommon in low dimensional approximation problems. In this paper we propo… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  30. arXiv:1709.06031  [pdf, other

    cs.CV

    Video Object Segmentation Without Temporal Information

    Authors: Kevis-Kokitsi Maninis, Sergi Caelles, Yuhua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool

    Abstract: Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such as when an object is occluded, or some frames are missing in a sequence, the result of these methods can deteriorate significantly or they may not even produce a… ▽ More

    Submitted 16 May, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Accepted to T-PAMI. Extended version of "One-Shot Video Object Segmentation", CVPR 2017 (arXiv:1611.05198). Project page: http://www.vision.ee.ethz.ch/~cvlsegmentation/osvos/

  31. Automatic tracking of vessel-like structures from a single starting point

    Authors: Dario Augusto Borges Oliveira, Laura Leal-Taixe, Raul Queiroz Feitosa, Bodo Rosenhahn

    Abstract: The identification of vascular networks is an important topic in the medical image analysis community. While most methods focus on single vessel tracking, the few solutions that exist for tracking complete vascular networks are usually computationally intensive and require a lot of user interaction. In this paper we present a method to track full vascular networks iteratively using a single starti… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

  32. arXiv:1705.08314  [pdf, other

    cs.CV

    Fusion of Head and Full-Body Detectors for Multi-Object Tracking

    Authors: Roberto Henschel, Laura Leal-Taixé, Daniel Cremers, Bodo Rosenhahn

    Abstract: In order to track all persons in a scene, the tracking-by-detection paradigm has proven to be a very effective approach. Yet, relying solely on a single detector is also a major limitation, as useful image information might be ignored. Consequently, this work demonstrates how to fuse two detectors into a tracking system. To obtain the trajectories, we propose to formulate tracking as a weighted gr… ▽ More

    Submitted 24 April, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: 10 pages, 4 figures; Winner of the MOT17 challenge; CVPRW 2018

  33. arXiv:1705.05020  [pdf, other

    cs.LG

    Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs

    Authors: Emanuel Laude, Jan-Hendrik Lange, Jonas Schüpfer, Csaba Domokos, Laura Leal-Taixé, Frank R. Schmidt, Bjoern Andres, Daniel Cremers

    Abstract: This paper introduces a novel algorithm for transductive inference in higher-order MRFs, where the unary energies are parameterized by a variable classifier. The considered task is posed as a joint optimization problem in the continuous classifier parameters and the discrete label variables. In contrast to prior approaches such as convex relaxations, we propose an advantageous decoupling of the ob… ▽ More

    Submitted 28 April, 2018; v1 submitted 14 May, 2017; originally announced May 2017.

  34. arXiv:1704.02781  [pdf, other

    cs.CV

    Tracking the Trackers: An Analysis of the State of the Art in Multiple Object Tracking

    Authors: Laura Leal-Taixé, Anton Milan, Konrad Schindler, Daniel Cremers, Ian Reid, Stefan Roth

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for research. We present a benchmark for Multiple Object Tracking launched in the late 2014, with the goal of creating a framework for the stan… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  35. arXiv:1704.01085  [pdf, other

    cs.CV

    Deep Depth From Focus

    Authors: Caner Hazirbas, Sebastian Georg Soyer, Maximilian Christian Staab, Laura Leal-Taixé, Daniel Cremers

    Abstract: Depth from focus (DFF) is one of the classical ill-posed inverse problems in computer vision. Most approaches recover the depth at each pixel based on the focal setting which exhibits maximal sharpness. Yet, it is not obvious how to reliably estimate the sharpness level, particularly in low-textured areas. In this paper, we propose `Deep Depth From Focus (DDFF)' as the first end-to-end learning ap… ▽ More

    Submitted 28 October, 2018; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: accepted to Asian Conference on Computer Vision (ACCV) 2018

  36. arXiv:1611.07890  [pdf, other

    cs.CV

    Image-based localization using LSTMs for structured feature correlation

    Authors: Florian Walch, Caner Hazirbas, Laura Leal-Taixé, Torsten Sattler, Sebastian Hilsenbeck, Daniel Cremers

    Abstract: In this work we propose a new CNN+LSTM architecture for camera pose regression for indoor and outdoor scenes. CNNs allow us to learn suitable feature representations for localization that are robust against motion blur and illumination changes. We make use of LSTM units on the CNN output, which play the role of a structured dimensionality reduction on the feature vector, leading to drastic improve… ▽ More

    Submitted 20 August, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

  37. arXiv:1611.05198  [pdf, other

    cs.CV

    One-Shot Video Object Segmentation

    Authors: Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool

    Abstract: This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foregro… ▽ More

    Submitted 13 April, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

    Comments: CVPR 2017 camera ready. Code: http://www.vision.ee.ethz.ch/~cvlsegmentation/osvos/

  38. arXiv:1607.07304  [pdf, other

    cs.CV

    Tracking with multi-level features

    Authors: Roberto Henschel, Laura Leal-Taixé, Bodo Rosenhahn, Konrad Schindler

    Abstract: We present a novel formulation of the multiple object tracking problem which integrates low and mid-level features. In particular, we formulate the tracking problem as a quadratic program coupling detections and dense point trajectories. Due to the computational complexity of the initial QP, we propose an approximation by two auxiliary problems, a temporal and spatial association, where the tempor… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

    Comments: Submitted as an IEEE PAMI short article

  39. arXiv:1604.07866  [pdf, other

    cs.LG cs.CV

    Learning by tracking: Siamese CNN for robust target association

    Authors: Laura Leal-Taixé, Cristian Canton Ferrer, Konrad Schindler

    Abstract: This paper introduces a novel approach to the task of data association within the context of pedestrian tracking, by introducing a two-stage learning scheme to match pairs of detections. First, a Siamese convolutional neural network (CNN) is trained to learn descriptors encoding local spatio-temporal structures between the two input image patches, aggregating pixel values and optical flow informat… ▽ More

    Submitted 4 August, 2016; v1 submitted 26 April, 2016; originally announced April 2016.

    Journal ref: Computer Vision and Pattern Recognition Conference Workshops (CVPRW). DeepVision: Deep Learning for Computer Vision. 2016

  40. arXiv:1603.00831  [pdf, other

    cs.CV

    MOT16: A Benchmark for Multi-Object Tracking

    Authors: Anton Milan, Laura Leal-Taixe, Ian Reid, Stefan Roth, Konrad Schindler

    Abstract: Standardized benchmarks are crucial for the majority of computer vision applications. Although leaderboards and ranking tables should not be over-claimed, benchmarks often provide the most objective measure of performance and are therefore important guides for reseach. Recently, a new benchmark for Multiple Object Tracking, MOTChallenge, was launched with the goal of collecting existing and new… ▽ More

    Submitted 3 May, 2016; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1504.01942

  41. arXiv:1504.01942  [pdf, other

    cs.CV

    MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking

    Authors: Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, Konrad Schindler

    Abstract: In the recent past, the computer vision community has developed centralized benchmarks for the performance evaluation of a variety of tasks, including generic object and pedestrian detection, 3D reconstruction, optical flow, single-object short-term tracking, and stereo estimation. Despite potential pitfalls of such benchmarks, they have proved to be extremely helpful to advance the state of the a… ▽ More

    Submitted 8 April, 2015; originally announced April 2015.

  42. arXiv:1411.7935  [pdf, other

    cs.CV

    Multiple object tracking with context awareness

    Authors: Laura Leal-Taixé

    Abstract: Multiple people tracking is a key problem for many applications such as surveillance, animation or car navigation, and a key input for tasks such as activity recognition. In crowded environments occlusions and false detections are common, and although there have been substantial advances in recent years, tracking is still a challenging task. Tracking is typically divided into two steps: detection,… ▽ More

    Submitted 30 November, 2016; v1 submitted 24 November, 2014; originally announced November 2014.

    Comments: PhD thesis, Leibniz University Hannover, Germany