Skip to main content

Showing 1–10 of 10 results for author: Booij, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.04770  [pdf, other

    cs.CV

    Objects do not disappear: Video object detection by single-frame object location anticipation

    Authors: Xin Liu, Fatemeh Karimi Nejadasl, Jan C. van Gemert, Olaf Booij, Silvia L. Pintea

    Abstract: Objects in videos are typically characterized by continuous smooth motion. We exploit continuous smooth motion in three ways. 1) Improved accuracy by using object motion as an additional source of supervision, which we obtain by anticipating object locations from a static keyframe. 2) Improved efficiency by only doing the expensive feature computations on a small subset of all frames. Because neig… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  2. arXiv:2303.05915  [pdf, other

    cs.CV

    Convolutional Cross-View Pose Estimation

    Authors: Zimin Xia, Olaf Booij, Julian F. P. Kooij

    Abstract: We propose a novel end-to-end method for cross-view pose estimation. Given a ground-level query image and an aerial image that covers the query's local neighborhood, the 3 Degrees-of-Freedom camera pose of the query is estimated by matching its image descriptor to descriptors of local regions within the aerial image. The orientation-aware descriptors are obtained by using a translationally equivar… ▽ More

    Submitted 22 December, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  3. arXiv:2208.08519  [pdf, other

    cs.CV

    Visual Cross-View Metric Localization with Dense Uncertainty Estimates

    Authors: Zimin Xia, Olaf Booij, Marco Manfredi, Julian F. P. Kooij

    Abstract: This work addresses visual cross-view metric localization for outdoor robotics. Given a ground-level color image and a satellite patch that contains the local surroundings, the task is to identify the location of the ground camera within the satellite patch. Related work addressed this task for range-sensors (LiDAR, Radar), but for vision, only as a secondary regression step after an initial cross… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: ECCV 2022

  4. arXiv:2111.13087  [pdf, other

    cs.CV

    BoxeR: Box-Attention for 2D and 3D Transformers

    Authors: Duy-Kien Nguyen, Jihong Ju, Olaf Booij, Martin R. Oswald, Cees G. M. Snoek

    Abstract: In this paper, we propose a simple attention mechanism, we call box-attention. It enables spatial interaction between grid features, as sampled from boxes of interest, and improves the learning capability of transformers for several vision tasks. Specifically, we present BoxeR, short for Box Transformer, which attends to a set of boxes by predicting their transformation from a reference window on… ▽ More

    Submitted 25 March, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: In Proceeding of CVPR'2022

  5. arXiv:2103.15395  [pdf, other

    cs.CV

    No frame left behind: Full Video Action Recognition

    Authors: Xin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert

    Abstract: Not all video frames are equally informative for recognizing an action. It is computationally infeasible to train deep networks on all video frames when actions develop over hundreds of frames. A common heuristic is uniformly sampling a small number of video frames and using these to recognize the action. Instead, here we propose full video action recognition and consider all video frames. To make… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021

  6. arXiv:2011.00551  [pdf, other

    cs.CV cs.AI

    Adversarial Self-Supervised Scene Flow Estimation

    Authors: Victor Zuanazzi, Joris van Vugt, Olaf Booij, Pascal Mettes

    Abstract: This work proposes a metric learning approach for self-supervised scene flow estimation. Scene flow estimation is the task of estimating 3D flow vectors for consecutive 3D point clouds. Such flow vectors are fruitful, \eg for recognizing actions, or avoiding collisions. Training a neural network via supervised learning for scene flow is impractical, as this requires manual annotations for each 3D… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Published at 3DV 2020

  7. arXiv:2007.12668  [pdf, other

    cs.CV cs.LG

    KPRNet: Improving projection-based LiDAR semantic segmentation

    Authors: Deyvid Kochanov, Fatemeh Karimi Nejadasl, Olaf Booij

    Abstract: Semantic segmentation is an important component in the perception systems of autonomous vehicles. In this work, we adopt recent advances in both image and point cloud segmentation to achieve a better accuracy in the task of segmenting LiDAR scans. KPRNet improves the convolutional neural network architecture of 2D projection methods and utilizes KPConv to replace the commonly used post-processing… ▽ More

    Submitted 21 August, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: "ECCV 2020. Code and pre-trained models at https://github.com/DeyvidKochanov-TomTom/kprnet"

  8. arXiv:1908.11309  [pdf, other

    cs.CV cs.LG eess.IV

    Exploiting Temporality for Semi-Supervised Video Segmentation

    Authors: Radu Sibechi, Olaf Booij, Nora Baka, Peter Bloem

    Abstract: In recent years, there has been remarkable progress in supervised image segmentation. Video segmentation is less explored, despite the temporal dimension being highly informative. Semantic labels, e.g. that cannot be accurately detected in the current frame, may be inferred by incorporating information from previous frames. However, video segmentation is challenging due to the amount of data that… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted as workshop paper at ICCV 2019

  9. arXiv:1908.02711  [pdf, other

    cs.CV

    I Bet You Are Wrong: Gambling Adversarial Networks for Structured Semantic Segmentation

    Authors: Laurens Samson, Nanne van Noord, Olaf Booij, Michael Hofmann, Efstratios Gavves, Mohsen Ghafoorian

    Abstract: Adversarial training has been recently employed for realizing structured semantic segmentation, in which the aim is to preserve higher-level scene structural consistencies in dense predictions. However, as we show, value-based discrimination between the predictions from the segmentation network and ground-truth annotations can hinder the training process from learning to improve structural qualiti… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: 13 pages, 8 figures

  10. arXiv:1806.05525  [pdf, other

    cs.CV

    EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection

    Authors: Mohsen Ghafoorian, Cedric Nugteren, NĂ³ra Baka, Olaf Booij, Michael Hofmann

    Abstract: Convolutional neural networks have been successfully applied to semantic segmentation problems. However, there are many problems that are inherently not pixel-wise classification problems but are nevertheless frequently formulated as semantic segmentation. This ill-posed formulation consequently necessitates hand-crafted scenario-specific and computationally expensive post-processing methods to co… ▽ More

    Submitted 5 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: 14 pages, 7 figures