Skip to main content

Showing 1–22 of 22 results for author: Gehrig, M

.
  1. arXiv:2402.15584  [pdf, other

    cs.CV cs.LG

    State Space Models for Event Cameras

    Authors: Nikola Zubić, Mathias Gehrig, Davide Scaramuzza

    Abstract: Today, state-of-the-art deep neural networks that process event-camera data first convert a temporal window of events into dense, grid-like input representations. As such, they exhibit poor generalizability when deployed at higher inference frequencies (i.e., smaller temporal windows) than the ones they were trained on. We address this challenge by introducing state-space models (SSMs) with learna… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 18 pages, 5 figures, 6 tables, CVPR 2024 Camera Ready paper

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 2024

  2. arXiv:2311.17286  [pdf, other

    cs.CV

    LEOD: Label-Efficient Object Detection for Event Cameras

    Authors: Ziyi Wu, Mathias Gehrig, Qing Lyu, Xudong Liu, Igor Gilitschenski

    Abstract: Object detection with event cameras benefits from the sensor's low latency and high dynamic range. However, it is costly to fully label event streams for supervised training due to their high temporal resolution. To reduce this cost, we present LEOD, the first method for label-efficient event-based detection. Our approach unifies weakly- and semi-supervised object detection with a self-training me… ▽ More

    Submitted 25 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: CVPR 2024. Code: https://github.com/Wuziyi616/LEOD

  3. arXiv:2306.07050  [pdf, other

    cs.CV

    Revisiting Token Pruning for Object Detection and Instance Segmentation

    Authors: Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

    Abstract: Vision Transformers (ViTs) have shown impressive performance in computer vision, but their high computational cost, quadratic in the number of tokens, limits their adoption in computation-constrained applications. However, this large number of tokens may not be necessary, as not all tokens are equally important. In this paper, we investigate token pruning to accelerate inference for object detecti… ▽ More

    Submitted 12 December, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Journal ref: IEEE Winter Conference on Applications of Computer Vision (WACV 2024)

  4. arXiv:2304.13455  [pdf, other

    cs.CV cs.LG

    From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection

    Authors: Nikola Zubić, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

    Abstract: Today, state-of-the-art deep neural networks that process events first convert them into dense, grid-like input representations before using an off-the-shelf network. However, selecting the appropriate representation for the task traditionally requires training a neural network for each representation and selecting the best one based on the validation score, which is very time-consuming. This work… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 15 pages, 11 figures, 2 tables, ICCV 2023 Camera Ready paper

  5. arXiv:2304.07139  [pdf, other

    cs.CV cs.NE

    Neuromorphic Optical Flow and Real-time Implementation with Event Cameras

    Authors: Yannick Schnider, Stanislaw Wozniak, Mathias Gehrig, Jules Lecomte, Axel von Arnim, Luca Benini, Davide Scaramuzza, Angeliki Pantazi

    Abstract: Optical flow provides information on relative motion that is an important component in many computer vision pipelines. Neural networks provide high accuracy optical flow, yet their complexity is often prohibitive for application at the edge or in robots, where efficiency and latency play crucial role. To address this challenge, we build on the latest developments in event-based vision and spiking… ▽ More

    Submitted 12 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted for IEEE CVPRW, Vancouver 2023. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media. Copyright 2023 IEEE

  6. arXiv:2303.14176  [pdf, other

    cs.CV cs.AI

    A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception

    Authors: Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

    Abstract: Spiking Neural Networks (SNN) are a class of bio-inspired neural networks that promise to bring low-power and low-latency inference to edge devices through asynchronous and sparse processing. However, being temporal models, SNNs depend heavily on expressive states to generate predictions on par with classical artificial neural networks (ANNs). These states converge only after long transient period… ▽ More

    Submitted 17 April, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, 2024

  7. arXiv:2212.05598  [pdf, other

    cs.CV

    Recurrent Vision Transformers for Object Detection with Event Cameras

    Authors: Mathias Gehrig, Davide Scaramuzza

    Abstract: We present Recurrent Vision Transformers (RVTs), a novel backbone for object detection with event cameras. Event cameras provide visual information with sub-millisecond latency at a high-dynamic range and with strong robustness against motion blur. These unique properties offer great potential for low-latency object detection and tracking in time-critical scenarios. Prior work in event-based visio… ▽ More

    Submitted 25 May, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023

  8. arXiv:2211.12826  [pdf, other

    cs.CV

    Data-driven Feature Tracking for Event Cameras

    Authors: Nico Messikommer, Carter Fang, Mathias Gehrig, Davide Scaramuzza

    Abstract: Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios. Existing feature tracking methods for event cameras are either handcrafted or derived from first principles but require extensive parameter tuning, are sensitive to noise,… ▽ More

    Submitted 25 April, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023

  9. Dense Continuous-Time Optical Flow from Events and Frames

    Authors: Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

    Abstract: We present a method for estimating dense continuous-time optical flow from event data. Traditional dense optical flow methods compute the pixel displacement between two images. Due to missing information, these approaches cannot recover the pixel trajectories in the blind time between two images. In this work, we show that it is possible to compute per-pixel, continuous-time optical flow using eve… ▽ More

    Submitted 11 February, 2024; v1 submitted 25 March, 2022; originally announced March 2022.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

  10. Bridging the Gap between Events and Frames through Unsupervised Domain Adaptation

    Authors: Nico Messikommer, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

    Abstract: Reliable perception during fast motion maneuvers or in high dynamic range environments is crucial for robotic systems. Since event cameras are robust to these challenging conditions, they have great potential to increase the reliability of robot vision. However, event-based vision has been held back by the shortage of labeled datasets due to the novelty of event cameras. To overcome this drawback,… ▽ More

    Submitted 3 February, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2022

  11. arXiv:2108.10552  [pdf, other

    cs.CV

    E-RAFT: Dense Optical Flow from Event Cameras

    Authors: Mathias Gehrig, Mario Millhäusler, Daniel Gehrig, Davide Scaramuzza

    Abstract: We propose to incorporate feature correlation and sequential processing into dense optical flow estimation from event cameras. Modern frame-based optical flow methods heavily rely on matching costs computed from feature correlation. In contrast, there exists no optical flow method for event cameras that explicitly computes matching costs. Instead, learning-based approaches using events usually res… ▽ More

    Submitted 21 October, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

    Comments: International Conference on 3D Vision (3DV)

  12. arXiv:2106.07286  [pdf, other

    cs.CV

    TimeLens: Event-based Video Frame Interpolation

    Authors: Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

    Abstract: State-of-the-art frame interpolation methods generate intermediate frames by inferring object motions in the image from consecutive key-frames. In the absence of additional information, first-order approximations, i.e. optical flow, must be used, but this choice restricts the types of motions that can be modeled, leading to errors in highly dynamic scenarios. Event cameras are novel sensors that a… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

  13. arXiv:2105.12362  [pdf, other

    cs.CV

    How to Calibrate Your Event Camera

    Authors: Manasi Muglikar, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

    Abstract: We propose a generic event camera calibration framework using image reconstruction. Instead of relying on blinking LED patterns or external screens, we show that neural-network-based image reconstruction is well suited for the task of intrinsic and extrinsic calibration of event cameras. The advantage of our proposed approach is that we can use standard calibration patterns that do not rely on act… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition Workshops

  14. arXiv:2103.06011  [pdf, other

    cs.CV cs.RO

    DSEC: A Stereo Event Camera Dataset for Driving Scenarios

    Authors: Mathias Gehrig, Willem Aarents, Daniel Gehrig, Davide Scaramuzza

    Abstract: Once an academic venture, autonomous driving has received unparalleled corporate funding in the last decade. Still, the operating conditions of current autonomous cars are mostly restricted to ideal scenarios. This means that driving in challenging illumination conditions such as night, sunrise, and sunset remains an open problem. In these cases, standard cameras are being pushed to their limits i… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: IEEE Robotics and Automation Letters

  15. arXiv:2102.09320  [pdf, other

    cs.CV

    Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction

    Authors: Daniel Gehrig, Michelle Rüegg, Mathias Gehrig, Javier Hidalgo Carrio, Davide Scaramuzza

    Abstract: Event cameras are novel vision sensors that report per-pixel brightness changes as a stream of asynchronous "events". They offer significant advantages compared to standard cameras due to their high temporal resolution, high dynamic range and lack of motion blur. However, events only measure the varying component of the visual signal, which limits their ability to encode scene context. By contrast… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Journal ref: IEEE Robotics and Automation Letters (RA-L), 2021

  16. arXiv:2005.12813  [pdf, other

    cs.RO cs.CV eess.SY

    AlphaPilot: Autonomous Drone Racing

    Authors: Philipp Foehn, Dario Brescianini, Elia Kaufmann, Titus Cieslewski, Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

    Abstract: This paper presents a novel system for autonomous, vision-based drone racing combining learned data abstraction, nonlinear filtering, and time-optimal trajectory planning. The system has successfully been deployed at the first autonomous drone racing world championship: the 2019 AlphaPilot Challenge. Contrary to traditional drone racing systems, which only detect the next gate, our approach makes… ▽ More

    Submitted 20 August, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: This paper is an extended version of an accepted publication from Robotics: Science and Systems, 2020. This version has been accepted for publication in Autonomous Robots (Springer). Please cite as "AlphaPilot: Autonomous Drone Racing", P. Foehn, Autonomous Robots 2021. Associated video at https://youtu.be/DGjwm5PZQT8

  17. arXiv:2003.02790  [pdf, other

    cs.NE cs.CV cs.RO

    Event-Based Angular Velocity Regression with Spiking Networks

    Authors: Mathias Gehrig, Sumit Bam Shrestha, Daniel Mouritzen, Davide Scaramuzza

    Abstract: Spiking Neural Networks (SNNs) are bio-inspired networks that process information conveyed as temporal spikes rather than numeric values. A spiking neuron of an SNN only produces a spike whenever a significant number of spikes occur within a short period of time. Due to their spike-based computational model, SNNs can process output from event-based, asynchronous sensors without any pre-processing… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020

  18. arXiv:1912.03095  [pdf, other

    cs.CV

    Video to Events: Recycling Video Datasets for Event Cameras

    Authors: Daniel Gehrig, Mathias Gehrig, Javier Hidalgo-Carrió, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high dynamic range (HDR), high temporal resolution, and no motion blur. Recently, novel learning approaches operating on event data have achieved impressive results. Yet, these methods re… ▽ More

    Submitted 1 April, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  19. arXiv:1911.04553  [pdf, other

    cs.RO eess.SY

    Towards Low-Latency High-Bandwidth Control of Quadrotors using Event Cameras

    Authors: Rika Sugimoto Dimitrova, Mathias Gehrig, Dario Brescianini, Davide Scaramuzza

    Abstract: Event cameras are a promising candidate to enable high speed vision-based control due to their low sensor latency and high temporal resolution. However, purely event-based feedback has yet to be used in the control of drones. In this work, a first step towards implementing low-latency high-bandwidth control of quadrotors using event cameras is taken. In particular, this paper addresses the problem… ▽ More

    Submitted 28 March, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020

  20. arXiv:1904.07235  [pdf, other

    cs.CV cs.LG cs.RO

    Focus Is All You Need: Loss Functions For Event-based Vision

    Authors: Guillermo Gallego, Mathias Gehrig, Davide Scaramuzza

    Abstract: Event cameras are novel vision sensors that output pixel-level brightness changes ("events") instead of traditional video frames. These asynchronous sensors offer several advantages over traditional cameras, such as, high temporal resolution, very high dynamic range, and no motion blur. To unlock the potential of such sensors, motion compensation methods have been recently proposed. We present a c… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 29 pages, 19 figures, 4 tables

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 2019

  21. arXiv:1810.06224  [pdf, other

    cs.RO

    Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing

    Authors: Elia Kaufmann, Mathias Gehrig, Philipp Foehn, René Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Autonomous micro aerial vehicles still struggle with fast and agile maneuvers, dynamic environments, imperfect sensing, and state estimation drift. Autonomous drone racing brings these challenges to the fore. Human pilots can fly a previously unseen track after a handful of practice runs. In contrast, state-of-the-art autonomous navigation algorithms require either a precise metric map of the envi… ▽ More

    Submitted 1 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 6 pages (+1 references)

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2019

  22. Visual Place Recognition with Probabilistic Vertex Voting

    Authors: Mathias Gehrig, Elena Stumm, Timo Hinzmann, Roland Siegwart

    Abstract: We propose a novel scoring concept for visual place recognition based on nearest neighbor descriptor voting and demonstrate how the algorithm naturally emerges from the problem formulation. Based on the observation that the number of votes for matching places can be evaluated using a binomial distribution model, loop closures can be detected with high precision. By casting the problem into a proba… ▽ More

    Submitted 7 June, 2018; v1 submitted 11 October, 2016; originally announced October 2016.

    Comments: 8 pages

    Journal ref: 2017 IEEE International Conference on Robotics and Automation (ICRA)