Skip to main content

Showing 101–150 of 164 results for author: Scaramuzza, D

.
  1. arXiv:2008.07971  [pdf, other

    cs.AI cs.LG cs.RO

    Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning

    Authors: Florian Fuchs, Yunlong Song, Elia Kaufmann, Davide Scaramuzza, Peter Duerr

    Abstract: Autonomous car racing is a major challenge in robotics. It raises fundamental problems for classical approaches such as planning minimum-time trajectories under uncertain dynamics and controlling the car at the limits of its handling. Besides, the requirement of minimizing the lap time, which is a sparse objective, and the difficulty of collecting training data from human experts have also hindere… ▽ More

    Submitted 9 May, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: Accepted for Publication at the IEEE Robotics and Automation Letters (RA-L) 2021, and International Conference on Robots and Automation (ICRA) 2021

    Journal ref: IEEE Robotics and Automation Letters (RAL) 2021

  2. arXiv:2008.03324  [pdf, other

    cs.RO

    Fisher Information Field: an Efficient and Differentiable Map for Perception-aware Planning

    Authors: Zichao Zhang, Davide Scaramuzza

    Abstract: Considering visual localization accuracy at the planning time gives preference to robot motion that can be better localized and thus has the potential of improving vision-based navigation, especially in visually degraded environments. To integrate the knowledge about localization accuracy in motion planning algorithms, a central task is to quantify the amount of information that an image taken at… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: 18 pages, 15 figures

  3. arXiv:2008.02532  [pdf, other

    math.OC cs.RO

    Online Weight-adaptive Nonlinear Model Predictive Control

    Authors: Dimche Kostadinov, Davide Scaramuzza

    Abstract: Nonlinear Model Predictive Control (NMPC) is a powerful and widely used technique for nonlinear dynamic process control under constraints. In NMPC, the state and control weights of the corresponding state and control costs are commonly selected based on human-expert knowledge, which usually reflects the acceptable stability in practice. Although broadly used, this approach might not be optimal for… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, 2020

  4. Learning High-Level Policies for Model Predictive Control

    Authors: Yunlong Song, Davide Scaramuzza

    Abstract: The combination of policy search and deep neural networks holds the promise of automating a variety of decision-making tasks. Model Predictive Control (MPC) provides robust solutions to robot control tasks by making use of a dynamical model of the system and solving an optimization problem online over a short planning horizon. In this work, we leverage probabilistic decision-making approaches and… ▽ More

    Submitted 9 May, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: Accepted for Publication at the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, 2020

  5. arXiv:2007.06255  [pdf, other

    cs.RO

    CPC: Complementary Progress Constraints for Time-Optimal Quadrotor Trajectories

    Authors: Philipp Foehn, Davide Scaramuzza

    Abstract: In many mobile robotics scenarios, such as drone racing, the goal is to generate a trajectory that passes through multiple waypoints in minimal time. This problem is referred to as time-optimal planning. State-of-the-art approaches either use polynomial trajectory formulations, which are suboptimal due to their smoothness, or numerical optimization, which requires waypoints to be allocated as cost… ▽ More

    Submitted 3 August, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  6. arXiv:2006.05768  [pdf, other

    cs.RO

    Deep Drone Acrobatics

    Authors: Elia Kaufmann, Antonio Loquercio, René Ranftl, Matthias Müller, Vladlen Koltun, Davide Scaramuzza

    Abstract: Performing acrobatic maneuvers with quadrotors is extremely challenging. Acrobatic flight requires high thrust and extreme angular accelerations that push the platform to its physical limits. Professional drone pilots often measure their level of mastery by flying such maneuvers in competitions. In this paper, we propose to learn a sensorimotor policy that enables an autonomous quadrotor to fly ex… ▽ More

    Submitted 11 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: 8 pages + 2 pages references. Video: https://youtu.be/2N_wKXQ6MXA. Code: https://github.com/uzh-rpg/deep_drone_acrobatics

    Journal ref: Robotics, Science, and Systems (RSS), 2020

  7. arXiv:2005.12813  [pdf, other

    cs.RO cs.CV eess.SY

    AlphaPilot: Autonomous Drone Racing

    Authors: Philipp Foehn, Dario Brescianini, Elia Kaufmann, Titus Cieslewski, Mathias Gehrig, Manasi Muglikar, Davide Scaramuzza

    Abstract: This paper presents a novel system for autonomous, vision-based drone racing combining learned data abstraction, nonlinear filtering, and time-optimal trajectory planning. The system has successfully been deployed at the first autonomous drone racing world championship: the 2019 AlphaPilot Challenge. Contrary to traditional drone racing systems, which only detect the next gate, our approach makes… ▽ More

    Submitted 20 August, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: This paper is an extended version of an accepted publication from Robotics: Science and Systems, 2020. This version has been accepted for publication in Autonomous Robots (Springer). Please cite as "AlphaPilot: Autonomous Drone Racing", P. Foehn, Autonomous Robots 2021. Associated video at https://youtu.be/DGjwm5PZQT8

  8. Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis

    Authors: Zichao Zhang, Torsten Sattler, Davide Scaramuzza

    Abstract: Visual Localization is one of the key enabling technologies for autonomous driving and augmented reality. High quality datasets with accurate 6 Degree-of-Freedom (DoF) reference poses are the foundation for benchmarking and improving existing methods. Traditionally, reference poses have been obtained via Structure-from-Motion (SfM). However, SfM itself relies on local features which are prone to f… ▽ More

    Submitted 30 December, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: 25 pages, 16 figures. Int J Comput Vis (2020)

  9. arXiv:2003.13493  [pdf, other

    cs.CV

    Faster than FAST: GPU-Accelerated Frontend for High-Speed VIO

    Authors: Balazs Nagy, Philipp Foehn, Davide Scaramuzza

    Abstract: The recent introduction of powerful embedded graphics processing units (GPUs) has allowed for unforeseen improvements in real-time computer vision applications. It has enabled algorithms to run onboard, well above the standard video rates, yielding not only higher information processing capability, but also reduced latency. This work focuses on the applicability of efficient low-level, GPU hardwar… ▽ More

    Submitted 3 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: IEEE International Conference on Intelligent Robots and Systems (IROS), 2020. Open-source implementation available at https://github.com/uzh-rpg/vilib

  10. arXiv:2003.09148  [pdf, other

    cs.CV cs.LG eess.SP

    Event-based Asynchronous Sparse Convolutional Networks

    Authors: Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza

    Abstract: Event cameras are bio-inspired sensors that respond to per-pixel brightness changes in the form of asynchronous and sparse "events". Recently, pattern recognition algorithms, such as learning-based methods, have made significant progress with event cameras by converting events into synchronous dense, image-like representations and applying traditional machine learning methods developed for standar… ▽ More

    Submitted 17 July, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Journal ref: European Conference on Computer Vision (ECCV), 2020

  11. arXiv:2003.09078  [pdf, other

    cs.CV

    Reducing the Sim-to-Real Gap for Event Cameras

    Authors: Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony

    Abstract: Event cameras are paradigm-shifting novel sensors that report asynchronous, per-pixel brightness changes called 'events' with unparalleled low latency. This makes them ideal for high speed, high dynamic range scenes where conventional cameras would fail. Recent work has demonstrated impressive results using Convolutional Neural Networks (CNNs) for video reconstruction and optic flow with events. W… ▽ More

    Submitted 22 August, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: Figure 5 fixed (had a glitch)

    Journal ref: European Conference on Computer Vision, 2020

  12. arXiv:2003.05654  [pdf, other

    cs.RO cs.CV

    AirSim Drone Racing Lab

    Authors: Ratnesh Madaan, Nicholas Gyde, Sai Vemprala, Matthew Brown, Keiko Nagami, Tim Taubner, Eric Cristofalo, Davide Scaramuzza, Mac Schwager, Ashish Kapoor

    Abstract: Autonomous drone racing is a challenging research problem at the intersection of computer vision, planning, state estimation, and control. We introduce AirSim Drone Racing Lab, a simulation framework for enabling fast prototy** of algorithms for autonomy and enabling machine learning research in this domain, with the goal of reducing the time, money, and risks associated with field robotics. Our… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: 14 pages, 6 figures

  13. arXiv:2003.04159  [pdf, other

    cs.RO

    Tightly-coupled Fusion of Global Positional Measurements in Optimization-based Visual-Inertial Odometry

    Authors: Giovanni Cioffi, Davide Scaramuzza

    Abstract: Motivated by the goal of achieving robust, drift-free pose estimation in long-term autonomous navigation, in this work we propose a methodology to fuse global positional information with visual and inertial measurements in a tightly-coupled nonlinear-optimization-based estimator. Differently from previous works, which are loosely-coupled, the use of a tightly-coupled approach allows exploiting the… ▽ More

    Submitted 10 July, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, 2020

  14. Geometry-aware Compensation Scheme for Morphing Drones

    Authors: Amedeo Fabris, Kevin Kleber, Davide Falanga, Davide Scaramuzza

    Abstract: Morphing multirotors, such as the Foldable Drone , can increase the versatility of drones employing in-flight-adaptive-morphology. To further increase precision in their tasks, recent works have investigated stable flight in asymmetric morphologies mainly leveraging the low-level controller. However, the aerodynamic effects embedded in multirotors are only analyzed in fixed shape aerial vehicles a… ▽ More

    Submitted 29 December, 2021; v1 submitted 9 March, 2020; originally announced March 2020.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), Xi'an, 2021

  15. arXiv:2003.02790  [pdf, other

    cs.NE cs.CV cs.RO

    Event-Based Angular Velocity Regression with Spiking Networks

    Authors: Mathias Gehrig, Sumit Bam Shrestha, Daniel Mouritzen, Davide Scaramuzza

    Abstract: Spiking Neural Networks (SNNs) are bio-inspired networks that process information conveyed as temporal spikes rather than numeric values. A spiking neuron of an SNN only produces a spike whenever a significant number of spikes occur within a short period of time. Due to their spike-based computational model, SNNs can process output from event-based, asynchronous sensors without any pre-processing… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020

  16. arXiv:2003.02247  [pdf, other

    cs.RO cs.CV

    Voxel Map for Visual SLAM

    Authors: Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

    Abstract: In modern visual SLAM systems, it is a standard practice to retrieve potential candidate map points from overlap** keyframes for further feature matching or direct tracking. In this work, we argue that keyframes are not the optimal choice for this task, due to several inherent limitations, such as weak geometric reasoning and poor scalability. We propose a voxel-map representation to efficiently… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Journal ref: IEEE Conference on Robotics and Automation(ICRA), Paris, 2020

  17. Redesigning SLAM for Arbitrary Multi-Camera Systems

    Authors: Juichung Kuo, Manasi Muglikar, Zichao Zhang, Davide Scaramuzza

    Abstract: Adding more cameras to SLAM systems improves robustness and accuracy but complicates the design of the visual front-end significantly. Thus, most systems in the literature are tailored for specific camera configurations. In this work, we aim at an adaptive SLAM system that works for arbitrary multi-camera setups. To this end, we revisit several common building blocks in visual SLAM. In particular,… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Journal ref: IEEE Conference on Robotics and Automation (ICRA), Paris, 2020

  18. Learning Depth With Very Sparse Supervision

    Authors: Antonio Loquercio, Alexey Dosovitskiy, Davide Scaramuzza

    Abstract: Motivated by the astonishing capabilities of natural intelligent agents and inspired by theories from psychology, this paper explores the idea that perception gets coupled to 3D properties of the world via interaction with the environment. Existing works for depth estimation require either massive amounts of annotated training data or some form of hard-coded geometrical constraint. This paper expl… ▽ More

    Submitted 16 July, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Accepted for Publication at the IEEE Robotics and Automation Letters (RA-L) 2020, and International Conference on Intelligent Robots and Systems (IROS) 2020

    Journal ref: IEEE Robotics and Automation Letters (RA-L) 2020

  19. Augmenting Visual Place Recognition with Structural Cues

    Authors: Amadeus Oertel, Titus Cieslewski, Davide Scaramuzza

    Abstract: In this paper, we propose to augment image-based place recognition with structural cues. Specifically, these structural cues are obtained using structure-from-motion, such that no additional sensors are needed for place recognition. This is achieved by augmenting the 2D convolutional neural network (CNN) typically used for image-based place recognition with a 3D CNN that takes as input a voxel gri… ▽ More

    Submitted 16 July, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: 8 pages, published in RA-L & IROS 2020

    Journal ref: IEEE Robotics and Automation Letters, 2020

  20. arXiv:2001.06209  [pdf, other

    cs.CV

    Registration made easy -- standalone orthopedic navigation with HoloLens

    Authors: Florentin Liebmann, Simon Roner, Marco von Atzigen, Florian Wanivenhaus, Caroline Neuhaus, José Spirig, Davide Scaramuzza, Reto Sutter, Jess Snedeker, Mazda Farshad, Philipp Fürnstahl

    Abstract: In surgical navigation, finding correspondence between preoperative plan and intraoperative anatomy, the so-called registration task, is imperative. One promising approach is to intraoperatively digitize anatomy and register it with the preoperative plan. State-of-the-art commercial navigation systems implement such approaches for pedicle screw placement in spinal fusion surgery. Although these sy… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: 6 pages, 5 figures, accepted at CVPR 2019 workshop on Computer Vision Applications for Mixed Reality Headsets (https://docs.microsoft.com/en-us/windows/mixed-reality/cvpr-2019)

    ACM Class: I.4.1

  21. arXiv:1912.03095  [pdf, other

    cs.CV

    Video to Events: Recycling Video Datasets for Event Cameras

    Authors: Daniel Gehrig, Mathias Gehrig, Javier Hidalgo-Carrió, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that output brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high dynamic range (HDR), high temporal resolution, and no motion blur. Recently, novel learning approaches operating on event data have achieved impressive results. Yet, these methods re… ▽ More

    Submitted 1 April, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  22. arXiv:1911.04553  [pdf, other

    cs.RO eess.SY

    Towards Low-Latency High-Bandwidth Control of Quadrotors using Event Cameras

    Authors: Rika Sugimoto Dimitrova, Mathias Gehrig, Dario Brescianini, Davide Scaramuzza

    Abstract: Event cameras are a promising candidate to enable high speed vision-based control due to their low sensor latency and high temporal resolution. However, purely event-based feedback has yet to be used in the control of drones. In this work, a first step towards implementing low-latency high-bandwidth control of quadrotors using event cameras is taken. In particular, this paper addresses the problem… ▽ More

    Submitted 28 March, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020

  23. arXiv:1909.01423  [pdf, other

    cs.RO

    Exploration Without Global Consistency Using Local Volume Consolidation

    Authors: Titus Cieslewski, Andreas Ziegler, Davide Scaramuzza

    Abstract: In exploration, the goal is to build a map of an unknown environment. Most state-of-the-art approaches use map representations that require drift-free state estimates to function properly. Real-world state estimators, however, exhibit drift. In this paper, we present a 2D map representation for exploration that is robust to drift. Rather than a global map, it uses local metric volumes connected by… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

    Comments: 16 pages with large margins, accepted for publication at the International Symposium on Robotics Research (ISRR), Hanoi, 2019

    Journal ref: International Symposium on Robotics Research (ISRR), Hanoi, 2019

  24. A General Framework for Uncertainty Estimation in Deep Learning

    Authors: Antonio Loquercio, Mattia Segù, Davide Scaramuzza

    Abstract: Neural networks predictions are unreliable when the input sample is out of the training distribution or corrupted by noise. Being able to detect such failures automatically is fundamental to integrate deep learning algorithms into robotics. Current approaches for uncertainty estimation of neural networks require changes to the network and optimization process, typically ignore prior knowledge abou… ▽ More

    Submitted 7 February, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in the Robotics and Automation Letters 2020, and for presentation at the International Conference on Robotics and Automation (ICRA) 2020

    Journal ref: IEEE Robotics and Automation Letters 2020

  25. arXiv:1906.07165  [pdf, other

    cs.CV

    High Speed and High Dynamic Range Video with an Event Camera

    Authors: Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that report brightness changes in the form of a stream of asynchronous "events" instead of intensity frames. They offer significant advantages with respect to conventional cameras: high temporal resolution, high dynamic range, and no motion blur. While the stream of events encodes in principle the complete visual signal, the reconstruction of an intensity image from… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.08298

  26. arXiv:1906.03996  [pdf, other

    cs.RO

    Rethinking Trajectory Evaluation for SLAM: a Probabilistic, Continuous-Time Approach

    Authors: Zichao Zhang, Davide Scaramuzza

    Abstract: Despite the existence of different error metrics for trajectory evaluation in SLAM, their theoretical justifications and connections are rarely studied, and few methods handle temporal association properly. In this work, we propose to formulate the trajectory evaluation problem in a probabilistic, continuous-time framework. By modeling the groundtruth as random variables, the concepts of absolute… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Accepted at ICRA19 Workshop on Dataset Generation and Benchmarking of SLAM Algorithms for Robotics and VR/AR. Best paper award

  27. arXiv:1906.03289  [pdf, other

    cs.RO

    Visual-Inertial Odometry of Aerial Robots

    Authors: Davide Scaramuzza, Zichao Zhang

    Abstract: Visual-Inertial odometry (VIO) is the process of estimating the state (pose and velocity) of an agent (e.g., an aerial robot) by using only the input of one or more cameras plus one or more Inertial Measurement Units (IMUs) attached to it. VIO is the only viable alternative to GPS and lidar-based odometry to achieve accurate state estimation. Since both cameras and IMUs are very cheap, these senso… ▽ More

    Submitted 14 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: Accepted in the Encyclopedia of Robotics, Springer

  28. arXiv:1906.02919  [pdf, other

    cs.RO cs.CV

    EVDodgeNet: Deep Dynamic Obstacle Dodging with Event Cameras

    Authors: Nitin J. Sanket, Chethan M. Parameshwara, Chahat Deep Singh, Ashwin V. Kuruttukulam, Cornelia Fermüller, Davide Scaramuzza, Yiannis Aloimonos

    Abstract: Dynamic obstacle avoidance on quadrotors requires low latency. A class of sensors that are particularly suitable for such scenarios are event cameras. In this paper, we present a deep learning -- based solution for dodging multiple dynamic obstacles on a quadrotor with a single event camera and on-board computation. Our approach uses a series of shallow neural networks for estimating both the ego-… ▽ More

    Submitted 1 March, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: 15 pages, 16 figures, Code and Video can be found at: https://prg.cs.umd.edu/EVDodgeNet

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2020

  29. Deep Drone Racing: From Simulation to Reality with Domain Randomization

    Authors: Antonio Loquercio, Elia Kaufmann, René Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Dynamically changing environments, unreliable state estimation, and operation under severe resource constraints are fundamental challenges that limit the deployment of small autonomous drones. We address these challenges in the context of autonomous, vision-based drone racing in dynamic environments. A racing drone must traverse a track with possibly moving gates at high speed. We enable this func… ▽ More

    Submitted 25 November, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: Accepted as a Regular Paper to the IEEE Transactions on Robotics Journal. arXiv admin note: substantial text overlap with arXiv:1806.08548

    Journal ref: IEEE Transactions on Robotics 2019

  30. arXiv:1904.10772  [pdf, other

    cs.CV

    CED: Color Event Camera Dataset

    Authors: Cedric Scheerlinck, Henri Rebecq, Timo Stoffregen, Nick Barnes, Robert Mahony, Davide Scaramuzza

    Abstract: Event cameras are novel, bio-inspired visual sensors, whose pixels output asynchronous and independent timestamped spikes at local intensity changes, called 'events'. Event cameras offer advantages over conventional frame-based cameras in terms of latency, high dynamic range (HDR) and temporal resolution. Until recently, event cameras have been limited to outputting events in the intensity channel… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

    Comments: Conference on Computer Vision and Pattern Recognition Workshops

  31. arXiv:1904.08405  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Event-based Vision: A Survey

    Authors: Guillermo Gallego, Tobi Delbruck, Garrick Orchard, Chiara Bartolozzi, Brian Taba, Andrea Censi, Stefan Leutenegger, Andrew Davison, Joerg Conradt, Kostas Daniilidis, Davide Scaramuzza

    Abstract: Event cameras are bio-inspired sensors that differ from conventional frame cameras: Instead of capturing images at a fixed rate, they asynchronously measure per-pixel brightness changes, and output a stream of events that encode the time, location and sign of the brightness changes. Event cameras offer attractive properties compared to traditional cameras: high temporal resolution (in the order of… ▽ More

    Submitted 8 August, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020

  32. arXiv:1904.08298  [pdf, other

    cs.CV

    Events-to-Video: Bringing Modern Computer Vision to Event Cameras

    Authors: Henri Rebecq, René Ranftl, Vladlen Koltun, Davide Scaramuzza

    Abstract: Event cameras are novel sensors that report brightness changes in the form of asynchronous "events" instead of intensity frames. They have significant advantages over conventional cameras: high temporal resolution, high dynamic range, and no motion blur. Since the output of event cameras is fundamentally different from conventional cameras, it is commonly accepted that they require the development… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 2019

  33. arXiv:1904.08245  [pdf, other

    cs.CV

    End-to-End Learning of Representations for Asynchronous Event-Based Data

    Authors: Daniel Gehrig, Antonio Loquercio, Konstantinos G. Derpanis, Davide Scaramuzza

    Abstract: Event cameras are vision sensors that record asynchronous streams of per-pixel brightness changes, referred to as "events". They have appealing advantages over frame-based cameras for computer vision, including high temporal resolution, high dynamic range, and no motion blur. Due to the sparse, non-uniform spatiotemporal layout of the event signal, pattern recognition algorithms typically aggregat… ▽ More

    Submitted 20 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: To appear at ICCV 2019

  34. arXiv:1904.07235  [pdf, other

    cs.CV cs.LG cs.RO

    Focus Is All You Need: Loss Functions For Event-based Vision

    Authors: Guillermo Gallego, Mathias Gehrig, Davide Scaramuzza

    Abstract: Event cameras are novel vision sensors that output pixel-level brightness changes ("events") instead of traditional video frames. These asynchronous sensors offer several advantages over traditional cameras, such as, high temporal resolution, very high dynamic range, and no motion blur. To unlock the potential of such sensors, motion compensation methods have been recently proposed. We present a c… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 29 pages, 19 figures, 4 tables

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, 2019

  35. Event-Based Motion Segmentation by Motion Compensation

    Authors: Timo Stoffregen, Guillermo Gallego, Tom Drummond, Lindsay Kleeman, Davide Scaramuzza

    Abstract: In contrast to traditional cameras, whose pixels have a common exposure time, event-based cameras are novel bio-inspired sensors whose pixels work independently and asynchronously output intensity changes (called "events"), with microsecond resolution. Since events are caused by the apparent motion of objects, event-based cameras sample visual information based on the scene dynamics and are, there… ▽ More

    Submitted 22 August, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: When viewed in Acrobat Reader, several of the figures animate. Video: https://youtu.be/0q6ap_OSBAk

    Journal ref: IEEE International Conference on Computer Vision 2019

  36. arXiv:1901.03360  [pdf, other

    cs.CV

    Unsupervised Moving Object Detection via Contextual Information Separation

    Authors: Yanchao Yang, Antonio Loquercio, Davide Scaramuzza, Stefano Soatto

    Abstract: We propose an adversarial contextual model for detecting moving objects in images. A deep neural network is trained to predict the optical flow in a region using information from everywhere else but that region (context), while another network attempts to make such context as uninformative as possible. The result is a model where hypotheses naturally compete with no need for explicit regularizatio… ▽ More

    Submitted 14 April, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  37. arXiv:1811.10681  [pdf, other

    cs.CV

    Matching Features without Descriptors: Implicitly Matched Interest Points

    Authors: Titus Cieslewski, Michael Bloesch, Davide Scaramuzza

    Abstract: The extraction and matching of interest points is a prerequisite for many geometric computer vision problems. Traditionally, matching has been achieved by assigning descriptors to interest points and matching points that have similar descriptors. In this paper, we propose a method by which interest points are instead already implicitly matched at detection time. With this, descriptors do not need… ▽ More

    Submitted 5 August, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 10 pages without references, accepted for publication at the British Machine Vision Conference (BMVC), Cardiff, 2019. v2 contains additional results, and a bug in the evaluation of LF-NET has been fixed

    Journal ref: British Machine Vision Conference (BMVC), Cardiff, 2019

  38. arXiv:1810.06224  [pdf, other

    cs.RO

    Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing

    Authors: Elia Kaufmann, Mathias Gehrig, Philipp Foehn, René Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Autonomous micro aerial vehicles still struggle with fast and agile maneuvers, dynamic environments, imperfect sensing, and state estimation drift. Autonomous drone racing brings these challenges to the fore. Human pilots can fly a previously unseen track after a handful of practice runs. In contrast, state-of-the-art autonomous navigation algorithms require either a precise metric map of the envi… ▽ More

    Submitted 1 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 6 pages (+1 references)

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2019

  39. Asynchronous, Photometric Feature Tracking using Events and Frames

    Authors: Daniel Gehrig, Henri Rebecq, Guillermo Gallego, Davide Scaramuzza

    Abstract: We present a method that leverages the complementarity of event cameras and standard cameras to track visual features with low-latency. Event cameras are novel sensors that output pixel-level brightness changes, called "events". They offer significant advantages over standard cameras, namely a very high dynamic range, no motion blur, and a latency in the order of microseconds. However, because the… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: 22 pages, 15 figures, Video: https://youtu.be/A7UfeUnG6c4

    Journal ref: European Conference on Computer Vision (ECCV), Munich, 2018

  40. Semi-Dense 3D Reconstruction with a Stereo Event Camera

    Authors: Yi Zhou, Guillermo Gallego, Henri Rebecq, Laurent Kneip, Hongdong Li, Davide Scaramuzza

    Abstract: Event cameras are bio-inspired sensors that offer several advantages, such as low latency, high-speed and high dynamic range, to tackle challenging scenarios in computer vision. This paper presents a solution to the problem of 3D reconstruction from data captured by a stereo event-camera rig moving in a static scene, such as in the context of stereo Simultaneous Localization and Map**. The propo… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: 19 pages, 8 figures, Video: https://youtu.be/Qrnpj2FD1e4

    Journal ref: European Conference on Computer Vision (ECCV), Munich, 2018

  41. arXiv:1806.08548  [pdf, other

    cs.RO

    Deep Drone Racing: Learning Agile Flight in Dynamic Environments

    Authors: Elia Kaufmann, Antonio Loquercio, Rene Ranftl, Alexey Dosovitskiy, Vladlen Koltun, Davide Scaramuzza

    Abstract: Autonomous agile flight brings up fundamental challenges in robotics, such as co** with unreliable state estimation, reacting optimally to dynamically changing environments, and coupling perception and action in real time under severe resource constraints. In this paper, we consider these challenges in the context of autonomous, vision-based drone racing in dynamic environments. Our approach com… ▽ More

    Submitted 9 October, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Accepted for publication in the Conference on Robotic Learning (CoRL) 2018, Zurich. 10 pages (+3 supplementary)

    Journal ref: Conference on Robotic Learning (CoRL), 2018

  42. arXiv:1805.01831  [pdf, other

    cs.RO cs.AI cs.NE eess.SP

    A 64mW DNN-based Visual Navigation Engine for Autonomous Nano-Drones

    Authors: Daniele Palossi, Antonio Loquercio, Francesco Conti, Eric Flamand, Davide Scaramuzza, Luca Benini

    Abstract: Fully-autonomous miniaturized robots (e.g., drones), with artificial intelligence (AI) based visual navigation capabilities are extremely challenging drivers of Internet-of-Things edge intelligence capabilities. Visual navigation based on AI approaches, such as deep neural networks (DNNs) are becoming pervasive for standard-size drones, but are considered out of reach for nanodrones with size of a… ▽ More

    Submitted 14 May, 2019; v1 submitted 4 May, 2018; originally announced May 2018.

    Comments: 15 pages, 13 figures, 5 tables, 2 listings, accepted for publication in the IEEE Internet of Things Journal (IEEE IOTJ)

  43. arXiv:1805.01358  [pdf, other

    cs.CV

    SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning

    Authors: Titus Cieslewski, Konstantinos G. Derpanis, Davide Scaramuzza

    Abstract: A wide range of computer vision algorithms rely on identifying sparse interest points in images and establishing correspondences between them. However, only a subset of the initially identified interest points results in true correspondences (inliers). In this paper, we seek a detector that finds the minimum number of points that are likely to result in an application-dependent "sufficient" number… ▽ More

    Submitted 19 August, 2019; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: 8 pages, 2p references, 1p supplementary material. Accepted for publication at the IEEE International Conference on 3D Vision (3DV), Québec City, 2019. v2 contains significant changes VS v1

    Journal ref: IEEE International Conference on 3D Vision (3DV), Québec City, 2019

  44. arXiv:1804.04811  [pdf, other

    cs.RO

    PAMPC: Perception-Aware Model Predictive Control for Quadrotors

    Authors: Davide Falanga, Philipp Foehn, Peng Lu, Davide Scaramuzza

    Abstract: We present the first perception-aware model predictive control framework for quadrotors that unifies control and planning with respect to action and perception objectives. Our framework leverages numerical optimization to compute trajectories that satisfy the system dynamics and require control inputs within the limits of the platform. Simultaneously, it optimizes perception objectives for robust… ▽ More

    Submitted 10 July, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, 2018

  45. arXiv:1804.01310  [pdf, other

    cs.CV cs.LG cs.RO

    Event-based Vision meets Deep Learning on Steering Prediction for Self-driving Cars

    Authors: Ana I. Maqueda, Antonio Loquercio, Guillermo Gallego, Narciso Garcia, Davide Scaramuzza

    Abstract: Event cameras are bio-inspired vision sensors that naturally capture the dynamics of a scene, filtering out redundant information. This paper presents a deep neural network approach that unlocks the potential of event cameras on a challenging motion-estimation task: prediction of a vehicle's steering angle. To make the best out of this sensor-algorithm combination, we adapt state-of-the-art convol… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: 9 pages, 8 figures, 6 tables. Video: https://youtu.be/_r_bsjkJTHA

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, 2018

  46. A Unifying Contrast Maximization Framework for Event Cameras, with Applications to Motion, Depth, and Optical Flow Estimation

    Authors: Guillermo Gallego, Henri Rebecq, Davide Scaramuzza

    Abstract: We present a unifying framework to solve several computer vision problems with event cameras: motion, depth and optical flow estimation. The main idea of our framework is to find the point trajectories on the image plane that are best aligned with the event data by maximizing an objective function: the contrast of an image of warped events. Our method implicitly handles data association between th… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: 16 pages, 16 figures. Video: https://youtu.be/KFMZFhi-9Aw

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, 2018

  47. arXiv:1801.02302  [pdf, other

    cs.RO

    A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

    Authors: Riccardo Spica, Davide Falanga, Eric Cristofalo, Eduardo Montijano, Davide Scaramuzza, Mac Schwager

    Abstract: To be successful in multi-player drone racing, a player must not only follow the race track in an optimal way, but also compete with other drones through strategic blocking, faking, and opportunistic passing while avoiding collisions. Since unveiling one's own strategy to the adversaries is not desirable, this requires each player to independently predict the other players' future actions. Nash eq… ▽ More

    Submitted 26 January, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

  48. Differential Flatness of Quadrotor Dynamics Subject to Rotor Drag for Accurate Tracking of High-Speed Trajectories

    Authors: Matthias Faessler, Antonio Franchi, Davide Scaramuzza

    Abstract: In this paper, we prove that the dynamical model of a quadrotor subject to linear rotor drag effects is differentially flat in its position and heading. We use this property to compute feed-forward control terms directly from a reference trajectory to be tracked. The obtained feed-forward terms are then used in a cascaded, nonlinear feedback control law that enables accurate agile flight with quad… ▽ More

    Submitted 28 March, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Journal ref: Robot.Autom.Lett. 3 (2018) 620-626

  49. Fast, Autonomous Flight in GPS-Denied and Cluttered Environments

    Authors: Kartik Mohta, Michael Watterson, Yash Mulgaonkar, Sikang Liu, Chao Qu, Anurag Makineni, Kelsey Saulnier, Ke Sun, Alex Zhu, Jeffrey Delmerico, Konstantinos Karydis, Nikolay Atanasov, Giuseppe Loianno, Davide Scaramuzza, Kostas Daniilidis, Camillo Jose Taylor, Vijay Kumar

    Abstract: One of the most challenging tasks for a flying robot is to autonomously navigate between target locations quickly and reliably while avoiding obstacles in its path, and with little to no a-priori knowledge of the operating environment. This challenge is addressed in the present paper. We describe the system design and software architecture of our proposed solution, and showcase how all the distinc… ▽ More

    Submitted 6 December, 2017; originally announced December 2017.

    Comments: Pre-peer reviewed version of the article accepted in Journal of Field Robotics

  50. arXiv:1710.05772  [pdf, other

    cs.RO

    Data-Efficient Decentralized Visual SLAM

    Authors: Titus Cieslewski, Siddharth Choudhary, Davide Scaramuzza

    Abstract: Decentralized visual simultaneous localization and map** (SLAM) is a powerful tool for multi-robot applications in environments where absolute positioning systems are not available. Being visual, it relies on cameras, cheap, lightweight and versatile sensors, and being decentralized, it does not rely on communication to a central ground station. In this work, we integrate state-of-the-art decent… ▽ More

    Submitted 16 October, 2017; originally announced October 2017.

    Comments: 8 pages, submitted to ICRA 2018

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2018