Skip to main content

Showing 1–50 of 164 results for author: Scaramuzza, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01568  [pdf, other

    cs.RO

    Agile Robotics: Optimal Control, Reinforcement Learning, and Differentiable Simulation

    Authors: Yunlong Song, Davide Scaramuzza

    Abstract: Control systems are at the core of every real-world robot. They are deployed in an ever-increasing number of applications, ranging from autonomous racing and search-and-rescue missions to industrial inspections and space exploration. To achieve peak performance, certain tasks require pushing the robot to its maximum agility. How can we design control algorithms that enhance the agility of autonomo… ▽ More

    Submitted 25 May, 2024; originally announced July 2024.

    Comments: This abstract has been accepted for the Robotics: Science and Systems (RSS) Pioneers Workshop, 2024

  2. arXiv:2406.12505  [pdf, other

    cs.RO

    Demonstrating Agile Flight from Pixels without State Estimation

    Authors: Ismail Geles, Leonard Bauersfeld, Angel Romero, Jiaxu Xing, Davide Scaramuzza

    Abstract: Quadrotors are among the most agile flying robots. Despite recent advances in learning-based control and computer vision, autonomous drones still rely on explicit state estimation. On the other hand, human pilots only rely on a first-person-view video stream from the drone onboard camera to push the platform to its limits and fly robustly in unseen environments. To the best of our knowledge, we pr… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: Robotics: Science and Systems (RSS), 2024

  3. arXiv:2405.16674  [pdf, other

    cs.LG cs.CC cs.LO

    Limits of Deep Learning: Sequence Modeling through the Lens of Complexity Theory

    Authors: Nikola Zubić, Federico Soldá, Aurelio Sulser, Davide Scaramuzza

    Abstract: Deep learning models have achieved significant success across various applications but continue to struggle with tasks requiring complex reasoning over sequences, such as function composition and compositional tasks. Despite advancements, models like Structured State Space Models (SSMs) and Transformers underperform in deep compositionality tasks due to inherent architectural and training limitati… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 23 pages, 17 figures, 4 tables

  4. arXiv:2404.11511  [pdf, other

    eess.IV cs.CV

    Event Cameras Meet SPADs for High-Speed, Low-Bandwidth Imaging

    Authors: Manasi Muglikar, Siddharth Somasundaram, Akshat Dave, Edoardo Charbon, Ramesh Raskar, Davide Scaramuzza

    Abstract: Traditional cameras face a trade-off between low-light performance and high-speed imaging: longer exposure times to capture sufficient light results in motion blur, whereas shorter exposures result in Poisson-corrupted noisy images. While burst photography techniques help mitigate this tradeoff, conventional cameras are fundamentally limited in their sensor noise characteristics. Event cameras and… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  5. arXiv:2404.09765  [pdf, other

    cs.RO eess.IV

    Hilti SLAM Challenge 2023: Benchmarking Single + Multi-session SLAM across Sensor Constellations in Construction

    Authors: Ashish Devadas Nair, Julien Kindle, Plamen Levchev, Davide Scaramuzza

    Abstract: Simultaneous Localization and Map** systems are a key enabler for positioning in both handheld and robotic applications. The Hilti SLAM Challenges organized over the past years have been successful at benchmarking some of the world's best SLAM Systems with high accuracy. However, more capabilities of these systems are yet to be explored, such as platform agnosticism across varying sensor suites… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2404.01887   

    cs.CV

    3D scene generation from scene graphs and self-attention

    Authors: Pietro Bonazzi, Mengqi Wang, Diego Martin Arroyo, Fabian Manhardt, Nico Messikomer, Federico Tombari, Davide Scaramuzza

    Abstract: Synthesizing realistic and diverse indoor 3D scene layouts in a controllable fashion opens up applications in simulated navigation and virtual reality. As concise and robust representations of a scene, scene graphs have proven to be well-suited as the semantic control on the generated layout. We present a variant of the conditional variational autoencoder (cVAE) model to synthesize 3D scenes from… ▽ More

    Submitted 23 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Some authors were not timely informed of the submission

  7. arXiv:2404.01112   

    cs.CV cs.CG

    Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features

    Authors: Pietro Bonazzi, Marie-Julie Rakatosaona, Marco Cannici, Federico Tombari, Davide Scaramuzza

    Abstract: Existing deep learning methods for the reconstruction and denoising of point clouds rely on small datasets of 3D shapes. We circumvent the problem by leveraging deep learning methods trained on billions of images. We propose a method to reconstruct point clouds from few images and to denoise point clouds from their rendering by exploiting prior knowledge distilled from image-based deep learning mo… ▽ More

    Submitted 23 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: An author was not timely informed before the released submission

  8. arXiv:2404.00842  [pdf, other

    cs.CV

    An N-Point Linear Solver for Line and Motion Estimation with Event Cameras

    Authors: Ling Gao, Daniel Gehrig, Hang Su, Davide Scaramuzza, Laurent Kneip

    Abstract: Event cameras respond primarily to edges--formed by strong gradients--and are thus particularly well-suited for line-based motion estimation. Recent work has shown that events generated by a single line each satisfy a polynomial constraint which describes a manifold in the space-time volume. Multiple such constraints can be solved simultaneously to recover the partial linear velocity and line para… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  9. arXiv:2403.19780  [pdf, other

    cs.CV

    Mitigating Motion Blur in Neural Radiance Fields with Events and Frames

    Authors: Marco Cannici, Davide Scaramuzza

    Abstract: Neural Radiance Fields (NeRFs) have shown great potential in novel view synthesis. However, they struggle to render sharp images when the data used for training is affected by motion blur. On the other hand, event cameras excel in dynamic scenes as they measure brightness changes with microsecond resolution and are thus only marginally affected by blur. Recent methods attempt to enhance NeRF recon… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  10. arXiv:2403.17551  [pdf, other

    cs.RO

    MPCC++: Model Predictive Contouring Control for Time-Optimal Flight with Safety Constraints

    Authors: Maria Krinner, Angel Romero, Leonard Bauersfeld, Melanie Zeilinger, Andrea Carron, Davide Scaramuzza

    Abstract: Quadrotor flight is an extremely challenging problem due to the limited control authority encountered at the limit of handling. Model Predictive Contouring Control (MPCC) has emerged as a promising model-based approach for time optimization problems such as drone racing. However, the standard MPCC formulation used in quadrotor racing introduces the notion of the gates directly in the cost function… ▽ More

    Submitted 14 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 6 figures

    Journal ref: Robotics: Science and Systems (RSS), 2024

  11. arXiv:2403.14864  [pdf, other

    cs.RO cs.AI

    Learning Quadruped Locomotion Using Differentiable Simulation

    Authors: Yunlong Song, Sangbae Kim, Davide Scaramuzza

    Abstract: While most recent advancements in legged robot control have been driven by model-free reinforcement learning, we explore the potential of differentiable simulation. Differentiable simulation promises faster convergence and more stable training by computing low-variant first-order gradients using the robot model, but so far, its use for legged robot control has remained limited to simulation. The m… ▽ More

    Submitted 27 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  12. arXiv:2403.13321  [pdf, other

    cs.RO

    Robotics meets Fluid Dynamics: A Characterization of the Induced Airflow around a Quadrotor

    Authors: Leonard Bauersfeld, Koen Muller, Dominic Ziegler, Filippo Coletti, Davide Scaramuzza

    Abstract: The widespread adoption of quadrotors for diverse applications, from agriculture to public safety, necessitates an understanding of the aerodynamic disturbances they create. This paper introduces a computationally lightweight model for estimating the time-averaged magnitude of the induced flow below quadrotors in hover. Unlike related approaches that rely on expensive computational fluid dynamics… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 7+1 pages

  13. arXiv:2403.12203  [pdf, other

    cs.RO cs.CV cs.LG

    Bootstrap** Reinforcement Learning with Imitation for Vision-Based Agile Flight

    Authors: Jiaxu Xing, Angel Romero, Leonard Bauersfeld, Davide Scaramuzza

    Abstract: We combine the effectiveness of Reinforcement Learning (RL) and the efficiency of Imitation Learning (IL) in the context of vision-based, autonomous drone racing. We focus on directly processing visual input without explicit state estimation. While RL offers a general framework for learning complex controllers through trial and error, it faces challenges regarding sample efficiency and computation… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  14. arXiv:2402.15584  [pdf, other

    cs.CV cs.LG

    State Space Models for Event Cameras

    Authors: Nikola Zubić, Mathias Gehrig, Davide Scaramuzza

    Abstract: Today, state-of-the-art deep neural networks that process event-camera data first convert a temporal window of events into dense, grid-like input representations. As such, they exhibit poor generalizability when deployed at higher inference frequencies (i.e., smaller temporal windows) than the ones they were trained on. We address this challenge by introducing state-space models (SSMs) with learna… ▽ More

    Submitted 18 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 18 pages, 5 figures, 6 tables, CVPR 2024 Camera Ready paper

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 2024

  15. arXiv:2401.02343  [pdf, other

    cs.RO

    AERIAL-CORE: AI-Powered Aerial Robots for Inspection and Maintenance of Electrical Power Infrastructures

    Authors: Anibal Ollero, Alejandro Suarez, Christos Papaioannidis, Ioannis Pitas, Juan M. Marredo, Viet Duong, Emad Ebeid, Vit Kratky, Martin Saska, Chloe Hanoune, Amr Afifi, Antonio Franchi, Charalampos Vourtsis, Dario Floreano, Goran Vasiljevic, Stjepan Bogdan, Alvaro Caballero, Fabio Ruggiero, Vincenzo Lippiello, Carlos Matilla, Giovanni Cioffi, Davide Scaramuzza, Jose R. Martinez-de-Dios, Begona C. Arrue, Carlos Martin , et al. (5 additional authors not shown)

    Abstract: Large-scale infrastructures are prone to deterioration due to age, environmental influences, and heavy usage. Ensuring their safety through regular inspections and maintenance is crucial to prevent incidents that can significantly affect public safety and the environment. This is especially pertinent in the context of electrical power networks, which, while essential for energy provision, can also… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  16. arXiv:2310.11659  [pdf, other

    cs.RO

    Flymation: Interactive Animation for Flying Robots

    Authors: Yunlong Song, Davide Scaramuzza

    Abstract: Trajectory visualization and animation play critical roles in robotics research. However, existing data visualization and animation tools often lack flexibility, scalability, and versatility, resulting in limited capability to fully explore and analyze flight data. To address this limitation, we introduce Flymation, a new flight trajectory visualization and animation tool. Built on the Unity3D eng… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: This work was presented at Workshop at ICRA 2023 ( The Role of Robotics Simulators for Unmanned Aerial Vehicles)

  17. Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning

    Authors: Yunlong Song, Angel Romero, Matthias Mueller, Vladlen Koltun, Davide Scaramuzza

    Abstract: A central question in robotics is how to design a control system for an agile mobile robot. This paper studies this question systematically, focusing on a challenging setting: autonomous drone racing. We show that a neural network controller trained with reinforcement learning (RL) outperformed optimal control (OC) methods in this setting. We then investigated which fundamental factors have contri… ▽ More

    Submitted 18 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Science Robotics, 2023

  18. A 5-Point Minimal Solver for Event Camera Relative Motion Estimation

    Authors: Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip

    Abstract: Event-based cameras are ideal for line-based motion estimation, since they predominantly respond to edges in the scene. However, accurately determining the camera displacement based on events continues to be an open problem. This is because line feature extraction and dynamics estimation are tightly coupled when using event cameras, and no precise model is currently available for describing the co… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Journal ref: IEEE/CVF International Conference on Computer Vision (ICCV), 2023

  19. arXiv:2309.12784  [pdf, other

    cs.RO

    Learning to Walk and Fly with Adversarial Motion Priors

    Authors: Giuseppe L'Erario, Drew Hanover, Angel Romero, Yunlong Song, Gabriele Nava, Paolo Maria Viceconte, Daniele Pucci, Davide Scaramuzza

    Abstract: Robot multimodal locomotion encompasses the ability to transition between walking and flying, representing a significant challenge in robotics. This work presents an approach that enables automatic smooth transitions between legged and aerial locomotion. Leveraging the concept of Adversarial Motion Priors, our method allows the robot to imitate motion datasets and accomplish the desired task witho… ▽ More

    Submitted 29 March, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 8 pages, 8 figures, submitted to IROS 2024

  20. arXiv:2309.09947  [pdf, other

    cs.CV

    End-to-end Learned Visual Odometry with Events and Frames

    Authors: Roberto Pellerito, Marco Cannici, Daniel Gehrig, Joris Belhadj, Olivier Dubois-Matra, Massimo Casasco, Davide Scaramuzza

    Abstract: Visual Odometry (VO) is crucial for autonomous robotic navigation, especially in GPS-denied environments like planetary terrains. To improve robustness, recent model-based VO systems have begun combining standard and event-based cameras. Event cameras excel in low-light and high-speed motion, while standard cameras provide dense and easier-to-track features, even in low-textured areas. However, th… ▽ More

    Submitted 20 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures, 4 tables

  21. arXiv:2309.09865  [pdf, other

    cs.RO cs.CV

    Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight

    Authors: Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza

    Abstract: Scene transfer for vision-based mobile robotics applications is a highly relevant and challenging problem. The utility of a robot greatly depends on its ability to perform a task in the real world, outside of a well-controlled lab environment. Existing scene transfer end-to-end policy learning approaches often suffer from poor sample efficiency or limited generalization capabilities, making them u… ▽ More

    Submitted 29 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2024

  22. arXiv:2309.09752  [pdf, other

    cs.LG

    Contrastive Initial State Buffer for Reinforcement Learning

    Authors: Nico Messikommer, Yunlong Song, Davide Scaramuzza

    Abstract: In Reinforcement Learning, the trade-off between exploration and exploitation poses a complex challenge for achieving efficient learning from limited samples. While recent works have been effective in leveraging past experiences for policy updates, they often overlook the potential of reusing past experiences for data collection. Independent of the underlying RL algorithm, we introduce the concept… ▽ More

    Submitted 26 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2024)

  23. arXiv:2307.15829  [pdf, other

    cs.CV

    Seeing Behind Dynamic Occlusions with Event Cameras

    Authors: Rong Zou, Manasi Muglikar, Nico Messikommer, Davide Scaramuzza

    Abstract: Unwanted camera occlusions, such as debris, dust, rain-drops, and snow, can severely degrade the performance of computer-vision systems. Dynamic occlusions are particularly challenging because of the continuously changing pattern. Existing occlusion-removal methods currently use synthetic aperture imaging or image inpainting. However, they face issues with dynamic occlusions as these require multi… ▽ More

    Submitted 1 August, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

  24. Agilicious: Open-Source and Open-Hardware Agile Quadrotor for Vision-Based Flight

    Authors: Philipp Foehn, Elia Kaufmann, Angel Romero, Robert Penicka, Sihao Sun, Leonard Bauersfeld, Thomas Laengle, Giovanni Cioffi, Yunlong Song, Antonio Loquercio, Davide Scaramuzza

    Abstract: Autonomous, agile quadrotor flight raises fundamental challenges for robotics research in terms of perception, planning, learning, and control. A versatile and standardized platform is needed to accelerate research and let practitioners focus on the core problems. To this end, we present Agilicious, a co-designed hardware and software framework tailored to autonomous, agile quadrotor flight. It is… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 14 pages, 5 figures, 2 tables

    Journal ref: Science Robotics Vol. 7, Issue 67, 2022

  25. arXiv:2306.11429  [pdf, other

    cs.RO

    HDVIO: Improving Localization and Disturbance Estimation with Hybrid Dynamics VIO

    Authors: Giovanni Cioffi, Leonard Bauersfeld, Davide Scaramuzza

    Abstract: Visual-inertial odometry (VIO) is the most common approach for estimating the state of autonomous micro aerial vehicles using only onboard sensors. Existing methods improve VIO performance by including a dynamics model in the estimation pipeline. However, such methods degrade in the presence of low-fidelity vehicle models and continuous external disturbances, such as wind. Our proposed method, HDV… ▽ More

    Submitted 28 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Robotics: Science and Systems (RSS) 2023

  26. arXiv:2306.09852  [pdf, other

    cs.RO

    Actor-Critic Model Predictive Control

    Authors: Angel Romero, Yunlong Song, Davide Scaramuzza

    Abstract: An open research question in robotics is how to combine the benefits of model-free reinforcement learning (RL) - known for its strong task performance and flexibility in optimizing general reward formulations - with the robustness and online replanning capabilities of model predictive control (MPC). This paper provides an answer by introducing a new framework called Actor-Critic Model Predictive C… ▽ More

    Submitted 12 April, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 6 pages, 5 figures

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2024)

  27. arXiv:2306.09078  [pdf, other

    cs.CV

    E-Calib: A Fast, Robust and Accurate Calibration Toolbox for Event Cameras

    Authors: Mohammed Salah, Abdulla Ayyad, Muhammad Humais, Daniel Gehrig, Abdelqader Abusafieh, Lakmal Seneviratne, Davide Scaramuzza, Yahya Zweiri

    Abstract: Event cameras triggered a paradigm shift in the computer vision community delineated by their asynchronous nature, low latency, and high dynamic range. Calibration of event cameras is always essential to account for the sensor intrinsic parameters and for 3D perception. However, conventional image-based calibration techniques are not applicable due to the asynchronous, binary output of the sensor.… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 13 pages, 6 tables, 15 figures

  28. arXiv:2306.07050  [pdf, other

    cs.CV

    Revisiting Token Pruning for Object Detection and Instance Segmentation

    Authors: Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

    Abstract: Vision Transformers (ViTs) have shown impressive performance in computer vision, but their high computational cost, quadratic in the number of tokens, limits their adoption in computation-constrained applications. However, this large number of tokens may not be necessary, as not all tokens are equally important. In this paper, we investigate token pruning to accelerate inference for object detecti… ▽ More

    Submitted 12 December, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Journal ref: IEEE Winter Conference on Applications of Computer Vision (WACV 2024)

  29. arXiv:2304.13455  [pdf, other

    cs.CV cs.LG

    From Chaos Comes Order: Ordering Event Representations for Object Recognition and Detection

    Authors: Nikola Zubić, Daniel Gehrig, Mathias Gehrig, Davide Scaramuzza

    Abstract: Today, state-of-the-art deep neural networks that process events first convert them into dense, grid-like input representations before using an off-the-shelf network. However, selecting the appropriate representation for the task traditionally requires training a neural network for each representation and selecting the best one based on the validation score, which is very time-consuming. This work… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 15 pages, 11 figures, 2 tables, ICCV 2023 Camera Ready paper

  30. Microgravity induces overconfidence in perceptual decision-making

    Authors: Leyla Loued-Khenissi, Christian Pfeiffer, Rupal Saxena, Shivam Adarsh, Davide Scaramuzza

    Abstract: Does gravity affect decision-making? This question comes into sharp focus as plans for interplanetary human space missions solidify. In the framework of Bayesian brain theories, gravity encapsulates a strong prior, anchoring agents to a reference frame via the vestibular system, informing their decisions and possibly their integration of uncertainty. What happens when such a strong prior is altere… ▽ More

    Submitted 22 June, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 12 pages, 10 figures

    Journal ref: Nature Scientific Reports 13, 9727 (2023)

  31. arXiv:2304.07139  [pdf, other

    cs.CV cs.NE

    Neuromorphic Optical Flow and Real-time Implementation with Event Cameras

    Authors: Yannick Schnider, Stanislaw Wozniak, Mathias Gehrig, Jules Lecomte, Axel von Arnim, Luca Benini, Davide Scaramuzza, Angeliki Pantazi

    Abstract: Optical flow provides information on relative motion that is an important component in many computer vision pipelines. Neural networks provide high accuracy optical flow, yet their complexity is often prohibitive for application at the edge or in robots, where efficiency and latency play crucial role. To address this challenge, we build on the latest developments in event-based vision and spiking… ▽ More

    Submitted 12 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted for IEEE CVPRW, Vancouver 2023. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media. Copyright 2023 IEEE

  32. arXiv:2304.04128  [pdf, other

    cs.RO

    Learning Agile, Vision-based Drone Flight: from Simulation to Reality

    Authors: Davide Scaramuzza, Elia Kaufmann

    Abstract: We present our latest research in learning deep sensorimotor policies for agile, vision-based quadrotor flight. We show methodologies for the successful transfer of such policies from simulation to the real world. In addition, we discuss the open research questions that still need to be answered to improve the agility and robustness of autonomous drones toward human-pilot performance.

    Submitted 8 April, 2023; originally announced April 2023.

  33. arXiv:2304.00959  [pdf, other

    cs.RO

    Autonomous Power Line Inspection with Drones via Perception-Aware MPC

    Authors: Jiaxu Xing, Giovanni Cioffi, Javier Hidalgo-Carrió, Davide Scaramuzza

    Abstract: Drones have the potential to revolutionize power line inspection by increasing productivity, reducing inspection time, improving data quality, and eliminating the risks for human operators. Current state-of-the-art systems for power line inspection have two shortcomings: (i) control is decoupled from perception and needs accurate information about the location of the power lines and masts; (ii) ob… ▽ More

    Submitted 9 August, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots (IROS), Detroit, 2023

  34. arXiv:2303.17479  [pdf, other

    cs.RO

    Event-based Agile Object Catching with a Quadrupedal Robot

    Authors: Benedek Forrai, Takahiro Miki, Daniel Gehrig, Marco Hutter, Davide Scaramuzza

    Abstract: Quadrupedal robots are conquering various indoor and outdoor applications due to their ability to navigate challenging uneven terrains. Exteroceptive information greatly enhances this capability since perceiving their surroundings allows them to adapt their controller and thus achieve higher levels of robustness. However, sensors such as LiDARs and RGB cameras do not provide sufficient information… ▽ More

    Submitted 6 April, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA) 2023, London

  35. arXiv:2303.14176  [pdf, other

    cs.CV cs.AI

    A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception

    Authors: Asude Aydin, Mathias Gehrig, Daniel Gehrig, Davide Scaramuzza

    Abstract: Spiking Neural Networks (SNN) are a class of bio-inspired neural networks that promise to bring low-power and low-latency inference to edge devices through asynchronous and sparse processing. However, being temporal models, SNNs depend heavily on expressive states to generate predictions on par with classical artificial neural networks (ANNs). These states converge only after long transient period… ▽ More

    Submitted 17 April, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, 2024

  36. COVERED, CollabOratiVE Robot Environment Dataset for 3D Semantic segmentation

    Authors: Charith Munasinghe, Fatemeh Mohammadi Amin, Davide Scaramuzza, Hans Wernher van de Venn

    Abstract: Safe human-robot collaboration (HRC) has recently gained a lot of interest with the emerging Industry 5.0 paradigm. Conventional robots are being replaced with more intelligent and flexible collaborative robots (cobots). Safe and efficient collaboration between cobots and humans largely relies on the cobot's comprehensive semantic understanding of the dynamic surrounding of industrial environments… ▽ More

    Submitted 4 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Journal ref: IEEE Conference on Emerging Technologies and Factory Automation (ETFA 2022)

  37. Improving safety in physical human-robot collaboration via deep metric learning

    Authors: Maryam Rezayati, Grammatiki Zanni, Ying Zaoshi, Davide Scaramuzza, Hans Wernher van de Venn

    Abstract: Direct physical interaction with robots is becoming increasingly important in flexible production scenarios, but robots without protective fences also pose a greater risk to the operator. In order to keep the risk potential low, relatively simple measures are prescribed for operation, such as stop** the robot if there is physical contact or if a safety distance is violated. Although human injuri… ▽ More

    Submitted 13 April, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Journal ref: 2022 IEEE 27th International Conference on Emerging Technologies and Factory Automation (ETFA)

  38. arXiv:2301.06855  [pdf, other

    cs.CV

    Event-based Shape from Polarization

    Authors: Manasi Muglikar, Leonard Bauersfeld, Diederik Paul Moeys, Davide Scaramuzza

    Abstract: State-of-the-art solutions for Shape-from-Polarization (SfP) suffer from a speed-resolution tradeoff: they either sacrifice the number of polarization angles measured or necessitate lengthy acquisition times due to framerate constraints, thus compromising either accuracy or latency. We tackle this tradeoff using event cameras. Event cameras operate at microseconds resolution with negligible motion… ▽ More

    Submitted 11 April, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023

  39. Autonomous Drone Racing: A Survey

    Authors: Drew Hanover, Antonio Loquercio, Leonard Bauersfeld, Angel Romero, Robert Penicka, Yunlong Song, Giovanni Cioffi, Elia Kaufmann, Davide Scaramuzza

    Abstract: Over the last decade, the use of autonomous drone systems for surveying, search and rescue, or last-mile delivery has increased exponentially. With the rise of these applications comes the need for highly robust, safety-critical algorithms which can operate drones in complex and uncertain environments. Additionally, flying fast enables drones to cover more ground which in turn increases productivi… ▽ More

    Submitted 16 May, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 26 pages, submitted to T-RO January 3rd, 2022; accepted to T-RO May 8th, 2024

    Journal ref: IEEE Transactions on Robotics (T-RO), 2024

  40. arXiv:2212.05598  [pdf, other

    cs.CV

    Recurrent Vision Transformers for Object Detection with Event Cameras

    Authors: Mathias Gehrig, Davide Scaramuzza

    Abstract: We present Recurrent Vision Transformers (RVTs), a novel backbone for object detection with event cameras. Event cameras provide visual information with sub-millisecond latency at a high-dynamic range and with strong robustness against motion blur. These unique properties offer great potential for low-latency object detection and tracking in time-critical scenarios. Prior work in event-based visio… ▽ More

    Submitted 25 May, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023

  41. arXiv:2212.04745  [pdf, other

    cs.CV

    SLAM for Visually Impaired People: a Survey

    Authors: Marziyeh Bamdad, Davide Scaramuzza, Alireza Darvishy

    Abstract: In recent decades, several assistive technologies have been developed to improve the ability of blind and visually impaired individuals to navigate independently and safely. At the same time, simultaneous localization and map** (SLAM) techniques have become sufficiently robust and efficient to be adopted in develo** these assistive technologies. We present the first systematic literature revie… ▽ More

    Submitted 24 May, 2024; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 45 pages, 38 tables, 6 figures

  42. arXiv:2211.12826  [pdf, other

    cs.CV

    Data-driven Feature Tracking for Event Cameras

    Authors: Nico Messikommer, Carter Fang, Mathias Gehrig, Davide Scaramuzza

    Abstract: Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios. Existing feature tracking methods for event cameras are either handcrafted or derived from first principles but require extensive parameter tuning, are sensitive to noise,… ▽ More

    Submitted 25 April, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, 2023

  43. arXiv:2211.12324  [pdf, other

    cs.CV

    Pushing the Limits of Asynchronous Graph-based Object Detection with Event Cameras

    Authors: Daniel Gehrig, Davide Scaramuzza

    Abstract: State-of-the-art machine-learning methods for event cameras treat events as dense representations and process them with conventional deep neural networks. Thus, they fail to maintain the sparsity and asynchronous nature of event data, thereby imposing significant computation and latency constraints on downstream systems. A recent line of work tackles this issue by modeling events as spatiotemporal… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  44. arXiv:2211.12181  [pdf, ps, other

    cs.RO

    User-Conditioned Neural Control Policies for Mobile Robotics

    Authors: Leonard Bauersfeld, Elia Kaufmann, Davide Scaramuzza

    Abstract: Recently, learning-based controllers have been shown to push mobile robotic systems to their limits and provide the robustness needed for many real-world applications. However, only classical optimization-based control frameworks offer the inherent flexibility to be dynamically adjusted during execution by, for example, setting target speeds or actuator limits. We present a framework to overcome t… ▽ More

    Submitted 2 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 6 pages + 1 pages references

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), London, 2023

  45. Cracking Double-Blind Review: Authorship Attribution with Deep Learning

    Authors: Leonard Bauersfeld, Angel Romero, Manasi Muglikar, Davide Scaramuzza

    Abstract: Double-blind peer review is considered a pillar of academic research because it is perceived to ensure a fair, unbiased, and fact-centered scientific discussion. Yet, experienced researchers can often correctly guess from which research group an anonymous submission originates, biasing the peer-review process. In this work, we present a transformer-based, neural-network architecture that only uses… ▽ More

    Submitted 3 July, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 13 pages + 3 pages references

    Journal ref: PLOS ONE 18(6): e0287611 (2023)

  46. arXiv:2210.15287  [pdf, other

    cs.RO

    Learned Inertial Odometry for Autonomous Drone Racing

    Authors: Giovanni Cioffi, Leonard Bauersfeld, Elia Kaufmann, Davide Scaramuzza

    Abstract: Inertial odometry is an attractive solution to the problem of state estimation for agile quadrotor flight. It is inexpensive, lightweight, and it is not affected by perceptual degradation. However, only relying on the integration of the inertial measurements for state estimation is infeasible. The errors and time-varying biases present in such measurements cause the accumulation of large drift in… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Journal ref: Robotics and Automation Letters (RA-L), 2023

  47. arXiv:2210.14985  [pdf, other

    cs.RO cs.AI

    Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing

    Authors: Jiawei Fu, Yunlong Song, Yan Wu, Fisher Yu, Davide Scaramuzza

    Abstract: Autonomous drones can operate in remote and unstructured environments, enabling various real-world applications. However, the lack of effective vision-based algorithms has been a stumbling block to achieving this goal. Existing systems often require hand-engineered components for state estimation, planning, and control. Such a sequential design involves laborious tuning, human heuristics, and comp… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

  48. arXiv:2210.11087  [pdf, other

    cs.RO

    Weighted Maximum Likelihood for Controller Tuning

    Authors: Angel Romero, Shreedhar Govil, Gonca Yilmaz, Yunlong Song, Davide Scaramuzza

    Abstract: Recently, Model Predictive Contouring Control (MPCC) has arisen as the state-of-the-art approach for model-based agile flight. MPCC benefits from great flexibility in trading-off between progress maximization and path following at runtime without relying on globally optimized trajectories. However, finding the optimal set of tuning parameters for MPCC is challenging because (i) the full quadrotor… ▽ More

    Submitted 2 March, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 8 pages

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2023)

  49. arXiv:2210.01841  [pdf, other

    cs.RO cs.AI

    Learning Perception-Aware Agile Flight in Cluttered Environments

    Authors: Yunlong Song, Kexin Shi, Robert Penicka, Davide Scaramuzza

    Abstract: Recently, neural control policies have outperformed existing model-based planning-and-control methods for autonomously navigating quadrotors through cluttered environments in minimum time. However, they are not perception aware, a crucial requirement in vision-based navigation due to the camera's limited field of view and the underactuated nature of a quadrotor. We propose a learning-based system… ▽ More

    Submitted 3 March, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  50. arXiv:2209.13052  [pdf, other

    cs.RO cs.AI

    Training Efficient Controllers via Analytic Policy Gradient

    Authors: Nina Wiedemann, Valentin Wüest, Antonio Loquercio, Matthias Müller, Dario Floreano, Davide Scaramuzza

    Abstract: Control design for robotic systems is complex and often requires solving an optimization to follow a trajectory accurately. Online optimization approaches like Model Predictive Control (MPC) have been shown to achieve great tracking performance, but require high computing power. Conversely, learning-based offline optimization approaches, such as Reinforcement Learning (RL), allow fast and efficien… ▽ More

    Submitted 2 May, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Journal ref: IEEE Conference on Robotics and Automation (ICRA 2023)