-
EV-Catcher: High-Speed Object Catching Using Low-latency Event-based Neural Networks
Authors:
Ziyun Wang,
Fernando Cladera Ojeda,
Anthony Bisulco,
Daewon Lee,
Camillo J. Taylor,
Kostas Daniilidis,
M. Ani Hsieh,
Daniel D. Lee,
Volkan Isler
Abstract:
Event-based sensors have recently drawn increasing interest in robotic perception due to their lower latency, higher dynamic range, and lower bandwidth requirements compared to standard CMOS-based imagers. These properties make them ideal tools for real-time perception tasks in highly dynamic environments. In this work, we demonstrate an application where event cameras excel: accurately estimating…
▽ More
Event-based sensors have recently drawn increasing interest in robotic perception due to their lower latency, higher dynamic range, and lower bandwidth requirements compared to standard CMOS-based imagers. These properties make them ideal tools for real-time perception tasks in highly dynamic environments. In this work, we demonstrate an application where event cameras excel: accurately estimating the impact location of fast-moving objects. We introduce a lightweight event representation called Binary Event History Image (BEHI) to encode event data at low latency, as well as a learning-based approach that allows real-time inference of a confidence-enabled control signal to the robot. To validate our approach, we present an experimental catching system in which we catch fast-flying **-pong balls. We show that the system is capable of achieving a success rate of 81% in catching balls targeted at different locations, with a velocity of up to 13 m/s even on compute-constrained embedded platforms such as the Nvidia Jetson NX.
△ Less
Submitted 14 April, 2023;
originally announced April 2023.
-
Fast Motion Understanding with Spatiotemporal Neural Networks and Dynamic Vision Sensors
Authors:
Anthony Bisulco,
Fernando Cladera Ojeda,
Volkan Isler,
Daniel D. Lee
Abstract:
This paper presents a Dynamic Vision Sensor (DVS) based system for reasoning about high speed motion. As a representative scenario, we consider the case of a robot at rest reacting to a small, fast approaching object at speeds higher than 15m/s. Since conventional image sensors at typical frame rates observe such an object for only a few frames, estimating the underlying motion presents a consider…
▽ More
This paper presents a Dynamic Vision Sensor (DVS) based system for reasoning about high speed motion. As a representative scenario, we consider the case of a robot at rest reacting to a small, fast approaching object at speeds higher than 15m/s. Since conventional image sensors at typical frame rates observe such an object for only a few frames, estimating the underlying motion presents a considerable challenge for standard computer vision systems and algorithms. In this paper we present a method motivated by how animals such as insects solve this problem with their relatively simple vision systems.
Our solution takes the event stream from a DVS and first encodes the temporal events with a set of causal exponential filters across multiple time scales. We couple these filters with a Convolutional Neural Network (CNN) to efficiently extract relevant spatiotemporal features. The combined network learns to output both the expected time to collision of the object, as well as the predicted collision point on a discretized polar grid. These critical estimates are computed with minimal delay by the network in order to react appropriately to the incoming object. We highlight the results of our system to a toy dart moving at 23.4m/s with a 24.73° error in $θ$, 18.4mm average discretized radius prediction error, and 25.03% median time to collision prediction error.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Near-chip Dynamic Vision Filtering for Low-Bandwidth Pedestrian Detection
Authors:
Anthony Bisulco,
Fernando Cladera Ojeda,
Volkan Isler,
Daniel D. Lee
Abstract:
This paper presents a novel end-to-end system for pedestrian detection using Dynamic Vision Sensors (DVSs). We target applications where multiple sensors transmit data to a local processing unit, which executes a detection algorithm. Our system is composed of (i) a near-chip event filter that compresses and denoises the event stream from the DVS, and (ii) a Binary Neural Network (BNN) detection mo…
▽ More
This paper presents a novel end-to-end system for pedestrian detection using Dynamic Vision Sensors (DVSs). We target applications where multiple sensors transmit data to a local processing unit, which executes a detection algorithm. Our system is composed of (i) a near-chip event filter that compresses and denoises the event stream from the DVS, and (ii) a Binary Neural Network (BNN) detection module that runs on a low-computation edge computing device (in our case a STM32F4 microcontroller). We present the system architecture and provide an end-to-end implementation for pedestrian detection in an office environment. Our implementation reduces transmission size by up to 99.6% compared to transmitting the raw event stream. The average packet size in our system is only 1397 bits, while 307.2 kb are required to send an uncompressed DVS time window. Our detector is able to perform a detection every 450 ms, with an overall testing F1 score of 83%. The low bandwidth and energy properties of our system make it ideal for IoT applications.
△ Less
Submitted 3 April, 2020;
originally announced April 2020.
-
Reward Prediction Error as an Exploration Objective in Deep RL
Authors:
Riley Simmons-Edler,
Ben Eisner,
Daniel Yang,
Anthony Bisulco,
Eric Mitchell,
Sebastian Seung,
Daniel Lee
Abstract:
A major challenge in reinforcement learning is exploration, when local dithering methods such as epsilon-greedy sampling are insufficient to solve a given task. Many recent methods have proposed to intrinsically motivate an agent to seek novel states, driving the agent to discover improved reward. However, while state-novelty exploration methods are suitable for tasks where novel observations corr…
▽ More
A major challenge in reinforcement learning is exploration, when local dithering methods such as epsilon-greedy sampling are insufficient to solve a given task. Many recent methods have proposed to intrinsically motivate an agent to seek novel states, driving the agent to discover improved reward. However, while state-novelty exploration methods are suitable for tasks where novel observations correlate well with improved reward, they may not explore more efficiently than epsilon-greedy approaches in environments where the two are not well-correlated. In this paper, we distinguish between exploration tasks in which seeking novel states aids in finding new reward, and those where it does not, such as goal-conditioned tasks and esca** local reward maxima. We propose a new exploration objective, maximizing the reward prediction error (RPE) of a value function trained to predict extrinsic reward. We then propose a deep reinforcement learning method, QXplore, which exploits the temporal difference error of a Q-function to solve hard exploration tasks in high-dimensional MDPs. We demonstrate the exploration behavior of QXplore on several OpenAI Gym MuJoCo tasks and Atari games and observe that QXplore is comparable to or better than a baseline state-novelty method in all cases, outperforming the baseline on tasks where state novelty is not well-correlated with improved reward.
△ Less
Submitted 13 January, 2021; v1 submitted 19 June, 2019;
originally announced June 2019.
-
Experimental Results of a 3D Millimeter-Wave Compressive-Reflector-Antenna Imaging System
Authors:
Weite Zhang,
Ali Molaei,
Juan Heredia-Juesas,
Luis Tirado,
Katherine Graham,
A. Bisulco,
Hipolito Gomez-Sousa,
Jose A. Martinez-Lorenzo
Abstract:
This letter presents the first experimental results of our three-dimensional (3D) millimeter-wave (mm-wave) Compressive-Reflector-Antenna (CRA) imaging system. In this prototype, the CRA is 3D-printed and coated with a metallic spray to easily introduce pseudo-random scatterers on the surface of a traditional reflector antenna (TRA). The CRA performs a pseudo random coding of the incident wavefron…
▽ More
This letter presents the first experimental results of our three-dimensional (3D) millimeter-wave (mm-wave) Compressive-Reflector-Antenna (CRA) imaging system. In this prototype, the CRA is 3D-printed and coated with a metallic spray to easily introduce pseudo-random scatterers on the surface of a traditional reflector antenna (TRA). The CRA performs a pseudo random coding of the incident wavefront, thus adding spatial diversity in the imaging region and enabling the effective use of compressive sensing (CS) and imaging techniques. The CRA is fed with a multiple-input-multiple-output (MIMO) radar, which consists of four transmitting and four receiving ports. Consequently, the mechanical scanning parts and phase shifters, which are necessary in conventional physical or synthetic aperture arrays, are not needed in this system. Experimental results show the effectiveness of the prototype to perform a successful 3D reconstruction of a T-shaped metallic target.
△ Less
Submitted 28 July, 2018;
originally announced August 2018.