-
Reinforcement Learning for Blind Stair Climbing with Legged and Wheeled-Legged Robots
Authors:
Simon Chamorro,
Victor Klemm,
Miguel de la Iglesia Valls,
Christopher Pal,
Roland Siegwart
Abstract:
In recent years, legged and wheeled-legged robots have gained prominence for tasks in environments predominantly created for humans across various domains. One significant challenge faced by many of these robots is their limited capability to navigate stairs, which hampers their functionality in multi-story environments. This study proposes a method aimed at addressing this limitation, employing r…
▽ More
In recent years, legged and wheeled-legged robots have gained prominence for tasks in environments predominantly created for humans across various domains. One significant challenge faced by many of these robots is their limited capability to navigate stairs, which hampers their functionality in multi-story environments. This study proposes a method aimed at addressing this limitation, employing reinforcement learning to develop a versatile controller applicable to a wide range of robots. In contrast to the conventional velocity-based controllers, our approach builds upon a position-based formulation of the RL task, which we show to be vital for stair climbing. Furthermore, the methodology leverages an asymmetric actor-critic structure, enabling the utilization of privileged information from simulated environments during training while eliminating the reliance on exteroceptive sensors during real-world deployment. Another key feature of the proposed approach is the incorporation of a boolean observation within the controller, enabling the activation or deactivation of a stair-climbing mode. We present our results on different quadrupeds and bipedal robots in simulation and showcase how our method allows the balancing robot Ascento to climb 15cm stairs in the real world, a task that was previously impossible for this robot.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
Vital Signs Estimation Using a 26 GHz Multi-Beam Communication Testbed
Authors:
Miquel Sellés Valls,
Sofie Pollin,
Ying Wang,
Rizqi Hersyandika,
Andre Kokkeler,
Yang Miao
Abstract:
This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present i…
▽ More
This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present in the environment will enable new vertical services of telecommunication, i.e., remote health monitoring. The proposed processing pipeline leverages spatially orthogonal beams to estimate the vital sign - breath rate and heart rate - of single and multiple persons in static scenarios from the raw Channel State Information samples. We consider both monostatic and bistatic sensing scenarios. For monostatic scenario, we employ the phase time-frequency calibration and Discrete Wavelet Transform to improve the performance compared to the conventional Fast Fourier Transform based methods. For bistatic scenario, we use K-means clustering algorithm to extract multi-person vital signs due to the distinct frequency-domain signal feature between single and multi-person scenarios. The results show that the estimated breath rate and heart rate reach below 2 beats per minute (bpm) error compared to the reference captured by on-body sensor for the single-person monostatic sensing scenario with body-transceiver distance up to 2 m, and the two-person bistatic sensing scenario with BS-UE distance up to 4 m. The presented work does not optimize the OFDM waveform parameters for sensing; it demonstrates a promising JCAS proof-of-concept in contact-free vital sign monitoring using mmWave multi-beam communication systems.
△ Less
Submitted 13 December, 2023; v1 submitted 19 November, 2023;
originally announced November 2023.
-
Heat transfer correlations for buoyant liquid metal MHD flows in blanket poloidal channels
Authors:
Daniel Suarez,
Elisabet Mas de les Valls,
Lluis Batet
Abstract:
In recent years, several simulation codes for reproducing liquid metal magnetohydrodynamic (MHD) phenomena have been validated and benchmarked. Accurate simulation codes are crucial to enhance our understanding of how flow behavior affects heat transport in liquid metal-based breeding blankets. Using heat transfer correlations, that model the influence of flow characteristics on the transport of h…
▽ More
In recent years, several simulation codes for reproducing liquid metal magnetohydrodynamic (MHD) phenomena have been validated and benchmarked. Accurate simulation codes are crucial to enhance our understanding of how flow behavior affects heat transport in liquid metal-based breeding blankets. Using heat transfer correlations, that model the influence of flow characteristics on the transport of heat, is especially interesting for system designers because it saves them the effort and time in completely simulating every design proposal. Our group has studied the buoyant MHD flow in poloidal channels on the EU Dual Coolant Lead Lithium (DCLL) blanket geometry. Two different codes were used for this study: a 2D fully-developed code and a Q2D-fully-developed code. In this work, we explored the influence of different flow conditions in the heat transport phenomena parametrically. This article presents the results of the calculations performed using the two codes and provides heat transfer correlations for poloidal EU DCLL channels.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
End-to-End Velocity Estimation For Autonomous Racing
Authors:
Sirish Srinivasan,
Inkyu Sa,
Alex Zyner,
Victor Reijgwart,
Miguel I. Valls,
Roland Siegwart
Abstract:
Velocity estimation plays a central role in driverless vehicles, but standard and affordable methods struggle to cope with extreme scenarios like aggressive maneuvers due to the presence of high sideslip. To solve this, autonomous race cars are usually equipped with expensive external velocity sensors. In this paper, we present an end-to-end recurrent neural network that takes available raw sensor…
▽ More
Velocity estimation plays a central role in driverless vehicles, but standard and affordable methods struggle to cope with extreme scenarios like aggressive maneuvers due to the presence of high sideslip. To solve this, autonomous race cars are usually equipped with expensive external velocity sensors. In this paper, we present an end-to-end recurrent neural network that takes available raw sensors as input (IMU, wheel odometry, and motor currents) and outputs velocity estimates. The results are compared to two state-of-the-art Kalman filters, which respectively include and exclude expensive velocity sensors. All methods have been extensively tested on a formula student driverless race car with very high sideslip (10° at the rear axle) and slip ratio (~20%), operating close to the limits of handling. The proposed network is able to estimate lateral velocity up to 15x better than the Kalman filter with the equivalent sensor input and matches (0.06 m/s RMSE) the Kalman filter with the expensive velocity sensor setup.
△ Less
Submitted 16 August, 2020; v1 submitted 15 March, 2020;
originally announced March 2020.
-
Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot
Authors:
Napat Karnchanachari,
Miguel I. Valls,
David Hoeller,
Marco Hutter
Abstract:
Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning a…
▽ More
Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning and in particular value learning to approximate the value function given only high level objectives, which can be sparse and binary. Building upon previous works, we present improvements that allowed us to successfully deploy the method on a real world unmanned ground vehicle. Our experiments show that our method can learn the cost function from scratch and without human intervention, while reaching a performance level similar to that of an expert-tuned MPC. We perform a quantitative comparison of these methods with standard MPC approaches both in simulation and on the real robot.
△ Less
Submitted 20 April, 2020; v1 submitted 6 March, 2020;
originally announced March 2020.
-
Knowledge graph based methods for record linkage
Authors:
B. Gautam,
O. Ramos Terrades,
J. M. Pujades,
M. Valls
Abstract:
Nowadays, it is common in Historical Demography the use of individual-level data as a consequence of a predominant life-course approach for the understanding of the demographic behaviour, family transition, mobility, etc. Record linkage advance is key in these disciplines since it allows to increase the volume and the data complexity to be analyzed. However, current methods are constrained to link…
▽ More
Nowadays, it is common in Historical Demography the use of individual-level data as a consequence of a predominant life-course approach for the understanding of the demographic behaviour, family transition, mobility, etc. Record linkage advance is key in these disciplines since it allows to increase the volume and the data complexity to be analyzed. However, current methods are constrained to link data coming from the same kind of sources. Knowledge graph are flexible semantic representations, which allow to encode data variability and semantic relations in a structured manner.
In this paper we propose the knowledge graph use to tackle record linkage task. The proposed method, named {\bf WERL}, takes advantage of the main knowledge graph properties and learns embedding vectors to encode census information. These embeddings are properly weighted to maximize the record linkage performance. We have evaluated this method on benchmark data sets and we have compared it to related methods with stimulating and satisfactory results.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
AMZ Driverless: The Full Autonomous Racing System
Authors:
Juraj Kabzan,
Miguel de la Iglesia Valls,
Victor Reijgwart,
Hubertus Franciscus Cornelis Hendrikx,
Claas Ehmke,
Manish Prajapat,
Andreas Bühler,
Nikhil Gosala,
Mehak Gupta,
Ramya Sivanesan,
Ankit Dhall,
Eugenio Chisari,
Napat Karnchanachari,
Sonja Brits,
Manuel Dangel,
Inkyu Sa,
Renaud Dubé,
Abel Gawel,
Mark Pfeiffer,
Alexander Liniger,
John Lygeros,
Roland Siegwart
Abstract:
This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and con…
▽ More
This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and control are incorporated into one high-performance autonomous racecar. This complex robotic system, developed by AMZ Driverless and ETH Zurich, finished 1st overall at each competition we attended: Formula Student Germany 2017, Formula Student Italy 2018 and Formula Student Germany 2018. We discuss the findings and learnings from these competitions and present an experimental evaluation of each module of our solution.
△ Less
Submitted 13 May, 2019;
originally announced May 2019.
-
Design of an Autonomous Racecar: Perception, State Estimation and System Integration
Authors:
Miguel de la Iglesia Valls,
Hubertus Franciscus Cornelis Hendrikx,
Victor Reijgwart,
Fabio Vito Meier,
Inkyu Sa,
Renaud Dubé,
Abel Roman Gawel,
Mathias Bürki,
Roland Siegwart
Abstract:
This paper introduces flüela driverless: the first autonomous racecar to win a Formula Student Driverless competition. In this competition, among other challenges, an autonomous racecar is tasked to complete 10 laps of a previously unknown racetrack as fast as possible and using only onboard sensing and computing. The key components of flüela's design are its modular redundant sub-systems that all…
▽ More
This paper introduces flüela driverless: the first autonomous racecar to win a Formula Student Driverless competition. In this competition, among other challenges, an autonomous racecar is tasked to complete 10 laps of a previously unknown racetrack as fast as possible and using only onboard sensing and computing. The key components of flüela's design are its modular redundant sub-systems that allow robust performance despite challenging perceptual conditions or partial system failures. The paper presents the integration of key components of our autonomous racecar, i.e., system design, EKF-based state estimation, LiDAR-based perception, and particle filter-based SLAM. We perform an extensive experimental evaluation on real-world data, demonstrating the system's effectiveness by outperforming the next-best ranking team by almost half the time required to finish a lap. The autonomous racecar reaches lateral and longitudinal accelerations comparable to those achieved by experienced human drivers.
△ Less
Submitted 9 April, 2018;
originally announced April 2018.