Search | arXiv e-print repository

Reinforcement Learning for Blind Stair Climbing with Legged and Wheeled-Legged Robots

Authors: Simon Chamorro, Victor Klemm, Miguel de la Iglesia Valls, Christopher Pal, Roland Siegwart

Abstract: In recent years, legged and wheeled-legged robots have gained prominence for tasks in environments predominantly created for humans across various domains. One significant challenge faced by many of these robots is their limited capability to navigate stairs, which hampers their functionality in multi-story environments. This study proposes a method aimed at addressing this limitation, employing r… ▽ More In recent years, legged and wheeled-legged robots have gained prominence for tasks in environments predominantly created for humans across various domains. One significant challenge faced by many of these robots is their limited capability to navigate stairs, which hampers their functionality in multi-story environments. This study proposes a method aimed at addressing this limitation, employing reinforcement learning to develop a versatile controller applicable to a wide range of robots. In contrast to the conventional velocity-based controllers, our approach builds upon a position-based formulation of the RL task, which we show to be vital for stair climbing. Furthermore, the methodology leverages an asymmetric actor-critic structure, enabling the utilization of privileged information from simulated environments during training while eliminating the reliance on exteroceptive sensors during real-world deployment. Another key feature of the proposed approach is the incorporation of a boolean observation within the controller, enabling the activation or deactivation of a stair-climbing mode. We present our results on different quadrupeds and bipedal robots in simulation and showcase how our method allows the balancing robot Ascento to climb 15cm stairs in the real world, a task that was previously impossible for this robot. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: Video: https://youtu.be/Ec6ar8BVJh4

arXiv:2311.11275 [pdf, other]

Vital Signs Estimation Using a 26 GHz Multi-Beam Communication Testbed

Authors: Miquel Sellés Valls, Sofie Pollin, Ying Wang, Rizqi Hersyandika, Andre Kokkeler, Yang Miao

Abstract: This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present i… ▽ More This paper presents a novel pipeline for vital sign monitoring using a 26 GHz multi-beam communication testbed. In context of Joint Communication and Sensing (JCAS), the advanced communication capability at millimeter-wave bands is comparable to the radio resource of radars and is promising to sense the surrounding environment. Being able to communicate and sense the vital sign of humans present in the environment will enable new vertical services of telecommunication, i.e., remote health monitoring. The proposed processing pipeline leverages spatially orthogonal beams to estimate the vital sign - breath rate and heart rate - of single and multiple persons in static scenarios from the raw Channel State Information samples. We consider both monostatic and bistatic sensing scenarios. For monostatic scenario, we employ the phase time-frequency calibration and Discrete Wavelet Transform to improve the performance compared to the conventional Fast Fourier Transform based methods. For bistatic scenario, we use K-means clustering algorithm to extract multi-person vital signs due to the distinct frequency-domain signal feature between single and multi-person scenarios. The results show that the estimated breath rate and heart rate reach below 2 beats per minute (bpm) error compared to the reference captured by on-body sensor for the single-person monostatic sensing scenario with body-transceiver distance up to 2 m, and the two-person bistatic sensing scenario with BS-UE distance up to 4 m. The presented work does not optimize the OFDM waveform parameters for sensing; it demonstrates a promising JCAS proof-of-concept in contact-free vital sign monitoring using mmWave multi-beam communication systems. △ Less

Submitted 13 December, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

arXiv:2303.11478 [pdf, other]

Heat transfer correlations for buoyant liquid metal MHD flows in blanket poloidal channels

Authors: Daniel Suarez, Elisabet Mas de les Valls, Lluis Batet

Abstract: In recent years, several simulation codes for reproducing liquid metal magnetohydrodynamic (MHD) phenomena have been validated and benchmarked. Accurate simulation codes are crucial to enhance our understanding of how flow behavior affects heat transport in liquid metal-based breeding blankets. Using heat transfer correlations, that model the influence of flow characteristics on the transport of h… ▽ More In recent years, several simulation codes for reproducing liquid metal magnetohydrodynamic (MHD) phenomena have been validated and benchmarked. Accurate simulation codes are crucial to enhance our understanding of how flow behavior affects heat transport in liquid metal-based breeding blankets. Using heat transfer correlations, that model the influence of flow characteristics on the transport of heat, is especially interesting for system designers because it saves them the effort and time in completely simulating every design proposal. Our group has studied the buoyant MHD flow in poloidal channels on the EU Dual Coolant Lead Lithium (DCLL) blanket geometry. Two different codes were used for this study: a 2D fully-developed code and a Q2D-fully-developed code. In this work, we explored the influence of different flow conditions in the heat transport phenomena parametrically. This article presents the results of the calculations performed using the two codes and provides heat transfer correlations for poloidal EU DCLL channels. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: 14 pages

arXiv:2003.06917 [pdf, other]

End-to-End Velocity Estimation For Autonomous Racing

Authors: Sirish Srinivasan, Inkyu Sa, Alex Zyner, Victor Reijgwart, Miguel I. Valls, Roland Siegwart

Abstract: Velocity estimation plays a central role in driverless vehicles, but standard and affordable methods struggle to cope with extreme scenarios like aggressive maneuvers due to the presence of high sideslip. To solve this, autonomous race cars are usually equipped with expensive external velocity sensors. In this paper, we present an end-to-end recurrent neural network that takes available raw sensor… ▽ More Velocity estimation plays a central role in driverless vehicles, but standard and affordable methods struggle to cope with extreme scenarios like aggressive maneuvers due to the presence of high sideslip. To solve this, autonomous race cars are usually equipped with expensive external velocity sensors. In this paper, we present an end-to-end recurrent neural network that takes available raw sensors as input (IMU, wheel odometry, and motor currents) and outputs velocity estimates. The results are compared to two state-of-the-art Kalman filters, which respectively include and exclude expensive velocity sensors. All methods have been extensively tested on a formula student driverless race car with very high sideslip (10° at the rear axle) and slip ratio (~20%), operating close to the limits of handling. The proposed network is able to estimate lateral velocity up to 15x better than the Kalman filter with the equivalent sensor input and matches (0.06 m/s RMSE) the Kalman filter with the expensive velocity sensor setup. △ Less

Submitted 16 August, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

Comments: RA-L + IROS 2020

arXiv:2003.03200 [pdf, other]

Practical Reinforcement Learning For MPC: Learning from sparse objectives in under an hour on a real robot

Authors: Napat Karnchanachari, Miguel I. Valls, David Hoeller, Marco Hutter

Abstract: Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning a… ▽ More Model Predictive Control (MPC) is a powerful control technique that handles constraints, takes the system's dynamics into account, and optimizes for a given cost function. In practice, however, it often requires an expert to craft and tune this cost function and find trade-offs between different state penalties to satisfy simple high level objectives. In this paper, we use Reinforcement Learning and in particular value learning to approximate the value function given only high level objectives, which can be sparse and binary. Building upon previous works, we present improvements that allowed us to successfully deploy the method on a real world unmanned ground vehicle. Our experiments show that our method can learn the cost function from scratch and without human intervention, while reaching a performance level similar to that of an expert-tuned MPC. We perform a quantitative comparison of these methods with standard MPC approaches both in simulation and on the real robot. △ Less

Submitted 20 April, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

Comments: 14 pages, 6 figures, submitted to L4DC 2020

MSC Class: 49M37 ACM Class: I.2.6; I.2.8; I.2.9

arXiv:2003.03136 [pdf, other]

Knowledge graph based methods for record linkage

Authors: B. Gautam, O. Ramos Terrades, J. M. Pujades, M. Valls

Abstract: Nowadays, it is common in Historical Demography the use of individual-level data as a consequence of a predominant life-course approach for the understanding of the demographic behaviour, family transition, mobility, etc. Record linkage advance is key in these disciplines since it allows to increase the volume and the data complexity to be analyzed. However, current methods are constrained to link… ▽ More Nowadays, it is common in Historical Demography the use of individual-level data as a consequence of a predominant life-course approach for the understanding of the demographic behaviour, family transition, mobility, etc. Record linkage advance is key in these disciplines since it allows to increase the volume and the data complexity to be analyzed. However, current methods are constrained to link data coming from the same kind of sources. Knowledge graph are flexible semantic representations, which allow to encode data variability and semantic relations in a structured manner. In this paper we propose the knowledge graph use to tackle record linkage task. The proposed method, named {\bf WERL}, takes advantage of the main knowledge graph properties and learns embedding vectors to encode census information. These embeddings are properly weighted to maximize the record linkage performance. We have evaluated this method on benchmark data sets and we have compared it to related methods with stimulating and satisfactory results. △ Less

Submitted 6 March, 2020; originally announced March 2020.

Comments: the paper is under consideration at Pattern Recognition Letters

arXiv:1905.05150 [pdf, other]

AMZ Driverless: The Full Autonomous Racing System

Authors: Juraj Kabzan, Miguel de la Iglesia Valls, Victor Reijgwart, Hubertus Franciscus Cornelis Hendrikx, Claas Ehmke, Manish Prajapat, Andreas Bühler, Nikhil Gosala, Mehak Gupta, Ramya Sivanesan, Ankit Dhall, Eugenio Chisari, Napat Karnchanachari, Sonja Brits, Manuel Dangel, Inkyu Sa, Renaud Dubé, Abel Gawel, Mark Pfeiffer, Alexander Liniger, John Lygeros, Roland Siegwart

Abstract: This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and con… ▽ More This paper presents the algorithms and system architecture of an autonomous racecar. The introduced vehicle is powered by a software stack designed for robustness, reliability, and extensibility. In order to autonomously race around a previously unknown track, the proposed solution combines state of the art techniques from different fields of robotics. Specifically, perception, estimation, and control are incorporated into one high-performance autonomous racecar. This complex robotic system, developed by AMZ Driverless and ETH Zurich, finished 1st overall at each competition we attended: Formula Student Germany 2017, Formula Student Italy 2018 and Formula Student Germany 2018. We discuss the findings and learnings from these competitions and present an experimental evaluation of each module of our solution. △ Less

Submitted 13 May, 2019; originally announced May 2019.

Comments: 40 pages, 32 figures, submitted to Journal of Field Robotics

arXiv:1804.03252 [pdf, other]

Design of an Autonomous Racecar: Perception, State Estimation and System Integration

Authors: Miguel de la Iglesia Valls, Hubertus Franciscus Cornelis Hendrikx, Victor Reijgwart, Fabio Vito Meier, Inkyu Sa, Renaud Dubé, Abel Roman Gawel, Mathias Bürki, Roland Siegwart

Abstract: This paper introduces flüela driverless: the first autonomous racecar to win a Formula Student Driverless competition. In this competition, among other challenges, an autonomous racecar is tasked to complete 10 laps of a previously unknown racetrack as fast as possible and using only onboard sensing and computing. The key components of flüela's design are its modular redundant sub-systems that all… ▽ More This paper introduces flüela driverless: the first autonomous racecar to win a Formula Student Driverless competition. In this competition, among other challenges, an autonomous racecar is tasked to complete 10 laps of a previously unknown racetrack as fast as possible and using only onboard sensing and computing. The key components of flüela's design are its modular redundant sub-systems that allow robust performance despite challenging perceptual conditions or partial system failures. The paper presents the integration of key components of our autonomous racecar, i.e., system design, EKF-based state estimation, LiDAR-based perception, and particle filter-based SLAM. We perform an extensive experimental evaluation on real-world data, demonstrating the system's effectiveness by outperforming the next-best ranking team by almost half the time required to finish a lap. The autonomous racecar reaches lateral and longitudinal accelerations comparable to those achieved by experienced human drivers. △ Less

Submitted 9 April, 2018; originally announced April 2018.

Comments: 8 pages, 10 figures, accepted to International Conference on Robotics and Automation | 21-25 May 2018 | Brisbane

Showing 1–8 of 8 results for author: Valls, M