Search | arXiv e-print repository

doi 10.1109/TIV.2024.3366153

Enhancing State Estimator for Autonomous Racing : Leveraging Multi-modal System and Managing Computing Resources

Authors: Daegyu Lee, Hyunwoo Nam, Chanhoe Ryu, Sungwon Nah, Seongwoo Moon, D. Hyunchul Shim

Abstract: This paper introduces an approach that enhances the state estimator for high-speed autonomous race cars, addressing challenges from unreliable measurements, localization failures, and computing resource management. The proposed robust localization system utilizes a Bayesian-based probabilistic approach to evaluate multimodal measurements, ensuring the use of credible data for accurate and reliable… ▽ More This paper introduces an approach that enhances the state estimator for high-speed autonomous race cars, addressing challenges from unreliable measurements, localization failures, and computing resource management. The proposed robust localization system utilizes a Bayesian-based probabilistic approach to evaluate multimodal measurements, ensuring the use of credible data for accurate and reliable localization, even in harsh racing conditions. To tackle potential localization failures, we present a resilient navigation system which enables the race car to continue track-following by leveraging direct perception information in planning and execution, ensuring continuous performance despite localization disruptions. In addition, efficient computing is critical to avoid overload and system failure. Hence, we optimize computing resources using an efficient LiDAR-based state estimation method. Leveraging CUDA programming and GPU acceleration, we perform nearest points search and covariance computation efficiently, overcoming CPU bottlenecks. Simulation and real-world tests validate the system's performance and resilience. The proposed approach successfully recovers from failures, effectively preventing accidents and ensuring safety of the car. △ Less

Submitted 12 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

Comments: arXiv admin note: text overlap with arXiv:2207.12232

Journal ref: IEEE Transactions on Intelligent Vehicles(2024)

arXiv:2306.01789 [pdf, other]

Edit Distance based RL for RNNT decoding

Authors: Dongseong Hwang, Changwan Ryu, Khe Chai Sim

Abstract: RNN-T is currently considered the industry standard in ASR due to its exceptional WERs in various benchmark tests and its ability to support seamless streaming and longform transcription. However, its biggest drawback lies in the significant discrepancy between its training and inference objectives. During training, RNN-T maximizes all alignment probabilities by teacher forcing, while during infer… ▽ More RNN-T is currently considered the industry standard in ASR due to its exceptional WERs in various benchmark tests and its ability to support seamless streaming and longform transcription. However, its biggest drawback lies in the significant discrepancy between its training and inference objectives. During training, RNN-T maximizes all alignment probabilities by teacher forcing, while during inference, it uses beam search which may not necessarily find the maximum probable alignment. Additionally, RNN-T's inability to experience mistakes during teacher forcing training makes it more problematic when a mistake occurs in inference. To address this issue, this paper proposes a Reinforcement Learning method that minimizes the gap between training and inference time. Our Edit Distance based RL (EDRL) approach computes rewards based on the edit distance, and trains the network at every action level. The proposed approach yielded SoTA WERs on LibriSpeech for the 600M Conformer RNN-T model. △ Less

Submitted 14 July, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

Comments: 5 pages, 2 figures

arXiv:2110.03634 [pdf, other]

Enabling On-Device Training of Speech Recognition Models with Federated Dropout

Authors: Dhruv Guliani, Lillian Zhou, Changwan Ryu, Tien-Ju Yang, Harry Zhang, Yonghui Xiao, Francoise Beaufays, Giovanni Motta

Abstract: Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech reco… ▽ More Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech recognition models. We propose using federated dropout to reduce the size of client models while training a full-size model server-side. We provide empirical evidence of the effectiveness of federated dropout, and propose a novel approach to vary the dropout rate applied at each layer. Furthermore, we find that federated dropout enables a set of smaller sub-models within the larger model to independently have low word error rates, making it easier to dynamically adjust the size of the model deployed for inference. △ Less

Submitted 7 October, 2021; originally announced October 2021.

Comments: \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses

MSC Class: 68T10 ACM Class: I.2.7

arXiv:2009.11543 [pdf, other]

Compressed Key Sort and Fast Index Reconstruction

Authors: Yongsik Kwon, Cheol Ryu, Sang Kyun Cha, Arthur H. Lee, Kunsoo Park, Bongki Moon

Abstract: In this paper we propose an index key compression scheme based on the notion of distinction bits by proving that the distinction bits of index keys are sufficient information to determine the sorted order of the index keys correctly. While the actual compression ratio may vary depending on the characteristics of datasets (an average of 2.76 to one compression ratio was observed in our experiments)… ▽ More In this paper we propose an index key compression scheme based on the notion of distinction bits by proving that the distinction bits of index keys are sufficient information to determine the sorted order of the index keys correctly. While the actual compression ratio may vary depending on the characteristics of datasets (an average of 2.76 to one compression ratio was observed in our experiments), the index key compression scheme leads to significant performance improvements during the reconstruction of large-scale indexes. Our index key compression can be effectively used in database replication and index recovery of modern main-memory database systems. △ Less

Submitted 24 September, 2020; originally announced September 2020.

Comments: 26 pages and 13 figures

arXiv:1908.04937 [pdf, other]

Fast Cartesian Tree Matching

Authors: Siwoo Song, Cheol Ryu, Simone Faro, Thierry Lecroq, Kunsoo Park

Abstract: Cartesian tree matching is the problem of finding all substrings of a given text which have the same Cartesian trees as that of a given pattern. So far there is one linear-time solution for Cartesian tree matching, which is based on the KMP algorithm. We improve the running time of the previous solution by introducing new representations. We present the framework of a binary filtration method and… ▽ More Cartesian tree matching is the problem of finding all substrings of a given text which have the same Cartesian trees as that of a given pattern. So far there is one linear-time solution for Cartesian tree matching, which is based on the KMP algorithm. We improve the running time of the previous solution by introducing new representations. We present the framework of a binary filtration method and an efficient verification technique for Cartesian tree matching. Any exact string matching algorithm can be used as a filtration for Cartesian tree matching on our framework. We also present a SIMD solution for Cartesian tree matching suitable for short patterns. By experiments we show that known string matching algorithms combined on our framework of binary filtration and efficient verification produce algorithms of good performances for Cartesian tree matching. △ Less

Submitted 13 August, 2019; originally announced August 2019.

Comments: 14 pages, 3 figures, Submitted to SPIRE 2019

Showing 1–5 of 5 results for author: Ryu, C