Search | arXiv e-print repository

Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children

Authors: Taekyung Ahn, Yeonjung Hong, Younggon Im, Do Hyung Kim, Dayoung Kang, Joo Won Jeong, Jae Won Kim, Min Jung Kim, Ah-ra Cho, Dae-Hyun Jang, Hosung Nam

Abstract: This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children wit… ▽ More This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children with SSDs is impractical. We fine-tuned the wav2vec 2.0 XLS-R model to recognize speech as pronounced rather than as existing words. The model was fine-tuned with a speech dataset from 137 children with inadequate speech production pronouncing 73 Korean words selected for actual clinical diagnosis. The model's predictions of the pronunciations of the words matched the human annotations with about 90% accuracy. While the model still requires improvement in recognizing unclear pronunciation, this study demonstrates that ASR models can streamline complex pronunciation error diagnostic procedures in clinical fields. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 12 pages, 2 figures

ACM Class: I.2.7

arXiv:2312.10289 [pdf, other]

Active Reinforcement Learning for Robust Building Control

Authors: Doseok Jang, Larry Yan, Lucas Spangher, Costas Spanos

Abstract: Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training environment and fail to generalize to new settings. Unsupervised environment design (UED) has been proposed as a solution to this problem, in which the agent trains… ▽ More Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training environment and fail to generalize to new settings. Unsupervised environment design (UED) has been proposed as a solution to this problem, in which the agent trains in environments that have been specially selected to help it learn. Previous UED algorithms focus on trying to train an RL agent that generalizes across a large distribution of environments. This is not necessarily desirable when we wish to prioritize performance in one environment over others. In this work, we will be examining the setting of robust RL building control, where we wish to train an RL agent that prioritizes performing well in normal weather while still being robust to extreme weather conditions. We demonstrate a novel UED algorithm, ActivePLR, that uses uncertainty-aware neural network architectures to generate new training environments at the limit of the RL agent's ability while being able to prioritize performance in a desired base environment. We show that ActivePLR is able to outperform state-of-the-art UED algorithms in minimizing energy usage while maximizing occupant comfort in the setting of building control. △ Less

Submitted 15 December, 2023; originally announced December 2023.

arXiv:2210.06820 [pdf, other]

Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning

Authors: Doseok Jang, Larry Yan, Lucas Spangher, Costas J. Spanos

Abstract: Multi-Agent Reinforcement Learning currently focuses on implementations where all data and training can be centralized to one machine. But what if local agents are split across multiple tasks, and need to keep data private between each? We develop the first application of Personalized Federated Hypernetworks (PFH) to Reinforcement Learning (RL). We then present a novel application of PFH to few-sh… ▽ More Multi-Agent Reinforcement Learning currently focuses on implementations where all data and training can be centralized to one machine. But what if local agents are split across multiple tasks, and need to keep data private between each? We develop the first application of Personalized Federated Hypernetworks (PFH) to Reinforcement Learning (RL). We then present a novel application of PFH to few-shot transfer, and demonstrate significant initial increases in learning. PFH has never been demonstrated beyond supervised learning benchmarks, so we apply PFH to an important domain: RL price-setting for energy demand response. We consider a general case across where agents are split across multiple microgrids, wherein energy consumption data must be kept private within each microgrid. Together, our work explores how the fields of personalized federated learning and RL can come together to make learning efficient across multiple tasks while kee** data secure. △ Less

Submitted 19 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

arXiv:2207.03469 [pdf, other]

Linearized Physics-Based Lithium-Ion Battery Model for Power System Economic Studies

Authors: Anton V. Vykhodtsev, Darren Jang, Qianpu Wang, William Rosehart, Hamidreza Zareipour

Abstract: This paper proposes the linearized physics-based model of a lithium-ion battery that can be incorporated into the optimization framework for power system economic studies. The proposed model is a linear approximation of the single particle model and it allows to characterize dynamics of the physical processes inside the battery that impact the battery operation. There is a need for such model as a… ▽ More This paper proposes the linearized physics-based model of a lithium-ion battery that can be incorporated into the optimization framework for power system economic studies. The proposed model is a linear approximation of the single particle model and it allows to characterize dynamics of the physical processes inside the battery that impact the battery operation. There is a need for such model as a simplistic power-energy model that is widely employed in operation and planning studies with the lithium-ion battery energy storage system (LIBESS) results in infeasible operation and misleading economic assessment. The proposed linearized model is computationally beneficial compared with a recently used nonlinear physics-based model. The energy arbitrage application is used to assess the advantages of the proposed model over a simple power-energy model. △ Less

Submitted 7 July, 2022; originally announced July 2022.

Comments: In proceedings of the 11th Bulk Power Systems Dynamics and Control Symposium (IREP 2022), July 25-30, 2022, Banff, Canada

Report number: IREP2022-21

arXiv:2112.14433 [pdf, other]

Fully Distributed Informative Planning for Environmental Learning with Multi-Robot Systems

Authors: Dohyun Jang, Jaehyun Yoo, Clark Youngdong Son, H. ** Kim

Abstract: This paper proposes a cooperative environmental learning algorithm working in a fully distributed manner. A multi-robot system is more effective for exploration tasks than a single robot, but it involves the following challenges: 1) online distributed learning of environmental map using multiple robots; 2) generation of safe and efficient exploration path based on the learned map; and 3) maintenan… ▽ More This paper proposes a cooperative environmental learning algorithm working in a fully distributed manner. A multi-robot system is more effective for exploration tasks than a single robot, but it involves the following challenges: 1) online distributed learning of environmental map using multiple robots; 2) generation of safe and efficient exploration path based on the learned map; and 3) maintenance of the scalability with respect to the number of robots. To this end, we divide the entire process into two stages of environmental learning and path planning. Distributed algorithms are applied in each stage and combined through communication between adjacent robots. The environmental learning algorithm uses a distributed Gaussian process, and the path planning algorithm uses a distributed Monte Carlo tree search. As a result, we build a scalable system without the constraint on the number of robots. Simulation results demonstrate the performance and scalability of the proposed system. Moreover, a real-world-dataset-based simulation validates the utility of our algorithm in a more realistic scenario. △ Less

Submitted 29 December, 2021; originally announced December 2021.

arXiv:2111.14362 [pdf, other]

Unsupervised Image Denoising with Frequency Domain Knowledge

Authors: Nahyun Kim, Donggon Jang, Sunhyeok Lee, Bomi Kim, Dae-Shik Kim

Abstract: Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on h… ▽ More Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: Accepted to BMVC 2021

arXiv:2106.08702 [pdf, other]

doi 10.1016/j.rser.2022.112584

A Review of Lithium-Ion Battery Models in Techno-economic Analyses of Power Systems

Authors: Anton V. Vykhodtsev, Darren Jang, Qianpu Wang, Hamidreza Zareipour, William D. Rosehart

Abstract: The penetration of the lithium-ion battery energy storage system (BESS) into the power system environment occurs at a colossal rate worldwide. This is mainly because it is considered as one of the major tools to decarbonize, digitalize, and democratize the electricity grid. The economic viability and technical reliability of projects with batteries require appropriate assessment because of high ca… ▽ More The penetration of the lithium-ion battery energy storage system (BESS) into the power system environment occurs at a colossal rate worldwide. This is mainly because it is considered as one of the major tools to decarbonize, digitalize, and democratize the electricity grid. The economic viability and technical reliability of projects with batteries require appropriate assessment because of high capital expenditures, deterioration in charging/discharging performance and uncertainty with regulatory policies. Most of the power system economic studies employ a simple power-energy representation coupled with an empirical description of degradation to model the lithium-ion battery. This approach to modelling may result in violations of the safe operation and misleading estimates of the economic benefits. Recently, the number of publications on techno-economic analysis of BESS with more details on the lithium-ion battery performance has increased. The aim of this review paper is to explore these publications focused on the grid-scale BESS applications and to discuss the impacts of using more sophisticated modelling approaches. First, an overview of the three most popular battery models is given, followed by a review of the applications of such models. The possible directions of future research of employing detailed battery models in power systems' techno-economic studies are then explored. △ Less

Submitted 16 June, 2021; originally announced June 2021.

arXiv:1612.06008 [pdf, other]

Optimal Control-Based UAV Path Planning with Dynamically-Constrained TSP with Neighborhoods

Authors: Dae-Sung Jang, Hyeok-Joo Chae, Han-Lim Choi

Abstract: This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dyn… ▽ More This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dynamically-constrained traveling salesman problem with neighborhoods. To obtain a close-to-optimal solution for the path planning in a tractable manner, a sampling-based roadmap algorithm that embeds an optimal control-based path generation process is proposed. The algorithm improves the computational efficiency by reducing numerical computations required for optimizing inefficient local paths, and by extracting additional information from a roadmap of a fixed number of samples. Comparative numerical simulations validate the efficiency of the presented algorithm in reducing computation time and improving the solution quality compared to previous roadmap-based planning methods. △ Less

Submitted 18 December, 2016; originally announced December 2016.

Comments: 17 pages, 7 figures

arXiv:1401.4248 [pdf, other]

Complexity Analysis of Heuristic Pulse Interleaving Algorithms for Multi-Target Tracking with Multiple Simultaneous Receive Beams

Authors: Dae-Sung Jang, Han-Lim Choi

Abstract: This paper presents heuristic algorithms for interleaved pulse scheduling problems on multi-target tracking in pulse Doppler phased array radars that can process multiple simultaneous received beams. The interleaved pulse scheduling problems for element and subarray level digital beamforming architectures are formulated as the same integer program and the asymptotic time complexities of the algori… ▽ More This paper presents heuristic algorithms for interleaved pulse scheduling problems on multi-target tracking in pulse Doppler phased array radars that can process multiple simultaneous received beams. The interleaved pulse scheduling problems for element and subarray level digital beamforming architectures are formulated as the same integer program and the asymptotic time complexities of the algorithms are analyzed. △ Less

Submitted 5 December, 2014; v1 submitted 17 January, 2014; originally announced January 2014.

Comments: 29 pages, 6 figures

arXiv:1308.2272 [pdf, other]

Search Optimization for Minimum Load under Detection Performance Constraints in Multifunction Radars

Authors: Dae-Sung Jang, Han-Lim Choi, Ji-Eun Roh

Abstract: This paper presents a solution procedure of search parameter optimization for minimum load ensuring desired one-off and cumulative probabilities of detection in a multifunction phased array radar. The key approach is to convert this nonlinear optimization on four search parameters into a scalar optimization on signal-to-noise ratio by a semi-analytic process based on subproblem decomposition. The… ▽ More This paper presents a solution procedure of search parameter optimization for minimum load ensuring desired one-off and cumulative probabilities of detection in a multifunction phased array radar. The key approach is to convert this nonlinear optimization on four search parameters into a scalar optimization on signal-to-noise ratio by a semi-analytic process based on subproblem decomposition. The efficacy of the proposed solution approach is verified with theoretical analysis and numerical case studies. △ Less

Submitted 9 August, 2013; originally announced August 2013.

Comments: 11 pages, 13 figures, submitted to IEEE Transactions on Aerospace and Electronic Systems

Showing 1–10 of 10 results for author: Jang, D