-
Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children
Authors:
Taekyung Ahn,
Yeonjung Hong,
Younggon Im,
Do Hyung Kim,
Dayoung Kang,
Joo Won Jeong,
Jae Won Kim,
Min Jung Kim,
Ah-ra Cho,
Dae-Hyun Jang,
Hosung Nam
Abstract:
This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children wit…
▽ More
This study presents a model of automatic speech recognition (ASR) designed to diagnose pronunciation issues in children with speech sound disorders (SSDs) to replace manual transcriptions in clinical procedures. Since ASR models trained for general purposes primarily predict input speech into real words, employing a well-known high-performance ASR model for evaluating pronunciation in children with SSDs is impractical. We fine-tuned the wav2vec 2.0 XLS-R model to recognize speech as pronounced rather than as existing words. The model was fine-tuned with a speech dataset from 137 children with inadequate speech production pronouncing 73 Korean words selected for actual clinical diagnosis. The model's predictions of the pronunciations of the words matched the human annotations with about 90% accuracy. While the model still requires improvement in recognizing unclear pronunciation, this study demonstrates that ASR models can streamline complex pronunciation error diagnostic procedures in clinical fields.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Active Reinforcement Learning for Robust Building Control
Authors:
Doseok Jang,
Larry Yan,
Lucas Spangher,
Costas Spanos
Abstract:
Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training environment and fail to generalize to new settings. Unsupervised environment design (UED) has been proposed as a solution to this problem, in which the agent trains…
▽ More
Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training environment and fail to generalize to new settings. Unsupervised environment design (UED) has been proposed as a solution to this problem, in which the agent trains in environments that have been specially selected to help it learn. Previous UED algorithms focus on trying to train an RL agent that generalizes across a large distribution of environments. This is not necessarily desirable when we wish to prioritize performance in one environment over others. In this work, we will be examining the setting of robust RL building control, where we wish to train an RL agent that prioritizes performing well in normal weather while still being robust to extreme weather conditions. We demonstrate a novel UED algorithm, ActivePLR, that uses uncertainty-aware neural network architectures to generate new training environments at the limit of the RL agent's ability while being able to prioritize performance in a desired base environment. We show that ActivePLR is able to outperform state-of-the-art UED algorithms in minimizing energy usage while maximizing occupant comfort in the setting of building control.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Personalized Federated Hypernetworks for Privacy Preservation in Multi-Task Reinforcement Learning
Authors:
Doseok Jang,
Larry Yan,
Lucas Spangher,
Costas J. Spanos
Abstract:
Multi-Agent Reinforcement Learning currently focuses on implementations where all data and training can be centralized to one machine. But what if local agents are split across multiple tasks, and need to keep data private between each? We develop the first application of Personalized Federated Hypernetworks (PFH) to Reinforcement Learning (RL). We then present a novel application of PFH to few-sh…
▽ More
Multi-Agent Reinforcement Learning currently focuses on implementations where all data and training can be centralized to one machine. But what if local agents are split across multiple tasks, and need to keep data private between each? We develop the first application of Personalized Federated Hypernetworks (PFH) to Reinforcement Learning (RL). We then present a novel application of PFH to few-shot transfer, and demonstrate significant initial increases in learning. PFH has never been demonstrated beyond supervised learning benchmarks, so we apply PFH to an important domain: RL price-setting for energy demand response. We consider a general case across where agents are split across multiple microgrids, wherein energy consumption data must be kept private within each microgrid. Together, our work explores how the fields of personalized federated learning and RL can come together to make learning efficient across multiple tasks while kee** data secure.
△ Less
Submitted 19 October, 2022; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Linearized Physics-Based Lithium-Ion Battery Model for Power System Economic Studies
Authors:
Anton V. Vykhodtsev,
Darren Jang,
Qianpu Wang,
William Rosehart,
Hamidreza Zareipour
Abstract:
This paper proposes the linearized physics-based model of a lithium-ion battery that can be incorporated into the optimization framework for power system economic studies. The proposed model is a linear approximation of the single particle model and it allows to characterize dynamics of the physical processes inside the battery that impact the battery operation. There is a need for such model as a…
▽ More
This paper proposes the linearized physics-based model of a lithium-ion battery that can be incorporated into the optimization framework for power system economic studies. The proposed model is a linear approximation of the single particle model and it allows to characterize dynamics of the physical processes inside the battery that impact the battery operation. There is a need for such model as a simplistic power-energy model that is widely employed in operation and planning studies with the lithium-ion battery energy storage system (LIBESS) results in infeasible operation and misleading economic assessment. The proposed linearized model is computationally beneficial compared with a recently used nonlinear physics-based model. The energy arbitrage application is used to assess the advantages of the proposed model over a simple power-energy model.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Fully Distributed Informative Planning for Environmental Learning with Multi-Robot Systems
Authors:
Dohyun Jang,
Jaehyun Yoo,
Clark Youngdong Son,
H. ** Kim
Abstract:
This paper proposes a cooperative environmental learning algorithm working in a fully distributed manner. A multi-robot system is more effective for exploration tasks than a single robot, but it involves the following challenges: 1) online distributed learning of environmental map using multiple robots; 2) generation of safe and efficient exploration path based on the learned map; and 3) maintenan…
▽ More
This paper proposes a cooperative environmental learning algorithm working in a fully distributed manner. A multi-robot system is more effective for exploration tasks than a single robot, but it involves the following challenges: 1) online distributed learning of environmental map using multiple robots; 2) generation of safe and efficient exploration path based on the learned map; and 3) maintenance of the scalability with respect to the number of robots. To this end, we divide the entire process into two stages of environmental learning and path planning. Distributed algorithms are applied in each stage and combined through communication between adjacent robots. The environmental learning algorithm uses a distributed Gaussian process, and the path planning algorithm uses a distributed Monte Carlo tree search. As a result, we build a scalable system without the constraint on the number of robots. Simulation results demonstrate the performance and scalability of the proposed system. Moreover, a real-world-dataset-based simulation validates the utility of our algorithm in a more realistic scenario.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
Unsupervised Image Denoising with Frequency Domain Knowledge
Authors:
Nahyun Kim,
Donggon Jang,
Sunhyeok Lee,
Bomi Kim,
Dae-Shik Kim
Abstract:
Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on h…
▽ More
Supervised learning-based methods yield robust denoising results, yet they are inherently limited by the need for large-scale clean/noisy paired datasets. The use of unsupervised denoisers, on the other hand, necessitates a more detailed understanding of the underlying image statistics. In particular, it is well known that apparent differences between clean and noisy images are most prominent on high-frequency bands, justifying the use of low-pass filters as part of conventional image preprocessing steps. However, most learning-based denoising methods utilize only one-sided information from the spatial domain without considering frequency domain information. To address this limitation, in this study we propose a frequency-sensitive unsupervised denoising method. To this end, a generative adversarial network (GAN) is used as a base structure. Subsequently, we include spectral discriminator and frequency reconstruction loss to transfer frequency knowledge into the generator. Results using natural and synthetic datasets indicate that our unsupervised learning method augmented with frequency information achieves state-of-the-art denoising performance, suggesting that frequency domain information could be a viable factor in improving the overall performance of unsupervised learning-based methods.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
A Review of Lithium-Ion Battery Models in Techno-economic Analyses of Power Systems
Authors:
Anton V. Vykhodtsev,
Darren Jang,
Qianpu Wang,
Hamidreza Zareipour,
William D. Rosehart
Abstract:
The penetration of the lithium-ion battery energy storage system (BESS) into the power system environment occurs at a colossal rate worldwide. This is mainly because it is considered as one of the major tools to decarbonize, digitalize, and democratize the electricity grid. The economic viability and technical reliability of projects with batteries require appropriate assessment because of high ca…
▽ More
The penetration of the lithium-ion battery energy storage system (BESS) into the power system environment occurs at a colossal rate worldwide. This is mainly because it is considered as one of the major tools to decarbonize, digitalize, and democratize the electricity grid. The economic viability and technical reliability of projects with batteries require appropriate assessment because of high capital expenditures, deterioration in charging/discharging performance and uncertainty with regulatory policies. Most of the power system economic studies employ a simple power-energy representation coupled with an empirical description of degradation to model the lithium-ion battery. This approach to modelling may result in violations of the safe operation and misleading estimates of the economic benefits. Recently, the number of publications on techno-economic analysis of BESS with more details on the lithium-ion battery performance has increased. The aim of this review paper is to explore these publications focused on the grid-scale BESS applications and to discuss the impacts of using more sophisticated modelling approaches. First, an overview of the three most popular battery models is given, followed by a review of the applications of such models. The possible directions of future research of employing detailed battery models in power systems' techno-economic studies are then explored.
△ Less
Submitted 16 June, 2021;
originally announced June 2021.
-
Optimal Control-Based UAV Path Planning with Dynamically-Constrained TSP with Neighborhoods
Authors:
Dae-Sung Jang,
Hyeok-Joo Chae,
Han-Lim Choi
Abstract:
This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dyn…
▽ More
This paper addresses path planning of an unmanned aerial vehicle (UAV) with remote sensing capabilities (or wireless communication capabilities). The goal of the path planning is to find a minimum-flight-time closed tour of the UAV visiting all executable areas of given remote sensing and communication tasks; in order to incorporate the nonlinear vehicle dynamics, this problem is regarded as a dynamically-constrained traveling salesman problem with neighborhoods. To obtain a close-to-optimal solution for the path planning in a tractable manner, a sampling-based roadmap algorithm that embeds an optimal control-based path generation process is proposed. The algorithm improves the computational efficiency by reducing numerical computations required for optimizing inefficient local paths, and by extracting additional information from a roadmap of a fixed number of samples. Comparative numerical simulations validate the efficiency of the presented algorithm in reducing computation time and improving the solution quality compared to previous roadmap-based planning methods.
△ Less
Submitted 18 December, 2016;
originally announced December 2016.
-
Complexity Analysis of Heuristic Pulse Interleaving Algorithms for Multi-Target Tracking with Multiple Simultaneous Receive Beams
Authors:
Dae-Sung Jang,
Han-Lim Choi
Abstract:
This paper presents heuristic algorithms for interleaved pulse scheduling problems on multi-target tracking in pulse Doppler phased array radars that can process multiple simultaneous received beams. The interleaved pulse scheduling problems for element and subarray level digital beamforming architectures are formulated as the same integer program and the asymptotic time complexities of the algori…
▽ More
This paper presents heuristic algorithms for interleaved pulse scheduling problems on multi-target tracking in pulse Doppler phased array radars that can process multiple simultaneous received beams. The interleaved pulse scheduling problems for element and subarray level digital beamforming architectures are formulated as the same integer program and the asymptotic time complexities of the algorithms are analyzed.
△ Less
Submitted 5 December, 2014; v1 submitted 17 January, 2014;
originally announced January 2014.
-
Search Optimization for Minimum Load under Detection Performance Constraints in Multifunction Radars
Authors:
Dae-Sung Jang,
Han-Lim Choi,
Ji-Eun Roh
Abstract:
This paper presents a solution procedure of search parameter optimization for minimum load ensuring desired one-off and cumulative probabilities of detection in a multifunction phased array radar. The key approach is to convert this nonlinear optimization on four search parameters into a scalar optimization on signal-to-noise ratio by a semi-analytic process based on subproblem decomposition. The…
▽ More
This paper presents a solution procedure of search parameter optimization for minimum load ensuring desired one-off and cumulative probabilities of detection in a multifunction phased array radar. The key approach is to convert this nonlinear optimization on four search parameters into a scalar optimization on signal-to-noise ratio by a semi-analytic process based on subproblem decomposition. The efficacy of the proposed solution approach is verified with theoretical analysis and numerical case studies.
△ Less
Submitted 9 August, 2013;
originally announced August 2013.