Showing 1–2 of 2 results for author: Bang, J
-
Boosting Active Learning for Speech Recognition with Noisy Pseudo-labeled Samples
Authors:
Jihwan Bang,
Heesu Kim,
YoungJoon Yoo,
Jung-Woo Ha
Abstract:
The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training pipeline boosting the conventional active learning approach targeting label-efficient learning to resolve the mentioned problem. Existing active learning methods only…
▽ More
The cost of annotating transcriptions for large speech corpora becomes a bottleneck to maximally enjoy the potential capacity of deep neural network-based automatic speech recognition models. In this paper, we present a new training pipeline boosting the conventional active learning approach targeting label-efficient learning to resolve the mentioned problem. Existing active learning methods only focus on selecting a set of informative samples under a labeling budget. One step further, we suggest that the training efficiency can be further improved by utilizing the unlabeled samples, exceeding the labeling budget, by introducing sophisticatedly configured unsupervised loss complementing supervised loss effectively. We propose new unsupervised loss based on consistency regularization, and we configure appropriate augmentation techniques for utterances to adopt consistency regularization in the automatic speech recognition task. From the qualitative and quantitative experiments on the real-world dataset and under real-usage scenarios, we show that the proposed training pipeline can boost the efficacy of active learning approaches, thus successfully reducing a sustainable amount of human labeling cost.
△ Less
Submitted 5 November, 2020; v1 submitted 19 June, 2020;
originally announced June 2020.
-
Classification of Visual Perception and Imagery based EEG Signals Using Convolutional Neural Networks
Authors:
Ji-Seon Bang,
Ji-Hoon Jeong,
Dong-Ok Won
Abstract:
Recently, visual perception (VP) and visual imagery (VI) paradigms are investigated in several brain-computer interface (BCI) studies. VP and VI are defined as a changing of brain signals when perceiving and memorizing visual information, respectively. These paradigms could be alternatives to the previous visual-based paradigms which have limitations such as fatigue and low information transfer ra…
▽ More
Recently, visual perception (VP) and visual imagery (VI) paradigms are investigated in several brain-computer interface (BCI) studies. VP and VI are defined as a changing of brain signals when perceiving and memorizing visual information, respectively. These paradigms could be alternatives to the previous visual-based paradigms which have limitations such as fatigue and low information transfer rates (ITR). In this study, we analyzed VP and VI to investigate the possibility to control BCI. First, we conducted a time-frequency analysis with event-related spectral perturbation. In addition, two types of decoding accuracies were obtained with convolutional neural network to verify whether the brain signals can be distinguished from each class in the VP and whether they can be differentiated with VP and VI paradigms. As a result, the 6-class classification performance in VP was 32.56% and the binary classification performance which classifies two paradigms was 90.16%.
△ Less
Submitted 5 February, 2021; v1 submitted 15 May, 2020;
originally announced May 2020.