Skip to main content

Showing 1–2 of 2 results for author: Sim, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.13108  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test

    Authors: Eungbeom Kim, Yunkee Chae, Jaeheon Sim, Kyogu Lee

    Abstract: Automatic speech recognition systems based on deep learning are mainly trained under empirical risk minimization (ERM). Since ERM utilizes the averaged performance on the data samples regardless of a group such as healthy or dysarthric speakers, ASR systems are unaware of the performance disparities across the groups. This results in biased ASR systems whose performance differences among groups ar… ▽ More

    Submitted 27 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted by Interspeech 2023

  2. arXiv:2210.17143  [pdf, other

    cs.SD cs.CL eess.AS

    Exploring Train and Test-Time Augmentations for Audio-Language Learning

    Authors: Eungbeom Kim, **hee Kim, Yoori Oh, Kyungsu Kim, Minju Park, Jaeheon Sim, **woo Lee, Kyogu Lee

    Abstract: In this paper, we aim to unveil the impact of data augmentation in audio-language multi-modal learning, which has not been explored despite its importance. We explore various augmentation methods at not only train-time but also test-time and find out that proper data augmentation can lead to substantial improvements. Specifically, applying our proposed audio-language paired augmentation PairMix, w… ▽ More

    Submitted 23 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures