Skip to main content

Showing 1–3 of 3 results for author: Cho, W I

Searching in archive eess. Search in all archives.
.
  1. TutorNet: Towards Flexible Knowledge Distillation for End-to-End Speech Recognition

    Authors: Ji Won Yoon, Hyeonseung Lee, Hyung Yong Kim, Won Ik Cho, Nam Soo Kim

    Abstract: In recent years, there has been a great deal of research in develo** end-to-end speech recognition models, which enable simplifying the traditional pipeline and achieving promising results. Despite their remarkable performance improvements, end-to-end models typically require expensive computational cost to show successful performance. To reduce this computational burden, knowledge distillation… ▽ More

    Submitted 16 September, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  2. arXiv:2005.08213  [pdf, other

    cs.CL cs.SD eess.AS

    Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation

    Authors: Won Ik Cho, Donghyun Kwak, Ji Won Yoon, Nam Soo Kim

    Abstract: Speech is one of the most effective means of communication and is full of information that helps the transmission of utterer's thoughts. However, mainly due to the cumbersome processing of acoustic features, phoneme or word posterior probability has frequently been discarded in understanding the natural language. Thus, some recent spoken language understanding (SLU) modules have utilized end-to-en… ▽ More

    Submitted 8 August, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: Interspeech 2020 Camera-ready

  3. arXiv:1910.09275  [pdf, other

    cs.CL eess.AS

    Text Matters but Speech Influences: A Computational Analysis of Syntactic Ambiguity Resolution

    Authors: Won Ik Cho, Jeonghwa Cho, Woo Hyun Kang, Nam Soo Kim

    Abstract: Analyzing how human beings resolve syntactic ambiguity has long been an issue of interest in the field of linguistics. It is, at the same time, one of the most challenging issues for spoken language understanding (SLU) systems as well. As syntactic ambiguity is intertwined with issues regarding prosody and semantics, the computational approach toward speech intention identification is expected to… ▽ More

    Submitted 21 May, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: CogSci 2020 Camera-ready