Skip to main content

Showing 1–6 of 6 results for author: Jeon, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2306.11473  [pdf, other

    cs.CL eess.AS

    Timestamped Embedding-Matching Acoustic-to-Word CTC ASR

    Authors: Woojay Jeon

    Abstract: In this work, we describe a novel method of training an embedding-matching word-level connectionist temporal classification (CTC) automatic speech recognizer (ASR) such that it directly produces word start times and durations, required by many real-world applications, in addition to the transcription. The word timestamps enable the ASR to output word segmentations and word confusion networks witho… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  2. arXiv:2304.06237  [pdf, other

    cs.LG eess.SP

    Deep learning based ECG segmentation for delineation of diverse arrhythmias

    Authors: Chankyu Joung, Mi** Kim, Tae** Paik, Seong-Ho Kong, Seung-Young Oh, Won Kyeong Jeon, Jae-hu Jeon, Joong-Sik Hong, Wan-Joong Kim, Woong Kook, Myung-** Cha, Otto van Koert

    Abstract: Accurate delineation of key waveforms in an ECG is a critical initial step in extracting relevant features to support the diagnosis and treatment of heart conditions. Although deep learning based methods using a segmentation model to locate the P, QRS, and T waves have shown promising results, their ability to handle signals exhibiting arrhythmia remains unclear. This study builds on existing rese… ▽ More

    Submitted 6 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  3. arXiv:2210.16726  [pdf, ps, other

    eess.AS cs.SD

    Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings

    Authors: Hao Yen, Woojay Jeon

    Abstract: In embedding-matching acoustic-to-word (A2W) ASR, every word in the vocabulary is represented by a fixed-dimension embedding vector that can be added or removed independently of the rest of the system. The approach is potentially an elegant solution for the dynamic out-of-vocabulary (OOV) words problem, where speaker- and context-dependent named entities like contact names must be incorporated int… ▽ More

    Submitted 19 February, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: Accepted to ICASSP 2023

  4. arXiv:2007.10329  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    Acoustic Neighbor Embeddings

    Authors: Woojay Jeon

    Abstract: This paper proposes a novel acoustic word embedding called Acoustic Neighbor Embeddings where speech or text of arbitrary length are mapped to a vector space of fixed, reduced dimensions by adapting stochastic neighbor embedding (SNE) to sequential inputs. The Euclidean distance between coordinates in the embedding space reflects the phonetic confusability between their corresponding sequences. Tw… ▽ More

    Submitted 6 January, 2022; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: Anonymized version submitted to ICLR 2021

  5. arXiv:2003.00304  [pdf, ps, other

    cs.CL cs.SD eess.AS stat.ML

    Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks

    Authors: Woojay Jeon, Leo Liu, Henry Mason

    Abstract: We propose a method to reduce false voice triggers of a speech-enabled personal assistant by post-processing the hypothesis lattice of a server-side large-vocabulary continuous speech recognizer (LVCSR) via a neural network. We first discuss how an estimate of the posterior probability of the trigger phrase can be obtained from the hypothesis lattice using known techniques to perform detection, th… ▽ More

    Submitted 29 February, 2020; originally announced March 2020.

    Comments: Presented at IEEE ICASSP, May 2019

    Journal ref: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6356-6360

  6. arXiv:1907.09636  [pdf, ps, other

    cs.CL cs.SD eess.AS stat.ML

    On Modeling ASR Word Confidence

    Authors: Woojay Jeon, Maxwell Jordan, Mahesh Krishnamoorthy

    Abstract: We present a new method for computing ASR word confidences that effectively mitigates the effect of ASR errors for diverse downstream applications, improves the word error rate of the 1-best result, and allows better comparison of scores across different models. We propose 1) a new method for modeling word confidence using a Heterogeneous Word Confusion Network (HWCN) that addresses some key flaws… ▽ More

    Submitted 2 June, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: Presented at IEEE ICASSP 2020, May 2020