Skip to main content

Showing 1–4 of 4 results for author: Hetz, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02649  [pdf, other

    eess.AS cs.LG cs.SD

    Keyword-Guided Adaptation of Automatic Speech Recognition

    Authors: Aviv Shamsian, Aviv Navon, Neta Glazer, Gill Hetz, Joseph Keshet

    Abstract: Automatic Speech Recognition (ASR) technology has made significant progress in recent years, providing accurate transcription across various domains. However, some challenges remain, especially in noisy environments and specialized jargon. In this paper, we propose a novel approach for improved jargon word recognition by contextual biasing Whisper-based models. We employ a keyword spotting model t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to InterSpeech 2024

  2. arXiv:2310.19708  [pdf, other

    cs.CL cs.LG

    Combining Language Models For Specialized Domains: A Colorful Approach

    Authors: Daniel Eitan, Menachem Pirchi, Neta Glazer, Shai Meital, Gil Ayach, Gidon Krendel, Aviv Shamsian, Aviv Navon, Gil Hetz, Joseph Keshet

    Abstract: General purpose language models (LMs) encounter difficulties when processing domain-specific jargon and terminology, which are frequently utilized in specialized fields such as medicine or industrial settings. Moreover, they often find it challenging to interpret mixed speech that blends general language with specialized jargon. This poses a challenge for automatic speech recognition systems opera… ▽ More

    Submitted 1 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Under Review

  3. arXiv:2309.08561  [pdf, other

    eess.AS cs.LG cs.SD

    Open-vocabulary Keyword-spotting with Adaptive Instance Normalization

    Authors: Aviv Navon, Aviv Shamsian, Neta Glazer, Gill Hetz, Joseph Keshet

    Abstract: Open vocabulary keyword spotting is a crucial and challenging task in automatic speech recognition (ASR) that focuses on detecting user-defined keywords within a spoken utterance. Keyword spotting methods commonly map the audio utterance and keyword into a joint embedding space to obtain some affinity score. In this work, we propose AdaKWS, a novel method for keyword spotting in which a text encod… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Under Review

  4. arXiv:1904.13236  [pdf, other

    cs.LG stat.ME

    A General Spatio-Temporal Clustering-Based Non-local Formulation for Multiscale Modeling of Compartmentalized Reservoirs

    Authors: Soheil Esmaeilzadeh, Amir Salehi, Gill Hetz, Feyisayo Olalotiti-lawal, Hamed Darabi, David Castineira

    Abstract: Representing the reservoir as a network of discrete compartments with neighbor and non-neighbor connections is a fast, yet accurate method for analyzing oil and gas reservoirs. Automatic and rapid detection of coarse-scale compartments with distinct static and dynamic properties is an integral part of such high-level reservoir analysis. In this work, we present a hybrid framework specific to reser… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Report number: SPE-195329-MS

    Journal ref: Machine Learning Session, WRM 2019 Conference