Skip to main content

Showing 1–4 of 4 results for author: Gemmeke, J F

.
  1. arXiv:1901.10680  [pdf, other

    cs.CL

    Effective weakly supervised semantic frame induction using expression sharing in hierarchical hidden Markov models

    Authors: Janneke van de Loo, Jort F. Gemmeke, Guy De Pauw, Bart Ons, Walter Daelemans, Hugo Van hamme

    Abstract: We present a framework for the induction of semantic frames from utterances in the context of an adaptive command-and-control interface. The system is trained on an individual user's utterances and the corresponding semantic frames representing controls. During training, no prior information on the alignment between utterance segments and frame slots and values is available. In addition, semantic… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

  2. arXiv:1609.09430  [pdf, other

    cs.SD cs.LG stat.ML

    CNN Architectures for Large-Scale Audio Classification

    Authors: Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin Wilson

    Abstract: Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 video-level labels. We examine fully connected Deep Neural Networks (DNNs), AlexNet [1], VGG [2], Inception [3], and ResNet [4]. We investigate varying th… ▽ More

    Submitted 10 January, 2017; v1 submitted 29 September, 2016; originally announced September 2016.

    Comments: Accepted for publication at ICASSP 2017 Changes: Added definitions of mAP, AUC, and d-prime. Updated mAP/AUC/d-prime numbers for Audio Set based on changes of latest Audio Set revision. Changed wording to fit 4 page limit with new additions

  3. arXiv:0903.3198  [pdf, ps, other

    cs.SD

    TR02: State dependent oracle masks for improved dynamical features

    Authors: J. F. Gemmeke, B. Cranen

    Abstract: Using the AURORA-2 digit recognition task, we show that recognition accuracies obtained with classical, SNR based oracle masks can be substantially improved by using a state-dependent mask estimation technique.

    Submitted 18 March, 2009; originally announced March 2009.

  4. arXiv:0901.2416  [pdf, ps, other

    cs.SD

    TR01: Time-continuous Sparse Imputation

    Authors: J. F. Gemmeke, B. Cranen

    Abstract: An effective way to increase the noise robustness of automatic speech recognition is to label noisy speech features as either reliable or unreliable (missing) prior to decoding, and to replace the missing ones by clean speech estimates. We present a novel method to obtain such clean speech estimates. Unlike previous imputation frameworks which work on a frame-by-frame basis, our method focuses o… ▽ More

    Submitted 16 January, 2009; originally announced January 2009.

    Comments: 9 pages, 5 figures, Technical Report