Skip to main content

Showing 1–4 of 4 results for author: Kung, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2005.10053  [pdf, other

    cs.CV cs.LG eess.IV

    Map Generation from Large Scale Incomplete and Inaccurate Data Labels

    Authors: Rui Zhang, Conrad Albrecht, Wei Zhang, Xiaodong Cui, Ulrich Finkler, David Kung, Siyuan Lu

    Abstract: Accurately and globally map** human infrastructure is an important and challenging task with applications in routing, regulation compliance monitoring, and natural disaster response management etc.. In this paper we present progress in develo** an algorithmic pipeline and distributed compute system that automates the process of map creation using high resolution aerial images. Unlike previous… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: This paper is accepted by KDD 2020

    ACM Class: I.2.10

  2. arXiv:2002.10502  [pdf, other

    cs.DC cs.LG cs.SD eess.AS

    Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

    Authors: Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

    Abstract: The past decade has witnessed great progress in Automatic Speech Recognition (ASR) due to advances in deep learning. The improvements in performance can be attributed to both improved models and large-scale training data. Key to training such models is the employment of efficient distributed learning techniques. In this article, we provide an overview of distributed training techniques for deep ne… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted to IEEE Signal Processing Magazine

  3. arXiv:1907.05701  [pdf, other

    eess.AS cs.DC cs.LG cs.SD stat.ML

    A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

    Authors: Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

    Abstract: Modern Automatic Speech Recognition (ASR) systems rely on distributed deep learning to for quick training completion. To enable efficient distributed training, it is imperative that the training algorithms can converge with a large mini-batch size. In this work, we discovered that Asynchronous Decentralized Parallel Stochastic Gradient Descent (ADPSGD) can work with much larger batch size than com… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Journal ref: INTERSPEECH 2019

  4. arXiv:1904.04956  [pdf, other

    cs.SD cs.CL cs.LG eess.AS stat.ML

    Distributed Deep Learning Strategies For Automatic Speech Recognition

    Authors: Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung, Michael Picheny

    Abstract: In this paper, we propose and investigate a variety of distributed deep learning strategies for automatic speech recognition (ASR) and evaluate them with a state-of-the-art Long short-term memory (LSTM) acoustic model on the 2000-hour Switchboard (SWB2000), which is one of the most widely used datasets for ASR performance benchmark. We first investigate what are the proper hyper-parameters (e.g.,… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: Published in ICASSP'19