Skip to main content

Showing 1–10 of 10 results for author: Finkler, U

.
  1. arXiv:2110.11199  [pdf, other

    cs.CL

    Asynchronous Decentralized Distributed Training of Acoustic Models

    Authors: Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung

    Abstract: Large-scale distributed training of deep acoustic models plays an important role in today's high-performance automatic speech recognition (ASR). In this paper we investigate a variety of asynchronous decentralized distributed training strategies based on data parallel stochastic gradient descent (SGD) to show their superior performance over the commonly-used synchronous distributed training via al… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted by IEEE/ACM Transactions on Audio, Speech and Language Processing

  2. arXiv:2105.12655  [pdf, other

    cs.SE cs.AI

    CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks

    Authors: Ruchir Puri, David S. Kung, Geert Janssen, Wei Zhang, Giacomo Domeniconi, Vladimir Zolotov, Julian Dolby, Jie Chen, Mihir Choudhury, Lindsey Decker, Veronika Thost, Luca Buratti, Saurabh Pujar, Shyam Ramji, Ulrich Finkler, Susan Malaika, Frederick Reiss

    Abstract: Over the last several decades, software has been woven into the fabric of every aspect of our society. As software development surges and code infrastructure of enterprise applications ages, it is now more critical than ever to increase software development productivity and modernize legacy applications. Advances in deep learning and machine learning algorithms have enabled numerous breakthroughs,… ▽ More

    Submitted 29 August, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: 22 pages including references

  3. arXiv:2011.10608  [pdf, other

    cs.CV

    Large Scale Neural Architecture Search with Polyharmonic Splines

    Authors: Ulrich Finkler, Michele Merler, Rameswar Panda, Mayoore S. Jaiswal, Hui Wu, Kandan Ramakrishnan, Chun-Fu Chen, Minsik Cho, David Kung, Rogerio Feris, Bishwaranjan Bhattacharjee

    Abstract: Neural Architecture Search (NAS) is a powerful tool to automatically design deep neural networks for many tasks, including image classification. Due to the significant computational burden of the search phase, most NAS methods have focused so far on small, balanced datasets. All attempts at conducting NAS at large scale have employed small proxy sets, and then transferred the learned architectures… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  4. arXiv:2006.13314  [pdf, other

    cs.CV cs.LG cs.NE

    NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search

    Authors: Rameswar Panda, Michele Merler, Mayoore Jaiswal, Hui Wu, Kandan Ramakrishnan, Ulrich Finkler, Chun-Fu Chen, Minsik Cho, David Kung, Rogerio Feris, Bishwaranjan Bhattacharjee

    Abstract: Neural Architecture Search (NAS) is an open and challenging problem in machine learning. While NAS offers great promise, the prohibitive computational demand of most of the existing NAS methods makes it difficult to directly search the architectures on large-scale tasks. The typical way of conducting large scale NAS is to search for an architectural building block on a small dataset (either using… ▽ More

    Submitted 11 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 19 pages, 19 Figures, 6 Tables

    MSC Class: 68T05 ACM Class: I.2.6; I.4

  5. arXiv:2005.10053  [pdf, other

    cs.CV cs.LG eess.IV

    Map Generation from Large Scale Incomplete and Inaccurate Data Labels

    Authors: Rui Zhang, Conrad Albrecht, Wei Zhang, Xiaodong Cui, Ulrich Finkler, David Kung, Siyuan Lu

    Abstract: Accurately and globally map** human infrastructure is an important and challenging task with applications in routing, regulation compliance monitoring, and natural disaster response management etc.. In this paper we present progress in develo** an algorithmic pipeline and distributed compute system that automates the process of map creation using high resolution aerial images. Unlike previous… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: This paper is accepted by KDD 2020

    ACM Class: I.2.10

  6. arXiv:2002.10502  [pdf, other

    cs.DC cs.LG cs.SD eess.AS

    Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition

    Authors: Xiaodong Cui, Wei Zhang, Ulrich Finkler, George Saon, Michael Picheny, David Kung

    Abstract: The past decade has witnessed great progress in Automatic Speech Recognition (ASR) due to advances in deep learning. The improvements in performance can be attributed to both improved models and large-scale training data. Key to training such models is the employment of efficient distributed learning techniques. In this article, we provide an overview of distributed training techniques for deep ne… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted to IEEE Signal Processing Magazine

  7. arXiv:2002.01119  [pdf, other

    cs.LG cs.DC stat.ML

    Improving Efficiency in Large-Scale Decentralized Distributed Training

    Authors: Wei Zhang, Xiaodong Cui, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, Youssef Mroueh, Alper Buyuktosunoglu, Payel Das, David Kung, Michael Picheny

    Abstract: Decentralized Parallel SGD (D-PSGD) and its asynchronous variant Asynchronous Parallel SGD (AD-PSGD) is a family of distributed learning algorithms that have been demonstrated to perform well for large-scale deep learning tasks. One drawback of (A)D-PSGD is that the spectral gap of the mixing matrix decreases when the number of learners in the system increases, which hampers convergence. In this p… ▽ More

    Submitted 3 February, 2020; originally announced February 2020.

    Journal ref: 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP'2020) Oral

  8. arXiv:1907.05701  [pdf, other

    eess.AS cs.DC cs.LG cs.SD stat.ML

    A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition

    Authors: Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny

    Abstract: Modern Automatic Speech Recognition (ASR) systems rely on distributed deep learning to for quick training completion. To enable efficient distributed training, it is imperative that the training algorithms can converge with a large mini-batch size. In this work, we discovered that Asynchronous Decentralized Parallel Stochastic Gradient Descent (ADPSGD) can work with much larger batch size than com… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Journal ref: INTERSPEECH 2019

  9. arXiv:1904.04956  [pdf, other

    cs.SD cs.CL cs.LG eess.AS stat.ML

    Distributed Deep Learning Strategies For Automatic Speech Recognition

    Authors: Wei Zhang, Xiaodong Cui, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung, Michael Picheny

    Abstract: In this paper, we propose and investigate a variety of distributed deep learning strategies for automatic speech recognition (ASR) and evaluate them with a state-of-the-art Long short-term memory (LSTM) acoustic model on the 2000-hour Switchboard (SWB2000), which is one of the most widely used datasets for ASR performance benchmark. We first investigate what are the proper hyper-parameters (e.g.,… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: Published in ICASSP'19

  10. arXiv:1708.02188  [pdf, ps, other

    cs.DC cs.AI cs.LG

    PowerAI DDL

    Authors: Minsik Cho, Ulrich Finkler, Sameer Kumar, David Kung, Vaibhav Saxena, Dheeraj Sreedhar

    Abstract: As deep neural networks become more complex and input datasets grow larger, it can take days or even weeks to train a deep neural network to the desired accuracy. Therefore, distributed Deep Learning at a massive scale is a critical capability, since it offers the potential to reduce the training time from weeks to hours. In this paper, we present a software-hardware co-optimized distributed Deep… ▽ More

    Submitted 7 August, 2017; originally announced August 2017.