Skip to main content

Showing 1–6 of 6 results for author: Saletore, V

.
  1. arXiv:2103.10142  [pdf, other

    physics.data-an cs.AI hep-ex

    Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case

    Authors: Florian Rehm, Sofia Vallecorsa, Vikram Saletore, Hans Pabst, Adel Chaibi, Valeriu Codreanu, Kerstin Borras, Dirk Krücker

    Abstract: Deep learning is finding its way into high energy physics by replacing traditional Monte Carlo simulations. However, deep learning still requires an excessive amount of computational resources. A promising approach to make deep learning more efficient is to quantize the parameters of the neural networks to reduced precision. Reduced precision computing is extensively used in modern deep learning a… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: Submitted at ICPRAM 2021; from CERN openlab - Intel collaboration

    Journal ref: ICPRAM 2021

  2. arXiv:2010.11630  [pdf, other

    astro-ph.IM astro-ph.GA eess.IV

    DeepGalaxy: Deducing the Properties of Galaxy Mergers from Images Using Deep Neural Networks

    Authors: Maxwell X. Cai, Jeroen Bédorf, Vikram A. Saletore, Valeriu Codreanu, Damian Podareanu, Adel Chaibi, Penny X. Qian

    Abstract: Galaxy mergers, the dynamical process during which two galaxies collide, are among the most spectacular phenomena in the Universe. During this process, the two colliding galaxies are tidally disrupted, producing significant visual features that evolve as a function of time. These visual features contain valuable clues for deducing the physical properties of the galaxy mergers. In this work, we pro… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: 7 pages, 7 figures. Accepted for publication at the 2020 IEEE/ACM Fifth Workshop on Deep Learning on Supercomputers (DLS)

  3. arXiv:1910.04852  [pdf, other

    eess.IV cs.CV

    Training Multiscale-CNN for Large Microscopy Image Classification in One Hour

    Authors: Kushal Datta, Imtiaz Hossain, Sun Choi, Vikram Saletore, Kyle Ambert, William J. Godinez, Xian Zhang

    Abstract: Existing approaches to train neural networks that use large images require to either crop or down-sample data during pre-processing, use small batch sizes, or split the model across devices mainly due to the prohibitively limited memory capacity available on GPUs and emerging accelerators. These techniques often lead to longer time to convergence or time to train (TTT), and in some cases, lower mo… ▽ More

    Submitted 10 March, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: 15 pages, 10 figures

    Journal ref: Workshop on Scalable Data Analytics in Scientific Computing, International SuperComputing 2019, Frankfurt, Germany

  4. arXiv:1906.00532  [pdf, other

    cs.LG

    Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model

    Authors: Aishwarya Bhandare, Vamsi Sripathi, Deepthi Karkada, Vivek Menon, Sun Choi, Kushal Datta, Vikram Saletore

    Abstract: In this work, we quantize a trained Transformer machine language translation model leveraging INT8/VNNI instructions in the latest Intel$^\circledR$ Xeon$^\circledR$ Cascade Lake processors to improve inference performance while maintaining less than 0.5$\%$ drop in accuracy. To the best of our knowledge, this is the first attempt in the industry to quantize the Transformer model. This has high im… ▽ More

    Submitted 7 June, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

    Comments: To appear at the Joint Workshop on On-Device Machine Learning & Compact Deep Neural Network Representations, 36th International Conference on Machine Learning, Long Beach, California, 2019

  5. arXiv:1905.04035  [pdf, other

    cs.LG cs.CL cs.DC

    Densifying Assumed-sparse Tensors: Improving Memory Efficiency and MPI Collective Performance during Tensor Accumulation for Parallelized Training of Neural Machine Translation Models

    Authors: Derya Cavdar, Valeriu Codreanu, Can Karakus, John A. Lockman III, Damian Podareanu, Vikram Saletore, Alexander Sergeev, Don D. Smith II, Victor Suthichai, Quy Ta, Srinivas Varadharajan, Lucas A. Wilson, Rengan Xu, Pei Yang

    Abstract: Neural machine translation - using neural networks to translate human language - is an area of active research exploring new neuron types and network topologies with the goal of dramatically improving machine translation performance. Current state-of-the-art approaches, such as the multi-head attention-based transformer, require very large translation corpuses and many epochs to produce models of… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

    Comments: 18 pages, 10 figures, accepted at the 2019 International Supercomputing Conference

  6. arXiv:1711.04291  [pdf, other

    stat.ML cs.LG

    Scale out for large minibatch SGD: Residual network training on ImageNet-1K with improved accuracy and reduced time to train

    Authors: Valeriu Codreanu, Damian Podareanu, Vikram Saletore

    Abstract: For the past 5 years, the ILSVRC competition and the ImageNet dataset have attracted a lot of interest from the Computer Vision community, allowing for state-of-the-art accuracy to grow tremendously. This should be credited to the use of deep artificial neural network designs. As these became more complex, the storage, bandwidth, and compute requirements increased. This means that with a non-distr… ▽ More

    Submitted 15 November, 2017; v1 submitted 12 November, 2017; originally announced November 2017.

    Comments: 10 pages, 4 figures, 13 tables