Skip to main content

Showing 1–6 of 6 results for author: Giladi, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.10598  [pdf, other

    cs.LG

    DropCompute: simple and more robust distributed synchronous training via compute variance reduction

    Authors: Niv Giladi, Shahar Gottlieb, Moran Shkolnik, Asaf Karnieli, Ron Banner, Elad Hoffer, Kfir Yehuda Levy, Daniel Soudry

    Abstract: Background: Distributed training is essential for large scale training of deep neural networks (DNNs). The dominant methods for large scale DNN training are synchronous (e.g. All-Reduce), but these require waiting for all workers in each step. Thus, these methods are limited by the delays caused by straggling workers. Results: We study a typical scenario in which workers are straggling due to vari… ▽ More

    Submitted 24 September, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: https://github.com/paper-submissions/dropcompute

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. arXiv:2111.02780  [pdf

    cs.LG

    Flood forecasting with machine learning models in an operational framework

    Authors: Sella Nevo, Efrat Morin, Adi Gerzi Rosenthal, Asher Metzger, Chen Barshai, Dana Weitzner, Dafi Voloshin, Frederik Kratzert, Gal Elidan, Gideon Dror, Gregory Begelman, Grey Nearing, Guy Shalev, Hila Noga, Ira Shavitt, Liora Yuklea, Moriah Royz, Niv Giladi, Nofar Peled Levi, Ofir Reich, Oren Gilon, Ronnie Maor, Shahar Timnat, Tal Shechter, Vladimir Anisimov , et al. (6 additional authors not shown)

    Abstract: The operational flood forecasting system by Google was developed to provide accurate real-time flood warnings to agencies and the public, with a focus on riverine floods in large, gauged rivers. It became operational in 2018 and has since expanded geographically. This forecasting system consists of four subsystems: data validation, stage forecasting, inundation modeling, and alert distribution. Ma… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 36 pages, 10 figures, 3 tables, 1 supplementary table (9 pages)

  3. arXiv:2106.07218  [pdf, other

    cs.LG cs.CV

    Physics-Aware Downsampling with Deep Learning for Scalable Flood Modeling

    Authors: Niv Giladi, Zvika Ben-Haim, Sella Nevo, Yossi Matias, Daniel Soudry

    Abstract: Background: Floods are the most common natural disaster in the world, affecting the lives of hundreds of millions. Flood forecasting is therefore a vitally important endeavor, typically achieved using physical water flow simulations, which rely on accurate terrain elevation maps. However, such simulations, based on solving partial differential equations, are computationally prohibitive on a large… ▽ More

    Submitted 31 October, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Journal ref: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  4. arXiv:1909.12340  [pdf, other

    cs.LG stat.ML

    At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?

    Authors: Niv Giladi, Mor Shpigel Nacson, Elad Hoffer, Daniel Soudry

    Abstract: Background: Recent developments have made it possible to accelerate neural networks training significantly using large batch sizes and data parallelism. Training in an asynchronous fashion, where delay occurs, can make training even more scalable. However, asynchronous training has its pitfalls, mainly a degradation in generalization, even after convergence of the algorithm. This gap remains not w… ▽ More

    Submitted 13 February, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: ICLR 2020 Camera ready version

  5. arXiv:1905.11659  [pdf, other

    cs.LG stat.ML

    Evaluating and Calibrating Uncertainty Prediction in Regression Tasks

    Authors: Dan Levi, Liran Gispan, Niv Giladi, Ethan Fetaya

    Abstract: Predicting not only the target but also an accurate measure of uncertainty is important for many machine learning applications and in particular safety-critical ones. In this work we study the calibration of uncertainty prediction for regression tasks which often arise in real-world systems. We show that the existing definition for calibration of a regression uncertainty [Kuleshov et al. 2018] has… ▽ More

    Submitted 3 February, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  6. arXiv:1901.09335  [pdf, other

    cs.LG stat.ML

    Augment your batch: better training with larger batches

    Authors: Elad Hoffer, Tal Ben-Nun, Itay Hubara, Niv Giladi, Torsten Hoefler, Daniel Soudry

    Abstract: Large-batch SGD is important for scaling training of deep neural networks. However, without fine-tuning hyperparameter schedules, the generalization of the model may be hampered. We propose to use batch augmentation: replicating instances of samples within the same batch with different data augmentations. Batch augmentation acts as a regularizer and an accelerator, increasing both generalization a… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.