Skip to main content

Showing 1–4 of 4 results for author: Giladi, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:1909.12340  [pdf, other

    cs.LG stat.ML

    At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?

    Authors: Niv Giladi, Mor Shpigel Nacson, Elad Hoffer, Daniel Soudry

    Abstract: Background: Recent developments have made it possible to accelerate neural networks training significantly using large batch sizes and data parallelism. Training in an asynchronous fashion, where delay occurs, can make training even more scalable. However, asynchronous training has its pitfalls, mainly a degradation in generalization, even after convergence of the algorithm. This gap remains not w… ▽ More

    Submitted 13 February, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: ICLR 2020 Camera ready version

  2. arXiv:1905.11659  [pdf, other

    cs.LG stat.ML

    Evaluating and Calibrating Uncertainty Prediction in Regression Tasks

    Authors: Dan Levi, Liran Gispan, Niv Giladi, Ethan Fetaya

    Abstract: Predicting not only the target but also an accurate measure of uncertainty is important for many machine learning applications and in particular safety-critical ones. In this work we study the calibration of uncertainty prediction for regression tasks which often arise in real-world systems. We show that the existing definition for calibration of a regression uncertainty [Kuleshov et al. 2018] has… ▽ More

    Submitted 3 February, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  3. arXiv:1901.09335  [pdf, other

    cs.LG stat.ML

    Augment your batch: better training with larger batches

    Authors: Elad Hoffer, Tal Ben-Nun, Itay Hubara, Niv Giladi, Torsten Hoefler, Daniel Soudry

    Abstract: Large-batch SGD is important for scaling training of deep neural networks. However, without fine-tuning hyperparameter schedules, the generalization of the model may be hampered. We propose to use batch augmentation: replicating instances of samples within the same batch with different data augmentations. Batch augmentation acts as a regularizer and an accelerator, increasing both generalization a… ▽ More

    Submitted 27 January, 2019; originally announced January 2019.

  4. arXiv:1705.09591  [pdf

    stat.ME

    Estimation of Genetic Risk Function with Covariates in the Presence of Missing Genotypes

    Authors: Annie J. Lee, Karen Marder, Helen Mejia-Santana, Avi Orr-Urtreger, Nir Giladi, Susan Bressman, Yuanjia Wang

    Abstract: In genetic epidemiological studies, family history data are collected on relatives of study participants and used to estimate the age-specific risk of disease for individuals who carry a causal mutation. However, a family member's genotype data may not be collected due to the high cost of in-person interview to obtain blood sample or death of a relative. Previously, efficient nonparametric genotyp… ▽ More

    Submitted 26 May, 2017; originally announced May 2017.

    Comments: 16 pages, 5 tables, 4 figures (7 Supplementary pages, 4 Supplementary tables, 13 Supplementary figures)