Skip to main content

Showing 1–6 of 6 results for author: Golan, I

Searching in archive cs. Search in all archives.
.
  1. Task Agnostic Continual Learning Using Online Variational Bayes with Fixed-Point Updates

    Authors: Chen Zeno, Itay Golan, Elad Hoffer, Daniel Soudry

    Abstract: Background: Catastrophic forgetting is the notorious vulnerability of neural networks to the changes in the data distribution during learning. This phenomenon has long been considered a major obstacle for using learning agents in realistic continual learning settings. A large body of continual learning research assumes that task boundaries are known during training. However, only a few works consi… ▽ More

    Submitted 18 October, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: The arXiv paper "Task Agnostic Continual Learning Using Online Variational Bayes" is a preliminary pre-print of this paper. The main differences between the versions are: 1. We develop new algorithmic framework (FOO-VB). 2. We add multivariate Gaussian and matrix variate Gaussian versions of the algorithm. 3. We demonstrate the new algorithm performance in task agnostic scenarios

    Journal ref: Neural Comput 2021; 33 (11)

  2. arXiv:2002.09277  [pdf, other

    cs.LG stat.ML

    Kernel and Rich Regimes in Overparametrized Models

    Authors: Blake Woodworth, Suriya Gunasekar, Jason D. Lee, Edward Moroshko, Pedro Savarese, Itay Golan, Daniel Soudry, Nathan Srebro

    Abstract: A recent line of work studies overparametrized neural networks in the "kernel regime," i.e. when the network behaves during training as a kernelized linear predictor, and thus training with gradient descent has the effect of finding the minimum RKHS norm solution. This stands in contrast to other studies which demonstrate how gradient descent on overparametrized multilayer networks can induce rich… ▽ More

    Submitted 27 July, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: This updates and significantly extends a previous article (arXiv:1906.05827), Sections 6 and 7 are the most major additions. 31 pages. arXiv admin note: text overlap with arXiv:1906.05827

  3. arXiv:1906.05827   

    cs.LG stat.ML

    Kernel and Rich Regimes in Overparametrized Models

    Authors: Blake Woodworth, Suriya Gunasekar, Pedro Savarese, Edward Moroshko, Itay Golan, Jason Lee, Daniel Soudry, Nathan Srebro

    Abstract: A recent line of work studies overparametrized neural networks in the "kernel regime," i.e. when the network behaves during training as a kernelized linear predictor, and thus training with gradient descent has the effect of finding the minimum RKHS norm solution. This stands in contrast to other studies which demonstrate how gradient descent on overparametrized multilayer networks can induce rich… ▽ More

    Submitted 25 February, 2020; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: This paper has been substantially modified, updated, and expanded with additional content (arXiv:2002.09277). To avoid confusion with already existing citations, we are withdrawing the old version of this article

  4. arXiv:1805.10917  [pdf, other

    cs.LG stat.ML

    Deep Anomaly Detection Using Geometric Transformations

    Authors: Izhak Golan, Ran El-Yaniv

    Abstract: We consider the problem of anomaly detection in images, and present a new detection technique. Given a sample of images, all known to belong to a "normal" class (e.g., dogs), we show how to train a deep neural model that can detect out-of-distribution images (i.e., non-dog objects). The main idea behind our scheme is to train a multi-class model to discriminate between dozens of geometric transfor… ▽ More

    Submitted 9 November, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

  5. arXiv:1803.10123  [pdf, other

    stat.ML cs.LG

    Task Agnostic Continual Learning Using Online Variational Bayes

    Authors: Chen Zeno, Itay Golan, Elad Hoffer, Daniel Soudry

    Abstract: Catastrophic forgetting is the notorious vulnerability of neural networks to the change of the data distribution while learning. This phenomenon has long been considered a major obstacle for allowing the use of learning agents in realistic continual learning settings. A large body of continual learning research assumes that task boundaries are known during training. However, research for scenarios… ▽ More

    Submitted 12 February, 2019; v1 submitted 27 March, 2018; originally announced March 2018.

  6. arXiv:1803.01814  [pdf, other

    stat.ML cs.LG

    Norm matters: efficient and accurate normalization schemes in deep networks

    Authors: Elad Hoffer, Ron Banner, Itay Golan, Daniel Soudry

    Abstract: Over the past few years, Batch-Normalization has been commonly used in deep networks, allowing faster training and high performance for a wide variety of applications. However, the reasons behind its merits remained unanswered, with several shortcomings that hindered its use for certain tasks. In this work, we present a novel view on the purpose and function of normalization methods and weight-dec… ▽ More

    Submitted 7 February, 2019; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: http://papers.nips.cc/paper/7485-norm-matters-efficient-and-accurate-normalization-schemes-in-deep-networks

    Journal ref: NeurIPS2018