Skip to main content

Showing 1–7 of 7 results for author: Ben-Shaul, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.15614  [pdf, other

    cs.LG cs.AI

    Reverse Engineering Self-Supervised Learning

    Authors: Ido Ben-Shaul, Ravid Shwartz-Ziv, Tomer Galanti, Shai Dekel, Yann LeCun

    Abstract: Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge. This paper presents an in-depth empirical analysis of SSL-trained representations, encompassing diverse models, architectures, and hyperparameters. Our study reveals an intriguing aspect of the SSL training process: it inherently… ▽ More

    Submitted 31 May, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  2. arXiv:2301.04605  [pdf, ps, other

    cs.LG cs.NE math.FA

    Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions

    Authors: Ido Ben-Shaul, Tomer Galanti, Shai Dekel

    Abstract: Multiplication layers are a key component in various influential neural network modules, including self-attention and hypernetwork layers. In this paper, we investigate the approximation capabilities of deep neural networks with intermediate neurons connected by simple multiplication operations. We consider two classes of target functions: generalized bandlimited functions, which are frequently us… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    MSC Class: 41A25; 68Q32; 68T07

  3. arXiv:2202.09028  [pdf, other

    cs.LG

    On the Implicit Bias Towards Minimal Depth of Deep Neural Networks

    Authors: Tomer Galanti, Liane Galanti, Ido Ben-Shaul

    Abstract: Recent results in the literature suggest that the penultimate (second-to-last) layer representations of neural networks that are trained for classification exhibit a clustering property called neural collapse (NC). We study the implicit bias of stochastic gradient descent (SGD) in favor of low-depth solutions when training deep neural networks. We characterize a notion of effective depth that meas… ▽ More

    Submitted 27 September, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

  4. arXiv:2201.08924  [pdf, other

    cs.LG

    Nearest Class-Center Simplification through Intermediate Layers

    Authors: Ido Ben-Shaul, Shai Dekel

    Abstract: Recent advances in theoretical Deep Learning have introduced geometric properties that occur during training, past the Interpolation Threshold -- where the training error reaches zero. We inquire into the phenomena coined Neural Collapse in the intermediate layers of the networks, and emphasize the innerworkings of Nearest Class-Center Mismatch inside the deepnet. We further show that these proces… ▽ More

    Submitted 11 June, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

  5. arXiv:2105.06849  [pdf, other

    cs.LG math.FA

    Sparsity-Probe: Analysis tool for Deep Learning Models

    Authors: Ido Ben-Shaul, Shai Dekel

    Abstract: We propose a probe for the analysis of deep learning architectures that is based on machine learning and approximation theoretical principles. Given a deep learning architecture and a training set, during or after training, the Sparsity Probe allows to analyze the performance of intermediate layers by quantifying the geometrical features of representations of the training set. We show how the Spar… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  6. arXiv:2008.10548  [pdf, other

    cs.CV

    Certainty Pooling for Multiple Instance Learning

    Authors: Jacob Gildenblat, Ido Ben-Shaul, Zvi Lapp, Eldad Klaiman

    Abstract: Multiple Instance Learning is a form of weakly supervised learning in which the data is arranged in sets of instances called bags with one label assigned per bag. The bag level class prediction is derived from the multiple instances through application of a permutation invariant pooling operator on instance predictions or embeddings. We present a novel pooling operator called \textbf{Certainty Poo… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  7. arXiv:2007.10205  [pdf, other

    cs.LG math.NA stat.ML

    Solving the functional Eigen-Problem using Neural Networks

    Authors: Ido Ben-Shaul, Leah Bar, Nir Sochen

    Abstract: In this work, we explore the ability of NN (Neural Networks) to serve as a tool for finding eigen-pairs of ordinary differential equations. The question we aime to address is whether, given a self-adjoint operator, we can learn what are the eigenfunctions, and their matching eigenvalues. The topic of solving the eigen-problem is widely discussed in Image Processing, as many image processing algori… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.