Skip to main content

Showing 1–18 of 18 results for author: Dekel, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19997  [pdf, other

    cs.LG cs.AI cs.CV

    Wavelets Are All You Need for Autoregressive Image Generation

    Authors: Wael Mattar, Idan Levy, Nir Sharon, Shai Dekel

    Abstract: In this paper, we take a new approach to autoregressive image generation that is based on two main ingredients. The first is wavelet image coding, which allows to tokenize the visual details of an image from coarse to fine details by ordering the information starting with the most significant bits of the most significant wavelet coefficients. The second is a variant of a language transformer whose… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 16 pages, 10 figures

    MSC Class: 65T60 ACM Class: I.4.2; I.4.5; I.4.10

  2. arXiv:2305.15614  [pdf, other

    cs.LG cs.AI

    Reverse Engineering Self-Supervised Learning

    Authors: Ido Ben-Shaul, Ravid Shwartz-Ziv, Tomer Galanti, Shai Dekel, Yann LeCun

    Abstract: Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge. This paper presents an in-depth empirical analysis of SSL-trained representations, encompassing diverse models, architectures, and hyperparameters. Our study reveals an intriguing aspect of the SSL training process: it inherently… ▽ More

    Submitted 31 May, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  3. arXiv:2304.11706  [pdf, other

    cs.CV

    Deep Convolutional Tables: Deep Learning without Convolutions

    Authors: Shay Dekel, Yosi Keller, Aharon Bar-Hillel

    Abstract: We propose a novel formulation of deep networks that do not use dot-product neurons and rely on a hierarchy of voting tables instead, denoted as Convolutional Tables (CT), to enable accelerated CPU-based inference. Convolutional layers are the most time-consuming bottleneck in contemporary deep learning techniques, severely limiting their use in Internet of Things and CPU-based devices. The propos… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Accepted for publication. IEEE Transactions on Neural Networks and Learning Systems

  4. arXiv:2303.02615  [pdf, other

    cs.CV

    Estimating Extreme 3D Image Rotation with Transformer Cross-Attention

    Authors: Shay Dekel, Yosi Keller, Martin Cadik

    Abstract: The estimation of large and extreme image rotation plays a key role in multiple computer vision domains, where the rotated images are related by a limited or a non-overlap** field of view. Contemporary approaches apply convolutional neural networks to compute a 4D correlation volume to estimate the relative rotation between image pairs. In this work, we propose a cross-attention-based approach t… ▽ More

    Submitted 8 March, 2024; v1 submitted 5 March, 2023; originally announced March 2023.

    Journal ref: CVPR 2024

  5. arXiv:2302.05322  [pdf, ps, other

    cs.LG math.NA

    Numerical Methods For PDEs Over Manifolds Using Spectral Physics Informed Neural Networks

    Authors: Yuval Zelig, Shai Dekel

    Abstract: We introduce an approach for solving PDEs over manifolds using physics informed neural networks whose architecture aligns with spectral methods. The networks are trained to take in as input samples of an initial condition, a time stamp and point(s) on the manifold and then output the solution's value at the given time and point(s). We provide proofs of our method for the heat equation on the inter… ▽ More

    Submitted 3 September, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 25 pages

  6. arXiv:2301.04605  [pdf, ps, other

    cs.LG cs.NE math.FA

    Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions

    Authors: Ido Ben-Shaul, Tomer Galanti, Shai Dekel

    Abstract: Multiplication layers are a key component in various influential neural network modules, including self-attention and hypernetwork layers. In this paper, we investigate the approximation capabilities of deep neural networks with intermediate neurons connected by simple multiplication operations. We consider two classes of target functions: generalized bandlimited functions, which are frequently us… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    MSC Class: 41A25; 68Q32; 68T07

  7. arXiv:2204.09051  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    PR-DAD: Phase Retrieval Using Deep Auto-Decoders

    Authors: Leon Gugel, Shai Dekel

    Abstract: Phase retrieval is a well known ill-posed inverse problem where one tries to recover images given only the magnitude values of their Fourier transform as input. In recent years, new algorithms based on deep learning have been proposed, providing breakthrough results that surpass the results of the classical methods. In this work we provide a novel deep learning architecture PR-DAD (Phase Retrieval… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  8. arXiv:2201.08924  [pdf, other

    cs.LG

    Nearest Class-Center Simplification through Intermediate Layers

    Authors: Ido Ben-Shaul, Shai Dekel

    Abstract: Recent advances in theoretical Deep Learning have introduced geometric properties that occur during training, past the Interpolation Threshold -- where the training error reaches zero. We inquire into the phenomena coined Neural Collapse in the intermediate layers of the networks, and emphasize the innerworkings of Nearest Class-Center Mismatch inside the deepnet. We further show that these proces… ▽ More

    Submitted 11 June, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

  9. arXiv:2105.06849  [pdf, other

    cs.LG math.FA

    Sparsity-Probe: Analysis tool for Deep Learning Models

    Authors: Ido Ben-Shaul, Shai Dekel

    Abstract: We propose a probe for the analysis of deep learning architectures that is based on machine learning and approximation theoretical principles. Given a deep learning architecture and a training set, during or after training, the Sparsity Probe allows to analyze the performance of intermediate layers by quantifying the geometrical features of representations of the training set. We show how the Spar… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  10. arXiv:1805.02642  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Wavelet Decomposition of Gradient Boosting

    Authors: Shai Dekel, Oren Elisha, Ohad Morgan

    Abstract: In this paper we introduce a significant improvement to the popular tree-based Stochastic Gradient Boosting algorithm using a wavelet decomposition of the trees. This approach is based on harmonic analysis and approximation theoretical elements, and as we show through extensive experimentation, our wavelet based method generally outperforms existing methods, particularly in difficult scenarios of… ▽ More

    Submitted 3 May, 2019; v1 submitted 7 May, 2018; originally announced May 2018.

  11. arXiv:1710.03263  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Function space analysis of deep learning representation layers

    Authors: Oren Elisha, Shai Dekel

    Abstract: In this paper we propose a function space approach to Representation Learning and the analysis of the representation layers in deep learning architectures. We show how to compute a weak-type Besov smoothness index that quantifies the geometry of the clustering in the feature space. This approach was already applied successfully to improve the performance of machine learning algorithms such as the… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

  12. arXiv:1602.04358  [pdf, ps, other

    cs.AI cs.DS stat.ML

    Machine olfaction using time scattering of sensor multiresolution graphs

    Authors: Leonid Gugel, Yoel Shkolnisky, Shai Dekel

    Abstract: In this paper we construct a learning architecture for high dimensional time series sampled by sensor arrangements. Using a redundant wavelet decomposition on a graph constructed over the sensor locations, our algorithm is able to construct discriminative features that exploit the mutual information between the sensors. The algorithm then applies scattering networks to the time series graphs to cr… ▽ More

    Submitted 13 February, 2016; originally announced February 2016.

  13. Stable Support Recovery of Stream of Pulses with Application to Ultrasound Imaging

    Authors: Tamir Bendory, Avinoam Bar-Zion, Dan Adam, Shai Dekel, Arie Feuer

    Abstract: This paper considers the problem of estimating the delays of a weighted superposition of pulses, called stream of pulses, in a noisy environment. We show that the delays can be estimated using a tractable convex optimization problem with a localization error proportional to the square root of the noise level. Furthermore, all false detections produced by the algorithm have small amplitudes. Numeri… ▽ More

    Submitted 29 December, 2015; v1 submitted 26 July, 2015; originally announced July 2015.

  14. arXiv:1501.01825  [pdf, other

    cs.IT

    Unified Convex Optimization Approach to Super-Resolution Based on Localized Kernels

    Authors: Tamir Bendory, Shai Dekel, Arie Feuer

    Abstract: The problem of resolving the fine details of a signal from its coarse scale measurements or, as it is commonly referred to in the literature, the super-resolution problem arises naturally in engineering and physics in a variety of settings. We suggest a unified convex optimization approach for super-resolution. The key is the construction of an interpolating polynomial based on localized kernels.… ▽ More

    Submitted 12 April, 2015; v1 submitted 8 January, 2015; originally announced January 2015.

  15. arXiv:1412.6254  [pdf, ps, other

    cs.IT math.NA

    Exact recovery of non-uniform splines from the projection onto spaces of algebraic polynomials

    Authors: Tamir Bendory, Shai Dekel, Arie Feuer

    Abstract: In this work we consider the problem of recovering non-uniform splines from their projection onto spaces of algebraic polynomials. We show that under a certain Chebyshev-type separation condition on its knots, a spline whose inner-products with a polynomial basis and boundary conditions are known, can be recovered using Total Variation norm minimization. The proof of the uniqueness of the solution… ▽ More

    Submitted 19 December, 2014; originally announced December 2014.

  16. arXiv:1412.3284  [pdf, other

    cs.IT

    Exact recovery of Dirac ensembles from the projection onto spaces of spherical harmonics

    Authors: Tamir Bendory, Shai Dekel, Arie Feuer

    Abstract: In this work we consider the problem of recovering an ensemble of Diracs on the sphere from its projection onto spaces of spherical harmonics. We show that under an appropriate separation condition on the unknown locations of the Diracs, the ensemble can be recovered through Total Variation norm minimization. The proof of the uniqueness of the solution uses the method of `dual' interpolating polyn… ▽ More

    Submitted 10 December, 2014; originally announced December 2014.

  17. Super-resolution on the Sphere using Convex Optimization

    Authors: Tamir Bendory, Shai Dekel, Arie Feuer

    Abstract: This paper considers the problem of recovering an ensemble of Diracs on a sphere from its low resolution measurements. The Diracs can be located at any location on the sphere, not necessarily on a grid. We show that under a separation condition, one can recover the ensemble with high precision by a three-stage algorithm, which consists of solving a semi-definite program, root finding and least-squ… ▽ More

    Submitted 7 January, 2015; v1 submitted 10 December, 2014; originally announced December 2014.

  18. arXiv:1412.3262  [pdf, other

    cs.IT

    Robust Recovery of Stream of Pulses using Convex Optimization

    Authors: Tamir Bendory, Shai Dekel, Arie Feuer

    Abstract: This paper considers the problem of recovering the delays and amplitudes of a weighted superposition of pulses. This problem is motivated by a variety of applications such as ultrasound and radar. We show that for univariate and bivariate stream of pulses, one can recover the delays and weights to any desired accuracy by solving a tractable convex optimization problem, provided that a pulse-depend… ▽ More

    Submitted 27 April, 2016; v1 submitted 10 December, 2014; originally announced December 2014.

    Comments: Small modifications