Skip to main content

Showing 1–19 of 19 results for author: Schwartz, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14880  [pdf, other

    cs.CV cs.AI

    Dissecting Query-Key Interaction in Vision Transformers

    Authors: Xu Pan, Aaron Philip, Ziqian Xie, Odelia Schwartz

    Abstract: Self-attention in vision transformers is often thought to perform perceptual grou** where tokens attend to other tokens with similar embeddings, which could correspond to semantically similar features of an object. However, attending to dissimilar tokens can be beneficial by providing contextual information. We propose to use the Singular Value Decomposition to dissect the query-key interaction… ▽ More

    Submitted 26 May, 2024; v1 submitted 4 April, 2024; originally announced May 2024.

  2. arXiv:2108.01548  [pdf, other

    cs.CV cs.LG

    Inference via Sparse Coding in a Hierarchical Vision Model

    Authors: Joshua Bowren, Luis Sanchez-Giraldo, Odelia Schwartz

    Abstract: Sparse coding has been incorporated in models of the visual cortex for its computational advantages and connection to biology. But how the level of sparsity contributes to performance on visual tasks is not well understood. In this work, sparse coding has been integrated into an existing hierarchical V2 model (Hosoya and Hyvärinen, 2015), but replacing its independent component analysis (ICA) with… ▽ More

    Submitted 16 January, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: To appear in Journal of Vision (JoV)

  3. arXiv:2008.03759  [pdf, other

    cs.DS

    Sparsifying the Operators of Fast Matrix Multiplication Algorithms

    Authors: Gal Beniamini, Nathan Cheng, Olga Holtz, Elaye Karstadt, Oded Schwartz

    Abstract: Fast matrix multiplication algorithms may be useful, provided that their running time is good in practice. Particularly, the leading coefficient of their arithmetic complexity needs to be small. Many sub-cubic algorithms have large leading coefficients, rendering them impractical. Karstadt and Schwartz (SPAA'17, JACM'20) demonstrated how to reduce these coefficients by sparsifying an algorithm's b… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    ACM Class: F.2.1; F.2.1

  4. arXiv:2005.14150  [pdf, other

    cs.DC

    Network Partitioning and Avoidable Contention

    Authors: Yishai Oltchik, Oded Schwartz

    Abstract: Network contention frequently dominates the run time of parallel algorithms and limits scaling performance. Most previous studies mitigate or eliminate contention by utilizing one of several approaches: communication-minimizing algorithms; hotspot-avoiding routing schemes; topology-aware task map**; or improving global network properties, such as bisection bandwidth, edge-expansion, partitioning… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 10 pages, 7 figures

    ACM Class: C.2.1

  5. arXiv:1806.02888  [pdf, other

    q-bio.NC cs.CV

    Correspondence of Deep Neural Networks and the Brain for Visual Textures

    Authors: Md Nasir Uddin Laskar, Luis G Sanchez Giraldo, Odelia Schwartz

    Abstract: Deep convolutional neural networks (CNNs) trained on objects and scenes have shown intriguing ability to predict some response properties of visual cortical neurons. However, the factors and computations that give rise to such ability, and the role of intermediate processing stages in explaining changes that develop across areas of the cortical hierarchy, are poorly understood. We focused on the s… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

  6. arXiv:1806.01823  [pdf, ps, other

    q-bio.NC cs.CV

    Integrating Flexible Normalization into Mid-Level Representations of Deep Convolutional Neural Networks

    Authors: Luis Gonzalo Sanchez Giraldo, Odelia Schwartz

    Abstract: Deep convolutional neural networks (CNNs) are becoming increasingly popular models to predict neural responses in visual cortex. However, contextual effects, which are prevalent in neural processing and in perception, are not explicitly handled by current CNNs, including those used for neural prediction. In primary visual cortex, neural responses are modulated by stimuli spatially surrounding the… ▽ More

    Submitted 24 December, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  7. High-order Discretization of a Gyrokinetic Vlasov Model in Edge Plasma Geometry

    Authors: Milo R. Dorr, Phillip Colella, Mikhail A. Dorf, Debojyoti Ghosh, Jeffrey A. F. Hittinger, Peter O. Schwartz

    Abstract: We present a high-order spatial discretization of a continuum gyrokinetic Vlasov model in axisymmetric tokamak edge plasma geometries. Such models describe the phase space advection of plasma species distribution functions in the absence of collisions. The gyrokinetic model is posed in a four-dimensional phase space, upon which a grid is imposed when discretized. To mitigate the computational cost… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    MSC Class: 65M06; 86A10; 76N15

  8. arXiv:1607.06303  [pdf, ps, other

    math.NA cs.DC

    High-Performance Algorithms for Computing the Sign Function of Triangular Matrices

    Authors: Vadim Stotland, Oded Schwartz, Sivan Toledo

    Abstract: Algorithms and implementations for computing the sign function of a triangular matrix are fundamental building blocks in algorithms for computing the sign of arbitrary square real or complex matrices. We present novel recursive and cache efficient algorithms that are based on Higham's stabilized specialization of Parlett's substitution algorithm for computing the sign of a triangular matrix. We sh… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

    Comments: 18 pages, 4 figures

  9. arXiv:1603.05627  [pdf, ps, other

    cs.DC

    Hypergraph Partitioning for Sparse Matrix-Matrix Multiplication

    Authors: Grey Ballard, Alex Druinsky, Nicholas Knight, Oded Schwartz

    Abstract: We propose a fine-grained hypergraph model for sparse matrix-matrix multiplication (SpGEMM), a key computational kernel in scientific computing and data analysis whose performance is often communication bound. This model correctly describes both the interprocessor communication volume along a critical path in a parallel computation and also the volume of data moving through the memory hierarchy in… ▽ More

    Submitted 17 March, 2016; originally announced March 2016.

  10. arXiv:1510.00844  [pdf, other

    cs.DC math.NA

    Exploiting Multiple Levels of Parallelism in Sparse Matrix-Matrix Multiplication

    Authors: Ariful Azad, Grey Ballard, Aydin Buluc, James Demmel, Laura Grigori, Oded Schwartz, Sivan Toledo, Samuel Williams

    Abstract: Sparse matrix-matrix multiplication (or SpGEMM) is a key primitive for many high-performance graph algorithms as well as for some linear solvers, such as algebraic multigrid. The scaling of existing parallel implementations of SpGEMM is heavily bound by communication. Even though 3D (or 2.5D) algorithms have been proposed and theoretically analyzed in the flat MPI model on Erdos-Renyi matrices, th… ▽ More

    Submitted 16 November, 2016; v1 submitted 3 October, 2015; originally announced October 2015.

    Journal ref: SIAM Journal of Scientific Computing, Volume 38, Number 6, pp. C624-C651, 2016

  11. arXiv:1209.2184  [pdf, other

    cs.DS cs.CC math.NA

    Graph Expansion Analysis for Communication Costs of Fast Rectangular Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: Graph expansion analysis of computational DAGs is useful for obtaining communication cost lower bounds where previous methods, such as geometric embedding, are not applicable. This has recently been demonstrated for Strassen's and Strassen-like fast square matrix multiplication algorithms. Here we extend the expansion analysis approach to fast algorithms for rectangular matrix multiplication, obta… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

    Journal ref: Design and Analysis of Algorithms Volume 7659, 2012, pp 13-36

  12. arXiv:1208.4405  [pdf, other

    cs.IT math.NT math.RT

    Delay-Doppler Channel Estimation with Almost Linear Complexity

    Authors: Alexander Fish, Shamgar Gurevich, Ronny Hadani, Akbar Sayeed, Oded Schwartz

    Abstract: A fundamental task in wireless communication is Channel Estimation: Compute the channel parameters a signal undergoes while traveling from a transmitter to a receiver. In the case of delay-Doppler channel, a widely used method is the Matched Filter algorithm. It uses a pseudo-random sequence of length N, and, in case of non-trivial relative velocity between transmitter and receiver, its computatio… ▽ More

    Submitted 23 August, 2012; v1 submitted 21 August, 2012; originally announced August 2012.

    Comments: 11 pages

  13. arXiv:1202.3177  [pdf, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Strong Scaling of Matrix Multiplication Algorithms and Memory-Independent Communication Lower Bounds

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P, including all communication costs. Distributed-memory parallel algorithms for matrix multiplication with perfect strong scaling have only recently been found. One is based on classical matrix multiplication (Solomonik and Demmel, 2011), and one is based on Strassen's fast matrix multiplication (Ba… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Comments: 4 pages, 1 figure

    MSC Class: 68W10; 68W40 ACM Class: F.2.1

  14. arXiv:1202.3173  [pdf, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Communication-Optimal Parallel Algorithm for Strassen's Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: Parallel matrix multiplication is one of the most studied fundamental problems in distributed and high performance computing. We obtain a new parallel algorithm that is based on Strassen's fast matrix multiplication and minimizes communication. The algorithm outperforms all known parallel matrix multiplication algorithms, classical and Strassen-based, both asymptotically and in practice. A criti… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Comments: 13 pages, 3 figures

    MSC Class: 68W40; 68W10 ACM Class: F.2.1

  15. arXiv:1112.4883  [pdf, other

    cs.IT

    Computing the Matched Filter in Linear Time

    Authors: Alexander Fish, Shamgar Gurevich, Ronny Hadani, Akbar Sayeed, Oded Schwartz

    Abstract: A fundamental problem in wireless communication is the time-frequency shift (TFS) problem: Find the time-frequency shift of a signal in a noisy environment. The shift is the result of time asynchronization of a sender with a receiver, and of non-zero speed of a sender with respect to a receiver. A classical solution of a discrete analog of the TFS problem is called the matched filter algorithm. It… ▽ More

    Submitted 20 December, 2011; originally announced December 2011.

    Comments: 6 pages

  16. arXiv:1109.1693  [pdf, ps, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Graph Expansion and Communication Costs of Fast Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: The communication cost of algorithms (also known as I/O-complexity) is shown to be closely related to the expansion properties of the corresponding computation graphs. We demonstrate this on Strassen's and other fast matrix multiplication algorithms, and obtain first lower bounds on their communication costs. In the sequential case, where the processor has a fast memory of size $M$, too small to… ▽ More

    Submitted 8 September, 2011; originally announced September 2011.

    Report number: UCB/EECS-2011-40 ACM Class: F.2.1

    Journal ref: Proceedings of the 23rd annual symposium on parallelism in algorithms and architectures. ACM, 1-12. 2011 (a shorter conference version)

  17. arXiv:0905.2485  [pdf, ps, other

    cs.CC cs.DS math.NA

    Minimizing Communication in Linear Algebra

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: In 1981 Hong and Kung proved a lower bound on the amount of communication needed to perform dense, matrix-multiplication using the conventional $O(n^3)$ algorithm, where the input matrices were too large to fit in the small, fast memory. In 2004 Irony, Toledo and Tiskin gave a new proof of this result and extended it to the parallel case. In both cases the lower bound may be expressed as $Ω$(#ar… ▽ More

    Submitted 15 May, 2009; originally announced May 2009.

    Comments: 27 pages, 2 tables

    Journal ref: SIAM. J. Matrix Anal. & Appl. 32 (2011), no. 3, 866-901

  18. arXiv:0904.2115  [pdf, other

    cs.CG

    Colorful Strips

    Authors: G. Aloupis, J. Cardinal, S. Collette, S. Imahori, M. Korman, S. Langerman, O. Schwartz, S. Smorodinsky, P. Taslakian

    Abstract: Given a planar point set and an integer $k$, we wish to color the points with $k$ colors so that any axis-aligned strip containing enough points contains all colors. The goal is to bound the necessary size of such a strip, as a function of $k$. We show that if the strip size is at least $2k{-}1$, such a coloring can always be found. We prove that the size of the strip is also bounded in any fixed… ▽ More

    Submitted 7 April, 2011; v1 submitted 14 April, 2009; originally announced April 2009.

  19. arXiv:0902.2537  [pdf, other

    math.NA cs.CC cs.DS

    Communication-optimal Parallel and Sequential Cholesky Decomposition

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: Numerical algorithms have two kinds of costs: arithmetic and communication, by which we mean either moving data between levels of a memory hierarchy (in the sequential case) or over a network connecting processors (in the parallel case). Communication costs often dominate arithmetic costs, so it is of interest to design algorithms minimizing communication. In this paper we first extend known lower… ▽ More

    Submitted 12 April, 2010; v1 submitted 15 February, 2009; originally announced February 2009.

    Comments: 29 pages, 2 tables, 6 figures

    ACM Class: F.2.1

    Journal ref: SIAM J. Sci. Comput. 32, (2010) pp. 3495-3523