Skip to main content

Showing 1–14 of 14 results for author: Holtz, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.08279  [pdf, other

    cs.DC cs.CC cs.DS

    Communication Bounds for Convolutional Neural Networks

    Authors: Anthony Chen, James Demmel, Grace Dinh, Mason Haberle, Olga Holtz

    Abstract: Convolutional neural networks (CNNs) are important in a wide variety of machine learning tasks and applications, so optimizing their performance is essential. Moving words of data between levels of a memory hierarchy or between processors on a network is much more expensive than the cost of arithmetic, so minimizing communication is critical to optimizing performance. In this paper, we present new… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Journal ref: PASC '22: Proceedings of the Platform for Advanced Scientific Computing Conference June 2022 Article No. 1 Pages 1-10

  2. arXiv:2008.03759  [pdf, other

    cs.DS

    Sparsifying the Operators of Fast Matrix Multiplication Algorithms

    Authors: Gal Beniamini, Nathan Cheng, Olga Holtz, Elaye Karstadt, Oded Schwartz

    Abstract: Fast matrix multiplication algorithms may be useful, provided that their running time is good in practice. Particularly, the leading coefficient of their arithmetic complexity needs to be small. Many sub-cubic algorithms have large leading coefficients, rendering them impractical. Karstadt and Schwartz (SPAA'17, JACM'20) demonstrated how to reduce these coefficients by sparsifying an algorithm's b… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    ACM Class: F.2.1; F.2.1

  3. arXiv:1209.2184  [pdf, other

    cs.DS cs.CC math.NA

    Graph Expansion Analysis for Communication Costs of Fast Rectangular Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: Graph expansion analysis of computational DAGs is useful for obtaining communication cost lower bounds where previous methods, such as geometric embedding, are not applicable. This has recently been demonstrated for Strassen's and Strassen-like fast square matrix multiplication algorithms. Here we extend the expansion analysis approach to fast algorithms for rectangular matrix multiplication, obta… ▽ More

    Submitted 10 September, 2012; originally announced September 2012.

    Journal ref: Design and Analysis of Algorithms Volume 7659, 2012, pp 13-36

  4. arXiv:1202.3177  [pdf, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Strong Scaling of Matrix Multiplication Algorithms and Memory-Independent Communication Lower Bounds

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: A parallel algorithm has perfect strong scaling if its running time on P processors is linear in 1/P, including all communication costs. Distributed-memory parallel algorithms for matrix multiplication with perfect strong scaling have only recently been found. One is based on classical matrix multiplication (Solomonik and Demmel, 2011), and one is based on Strassen's fast matrix multiplication (Ba… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Comments: 4 pages, 1 figure

    MSC Class: 68W10; 68W40 ACM Class: F.2.1

  5. arXiv:1202.3173  [pdf, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Communication-Optimal Parallel Algorithm for Strassen's Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Benjamin Lipshitz, Oded Schwartz

    Abstract: Parallel matrix multiplication is one of the most studied fundamental problems in distributed and high performance computing. We obtain a new parallel algorithm that is based on Strassen's fast matrix multiplication and minimizes communication. The algorithm outperforms all known parallel matrix multiplication algorithms, classical and Strassen-based, both asymptotically and in practice. A criti… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Comments: 13 pages, 3 figures

    MSC Class: 68W40; 68W10 ACM Class: F.2.1

  6. arXiv:1109.1693  [pdf, ps, other

    cs.DS cs.CC cs.DC math.CO math.NA

    Graph Expansion and Communication Costs of Fast Matrix Multiplication

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: The communication cost of algorithms (also known as I/O-complexity) is shown to be closely related to the expansion properties of the corresponding computation graphs. We demonstrate this on Strassen's and other fast matrix multiplication algorithms, and obtain first lower bounds on their communication costs. In the sequential case, where the processor has a fast memory of size $M$, too small to… ▽ More

    Submitted 8 September, 2011; originally announced September 2011.

    Report number: UCB/EECS-2011-40 ACM Class: F.2.1

    Journal ref: Proceedings of the 23rd annual symposium on parallelism in algorithms and architectures. ACM, 1-12. 2011 (a shorter conference version)

  7. arXiv:0906.0687  [pdf, ps, other

    cs.CC cs.DS math.HO math.NA math.RA

    Computational Complexity and Numerical Stability of Linear Problems

    Authors: Olga Holtz, Noam Shomron

    Abstract: We survey classical and recent developments in numerical linear algebra, focusing on two issues: computational complexity, or arithmetic costs, and numerical stability, or performance under roundoff error. We present a brief account of the algebraic complexity theory as well as the general error analysis for matrix multiplication and related problems. We emphasize the central role played by the… ▽ More

    Submitted 11 September, 2009; v1 submitted 3 June, 2009; originally announced June 2009.

    Comments: 16 pages; updated to reflect referees' remarks; to appear in Proceedings of the 5th European Congress of Mathematics

    Journal ref: European Congress of Mathematics Amsterdam, 14-18 July, 2008, EMS Publishing House, pp. 381-400

  8. arXiv:0905.2485  [pdf, ps, other

    cs.CC cs.DS math.NA

    Minimizing Communication in Linear Algebra

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: In 1981 Hong and Kung proved a lower bound on the amount of communication needed to perform dense, matrix-multiplication using the conventional $O(n^3)$ algorithm, where the input matrices were too large to fit in the small, fast memory. In 2004 Irony, Toledo and Tiskin gave a new proof of this result and extended it to the parallel case. In both cases the lower bound may be expressed as $Ω$(#ar… ▽ More

    Submitted 15 May, 2009; originally announced May 2009.

    Comments: 27 pages, 2 tables

    Journal ref: SIAM. J. Matrix Anal. & Appl. 32 (2011), no. 3, 866-901

  9. arXiv:0902.2537  [pdf, other

    math.NA cs.CC cs.DS

    Communication-optimal Parallel and Sequential Cholesky Decomposition

    Authors: Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

    Abstract: Numerical algorithms have two kinds of costs: arithmetic and communication, by which we mean either moving data between levels of a memory hierarchy (in the sequential case) or over a network connecting processors (in the parallel case). Communication costs often dominate arithmetic costs, so it is of interest to design algorithms minimizing communication. In this paper we first extend known lower… ▽ More

    Submitted 12 April, 2010; v1 submitted 15 February, 2009; originally announced February 2009.

    Comments: 29 pages, 2 tables, 6 figures

    ACM Class: F.2.1

    Journal ref: SIAM J. Sci. Comput. 32, (2010) pp. 3495-3523

  10. arXiv:0812.3137  [pdf, ps, other

    math.HO cs.DS math.NA math.OC

    Compressive sensing: a paradigm shift in signal processing

    Authors: Olga Holtz

    Abstract: We survey a new paradigm in signal processing known as "compressive sensing". Contrary to old practices of data acquisition and reconstruction based on the Shannon-Nyquist sampling principle, the new theory shows that it is possible to reconstruct images or signals of scientific interest accurately and even exactly from a number of samples which is far smaller than the desired resolution of the… ▽ More

    Submitted 16 December, 2008; originally announced December 2008.

    Comments: A short survey of compressive sensing

    MSC Class: 90C05; 90C25; 65F50; 94A08; 94A20; 68P30; 65Y20

  11. arXiv:0712.4027  [pdf, ps, other

    math.NA cs.CC cs.DS math.RA

    Accurate and Efficient Expression Evaluation and Linear Algebra

    Authors: James Demmel, Ioana Dumitriu, Olga Holtz, Plamen Koev

    Abstract: We survey and unify recent results on the existence of accurate algorithms for evaluating multivariate polynomials, and more generally for accurate numerical linear algebra with structured matrices. By "accurate" we mean that the computed answer has relative error less than 1, i.e., has some correct leading digits. We also address efficiency, by which we mean algorithms that run in polynomial ti… ▽ More

    Submitted 24 December, 2007; originally announced December 2007.

    Comments: 49 pages, 6 figures, 1 table

    MSC Class: 65Y20; 68Q05; 68Q25; 65F30; 68W40; 68W25

    Journal ref: Acta Numerica, Volume 17, May 2008, pp 87-145

  12. arXiv:math/0612264  [pdf, ps, other

    math.NA cs.CC cs.DS

    Fast linear algebra is stable

    Authors: James Demmel, Ioana Dumitriu, Olga Holtz

    Abstract: In an earlier paper, we showed that a large class of fast recursive matrix multiplication algorithms is stable in a normwise sense, and that in fact if multiplication of $n$-by-$n$ matrices can be done by any algorithm in $O(n^{ω+ η})$ operations for any $η> 0$, then it can be done stably in $O(n^{ω+ η})$ operations for any $η> 0$. Here we extend this result to show that essentially all standard… ▽ More

    Submitted 28 August, 2007; v1 submitted 10 December, 2006; originally announced December 2006.

    Comments: 26 pages; final version; to appear in Numerische Mathematik

    MSC Class: 65Y20; 65F30; 65G50; 68Q17; 68Q25

    Journal ref: Numer. Math. 108 (2007), no. 1, 59-91

  13. arXiv:math/0603207  [pdf, ps, other

    math.NA cs.CC cs.DS math.GR

    Fast matrix multiplication is stable

    Authors: James Demmel, Ioana Dumitriu, Olga Holtz, Robert Kleinberg

    Abstract: We perform forward error analysis for a large class of recursive matrix multiplication algorithms in the spirit of [D. Bini and G. Lotti, Stability of fast algorithms for matrix multiplication, Numer. Math. 36 (1980), 63--72]. As a consequence of our analysis, we show that the exponent of matrix multiplication (the optimal running time) can be achieved by numerically stable algorithms. We also s… ▽ More

    Submitted 7 December, 2006; v1 submitted 8 March, 2006; originally announced March 2006.

    Comments: 19 pages; final version, expanded and updated to reflect referees' remarks; to appear in Numerische Mathematik

    MSC Class: 65Y20; 65F30; 65G50; 68Q17; 68W40; 20C05; 20K01; 16S34; 43A30; 65T50

    Journal ref: Numer. Math. 106 (2007), no. 2, 199-224

  14. arXiv:math/0508350  [pdf, ps, other

    math.NA cs.CC

    Toward accurate polynomial evaluation in rounded arithmetic

    Authors: James Demmel, Ioana Dumitriu, Olga Holtz

    Abstract: Given a multivariate real (or complex) polynomial $p$ and a domain $\cal D$, we would like to decide whether an algorithm exists to evaluate $p(x)$ accurately for all $x \in {\cal D}$ using rounded real (or complex) arithmetic. Here ``accurately'' means with relative error less than 1, i.e., with some correct leading digits. The answer depends on the model of rounded arithmetic: We assume that f… ▽ More

    Submitted 18 January, 2006; v1 submitted 18 August, 2005; originally announced August 2005.

    Comments: 54 pages, 6 figures; refereed version; to appear in Foundations of Computational Mathematics: Santander 2005, Cambridge University Press, March 2006

    MSC Class: 65Y20; 68Q05; 68Q25; 65F30; 68W40; 68W25

    Journal ref: in Foundations of Computational Mathematics: Santander 2005 (L. Pardo et al, eds.) Cambridge University Press, 2006, pp. 36-105