Search | arXiv e-print repository

arXiv:2008.11117 [pdf, other]

Stochastic Markov Gradient Descent and Training Low-Bit Neural Networks

Authors: Jonathan Ashbrock, Alexander M. Powell

Abstract: The massive size of modern neural networks has motivated substantial recent interest in neural network quantization. We introduce Stochastic Markov Gradient Descent (SMGD), a discrete optimization method applicable to training quantized neural networks. The SMGD algorithm is designed for settings where memory is highly constrained during training. We provide theoretical guarantees of algorithm per… ▽ More The massive size of modern neural networks has motivated substantial recent interest in neural network quantization. We introduce Stochastic Markov Gradient Descent (SMGD), a discrete optimization method applicable to training quantized neural networks. The SMGD algorithm is designed for settings where memory is highly constrained during training. We provide theoretical guarantees of algorithm performance as well as encouraging numerical results. △ Less

Submitted 22 December, 2020; v1 submitted 25 August, 2020; originally announced August 2020.

Comments: 19 pages, 2 figures

arXiv:2006.09732 [pdf, other]

doi 10.1142/S0219530520400096

High order low-bit Sigma-Delta quantization for fusion frames

Authors: Zhen Gao, Felix Krahmer, Alexander M. Powell

Abstract: We construct high order low-bit Sigma-Delta $(ΣΔ)$ quantizers for the vector-valued setting of fusion frames. We prove that these $ΣΔ$ quantizers can be stably implemented to quantize fusion frame measurements on subspaces $W_n$ using $\log_2( {\rm dim}(W_n)+1)$ bits per measurement. Signal reconstruction is performed using a version of Sobolev duals for fusion frames, and numerical experiments ar… ▽ More We construct high order low-bit Sigma-Delta $(ΣΔ)$ quantizers for the vector-valued setting of fusion frames. We prove that these $ΣΔ$ quantizers can be stably implemented to quantize fusion frame measurements on subspaces $W_n$ using $\log_2( {\rm dim}(W_n)+1)$ bits per measurement. Signal reconstruction is performed using a version of Sobolev duals for fusion frames, and numerical experiments are given to validate the overall performance. △ Less

Submitted 3 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

Comments: 18 pages, 2 figures

arXiv:2003.09576 [pdf, ps, other]

A Schauder basis for $L_2$ consisting of non-negative functions

Authors: Daniel Freeman, Alexander M. Powell, Mitchell A. Taylor

Abstract: We prove that $L_2(\mathbb{R})$ contains a Schauder basis of non-negative functions. Similarly, $L_p(\mathbb{R})$ contains a Schauder basic sequence of non-negative functions such that $L_p(\mathbb{R})$ embeds into the closed span of the sequence. We prove as well that if $X$ is a separable Banach space with the bounded approximation property, then any set in $X$ with dense span contains a quasi-b… ▽ More We prove that $L_2(\mathbb{R})$ contains a Schauder basis of non-negative functions. Similarly, $L_p(\mathbb{R})$ contains a Schauder basic sequence of non-negative functions such that $L_p(\mathbb{R})$ embeds into the closed span of the sequence. We prove as well that if $X$ is a separable Banach space with the bounded approximation property, then any set in $X$ with dense span contains a quasi-basis (Schauder frame) for $X$. Furthermore, if $X$ is a separable Banach lattice with a bibasis then any set in $X$ with dense span contains a u-frame. △ Less

Submitted 21 March, 2020; originally announced March 2020.

Comments: 26 pages

MSC Class: 46B03; 46B15; 46E30; 42C15

arXiv:1510.04855 [pdf, ps, other]

doi 10.1016/j.acha.2016.05.001

A Sharp Balian-Low Uncertainty Principle for Shift-Invariant Spaces

Authors: Douglas P. Hardin, Michael C. Northington V., Alexander M. Powell

Abstract: A sharp version of the Balian-Low theorem is proven for the generators of finitely generated shift-invariant spaces. If generators $\{f_k\}_{k=1}^K \subset L^2(\mathbb{R}^d)$ are translated along a lattice to form a frame or Riesz basis for a shift-invariant space $V$, and if $V$ has extra invariance by a suitable finer lattice, then one of the generators $f_k$ must satisfy… ▽ More A sharp version of the Balian-Low theorem is proven for the generators of finitely generated shift-invariant spaces. If generators $\{f_k\}_{k=1}^K \subset L^2(\mathbb{R}^d)$ are translated along a lattice to form a frame or Riesz basis for a shift-invariant space $V$, and if $V$ has extra invariance by a suitable finer lattice, then one of the generators $f_k$ must satisfy $\int_{\mathbb{R}^d} |x| |f_k(x)|^2 dx = \infty$, namely, $\widehat{f_k} \notin H^{1/2}(\mathbb{R}^d)$. Similar results are proven for frames of translates that are not Riesz bases without the assumption of extra lattice invariance. The best previously existing results in the literature give a notably weaker conclusion using the Sobolev space $H^{d/2+ε}(\mathbb{R}^d)$; our results provide an absolutely sharp improvement with $H^{1/2}(\mathbb{R}^d)$. Our results are sharp in the sense that $H^{1/2}(\mathbb{R}^d)$ cannot be replaced by $H^s(\mathbb{R}^d)$ for any $s<1/2$. △ Less

Submitted 16 October, 2015; originally announced October 2015.

Comments: 20 pages

MSC Class: 42C15

arXiv:1405.7094 [pdf, ps, other]

Error bounds for consistent reconstruction: random polytopes and coverage processes

Authors: Alexander M. Powell, J. Tyler Whitehouse

Abstract: Consistent reconstruction is a method for producing an estimate $\widetilde{x} \in \mathbb{R}^d$ of a signal $x\in \mathbb{R}^d$ if one is given a collection of $N$ noisy linear measurements $q_n = \langle x, \varphi_n \rangle + ε_n$, $1 \leq n \leq N$, that have been corrupted by i.i.d. uniform noise $\{ε_n\}_{n=1}^N$. We prove mean squared error bounds for consistent reconstruction when the meas… ▽ More Consistent reconstruction is a method for producing an estimate $\widetilde{x} \in \mathbb{R}^d$ of a signal $x\in \mathbb{R}^d$ if one is given a collection of $N$ noisy linear measurements $q_n = \langle x, \varphi_n \rangle + ε_n$, $1 \leq n \leq N$, that have been corrupted by i.i.d. uniform noise $\{ε_n\}_{n=1}^N$. We prove mean squared error bounds for consistent reconstruction when the measurement vectors $\{\varphi_n\}_{n=1}^N\subset \mathbb{R}^d$ are drawn independently at random from a suitable distribution on the unit-sphere $\mathbb{S}^{d-1}$. Our main results prove that the mean squared error (MSE) for consistent reconstruction is of the optimal order $\mathbb{E}\|x - \widetilde{x}\|^2 \leq Kδ^2/N^2$ under general conditions on the measurement vectors. We also prove refined MSE bounds when the measurement vectors are i.i.d. uniformly distributed on the unit-sphere $\mathbb{S}^{d-1}$ and, in particular, show that in this case the constant $K$ is dominated by $d^3$, the cube of the ambient dimension. The proofs involve an analysis of random polytopes using coverage processes on the sphere. △ Less

Submitted 27 May, 2014; originally announced May 2014.

arXiv:1002.4076 [pdf, ps, other]

Time-frequency concentration of generating systems

Authors: Philippe Jaming, Alexander M. Powell

Abstract: Uncertainty principles for generating systems $\{e_n\}_{n=1}^{\infty} \subset \ltwo$ are proven and quantify the interplay between $\ell^r(\N)$ coefficient stability properties and time-frequency localization with respect to $|t|^p$ power weight dispersions. As a sample result, it is proven that if the unit-norm system $\{e_n\}_{n=1}^{\infty}$ is a Schauder basis or frame for $\ltwo$ then the tw… ▽ More Uncertainty principles for generating systems $\{e_n\}_{n=1}^{\infty} \subset \ltwo$ are proven and quantify the interplay between $\ell^r(\N)$ coefficient stability properties and time-frequency localization with respect to $|t|^p$ power weight dispersions. As a sample result, it is proven that if the unit-norm system $\{e_n\}_{n=1}^{\infty}$ is a Schauder basis or frame for $\ltwo$ then the two dispersion sequences $Δ(e_n)$, $Δ(\bar{e_n})$ and the one mean sequence $μ(e_n)$ cannot all be bounded. On the other hand, it is constructively proven that there exists a unit-norm exact system $\{f_n\}_{n=1}^{\infty}$ in $\ltwo$ for which all four of the sequences $Δ(f_n)$, $Δ(\bar{f_n})$, $μ(f_n)$, $μ(\bar{f_n})$ are bounded. △ Less

Submitted 22 February, 2010; originally announced February 2010.

Comments: 14 pages

MSC Class: 42A38; 42A65

Journal ref: Proc. A.M.S. 139 (2011) 3279-3290

arXiv:math/0606395 [pdf, ps, other]

doi 10.1016/j.jfa.2006.09.001

Uncertainty principles for orthonormal sequences

Authors: Philippe Jaming, Alexander M. Powell

Abstract: The aim of this paper is to provide complementary quantitative extensions of two results of H.S. Shapiro on the time-frequency concentration of orthonormal sequences in $L^2 (\R)$. More precisely, Shapiro proved that if the elements of an orthonormal sequence and their Fourier transforms are all pointwise bounded by a fixed function in $L^2(\R)$ then the sequence is finite. In a related result,… ▽ More The aim of this paper is to provide complementary quantitative extensions of two results of H.S. Shapiro on the time-frequency concentration of orthonormal sequences in $L^2 (\R)$. More precisely, Shapiro proved that if the elements of an orthonormal sequence and their Fourier transforms are all pointwise bounded by a fixed function in $L^2(\R)$ then the sequence is finite. In a related result, Shapiro also proved that if the elements of an orthonormal sequence and their Fourier transforms have uniformly bounded means and dispersions then the sequence is finite. This paper gives quantitative bounds on the size of the finite orthonormal sequences in Shapiro's uncertainty principles. The bounds are obtained by using prolate spheroïdal wave functions and combinatorial estimates on the number of elements in a spherical code. Extensions for Riesz bases and different measures of time-frequency concentration are also given. △ Less

Submitted 16 June, 2006; originally announced June 2006.

MSC Class: 42B10

Journal ref: Journal of Functional Analysis 243 (15/02/2007) 611-630

Showing 1–7 of 7 results for author: Powell, A M