Search | arXiv e-print repository

Covariance estimation with direction dependence accuracy

Authors: Pedro Abdalla, Shahar Mendelson

Abstract: We construct an estimator $\widehatΣ$ for covariance matrices of unknown, centred random vectors X, with the given data consisting of N independent measurements $X_1,...,X_N$ of X and the wanted confidence level. We show under minimal assumptions on X, the estimator performs with the optimal accuracy with respect to the operator norm. In addition, the estimator is also optimal with respect to dire… ▽ More We construct an estimator $\widehatΣ$ for covariance matrices of unknown, centred random vectors X, with the given data consisting of N independent measurements $X_1,...,X_N$ of X and the wanted confidence level. We show under minimal assumptions on X, the estimator performs with the optimal accuracy with respect to the operator norm. In addition, the estimator is also optimal with respect to direction dependence accuracy: $\langle \widehatΣu,u\rangle$ is an optimal estimator for $σ^2(u)=\mathbb{E}\langle X,u\rangle^2$ when $σ^2(u)$ is ``large". △ Less

Submitted 13 February, 2024; originally announced February 2024.

arXiv:2312.06442 [pdf, ps, other]

A uniform Dvoretzky-Kiefer-Wolfowitz inequality

Authors: Daniel Bartl, Shahar Mendelson

Abstract: We show that under minimal assumption on a class of functions $\mathcal{H}$ defined on a probability space $(\mathcal{X},μ)$, there is a threshold $Δ_0$ satisfying the following: for every $Δ\geqΔ_0$, with probability at least $1-2\exp(-cΔm)$ with respect to $μ^{\otimes m}$, \[ \sup_{h\in\mathcal{H}} \sup_{t\in\mathbb{R}} \left| \mathbb{P}(h(X)\leq t) - \frac{1}{m}\sum_{i=1}^m 1_{(-\infty,t]}(h(… ▽ More We show that under minimal assumption on a class of functions $\mathcal{H}$ defined on a probability space $(\mathcal{X},μ)$, there is a threshold $Δ_0$ satisfying the following: for every $Δ\geqΔ_0$, with probability at least $1-2\exp(-cΔm)$ with respect to $μ^{\otimes m}$, \[ \sup_{h\in\mathcal{H}} \sup_{t\in\mathbb{R}} \left| \mathbb{P}(h(X)\leq t) - \frac{1}{m}\sum_{i=1}^m 1_{(-\infty,t]}(h(X_i)) \right| \leq \sqrtΔ;\] here $X$ is distributed according to $μ$ and $(X_i)_{i=1}^m$ are independent copies of $X$. The value of $Δ_0$ is determined by an unexpected complexity parameter of the class $\mathcal{H}$ that captures the set's geometry (Talagrand's $γ_1$-functional). The bound, the probability estimate and the value of $Δ_0$ are all optimal up to a logarithmic factor. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2311.07741 [pdf, ps, other]

Exact Synthesis of Multiqubit Clifford-Cyclotomic Circuits

Authors: Matthew Amy, Andrew N. Glaudell, Shaun Kelso, William Maxwell, Samuel S. Mendelson, Neil J. Ross

Abstract: Let $n\geq 8$ be divisible by 4. The Clifford-cyclotomic gate set $\mathcal{G}_n$ is the universal gate set obtained by extending the Clifford gates with the $z$-rotation $T_n = \mathrm{diag}(1,ζ_n)$, where $ζ_n$ is a primitive $n$-th root of unity. In this note, we show that, when $n$ is a power of 2, a multiqubit unitary matrix $U$ can be exactly represented by a circuit over $\mathcal{G}_n$ if… ▽ More Let $n\geq 8$ be divisible by 4. The Clifford-cyclotomic gate set $\mathcal{G}_n$ is the universal gate set obtained by extending the Clifford gates with the $z$-rotation $T_n = \mathrm{diag}(1,ζ_n)$, where $ζ_n$ is a primitive $n$-th root of unity. In this note, we show that, when $n$ is a power of 2, a multiqubit unitary matrix $U$ can be exactly represented by a circuit over $\mathcal{G}_n$ if and only if the entries of $U$ belong to the ring $\mathbb{Z}[1/2,ζ_n]$. We moreover show that $\log(n)-2$ ancillas are always sufficient to construct a circuit for $U$. Our results generalize prior work to an infinite family of gate sets and show that the limitations that apply to single-qubit unitaries, for which the correspondence between Clifford-cyclotomic operators and matrices over $\mathbb{Z}[1/2,ζ_n]$ fails for all but finitely many values of $n$, can be overcome through the use of ancillas. △ Less

Submitted 12 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.07675 [pdf, other]

Spectral properties of random graphs with fixed equitable partition

Authors: Matthew B. Crawford, David J. Marchette, William Maxwell, Samuel S. Mendelson

Abstract: We define a graph to be $S$-regular if it contains an equitable partition given by a matrix $S$. These graphs are generalizations of both regular and bipartite, biregular graphs. An $S$-regular matrix is defined then as a matrix on an $S$-regular graph consistent with the graph's equitable partition. In this paper we derive the limiting spectral density for large, random $S$-regular matrices as we… ▽ More We define a graph to be $S$-regular if it contains an equitable partition given by a matrix $S$. These graphs are generalizations of both regular and bipartite, biregular graphs. An $S$-regular matrix is defined then as a matrix on an $S$-regular graph consistent with the graph's equitable partition. In this paper we derive the limiting spectral density for large, random $S$-regular matrices as well as limiting functions of certain statistics for their eigenvector coordinates as a function of eigenvalue. These limiting functions are defined in terms of spectral measures on $S$-regular trees. In general, these spectral measures do not have a closed-form expression; however, we provide a defining system of polynomials for them. Finally, we explore eigenvalue bounds of $S$-regular graph, proving an expander mixing lemma, Alon-Bopana bound, and other eigenvalue inequalities in terms of the eigenvalues of the matrix $S$. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: 24 pages, 3 figures

MSC Class: 05C75 (Primary) 60B20; 05C80 (Secondary)

arXiv:2309.12069 [pdf, ps, other]

Optimal non-gaussian Dvoretzky-Milman embeddings

Authors: Daniel Bartl, Shahar Mendelson

Abstract: We construct the first non-gaussian ensemble that yields the optimal estimate in the Dvoretzky-Milman Theorem: the ensemble exhibits almost Euclidean sections in arbitrary normed spaces of the same dimension as the gaussian embedding -- despite being very far from gaussian (in fact, it happens to be heavy-tailed). We construct the first non-gaussian ensemble that yields the optimal estimate in the Dvoretzky-Milman Theorem: the ensemble exhibits almost Euclidean sections in arbitrary normed spaces of the same dimension as the gaussian embedding -- despite being very far from gaussian (in fact, it happens to be heavy-tailed). △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: This is part two of the paper "Structure preservation via the Wasserstein distance" (arXiv:2209.07058v1) which was split into two parts

Journal ref: International Mathematics Research Notices, 2023+

arXiv:2309.02013 [pdf, ps, other]

Empirical approximation of the gaussian distribution in $\mathbb{R}^d$

Authors: Daniel Bartl, Shahar Mendelson

Abstract: Let $G_1,\dots,G_m$ be independent copies of the standard gaussian random vector in $\mathbb{R}^d$. We show that there is an absolute constant $c$ such that for any $A \subset S^{d-1}$, with probability at least $1-2\exp(-cΔm)$, for every $t\in\mathbb{R}$, \[ \sup_{x \in A} \left| \frac{1}{m}\sum_{i=1}^m 1_{ \{\langle G_i,x\rangle \leq t \}} - \mathbb{P}(\langle G,x\rangle \leq t) \right| \leq Δ+… ▽ More Let $G_1,\dots,G_m$ be independent copies of the standard gaussian random vector in $\mathbb{R}^d$. We show that there is an absolute constant $c$ such that for any $A \subset S^{d-1}$, with probability at least $1-2\exp(-cΔm)$, for every $t\in\mathbb{R}$, \[ \sup_{x \in A} \left| \frac{1}{m}\sum_{i=1}^m 1_{ \{\langle G_i,x\rangle \leq t \}} - \mathbb{P}(\langle G,x\rangle \leq t) \right| \leq Δ+ σ(t) \sqrtΔ. \] Here $σ(t) $ is the variance of $1_{\{\langle G,x\rangle\leq t\}}$ and $Δ\geq Δ_0$, where $Δ_0$ is determined by an unexpected complexity parameter of $A$ that captures the set's geometry (Talagrand's $γ_1$ functional). The bound, the probability estimate, and the value of $Δ_0$ are all (almost) optimal. We use this fact to show that if $Γ=\sum_{i=1}^m \langle G_i,x\rangle e_i$ is the random matrix that has $G_1,\dots,G_m$ as its rows, then the structure of $Γ(A)=\{Γx: x\in A\}$ is far more rigid and well-prescribed than was previously expected. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.04757 [pdf, ps, other]

On a variance dependent Dvoretzky-Kiefer-Wolfowitz inequality

Authors: Daniel Bartl, Shahar Mendelson

Abstract: Let $X$ be a real-valued random variable with distribution function $F$. Set $X_1,\dots, X_m$ to be independent copies of $X$ and let $F_m$ be the corresponding empirical distribution function. We show that there are absolute constants $c_0$ and $c_1$ such that if $Δ\geq c_0\frac{\log\log m}{m}$, then with probability at least $1-2\exp(-c_1Δm)$, for every $t\in\mathbb{R}$ that satisfies… ▽ More Let $X$ be a real-valued random variable with distribution function $F$. Set $X_1,\dots, X_m$ to be independent copies of $X$ and let $F_m$ be the corresponding empirical distribution function. We show that there are absolute constants $c_0$ and $c_1$ such that if $Δ\geq c_0\frac{\log\log m}{m}$, then with probability at least $1-2\exp(-c_1Δm)$, for every $t\in\mathbb{R}$ that satisfies $F(t)\in[Δ,1-Δ]$, \[ |F_m(t) - F(t) | \leq \sqrt{Δ\min\{F(t),1-F(t)\} } .\] Moreover, this estimate is optimal up to the multiplicative constants $c_0$ and $c_1$. △ Less

Submitted 9 August, 2023; originally announced August 2023.

arXiv:2307.01181 [pdf, ps, other]

Fitting an ellipsoid to a quadratic number of random points

Authors: Afonso S. Bandeira, Antoine Maillard, Shahar Mendelson, Elliot Paquette

Abstract: We consider the problem $(\mathrm{P})$ of fitting $n$ standard Gaussian random vectors in $\mathbb{R}^d$ to the boundary of a centered ellipsoid, as $n, d \to \infty$. This problem is conjectured to have a sharp feasibility transition: for any $\varepsilon > 0$, if $n \leq (1 - \varepsilon) d^2 / 4$ then $(\mathrm{P})$ has a solution with high probability, while $(\mathrm{P})$ has no solutions wit… ▽ More We consider the problem $(\mathrm{P})$ of fitting $n$ standard Gaussian random vectors in $\mathbb{R}^d$ to the boundary of a centered ellipsoid, as $n, d \to \infty$. This problem is conjectured to have a sharp feasibility transition: for any $\varepsilon > 0$, if $n \leq (1 - \varepsilon) d^2 / 4$ then $(\mathrm{P})$ has a solution with high probability, while $(\mathrm{P})$ has no solutions with high probability if $n \geq (1 + \varepsilon) d^2 /4$. So far, only a trivial bound $n \geq d^2 / 2$ is known on the negative side, while the best results on the positive side assume $n \leq d^2 / \mathrm{polylog}(d)$. In this work, we improve over previous approaches using a key result of Bartl & Mendelson on the concentration of Gram matrices of random vectors under mild assumptions on their tail behavior. This allows us to give a simple proof that $(\mathrm{P})$ is feasible with high probability when $n \leq d^2 / C$, for a (possibly large) constant $C > 0$. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: 17 pages

arXiv:2305.07720 [pdf, other]

Catalytic Embeddings of Quantum Circuits

Authors: M. Amy, M. Crawford, A. N. Glaudell, M. L. Macasieb, S. S. Mendelson, N. J. Ross

Abstract: If a set $\mathbb{G}$ of quantum gates is countable, then the operators that can be exactly represented by a circuit over $\mathbb{G}$ form a strict subset of the collection of all unitary operators. When $\mathbb{G}$ is universal, one circumvents this limitation by resorting to repeated gate approximations: every occurrence of a gate which cannot be exactly represented over $\mathbb{G}$ is replac… ▽ More If a set $\mathbb{G}$ of quantum gates is countable, then the operators that can be exactly represented by a circuit over $\mathbb{G}$ form a strict subset of the collection of all unitary operators. When $\mathbb{G}$ is universal, one circumvents this limitation by resorting to repeated gate approximations: every occurrence of a gate which cannot be exactly represented over $\mathbb{G}$ is replaced by an approximating circuit. Here, we introduce catalytic embeddings, which provide an alternative to repeated gate approximations. With catalytic embeddings, approximations are relegated to the preparation of a fixed number of reusable resource states called catalysts. Because the catalysts only need to be prepared once, when catalytic embeddings exist they always produce shorter circuits, in the limit of increasing gate count and target precision. In the present paper, we lay the foundations of the theory of catalytic embeddings and we establish several of their structural properties. In addition, we provide methods to design catalytic embeddings, showing that their construction can be reduced to that of a single fixed matrix when the gates involved have entries in well-behaved rings of algebraic numbers. Finally, we showcase some concrete examples and applications. Notably, we show that catalytic embeddings generalize a technique previously used to implement the Quantum Fourier Transform over the Clifford+$T$ gate set with $O(n)$ gate approximations. △ Less

Submitted 12 May, 2023; originally announced May 2023.

arXiv:2209.07058 [pdf, ps, other]

Structure preservation via the Wasserstein distance

Authors: Daniel Bartl, Shahar Mendelson

Abstract: We show that under minimal assumptions on a random vector $X\in\mathbb{R}^d$ and with high probability, given $m$ independent copies of $X$, the coordinate distribution of each vector $(\langle X_i,θ\rangle)_{i=1}^m$ is dictated by the distribution of the true marginal $\langle X,θ\rangle$. Specifically, we show that with high probability, \[\sup_{θ\in S^{d-1}} \left( \frac{1}{m}\sum_{i=1}^m \left… ▽ More We show that under minimal assumptions on a random vector $X\in\mathbb{R}^d$ and with high probability, given $m$ independent copies of $X$, the coordinate distribution of each vector $(\langle X_i,θ\rangle)_{i=1}^m$ is dictated by the distribution of the true marginal $\langle X,θ\rangle$. Specifically, we show that with high probability, \[\sup_{θ\in S^{d-1}} \left( \frac{1}{m}\sum_{i=1}^m \left|\langle X_i,θ\rangle^\sharp - λ^θ_i \right|^2 \right)^{1/2} \leq c \left( \frac{d}{m} \right)^{1/4},\] where $λ^θ_i = m\int_{(\frac{i-1}{m}, \frac{i}{m}]} F_{ \langle X,θ\rangle }^{-1}(u)\,du$ and $a^\sharp$ denotes the monotone non-decreasing rearrangement of $a$. Moreover, this estimate is optimal. The proof follows from a sharp estimate on the worst Wasserstein distance between a marginal of $X$ and its empirical counterpart, $\frac{1}{m} \sum_{i=1}^m δ_{\langle X_i, θ\rangle}$. △ Less

Submitted 21 September, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

Comments: Original paper [v1] was split into two papers. Here is the first part. Second part is now called "Optimal non-gaussian Dvoretzky-Milman embeddings"

arXiv:2204.04109 [pdf, ps, other]

Fast metric embedding into the Hamming cube

Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

Abstract: We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high probability: we first apply a specific structured random matrix, which we call the double circulant matrix; using that matrix requires linear storage and matrix-vect… ▽ More We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high probability: we first apply a specific structured random matrix, which we call the double circulant matrix; using that matrix requires linear storage and matrix-vector multiplication can be performed in near-linear time. We then binarize each vector by comparing each of its entries to a random threshold, selected uniformly at random from a well-chosen interval. We estimate the number of bits required for this encoding scheme in terms of two natural geometric complexity parameters of the set - its Euclidean covering numbers and its localized Gaussian complexity. The estimate we derive turns out to be the best that one can hope for - up to logarithmic terms. The key to the proof is a phenomenon of independent interest: we show that the double circulant matrix mimics the behavior of a Gaussian matrix in two important ways. First, it maps an arbitrary set in $\mathbb{R}^n$ into a set of well-spread vectors. Second, it yields a fast near-isometric embedding of any finite subset of $\ell_2^n$ into $\ell_1^m$. This embedding achieves the same dimension reduction as a Gaussian matrix in near-linear time, under an optimal condition - up to logarithmic factors - on the number of points to be embedded. This improves a well-known construction due to Ailon and Chazelle. △ Less

Submitted 6 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

Comments: Added new, near-optimal result on fast near-isometric embedding of $\ell_2^n$ into $\ell_1^m$

arXiv:2201.05204 [pdf, ps, other]

Sharp estimates on random hyperplane tessellations

Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

Abstract: We study the problem of generating a hyperplane tessellation of an arbitrary set $T$ in $\mathbb{R}^n$, ensuring that the Euclidean distance between any two points corresponds to the fraction of hyperplanes separating them up to a pre-specified error $δ$. We focus on random gaussian tessellations with uniformly distributed shifts and derive sharp bounds on the number of hyperplanes $m$ that are re… ▽ More We study the problem of generating a hyperplane tessellation of an arbitrary set $T$ in $\mathbb{R}^n$, ensuring that the Euclidean distance between any two points corresponds to the fraction of hyperplanes separating them up to a pre-specified error $δ$. We focus on random gaussian tessellations with uniformly distributed shifts and derive sharp bounds on the number of hyperplanes $m$ that are required. Surprisingly, our lower estimates falsify the conjecture that $m\sim \ell_*^2(T)/δ^2$, where $\ell_*^2(T)$ is the gaussian width of $T$, is optimal. △ Less

Submitted 13 January, 2022; originally announced January 2022.

arXiv:2106.15173 [pdf, ps, other]

Random embeddings with an almost Gaussian distortion

Authors: Daniel Bartl, Shahar Mendelson

Abstract: Let $X$ be a symmetric, isotropic random vector in $\mathbb{R}^m$ and let $X_1...,X_n$ be independent copies of $X$. We show that under mild assumptions on $\|X\|_2$ (a suitable thin-shell bound) and on the tail-decay of the marginals $\langle X,u\rangle$, the random matrix $A$, whose columns are $X_i/\sqrt{m}$ exhibits a Gaussian-like behaviour in the following sense: for an arbitrary subset of… ▽ More Let $X$ be a symmetric, isotropic random vector in $\mathbb{R}^m$ and let $X_1...,X_n$ be independent copies of $X$. We show that under mild assumptions on $\|X\|_2$ (a suitable thin-shell bound) and on the tail-decay of the marginals $\langle X,u\rangle$, the random matrix $A$, whose columns are $X_i/\sqrt{m}$ exhibits a Gaussian-like behaviour in the following sense: for an arbitrary subset of $T\subset \mathbb{R}^n$, the distortion $\sup_{t \in T} | \|At\|_2^2 - \|t\|_2^2 |$ is almost the same as if $A$ were a Gaussian matrix. A simple outcome of our result is that if $X$ is a symmetric, isotropic, log-concave random vector and $n \leq m \leq c_1(α)n^α$ for some $α>1$, then with high probability, the extremal singular values of $A$ satisfy the optimal estimate: $1-c_2(α) \sqrt{n/m} \leq λ_{\rm min} \leq λ_{\rm max} \leq 1+c_2(α) \sqrt{n/m}$. △ Less

Submitted 4 February, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

Journal ref: Advances in Mathematics, 400:108261, 2022

arXiv:2103.05237 [pdf, ps, other]

Column randomization and almost-isometric embeddings

Showing 1–50 of 79 results for author: Mendelson, S