-
Rate of Convergence in Multiple SLE using Random Matrix Theory
Authors:
Andrew Campbell,
Kyle Luh,
Vlad Margarint
Abstract:
We provide an order of convergence for a version of the Carathéodory convergence for the multiple SLE model with a Dyson Brownian motion driver towards its hydrodynamic limit, for $β=1$ and $β=2$. The result is obtained by combining techniques from the field of Schramm-Loewner Evolutions with modern techniques from random matrices. Our approach shows how one can apply modern tools used in the proo…
▽ More
We provide an order of convergence for a version of the Carathéodory convergence for the multiple SLE model with a Dyson Brownian motion driver towards its hydrodynamic limit, for $β=1$ and $β=2$. The result is obtained by combining techniques from the field of Schramm-Loewner Evolutions with modern techniques from random matrices. Our approach shows how one can apply modern tools used in the proof of universality in random matrix theory, in the field of Schramm-Loewner Evolutions.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
Extreme eigenvalues of Laplacian random matrices with Gaussian entries
Authors:
Andrew Campbell,
Kyle Luh,
Sean O'Rourke,
Santiago Arenas-Velilla,
Victor Pérez-Abreu
Abstract:
A Laplacian matrix is a real symmetric matrix whose row and column sums are zero. We investigate the limiting distribution of the largest eigenvalue of a Laplacian random matrix with Gaussian entries. Unlike many classical matrix ensembles, this random matrix model contains dependent entries. After properly shifting and scaling, we show the largest eigenvalue converges to the Gumbel distribution a…
▽ More
A Laplacian matrix is a real symmetric matrix whose row and column sums are zero. We investigate the limiting distribution of the largest eigenvalue of a Laplacian random matrix with Gaussian entries. Unlike many classical matrix ensembles, this random matrix model contains dependent entries. After properly shifting and scaling, we show the largest eigenvalue converges to the Gumbel distribution as the dimension of the matrix tends to infinity. While the largest diagonal entry is also shown to have Gumbel fluctuations, there is a rather surprising difference between its deterministic centering term and the centering term required for the largest eigenvalue.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Eigenvalue Gaps of Random Perturbations of Large Matrices
Authors:
Kyle Luh,
Ryan Vogel,
Alan Yu
Abstract:
The current work applies some recent combinatorial tools due to Jain to control the eigenvalue gaps of a matrix $M_n = M + N_n$ where $M$ is deterministic, symmetric with large operator norm and $N_n$ is a random symmetric matrix with subgaussian entries. One consequence of our tail bounds is that $M_n$ has simple spectrum with probability at least $1 - \exp(-n^{2/15})$ which improves on a result…
▽ More
The current work applies some recent combinatorial tools due to Jain to control the eigenvalue gaps of a matrix $M_n = M + N_n$ where $M$ is deterministic, symmetric with large operator norm and $N_n$ is a random symmetric matrix with subgaussian entries. One consequence of our tail bounds is that $M_n$ has simple spectrum with probability at least $1 - \exp(-n^{2/15})$ which improves on a result of Nguyen, Tao and Vu in terms of both the probability and the size of the matrix $M$.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Robustness Implies Generalization via Data-Dependent Generalization Bounds
Authors:
Kenji Kawaguchi,
Zhun Deng,
Kyle Luh,
Jiaoyang Huang
Abstract:
This paper proves that robustness implies generalization via data-dependent generalization bounds. As a result, robustness and generalization are shown to be connected closely in a data-dependent manner. Our bounds improve previous bounds in two directions, to solve an open problem that has seen little development since 2010. The first is to reduce the dependence on the covering number. The second…
▽ More
This paper proves that robustness implies generalization via data-dependent generalization bounds. As a result, robustness and generalization are shown to be connected closely in a data-dependent manner. Our bounds improve previous bounds in two directions, to solve an open problem that has seen little development since 2010. The first is to reduce the dependence on the covering number. The second is to remove the dependence on the hypothesis space. We present several examples, including ones for lasso and deep learning, in which our bounds are provably preferable. The experiments on real-world data and theoretical models demonstrate near-exponential improvements in various situations. To achieve these improvements, we do not require additional assumptions on the unknown distribution; instead, we only incorporate an observable and computable property of the training samples. A key technical innovation is an improved concentration bound for multinomial random variables that is of independent interest beyond robustness and generalization.
△ Less
Submitted 3 August, 2022; v1 submitted 27 June, 2022;
originally announced June 2022.
-
A nonuniform Littlewood-Offord inequality for all norms
Authors:
Kyle Luh,
David Xiang
Abstract:
Let $\mathbf{v}_i$ be vectors in $\mathbb{R}^d$ and $\{\varepsilon_i\}$ be independent Rademacher random variables. Then the Littlewood-Offord problem entails finding the best upper bound for $\sup_{\mathbf{x} \in \mathbb{R}^d} \mathbb{P}(\sum \varepsilon_i \mathbf{v}_i = \mathbf{x})$. Generalizing the uniform bounds of Littlewood-Offord, Erdős and Kleitman, a recent result of Dzindzalieta and Juš…
▽ More
Let $\mathbf{v}_i$ be vectors in $\mathbb{R}^d$ and $\{\varepsilon_i\}$ be independent Rademacher random variables. Then the Littlewood-Offord problem entails finding the best upper bound for $\sup_{\mathbf{x} \in \mathbb{R}^d} \mathbb{P}(\sum \varepsilon_i \mathbf{v}_i = \mathbf{x})$. Generalizing the uniform bounds of Littlewood-Offord, Erdős and Kleitman, a recent result of Dzindzalieta and Juškevičius provides a non-uniform bound that is optimal in its dependence on $\|\mathbf{x}\|_2$. In this short note, we provide a simple alternative proof of their result. Furthermore, our proof demonstrates that the bound applies to any norm on $\mathbb{R}^d$, not just the $\ell_2$ norm. This resolves a conjecture of Dzindzalieta and Juškevičius.
△ Less
Submitted 1 September, 2020; v1 submitted 27 August, 2020;
originally announced August 2020.
-
Circular Law for Random Block Band Matrices with Genuinely Sublinear Bandwidth
Authors:
Vishesh Jain,
Indrajit Jana,
Kyle Luh,
Sean O'Rourke
Abstract:
We prove the circular law for a class of non-Hermitian random block band matrices with genuinely sublinear bandwidth. Namely, we show there exists $τ\in (0,1)$ so that if the bandwidth of the matrix $X$ is at least $n^{1-τ}$ and the nonzero entries are iid random variables with mean zero and slightly more than four finite moments, then the limiting empirical eigenvalue distribution of $X$, when pr…
▽ More
We prove the circular law for a class of non-Hermitian random block band matrices with genuinely sublinear bandwidth. Namely, we show there exists $τ\in (0,1)$ so that if the bandwidth of the matrix $X$ is at least $n^{1-τ}$ and the nonzero entries are iid random variables with mean zero and slightly more than four finite moments, then the limiting empirical eigenvalue distribution of $X$, when properly normalized, converges in probability to the uniform distribution on the unit disk in the complex plane. The key technical result is a least singular value bound for shifted random band block matrices with genuinely sublinear bandwidth, which improves on a result of Cook in the band matrix setting.
△ Less
Submitted 15 July, 2021; v1 submitted 9 August, 2020;
originally announced August 2020.
-
Eigenvectors and controllability of non-Hermitian random matrices and directed graphs
Authors:
Kyle Luh,
Sean O'Rourke
Abstract:
We study the eigenvectors and eigenvalues of random matrices with iid entries. Let $N$ be a random matrix with iid entries which have symmetric distribution. For each unit eigenvector $\mathbf{v}$ of $N$ our main results provide a small ball probability bound for linear combinations of the coordinates of $\mathbf{v}$. Our results generalize the works of Meehan and Nguyen as well as Touri and the s…
▽ More
We study the eigenvectors and eigenvalues of random matrices with iid entries. Let $N$ be a random matrix with iid entries which have symmetric distribution. For each unit eigenvector $\mathbf{v}$ of $N$ our main results provide a small ball probability bound for linear combinations of the coordinates of $\mathbf{v}$. Our results generalize the works of Meehan and Nguyen as well as Touri and the second author for random symmetric matrices. Along the way, we provide an optimal estimate of the probability that an iid matrix has simple spectrum, improving a recent result of Ge. Our techniques also allow us to establish analogous results for the adjacency matrix of a random directed graph, and as an application we establish controllability properties of network control systems on directed graphs.
△ Less
Submitted 22 April, 2020;
originally announced April 2020.
-
Resilience of the Rank of Random Matrices
Authors:
Asaf Ferber,
Kyle Luh,
Gweneth McKinley
Abstract:
Let $M$ be an $n \times m$ matrix of independent Rademacher ($\pm 1$) random variables. It is well known that if $n \leq m$, then $M$ is of full rank with high probability. We show that this property is resilient to adversarial changes to $M$. More precisely, if $m \geq n + n^{1-\varepsilon/6}$, then even after changing the sign of $(1-\varepsilon)m/2$ entries, $M$ is still of full rank with high…
▽ More
Let $M$ be an $n \times m$ matrix of independent Rademacher ($\pm 1$) random variables. It is well known that if $n \leq m$, then $M$ is of full rank with high probability. We show that this property is resilient to adversarial changes to $M$. More precisely, if $m \geq n + n^{1-\varepsilon/6}$, then even after changing the sign of $(1-\varepsilon)m/2$ entries, $M$ is still of full rank with high probability. Note that this is asymptotically best possible as one can easily make any two rows proportional with at most $m/2$ changes. Moreover, this theorem gives an asymptotic solution to a slightly weakened version of a conjecture made by Van Vu.
△ Less
Submitted 8 October, 2019;
originally announced October 2019.
-
A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates
Authors:
Zhixian Lei,
Kyle Luh,
Prayaag Venkat,
Fred Zhang
Abstract:
We study the algorithmic problem of estimating the mean of heavy-tailed random vector in $\mathbb{R}^d$, given $n$ i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming that the random vector has bounded mean and covariance. Polynomial-time solutions to this problem are known but have high runtime due to their use of semi-defi…
▽ More
We study the algorithmic problem of estimating the mean of heavy-tailed random vector in $\mathbb{R}^d$, given $n$ i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming that the random vector has bounded mean and covariance. Polynomial-time solutions to this problem are known but have high runtime due to their use of semi-definite programming (SDP). Conceptually, it remains open whether convex relaxation is truly necessary for this problem.
In this work, we show that it is possible to go beyond SDP and achieve better computational efficiency. In particular, we provide a spectral algorithm that achieves the optimal statistical performance and runs in time $\widetilde O\left(n^2 d \right)$, improving upon the previous fastest runtime $\widetilde O\left(n^{3.5}+ n^2d\right)$ by Cherapanamjeri el al. (COLT '19). Our algorithm is spectral in that it only requires (approximate) eigenvector computations, which can be implemented very efficiently by, for example, power iteration or the Lanczos method.
At the core of our algorithm is a novel connection between the furthest hyperplane problem introduced by Karnin et al. (COLT '12) and a structural lemma on heavy-tailed distributions by Lugosi and Mendelson (Ann. Stat. '19). This allows us to iteratively reduce the estimation error at a geometric rate using only the information derived from the top singular vector of the data matrix, leading to a significantly faster running time.
△ Less
Submitted 17 February, 2020; v1 submitted 12 August, 2019;
originally announced August 2019.
-
Some new results in random matrices over finite fields
Authors:
Kyle Luh,
Sean Meehan,
Hoi H. Nguyen
Abstract:
In this note we give various characterizations of random walks with possibly different steps that have relatively large discrepancy from the uniform distribution modulo a prime p, and use these results to study the distribution of the rank of random matrices over F_p and the equi-distribution behavior of normal vectors of random hyperplanes. We also study the probability that a random square matri…
▽ More
In this note we give various characterizations of random walks with possibly different steps that have relatively large discrepancy from the uniform distribution modulo a prime p, and use these results to study the distribution of the rank of random matrices over F_p and the equi-distribution behavior of normal vectors of random hyperplanes. We also study the probability that a random square matrix is eigenvalue-free, or when its characteristic polynomial is divisible by a given irreducible polynomial in the limit n to infinity in F_p. We show that these statistics are universal, extending results of Stong and Neumann-Praeger beyond the uniform model.
△ Less
Submitted 26 December, 2019; v1 submitted 4 July, 2019;
originally announced July 2019.
-
On the counting problem in inverse Littlewood--Offord theory
Authors:
Asaf Ferber,
Vishesh Jain,
Kyle Luh,
Wojciech Samotij
Abstract:
Let $ε_1, \dotsc, ε_n$ be i.i.d. Rademacher random variables taking values $\pm 1$ with probability $1/2$ each. Given an integer vector $\boldsymbol{a} = (a_1, \dotsc, a_n)$, its concentration probability is the quantity $ρ(\boldsymbol{a}):=\sup_{x\in \mathbb{Z}}\Pr(ε_1 a_1+\dots+ε_n a_n = x)$. The Littlewood-Offord problem asks for bounds on $ρ(\boldsymbol{a})$ under various hypotheses on…
▽ More
Let $ε_1, \dotsc, ε_n$ be i.i.d. Rademacher random variables taking values $\pm 1$ with probability $1/2$ each. Given an integer vector $\boldsymbol{a} = (a_1, \dotsc, a_n)$, its concentration probability is the quantity $ρ(\boldsymbol{a}):=\sup_{x\in \mathbb{Z}}\Pr(ε_1 a_1+\dots+ε_n a_n = x)$. The Littlewood-Offord problem asks for bounds on $ρ(\boldsymbol{a})$ under various hypotheses on $\boldsymbol{a}$, whereas the inverse Littlewood-Offord problem, posed by Tao and Vu, asks for a characterization of all vectors $\boldsymbol{a}$ for which $ρ(\boldsymbol{a})$ is large. In this paper, we study the associated counting problem: How many integer vectors $\boldsymbol{a}$ belonging to a specified set have large $ρ(\boldsymbol{a})$? The motivation for our study is that in typical applications, the inverse Littlewood-Offord theorems are only used to obtain such counting estimates. Using a more direct approach, we obtain significantly better bounds for this problem than those obtained using the inverse Littlewood--Offord theorems of Tao and Vu and of Nguyen and Vu. Moreover, we develop a framework for deriving upper bounds on the probability of singularity of random discrete matrices that utilizes our counting result. To illustrate the methods, we present the first `exponential-type' (i.e., $\exp(-n^c)$ for some positive constant $c$) upper bounds on the singularity probability for the following two models: (i) adjacency matrices of dense signed random regular digraphs, for which the previous best known bound is $O(n^{-1/4})$ due to Cook; and (ii) dense row-regular $\{0,1\}$-matrices, for which the previous best known bound is $O_{C}(n^{-C})$ for any constant $C>0$ due to Nguyen.
△ Less
Submitted 23 April, 2019;
originally announced April 2019.
-
An Improved Lower Bound for Sparse Reconstruction from Subsampled Walsh Matrices
Authors:
Jarosław Błasiok,
Patrick Lopatto,
Kyle Luh,
Jake Marcinek,
Shravas Rao
Abstract:
We give a short argument that yields a new lower bound on the number of subsampled rows from a bounded, orthonormal matrix necessary to form a matrix with the restricted isometry property. We show that a matrix formed by uniformly subsampling rows of an $N \times N$ Walsh matrix contains a $K$-sparse vector in the kernel, unless the number of subsampled rows is $Ω(K \log K \log (N/K))$ -- our lowe…
▽ More
We give a short argument that yields a new lower bound on the number of subsampled rows from a bounded, orthonormal matrix necessary to form a matrix with the restricted isometry property. We show that a matrix formed by uniformly subsampling rows of an $N \times N$ Walsh matrix contains a $K$-sparse vector in the kernel, unless the number of subsampled rows is $Ω(K \log K \log (N/K))$ -- our lower bound applies whenever $\min(K, N/K) > \log^C N$. Containing a sparse vector in the kernel precludes not only the restricted isometry property, but more generally the application of those matrices for uniform sparse recovery.
△ Less
Submitted 9 May, 2023; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Four Deviations Suffice for Rank 1 Matrices
Authors:
Rasmus Kyng,
Kyle Luh,
Zhao Song
Abstract:
We prove a matrix discrepancy bound that strengthens the famous Kadison-Singer result of Marcus, Spielman, and Srivastava. Consider any independent scalar random variables $ξ_1, \ldots, ξ_n$ with finite support, e.g.
$\{ \pm 1 \}$ or $\{ 0,1 \}$-valued random variables, or some combination thereof. Let $u_1, \dots, u_n \in \mathbb{C}^m$ and…
▽ More
We prove a matrix discrepancy bound that strengthens the famous Kadison-Singer result of Marcus, Spielman, and Srivastava. Consider any independent scalar random variables $ξ_1, \ldots, ξ_n$ with finite support, e.g.
$\{ \pm 1 \}$ or $\{ 0,1 \}$-valued random variables, or some combination thereof. Let $u_1, \dots, u_n \in \mathbb{C}^m$ and $$ σ^2 = \left\| \sum_{i=1}^n \text{Var}[ ξ_i ] (u_i u_i^{*})^2 \right\|. $$ Then there exists a choice of outcomes $\varepsilon_1,\ldots,\varepsilon_n$ in the support of $ξ_1, \ldots, ξ_n$ s.t. $$ \left \|\sum_{i=1}^n \mathbb{E} [ ξ_i] u_i u_i^* - \sum_{i=1}^n \varepsilon_i u_i u_i^* \right \| \leq 4 σ. $$ A simple consequence of our result is an improvement of a Lyapunov-type theorem of Akemann and Weaver.
△ Less
Submitted 4 August, 2020; v1 submitted 20 January, 2019;
originally announced January 2019.
-
Tail bounds for gaps between eigenvalues of sparse random matrices
Authors:
Patrick Lopatto,
Kyle Luh
Abstract:
We prove the first eigenvalue repulsion bound for sparse random matrices. As a consequence, we show that these matrices have simple spectrum, improving the range of sparsity and error probability from the work of the second author and Vu. As an application of our tail bounds, we show that for sparse Erdős--Rényi graphs, weak and strong nodal domains are the same, answering a question of Dekel, Lee…
▽ More
We prove the first eigenvalue repulsion bound for sparse random matrices. As a consequence, we show that these matrices have simple spectrum, improving the range of sparsity and error probability from the work of the second author and Vu. As an application of our tail bounds, we show that for sparse Erdős--Rényi graphs, weak and strong nodal domains are the same, answering a question of Dekel, Lee, and Linial.
△ Less
Submitted 18 December, 2020; v1 submitted 17 January, 2019;
originally announced January 2019.
-
Eigenvector Delocalization for Non-Hermitian Random Matrices and Applications
Authors:
Kyle Luh,
Sean O'Rourke
Abstract:
Improving upon results of Rudelson and Vershynin, we establish delocalization bounds for eigenvectors of independent-entry random matrices. In particular, we show that with high probability every eigenvector is delocalized, meaning any subset of its coordinates carries an appropriate proportion of its mass. Our results hold for random matrices with genuinely complex as well as real entries. In bot…
▽ More
Improving upon results of Rudelson and Vershynin, we establish delocalization bounds for eigenvectors of independent-entry random matrices. In particular, we show that with high probability every eigenvector is delocalized, meaning any subset of its coordinates carries an appropriate proportion of its mass. Our results hold for random matrices with genuinely complex as well as real entries. In both cases, our bounds match numerical simulations, up to lower order terms, indicating the optimality of our results. As an application of our methods, we also establish delocalization bounds for normal vectors to random hyperplanes. The proofs of our main results rely on a least singular value bound for genuinely complex rectangular random matrices, which generalizes a previous bound due to the first author, and may be of independent interest.
△ Less
Submitted 30 January, 2019; v1 submitted 30 September, 2018;
originally announced October 2018.
-
Sparse Random Matrices have Simple Spectrum
Authors:
Kyle Luh,
Van Vu
Abstract:
Let $M_n$ be a class of symmetric sparse random matrices, with independent entries $M_{ij} = δ_{ij} ξ_{ij}$ for $i \leq j$. $δ_{ij}$ are i.i.d. Bernoulli random variables taking the value $1$ with probability $p \geq n^{-1+δ}$ for any constant $δ> 0$ and $ξ_{ij}$ are i.i.d. centered, subgaussian random variables. We show that with high probability this class of random matrices has simple spectrum…
▽ More
Let $M_n$ be a class of symmetric sparse random matrices, with independent entries $M_{ij} = δ_{ij} ξ_{ij}$ for $i \leq j$. $δ_{ij}$ are i.i.d. Bernoulli random variables taking the value $1$ with probability $p \geq n^{-1+δ}$ for any constant $δ> 0$ and $ξ_{ij}$ are i.i.d. centered, subgaussian random variables. We show that with high probability this class of random matrices has simple spectrum (i.e. the eigenvalues appear with multiplicity one). We can slightly modify our proof to show that the adjacency matrix of a sparse Erdős-Rényi graph has simple spectrum for $n^{-1+δ} \leq p \leq 1- n^{-1+δ}$. These results are optimal in the exponent. The result for graphs has connections to the notorious graph isomorphism problem.
△ Less
Submitted 18 February, 2018; v1 submitted 10 February, 2018;
originally announced February 2018.
-
Optimal Threshold for a Random Graph to be 2-Universal
Authors:
Asaf Ferber,
Gal Kronenberg,
Kyle Luh
Abstract:
For a family of graphs $\mathcal{F}$, a graph $G$ is $\mathcal{F}$-universal if $G$ contains every graph in $\mathcal{F}$ as a (not necessarily induced) subgraph. For the family of all graphs on $n$ vertices and of maximum degree at most two, $\mathcal{H}(n,2)$, we prove that there exists a constant $C$ such that for $p \geq C \left( \frac{\log n}{n^2} \right)^{\frac{1}{3}}$, the binomial random g…
▽ More
For a family of graphs $\mathcal{F}$, a graph $G$ is $\mathcal{F}$-universal if $G$ contains every graph in $\mathcal{F}$ as a (not necessarily induced) subgraph. For the family of all graphs on $n$ vertices and of maximum degree at most two, $\mathcal{H}(n,2)$, we prove that there exists a constant $C$ such that for $p \geq C \left( \frac{\log n}{n^2} \right)^{\frac{1}{3}}$, the binomial random graph $G(n,p)$ is typically $\mathcal{H}(n,2)$-universal. This bound is optimal up to the constant factor as illustrated in the seminal work of Johansson, Kahn, and Vu for triangle factors. Our result improves significantly on the previous best bound of $p \geq C \left(\frac{\log n}{n}\right)^{\frac{1}{2}}$ due to Kim and Lee. In fact, we prove the stronger result that for the family of all graphs on $n$ vertices, of maximum degree at most two and of girth at least $\ell$, $\mathcal{H}^{\ell}(n,2)$, $G(n,p)$ is typically $\mathcal H^{\ell}(n,2)$-universal when $p \geq C \left(\frac{\log n}{n^{\ell -1}}\right)^{\frac{1}{\ell}}$. This result is also optimal up to the constant factor. Our results verify (in a weak form) a classical conjecture of Kahn and Kalai.
△ Less
Submitted 18 December, 2016;
originally announced December 2016.
-
Complex Random Matrices have no Real Eigenvalues
Authors:
Kyle Luh
Abstract:
Let $ζ= ξ+ iξ'$ where $ξ, ξ'$ are iid copies of a mean zero, variance one, subgaussian random variable. Let $N_n$ be a $n \times n$ random matrix with entries that are iid copies of $ζ$. We prove that there exists a $c \in (0,1)$ such that the probability that $N_n$ has any real eigenvalues is less than $c^n$ where $c$ only depends on the subgaussian moment of $ξ$. The bound is optimal up to the v…
▽ More
Let $ζ= ξ+ iξ'$ where $ξ, ξ'$ are iid copies of a mean zero, variance one, subgaussian random variable. Let $N_n$ be a $n \times n$ random matrix with entries that are iid copies of $ζ$. We prove that there exists a $c \in (0,1)$ such that the probability that $N_n$ has any real eigenvalues is less than $c^n$ where $c$ only depends on the subgaussian moment of $ξ$. The bound is optimal up to the value of the constant $c$. The principal component of the proof is an optimal tail bound on the least singular value of matrices of the form $M_n := M + N_n$ where $M$ is a deterministic complex matrix with the condition that $\|M\| \leq K n^{1/2}$ for some constant $K$ depending on the subgaussian moment of $ξ$. For this class of random variables, this result improves on the results of Pan-Zhou and Rudelson-Vershynin. In the proof of the tail bound, we develop an optimal small-ball probability bound for complex random variables that generalizes the Littlewood-Offord theory developed by Tao-Vu and Rudelson-Vershynin.
△ Less
Submitted 8 October, 2017; v1 submitted 24 September, 2016;
originally announced September 2016.
-
Packing Loose Hamilton Cycles
Authors:
Asaf Ferber,
Kyle Luh,
Daniel Montealegre,
Oanh Nguyen
Abstract:
A subset $C$ of edges in a $k$-uniform hypergraph $H$ is a \emph{loose Hamilton cycle} if $C$ covers all the vertices of $H$ and there exists a cyclic ordering of these vertices such that the edges in $C$ are segments of that order and such that every two consecutive edges share exactly one vertex. The binomial random $k$-uniform hypergraph $H^k_{n,p}$ has vertex set $[n]$ and an edge set $E$ obta…
▽ More
A subset $C$ of edges in a $k$-uniform hypergraph $H$ is a \emph{loose Hamilton cycle} if $C$ covers all the vertices of $H$ and there exists a cyclic ordering of these vertices such that the edges in $C$ are segments of that order and such that every two consecutive edges share exactly one vertex. The binomial random $k$-uniform hypergraph $H^k_{n,p}$ has vertex set $[n]$ and an edge set $E$ obtained by adding each $k$-tuple $e\in \binom{[n]}{k}$ to $E$ with probability $p$, independently at random.
Here we consider the problem of finding edge-disjoint loose Hamilton cycles covering all but $o(|E|)$ edges, referred to as the \emph{packing problem}. While it is known that the threshold probability for the appearance of a loose Hamilton cycle in $H^k_{n,p}$ is $p=Θ\left(\frac{\log n}{n^{k-1}}\right)$, the best known bounds for the packing problem are around $p=\text{polylog}(n)/n$. Here we make substantial progress and prove the following asymptotically (up to a polylog$(n)$ factor) best possible result: For $p\geq \log^{C}n/n^{k-1}$, a random $k$-uniform hypergraph $H^k_{n,p}$ with high probability contains $N:=(1-o(1))\frac{\binom{n}{k}p}{n/(k-1)}$ edge-disjoint loose Hamilton cycles.
Our proof utilizes and modifies the idea of "online sprinkling" recently introduced by Vu and the first author.
△ Less
Submitted 3 August, 2016;
originally announced August 2016.
-
Embedding large graphs into a random graph
Authors:
Asaf Ferber,
Kyle Luh,
Oanh Nguyen
Abstract:
In this paper we consider the problem of embedding almost-spanning, bounded degree graphs in a random graph. In particular, let $Δ\geq 5$, $\varepsilon > 0$ and let $H$ be a graph on $(1-\varepsilon)n$ vertices and with maximum degree $Δ$. We show that a random graph $G_{n,p}$ with high probability contains a copy of $H$, provided that $p\gg (n^{-1}\log^{1/Δ}n)^{2/(Δ+1)}$. Our assumption on $p$ is…
▽ More
In this paper we consider the problem of embedding almost-spanning, bounded degree graphs in a random graph. In particular, let $Δ\geq 5$, $\varepsilon > 0$ and let $H$ be a graph on $(1-\varepsilon)n$ vertices and with maximum degree $Δ$. We show that a random graph $G_{n,p}$ with high probability contains a copy of $H$, provided that $p\gg (n^{-1}\log^{1/Δ}n)^{2/(Δ+1)}$. Our assumption on $p$ is optimal up to the $polylog$ factor. We note that this $polylog$ term matches the conjectured threshold for the spanning case.
△ Less
Submitted 2 August, 2017; v1 submitted 19 June, 2016;
originally announced June 2016.
-
Dictionary Learning with Few Samples and Matrix Concentration
Authors:
Kyle Luh,
Van Vu
Abstract:
Let $A$ be an $n \times n$ matrix, $X$ be an $n \times p$ matrix and $Y = AX$. A challenging and important problem in data analysis, motivated by dictionary learning and other practical problems, is to recover both $A$ and $X$, given $Y$. Under normal circumstances, it is clear that this problem is underdetermined. However, in the case when $X$ is sparse and random, Spielman, Wang and Wright showe…
▽ More
Let $A$ be an $n \times n$ matrix, $X$ be an $n \times p$ matrix and $Y = AX$. A challenging and important problem in data analysis, motivated by dictionary learning and other practical problems, is to recover both $A$ and $X$, given $Y$. Under normal circumstances, it is clear that this problem is underdetermined. However, in the case when $X$ is sparse and random, Spielman, Wang and Wright showed that one can recover both $A$ and $X$ efficiently from $Y$ with high probability, given that $p$ (the number of samples) is sufficiently large. Their method works for $p \ge C n^2 \log^ 2 n$ and they conjectured that $p \ge C n \log n$ suffices. The bound $n \log n$ is sharp for an obvious information theoretical reason.
In this paper, we show that $p \ge C n \log^4 n$ suffices, matching the conjectural bound up to a polylogarithmic factor. The core of our proof is a theorem concerning $l_1$ concentration of random matrices, which is of independent interest.
Our proof of the concentration result is based on two ideas. The first is an economical way to apply the union bound. The second is a refined version of Bernstein's concentration inequality for the sum of independent variables. Both have nothing to do with random matrices and are applicable in general settings.
△ Less
Submitted 30 March, 2015;
originally announced March 2015.
-
Martingale Couplings and Bounds on the Tails of Probability Distributions
Authors:
Kyle J. Luh,
Nicholas Pippenger
Abstract:
Hoeffding has shown that tail bounds on the distribution for sampling from a finite population with replacement also apply to the corresponding cases of sampling without replacement. (A special case of this result is that binomial tail bounds apply to the corresponding hypergeometric tails.) We give a new proof of Hoeffding's result by constructing a martingale coupling between the sampling distri…
▽ More
Hoeffding has shown that tail bounds on the distribution for sampling from a finite population with replacement also apply to the corresponding cases of sampling without replacement. (A special case of this result is that binomial tail bounds apply to the corresponding hypergeometric tails.) We give a new proof of Hoeffding's result by constructing a martingale coupling between the sampling distributions. This construction is given by an explicit combinatorial procedure involving balls and urns. We then apply this construction to create martingale couplings between other pairs of sampling distributions, both without replacement and with "surreplacement" (that is, sampling in which not only is the sampled individual replaced, but some number of "copies" of that individual are added to the population).
△ Less
Submitted 7 July, 2011;
originally announced July 2011.