Skip to main content

Showing 1–50 of 79 results for author: Mendelson, S

.
  1. arXiv:2402.08288  [pdf, ps, other

    math.ST math.PR

    Covariance estimation with direction dependence accuracy

    Authors: Pedro Abdalla, Shahar Mendelson

    Abstract: We construct an estimator $\widehatΣ$ for covariance matrices of unknown, centred random vectors X, with the given data consisting of N independent measurements $X_1,...,X_N$ of X and the wanted confidence level. We show under minimal assumptions on X, the estimator performs with the optimal accuracy with respect to the operator norm. In addition, the estimator is also optimal with respect to dire… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  2. arXiv:2312.06442  [pdf, ps, other

    math.PR

    A uniform Dvoretzky-Kiefer-Wolfowitz inequality

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: We show that under minimal assumption on a class of functions $\mathcal{H}$ defined on a probability space $(\mathcal{X},μ)$, there is a threshold $Δ_0$ satisfying the following: for every $Δ\geqΔ_0$, with probability at least $1-2\exp(-cΔm)$ with respect to $μ^{\otimes m}$, \[ \sup_{h\in\mathcal{H}} \sup_{t\in\mathbb{R}} \left| \mathbb{P}(h(X)\leq t) - \frac{1}{m}\sum_{i=1}^m 1_{(-\infty,t]}(h(… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  3. arXiv:2311.07741  [pdf, ps, other

    quant-ph

    Exact Synthesis of Multiqubit Clifford-Cyclotomic Circuits

    Authors: Matthew Amy, Andrew N. Glaudell, Shaun Kelso, William Maxwell, Samuel S. Mendelson, Neil J. Ross

    Abstract: Let $n\geq 8$ be divisible by 4. The Clifford-cyclotomic gate set $\mathcal{G}_n$ is the universal gate set obtained by extending the Clifford gates with the $z$-rotation $T_n = \mathrm{diag}(1,ζ_n)$, where $ζ_n$ is a primitive $n$-th root of unity. In this note, we show that, when $n$ is a power of 2, a multiqubit unitary matrix $U$ can be exactly represented by a circuit over $\mathcal{G}_n$ if… ▽ More

    Submitted 12 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  4. arXiv:2311.07675  [pdf, other

    math.CO math.SP

    Spectral properties of random graphs with fixed equitable partition

    Authors: Matthew B. Crawford, David J. Marchette, William Maxwell, Samuel S. Mendelson

    Abstract: We define a graph to be $S$-regular if it contains an equitable partition given by a matrix $S$. These graphs are generalizations of both regular and bipartite, biregular graphs. An $S$-regular matrix is defined then as a matrix on an $S$-regular graph consistent with the graph's equitable partition. In this paper we derive the limiting spectral density for large, random $S$-regular matrices as we… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 24 pages, 3 figures

    MSC Class: 05C75 (Primary) 60B20; 05C80 (Secondary)

  5. arXiv:2309.12069  [pdf, ps, other

    math.FA math.PR

    Optimal non-gaussian Dvoretzky-Milman embeddings

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: We construct the first non-gaussian ensemble that yields the optimal estimate in the Dvoretzky-Milman Theorem: the ensemble exhibits almost Euclidean sections in arbitrary normed spaces of the same dimension as the gaussian embedding -- despite being very far from gaussian (in fact, it happens to be heavy-tailed).

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: This is part two of the paper "Structure preservation via the Wasserstein distance" (arXiv:2209.07058v1) which was split into two parts

    Journal ref: International Mathematics Research Notices, 2023+

  6. arXiv:2309.02013  [pdf, ps, other

    math.PR math.FA

    Empirical approximation of the gaussian distribution in $\mathbb{R}^d$

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: Let $G_1,\dots,G_m$ be independent copies of the standard gaussian random vector in $\mathbb{R}^d$. We show that there is an absolute constant $c$ such that for any $A \subset S^{d-1}$, with probability at least $1-2\exp(-cΔm)$, for every $t\in\mathbb{R}$, \[ \sup_{x \in A} \left| \frac{1}{m}\sum_{i=1}^m 1_{ \{\langle G_i,x\rangle \leq t \}} - \mathbb{P}(\langle G,x\rangle \leq t) \right| \leq Δ+… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  7. arXiv:2308.04757  [pdf, ps, other

    math.PR math.ST

    On a variance dependent Dvoretzky-Kiefer-Wolfowitz inequality

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: Let $X$ be a real-valued random variable with distribution function $F$. Set $X_1,\dots, X_m$ to be independent copies of $X$ and let $F_m$ be the corresponding empirical distribution function. We show that there are absolute constants $c_0$ and $c_1$ such that if $Δ\geq c_0\frac{\log\log m}{m}$, then with probability at least $1-2\exp(-c_1Δm)$, for every $t\in\mathbb{R}$ that satisfies… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  8. arXiv:2307.01181  [pdf, ps, other

    math.PR cs.DS cs.LG math.ST stat.ML

    Fitting an ellipsoid to a quadratic number of random points

    Authors: Afonso S. Bandeira, Antoine Maillard, Shahar Mendelson, Elliot Paquette

    Abstract: We consider the problem $(\mathrm{P})$ of fitting $n$ standard Gaussian random vectors in $\mathbb{R}^d$ to the boundary of a centered ellipsoid, as $n, d \to \infty$. This problem is conjectured to have a sharp feasibility transition: for any $\varepsilon > 0$, if $n \leq (1 - \varepsilon) d^2 / 4$ then $(\mathrm{P})$ has a solution with high probability, while $(\mathrm{P})$ has no solutions wit… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 17 pages

  9. arXiv:2305.07720  [pdf, other

    quant-ph

    Catalytic Embeddings of Quantum Circuits

    Authors: M. Amy, M. Crawford, A. N. Glaudell, M. L. Macasieb, S. S. Mendelson, N. J. Ross

    Abstract: If a set $\mathbb{G}$ of quantum gates is countable, then the operators that can be exactly represented by a circuit over $\mathbb{G}$ form a strict subset of the collection of all unitary operators. When $\mathbb{G}$ is universal, one circumvents this limitation by resorting to repeated gate approximations: every occurrence of a gate which cannot be exactly represented over $\mathbb{G}$ is replac… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  10. arXiv:2209.07058  [pdf, ps, other

    math.ST math.FA math.PR

    Structure preservation via the Wasserstein distance

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: We show that under minimal assumptions on a random vector $X\in\mathbb{R}^d$ and with high probability, given $m$ independent copies of $X$, the coordinate distribution of each vector $(\langle X_i,θ\rangle)_{i=1}^m$ is dictated by the distribution of the true marginal $\langle X,θ\rangle$. Specifically, we show that with high probability, \[\sup_{θ\in S^{d-1}} \left( \frac{1}{m}\sum_{i=1}^m \left… ▽ More

    Submitted 21 September, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Original paper [v1] was split into two papers. Here is the first part. Second part is now called "Optimal non-gaussian Dvoretzky-Milman embeddings"

  11. arXiv:2204.04109  [pdf, ps, other

    math.PR cs.IT

    Fast metric embedding into the Hamming cube

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We consider the problem of embedding a subset of $\mathbb{R}^n$ into a low-dimensional Hamming cube in an almost isometric way. We construct a simple, data-oblivious, and computationally efficient map that achieves this task with high probability: we first apply a specific structured random matrix, which we call the double circulant matrix; using that matrix requires linear storage and matrix-vect… ▽ More

    Submitted 6 September, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Added new, near-optimal result on fast near-isometric embedding of $\ell_2^n$ into $\ell_1^m$

  12. arXiv:2201.05204  [pdf, ps, other

    math.PR cs.IT

    Sharp estimates on random hyperplane tessellations

    Authors: Sjoerd Dirksen, Shahar Mendelson, Alexander Stollenwerk

    Abstract: We study the problem of generating a hyperplane tessellation of an arbitrary set $T$ in $\mathbb{R}^n$, ensuring that the Euclidean distance between any two points corresponds to the fraction of hyperplanes separating them up to a pre-specified error $δ$. We focus on random gaussian tessellations with uniformly distributed shifts and derive sharp bounds on the number of hyperplanes $m$ that are re… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  13. arXiv:2106.15173  [pdf, ps, other

    math.FA math.PR

    Random embeddings with an almost Gaussian distortion

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: Let $X$ be a symmetric, isotropic random vector in $\mathbb{R}^m$ and let $X_1...,X_n$ be independent copies of $X$. We show that under mild assumptions on $\|X\|_2$ (a suitable thin-shell bound) and on the tail-decay of the marginals $\langle X,u\rangle$, the random matrix $A$, whose columns are $X_i/\sqrt{m}$ exhibits a Gaussian-like behaviour in the following sense: for an arbitrary subset of… ▽ More

    Submitted 4 February, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Journal ref: Advances in Mathematics, 400:108261, 2022

  14. arXiv:2103.05237  [pdf, ps, other

    math.ST math.FA

    Column randomization and almost-isometric embeddings

    Authors: Shahar Mendelson

    Abstract: The matrix $A:\mathbb{R}^n \to \mathbb{R}^m$ is $(δ,k)$-regular if for any $k$-sparse vector $x$, $$ \left| \|Ax\|_2^2-\|x\|_2^2\right| \leq δ\sqrt{k} \|x\|_2^2. $$ We show that if $A$ is $(δ,k)$-regular for $1 \leq k \leq 1/δ^2$, then by multiplying the columns of $A$ by independent random signs, the resulting random ensemble $A_ε$ acts on an arbitrary subset $T \subset \mathbb{R}^n$ (almost) as… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  15. arXiv:2101.07794  [pdf, ps, other

    math.ST math.OC math.PR q-fin.MF

    On Monte-Carlo methods in convex stochastic optimization

    Authors: Daniel Bartl, Shahar Mendelson

    Abstract: We develop a novel procedure for estimating the optimizer of general convex stochastic optimization problems of the form $\min_{x\in\mathcal{X}} \mathbb{E}[F(x,ξ)]$, when the given data is a finite independent sample selected according to $ξ$. The procedure is based on a median-of-means tournament, and is the first procedure that exhibits the optimal statistical performance in heavy tailed situati… ▽ More

    Submitted 25 January, 2022; v1 submitted 19 January, 2021; originally announced January 2021.

    Journal ref: Annals of Applied Probability, 2022+

  16. arXiv:2010.14404  [pdf, ps, other

    math.FA

    An isomorphic Dvoretzky-Milman Theorem using general random ensembles

    Authors: Shahar Mendelson

    Abstract: We construct rather general random ensembles that yield the optimal (isomorphic) estimate in the Dvoretzky-Milman Theorem. This is the first construction of non gaussian/spherical ensembles that exhibit the optimal behaviour. The ensembles constructed here need not satisfy any rotation invariance and can be rather heavy-tailed.

    Submitted 27 October, 2020; originally announced October 2020.

    MSC Class: 46B06; 46B09

  17. arXiv:2010.11921  [pdf, ps, other

    math.ST math.PR stat.ML

    Multivariate mean estimation with direction-dependent accuracy

    Authors: Gabor Lugosi, Shahar Mendelson

    Abstract: We consider the problem of estimating the mean of a random vector based on $N$ independent, identically distributed observations. We prove the existence of an estimator that has a near-optimal error in all directions in which the variance of the one dimensional marginal of the random vector is not too small: with probability $1-δ$, the procedure returns $\whμ_N$ which satisfies that for every dire… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  18. arXiv:2008.08380  [pdf, ps, other

    math.PR math.FA

    Approximating $L_p$ unit balls via random sampling

    Authors: Shahar Mendelson

    Abstract: Let $X$ be an isotropic random vector in $R^d$ that satisfies that for every $v \in S^{d-1}$, $\|<X,v>\|_{L_q} \leq L \|<X,v>\|_{L_p}$ for some $q \geq 2p$. We show that for $0<\varepsilon<1$, a set of $N = c(p,q,\varepsilon) d$ random points, selected independently according to $X$, can be used to construct a $1 \pm \varepsilon$ approximation of the $L_p$ unit ball endowed on $R^d$ by $X$. Moreov… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  19. arXiv:2004.00303  [pdf, other

    cs.CV

    Transfer Learning of Photometric Phenotypes in Agriculture Using Metadata

    Authors: Dan Halbersberg, Aharon Bar Hillel, Shon Mendelson, Daniel Koster, Lena Karol, Boaz Lerner

    Abstract: Estimation of photometric plant phenotypes (e.g., hue, shine, chroma) in field conditions is important for decisions on the expected yield quality, fruit ripeness, and need for further breeding. Estimating these from images is difficult due to large variances in lighting conditions, shadows, and sensor properties. We combine the image and metadata regarding capturing conditions embedded into a net… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: Paper presented at the ICLR 2020 Workshop on Computer Vision for Agriculture (CV4A)

  20. arXiv:2002.01182  [pdf, ps, other

    stat.ML cs.LG math.ST

    Learning bounded subsets of $L_p$

    Authors: Shahar Mendelson

    Abstract: We study learning problems in which the underlying class is a bounded subset of $L_p$ and the target $Y$ belongs to $L_p$. Previously, minimax sample complexity estimates were known under such boundedness assumptions only when $p=\infty$. We present a sharp sample complexity estimate that holds for any $p > 4$. It is based on a learning procedure that is suited for heavy-tailed problems.

    Submitted 4 February, 2020; originally announced February 2020.

  21. arXiv:1907.11391  [pdf, ps, other

    math.ST

    Robust multivariate mean estimation: the optimality of trimmed mean

    Authors: Gabor Lugosi, Shahar Mendelson

    Abstract: We consider the problem of estimating the mean of a random vector based on i.i.d. observations and adversarial contamination. We introduce a multivariate extension of the trimmed-mean estimator and show its optimal performance under minimal conditions.

    Submitted 22 February, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

  22. arXiv:1907.07258  [pdf, ps, other

    math.PR cs.IT

    On the geometry of polytopes generated by heavy-tailed random vectors

    Authors: Olivier Guédon, Felix Krahmer, Christian Kümmerle, Shahar Mendelson, Holger Rauhut

    Abstract: We study the geometry of centrally-symmetric random polytopes, generated by $N$ independent copies of a random vector $X$ taking values in $\mathbb{R}^n$. We show that under minimal assumptions on $X$, for $N \gtrsim n$ and with high probability, the polytope contains a deterministic set that is naturally associated with the random vector---namely, the polar of a certain floating body. This solves… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 23 pages

    MSC Class: 52A22; 46B06; 60B20; 65K10; 52A23; 46B09; 15B52

  23. arXiv:1907.05335  [pdf, ps, other

    math.RA

    On a special presentation of matrix algebras

    Authors: Geir Agnarsson, Samuel S. Mendelson

    Abstract: Recognizing when a ring is a complete matrix ring is of significant importance in algebra. It is well-known folklore that a ring $R$ is a complete $n\times n$ matrix ring, so $R\cong M_{n}(S)$ for some ring $S$, if and only if it contains a set of $n\times n$ matrix units $\{e_{ij}\}_{i,j=1}^n$. A more recent and less known result states that a ring $R$ is a complete $(m+n)\times(m+n)$ matrix ring… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 32 pages

    MSC Class: 15B33; 16S15; 16S50

  24. arXiv:1906.04280  [pdf, ps, other

    math.ST cs.LG stat.ML

    Mean estimation and regression under heavy-tailed distributions--a survey

    Authors: Gabor Lugosi, Shahar Mendelson

    Abstract: We survey some of the recent advances in mean estimation and regression function estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy-tailed data both in the univariate and multivariate settings. We focus on estimators based on median-of-means techniques but other methods such as the trimmed mean and Catoni's estimator are also reviewed. We give detailed proofs fo… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

  25. arXiv:1904.08992  [pdf, other

    quant-ph cs.LG

    Quantum-Assisted Clustering Algorithms for NISQ-Era Devices

    Authors: Samuel S. Mendelson, Robert W. Strand, Guy B. Oldaker IV, Jacob M. Farinholt

    Abstract: In the NISQ-era of quantum computing, we should not expect to see quantum devices that provide an exponential improvement in runtime for practical problems, due to the lack of error correction and small number of qubits available. Nevertheless, these devices should be able to provide other performance improvements, particularly when combined with existing classical machines. In this article, we de… ▽ More

    Submitted 27 June, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: 12 pages, 11 figures, 2 tables, 1 Appendix. Latest Version: Evaluated runtimes, edited content for readability, included further discussions

  26. arXiv:1904.08532  [pdf, ps, other

    math.PR math.ST

    Stable recovery and the coordinate small-ball behaviour of random vectors

    Authors: Shahar Mendelson, Grigoris Paouris

    Abstract: Recovery procedures in various application in Data Science are based on \emph{stable point separation}. In its simplest form, stable point separation implies that if $f$ is "far away" from $0$, and one is given a random sample $(f(Z_i))_{i=1}^m$ where a proportional number of the sample points may be corrupted by noise, that information is still enough to exhibit that $f$ is far from $0$. Stable… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  27. arXiv:1902.01664  [pdf, ps, other

    math.FA

    On the geometry of random polytopes

    Authors: Shahar Mendelson

    Abstract: We present a simple proof to a fact recently established in [5]: let $ξ$ be a symmetric random variable that has variance $1$, let $Γ=(ξ_{ij})$ be an $N \times n$ random matrix whose entries are independent copies of $ξ$, and set $X_1,...,X_N$ to be the rows of $Γ$. Then under minimal assumptions on $ξ$ and as long as $N \geq c_1n$,… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

  28. arXiv:1812.06719  [pdf, ps, other

    cs.IT eess.SP math.PR

    Robust one-bit compressed sensing with partial circulant matrices

    Authors: Sjoerd Dirksen, Shahar Mendelson

    Abstract: We present optimal sample complexity estimates for one-bit compressed sensing problems in a realistic scenario: the procedure uses a structured matrix (a randomly sub-sampled circulant matrix) and is robust to analog pre-quantization noise as well as to adversarial bit corruptions in the quantization process. Our results imply that quantization is not a statistically expensive procedure in the pre… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

  29. arXiv:1809.10462  [pdf, ps, other

    math.ST

    Robust covariance estimation under $L_4-L_2$ norm equivalence

    Authors: Shahar Mendelson, Nikita Zhivotovskiy

    Abstract: Let $X$ be a centered random vector taking values in $\mathbb{R}^d$ and let $Σ= \mathbb{E}(X\otimes X)$ be its covariance matrix. We show that if $X$ satisfies an $L_4-L_2$ norm equivalence, there is a covariance estimator $\hatΣ$ that exhibits the optimal performance one would expect had $X$ been a gaussian vector. The procedure also improves the current state-of-the-art regarding high probabilit… ▽ More

    Submitted 26 March, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: 19 pages. Referee's suggestions addressed

  30. arXiv:1806.06233  [pdf, ps, other

    math.ST

    Near-optimal mean estimators with respect to general norms

    Authors: Gábor Lugosi, Shahar Mendelson

    Abstract: We study the problem of estimating the mean of a random vector in $\mathbb{R}^d$ based on an i.i.d.\ sample, when the accuracy of the estimator is measured by a general norm on $\mathbb{R}^d$. We construct an estimator (that depends on the norm) that achieves an essentially optimal accuracy/confidence tradeoff under the only assumption that the random vector has a well-defined covariance matrix. T… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

  31. arXiv:1805.09409  [pdf, other

    cs.IT math.PR

    Non-Gaussian Hyperplane Tessellations and Robust One-Bit Compressed Sensing

    Authors: Sjoerd Dirksen, Shahar Mendelson

    Abstract: We show that a tessellation generated by a small number of random affine hyperplanes can be used to approximate Euclidean distances between any two points in an arbitrary bounded set $T$, where the random hyperplanes are generated by subgaussian or heavy-tailed normal vectors and uniformly distributed shifts. We derive quantitative bounds on the number of hyperplanes needed for constructing such t… ▽ More

    Submitted 13 August, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: Title and presentation changed, typos corrected

  32. arXiv:1804.05402  [pdf, ps, other

    stat.ML cs.LG

    Approximating the covariance ellipsoid

    Authors: Shahar Mendelson

    Abstract: We explore ways in which the covariance ellipsoid ${\cal B}=\{v \in \mathbb{R}^d : \mathbb{E} <X,v>^2 \leq 1\}$ of a centred random vector $X$ in $\mathbb{R}^d$ can be approximated by a simple set. The data one is given for constructing the approximating set consists of $X_1,...,X_N$ that are independent and distributed as $X$. We present a general method that can be used to construct such appro… ▽ More

    Submitted 15 April, 2018; originally announced April 2018.

  33. arXiv:1801.02157  [pdf, ps, other

    math.PR

    Concentration of the spectral norm of Erdős-Rényi random graphs

    Authors: Gábor Lugosi, Shahar Mendelson, Nikita Zhivotovskiy

    Abstract: We present results on the concentration properties of the spectral norm $\|A_p\|$ of the adjacency matrix $A_p$ of an Erdős-Rényi random graph $G(n,p)$. First we consider the Erdős-Rényi random graph process and prove that $\|A_p\|$ is uniformly concentrated over the range $p\in [C\log n/n,1]$. The analysis is based on delocalization arguments, uniform laws of large numbers, together with the entr… ▽ More

    Submitted 20 November, 2018; v1 submitted 7 January, 2018; originally announced January 2018.

    Comments: 23 pages, Proposition 2 was added

  34. arXiv:1712.06788  [pdf, ps, other

    math.ST

    A remark on "Robust machine learning by median-of-means"

    Authors: Gabor Lugosi, Shahar Mendelson

    Abstract: We explore the recent results announced in "Robust machine learning by median-of-means: theory and practice" by G. Lecué and M. Lerasle. We show that these results are, in fact, almost obvious outcomes of the machinery developed in [4] for the study of tournament procedures.

    Submitted 19 December, 2017; originally announced December 2017.

  35. arXiv:1709.00843  [pdf, ps, other

    stat.ML

    Extending the scope of the small-ball method

    Authors: Shahar Mendelson

    Abstract: The small-ball method was introduced as a way of obtaining a high probability, isomorphic lower bound on the quadratic empirical process, under weak assumptions on the indexing class. The key assumption was that class members satisfy a uniform small-ball estimate: that $Pr(|f| \geq κ\|f\|_{L_2}) \geq δ$ for given constants $κ$ and $δ$. Here we extend the small-ball method and obtain a high proba… ▽ More

    Submitted 15 June, 2020; v1 submitted 4 September, 2017; originally announced September 2017.

  36. arXiv:1707.05342  [pdf, ps, other

    stat.ML

    An optimal unrestricted learning procedure

    Authors: Shahar Mendelson

    Abstract: We study learning problems involving arbitrary classes of functions $F$, distributions $X$ and targets $Y$. Because proper learning procedures, i.e., procedures that are only allowed to select functions in $F$, tend to perform poorly unless the problem satisfies some additional structural property (e.g., that $F$ is convex), we consider unrestricted learning procedures that are free to choose func… ▽ More

    Submitted 14 April, 2018; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: This version contains a different presentation of the same results, written from a more CS perspective (using the notion of sample complexity rather than the accuracy/confidence trade-off for a fixed sample size)

  37. arXiv:1702.06278  [pdf, ps, other

    stat.ML

    Column normalization of a random measurement matrix

    Authors: Shahar Mendelson

    Abstract: In this note we answer a question of G. Lecué, by showing that column normalization of a random matrix with iid entries need not lead to good sparse recovery properties, even if the generating random variable has a reasonable moment growth. Specifically, for every $2 \leq p \leq c_1\log d$ we construct a random vector $X \in R^d$ with iid, mean-zero, variance $1$ coordinates, that satisfies… ▽ More

    Submitted 21 February, 2017; originally announced February 2017.

  38. arXiv:1702.00482  [pdf, ps, other

    math.ST stat.ML

    Sub-Gaussian estimators of the mean of a random vector

    Authors: Gábor Lugosi, Shahar Mendelson

    Abstract: We study the problem of estimating the mean of a random vector $X$ given a sample of $N$ independent, identically distributed points. We introduce a new estimator that achieves a purely sub-Gaussian performance under the only condition that the second moment of $X$ exists. The estimator is based on a novel concept of a multivariate median.

    Submitted 1 February, 2017; originally announced February 2017.

    Comments: 12 pages

  39. arXiv:1701.04112  [pdf, ps, other

    math.ST stat.ML

    Regularization, sparse recovery, and median-of-means tournaments

    Authors: Gábor Lugosi, Shahar Mendelson

    Abstract: A regularized risk minimization procedure for regression function estimation is introduced that achieves near optimal accuracy and confidence under general conditions, including heavy-tailed predictor and response variables. The procedure is based on median-of-means tournaments, introduced by the authors in [8]. It is shown that the new procedure outperforms standard regularized empirical risk min… ▽ More

    Submitted 29 November, 2017; v1 submitted 15 January, 2017; originally announced January 2017.

    Comments: 28 pages

  40. arXiv:1610.09287  [pdf, ps, other

    math.FA

    Generalized Dual Sudakov Minoration via Dimension Reduction - A Program

    Authors: Shahar Mendelson, Emanuel Milman, Grigoris Paouris

    Abstract: We propose a program for establishing a conjectural extension to the class of (origin-symmetric) log-concave probability measures $μ$, of the classical dual Sudakov Minoration on the expectation of the supremum of a Gaussian process: \begin{equation} \label{eq:abstract} M(Z_p(μ), C \int ||x||_K dμ\cdot K) \leq \exp(C p) \;\;\, \forall p \geq 1 . \end{equation} Here $K$ is an origin-symmetric conve… ▽ More

    Submitted 6 May, 2018; v1 submitted 28 October, 2016; originally announced October 2016.

    Comments: 44 pages, to appear in Studia Math

  41. arXiv:1610.04983  [pdf, ps, other

    cs.IT math.PR

    Improved bounds for sparse recovery from subsampled random convolutions

    Authors: Shahar Mendelson, Holger Rauhut, Rachel Ward

    Abstract: We study the recovery of sparse vectors from subsampled random convolutions via $\ell_1$-minimization. We consider the setup in which both the subsampling locations as well as the generating vector are chosen at random. For a subgaussian generator with independent entries, we improve previously known estimates: if the sparsity $s$ is small enough, i.e., $s \lesssim \sqrt{n/\log(n)}$, we show that… ▽ More

    Submitted 23 March, 2018; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: 34 pages

    MSC Class: 94A20; 60B20

  42. arXiv:1608.07681  [pdf, ps, other

    math.ST

    Regularization and the small-ball method II: complexity dependent error rates

    Authors: Guillaume Lecué, Shahar Mendelson

    Abstract: For a convex class of functions $F$, a regularization functions $Ψ(\cdot)$ and given the random data $(X_i, Y_i)_{i=1}^N$, we study estimation properties of regularization procedures of the form \begin{equation*} \hat f \in {\rm argmin}_{f\in F}\Big(\frac{1}{N}\sum_{i=1}^N\big(Y_i-f(X_i)\big)^2+λΨ(f)\Big) \end{equation*} for some well chosen regularization parameter $λ$. We obtain bounds on… ▽ More

    Submitted 27 August, 2016; originally announced August 2016.

  43. arXiv:1608.00757  [pdf, ps, other

    math.ST

    Risk minimization by median-of-means tournaments

    Authors: Gabor Lugosi, Shahar Mendelson

    Abstract: We consider the classical statistical learning/regression problem, when the value of a real random variable Y is to be predicted based on the observation of another random variable X. Given a class of functions F and a sample of independent copies of (X, Y ), one needs to choose a function f from F such that f(X) approximates Y as well as possible, in the mean-squared sense. We introduce a new pro… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

    Comments: 40 pages

  44. arXiv:1601.06523  [pdf, ps, other

    math.ST

    On multiplier processes under weak moment assumptions

    Authors: Shahar Mendelson

    Abstract: We show that if $V \subset \R^n$ satisfies a certain symmetry condition (closely related to unconditionaity) and if $X$ is an isotropic random vector for which $\|\inr{X,t}\|_{L_p} \leq L \sqrt{p}$ for every $t \in S^{n-1}$ and $p \lesssim \log n$, then the corresponding empirical and multiplier processes indexed by $V$ behave as if $X$ were $L$-subgaussian.

    Submitted 25 January, 2016; originally announced January 2016.

  45. arXiv:1601.05584  [pdf, ps, other

    math.ST

    Regularization and the small-ball method I: sparse recovery

    Authors: Guillaume Lecué, Shahar Mendelson

    Abstract: We obtain bounds on estimation error rates for regularization procedures of the form \begin{equation*} \hat f \in {\rm argmin}_{f\in F}\left(\frac{1}{N}\sum_{i=1}^N\left(Y_i-f(X_i)\right)^2+λΨ(f)\right) \end{equation*} when $Ψ$ is a norm and $F$ is convex. Our approach gives a common framework that may be used in the analysis of learning problems and regularization problems alike. In particu… ▽ More

    Submitted 3 January, 2017; v1 submitted 21 January, 2016; originally announced January 2016.

  46. arXiv:1504.02191  [pdf, ps, other

    stat.ML math.ST

    `local' vs. `global' parameters -- breaking the gaussian complexity barrier

    Authors: Shahar Mendelson

    Abstract: We show that if $F$ is a convex class of functions that is $L$-subgaussian, the error rate of learning problems generated by independent noise is equivalent to a fixed point determined by `local' covering estimates of the class, rather than by the gaussian averages. To that end, we establish new sharp upper and lower estimates on the error rate for such problems.

    Submitted 9 April, 2015; originally announced April 2015.

  47. arXiv:1502.07097  [pdf, ps, other

    math.ST stat.ML

    On aggregation for heavy-tailed classes

    Authors: Shahar Mendelson

    Abstract: We introduce an alternative to the notion of `fast rate' in Learning Theory, which coincides with the optimal error rate when the given class happens to be convex and regular in some sense. While it is well known that such a rate cannot always be attained by a learning procedure (i.e., a procedure that selects a function in the given class), we introduce an aggregation procedure that attains that… ▽ More

    Submitted 25 February, 2015; originally announced February 2015.

    ACM Class: I.2.6

  48. arXiv:1410.8003  [pdf, ps, other

    math.PR

    Upper bounds on product and multiplier empirical processes

    Authors: Shahar Mendelson

    Abstract: We study two empirical process of special structure: firstly, the centred multiplier process indexed by a class $F$, $f \to \left|\sum_{i=1}^N (ξ_i f(X_i) - \E ξf)\right|$, where the i.i.d. multipliers $(ξ_i)_{i=1}^N$ need not be independent of $(X_i)_{i=1}^N$, and secondly, $(f,h) \to \left|\sum_{i=1}^N (f(X_i)h(X_i)-\E f h) \right|$, the centred product process indexed by the classes $F$ and… ▽ More

    Submitted 2 October, 2015; v1 submitted 29 October, 2014; originally announced October 2014.

  49. arXiv:1410.6914  [pdf, ps, other

    math.FA

    Dvoretzky type theorems for subgaussian coordinate projections

    Authors: Shahar Mendelson

    Abstract: Given a class of functions $F$ on a probability space $(Ω,μ)$, we study the structure of a typical coordinate projection of the class, defined by $\{(f(X_i))_{i=1}^N : f \in F\}$, where $X_1,...,X_N$ are independent, selected according to $μ$. This notion of projection generalizes the standard linear random projection used in Asymptotic Geometric Analysis. We show that when $F$ is a subgaussian… ▽ More

    Submitted 25 October, 2014; originally announced October 2014.

  50. arXiv:1410.3192  [pdf, ps, other

    stat.ML

    Learning without Concentration for General Loss Functions

    Authors: Shahar Mendelson

    Abstract: We study prediction and estimation problems using empirical risk minimization, relative to a general convex loss function. We obtain sharp error rates even when concentration is false or is very restricted, for example, in heavy-tailed scenarios. Our results show that the error rate depends on two parameters: one captures the intrinsic complexity of the class, and essentially leads to the error ra… ▽ More

    Submitted 13 October, 2014; originally announced October 2014.

    ACM Class: K.3.2