Skip to main content

Showing 1–19 of 19 results for author: Valiant, P

.
  1. arXiv:2402.07248  [pdf, ps, other

    cs.LG stat.ML

    Depth Separations in Neural Networks: Separating the Dimension from the Accuracy

    Authors: Itay Safran, Daniel Reichman, Paul Valiant

    Abstract: We prove an exponential separation between depth 2 and depth 3 neural networks, when approximating an $\mathcal{O}(1)$-Lipschitz target function to constant accuracy, with respect to a distribution with support in $[0,1]^{d}$, assuming exponentially bounded weights. This addresses an open problem posed in \citet{safran2019depth}, and proves that the curse of dimensionality manifests in depth 2 app… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  2. arXiv:2311.12784  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+α$ Moments

    Authors: Trung Dang, Jasper C. H. Lee, Maoyuan Song, Paul Valiant

    Abstract: There is growing interest in improving our algorithmic understanding of fundamental statistical problems such as mean estimation, driven by the goal of understanding the limits of what we can extract from valuable data. The state of the art results for mean estimation in $\mathbb{R}$ are 1) the optimal sub-Gaussian mean estimator by [LV22], with the tight sub-Gaussian constant for all distribution… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 27 pages, to appear in NeurIPS 2023. Abstract shortened to fit arXiv limit

  3. arXiv:2310.09408  [pdf, other

    math.ST

    Improving Pearson's chi-squared test: hypothesis testing of distributions -- optimally

    Authors: Trung Dang, Walter McKelvie, Paul Valiant, Hongao Wang

    Abstract: Pearson's chi-squared test, from 1900, is the standard statistical tool for "hypothesis testing on distributions": namely, given samples from an unknown distribution $Q$ that may or may not equal a hypothesis distribution $P$, we want to return "yes" if $P=Q$ and "no" if $P$ is far from $Q$. While the chi-squared test is easy to use, it has been known for a while that it is not "data efficient", i… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  4. arXiv:2307.09212  [pdf, other

    cs.LG

    How Many Neurons Does it Take to Approximate the Maximum?

    Authors: Itay Safran, Daniel Reichman, Paul Valiant

    Abstract: We study the size of a neural network needed to approximate the maximum function over $d$ inputs, in the most basic setting of approximating with respect to the $L_2$ norm, for continuous distributions, for a network that uses ReLU activations. We provide new lower and upper bounds on the width required for approximation across various depths. Our results establish new depth separations between de… ▽ More

    Submitted 7 November, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2206.02348  [pdf, other

    math.ST cs.DS cs.IT cs.LG stat.ML

    Finite-Sample Maximum Likelihood Estimation of Location

    Authors: Shivam Gupta, Jasper C. H. Lee, Eric Price, Paul Valiant

    Abstract: We consider 1-dimensional location estimation, where we estimate a parameter $λ$ from $n$ samples $λ+ η_i$, with each $η_i$ drawn i.i.d. from a known distribution $f$. For fixed $f$ the maximum-likelihood estimate (MLE) is well-known to be optimal in the limit as $n \to \infty$: it is asymptotically normal with variance matching the Cramér-Rao lower bound of $\frac{1}{n\mathcal{I}}$, where… ▽ More

    Submitted 18 July, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Corrected an inaccuracy in the description of the experimental setup. Also updated funding acknowledgements

  6. arXiv:2011.08384  [pdf, ps, other

    math.ST cs.DS cs.IT cs.LG stat.ML

    Optimal Sub-Gaussian Mean Estimation in $\mathbb{R}$

    Authors: Jasper C. H. Lee, Paul Valiant

    Abstract: We revisit the problem of estimating the mean of a real-valued distribution, presenting a novel estimator with sub-Gaussian convergence: intuitively, "our estimator, on any distribution, is as accurate as the sample mean is for the Gaussian distribution of matching variance." Crucially, in contrast to prior works, our estimator does not require prior knowledge of the variance, and works across the… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  7. arXiv:1911.12289  [pdf, ps, other

    physics.flu-dyn

    New relations for energy flow in terms of vorticity

    Authors: Paul Valiant

    Abstract: Considering the vorticity formulation of the Euler equations, we partition the kinetic energy into its contribution from each pair of interacting vortices. We call this contribution the "interaction energy". We show that each contribution satisfies a reciprocity relation on triples of vortices: $\boldsymbol{A}$'s action on $\boldsymbol{B}$ changes the interaction energy between $\boldsymbol{B}$ an… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  8. arXiv:1911.03605  [pdf, other

    cs.DS cs.LG stat.ML

    Worst-Case Analysis for Randomly Collected Data

    Authors: Justin Y. Chen, Gregory Valiant, Paul Valiant

    Abstract: We introduce a framework for statistical estimation that leverages knowledge of how samples are collected but makes no distributional assumptions on the data values. Specifically, we consider a population of elements $[n]={1,\ldots,n}$ with corresponding data values $x_1,\ldots,x_n$. We observe the values for a "sample" set $A \subset [n]$ and wish to estimate some statistic of the values for a "t… ▽ More

    Submitted 26 October, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

  9. arXiv:1904.09228  [pdf, other

    cs.LG cs.DS stat.ML

    Uncertainty about Uncertainty: Optimal Adaptive Algorithms for Estimating Mixtures of Unknown Coins

    Authors: Jasper C. H. Lee, Paul Valiant

    Abstract: Given a mixture between two populations of coins, "positive" coins that each have -- unknown and potentially different -- bias $\geq\frac{1}{2}+Δ$ and "negative" coins with bias $\leq\frac{1}{2}-Δ$, we consider the task of estimating the fraction $ρ$ of positive coins to within additive error $ε$. We achieve an upper and lower bound of $Θ(\fracρ{ε^2Δ^2}\log\frac{1}δ)$ samples for a $1-δ$ probabili… ▽ More

    Submitted 5 February, 2021; v1 submitted 19 April, 2019; originally announced April 2019.

    Comments: Full paper updated to reflect the new result in our SODA 2021 proceedings version: our new sample complexity lower bound includes dependence on the failure probability, and hence is simultaneously tight in all of the problem parameters up to a constant multiplicative factor

  10. arXiv:1904.09080  [pdf, other

    cs.LG stat.ML

    Implicit regularization for deep neural networks driven by an Ornstein-Uhlenbeck like process

    Authors: Guy Blanc, Neha Gupta, Gregory Valiant, Paul Valiant

    Abstract: We consider networks, trained via stochastic gradient descent to minimize $\ell_2$ loss, with the training labels perturbed by independent noise at each iteration. We characterize the behavior of the training dynamics near any parameter vector that achieves zero training error, in terms of an implicit regularization term corresponding to the sum over the data points, of the squared $\ell_2$ norm o… ▽ More

    Submitted 22 July, 2020; v1 submitted 19 April, 2019; originally announced April 2019.

  11. arXiv:1605.02646  [pdf, other

    cs.CR

    Information Theoretically Secure Databases

    Authors: Gregory Valiant, Paul Valiant

    Abstract: We introduce the notion of a database system that is information theoretically "Secure In Between Accesses"--a database system with the properties that 1) users can efficiently access their data, and 2) while a user is not accessing their data, the user's information is information theoretically secure to malicious agents, provided that certain requirements on the maintenance of the database are r… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 16 pages, 2 figures

  12. arXiv:1512.07898  [pdf, ps, other

    physics.flu-dyn

    Eroding dipoles and vorticity growth for Euler flows in $ \scriptstyle{\mathbb{R}}^3$ I. Axisymmetric flow without swirl

    Authors: Stephen Childress, Andrew D. Gilbert, Paul Valiant

    Abstract: A review of analyses based upon anti-parallel vortex structures suggests that structurally stable vortex structures with eroding circulation may offer a path to the study of rapid vorticity growth in solutions of Euler's equations in $ \scriptstyle{\mathbb{R}}^3$. We examine here the possible formation of such a structure in axisymmetric flow without swirl, leading to maximal growth of vorticity a… ▽ More

    Submitted 24 December, 2015; originally announced December 2015.

    Comments: 33 pages, 11 figures

    MSC Class: 76B47

  13. arXiv:1511.04466  [pdf, other

    cs.DS

    Optimizing Star-Convex Functions

    Authors: Jasper C. H. Lee, Paul Valiant

    Abstract: We introduce a polynomial time algorithm for optimizing the class of star-convex functions, under no restrictions except boundedness on a region about the origin, and Lebesgue measurability. The algorithm's performance is polynomial in the requested number of digits of accuracy, contrasting with the previous best known algorithm of Nesterov and Polyak that has exponential dependence, and that furt… ▽ More

    Submitted 11 May, 2016; v1 submitted 13 November, 2015; originally announced November 2015.

    Comments: 30 pages (including appendices)

  14. arXiv:1504.05321  [pdf, ps, other

    cs.LG

    Instance Optimal Learning

    Authors: Gregory Valiant, Paul Valiant

    Abstract: We consider the following basic learning task: given independent draws from an unknown distribution over a discrete support, output an approximation of the distribution that is as accurate as possible in $\ell_1$ distance (i.e. total variation or statistical distance). Perhaps surprisingly, it is often possible to "de-noise" the empirical distribution of the samples to return an approximation of t… ▽ More

    Submitted 11 November, 2015; v1 submitted 21 April, 2015; originally announced April 2015.

  15. arXiv:1308.3946  [pdf, ps, other

    cs.DS cs.IT cs.LG

    Optimal Algorithms for Testing Closeness of Discrete Distributions

    Authors: Siu-On Chan, Ilias Diakonikolas, Gregory Valiant, Paul Valiant

    Abstract: We study the question of closeness testing for two discrete distributions. More precisely, given samples from two distributions $p$ and $q$ over an $n$-element set, we wish to distinguish whether $p=q$ versus $p$ is at least $\eps$-far from $q$, in either $\ell_1$ or $\ell_2$ distance. Batu et al. gave the first sub-linear time algorithms for these problems, which matched the lower bounds of Valia… ▽ More

    Submitted 19 August, 2013; originally announced August 2013.

  16. arXiv:1112.5659  [pdf, ps, other

    cs.DS math.PR math.ST

    Testing $k$-Modal Distributions: Optimal Algorithms via Reductions

    Authors: Constantinos Daskalakis, Ilias Diakonikolas, Rocco A. Servedio, Gregory Valiant, Paul Valiant

    Abstract: We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statistical problems that involve testing and estimating the L_1 distance between two k-modal distributions $p$ and $q$ over the discrete domain $\{1,\dots,n\}$. More precisely, we consider the following four problems: given sample access to an unknown k-modal distribution $p$, Testing identity to a known… ▽ More

    Submitted 23 December, 2011; originally announced December 2011.

  17. arXiv:0909.2030  [pdf, ps, other

    cs.DB cs.DS

    Size Bounds for Conjunctive Queries with General Functional Dependencies

    Authors: Gregory Valiant, Paul Valiant

    Abstract: This paper extends the work of Gottlob, Lee, and Valiant (PODS 2009)[GLV], and considers worst-case bounds for the size of the result Q(D) of a conjunctive query Q to a database D given an arbitrary set of functional dependencies. The bounds in [GLV] are based on a "coloring" of the query variables. In order to extend the previous bounds to the setting of arbitrary functional dependencies, we le… ▽ More

    Submitted 12 December, 2009; v1 submitted 10 September, 2009; originally announced September 2009.

    Comments: 22 pages, 2 figures

    ACM Class: H.2.4; F.2.0

  18. arXiv:0802.1604  [pdf, ps, other

    cs.GT cs.MA

    On the Complexity of Nash Equilibria of Action-Graph Games

    Authors: Constantinos Daskalakis, Grant Schoenebeck, Gregory Valiant, Paul Valiant

    Abstract: We consider the problem of computing Nash Equilibria of action-graph games (AGGs). AGGs, introduced by Bhat and Leyton-Brown, is a succinct representation of games that encapsulates both "local" dependencies as in graphical games, and partial indifference to other agents' identities as in anonymous games, which occur in many natural settings. This is achieved by specifying a graph on the set of… ▽ More

    Submitted 12 February, 2008; originally announced February 2008.

  19. arXiv:quant-ph/0211179  [pdf, ps, other

    quant-ph cs.CC

    Comparing EQP and MOD_{p^k}P using Polynomial Degree Lower Bounds

    Authors: M. de Graaf, P. Valiant

    Abstract: We show that an oracle A that contains either 1/4 or 3/4 of all strings of length n can be used to separate EQP from the counting classes MOD_{p^k}P. Our proof makes use of the degree of a representing polynomial over the finite field of size p^k. We show a linear lower bound on the degree of this polynomial. We also show an upper bound of O(n^{1/log_p m}) on the degree over the ring of intege… ▽ More

    Submitted 27 November, 2002; originally announced November 2002.

    Comments: 10 pages, no figures