Skip to main content

Showing 1–12 of 12 results for author: Barnes, L P

.
  1. arXiv:2401.16701  [pdf, ps, other

    math.ST cs.IT

    Multivariate Priors and the Linearity of Optimal Bayesian Estimators under Gaussian Noise

    Authors: Leighton P. Barnes, Alex Dytso, **gbo Liu, H. Vincent Poor

    Abstract: Consider the task of estimating a random vector $X$ from noisy observations $Y = X + Z$, where $Z$ is a standard normal vector, under the $L^p$ fidelity criterion. This work establishes that, for $1 \leq p \leq 2$, the optimal Bayesian estimator is linear and positive definite if and only if the prior distribution on $X$ is a (non-degenerate) multivariate Gaussian. Furthermore, for $p > 2$, it is… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  2. arXiv:2309.09129  [pdf, ps, other

    math.ST cs.IT stat.ML

    $L^1$ Estimation: On the Optimality of Linear Estimators

    Authors: Leighton P. Barnes, Alex Dytso, **gbo Liu, H. Vincent Poor

    Abstract: Consider the problem of estimating a random variable $X$ from noisy observations $Y = X+ Z$, where $Z$ is standard normal, under the $L^1$ fidelity criterion. It is well known that the optimal Bayesian estimator in this setting is the conditional median. This work shows that the only prior distribution on $X$ that induces linearity in the conditional median is Gaussian. Along the way, several ot… ▽ More

    Submitted 9 January, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

  3. Improved Information Theoretic Generalization Bounds for Distributed and Federated Learning

    Authors: L. P. Barnes, Alex Dytso, H. V. Poor

    Abstract: We consider information-theoretic bounds on expected generalization error for statistical learning problems in a networked setting. In this setting, there are $K$ nodes, each with its own independent dataset, and the models from each node have to be aggregated into a final centralized model. We consider both simple averaging of the models as well as more complicated multi-round algorithms. We give… ▽ More

    Submitted 15 January, 2024; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: This version of the paper adds an assumption that was missing from Theorem 4 for loss functions of type (i). Thanks to Peyman Gholami for spotting this bug

  4. arXiv:2103.04014  [pdf, ps, other

    cs.IT cs.DC math.ST stat.ML

    Over-the-Air Statistical Estimation

    Authors: Chuan-Zheng Lee, Leighton Pate Barnes, Ayfer Ozgur

    Abstract: We study schemes and lower bounds for distributed minimax statistical estimation over a Gaussian multiple-access channel (MAC) under squared error loss, in a framework combining statistical estimation and wireless communication. First, we develop "analog" joint estimation-communication schemes that exploit the superposition property of the Gaussian MAC and we characterize their risk in terms of th… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: 12 pages, 5 figures

  5. arXiv:2102.05802  [pdf, ps, other

    cs.IT math.ST

    Fisher Information and Mutual Information Constraints

    Authors: Leighton Pate Barnes, Ayfer Ozgur

    Abstract: We consider the processing of statistical samples $X\sim P_θ$ by a channel $p(y|x)$, and characterize how the statistical information from the samples for estimating the parameter $θ\in\mathbb{R}^d$ can scale with the mutual information or capacity of the channel. We show that if the statistical model has a sub-Gaussian score function, then the trace of the Fisher information matrix for estimating… ▽ More

    Submitted 8 July, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

  6. arXiv:2005.10783  [pdf, ps, other

    cs.IT math.ST stat.ML

    Fisher information under local differential privacy

    Authors: Leighton Pate Barnes, Wei-Ning Chen, Ayfer Ozgur

    Abstract: We develop data processing inequalities that describe how Fisher information from statistical samples can scale with the privacy parameter $\varepsilon$ under local differential privacy constraints. These bounds are valid under general conditions on the distribution of the score of the statistical model, and they elucidate under which conditions the dependence on $\varepsilon$ is linear, quadratic… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

  7. arXiv:2005.10761  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    rTop-k: A Statistical Estimation Approach to Distributed SGD

    Authors: Leighton Pate Barnes, Huseyin A. Inan, Berivan Isik, Ayfer Ozgur

    Abstract: The large communication cost for exchanging gradients between different nodes significantly limits the scalability of distributed training for large-scale learning models. Motivated by this observation, there has been significant recent interest in techniques that reduce the communication cost of distributed Stochastic Gradient Descent (SGD), with gradient sparsification techniques such as top-k a… ▽ More

    Submitted 2 December, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

  8. arXiv:2004.01277  [pdf, other

    cs.IT

    The Courtade-Kumar Most Informative Boolean Function Conjecture and a Symmetrized Li-Médard Conjecture are Equivalent

    Authors: Leighton Pate Barnes, Ayfer Özgür

    Abstract: We consider the Courtade-Kumar most informative Boolean function conjecture for balanced functions, as well as a conjecture by Li and Médard that dictatorship functions also maximize the $L^α$ norm of $T_pf$ for $1\leqα\leq2$ where $T_p$ is the noise operator and $f$ is a balanced Boolean function. By using a result due to Laguerre from the 1880's, we are able to bound how many times an $L^α$-norm… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

  9. arXiv:1910.01625  [pdf, ps, other

    cs.IT math.ST

    Minimax Bounds for Distributed Logistic Regression

    Authors: Leighton Pate Barnes, Ayfer Ozgur

    Abstract: We consider a distributed logistic regression problem where labeled data pairs $(X_i,Y_i)\in \mathbb{R}^d\times\{-1,1\}$ for $i=1,\ldots,n$ are distributed across multiple machines in a network and must be communicated to a centralized estimator using at most $k$ bits per labeled pair. We assume that the data $X_i$ come independently from some distribution $P_X$, and that the distribution of… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

  10. arXiv:1902.02890  [pdf, other

    cs.IT cs.LG math.ST

    Lower Bounds for Learning Distributions under Communication Constraints via Fisher Information

    Authors: Leighton Pate Barnes, Yanjun Han, Ayfer Ozgur

    Abstract: We consider the problem of learning high-dimensional, nonparametric and structured (e.g. Gaussian) distributions in distributed networks, where each node in the network observes an independent sample from the underlying distribution and can use $k$ bits to communicate its sample to a central processor. We consider three different models for communication. Under the independent model, each node com… ▽ More

    Submitted 31 May, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

  11. arXiv:1811.10533  [pdf, ps, other

    math.PR cs.IT math.CA math.MG

    An Isoperimetric Result on High-Dimensional Spheres

    Authors: Leighton Pate Barnes, Ayfer Ozgur, Xiugang Wu

    Abstract: We consider an extremal problem for subsets of high-dimensional spheres that can be thought of as an extension of the classical isoperimetric problem on the sphere. Let $A$ be a subset of the $(m-1)$-dimensional sphere $\mathbb{S}^{m-1}$, and let $\mathbf{y}\in \mathbb{S}^{m-1}$ be a randomly chosen point on the sphere. What is the measure of the intersection of the $t$-neighborhood of the point… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: text overlap with arXiv:1701.02043

  12. arXiv:1701.02043  [pdf, other

    cs.IT

    "The Capacity of the Relay Channel": Solution to Cover's Problem in the Gaussian Case

    Authors: Xiugang Wu, Leighton Pate Barnes, Ayfer Ozgur

    Abstract: Consider a memoryless relay channel, where the relay is connected to the destination with an isolated bit pipe of capacity $C_0$. Let $C(C_0)$ denote the capacity of this channel as a function of $C_0$. What is the critical value of $C_0$ such that $C(C_0)$ first equals $C(\infty)$? This is a long-standing open problem posed by Cover and named "The Capacity of the Relay Channel," in… ▽ More

    Submitted 7 October, 2018; v1 submitted 8 January, 2017; originally announced January 2017.

    Comments: Accepted to IEEE Trans. on Information Theory