Skip to main content

Showing 1–13 of 13 results for author: Sefidgaran, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08193  [pdf, other

    stat.ML cs.IT cs.LG

    Minimal Communication-Cost Statistical Learning

    Authors: Milad Sefidgaran, Abdellatif Zaidi, Piotr Krasnowski

    Abstract: A client device which has access to $n$ training data samples needs to obtain a statistical hypothesis or model $W$ and then to send it to a remote server. The client and the server devices share some common randomness sequence as well as a prior on the hypothesis space. In this problem a suitable hypothesis or model $W$ should meet two distinct design criteria simultaneously: (i) small (populatio… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at ISIT 2024

  2. arXiv:2402.03254  [pdf, other

    stat.ML cs.IT cs.LG

    Minimum Description Length and Generalization Guarantees for Representation Learning

    Authors: Milad Sefidgaran, Abdellatif Zaidi, Piotr Krasnowski

    Abstract: A major challenge in designing efficient statistical supervised learning algorithms is finding representations that perform well not only on available training samples but also on unseen data. While the study of representation learning has spurred much interest, most existing such approaches are heuristic; and very little is known about theoretical generalization guarantees. In this paper, we es… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted and presented at NeurIPS 2023

  3. arXiv:2306.05862  [pdf, other

    stat.ML cs.IT cs.LG

    Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often!

    Authors: Milad Sefidgaran, Romain Chor, Abdellatif Zaidi, Yijun Wan

    Abstract: We investigate the generalization error of statistical learning models in a Federated Learning (FL) setting. Specifically, we study the evolution of the generalization error with the number of communication rounds $R$ between $K$ clients and a parameter server (PS), i.e., the effect on the generalization error of how often the clients' local models are aggregated at PS. In our setup, the more the… ▽ More

    Submitted 10 June, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2024

  4. arXiv:2304.12216  [pdf, other

    stat.ML cs.IT cs.LG

    More Communication Does Not Result in Smaller Generalization Error in Federated Learning

    Authors: Romain Chor, Milad Sefidgaran, Abdellatif Zaidi

    Abstract: We study the generalization error of statistical learning models in a Federated Learning (FL) setting. Specifically, there are $K$ devices or clients, each holding an independent own dataset of size $n$. Individual models, learned locally via Stochastic Gradient Descent, are aggregated (averaged) by a central server into a global model and then sent back to the devices. We consider multiple (say… ▽ More

    Submitted 11 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Extended version of paper accepted at ISIT 2023

  5. arXiv:2303.05369  [pdf, other

    stat.ML cs.IT cs.LG

    Data-dependent Generalization Bounds via Variable-Size Compressibility

    Authors: Milad Sefidgaran, Abdellatif Zaidi

    Abstract: In this paper, we establish novel data-dependent upper bounds on the generalization error through the lens of a "variable-size compressibility" framework that we introduce newly here. In this framework, the generalization error of an algorithm is linked to a variable-size 'compression rate' of its input data. This is shown to yield bounds that depend on the empirical measure of the given input dat… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in IEEE Transactions on Information Theory

  6. arXiv:2206.02604  [pdf, other

    stat.ML cs.IT cs.LG

    Rate-Distortion Theoretic Bounds on Generalization Error for Distributed Learning

    Authors: Milad Sefidgaran, Romain Chor, Abdellatif Zaidi

    Abstract: In this paper, we use tools from rate-distortion theory to establish new upper bounds on the generalization error of statistical distributed learning algorithms. Specifically, there are $K$ clients whose individually chosen models are aggregated by a central server. The bounds depend on the compressibility of each client's algorithm while kee** other clients' algorithms un-compressed, and levera… ▽ More

    Submitted 22 November, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted at NeurIPS 2022

  7. arXiv:2203.02474  [pdf, other

    stat.ML cs.IT cs.LG

    Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms

    Authors: Milad Sefidgaran, Amin Gohari, Gaël Richard, Umut Şimşekli

    Abstract: Understanding generalization in modern machine learning settings has been one of the major challenges in statistical learning theory. In this context, recent years have witnessed the development of various generalization bounds suggesting different complexity notions such as the mutual information between the data sample and the algorithm output, compressibility of the hypothesis space, and the fr… ▽ More

    Submitted 29 June, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2022

  8. arXiv:2106.03795  [pdf, other

    stat.ML cs.LG

    Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks

    Authors: Melih Barsbey, Milad Sefidgaran, Murat A. Erdogdu, Gaël Richard, Umut Şimşekli

    Abstract: Neural network compression techniques have become increasingly popular as they can drastically reduce the storage and computation requirements for very large networks. Recent empirical studies have illustrated that even simple pruning strategies can be surprisingly effective, and several theoretical studies have shown that compressible networks (in specific senses) should achieve a low generalizat… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  9. arXiv:2102.00697  [pdf, other

    cs.IT

    Zero-Error Sum Modulo Two with a Common Observation

    Authors: Milad Sefidgaran, Aslan Tchamkerten

    Abstract: This paper investigates the classical modulo two sum problem in source coding, but with a common observation: a transmitter observes $(X,Z)$, the other transmitter observes $(Y,Z)$, and the receiver wants to compute $X \oplus Y$ without error. Through a coupling argument, this paper establishes a new lower bound on the sum-rate when $X-Z-Y$ forms a Markov chain.

    Submitted 22 March, 2021; v1 submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted for presentation at IEEE ITW 2020

  10. arXiv:2011.11341  [pdf, other

    cs.IT

    Lower Bound on the Capacity of the Continuous-Space SSFM Model of Optical Fiber

    Authors: Milad Sefidgaran, Mansoor Yousefi

    Abstract: The capacity of a discrete-time model of optical fiber described by the split-step Fourier method (SSFM) as a function of the signal-to-noise ratio $\text{SNR}$ and the number of segments in distance $K$ is considered. It is shown that if $K\geq \text{SNR}^{2/3}$ and $\text{SNR} \rightarrow \infty$, the capacity of the resulting continuous-space lossless model is lower bounded by… ▽ More

    Submitted 26 September, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: Submitted to IEEE Transactions on Information Theory

  11. arXiv:1312.3631  [pdf, ps, other

    cs.IT

    Distributed Function Computation Over a Rooted Directed Tree

    Authors: Milad Sefidgaran, Aslan Tchamkerten

    Abstract: This paper establishes the rate region for a class of source coding function computation setups where sources of information are available at the nodes of a tree and where a function of these sources must be computed at the root. The rate region holds for any function as long as the sources' joint distribution satisfies a certain Markov criterion. This criterion is met, in particular, when the sou… ▽ More

    Submitted 7 April, 2015; v1 submitted 12 December, 2013; originally announced December 2013.

    Comments: 36 pages, Submitted to IEEE Transactions on Information Theory

  12. arXiv:1303.0817  [pdf, other

    cs.IT

    On Cooperation in Multi-Terminal Computation and Rate Distortion

    Authors: Milad Sefidgaran, Aslan Tchamkerten

    Abstract: A receiver wants to compute a function of two correlated sources separately observed by two transmitters. One of the transmitters may send a possibly private message to the other transmitter in a cooperation phase before both transmitters communicate to the receiver. For this network configuration this paper investigates both a function computation setup, wherein the receiver wants to compute a gi… ▽ More

    Submitted 7 April, 2015; v1 submitted 4 March, 2013; originally announced March 2013.

    Comments: 31 pages, Submitted to IEEE Transactions on Information Theory

  13. arXiv:1107.5806  [pdf, ps, other

    cs.IT

    On Computing a Function of Correlated Sources

    Authors: Milad Sefidgaran, Aslan Tchamkerten

    Abstract: A receiver wants to compute a function f of two correlated sources X and Y and side information Z. What is the minimum number of bits that needs to be communicated by each transmitter? In this paper, we derive inner and outer bounds to the rate region of this problem which coincide in the cases where f is partially invertible and where the sources are independent given the side information. Th… ▽ More

    Submitted 11 October, 2012; v1 submitted 28 July, 2011; originally announced July 2011.

    Comments: 11 pages, Submitted to IEEE Transactions on Information Theory