Skip to main content

Showing 1–30 of 30 results for author: Konečný, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.02664  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    Optimizing the Communication-Accuracy Trade-off in Federated Learning with Rate-Distortion Theory

    Authors: Nicole Mitchell, Johannes Ballé, Zachary Charles, Jakub Konečný

    Abstract: A significant bottleneck in federated learning (FL) is the network communication cost of sending model updates from client devices to the central server. We present a comprehensive empirical study of the statistics of model updates in FL, as well as the role and benefits of various compression techniques. Motivated by these observations, we propose a novel method to reduce the average communicatio… ▽ More

    Submitted 19 May, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  2. arXiv:2107.06917  [pdf, other

    cs.LG

    A Field Guide to Federated Optimization

    Authors: Jianyu Wang, Zachary Charles, Zheng Xu, Gauri Joshi, H. Brendan McMahan, Blaise Aguera y Arcas, Maruan Al-Shedivat, Galen Andrew, Salman Avestimehr, Katharine Daly, Deepesh Data, Suhas Diggavi, Hubert Eichner, Advait Gadhikar, Zachary Garrett, Antonious M. Girgis, Filip Hanzely, Andrew Hard, Chaoyang He, Samuel Horvath, Zhouyuan Huo, Alex Ingerman, Martin Jaggi, Tara Javidi, Peter Kairouz , et al. (28 additional authors not shown)

    Abstract: Federated learning and analytics are a distributed approach for collaboratively learning models (or statistics) from decentralized data, motivated by and designed for privacy protection. The distributed learning process can be formulated as solving federated optimization problems, which emphasize communication efficiency, data heterogeneity, compatibility with privacy and system requirements, and… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

  3. arXiv:2103.05032  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Convergence and Accuracy Trade-Offs in Federated Learning and Meta-Learning

    Authors: Zachary Charles, Jakub Konečný

    Abstract: We study a family of algorithms, which we refer to as local update methods, generalizing many federated and meta-learning algorithms. We prove that for quadratic models, local update methods are equivalent to first-order optimization on a surrogate loss we exactly characterize. Moreover, fundamental algorithmic choices (such as learning rates) explicitly govern a trade-off between the condition nu… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Journal ref: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021. PMLR: Volume 130

  4. arXiv:2011.04928  [pdf, ps, other

    cs.DS

    LinCbO: fast algorithm for computation of the Duquenne-Guigues basis

    Authors: Radek Janostik, Jan Konecny, Petr Krajča

    Abstract: We propose and evaluate a novel algorithm for computation of the Duquenne-Guigues basis which combines Close-by-One and LinClosure algorithms. This combination enables us to reuse attribute counters used in LinClosure and speed up the computation. Our experimental evaluation shows that it is the most efficient algorithm for computation of the Duquenne-Guigues basis.

    Submitted 22 January, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    ACM Class: F.2.2

  5. arXiv:2010.06980  [pdf, other

    cs.DS

    LCM from FCA Point of View: A CbO-style Algorithm with Speed-up Features

    Authors: Radek Janostik, Jan Konecny, Petr Krajča

    Abstract: LCM is an algorithm for enumeration of frequent closed itemsets in transaction databases. It is well known that when we ignore the required frequency, the closed itemsets are exactly intents of formal concepts in Formal Concept Analysis (FCA). We describe LCM in terms of FCA and show that LCM is basically the Close-by-One algorithm with multiple speed-up features for processing sparse data. We ana… ▽ More

    Submitted 22 January, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: full version of a conference paper to be published in IJAR

    ACM Class: F.2.2

  6. arXiv:2007.00878  [pdf, other

    cs.LG math.OC stat.ML

    On the Outsized Importance of Learning Rates in Local Update Methods

    Authors: Zachary Charles, Jakub Konečný

    Abstract: We study a family of algorithms, which we refer to as local update methods, that generalize many federated learning and meta-learning algorithms. We prove that for quadratic objectives, local update methods perform stochastic gradient descent on a surrogate loss function which we exactly characterize. We show that the choice of client learning rate controls the condition number of that surrogate l… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  7. arXiv:2003.00295  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Adaptive Federated Optimization

    Authors: Sashank Reddi, Zachary Charles, Manzil Zaheer, Zachary Garrett, Keith Rush, Jakub Konečný, Sanjiv Kumar, H. Brendan McMahan

    Abstract: Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data. Standard federated optimization methods such as Federated Averaging (FedAvg) are often difficult to tune and exhibit unfavorable convergence behavior. In non-federated settings, adaptive optimization methods have… ▽ More

    Submitted 8 September, 2021; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: Published as a conference paper at ICLR 2021

  8. arXiv:1912.04977  [pdf, other

    cs.LG cs.CR stat.ML

    Advances and Open Problems in Federated Learning

    Authors: Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D'Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson , et al. (34 additional authors not shown)

    Abstract: Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while kee** the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs re… ▽ More

    Submitted 8 March, 2021; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published in Foundations and Trends in Machine Learning Vol 4 Issue 1. See: https://www.nowpublishers.com/article/Details/MAL-083

  9. arXiv:1912.00131  [pdf, other

    cs.DC cs.CR cs.LG stat.ML

    Federated Learning with Autotuned Communication-Efficient Secure Aggregation

    Authors: Keith Bonawitz, Fariborz Salehi, Jakub Konečný, Brendan McMahan, Marco Gruteser

    Abstract: Federated Learning enables mobile devices to collaboratively learn a shared inference model while kee** all the training data on a user's device, decoupling the ability to do machine learning from the need to store the data in the cloud. Existing work on federated learning with limited communication demonstrates how random rotation can enable users' model updates to be quantized much more effici… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 5 pages, 3 figures. To appear at the IEEE Asilomar Conference on Signals, Systems, and Computers 2019

  10. arXiv:1909.12488  [pdf, other

    cs.LG stat.ML

    Improving Federated Learning Personalization via Model Agnostic Meta Learning

    Authors: Yihan Jiang, Jakub Konečný, Keith Rush, Sreeram Kannan

    Abstract: Federated Learning (FL) refers to learning a high quality global model based on decentralized data storage, without ever copying the raw data. A natural scenario arises with data created on mobile phones by the activity of their users. Given the typical data heterogeneity in such situations, it is natural to ask how can the global model be personalized for every such device, individually. In this… ▽ More

    Submitted 18 January, 2023; v1 submitted 27 September, 2019; originally announced September 2019.

  11. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  12. arXiv:1902.01046  [pdf, other

    cs.LG cs.DC stat.ML

    Towards Federated Learning at Scale: System Design

    Authors: Keith Bonawitz, Hubert Eichner, Wolfgang Grieskamp, Dzmitry Huba, Alex Ingerman, Vladimir Ivanov, Chloe Kiddon, Jakub Konečný, Stefano Mazzocchi, H. Brendan McMahan, Timon Van Overveldt, David Petrou, Daniel Ramage, Jason Roselander

    Abstract: Federated Learning is a distributed machine learning approach which enables model training on a large corpus of decentralized data. We have built a scalable production system for Federated Learning in the domain of mobile devices, based on TensorFlow. In this paper, we describe the resulting high-level design, sketch some of the challenges and their solutions, and touch upon the open problems and… ▽ More

    Submitted 22 March, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  13. arXiv:1901.09367  [pdf, other

    math.OC cs.DC cs.LG cs.MA eess.SY

    A Privacy Preserving Randomized Gossip Algorithm via Controlled Noise Insertion

    Authors: Filip Hanzely, Jakub Konečný, Nicolas Loizou, Peter Richtárik, Dmitry Grishchenko

    Abstract: In this work we present a randomized gossip algorithm for solving the average consensus problem while at the same time protecting the information about the initial private values stored at the nodes. We give iteration complexity bounds for the method and perform extensive numerical experiments.

    Submitted 27 January, 2019; originally announced January 2019.

    Comments: NeurIPS 2018, Privacy Preserving Machine Learning Workshop (camera ready version). The full-length paper, which includes a number of additional algorithms and results (including proofs of statements and experiments), is available in arXiv:1706.07636

  14. arXiv:1812.07210  [pdf, other

    cs.LG cs.DC stat.ML

    Expanding the Reach of Federated Learning by Reducing Client Resource Requirements

    Authors: Sebastian Caldas, Jakub Konečny, H. Brendan McMahan, Ameet Talwalkar

    Abstract: Communication on heterogeneous edge networks is a fundamental bottleneck in Federated Learning (FL), restricting both model capacity and user participation. To address this issue, we introduce two novel strategies to reduce communication costs: (1) the use of lossy compression on the global model sent server-to-client; and (2) Federated Dropout, which allows users to efficiently train locally on s… ▽ More

    Submitted 8 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

  15. arXiv:1812.01097  [pdf, other

    cs.LG stat.ML

    LEAF: A Benchmark for Federated Settings

    Authors: Sebastian Caldas, Sai Meher Karthik Duddu, Peter Wu, Tian Li, Jakub Konečný, H. Brendan McMahan, Virginia Smith, Ameet Talwalkar

    Abstract: Modern federated networks, such as those comprised of wearable devices, mobile phones, or autonomous vehicles, generate massive amounts of data each day. This wealth of data can help to learn models that can improve the user experience on each device. However, the scale and heterogeneity of federated data presents new challenges in research areas such as federated learning, meta-learning, and mult… ▽ More

    Submitted 9 December, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  16. arXiv:1711.05509  [pdf, ps, other

    cs.AI cs.DM

    Note on Representing attribute reduction and concepts in concepts lattice using graphs

    Authors: Jan Konecny

    Abstract: Mao H. (2017, Representing attribute reduction and concepts in concept lattice using graphs. Soft Computing 21(24):7293--7311) claims to make contributions to the study of reduction of attributes in concept lattices by using graph theory. We show that her results are either trivial or already well-known and all three algorithms proposed in the paper are incorrect.

    Submitted 30 May, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: 10 pages, 5 figures

  17. arXiv:1707.01155  [pdf, other

    cs.LG

    Stochastic, Distributed and Federated Optimization for Machine Learning

    Authors: Jakub Konečný

    Abstract: We study optimization algorithms for the finite sum problems frequently arising in machine learning applications. First, we propose novel variants of stochastic gradient descent with a variance reduction property that enables linear convergence for strongly convex objectives. Second, we study distributed setting, in which the data describing the optimization problem does not fit into a single comp… ▽ More

    Submitted 4 July, 2017; originally announced July 2017.

    Comments: PhD thesis

  18. arXiv:1611.07555  [pdf, other

    cs.DC math.NA stat.ML

    Randomized Distributed Mean Estimation: Accuracy vs Communication

    Authors: Jakub Konečný, Peter Richtárik

    Abstract: We consider the problem of estimating the arithmetic average of a finite collection of real vectors stored in a distributed fashion across several compute nodes subject to a communication budget constraint. Our analysis does not rely on any statistical assumptions about the source of the vectors. This problem arises as a subproblem in many applications, including reduce-all operations within algor… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: 19 pages, 1 figure

  19. arXiv:1610.05492  [pdf, other

    cs.LG

    Federated Learning: Strategies for Improving Communication Efficiency

    Authors: Jakub Konečný, H. Brendan McMahan, Felix X. Yu, Peter Richtárik, Ananda Theertha Suresh, Dave Bacon

    Abstract: Federated Learning is a machine learning setting where the goal is to train a high-quality centralized model while training data remains distributed over a large number of clients each with unreliable and relatively slow network connections. We consider learning algorithms for this setting where on each round, each client independently computes an update to the current model based on its local dat… ▽ More

    Submitted 30 October, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

  20. arXiv:1610.02527  [pdf, other

    cs.LG

    Federated Optimization: Distributed Machine Learning for On-Device Intelligence

    Authors: Jakub Konečný, H. Brendan McMahan, Daniel Ramage, Peter Richtárik

    Abstract: We introduce a new and increasingly relevant setting for distributed optimization in machine learning, where the data defining the optimization are unevenly distributed over an extremely large number of nodes. The goal is to train a high-quality centralized model. We refer to this setting as Federated Optimization. In this setting, communication efficiency is of the utmost importance and minimizin… ▽ More

    Submitted 8 October, 2016; originally announced October 2016.

    Comments: 38 pages

  21. arXiv:1608.06879  [pdf, other

    math.OC cs.LG stat.ML

    AIDE: Fast and Communication Efficient Distributed Optimization

    Authors: Sashank J. Reddi, Jakub Konečný, Peter Richtárik, Barnabás Póczós, Alex Smola

    Abstract: In this paper, we present two new communication-efficient methods for distributed minimization of an average of functions. The first algorithm is an inexact variant of the DANE algorithm that allows any local algorithm to return an approximate solution to a local subproblem. We show that such a strategy does not affect the theoretical guarantees of DANE significantly. In fact, our approach can be… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

  22. arXiv:1512.04039  [pdf, other

    cs.LG math.OC

    Distributed Optimization with Arbitrary Local Solvers

    Authors: Chenxin Ma, Jakub Konečný, Martin Jaggi, Virginia Smith, Michael I. Jordan, Peter Richtárik, Martin Takáč

    Abstract: With the growth of data and necessity for distributed optimization methods, solvers that work well on a single machine must be re-designed to leverage distributed computation. Recent work in this area has been limited by focusing heavily on develo** highly specific methods for the distributed environment. These special-purpose methods are often unable to fully leverage the competitive performanc… ▽ More

    Submitted 3 August, 2016; v1 submitted 13 December, 2015; originally announced December 2015.

  23. arXiv:1511.03575  [pdf, ps, other

    cs.LG math.OC

    Federated Optimization:Distributed Optimization Beyond the Datacenter

    Authors: Jakub Konečný, Brendan McMahan, Daniel Ramage

    Abstract: We introduce a new and increasingly relevant setting for distributed optimization in machine learning, where the data defining the optimization are distributed (unevenly) over an extremely large number of \nodes, but the goal remains to train a high-quality centralized model. We refer to this setting as Federated Optimization. In this setting, communication efficiency is of utmost importance. A… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

    Comments: NIPS workshop version

  24. arXiv:1511.01942  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Stop Wasting My Gradients: Practical SVRG

    Authors: Reza Babanezhad, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

    Abstract: We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  25. arXiv:1506.03930  [pdf, ps, other

    cs.LO

    Complete relations on fuzzy complete lattices

    Authors: Jan Konecny, Michal Krupka

    Abstract: We generalize the notion of complete binary relation on complete lattice to residuated lattice valued ordered sets and show its properties. Then we focus on complete fuzzy tolerances on fuzzy complete lattices and prove they are in one-to-one correspondence with extensive isotone Galois connections. Finally, we prove that fuzzy complete lattice, factorized by a complete fuzzy tolerance, is again a… ▽ More

    Submitted 12 June, 2015; originally announced June 2015.

    Comments: Preprint submitted to Fuzzy Sets and Systems

  26. Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting

    Authors: Jakub Konečný, Jie Liu, Peter Richtárik, Martin Takáč

    Abstract: We propose mS2GD: a method incorporating a mini-batching scheme for improving the theoretical complexity and practical performance of semi-stochastic gradient descent (S2GD). We consider the problem of minimizing a strongly convex function represented as the sum of an average of a large number of smooth convex functions, and a simple nonsmooth convex regularizer. Our method first performs a determ… ▽ More

    Submitted 16 November, 2015; v1 submitted 16 April, 2015; originally announced April 2015.

  27. arXiv:1410.4744  [pdf, other

    cs.LG stat.ML

    mS2GD: Mini-Batch Semi-Stochastic Gradient Descent in the Proximal Setting

    Authors: Jakub Konečný, Jie Liu, Peter Richtárik, Martin Takáč

    Abstract: We propose a mini-batching scheme for improving the theoretical complexity and practical performance of semi-stochastic gradient descent applied to the problem of minimizing a strongly convex composite function represented as the sum of an average of a large number of smooth convex functions, and simple nonsmooth convex function. Our method first performs a deterministic step (computation of the g… ▽ More

    Submitted 17 October, 2014; originally announced October 2014.

  28. arXiv:1410.0390  [pdf, ps, other

    math.OC cs.CC

    Simple Complexity Analysis of Simplified Direct Search

    Authors: Jakub Konečný, Peter Richtárik

    Abstract: We consider the problem of unconstrained minimization of a smooth function in the derivative-free setting using. In particular, we propose and study a simplified variant of the direct search method (of direction type), which we call simplified direct search (SDS). Unlike standard direct search methods, which depend on a large number of parameters that need to be tuned, SDS depends on a single scal… ▽ More

    Submitted 13 November, 2014; v1 submitted 1 October, 2014; originally announced October 2014.

    Comments: 21 pages, 5 algorithms, 1 table

  29. arXiv:1312.4190  [pdf, other

    cs.CV

    One-Shot-Learning Gesture Recognition using HOG-HOF Features

    Authors: Jakub Konečný, Michal Hagara

    Abstract: The purpose of this paper is to describe one-shot-learning gesture recognition systems developed on the \textit{ChaLearn Gesture Dataset}. We use RGB and depth images and combine appearance (Histograms of Oriented Gradients) and motion descriptors (Histogram of Optical Flow) for parallel temporal segmentation and recognition. The Quadratic-Chi distance family is used to measure differences between… ▽ More

    Submitted 15 February, 2014; v1 submitted 15 December, 2013; originally announced December 2013.

    Comments: 20 pages, 10 figures, 2 tables To appear in Journal of Machine Learning Research subject to minor revision

  30. arXiv:1312.1666  [pdf, other

    stat.ML cs.LG math.NA math.OC

    Semi-Stochastic Gradient Descent Methods

    Authors: Jakub Konečný, Peter Richtárik

    Abstract: In this paper we study the problem of minimizing the average of a large number ($n$) of smooth convex loss functions. We propose a new method, S2GD (Semi-Stochastic Gradient Descent), which runs for one or several epochs in each of which a single full gradient and a random number of stochastic gradients is computed, following a geometric law. The total work needed for the method to output an… ▽ More

    Submitted 16 June, 2015; v1 submitted 5 December, 2013; originally announced December 2013.

    Comments: 19 pages, 3 figures, 2 algorithms, 3 tables