Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

Nokleby, Matthew; Bajwa, Waheed U.

doi:10.1109/TSIPN.2018.2866320

Statistics > Machine Learning

arXiv:1704.07888 (stat)

[Submitted on 25 Apr 2017 (v1), last revised 6 Aug 2018 (this version, v4)]

Title:Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

Authors:Matthew Nokleby, Waheed U. Bajwa

View PDF

Abstract:Motivated by machine learning applications in networks of sensors, internet-of-things (IoT) devices, and autonomous agents, we propose techniques for distributed stochastic convex learning from high-rate data streams. The setup involves a network of nodes---each one of which has a stream of data arriving at a constant rate---that solve a stochastic convex optimization problem by collaborating with each other over rate-limited communication links. To this end, we present and analyze two algorithms---termed distributed stochastic approximation mirror descent (D-SAMD) and accelerated distributed stochastic approximation mirror descent (AD-SAMD)---that are based on two stochastic variants of mirror descent and in which nodes collaborate via approximate averaging of the local, noisy subgradients using distributed consensus. Our main contributions are (i) bounds on the convergence rates of D-SAMD and AD-SAMD in terms of the number of nodes, network topology, and ratio of the data streaming and communication rates, and (ii) sufficient conditions for order-optimum convergence of these algorithms. In particular, we show that for sufficiently well-connected networks, distributed learning schemes can obtain order-optimum convergence even if the communications rate is small. Further we find that the use of accelerated methods significantly enlarges the regime in which order-optimum convergence is achieved; this is in contrast to the centralized setting, where accelerated methods usually offer only a modest improvement. Finally, we demonstrate the effectiveness of the proposed algorithms using numerical experiments.

Comments:	16 pages, 6 figures; Accepted for publication in IEEE Transactions on Signal and Information Processing over Networks
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1704.07888 [stat.ML]
	(or arXiv:1704.07888v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1704.07888
Journal reference:	Published in IEEE Trans. Signal Inform. Proc. over Netw., vol. 5, no. 1, pp. 152-167, Mar. 2019
Related DOI:	https://doi.org/10.1109/TSIPN.2018.2866320

Submission history

From: Waheed Bajwa [view email]
[v1] Tue, 25 Apr 2017 19:52:52 UTC (69 KB)
[v2] Tue, 2 May 2017 18:41:03 UTC (91 KB)
[v3] Tue, 5 Jun 2018 10:31:27 UTC (718 KB)
[v4] Mon, 6 Aug 2018 08:42:35 UTC (790 KB)

Statistics > Machine Learning

Title:Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators