-
AdaDelay: Delay Adaptive Distributed Stochastic Convex Optimization
Abstract: We study distributed stochastic convex optimization under the delayed gradient model where the server nodes perform parameter updates, while the worker nodes compute stochastic gradients. We discuss, analyze, and experiment with a setup motivated by the behavior of real-world distributed computation networks, where the machines are differently slow at different time. Therefore, we allow the parame… ▽ More
Submitted 20 August, 2015; originally announced August 2015.
Comments: 19 pages
-
arXiv:math/0701907 [pdf, ps, other]
Kernel methods in machine learning
Abstract: We review machine learning methods employing positive definite kernels. These methods formulate learning and estimation problems in a reproducing kernel Hilbert space (RKHS) of functions defined on the data domain, expanded in terms of a kernel. Working in linear spaces of function has the benefit of facilitating the construction and analysis of learning algorithms while at the same time allowin… ▽ More
Submitted 1 July, 2008; v1 submitted 30 January, 2007; originally announced January 2007.
Comments: Published in at http://dx.doi.org/10.1214/009053607000000677 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOS-AOS0290 MSC Class: 30C40 (Primary) 68T05 (Secondary)
Journal ref: Annals of Statistics 2008, Vol. 36, No. 3, 1171-1220