Skip to main content

Showing 51–100 of 113 results for author: Mokhtari, A

.
  1. Scattering under Linear Non Self-Adjoint Operators: Case of in-Plane Elastic Waves

    Authors: Amir Ashkan Mokhtari, Yan Lu, Qiyuan Zhou, Alireza V. Amirkhizi, Ankit Srivastava

    Abstract: In this paper, we consider the problem of the scattering of in-plane waves at an interface between a homogeneous medium and a metamaterial. The relevant eigenmodes in the two regions are calculated by solving a recently described non self-adjoint eigenvalue problem particularly suited to scattering studies. The method efficiently produces all propagating and evanescent modes consistent with the ap… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  2. arXiv:2002.09964  [pdf, other

    cs.DC cs.LG cs.MA eess.SP eess.SY

    Quantized Decentralized Stochastic Learning over Directed Graphs

    Authors: Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider a decentralized stochastic learning problem where data points are distributed among computing nodes communicating over a directed graph. As the model size gets large, decentralized learning faces a major bottleneck that is the heavy communication load due to each node transmitting large messages (model updates) to its neighbors. To tackle this bottleneck, we propose the quantized decen… ▽ More

    Submitted 28 December, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

  3. arXiv:2002.07948  [pdf, other

    cs.LG math.OC stat.ML

    Personalized Federated Learning: A Meta-Learning Approach

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: In Federated Learning, we aim to train models across multiple computing units (users), while users can only communicate with a common central server, without exchanging their data samples. This mechanism exploits the computational power of all users and allows users to obtain a richer model as their models are trained over a larger set of data points. However, this scheme only develops a common ou… ▽ More

    Submitted 22 October, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  4. arXiv:2002.05135  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning

    Authors: Alireza Fallah, Kristian Georgiev, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We consider Model-Agnostic Meta-Learning (MAML) methods for Reinforcement Learning (RL) problems, where the goal is to find a policy using data from several tasks represented by Markov Decision Processes (MDPs) that can be updated by one step of stochastic policy gradient for the realized MDP. In particular, using stochastic gradients in MAML update steps is crucial for RL problems since computati… ▽ More

    Submitted 16 November, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  5. arXiv:2002.04766  [pdf, other

    cs.LG math.OC stat.ML

    Task-Robust Model-Agnostic Meta-Learning

    Authors: Liam Collins, Aryan Mokhtari, Sanjay Shakkottai

    Abstract: Meta-learning methods have shown an impressive ability to train models that rapidly learn new tasks. However, these methods only aim to perform well in expectation over tasks coming from some particular distribution that is typically equivalent across meta-training and meta-testing, rather than considering worst-case task performance. In this work we introduce the notion of "task-robustness" by re… ▽ More

    Submitted 18 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  6. arXiv:1910.14380  [pdf, other

    math.OC cs.LG stat.ML

    A Decentralized Proximal Point-type Method for Saddle Point Problems

    Authors: Weijie Liu, Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil, Zebang Shen, Nenggan Zheng

    Abstract: In this paper, we focus on solving a class of constrained non-convex non-concave saddle point problems in a decentralized manner by a group of nodes in a network. Specifically, we assume that each node has access to a summand of a global objective function and nodes are allowed to exchange information only with their neighboring nodes. We propose a decentralized variant of the proximal point metho… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 18 pages

  7. arXiv:1910.04322  [pdf, other

    math.OC cs.LG stat.ML

    One Sample Stochastic Frank-Wolfe

    Authors: Mingrui Zhang, Zebang Shen, Aryan Mokhtari, Hamed Hassani, Amin Karbasi

    Abstract: One of the beauties of the projected gradient descent method lies in its rather simple mechanism and yet stable behavior with inexact, stochastic gradients, which has led to its wide-spread use in many machine learning applications. However, once we replace the projection operator with a simpler linear program, as is done in the Frank-Wolfe method, both simplicity and stability take a serious hit.… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  8. arXiv:1909.13014  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    FedPAQ: A Communication-Efficient Federated Learning Method with Periodic Averaging and Quantization

    Authors: Amirhossein Reisizadeh, Aryan Mokhtari, Hamed Hassani, Ali Jadbabaie, Ramtin Pedarsani

    Abstract: Federated learning is a distributed framework according to which a model is trained over a set of devices, while kee** data localized. This framework faces several systems-oriented challenges which include (i) communication bottleneck since a large number of devices upload their local updates to a parameter server, and (ii) scalability as the federated network consists of millions of devices. Du… ▽ More

    Submitted 7 June, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  9. arXiv:1908.10400  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We study the convergence of a class of gradient-based Model-Agnostic Meta-Learning (MAML) methods and characterize their overall complexity as well as their best achievable accuracy in terms of gradient norm for nonconvex loss functions. We start with the MAML method and its first-order approximation (FO-MAML) and highlight the challenges that emerge in their analysis. By overcoming these challeng… ▽ More

    Submitted 15 May, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: To appear in the proceedings of the $23^{rd}$ International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  10. arXiv:1907.10595  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robust and Communication-Efficient Collaborative Learning

    Authors: Amirhossein Reisizadeh, Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks: stragglers' delay and communication overhead. In this paper, we tackle these bottlenecks by proposing a novel decentralized and gradient-based optimization algorithm… ▽ More

    Submitted 31 October, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

  11. arXiv:1906.01115  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convergence Rate of $\mathcal{O}(1/k)$ for Optimistic Gradient and Extra-gradient Methods in Smooth Convex-Concave Saddle Point Problems

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: We study the iteration complexity of the optimistic gradient descent-ascent (OGDA) method and the extra-gradient (EG) method for finding a saddle point of a convex-concave unconstrained min-max problem. To do so, we first show that both OGDA and EG can be interpreted as approximate variants of the proximal point method. This is similar to the approach taken in [Nemirovski, 2004] which analyzes EG… ▽ More

    Submitted 29 September, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 19 pages

  12. arXiv:1906.00506  [pdf, ps, other

    math.OC

    DAve-QN: A Distributed Averaged Quasi-Newton Method with Local Superlinear Convergence Rate

    Authors: Saeed Soori, Konstantin Mischenko, Aryan Mokhtari, Maryam Mehri Dehnavi, Mert Gurbuzbalaban

    Abstract: In this paper, we consider distributed algorithms for solving the empirical risk minimization problem under the master/worker communication model. We develop a distributed asynchronous quasi-Newton algorithm that can achieve superlinear convergence. To our knowledge, this is the first distributed asynchronous algorithm with superlinear convergence guarantees. Our algorithm is communication-efficie… ▽ More

    Submitted 10 June, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

  13. On the Properties of Phononic Eigenvalue Problems

    Authors: Amir Ashkan Mokhtari, Yan Lu, Ankit Srivastava

    Abstract: In this paper, we consider the operator properties of various phononic eigenvalue problems. We aim to answer some fundamental questions about the eigenvalues and eigenvectors of phononic operators. These include questions about the potential real and complex nature of the eigenvalues, whether the eigenvectors form a complete basis, what are the right orthogonality relationships, and how to create… ▽ More

    Submitted 7 July, 2019; v1 submitted 16 February, 2019; originally announced February 2019.

    Journal ref: Journal of the Mechanics and Physics of Solids, 2019

  14. arXiv:1902.06992  [pdf, other

    math.OC cs.LG

    Stochastic Conditional Gradient++

    Authors: Hamed Hassani, Amin Karbasi, Aryan Mokhtari, Zebang Shen

    Abstract: In this paper, we consider the general non-oblivious stochastic optimization where the underlying stochasticity may change during the optimization procedure and depends on the point at which the function is evaluated. We develop Stochastic Frank-Wolfe++ ($\text{SFW}{++} $), an efficient variant of the conditional gradient method for minimizing a smooth non-convex function subject to a convex body… ▽ More

    Submitted 8 September, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

  15. arXiv:1902.06332  [pdf, other

    cs.LG cs.DC cs.DS math.OC stat.ML

    Quantized Frank-Wolfe: Faster Optimization, Lower Communication, and Projection Free

    Authors: Mingrui Zhang, Lin Chen, Aryan Mokhtari, Hamed Hassani, Amin Karbasi

    Abstract: How can we efficiently mitigate the overhead of gradient communications in distributed optimization? This problem is at the heart of training scalable machine learning models and has been mainly studied in the unconstrained setting. In this paper, we propose Quantized-Frank-Wolfe (QFW), the first projection-free and communication-efficient algorithm for solving constrained optimization problems at… ▽ More

    Submitted 30 May, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

  16. arXiv:1901.08511  [pdf, ps, other

    math.OC cs.LG stat.ML

    A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper we consider solving saddle point problems using two variants of Gradient Descent-Ascent algorithms, Extra-gradient (EG) and Optimistic Gradient Descent Ascent (OGDA) methods. We show that both of these algorithms admit a unified analysis as approximations of the classical proximal point method for solving saddle point problems. This viewpoint enables us to develop a new framework for… ▽ More

    Submitted 5 September, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: 25 pages, 3 figures

  17. On the Emergence of Negative Effective Density and Modulus in 2-phase Phononic Crystals

    Authors: Amir Ashkan Mokhtari, Yan Lu, Ankit Srivastava

    Abstract: In this paper we report metamaterial properties including negative and singular effective properties for what would traditionally be considered non locally resonant 2-phase phononic unit cells. The negative effective material properties reported here occur well below the homogenization limit and are, therefore, acceptable descriptions of overall behavior. The material property combinations which m… ▽ More

    Submitted 7 January, 2019; v1 submitted 11 November, 2018; originally announced November 2018.

  18. arXiv:1811.02521  [pdf, ps, other

    math.OC

    Achieving Acceleration in Distributed Optimization via Direct Discretization of the Heavy-Ball ODE

    Authors: **gzhao Zhang, César A. Uribe, Aryan Mokhtari, Ali Jadbabaie

    Abstract: We develop a distributed algorithm for convex Empirical Risk Minimization, the problem of minimizing large but finite sum of convex functions over networks. The proposed algorithm is derived from directly discretizing the second-order heavy-ball differential equation and results in an accelerated convergence rate, i.e, faster than distributed gradient descent-based methods for strongly convex obje… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  19. arXiv:1810.11507  [pdf, other

    cs.LG stat.ML

    Efficient Distributed Hessian Free Algorithm for Large-scale Empirical Risk Minimization via Accumulating Sample Strategy

    Authors: Majid Jahani, Xi He, Chenxin Ma, Aryan Mokhtari, Dheevatsa Mudigere, Alejandro Ribeiro, Martin Takáč

    Abstract: In this paper, we propose a Distributed Accumulated Newton Conjugate gradiEnt (DANCE) method in which sample size is gradually increasing to quickly obtain a solution whose empirical loss is under satisfactory statistical accuracy. Our proposed method is multistage in which the solution of a stage serves as a warm start for the next stage which contains more samples (including the samples in the p… ▽ More

    Submitted 9 March, 2020; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: Updated numerical results

  20. arXiv:1809.02162  [pdf, ps, other

    cs.LG math.OC stat.ML

    Esca** Saddle Points in Constrained Optimization

    Authors: Aryan Mokhtari, Asuman Ozdaglar, Ali Jadbabaie

    Abstract: In this paper, we study the problem of esca** from saddle points in smooth nonconvex optimization problems subject to a convex set $\mathcal{C}$. We propose a generic framework that yields convergence to a second-order stationary point of the problem, if the convex set $\mathcal{C}$ is simple for a quadratic objective function. Specifically, our results hold if one can find a $ρ$-approximate sol… ▽ More

    Submitted 9 October, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

  21. A Primal-Dual Quasi-Newton Method for Exact Consensus Optimization

    Authors: Mark Eisen, Aryan Mokhtari, Alejandro Ribeiro

    Abstract: We introduce the primal-dual quasi-Newton (PD-QN) method as an approximated second order method for solving decentralized optimization problems. The PD-QN method performs quasi-Newton updates on both the primal and dual variables of the consensus optimization problem to find the optimal point of the augmented Lagrangian. By optimizing the augmented Lagrangian, the PD-QN method is able to find the… ▽ More

    Submitted 10 July, 2019; v1 submitted 4 September, 2018; originally announced September 2018.

  22. Enhanced magnetic properties in ZnCoAlO caused by exchangecoupling to Co nanoparticles

    Authors: Qi Feng, Wala Dizayee, Xiaoli Li, David S Score, James R Neal, Anthony J Behan, Abbas Mokhtari, Marzook S Alshammari, Mohammed S Al-Qahtani, Harry J Blythe, Roy W Chantrell, Steve M Heald, Xiao-Hong Xu, A Mark Fox, Gillian A Gehring

    Abstract: Wereport the results of a sequence of magnetisation and magneto-optical studies on laser ablated thin films of ZnCoAlO and ZnCoO that contain a small amount of metallic cobalt. The results are compared to those expected when all the magnetization is due to isolated metallic clusters of cobalt and with an oxide sample that is almost free from metallic inclusions. Using a variety of direct magnetic… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: 13 pages, 6 figures

    Journal ref: New J. Phys. 18 (2016) 113040

  23. arXiv:1806.11536  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    An Exact Quantized Decentralized Gradient Descent Algorithm

    Authors: Amirhossein Reisizadeh, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider the problem of decentralized consensus optimization, where the sum of $n$ smooth and strongly convex functions are minimized over $n$ distributed agents that form a connected network. In particular, we consider the case that the communicated local decision variables among nodes are quantized in order to alleviate the communication bottleneck in distributed optimization. We propose the… ▽ More

    Submitted 1 August, 2019; v1 submitted 29 June, 2018; originally announced June 2018.

  24. arXiv:1805.09969  [pdf, ps, other

    stat.ML cs.LG

    Towards More Efficient Stochastic Decentralized Learning: Faster Convergence and Sparse Communication

    Authors: Zebang Shen, Aryan Mokhtari, Tengfei Zhou, Peilin Zhao, Hui Qian

    Abstract: Recently, the decentralized optimization problem is attracting growing attention. Most existing methods are deterministic with high per-iteration cost and have a convergence rate quadratically depending on the problem condition number. Besides, the dense communication is necessary to ensure the convergence even if the dataset is sparse. In this paper, we generalize the decentralized optimization p… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: Accepted to ICML 2018

  25. arXiv:1805.00521  [pdf, other

    math.OC cs.LG stat.ML

    Direct Runge-Kutta Discretization Achieves Acceleration

    Authors: **gzhao Zhang, Aryan Mokhtari, Suvrit Sra, Ali Jadbabaie

    Abstract: We study gradient-based optimization methods obtained by directly discretizing a second-order ordinary differential equation (ODE) related to the continuous limit of Nesterov's accelerated gradient method. When the function is smooth enough, we show that acceleration can be achieved by a stable discretization of this ODE using standard Runge-Kutta integrators. Specifically, we prove that under Lip… ▽ More

    Submitted 27 November, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: 24 pages. 4 figures

  26. arXiv:1804.09554  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization

    Authors: Aryan Mokhtari, Hamed Hassani, Amin Karbasi

    Abstract: This paper considers stochastic optimization problems for a large class of objective functions, including convex and continuous submodular. Stochastic proximal gradient methods have been widely used to solve such problems; however, their applicability remains limited when the problem dimension is large and the projection onto a convex set is costly. Instead, stochastic conditional gradient methods… ▽ More

    Submitted 12 November, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: arXiv admin note: text overlap with arXiv:1711.01660

  27. arXiv:1802.03825  [pdf, other

    math.OC

    Decentralized Submodular Maximization: Bridging Discrete and Continuous Settings

    Authors: Aryan Mokhtari, Hamed Hassani, Amin Karbasi

    Abstract: In this paper, we showcase the interplay between discrete and continuous optimization in network-structured settings. We propose the first fully decentralized optimization method for a wide class of non-convex objective functions that possess a diminishing returns property. More specifically, given an arbitrary connected network and a global continuous submodular function, formed by a sum of local… ▽ More

    Submitted 11 February, 2018; originally announced February 2018.

  28. arXiv:1711.01660  [pdf, other

    math.OC cs.LG

    Conditional Gradient Method for Stochastic Submodular Maximization: Closing the Gap

    Authors: Aryan Mokhtari, Hamed Hassani, Amin Karbasi

    Abstract: In this paper, we study the problem of \textit{constrained} and \textit{stochastic} continuous submodular maximization. Even though the objective function is not concave (nor convex) and is defined in terms of an expectation, we develop a variant of the conditional gradient method, called \alg, which achieves a \textit{tight} approximation guarantee. More precisely, for a monotone and continuous D… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

  29. arXiv:1710.03738  [pdf, ps, other

    hep-th cond-mat.str-el

    Diffusivities bounds in the presence of Weyl corrections

    Authors: Ali Mokhtari, Seyed Ali Hosseini Mansoori, Kazem Bitaghsir Fadafan

    Abstract: In this paper, we investigate the behavior of the thermoelectric DC conductivities in the presence of Weyl corrections with momentum dissipation in the incoherent limit. Moreover, we compute the butterfly velocity and study the charge and energy diffusion with broken translational symmetry. Our results show that the Weyl coupling $γ$, violates the bounds on the charge and energy diffusivity. It is… ▽ More

    Submitted 24 September, 2018; v1 submitted 10 October, 2017; originally announced October 2017.

    Comments: v4: The appendix D and E were added

  30. arXiv:1709.00599  [pdf, other

    cs.LG math.OC

    First-Order Adaptive Sample Size Methods to Reduce Complexity of Empirical Risk Minimization

    Authors: Aryan Mokhtari, Alejandro Ribeiro

    Abstract: This paper studies empirical risk minimization (ERM) problems for large-scale datasets and incorporates the idea of adaptive sample size methods to improve the guaranteed convergence bounds for first-order stochastic and deterministic methods. In contrast to traditional methods that attempt to solve the ERM problem corresponding to the full dataset directly, adaptive sample size schemes start with… ▽ More

    Submitted 2 September, 2017; originally announced September 2017.

  31. arXiv:1707.08028  [pdf, ps, other

    math.OC

    A Newton-Based Method for Nonconvex Optimization with Fast Evasion of Saddle Points

    Authors: Santiago Paternain, Aryan Mokhtari, Alejandro Ribeiro

    Abstract: Machine learning problems such as neural network training, tensor decomposition, and matrix factorization, require local minimization of a nonconvex function. This local minimization is challenged by the presence of saddle points, of which there can be many and from which descent methods may take inordinately large number of iterations to escape. This paper presents a second-order method that modi… ▽ More

    Submitted 20 July, 2018; v1 submitted 25 July, 2017; originally announced July 2017.

  32. arXiv:1705.07957  [pdf, ps, other

    math.OC cs.LG stat.ML

    Large Scale Empirical Risk Minimization via Truncated Adaptive Newton Method

    Authors: Mark Eisen, Aryan Mokhtari, Alejandro Ribeiro

    Abstract: We consider large scale empirical risk minimization (ERM) problems, where both the problem dimension and variable size is large. In these cases, most second order methods are infeasible due to the high cost in both computing the Hessian over all samples and computing its inverse in high dimensions. In this paper, we propose a novel adaptive sample size second-order method, which reduces the cost o… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

  33. arXiv:1702.00709  [pdf, other

    math.OC cs.LG

    IQN: An Incremental Quasi-Newton Method with Local Superlinear Convergence Rate

    Authors: Aryan Mokhtari, Mark Eisen, Alejandro Ribeiro

    Abstract: The problem of minimizing an objective that can be written as the sum of a set of $n$ smooth and strongly convex functions is considered. The Incremental Quasi-Newton (IQN) method proposed here belongs to the family of stochastic and incremental methods that have a cost per iteration independent of $n$. IQN iterations are a stochastic version of BFGS iterations that use memory to reduce the varian… ▽ More

    Submitted 27 March, 2017; v1 submitted 2 February, 2017; originally announced February 2017.

  34. arXiv:1611.00347  [pdf, other

    math.OC cs.LG

    Surpassing Gradient Descent Provably: A Cyclic Incremental Method with Linear Convergence Rate

    Authors: Aryan Mokhtari, Mert Gürbüzbalaban, Alejandro Ribeiro

    Abstract: Recently, there has been growing interest in develo** optimization methods for solving large-scale machine learning problems. Most of these problems boil down to the problem of minimizing an average of a finite set of smooth and strongly convex functions where the number of functions $n$ is large. Gradient descent method (GD) is successful in minimizing convex problems at a fast linear rate; how… ▽ More

    Submitted 7 February, 2018; v1 submitted 1 November, 2016; originally announced November 2016.

  35. arXiv:1610.02143  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    Stochastic Averaging for Constrained Optimization with Application to Online Resource Allocation

    Authors: Tianyi Chen, Aryan Mokhtari, Xin Wang, Alejandro Ribeiro, Georgios B. Giannakis

    Abstract: Existing approaches to resource allocation for nowadays stochastic networks are challenged to meet fast convergence and tolerable delay requirements. The present paper leverages online learning advances to facilitate stochastic resource allocation tasks. By recognizing the central role of Lagrange multipliers, the underlying constrained optimization problem is formulated as a machine learning task… ▽ More

    Submitted 26 February, 2017; v1 submitted 7 October, 2016; originally announced October 2016.

  36. arXiv:1606.04991  [pdf, other

    cs.LG math.OC stat.ML

    A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning

    Authors: Aryan Mokhtari, Alec Koppel, Alejandro Ribeiro

    Abstract: We consider learning problems over training sets in which both, the number of training examples and the dimension of the feature vectors, are large. To solve these problems we propose the random parallel stochastic algorithm (RAPSA). We call the algorithm random parallel because it utilizes multiple parallel processors to operate on a randomly chosen subset of blocks of the feature vector. We call… ▽ More

    Submitted 15 June, 2016; originally announced June 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.06782

  37. arXiv:1605.07659  [pdf, other

    cs.LG math.OC

    Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy

    Authors: Aryan Mokhtari, Alejandro Ribeiro

    Abstract: We consider empirical risk minimization for large-scale datasets. We introduce Ada Newton as an adaptive algorithm that uses Newton's method with adaptive sample sizes. The main idea of Ada Newton is to increase the size of the training set by a factor larger than one in a way that the minimization variable for the current training set is in the local neighborhood of the optimal argument of the ne… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

  38. Decentralized Quasi-Newton Methods

    Authors: Mark Eisen, Aryan Mokhtari, Alejandro Ribeiro

    Abstract: We introduce the decentralized Broyden-Fletcher-Goldfarb-Shanno (D-BFGS) method as a variation of the BFGS quasi-Newton method for solving decentralized optimization problems. The D-BFGS method is of interest in problems that are not well conditioned, making first order decentralized methods ineffective, and in which second order information is not readily available, making second order decentrali… ▽ More

    Submitted 3 May, 2016; originally announced May 2016.

  39. arXiv:1603.08094  [pdf, other

    math.OC

    A Decentralized Second-Order Method for Dynamic Optimization

    Authors: Aryan Mokhtari, Wei Shi, Qing Ling, Alejandro Ribeiro

    Abstract: This paper considers decentralized dynamic optimization problems where nodes of a network try to minimize a sequence of time-varying objective functions in a real-time scheme. At each time slot, nodes have access to different summands of an instantaneous global objective function and they are allowed to exchange information only with their neighbors. This paper develops the application of the Exac… ▽ More

    Submitted 26 March, 2016; originally announced March 2016.

  40. arXiv:1603.07195  [pdf, other

    math.OC cs.DC cs.LG

    A Decentralized Quasi-Newton Method for Dual Formulations of Consensus Optimization

    Authors: Mark Eisen, Aryan Mokhtari, Alejandro Ribeiro

    Abstract: This paper considers consensus optimization problems where each node of a network has access to a different summand of an aggregate cost function. Nodes try to minimize the aggregate cost function, while they exchange information only with their neighbors. We modify the dual decomposition method to incorporate a curvature correction inspired by the Broyden-Fletcher-Goldfarb-Shanno (BFGS) quasi-New… ▽ More

    Submitted 23 March, 2016; originally announced March 2016.

    Comments: 8 pages

  41. arXiv:1603.06782  [pdf, other

    cs.LG math.OC

    Doubly Random Parallel Stochastic Methods for Large Scale Learning

    Authors: Aryan Mokhtari, Alec Koppel, Alejandro Ribeiro

    Abstract: We consider learning problems over training sets in which both, the number of training examples and the dimension of the feature vectors, are large. To solve these problems we propose the random parallel stochastic algorithm (RAPSA). We call the algorithm random parallel because it utilizes multiple processors to operate in a randomly chosen subset of blocks of the feature vector. We call the algo… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.

  42. arXiv:1603.04954  [pdf, other

    cs.LG math.OC

    Online Optimization in Dynamic Environments: Improved Regret Rates for Strongly Convex Problems

    Authors: Aryan Mokhtari, Shahin Shahrampour, Ali Jadbabaie, Alejandro Ribeiro

    Abstract: In this paper, we address tracking of a time-varying parameter with unknown dynamics. We formalize the problem as an instance of online optimization in a dynamic setting. Using online gradient descent, we propose a method that sequentially predicts the value of the parameter and in turn suffers a loss. The objective is to minimize the accumulation of losses over the time horizon, a notion that is… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.

  43. arXiv:1602.07245  [pdf, other

    hep-th cond-mat.supr-con gr-qc

    Weyl holographic superconductor in the Lifshitz black hole background

    Authors: S. A. Hosseini Mansoori, B. Mirza, A. Mokhtari, F. Lalehgani Dezaki, Z. Sherkatghanad

    Abstract: We investigate analytically the properties of the Weyl holographic superconductor in the Lifshitz black hole background. We find that the critical temperature of the Weyl superconductor decreases with increasing Lifshitz dynamical exponent, $z$, indicating that condensation becomes difficult. In addition, it is found that the critical temperature and condensation operator could be affected by appl… ▽ More

    Submitted 21 July, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

    Comments: 25 pages, 22 figures

    Journal ref: JHEP07(2016)111

  44. arXiv:1602.01716  [pdf, other

    math.OC cs.IT

    Decentralized Prediction-Correction Methods for Networked Time-Varying Convex Optimization

    Authors: Andrea Simonetto, Alec Koppel, Aryan Mokhtari, Geert Leus, Alejandro Ribeiro

    Abstract: We develop algorithms that find and track the optimal solution trajectory of time-varying convex optimization problems which consist of local and network-related objectives. The algorithms are derived from the prediction-correction methodology, which corresponds to a strategy where the time-varying problem is sampled at discrete time instances and then a sequence is generated via alternatively exe… ▽ More

    Submitted 7 November, 2016; v1 submitted 4 February, 2016; originally announced February 2016.

  45. arXiv:1602.00596  [pdf, other

    math.OC cs.DC

    A Decentralized Second-Order Method with Exact Linear Convergence Rate for Consensus Optimization

    Authors: Aryan Mokhtari, Wei Shi, Qing Ling, Alejandro Ribeiro

    Abstract: This paper considers decentralized consensus optimization problems where different summands of a global objective function are available at nodes of a network that can communicate with neighbors only. The proximal method of multipliers is considered as a powerful tool that relies on proximal primal descent and dual ascent updates on a suitably defined augmented Lagrangian. The structure of the aug… ▽ More

    Submitted 1 February, 2016; originally announced February 2016.

  46. arXiv:1510.07356  [pdf, other

    math.OC

    Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

    Authors: Aryan Mokhtari, Wei Shi, Qing Ling, Alejandro Ribeiro

    Abstract: This paper considers an optimization problem that components of the objective function are available at different nodes of a network and nodes are allowed to only exchange information with their neighbors. The decentralized alternating method of multipliers (DADMM) is a well-established iterative method for solving this category of problems; however, implementation of DADMM requires solving an opt… ▽ More

    Submitted 25 October, 2015; originally announced October 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1508.02073

  47. A Class of Prediction-Correction Methods for Time-Varying Convex Optimization

    Authors: Andrea Simonetto, Aryan Mokhtari, Alec Koppel, Geert Leus, Alejandro Ribeiro

    Abstract: This paper considers unconstrained convex optimization problems with time-varying objective functions. We propose algorithms with a discrete time-sampling scheme to find and track the solution trajectory based on prediction and correction steps, while sampling the problem data at a constant rate of $1/h$, where $h$ is the length of the sampling interval. The prediction step is derived by analyzing… ▽ More

    Submitted 11 May, 2016; v1 submitted 17 September, 2015; originally announced September 2015.

    Comments: 16 pages, 8 figures

    Journal ref: IEEE Transactions on Signal Processing, vol. 64 (17), pages 4576 - 4591, 2016

  48. DQM: Decentralized Quadratically Approximated Alternating Direction Method of Multipliers

    Authors: Aryan Mokhtari, Wei Shi, Qing Ling, Alejandro Ribeiro

    Abstract: This paper considers decentralized consensus optimization problems where nodes of a network have access to different summands of a global objective function. Nodes cooperate to minimize the global objective by exchanging information with neighbors only. A decentralized version of the alternating directions method of multipliers (DADMM) is a common method for solving this category of problems. DADM… ▽ More

    Submitted 9 August, 2015; originally announced August 2015.

    Comments: 13 pages

  49. arXiv:1506.04216  [pdf, ps, other

    math.OC

    DSA: Decentralized Double Stochastic Averaging Gradient Algorithm

    Authors: Aryan Mokhtari, Alejandro Ribeiro

    Abstract: This paper considers convex optimization problems where nodes of a network have access to summands of a global objective. Each of these local objectives is further assumed to be an average of a finite set of functions. The motivation for this setup is to solve large scale machine learning problems where elements of the training set are distributed to multiple computational elements. The decentrali… ▽ More

    Submitted 12 June, 2015; originally announced June 2015.

  50. arXiv:1505.02344  [pdf, ps, other

    math.RA

    More on Lie Derivations of Generalized Matrix Algebras

    Authors: A. H. Mokhtari, H. R. Ebrahimi Vishki

    Abstract: Motivated by the Cheung's elaborate work [Linear Multilinear Algebra, 51 (2003), 299-310], we investigate the construction of a Lie derivation on a generalized matrix algebra and apply it to give a characterization for such a Lie derivation to be proper. Our approach not only provides a direct proof for some known results in the theory, but also it presents several sufficient conditions assuring t… ▽ More

    Submitted 10 May, 2015; originally announced May 2015.

    Comments: 11 pages

    MSC Class: 16W25; 15A78; 47B47