Skip to main content

Showing 1–46 of 46 results for author: Lavaei, J

.
  1. arXiv:2405.16601  [pdf, other

    cs.LG

    A CMDP-within-online framework for Meta-Safe Reinforcement Learning

    Authors: Vanshaj Khattar, Yuhao Ding, Bilgehan Sel, Javad Lavaei, Ming **

    Abstract: Meta-reinforcement learning has widely been used as a learning-to-learn framework to solve unseen tasks with limited experience. However, the aspect of constraint violations has not been adequately addressed in the existing works, making their application restricted in real-world settings. In this paper, we study the problem of meta-safe reinforcement learning (Meta-SRL) through the CMDP-within-on… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Journal ref: ICLR 2023

  2. arXiv:2405.16053  [pdf, other

    cs.LG

    Pausing Policy Learning in Non-stationary Reinforcement Learning

    Authors: Hyunin Lee, Ming **, Javad Lavaei, Somayeh Sojoudi

    Abstract: Real-time inference is a challenge of real-world reinforcement learning due to temporal differences in time-varying environments: the system collects data from the past, updates the decision model in the present, and deploys it in the future. We tackle a common belief that continually updating the decision is optimal to minimize the temporal gap. We propose forecasting an online reinforcement lear… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: conference

  3. arXiv:2403.15099  [pdf, other

    math.OC math.NA stat.AP

    Optimal Contract Design for End-of-Life Care Payments

    Authors: Muyan Jiang, Ying Chen, Xin Chen, Javad Lavaei, Anil Aswani

    Abstract: A large fraction of total healthcare expenditure occurs due to end-of-life (EOL) care, which means it is important to study the problem of more carefully incentivizing necessary versus unnecessary EOL care because this has the potential to reduce overall healthcare spending. This paper introduces a principal-agent model that integrates a mixed payment system of fee-for-service and pay-for-performa… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  4. arXiv:2403.06056  [pdf, other

    math.OC cs.LG eess.SP

    Absence of spurious solutions far from ground truth: A low-rank analysis with high-order losses

    Authors: Ziye Ma, Ying Chen, Javad Lavaei, Somayeh Sojoudi

    Abstract: Matrix sensing problems exhibit pervasive non-convexity, plaguing optimization with a proliferation of suboptimal spurious solutions. Avoiding convergence to these critical points poses a major challenge. This work provides new theoretical insights that help demystify the intricacies of the non-convex landscape. In this work, we prove that under certain conditions, critical points sufficiently dis… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted by AISTATS 2024

  5. arXiv:2310.15549  [pdf, other

    math.OC cs.LG

    Algorithmic Regularization in Tensor Optimization: Towards a Lifted Approach in Matrix Sensing

    Authors: Ziye Ma, Javad Lavaei, Somayeh Sojoudi

    Abstract: Gradient descent (GD) is crucial for generalization in machine learning models, as it induces implicit regularization, promoting compact representations. In this work, we examine the role of GD in inducing implicit regularization for tensor optimization, particularly within the context of the lifted matrix sensing framework. This framework has been recently proposed to address the non-convex matri… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: NeurIPS23 Poster

  6. arXiv:2309.14989  [pdf, other

    cs.LG

    Tempo Adaptation in Non-stationary Reinforcement Learning

    Authors: Hyunin Lee, Yuhao Ding, Jongmin Lee, Ming **, Javad Lavaei, Somayeh Sojoudi

    Abstract: We first raise and tackle a ``time synchronization'' issue between the agent and the environment in non-stationary reinforcement learning (RL), a crucial factor hindering its real-world applications. In reality, environmental changes occur over wall-clock time ($t$) rather than episode progress ($k$), where wall-clock time signifies the actual elapsed time within the fixed duration $t \in [0, T]$.… ▽ More

    Submitted 27 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 53 pages. To be published in Neural Information Processing Systems (NeurIPS), 2023

  7. arXiv:2305.17568  [pdf, other

    cs.LG math.OC

    Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities

    Authors: Donghao Ying, Yunkai Zhang, Yuhao Ding, Alec Koppel, Javad Lavaei

    Abstract: We investigate safe multi-agent reinforcement learning, where agents seek to collectively maximize an aggregate sum of local objectives while satisfying their own safety constraints. The objective and constraints are described by {\it general utilities}, i.e., nonlinear functions of the long-term state-action occupancy measure, which encompass broader decision-making goals such as risk, exploratio… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: 50 pages

  8. arXiv:2305.17567  [pdf, other

    cs.GT math.OC

    No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

    Authors: Mengzi Amy Guo, Donghao Ying, Javad Lavaei, Zuo-Jun Max Shen

    Abstract: This work is dedicated to the algorithm design in a competitive framework, with the primary goal of learning a stable equilibrium. We consider the dynamic price competition between two firms operating within an opaque marketplace, where each firm lacks information about its competitor. The demand follows the multinomial logit (MNL) choice model, which depends on the consumers' observed price and t… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  9. arXiv:2305.10506  [pdf, other

    cs.LG math.OC

    Exact Recovery for System Identification with More Corrupt Data than Clean Data

    Authors: Baturalp Yalcin, Haixiang Zhang, Javad Lavaei, Murat Arcak

    Abstract: This paper investigates the system identification problem for linear discrete-time systems under adversaries and analyzes two lasso-type estimators. We examine both asymptotic and non-asymptotic properties of these estimators in two separate scenarios, corresponding to deterministic and stochastic models for the attack times. Since the samples collected from the system are correlated, the existing… ▽ More

    Submitted 24 April, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    MSC Class: 62; 90; 93

  10. arXiv:2302.11190  [pdf, other

    math.OC

    A Hitting Time Analysis for Stochastic Time-Varying Functions with Applications to Adversarial Attacks on Computation of Markov Decision Processes

    Authors: Ali Yekkehkhany, Han Feng, Donghao Ying, Javad Lavaei

    Abstract: Stochastic time-varying optimization is an integral part of learning in which the shape of the function changes over time in a non-deterministic manner. This paper considers multiple models of stochastic time variation and analyzes the corresponding notion of hitting time for each model, i.e., the period after which optimizing the stochastic time-varying function reveals informative statistics on… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  11. arXiv:2302.07938  [pdf, ps, other

    cs.LG cs.AI cs.MA

    Scalable Multi-Agent Reinforcement Learning with General Utilities

    Authors: Donghao Ying, Yuhao Ding, Alec Koppel, Javad Lavaei

    Abstract: We study the scalable multi-agent reinforcement learning (MARL) with general utilities, defined as nonlinear functions of the team's long-term state-action occupancy measure. The objective is to find a localized policy that maximizes the average of the team's local utility functions without the full observability of each agent in the team. By exploiting the spatial correlation decay property of th… ▽ More

    Submitted 26 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Supplementary material for the contribution to American Control Conference 2023 under the same title

  12. arXiv:2302.07828  [pdf, other

    math.OC cs.LG

    Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points

    Authors: Ziye Ma, Igor Molybog, Javad Lavaei, Somayeh Sojoudi

    Abstract: This paper studies the role of over-parametrization in solving non-convex optimization problems. The focus is on the important class of low-rank matrix sensing, where we propose an infinite hierarchy of non-convex problems via the lifting technique and the Burer-Monteiro factorization. This contrasts with the existing over-parametrization technique where the search rank is limited by the dimension… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  13. arXiv:2211.10815  [pdf, other

    cs.LG math.OC

    Non-stationary Risk-sensitive Reinforcement Learning: Near-optimal Dynamic Regret, Adaptive Detection, and Separation Design

    Authors: Yuhao Ding, Ming **, Javad Lavaei

    Abstract: We study risk-sensitive reinforcement learning (RL) based on an entropic risk measure in episodic non-stationary Markov decision processes (MDPs). Both the reward functions and the state transition kernels are unknown and allowed to vary arbitrarily over time with a budget on their cumulative variations. When this variation budget is known a prior, we propose two restart-based algorithms, namely R… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 33 pages,3 figures, AAAI 2023. arXiv admin note: text overlap with arXiv:2111.03947, arXiv:2102.05406 by other authors

  14. arXiv:2210.01421  [pdf, other

    eess.SY math.ST

    Learning of Dynamical Systems under Adversarial Attacks -- Null Space Property Perspective

    Authors: Han Feng, Baturalp Yalcin, Javad Lavaei

    Abstract: We study the identification of a linear time-invariant dynamical system affected by large-and-sparse disturbances modeling adversarial attacks or faults. Under the assumption that the states are measurable, we develop necessary and sufficient conditions for the recovery of the system matrices by solving a constrained lasso-type optimization problem. In addition, we provide an upper bound on the es… ▽ More

    Submitted 5 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 8 pages, 2 figures

    MSC Class: 93

  15. arXiv:2208.07469  [pdf, ps, other

    math.OC cs.LG

    Semidefinite Programming versus Burer-Monteiro Factorization for Matrix Sensing

    Authors: Baturalp Yalcin, Ziye Ma, Javad Lavaei, Somayeh Sojoudi

    Abstract: Many fundamental low-rank optimization problems, such as matrix completion, phase synchronization/retrieval, power system state estimation, and robust PCA, can be formulated as the matrix sensing problem. Two main approaches for solving matrix sensing are based on semidefinite programming (SDP) and Burer-Monteiro (B-M) factorization. The SDP method suffers from high computational and space complex… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 21 pages

    MSC Class: 90C22; 90C26

  16. arXiv:2205.10715  [pdf, other

    cs.LG math.OC

    Policy-based Primal-Dual Methods for Concave CMDP with Variance Reduction

    Authors: Donghao Ying, Mengzi Amy Guo, Hyunin Lee, Yuhao Ding, Javad Lavaei, Zuo-Jun Max Shen

    Abstract: We study Concave Constrained Markov Decision Processes (Concave CMDPs) where both the objective and constraints are defined as concave functions of the state-action occupancy measure. We propose the Variance-Reduced Primal-Dual Policy Gradient Algorithm (VR-PDPG), which updates the primal variable via policy gradient ascent and the dual variable via projected sub-gradient descent. Despite the chal… ▽ More

    Submitted 26 May, 2024; v1 submitted 21 May, 2022; originally announced May 2022.

  17. arXiv:2204.02364  [pdf, other

    math.OC

    A New Complexity Metric for Nonconvex Rank-one Generalized Matrix Completion

    Authors: Haixiang Zhang, Baturalp Yalcin, Javad Lavaei, Somayeh Sojoudi

    Abstract: In this work, we develop a new complexity metric for an important class of low-rank matrix optimization problems in both symmetric and asymmetric cases, where the metric aims to quantify the complexity of the nonconvex optimization landscape of each problem and the success of local search methods in solving the problem. The existing literature has focused on two complexity bounds. The RIP constant… ▽ More

    Submitted 21 July, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

  18. arXiv:2201.11965  [pdf, ps, other

    cs.LG

    Provably Efficient Primal-Dual Reinforcement Learning for CMDPs with Non-stationary Objectives and Constraints

    Authors: Yuhao Ding, Javad Lavaei

    Abstract: We consider primal-dual-based reinforcement learning (RL) in episodic constrained Markov decision processes (CMDPs) with non-stationary objectives and constraints, which plays a central role in ensuring the safety of RL in time-varying environments. In this problem, the reward/utility functions and the state transition functions are both allowed to vary arbitrarily over time as long as their cumul… ▽ More

    Submitted 19 November, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 32 pages, AAAI 2023

  19. arXiv:2110.10279  [pdf, other

    math.OC cs.LG

    Factorization Approach for Low-complexity Matrix Completion Problems: Exponential Number of Spurious Solutions and Failure of Gradient Methods

    Authors: Baturalp Yalcin, Haixiang Zhang, Javad Lavaei, Somayeh Sojoudi

    Abstract: It is well-known that the Burer-Monteiro (B-M) factorization approach can efficiently solve low-rank matrix optimization problems under the RIP condition. It is natural to ask whether B-M factorization-based methods can succeed on any low-rank matrix optimization problems with a low information-theoretic complexity, i.e., polynomial-time solvable problems that have a unique solution. In this work,… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 21 pages, 1 figure

  20. arXiv:2110.10117  [pdf, other

    cs.LG

    Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization

    Authors: Yuhao Ding, Junzi Zhang, Javad Lavaei

    Abstract: Entropy regularization is an efficient technique for encouraging exploration and preventing a premature convergence of (vanilla) policy gradient methods in reinforcement learning (RL). However, the theoretical understanding of entropy regularized RL algorithms has been limited. In this paper, we revisit the classical entropy regularized policy gradient methods with the soft-max policy parametrizat… ▽ More

    Submitted 10 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

  21. arXiv:2110.10116  [pdf, ps, other

    cs.LG math.OC

    On the Global Optimum Convergence of Momentum-based Policy Gradient

    Authors: Yuhao Ding, Junzi Zhang, Javad Lavaei

    Abstract: Policy gradient (PG) methods are popular and efficient for large-scale reinforcement learning due to their relative stability and incremental nature. In recent years, the empirical success of PG methods has led to the development of a theoretical foundation for these methods. In this work, we generalize this line of research by studying the global convergence of stochastic PG methods with momentum… ▽ More

    Submitted 22 May, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: AISTATS 2022

  22. arXiv:2110.08923  [pdf, ps, other

    cs.LG math.OC

    A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization

    Authors: Donghao Ying, Yuhao Ding, Javad Lavaei

    Abstract: We study entropy-regularized constrained Markov decision processes (CMDPs) under the soft-max parameterization, in which an agent aims to maximize the entropy-regularized value function while satisfying constraints on the expected total utility. By leveraging the entropy regularization, our theoretical analysis shows that its Lagrangian dual function is smooth and the Lagrangian duality gap can be… ▽ More

    Submitted 7 April, 2023; v1 submitted 17 October, 2021; originally announced October 2021.

    Comments: 24 pages, AISTATS22

  23. arXiv:2105.08232  [pdf, other

    math.OC cs.LG stat.ML

    Sharp Restricted Isometry Property Bounds for Low-rank Matrix Recovery Problems with Corrupted Measurements

    Authors: Ziye Ma, Yingjie Bi, Javad Lavaei, Somayeh Sojoudi

    Abstract: In this paper, we study a general low-rank matrix recovery problem with linear measurements corrupted by some noise. The objective is to understand under what conditions on the restricted isometry property (RIP) of the problem local search methods can find the ground truth with a small error. By analyzing the landscape of the non-convex problem, we first propose a global guarantee on the maximum d… ▽ More

    Submitted 25 July, 2023; v1 submitted 17 May, 2021; originally announced May 2021.

  24. arXiv:2104.13348  [pdf, other

    math.OC

    Local and Global Linear Convergence of General Low-rank Matrix Recovery Problems

    Authors: Yingjie Bi, Haixiang Zhang, Javad Lavaei

    Abstract: We study the convergence rate of gradient-based local search methods for solving low-rank matrix recovery problems with general objectives in both symmetric and asymmetric cases, under the assumption of the restricted isometry property. First, we develop a new technique to verify the Polyak-Lojasiewicz inequality in a neighborhood of the global minimizers, which leads to a local linear convergence… ▽ More

    Submitted 8 March, 2022; v1 submitted 27 April, 2021; originally announced April 2021.

  25. arXiv:2104.10356  [pdf, ps, other

    math.OC

    General Low-rank Matrix Optimization: Geometric Analysis and Sharper Bounds

    Authors: Haixiang Zhang, Yingjie Bi, Javad Lavaei

    Abstract: This paper considers the global geometry of general low-rank minimization problems via the Burer-Monterio factorization approach. For the rank-$1$ case, we prove that there is no spurious second-order critical point for both symmetric and asymmetric problems if the rank-$2$ RIP constant $δ$ is less than $1/2$. Combining with a counterexample with $δ=1/2$, we show that the derived bound is the shar… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  26. arXiv:2012.02427  [pdf, other

    math.OC

    Stochastic Localization Methods for Convex Discrete Optimization via Simulation

    Authors: Haixiang Zhang, Zeyu Zheng, Javad Lavaei

    Abstract: We develop and analyze a set of new sequential simulation-optimization algorithms for large-scale multi-dimensional discrete optimization via simulation problems with a convexity structure. The "large-scale" notion refers to that the decision variable has a large number of values to choose from on each dimension. The proposed algorithms are targeted to identify a solution that is close to the opti… ▽ More

    Submitted 18 January, 2022; v1 submitted 4 December, 2020; originally announced December 2020.

  27. arXiv:2010.16250  [pdf, other

    math.OC

    Gradient-based Algorithms for Convex Discrete Optimization via Simulation

    Authors: Haixiang Zhang, Zeyu Zheng, Javad Lavaei

    Abstract: We propose new sequential simulation-optimization algorithms for general convex optimization via simulation problems with high-dimensional discrete decision space. The performance of each choice of discrete decision variables is evaluated via stochastic simulation replications. If an upper bound on the overall level of uncertainties is known, our proposed simulation-optimization algorithms utilize… ▽ More

    Submitted 11 February, 2022; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: Accepted by Operations Research. Title changed from "Discrete Convex Simulation Optimization" to "Gradient-based Algorithms for Convex Discrete Optimization via Simulation"

  28. arXiv:2010.04349  [pdf, other

    math.OC

    Global and Local Analyses of Nonlinear Low-Rank Matrix Recovery Problems

    Authors: Yingjie Bi, Javad Lavaei

    Abstract: The restricted isometry property (RIP) is a well-known condition that guarantees the absence of spurious local minima in low-rank matrix recovery problems with linear measurements. In this paper, we introduce a novel property named bound difference property (BDP) to study low-rank matrix recovery problems with nonlinear measurements. Using RIP and BDP jointly, we first focus on the rank-1 matrix r… ▽ More

    Submitted 10 December, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

  29. arXiv:2006.00453  [pdf, ps, other

    cs.LG math.OC stat.ML

    When Does MAML Objective Have Benign Landscape?

    Authors: Igor Molybog, Javad Lavaei

    Abstract: The paper studies the complexity of the optimization problem behind the Model-Agnostic Meta-Learning (MAML) algorithm. The goal of the study is to determine the global convergence of MAML on sequential decision-making tasks possessing a common structure. We are curious to know when, if at all, the benign landscape of the underlying tasks results in a benign landscape of the corresponding MAML obje… ▽ More

    Submitted 10 December, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: 12 pages, 3 figures

  30. arXiv:2004.14328  [pdf, other

    math.OC

    Penalized Semidefinite Programming for Quadratically-Constrained Quadratic Optimization

    Authors: Ramtin Madani, Mohsen Kheirandishfard, Javad Lavaei, Alper Atamturk

    Abstract: In this paper, we give a new penalized semidefinite programming approach for non-convex quadratically-constrained quadratic programs (QCQPs). We incorporate penalty terms into the objective of convex relaxations in order to retrieve feasible and near-optimal solutions for non-convex QCQPs. We introduce a generalized linear independence constraint qualification (GLICQ) criterion and prove that any… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  31. arXiv:1912.00561  [pdf, other

    math.OC eess.SY

    Esca** spurious local minimum trajectories in online time-varying nonconvex optimization

    Authors: Yuhao Ding, Javad Lavaei, Murat Arcak

    Abstract: A major limitation of online algorithms that track the optimizers of time-varying nonconvex optimization problems is that they focus on a specific local minimum trajectory, which may lead to poor spurious local solutions. In this paper, we show that the natural temporal variation may help simple online tracking methods find and track time-varying global minima. To this end, we investigate the prop… ▽ More

    Submitted 25 January, 2021; v1 submitted 1 December, 2019; originally announced December 2019.

  32. Large-Scale Traffic Signal Offset Optimization

    Authors: Yi Ouyang, Richard Y. Zhang, Javad Lavaei, Pravin Varaiya

    Abstract: The offset optimization problem seeks to coordinate and synchronize the timing of traffic signals throughout a network in order to enhance traffic flow and reduce stops and delays. Recently, offset optimization was formulated into a continuous optimization problem without integer variables by modeling traffic flow as sinusoidal. In this paper, we present a novel algorithm to solve this new formula… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Journal ref: IEEE Transactions on Control of Network Systems 2020

  33. arXiv:1908.10315  [pdf, other

    eess.SP stat.AP

    Boundary Defense against Cyber Threat for Power System Operation

    Authors: Ming **, Javad Lavaei, Somayeh Sojoudi, Ross Baldick

    Abstract: The operation of power grids is becoming increasingly data-centric. While the abundance of data could improve the efficiency of the system, it poses major reliability challenges. In particular, state estimation aims to learn the behavior of the network from data but an undetected attack on this problem could lead to a large-scale blackout. Nevertheless, understanding vulnerability of state estimat… ▽ More

    Submitted 4 August, 2019; originally announced August 2019.

  34. arXiv:1905.09937  [pdf, other

    math.OC

    On the Absence of Spurious Local Trajectories in Time-varying Nonconvex Optimization

    Authors: S. Fattahi, C. Josz, Y. Ding, R. Mohammadi, J. Lavaei, S. Sojoudi

    Abstract: In this paper, we study the landscape of an online nonconvex optimization problem, for which the input data vary over time and the solution is a trajectory rather than a single point. To understand the complexity of finding a global solution of this problem, we introduce the notion of \textit{spurious (i.e., non-global) local trajectory} as a generalization to the notion of spurious local solution… ▽ More

    Submitted 30 October, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

  35. arXiv:1905.09915  [pdf, other

    math.OC

    Esca** Locally Optimal Decentralized Control Polices via Dam**

    Authors: Han Feng, Javad Lavaei

    Abstract: We study the evolution of locally optimal decentralized controllers with the dam** of the control system. Empirically it is shown that even for instances with an exponential number of connected components, dam** merges all local solutions to the one global solution. We characterize the evolution of locally optimal solutions with the notion of hemi-continuity and further derive asymptotic prope… ▽ More

    Submitted 23 May, 2019; originally announced May 2019.

    Comments: 20 pages, 9 figures

  36. arXiv:1903.08634  [pdf, other

    math.OC

    Aggressive Local Search for Constrained Optimal Control Problems with Many Local Minima

    Authors: Yuhao Ding, Han Feng, Javad Lavaei

    Abstract: This paper is concerned with numerically finding a global solution of constrained optimal control problems with many local minima. The focus is on the optimal decentralized control (ODC) problem, whose feasible set is recently shown to have an exponential number of connected components and consequently an exponential number of local minima. The rich literature of numerical algorithms for nonlinear… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

  37. arXiv:1901.01631  [pdf, other

    cs.LG math.OC stat.ML

    Sharp Restricted Isometry Bounds for the Inexistence of Spurious Local Minima in Nonconvex Matrix Recovery

    Authors: Richard Y. Zhang, Somayeh Sojoudi, Javad Lavaei

    Abstract: Nonconvex matrix recovery is known to contain no spurious local minima under a restricted isometry property (RIP) with a sufficiently small RIP constant $δ$. If $δ$ is too large, however, then counterexamples containing spurious local minima are known to exist. In this paper, we introduce a proof technique that is capable of establishing sharp thresholds on $δ$ to guarantee the inexistence of spur… ▽ More

    Submitted 13 June, 2019; v1 submitted 6 January, 2019; originally announced January 2019.

    Comments: v2: fixed several typos; v3: accepted at JMLR

    Journal ref: Journal of Machine Learning Research 20 (114): 1-34, 2019

  38. arXiv:1810.11505  [pdf, other

    eess.SY cs.LG

    Stability-certified reinforcement learning: A control-theoretic perspective

    Authors: Ming **, Javad Lavaei

    Abstract: We investigate the important problem of certifying stability of reinforcement learning policies when interconnected with nonlinear dynamical systems. We show that by regulating the input-output gradients of policies, strong guarantees of robust stability can be obtained based on a proposed semidefinite programming feasibility problem. The method is able to certify a large set of stabilizing contro… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

  39. arXiv:1805.10251  [pdf, other

    cs.LG math.OC stat.ML

    How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

    Authors: Richard Y. Zhang, Cédric Josz, Somayeh Sojoudi, Javad Lavaei

    Abstract: When the linear measurements of an instance of low-rank matrix recovery satisfy a restricted isometry property (RIP)---i.e. they are approximately norm-preserving---the problem is known to contain no spurious local minima, so exact recovery is guaranteed. In this paper, we show that moderate RIP is not enough to eliminate spurious local minima, so existing results can only hold for near-perfect RI… ▽ More

    Submitted 30 October, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 32nd Conference on Neural Information Processing Systems (NIPS 2018)

  40. arXiv:1805.08204  [pdf, other

    math.OC

    A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization

    Authors: Cedric Josz, Yi Ouyang, Richard Y. Zhang, Javad Lavaei, Somayeh Sojoudi

    Abstract: We study the set of continuous functions that admit no spurious local optima (i.e. local minima that are not global minima) which we term \textit{global functions}. They satisfy various powerful properties for analyzing nonconvex and nonsmooth optimization problems. For instance, they satisfy a theorem akin to the fundamental uniform limit theorem in the analysis regarding continuous functions. Gl… ▽ More

    Submitted 31 October, 2018; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: 22 pages, 13 figures

    MSC Class: 90C26

  41. arXiv:1711.10428  [pdf, other

    math.OC

    A Bound Strengthening Method for Optimal Transmission Switching in Power Systems

    Authors: Salar Fattahi, Javad Lavaei, Alper Atamturk

    Abstract: This paper studies the optimal transmission switching (OTS) problem for power systems, where certain lines are fixed (uncontrollable) and the remaining ones are controllable via on/off switches. The goal is to identify a topology of the power grid that minimizes the cost of the system operation while satisfying the physical and operational constraints. Most of the existing methods for the problem… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Report number: BCOL Research Report 17.06, IEOR, University of California-Berkeley

  42. Sparse Semidefinite Programs with Guaranteed Near-Linear Time Complexity via Dualized Clique Tree Conversion

    Authors: Richard Y. Zhang, Javad Lavaei

    Abstract: Clique tree conversion solves large-scale semidefinite programs by splitting an $n\times n$ matrix variable into up to $n$ smaller matrix variables, each representing a principal submatrix of up to $ω\timesω$. Its fundamental weakness is the need to introduce overlap constraints that enforce agreement between different matrix variables, because these can result in dense coupling. In this paper, we… ▽ More

    Submitted 26 April, 2020; v1 submitted 10 October, 2017; originally announced October 2017.

    Comments: [v1] appeared in IEEE CDC 2018; [v2+] To appear in Mathematical Programming

    Journal ref: Mathematical Programming 2020

  43. arXiv:1704.00133  [pdf, ps, other

    math.OC

    Conic Relaxations for Power System State Estimation with Line Measurements

    Authors: Yu Zhang, Ramtin Madani, Javad Lavaei

    Abstract: This paper deals with the non-convex power system state estimation (PSSE) problem, which plays a central role in the monitoring and operation of electric power networks. Given a set of noisy measurements, PSSE aims at estimating the vector of complex voltages at all buses of the network. This is a challenging task due to the inherent nonlinearity of power flows, for which existing methods lack gua… ▽ More

    Submitted 1 April, 2017; originally announced April 2017.

    Comments: Technical report: 14 pages, 5 figures

  44. Modified Interior-Point Method for Large-and-Sparse Low-Rank Semidefinite Programs

    Authors: Richard Y. Zhang, Javad Lavaei

    Abstract: Semidefinite programs (SDPs) are powerful theoretical tools that have been studied for over two decades, but their practical use remains limited due to computational difficulties in solving large-scale, realistic-sized problems. In this paper, we describe a modified interior-point method for the efficient solution of large-and-sparse low-rank SDPs, which finds applications in graph theory, approxi… ▽ More

    Submitted 5 September, 2017; v1 submitted 31 March, 2017; originally announced March 2017.

    Comments: 8 pages, 2 figures

  45. arXiv:1204.4419  [pdf, ps, other

    math.OC cs.IT eess.SY

    Geometry of Power Flows and Optimization in Distribution Networks

    Authors: Javad Lavaei, David Tse, Baosen Zhang

    Abstract: We investigate the geometry of injection regions and its relationship to optimization of power flows in tree networks. The injection region is the set of all vectors of bus power injections that satisfy the network and operation constraints. The geometrical object of interest is the set of Pareto-optimal points of the injection region. If the voltage magnitudes are fixed, the injection region of a… ▽ More

    Submitted 19 August, 2013; v1 submitted 19 April, 2012; originally announced April 2012.

    Comments: To Appear in IEEE Transaction on Power Systems

  46. arXiv:1204.1106  [pdf, ps, other

    math.OC cs.DC eess.SY

    Message Passing for Dynamic Network Energy Management

    Authors: Matt Kraning, Eric Chu, Javad Lavaei, Stephen Boyd

    Abstract: We consider a network of devices, such as generators, fixed loads, deferrable loads, and storage devices, each with its own dynamic constraints and objective, connected by lossy capacitated lines. The problem is to minimize the total network objective subject to the device and line constraints, over a given time horizon. This is a large optimization problem, with variables for consumption or gener… ▽ More

    Submitted 4 April, 2012; originally announced April 2012.

    Comments: Submitted to IEEE Transactions on Smart grid