Skip to main content

Showing 1–26 of 26 results for author: Ghadimi, S

.
  1. arXiv:2404.00158  [pdf, ps, other

    math.OC cs.LG

    Fully Zeroth-Order Bilevel Programming via Gaussian Smoothing

    Authors: Alireza Aghasi, Saeed Ghadimi

    Abstract: In this paper, we study and analyze zeroth-order stochastic approximation algorithms for solving bilvel problems, when neither the upper/lower objective values, nor their unbiased gradient estimates are available. In particular, exploiting Stein's identity, we first use Gaussian smoothing to estimate first- and second-order partial derivatives of functions with two independent block of variables.… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  2. arXiv:2307.05384  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Stochastic Nested Compositional Bi-level Optimization for Robust Feature Learning

    Authors: Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We develop and analyze stochastic approximation algorithms for solving nested compositional bi-level optimization problems. These problems involve a nested composition of $T$ potentially non-convex smooth functions in the upper-level, and a smooth and strongly convex function in the lower-level. Our proposed algorithm does not rely on matrix inversions or mini-batches and can achieve an $ε$-statio… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  3. arXiv:2304.11220  [pdf, other

    cs.CL

    Learn What NOT to Learn: Towards Generative Safety in Chatbots

    Authors: Leila Khalatbari, Ye** Bang, Dan Su, Willy Chung, Saeed Ghadimi, Hossein Sameti, Pascale Fung

    Abstract: Conversational models that are generative and open-domain are particularly susceptible to generating unsafe content since they are trained on web-based social data. Prior approaches to mitigating this issue have drawbacks, such as disrupting the flow of conversation, limited generalization to unseen toxic input contexts, and sacrificing the quality of the dialogue for the sake of safety. In this p… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Comments: 9 pages, 3 tables, 3 figures

  4. arXiv:2302.09766  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    A One-Sample Decentralized Proximal Algorithm for Non-Convex Stochastic Composite Optimization

    Authors: Tesi Xiao, Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We focus on decentralized stochastic non-convex optimization, where $n$ agents work together to optimize a composite objective function which is a sum of a smooth term and a non-smooth convex term. To solve this problem, we propose two single-time scale algorithms: Prox-DASA and Prox-DASA-GT. These algorithms can find $ε$-stationary points in $\mathcal{O}(n^{-1}ε^{-2})$ iterations using constant b… ▽ More

    Submitted 22 June, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: UAI 2023

  5. arXiv:2206.11346  [pdf, other

    math.OC cs.LG stat.ML

    Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We study stochastic optimization algorithms for constrained nonconvex stochastic optimization problems with Markovian data. In particular, we focus on the case when the transition kernel of the Markov chain is state-dependent. Such stochastic optimization problems arise in various machine learning problems including strategic classification and reinforcement learning. For this problem, we study bo… ▽ More

    Submitted 8 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 2 figures

  6. arXiv:2205.13635  [pdf, other

    cs.LG math.OC math.ST

    RIGID: Robust Linear Regression with Missing Data

    Authors: Alireza Aghasi, MohammadJavad Feizollahi, Saeed Ghadimi

    Abstract: We present a robust framework to perform linear regression with missing entries in the features. By considering an elliptical data distribution, and specifically a multivariate normal model, we are able to conditionally formulate a distribution for the missing entries and present a robust framework, which minimizes the worst case error caused by the uncertainty about the missing data. We show that… ▽ More

    Submitted 8 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  7. arXiv:2204.07317  [pdf, other

    math.OC

    Stochastic Search for a Parametric Cost Function Approximation: Energy storage with rolling forecasts

    Authors: Saeed Ghadimi, Warren B. Powell

    Abstract: Rolling forecasts have been almost overlooked in the renewable energy storage literature. In this paper, we provide a new approach for handling uncertainty not just in the accuracy of a forecast, but in the evolution of forecasts over time. Our approach shifts the focus from modeling the uncertainty in a lookahead model to accurate simulations in a stochastic base model. We develop a robust policy… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  8. arXiv:2202.04296  [pdf, ps, other

    math.OC math.ST stat.ML

    A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization

    Authors: Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We propose a projection-free conditional gradient-type algorithm for smooth stochastic multi-level composition optimization, where the objective function is a nested composition of $T$ functions and the constraint set is a closed convex set. Our algorithm assumes access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle satisfying certain standard un… ▽ More

    Submitted 9 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: To appear in NeurIPS 2022

  9. arXiv:2201.00258  [pdf, other

    math.OC cs.AI

    The Parametric Cost Function Approximation: A new approach for multistage stochastic programming

    Authors: Warren B Powell, Saeed Ghadimi

    Abstract: The most common approaches for solving multistage stochastic programming problems in the research literature have been to either use value functions ("dynamic programming") or scenario trees ("stochastic programming") to approximate the impact of a decision now on the future. By contrast, common industry practice is to use a deterministic approximation of the future which is easier to understand a… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 3 figures

    MSC Class: 68 ACM Class: F.2; I.2

  10. arXiv:2012.04000  [pdf, other

    eess.IV

    Deep Networks to Automatically Detect Late-activating Regions of the Heart

    Authors: Jiarui Xing, Sona Ghadimi, Mohammad Abdishektaei, Kenneth C. Bilchick, Frederick H. Epstein, Miaomiao Zhang

    Abstract: This paper presents a novel method to automatically identify late-activating regions of the left ventricle from cine Displacement Encoding with Stimulated Echo (DENSE) MR images. We develop a deep learning framework that identifies late mechanical activation in heart failure patients by detecting the Time to the Onset of circumferential Shortening (TOS). In particular, we build a cascade network p… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  11. arXiv:2009.13016  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    Esca** Saddle-Points Faster under Interpolation-like Conditions

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

    Abstract: In this paper, we show that under over-parametrization several standard stochastic optimization algorithms escape saddle-points and converge to local-minimizers much faster. One of the fundamental aspects of over-parametrized models is that they are capable of interpolating the training data. We show that, under interpolation-like assumptions satisfied by the stochastic gradients in an over-parame… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: To appear in NeurIPS, 2020

  12. arXiv:2008.10526  [pdf, other

    math.OC cs.DS cs.LG math.ST stat.ML

    Stochastic Multi-level Composition Optimization Algorithms with Level-Independent Convergence Rates

    Authors: Krishnakumar Balasubramanian, Saeed Ghadimi, Anthony Nguyen

    Abstract: In this paper, we study smooth stochastic multi-level composition optimization problems, where the objective function is a nested composition of $T$ functions. We assume access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle. For solving this class of problems, we propose two algorithms using moving-average stochastic estimates, and analyze their… ▽ More

    Submitted 14 February, 2022; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Fixed some typos

  13. arXiv:2006.08167  [pdf, other

    math.OC cs.LG stat.ML

    Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions

    Authors: Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: We analyze stochastic conditional gradient methods for constrained optimization problems arising in over-parametrized machine learning. We show that one could leverage the interpolation-like conditions satisfied by such models to obtain improved oracle complexities. Specifically, when the objective function is convex, we show that the conditional gradient method requires $\mathcal{O}(ε^{-2})$ call… ▽ More

    Submitted 26 January, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

  14. arXiv:2001.00831  [pdf, other

    math.OC

    Reinforcement Learning via Parametric Cost Function Approximation for Multistage Stochastic Programming

    Authors: Saeed Ghadimi, Raymond T. Perkins, Warren B. Powell

    Abstract: The most common approaches for solving stochastic resource allocation problems in the research literature is to either use value functions ("dynamic programming") or scenario trees ("stochastic programming") to approximate the impact of a decision now on the future. By contrast, common industry practice is to use a deterministic approximation of the future which is easier to understand and solve,… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1703.04644

  15. arXiv:1910.03591  [pdf, other

    quant-ph physics.app-ph

    Robust and efficient algorithms for high-dimensional black-box quantum optimization

    Authors: Zhaoqi Leng, Pranav Mundada, Saeed Ghadimi, Andrew Houck

    Abstract: Hybrid quantum-classical optimization using near-term quantum technology is an emerging direction for exploring quantum advantage in high-dimensional systems. However, precise characterization of all experimental parameters is often impractical and challenging. A viable approach is to use algorithms that rely only on black-box inference rather than analytical gradients. Here, we combine randomized… ▽ More

    Submitted 10 October, 2019; v1 submitted 8 October, 2019; originally announced October 2019.

  16. arXiv:1907.13616  [pdf, ps, other

    stat.ML cs.DS cs.LG math.OC math.ST

    Multi-Point Bandit Algorithms for Nonstationary Online Nonconvex Optimization

    Authors: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

    Abstract: Bandit algorithms have been predominantly analyzed in the convex setting with function-value based stationary regret as the performance measure. In this paper, motivated by online reinforcement learning problems, we propose and analyze bandit algorithms for both general and structured nonconvex problems with nonstationary (or dynamic) regret as the performance measure, in both stochastic and non-s… ▽ More

    Submitted 11 September, 2019; v1 submitted 31 July, 2019; originally announced July 2019.

  17. arXiv:1902.01373  [pdf, ps, other

    math.ST math.OC stat.ML

    Stochastic Zeroth-order Discretizations of Langevin Diffusions for Bayesian Inference

    Authors: Abhishek Roy, Lingqing Shen, Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: Discretizations of Langevin diffusions provide a powerful method for sampling and Bayesian inference. However, such discretizations require evaluation of the gradient of the potential function. In several real-world scenarios, obtaining gradient evaluations might either be computationally expensive, or simply impossible. In this work, we propose and analyze stochastic zeroth-order sampling algorit… ▽ More

    Submitted 17 January, 2021; v1 submitted 4 February, 2019; originally announced February 2019.

  18. arXiv:1812.01094  [pdf, ps, other

    math.OC

    A Single Time-Scale Stochastic Approximation Method for Nested Stochastic Optimization

    Authors: Saeed Ghadimi, Andrzej Ruszczyński, Mengdi Wang

    Abstract: We study constrained nested stochastic optimization problems in which the objective function is a composition of two smooth functions whose exact values and derivatives are not available. We propose a single time-scale stochastic approximation algorithm, which we call the Nested Averaged Stochastic Approximation (NASA), to find an approximate stationary point of the problem. The algorithm has two… ▽ More

    Submitted 6 September, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  19. arXiv:1809.06474  [pdf, ps, other

    math.OC cs.DS cs.LG math.ST stat.ML

    Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points

    Authors: Krishnakumar Balasubramanian, Saeed Ghadimi

    Abstract: In this paper, we propose and analyze zeroth-order stochastic approximation algorithms for nonconvex and convex optimization, with a focus on addressing constrained optimization, high-dimensional setting and saddle-point avoiding. To handle constrained optimization, we first propose generalizations of the conditional gradient algorithm achieving rates similar to the standard stochastic gradient al… ▽ More

    Submitted 13 January, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

  20. arXiv:1802.02246  [pdf, ps, other

    math.OC

    Approximation Methods for Bilevel Programming

    Authors: Saeed Ghadimi, Mengdi Wang

    Abstract: In this paper, we study a class of bilevel programming problem where the inner objective function is strongly convex. More specifically, under some mile assumptions on the partial derivatives of both inner and outer objective functions, we present an approximation algorithm for solving this class of problem and provide its finite-time convergence analysis under different convexity assumption on th… ▽ More

    Submitted 6 February, 2018; originally announced February 2018.

  21. arXiv:1710.05782  [pdf, other

    math.OC

    Second-Order Methods with Cubic Regularization Under Inexact Information

    Authors: Saeed Ghadimi, Han Liu, Tong Zhang

    Abstract: In this paper, we generalize (accelerated) Newton's method with cubic regularization under inexact second-order information for (strongly) convex optimization problems. Under mild assumptions, we provide global rate of convergence of these methods and show the explicit dependence of the rate of convergence on the problem parameters. While the complexity bounds of our presented algorithms are theor… ▽ More

    Submitted 16 October, 2017; originally announced October 2017.

  22. arXiv:1602.00961  [pdf, ps, other

    math.OC

    Conditional gradient type methods for composite nonlinear and stochastic optimization

    Authors: Saeed Ghadimi

    Abstract: In this paper, we present a conditional gradient type (CGT) method for solving a class of composite optimization problems where the objective function consists of a (weakly) smooth term and a (strongly) convex regularization term. While including a strongly convex term in the subproblems of the classical conditional gradient (CG) method improves its rate of convergence, it does not cost per iterat… ▽ More

    Submitted 1 January, 2018; v1 submitted 2 February, 2016; originally announced February 2016.

  23. arXiv:1508.07384  [pdf, ps, other

    math.OC stat.ML

    Generalized Uniformly Optimal Methods for Nonlinear Programming

    Authors: Saeed Ghadimi, Guanghui Lan, Hongchao Zhang

    Abstract: In this paper, we present a generic framework to extend existing uniformly optimal convex programming algorithms to solve more general nonlinear, possibly nonconvex, optimization problems. The basic idea is to incorporate a local search step (gradient descent or Quasi-Newton iteration) into these uniformly optimal convex programming methods, and then enforce a monotone decreasing property of the f… ▽ More

    Submitted 12 September, 2015; v1 submitted 28 August, 2015; originally announced August 2015.

  24. arXiv:1310.3787  [pdf, ps, other

    math.OC

    Accelerated Gradient Methods for Nonconvex Nonlinear and Stochastic Programming

    Authors: Saeed Ghadimi, Guanghui Lan

    Abstract: In this paper, we generalize the well-known Nesterov's accelerated gradient (AG) method, originally designed for convex smooth optimization, to solve nonconvex and possibly stochastic optimization problems. We demonstrate that by properly specifying the stepsize policy, the AG method exhibits the best known rate of convergence for solving general nonconvex smooth optimization problems by using fir… ▽ More

    Submitted 14 October, 2013; originally announced October 2013.

  25. arXiv:1309.5549  [pdf, ps, other

    math.OC cs.CC stat.ML

    Stochastic First- and Zeroth-order Methods for Nonconvex Stochastic Programming

    Authors: Saeed Ghadimi, Guanghui Lan

    Abstract: In this paper, we introduce a new stochastic approximation (SA) type algorithm, namely the randomized stochastic gradient (RSG) method, for solving an important class of nonlinear (possibly nonconvex) stochastic programming (SP) problems. We establish the complexity of this method for computing an approximate stationary point of a nonlinear programming problem. We also show that this method posses… ▽ More

    Submitted 21 September, 2013; originally announced September 2013.

  26. arXiv:1308.6594  [pdf, ps, other

    math.OC

    Mini-batch Stochastic Approximation Methods for Nonconvex Stochastic Composite Optimization

    Authors: Saeed Ghadimi, Guanghui Lan, Hongchao Zhang

    Abstract: This paper considers a class of constrained stochastic composite optimization problems whose objective function is given by the summation of a differentiable (possibly nonconvex) component, together with a certain non-differentiable (but convex) component. In order to solve these problems, we propose a randomized stochastic projected gradient (RSPG) algorithm, in which proper mini-batch of samples… ▽ More

    Submitted 5 September, 2013; v1 submitted 29 August, 2013; originally announced August 2013.

    Comments: 32 pages