Skip to main content

Showing 1–10 of 10 results for author: Thekumparampil, K K

Searching in archive math. Search in all archives.
.
  1. arXiv:2403.05054  [pdf, other

    math.OC cs.LG

    A Sinkhorn-type Algorithm for Constrained Optimal Transport

    Authors: Xun Tang, Holakou Rahmanian, Michael Shavlovsky, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying

    Abstract: Entropic optimal transport (OT) and the Sinkhorn algorithm have made it practical for machine learning practitioners to perform the fundamental task of calculating transport distance between statistical distributions. In this work, we focus on a general class of OT problems under a combination of equality and inequality constraints. We derive the corresponding entropy regularization formulation an… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2401.12253  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Sinkhorn Algorithm with Sparse Newton Iterations

    Authors: Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Elisa Tardini, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying

    Abstract: Computing the optimal transport distance between statistical distributions is a fundamental task in machine learning. One remarkable recent advancement is entropic regularization and the Sinkhorn algorithm, which utilizes only matrix scaling and guarantees an approximated solution with near-linear runtime. Despite the success of the Sinkhorn algorithm, its runtime may still be slow due to the pote… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: In ICLR 2024

  3. arXiv:2310.09639  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    DPZero: Private Fine-Tuning of Language Models without Backpropagation

    Authors: Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

    Abstract: The widespread practice of fine-tuning large language models (LLMs) on domain-specific data faces two major challenges in memory and privacy. First, as the size of LLMs continues to grow, the memory demands of gradient-based training methods via backpropagation become prohibitively high. Second, given the tendency of LLMs to memorize training data, it is important to protect potentially sensitive… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  4. arXiv:2206.00363  [pdf, ps, other

    cs.LG cs.CR math.OC stat.ML

    Bring Your Own Algorithm for Optimal Differentially Private Stochastic Minimax Optimization

    Authors: Liang Zhang, Kiran Koshy Thekumparampil, Sewoong Oh, Niao He

    Abstract: We study differentially private (DP) algorithms for smooth stochastic minimax optimization, with stochastic minimization as a byproduct. The holy grail of these settings is to guarantee the optimal trade-off between the privacy and the excess population loss, using an algorithm with a linear time-complexity in the number of training samples. We provide a general framework for solving differentiall… ▽ More

    Submitted 19 October, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  5. arXiv:2201.07427  [pdf, other

    math.OC cs.LG stat.ML

    Lifted Primal-Dual Method for Bilinearly Coupled Smooth Minimax Optimization

    Authors: Kiran Koshy Thekumparampil, Niao He, Sewoong Oh

    Abstract: We study the bilinearly coupled minimax problem: $\min_{x} \max_{y} f(x) + y^\top A x - h(y)$, where $f$ and $h$ are both strongly convex smooth functions and admit first-order gradient oracles. Surprisingly, no known first-order algorithms have hitherto achieved the lower complexity bound of… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted for review on Oct 15, 2021. Accepted to AISTATS 2022 on Jan 18, 2022

  6. arXiv:2108.06869  [pdf, other

    cs.LG cs.DC math.OC

    FedChain: Chained Algorithms for Near-Optimal Communication Cost in Federated Learning

    Authors: Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

    Abstract: Federated learning (FL) aims to minimize the communication complexity of training a model over heterogeneous data distributed across many clients. A common approach is local methods, where clients take multiple optimization steps over local data before communicating with the server (e.g., FedAvg). Local methods can exploit similarity between clients' data. However, in existing analyses, this comes… ▽ More

    Submitted 16 April, 2023; v1 submitted 15 August, 2021; originally announced August 2021.

    Comments: abstract typo correction

  7. arXiv:2105.08306  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Sample Efficient Linear Meta-Learning by Alternating Minimization

    Authors: Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

    Abstract: Meta-learning synthesizes and leverages the knowledge from a given set of tasks to rapidly learn new tasks using very little data. Meta-learning of linear regression tasks, where the regressors lie in a low-dimensional subspace, is an extensively-studied fundamental problem in this domain. However, existing results either guarantee highly suboptimal estimation errors, or require $Ω(d)$ samples per… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  8. arXiv:2102.06333  [pdf, other

    cs.LG cs.DC math.OC

    Efficient Algorithms for Federated Saddle Point Optimization

    Authors: Charlie Hou, Kiran K. Thekumparampil, Giulia Fanti, Sewoong Oh

    Abstract: We consider strongly convex-concave minimax problems in the federated setting, where the communication constraint is the main bottleneck. When clients are arbitrarily heterogeneous, a simple Minibatch Mirror-prox achieves the best performance. As the clients become more homogeneous, using multiple local gradient updates at the clients significantly improves upon Minibatch Mirror-prox by communicat… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  9. arXiv:2010.01848  [pdf, other

    math.OC cs.LG stat.ML

    Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method

    Authors: Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

    Abstract: We consider the classical setting of optimizing a nonsmooth Lipschitz continuous convex function over a convex constraint set, when having access to a (stochastic) first-order oracle (FO) for the function and a projection oracle (PO) for the constraint set. It is well known that to achieve $ε$-suboptimality in high-dimensions, $Θ(ε^{-2})$ FO calls are necessary. This is achieved by the projected s… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  10. arXiv:1907.01543  [pdf, other

    math.OC cs.LG stat.ML

    Efficient Algorithms for Smooth Minimax Optimization

    Authors: Kiran Koshy Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

    Abstract: This paper studies first order methods for solving smooth minimax optimization problems $\min_x \max_y g(x,y)$ where $g(\cdot,\cdot)$ is smooth and $g(x,\cdot)$ is concave for each $x$. In terms of $g(\cdot,y)$, we consider two settings -- strongly convex and nonconvex -- and improve upon the best known rates in both. For strongly-convex $g(\cdot, y),\ \forall y$, we propose a new algorithm combin… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.