Skip to main content

Showing 1–50 of 57 results for author: Ryu, E

.
  1. arXiv:2405.03958  [pdf, other

    cs.CV cs.AI cs.LG

    Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model

    Authors: Joo Young Choi, Jaesung R. Park, Inkyu Park, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Current state-of-the-art diffusion models employ U-Net architectures containing convolutional and (qkv) self-attention layers. The U-Net processes images while being conditioned on the time embedding input for each sampling step and the class or caption embedding input corresponding to the desired conditional generation. Such conditioning involves scale-and-shift operations to the convolutional la… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2404.13228  [pdf, other

    math.OC

    Optimal Acceleration for Minimax and Fixed-Point Problems is Not Unique

    Authors: TaeHo Yoon, Jaeyeon Kim, Jaewook J. Suh, Ernest K. Ryu

    Abstract: Recently, accelerated algorithms using the anchoring mechanism for minimax optimization and fixed-point problems have been proposed, and matching complexity lower bounds establish their optimality. In this work, we present the surprising observation that the optimal acceleration mechanism in minimax optimization and fixed-point problems is not unique. Our new algorithms achieve exactly the same wo… ▽ More

    Submitted 23 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2403.17199  [pdf, other

    cs.CL

    Extracting Social Support and Social Isolation Information from Clinical Psychiatry Notes: Comparing a Rule-based NLP System and a Large Language Model

    Authors: Braja Gopal Patra, Lauren A. Lepow, Praneet Kasi Reddy Jagadeesh Kumar, Veer Vekaria, Mohit Manoj Sharma, Prakash Adekkanattu, Brian Fennessy, Gavin Hynes, Isotta Landi, Jorge A. Sanchez-Ruiz, Euijung Ryu, Joanna M. Biernacka, Girish N. Nadkarni, Ardesheer Talati, Myrna Weissman, Mark Olfson, J. John Mann, Alexander W. Charney, Jyotishman Pathak

    Abstract: Background: Social support (SS) and social isolation (SI) are social determinants of health (SDOH) associated with psychiatric outcomes. In electronic health records (EHRs), individual-level SS/SI is typically documented as narrative clinical notes rather than structured coded data. Natural language processing (NLP) algorithms can automate the otherwise labor-intensive process of data extraction.… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 2 figures, 3 tables

  4. arXiv:2403.04616  [pdf, other

    cs.GT

    Modeling reputation-based behavioral biases in school choice

    Authors: Jon Kleinberg, Sigal Oren, Emily Ryu, Éva Tardos

    Abstract: A fundamental component in the theoretical school choice literature is the problem a student faces in deciding which schools to apply to. Recent models have considered a set of schools of different selectiveness and a student who is unsure of their strength and can apply to at most $k$ schools. Such models assume that the student cares solely about maximizing the quality of the school that they at… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 22 pages, 8 figures

  5. arXiv:2403.03937  [pdf, ps, other

    cs.GT

    Settling the Competition Complexity of Additive Buyers over Independent Items

    Authors: Mahsa Derakhshan, Emily Ryu, S. Matthew Weinberg, Eric Xue

    Abstract: The competition complexity of an auction setting is the number of additional bidders needed such that the simple mechanism of selling items separately (with additional bidders) achieves greater revenue than the optimal but complex (randomized, prior-dependent, Bayesian-truthful) optimal mechanism without the additional bidders. Our main result settles the competition complexity of $n$ bidders with… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 50 pages

  6. arXiv:2402.11867  [pdf, other

    cs.LG math.OC

    LoRA Training in the NTK Regime has No Spurious Local Minima

    Authors: Uijeong Jang, Jason D. Lee, Ernest K. Ryu

    Abstract: Low-rank adaptation (LoRA) has become the standard approach for parameter-efficient fine-tuning of large language models (LLM), but our theoretical understanding of LoRA has been limited. In this work, we theoretically analyze LoRA fine-tuning in the neural tangent kernel (NTK) regime with $N$ data points, showing: (i) full fine-tuning (without LoRA) admits a low-rank solution of rank… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 23 pages

  7. arXiv:2311.17296  [pdf, other

    math.OC

    Mirror Duality in Convex Optimization

    Authors: Jaeyeon Kim, Chanwoo Park, Asuman Ozdaglar, Jelena Diakonikolas, Ernest K. Ryu

    Abstract: While first-order optimization methods are usually designed to efficiently reduce the function value $f(x)$, there has been recent interest in methods efficiently reducing the magnitude of $\nabla f(x)$, and the findings show that the two types of methods exhibit a certain symmetry. In this work, we present mirror duality, a one-to-one correspondence between mirror-descent-type methods reducing fu… ▽ More

    Submitted 15 May, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  8. arXiv:2310.18297  [pdf, other

    cs.CV cs.AI

    Image Clustering Conditioned on Text Criteria

    Authors: Sehyun Kwon, Jaeseung Park, Minkyu Kim, Jaewoong Cho, Ernest K. Ryu, Kangwook Lee

    Abstract: Classical clustering methods do not provide users with direct control of the clustering results, and the clustering results may not be consistent with the relevant criterion that a user has in mind. In this work, we present a new methodology for performing image clustering based on user-specified text criteria by leveraging modern vision-language models and large language models. We call our metho… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  9. arXiv:2307.02770  [pdf, other

    cs.CV cs.AI

    Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback

    Authors: TaeHo Yoon, Kibeom Myoung, Keon Lee, Jaewoong Cho, Albert No, Ernest K. Ryu

    Abstract: Diffusion models have recently shown remarkable success in high-quality image generation. Sometimes, however, a pre-trained diffusion model exhibits partial misalignment in the sense that the model can generate good images, but it sometimes outputs undesirable images. If so, we simply need to prevent the generation of the bad images, and we call this task censoring. In this work, we present censor… ▽ More

    Submitted 30 October, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Published in NeurIPS 2023

  10. arXiv:2305.16569  [pdf, ps, other

    cs.LG math.OC

    Accelerating Value Iteration with Anchoring

    Authors: Jongmin Lee, Ernest K. Ryu

    Abstract: Value Iteration (VI) is foundational to the theory and practice of modern reinforcement learning, and it is known to converge at a $\mathcal{O}(γ^k)$-rate, where $γ$ is the discount factor. Surprisingly, however, the optimal rate for the VI setup was not known, and finding a general acceleration mechanism has been an open problem. In this paper, we present the first accelerated VI for both the Bel… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Neural Information Processing System 2023

  11. arXiv:2305.15704  [pdf, ps, other

    math.OC

    Computer-Assisted Design of Accelerated Composite Optimization Methods: OptISTA

    Authors: Uijeong Jang, Shuvomoy Das Gupta, Ernest K. Ryu

    Abstract: The accelerated composite optimization method FISTA (Beck, Teboulle 2009) is suboptimal, and we present a new method OptISTA that improves upon it by a factor of 2. The performance estimation problem (PEP) has recently been introduced as a new computer-assisted paradigm for designing optimal first-order methods, but the methodology was largely limited to unconstrained optimization with a single fu… ▽ More

    Submitted 1 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 54 pages. There are two major changes. 1. In our prior submission, the method termed "OPM" was identified as existing work. Consequently, we have made appropriate modifications to the manuscript. 2. The proof for Theorem 1 has been replaced with an alternative proof (Section 2.2 and Appendix C), which we believe is more intuitive

  12. arXiv:2305.12211  [pdf, other

    math.OC

    Coordinate-Update Algorithms can Efficiently Detect Infeasible Optimization Problems

    Authors: **hee Paeng, Jisun Park, Ernest K. Ryu

    Abstract: Coordinate update/descent algorithms are widely used in large-scale optimization due to their low per-iteration cost and scalability, but their behavior on infeasible or misspecified problems has not been much studied compared to the algorithms that use full updates. For coordinate-update methods to be as widely adopted to the extent so that they can be used as engines of general-purpose solvers,… ▽ More

    Submitted 19 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  13. arXiv:2305.06628  [pdf, ps, other

    math.OC

    Time-Reversed Dissipation Induces Duality Between Minimizing Gradient Norm and Function Value

    Authors: Jaeyeon Kim, Asuman Ozdaglar, Chanwoo Park, Ernest K. Ryu

    Abstract: In convex optimization, first-order optimization methods efficiently minimizing function values have been a central subject study since Nesterov's seminal work of 1983. Recently, however, Kim and Fessler's OGM-G and Lee et al.'s FISTA-G have been presented as alternatives that efficiently minimize the gradient magnitude instead. In this paper, we present H-duality, which represents a surprising on… ▽ More

    Submitted 31 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  14. arXiv:2304.13995  [pdf, other

    cs.CV cs.AI

    Rotation and Translation Invariant Representation Learning with Implicit Neural Representations

    Authors: Sehyun Kwon, Joo Young Choi, Ernest K. Ryu

    Abstract: In many computer vision applications, images are acquired with arbitrary or random rotations and translations, and in such setups, it is desirable to obtain semantic representations disentangled from the image orientation. Examples of such applications include semiconductor wafer defect inspection, plankton microscope images, and inference on single-particle cryo-electron microscopy (cryo-EM) micr… ▽ More

    Submitted 12 June, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  15. arXiv:2304.00771  [pdf, other

    math.OC

    Continuous-time Analysis of Anchor Acceleration

    Authors: Jaewook J. Suh, Jisun Park, Ernest K. Ryu

    Abstract: Recently, the anchor acceleration, an acceleration mechanism distinct from Nesterov's, has been discovered for minimax optimization and fixed-point problems, but its mechanism is not understood well, much less so than Nesterov acceleration. In this work, we analyze continuous-time models of anchor acceleration. We provide tight, unified analyses for characterizing the convergence rate as a functio… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  16. arXiv:2303.15876  [pdf, other

    math.OC

    Accelerated Infeasibility Detection of Constrained Optimization and Fixed-Point Iterations

    Authors: Jisun Park, Ernest K. Ryu

    Abstract: As first-order optimization methods become the method of choice for solving large-scale optimization problems, optimization solvers based on first-order algorithms are being built. Such general-purpose solvers must robustly detect infeasible or misspecified problem instances, but the computational complexity of first-order methods for doing so has yet to be formally studied. In this work, we chara… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  17. arXiv:2302.03239  [pdf, ps, other

    cs.DS cs.GT cs.SI

    Calibrated Recommendations for Users with Decaying Attention

    Authors: Jon Kleinberg, Emily Ryu, Éva Tardos

    Abstract: Recommendation systems capable of providing diverse sets of results are a focus of increasing importance, with motivations ranging from fairness to novelty and other aspects of optimizing user experience. One form of diversity of recent interest is calibration, the notion that personalized recommendations should reflect the full distribution of a user's interests, rather than a single predominant… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 24 pages, 1 figure. This paper incorporates and supersedes our earlier paper arXiv:2203.00233

  18. arXiv:2211.15604  [pdf, other

    math.OC

    Convergence Analyses of Davis-Yin Splitting via Scaled Relative Graphs II: Convex Optimization Problems

    Authors: Soheun Yi, Ernest K. Ryu

    Abstract: The prior work of [arXiv:2207.04015, 2022] used scaled relative graphs (SRG) to analyze the convergence of Davis-Yin splitting (DYS) iterations on monotone inclusion problems. In this work, we use this machinery to analyze DYS iterations on convex optimization problems and obtain state-of-the-art linear convergence rates.

    Submitted 28 November, 2022; originally announced November 2022.

  19. arXiv:2207.04015  [pdf, other

    math.OC

    Convergence Analyses of Davis-Yin Splitting via Scaled Relative Graphs

    Authors: Jongmin Lee, Soheun Yi, Ernest K. Ryu

    Abstract: Davis-Yin splitting (DYS) has found a wide range of applications in optimization, but its linear rates of convergence have not been studied extensively. The scaled relative graph (SRG) simplifies the convergence analysis of operator splitting methods by map** the action of the operator onto the complex plane, but the prior SRG theory did not fully apply to the DYS operator. In this work, we form… ▽ More

    Submitted 21 April, 2024; v1 submitted 8 July, 2022; originally announced July 2022.

  20. arXiv:2205.11093  [pdf, other

    math.OC

    Accelerated Minimax Algorithms Flock Together

    Authors: TaeHo Yoon, Ernest K. Ryu

    Abstract: Several new accelerated methods in minimax optimization and fixed-point iterations have recently been discovered, and, interestingly, they rely on a mechanism distinct from Nesterov's momentum-based acceleration. In this work, we show that these accelerated algorithms exhibit what we call the merging path (MP) property; the trajectories of these algorithms merge quickly. Using this novel MP proper… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  21. arXiv:2203.07305  [pdf, other

    math.OC

    Branch-and-Bound Performance Estimation Programming: A Unified Methodology for Constructing Optimal Optimization Methods

    Authors: Shuvomoy Das Gupta, Bart P. G. Van Parys, Ernest K. Ryu

    Abstract: We present the Branch-and-Bound Performance Estimation Programming (BnB-PEP), a unified methodology for constructing optimal first-order methods for convex and nonconvex optimization. BnB-PEP poses the problem of finding the optimal optimization method as a nonconvex but practically tractable quadratically constrained quadratic optimization problem and solves it to certifiable global optimality us… ▽ More

    Submitted 8 June, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Published in Mathematical Programming Series A

  22. arXiv:2203.00233  [pdf, ps, other

    cs.DS cs.GT cs.SI

    Ordered Submodularity and its Applications to Diversifying Recommendations

    Authors: Jon Kleinberg, Emily Ryu, Éva Tardos

    Abstract: A fundamental task underlying many important optimization problems, from influence maximization to sensor placement to content recommendation, is to select the optimal group of $k$ items from a larger set. Submodularity has been very effective in allowing approximation algorithms for such subset selection problems. However, in several applications, we are interested not only in the elements of a s… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 17 pages

  23. arXiv:2202.11910  [pdf, other

    cs.LG

    Robust Probabilistic Time Series Forecasting

    Authors: TaeHo Yoon, Youngsuk Park, Ernest K. Ryu, Yuyang Wang

    Abstract: Probabilistic time series forecasting has played critical role in decision-making processes due to its capability to quantify uncertainties. Deep forecasting models, however, could be prone to input perturbations, and the notion of such perturbations, together with that of robustness, has not even been completely established in the regime of probabilistic forecasting. In this work, we propose a fr… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022 camera ready version

  24. arXiv:2202.05501  [pdf, ps, other

    math.OC

    Continuous-Time Analysis of Accelerated Gradient Methods via Conservation Laws in Dilated Coordinate Systems

    Authors: Jaewook J. Suh, Gyumin Roh, Ernest K. Ryu

    Abstract: We analyze continuous-time models of accelerated gradient methods through deriving conservation laws in dilated coordinate systems. Namely, instead of analyzing the dynamics of $X(t)$, we analyze the dynamics of $W(t)=t^α(X(t)-X_c)$ for some $α$ and $X_c$ and derive a conserved quantity, analogous to physical energy, in this dilated coordinate system. Through this methodology, we recover many know… ▽ More

    Submitted 24 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  25. arXiv:2202.02981  [pdf, other

    cs.LG math.OC stat.ML

    Neural Tangent Kernel Analysis of Deep Narrow Neural Networks

    Authors: Jongmin Lee, Joo Young Choi, Ernest K. Ryu, Albert No

    Abstract: The tremendous recent progress in analyzing the training dynamics of overparameterized neural networks has primarily focused on wide networks and therefore does not sufficiently address the role of depth in deep learning. In this work, we present the first trainability guarantee of infinitely deep but narrow neural networks. We study the infinite-depth limit of a multilayer perceptron (MLP) with a… ▽ More

    Submitted 27 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Journal ref: Published in International Conference on Machine Learning, 2022

  26. arXiv:2201.11413  [pdf, other

    math.OC

    Exact Optimal Accelerated Complexity for Fixed-Point Iterations

    Authors: Jisun Park, Ernest K. Ryu

    Abstract: Despite the broad use of fixed-point iterations throughout applied mathematics, the optimal convergence rate of general fixed-point problems with nonexpansive nonlinear operators has not been established. This work presents an acceleration mechanism for fixed-point iterations with nonexpansive operators, contractive operators, and nonexpansive operators satisfying a Hölder-type growth condition. W… ▽ More

    Submitted 27 June, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: ICML 2022 Long Talk

  27. arXiv:2201.09077  [pdf, other

    cs.CV cs.AI

    LTC-GIF: Attracting More Clicks on Feature-length Sports Videos

    Authors: Ghulam Mujtaba, Jaehyuk Choi, Eun-Seok Ryu

    Abstract: This paper proposes a lightweight method to attract users and increase views of the video by presenting personalized artistic media -- i.e, static thumbnails and animated GIFs. This method analyzes lightweight thumbnail containers (LTC) using computational resources of the client device to recognize personalized events from full-length sports videos. In addition, instead of processing the entire v… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

  28. LTC-SUM: Lightweight Client-driven Personalized Video Summarization Framework Using 2D CNN

    Authors: Ghulam Mujtaba, Adeel Malik, Eun-Seok Ryu

    Abstract: This paper proposes a novel lightweight thumbnail container-based summarization (LTC-SUM) framework for full feature-length videos. This framework generates a personalized keyshot summary for concurrent users by using the computational resource of the end-user device. State-of-the-art methods that acquire and process entire video data to generate video summaries are highly computationally intensiv… ▽ More

    Submitted 4 October, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: 14

    Journal ref: in IEEE Access, vol. 10, pp. 103041-103055, 2022

  29. arXiv:2112.09379  [pdf

    cs.CV

    Enhanced Frame and Event-Based Simulator and Event-Based Video Interpolation Network

    Authors: Adam Radomski, Andreas Georgiou, Thomas Debrunner, Chenghan Li, Luca Longinotti, Minwon Seo, Moosung Kwak, Chang-Woo Shin, Paul K. J. Park, Hyunsurk Eric Ryu, Kynan Eng

    Abstract: Fast neuromorphic event-based vision sensors (Dynamic Vision Sensor, DVS) can be combined with slower conventional frame-based sensors to enable higher-quality inter-frame interpolation than traditional methods relying on fixed motion approximations using e.g. optical flow. In this work we present a new, advanced event simulator that can produce realistic scenes recorded by a camera rig with an ar… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 10 pages, 19 figures

  30. arXiv:2110.11035  [pdf, ps, other

    math.OC

    Optimal First-Order Algorithms as a Function of Inequalities

    Authors: Chanwoo Park, Ernest K. Ryu

    Abstract: In this work, we present a novel algorithm design methodology that finds the optimal algorithm as a function of inequalities. Specifically, we restrict convergence analyses of algorithms to use a prespecified subset of inequalities, rather than utilizing all true inequalities, and find the optimal algorithm subject to this restriction. This methodology allows us to design algorithms with certain d… ▽ More

    Submitted 21 March, 2024; v1 submitted 21 October, 2021; originally announced October 2021.

  31. arXiv:2106.10439  [pdf, other

    math.OC

    A Geometric Structure of Acceleration and Its Role in Making Gradients Small Fast

    Authors: Jongmin Lee, Chanwoo Park, Ernest K. Ryu

    Abstract: Since Nesterov's seminal 1983 work, many accelerated first-order optimization methods have been proposed, but their analyses lacks a common unifying structure. In this work, we identify a geometric structure satisfied by a wide range of first-order accelerated methods. Using this geometric insight, we present several novel generalizations of accelerated methods. Most interesting among them is a me… ▽ More

    Submitted 4 November, 2021; v1 submitted 19 June, 2021; originally announced June 2021.

    Journal ref: Published in the Neural Information Processing Systems, 2021

  32. arXiv:2104.09644  [pdf

    cs.CL cs.AI cs.IR

    Neural Language Models with Distant Supervision to Identify Major Depressive Disorder from Clinical Notes

    Authors: Bhavani Singh Agnikula Kshatriya, Nicolas A Nunez, Manuel Gardea- Resendez, Euijung Ryu, Brandon J Coombes, Sunyang Fu, Mark A Frye, Joanna M Biernacka, Yanshan Wang

    Abstract: Major depressive disorder (MDD) is a prevalent psychiatric disorder that is associated with significant healthcare burden worldwide. Phenoty** of MDD can help early diagnosis and consequently may have significant advantages in patient management. In prior research MDD phenotypes have been extracted from structured Electronic Health Records (EHR) or using Electroencephalographic (EEG) data with t… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  33. arXiv:2102.07922  [pdf, other

    math.OC

    Accelerated Algorithms for Smooth Convex-Concave Minimax Problems with $\mathcal{O}(1/k^2)$ Rate on Squared Gradient Norm

    Authors: TaeHo Yoon, Ernest K. Ryu

    Abstract: In this work, we study the computational complexity of reducing the squared gradient magnitude for smooth minimax optimization problems. First, we present algorithms with accelerated $\mathcal{O}(1/k^2)$ last-iterate rates, faster than the existing $\mathcal{O}(1/k)$ or slower rates for extragradient, Popov, and gradient descent with anchoring. The acceleration mechanism combines extragradient ste… ▽ More

    Submitted 10 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Published at ICML 2021 as a long talk

  34. arXiv:2102.07541  [pdf, other

    cs.LG math.OC

    WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points

    Authors: Albert No, TaeHo Yoon, Sehyun Kwon, Ernest K. Ryu

    Abstract: Generative adversarial networks (GAN) are a widely used class of deep generative models, but their minimax training dynamics are not understood very well. In this work, we show that GANs with a 2-layer infinite-width generator and a 2-layer finite-width discriminator trained with stochastic gradient ascent-descent have no spurious stationary points. We then show that when the width of the generato… ▽ More

    Submitted 9 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Published at ICML 2021

  35. arXiv:2102.07366  [pdf, ps, other

    math.OC

    Factor-$\sqrt{2}$ Acceleration of Accelerated Gradient Methods

    Authors: Chanwoo Park, Jisun Park, Ernest K. Ryu

    Abstract: The optimized gradient method (OGM) provides a factor-$\sqrt{2}$ speedup upon Nesterov's celebrated accelerated gradient method in the convex (but non-strongly convex) setup. However, this improved acceleration mechanism has not been well understood; prior analyses of OGM relied on a computer-assisted proof methodology, so the proofs were opaque for humans despite being verifiable and correct. In… ▽ More

    Submitted 24 May, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  36. arXiv:2001.02061  [pdf, ps, other

    math.NA math.OC

    Scaled Relative Graph of Normal Matrices

    Authors: Xinmeng Huang, Ernest K. Ryu, Wotao Yin

    Abstract: The Scaled Relative Graph (SRG) by Ryu, Hannah, and Yin (arXiv:1902.09788, 2019) is a geometric tool that maps the action of a multi-valued nonlinear operator onto the 2D plane, used to analyze the convergence of a wide range of iterative methods. As the SRG includes the spectrum for linear operators, we can view the SRG as a generalization of the spectrum to multi-valued nonlinear operators. In t… ▽ More

    Submitted 8 January, 2020; v1 submitted 27 December, 2019; originally announced January 2020.

  37. arXiv:1912.01593  [pdf, ps, other

    math.OC

    Tight Coefficients of Averaged Operators via Scaled Relative Graph

    Authors: Xinmeng Huang, Ernest K. Ryu, Wotao Yin

    Abstract: Many iterative methods in optimization are fixed-point iterations with averaged operators. As such methods converge at an $\mathcal{O}(1/k)$ rate with the constant determined by the averagedness coefficient, establishing small averagedness coefficients for operators is of broad interest. In this paper, we show that the averagedness coefficients of the composition of averaged operators by Ogura and… ▽ More

    Submitted 27 April, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

  38. arXiv:1909.09747  [pdf, ps, other

    math.OC

    Finding the forward-Douglas-Rachford-forward method

    Authors: Ernest K. Ryu, Bang Cong Vu

    Abstract: We consider the monotone inclusion problem with a sum of 3 operators, in which 2 are monotone and 1 is monotone-Lipschitz. The classical Douglas--Rachford and Forward-backward-forward methods respectively solve the monotone inclusion problem with a sum of 2 monotone operators and a sum of 1 monotone and 1 monotone-Lipschitz operators. We first present a method that naturally combines Douglas--Rach… ▽ More

    Submitted 16 October, 2019; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: To appear in Journal of Optimization Theory and Applications

    MSC Class: 47H05; 47H09; 90C25

  39. arXiv:1909.06479  [pdf, other

    math.OC

    Decentralized Proximal Gradient Algorithms with Linear Convergence Rates

    Authors: Sulaiman A. Alghunaim, Ernest K. Ryu, Kun Yuan, Ali H. Sayed

    Abstract: This work studies a class of non-smooth decentralized multi-agent optimization problems where the agents aim at minimizing a sum of local strongly-convex smooth components plus a common non-smooth term. We propose a general primal-dual algorithmic framework that unifies many existing state-of-the-art algorithms. We establish linear convergence of the proposed method to the exact solution in the pr… ▽ More

    Submitted 9 July, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: To appear in IEEE Transactions on Automatic Control

  40. arXiv:1906.12141  [pdf, other

    cs.CG

    MGOS: A Library for Molecular Geometry and its Operating System

    Authors: Deok-Soo Kima, Joonghyun Ryua, Youngsong Choa, Mokwon Leeb, Jehyun Cha, Chanyoung Song, Sangwha Kim, Roman A Laskowskid, Kokichi Sugihara, Jong Bhak, Seong Eon Ryu

    Abstract: The geometry of atomic arrangement underpins the structural understanding of molecules in many fields. However, no general framework of mathematical/computational theory for the geometry of atomic arrangement exists. Here we present "Molecular Geometry (MG)" as a theoretical framework accompanied by "MG Operating System (MGOS)" which consists of callable functions implementing the MG theory. MG al… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

  41. arXiv:1905.10899  [pdf, other

    cs.LG stat.ML

    ODE Analysis of Stochastic Gradient Methods with Optimism and Anchoring for Minimax Problems

    Authors: Ernest K. Ryu, Kun Yuan, Wotao Yin

    Abstract: Despite remarkable empirical success, the training dynamics of generative adversarial networks (GAN), which involves solving a minimax game using stochastic gradients, is still poorly understood. In this work, we analyze last-iterate convergence of simultaneous gradient descent (simGD) and its variants under the assumption of convex-concavity, guided by a continuous-time analysis with differential… ▽ More

    Submitted 11 October, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  42. arXiv:1905.05406  [pdf, other

    cs.CV eess.IV

    Plug-and-Play Methods Provably Converge with Properly Trained Denoisers

    Authors: Ernest K. Ryu, Jialin Liu, Sicheng Wang, Xiaohan Chen, Zhangyang Wang, Wotao Yin

    Abstract: Plug-and-play (PnP) is a non-convex framework that integrates modern denoising priors, such as BM3D or deep learning-based denoisers, into ADMM or other proximal algorithms. An advantage of PnP is that one can use pre-trained denoisers when there is not sufficient data for end-to-end training. Although PnP has been recently studied extensively with great empirical success, theoretical analysis add… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Published in the International Conference on Machine Learning, 2019

  43. Scaled Relative Graph: Nonexpansive operators via 2D Euclidean Geometry

    Authors: Ernest K. Ryu, Robert Hannah, Wotao Yin

    Abstract: Many iterative methods in applied mathematics can be thought of as fixed-point iterations, and such algorithms are usually analyzed analytically, with inequalities. In this paper, we present a geometric approach to analyzing contractive and nonexpansive fixed point iterations with a new tool called the scaled relative graph (SRG). The SRG provides a correspondence between nonlinear operators and s… ▽ More

    Submitted 16 June, 2021; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Published in Mathematical Programming

    MSC Class: 47H05; 47H09; 51M04; 90C25

  44. arXiv:1812.00146  [pdf, other

    math.OC

    Operator Splitting Performance Estimation: Tight contraction factors and optimal parameter selection

    Authors: Ernest K. Ryu, Adrien B. Taylor, Carolina Bergeling, Pontus Giselsson

    Abstract: We propose a methodology for studying the performance of common splitting methods through semidefinite programming. We prove tightness of the methodology and demonstrate its value by presenting two applications of it. First, we use the methodology as a tool for computer-assisted proofs to prove tight analytical contraction factors for Douglas--Rachford splitting that are likely too complicated for… ▽ More

    Submitted 30 April, 2020; v1 submitted 1 December, 2018; originally announced December 2018.

    Comments: Published in the SIAM Journal on Optimization

    MSC Class: 47H05 47H09 68Q25 90C22 90C25 90C30 90C60

  45. arXiv:1810.13100  [pdf, other

    math.OC

    Splitting with Near-Circulant Linear Systems: Applications to Total Variation CT and PET

    Authors: Ernest K. Ryu, Seyoon Ko, Joong-Ho Won

    Abstract: Many imaging problems, such as total variation reconstruction of X-ray computed tomography (CT) and positron-emission tomography (PET), are solved via a convex optimization problem with near-circulant, but not actually circulant, linear systems. The popular methods to solve these problems, alternating direction method of multipliers (ADMM) and primal-dual hybrid gradient (PDHG), do not directly ut… ▽ More

    Submitted 29 November, 2019; v1 submitted 31 October, 2018; originally announced October 2018.

    Comments: Published in SIAM Journal on Scientific Computing

  46. Linear Convergence of Cyclic SAGA

    Authors: Youngsuk Park, Ernest K. Ryu

    Abstract: In this work, we present and analyze C-SAGA, a (deterministic) cyclic variant of SAGA. C-SAGA is an incremental gradient method that minimizes a sum of differentiable convex functions by cyclically accessing their gradients. Even though the theory of stochastic algorithms is more mature than that of cyclic counterparts in general, practitioners often prefer cyclic algorithms. We prove C-SAGA conve… ▽ More

    Submitted 8 January, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

    Comments: Published in Optimization Letters

  47. arXiv:1802.07534  [pdf, other

    math.OC

    Uniqueness of DRS as the 2 Operator Resolvent-Splitting and Impossibility of 3 Operator Resolvent-Splitting

    Authors: Ernest K. Ryu

    Abstract: Given the success of Douglas--Rachford splitting (DRS), it is natural to ask whether DRS can be generalized. Are there other 2 operator resolvent-splittings sharing the favorable properties of DRS? Can DRS be generalized to 3 operators? This work presents the answers: no and no. In a certain sense, DRS is the unique 2 operator resolvent-splitting, and generalizing DRS to 3 operators is impossible… ▽ More

    Submitted 20 May, 2019; v1 submitted 21 February, 2018; originally announced February 2018.

    Comments: Published in Mathematical Programming

  48. Douglas--Rachford Splitting and ADMM for Pathological Convex Optimization

    Authors: Ernest K. Ryu, Yanli Liu, Wotao Yin

    Abstract: Despite the vast literature on DRS and ADMM, there has been very little work analyzing their behavior under pathologies. Most analyses assume a primal solution exists, a dual solution exists, and strong duality holds. When these assumptions are not met, i.e., under pathologies, the theory often breaks down and the empirical performance may degrade significantly. In this paper, we establish that DR… ▽ More

    Submitted 9 September, 2019; v1 submitted 19 January, 2018; originally announced January 2018.

    Comments: Published in Computational Optimization and Applications

    MSC Class: 90C46; 49N15; 90C25

  49. arXiv:1712.10279  [pdf, other

    math.OC eess.SY math.FA

    Vector and Matrix Optimal Mass Transport: Theory, Algorithm, and Applications

    Authors: Ernest K. Ryu, Yongxin Chen, Wuchen Li, Stanley Osher

    Abstract: In many applications such as color image processing, data has more than one piece of information associated with each spatial coordinate, and in such cases the classical optimal mass transport (OMT) must be generalized to handle vector-valued or matrix-valued densities. In this paper, we discuss the vector and matrix optimal mass transport and present three contributions. We first present a rigoro… ▽ More

    Submitted 16 June, 2018; v1 submitted 29 December, 2017; originally announced December 2017.

    Comments: 22 pages, 5 figures, 3 tables

    MSC Class: 65K10; 65K05; 90C25

  50. arXiv:1709.02838  [pdf, other

    math.OC

    Cosmic Divergence, Weak Cosmic Convergence, and Fixed Points at Infinity

    Authors: Ernest K. Ryu

    Abstract: To characterize the asymptotic behavior of fixed-point iterations of non-expansive operators with no fixed points, Bauschke et al. [Fixed Point Theory Appl. (2016)] recently studied cosmic convergence and conjectured that cosmic convergence always holds. This paper presents a cosmically divergent counter example, which disproves this conjecture. This paper also demonstrates, with a counter example… ▽ More

    Submitted 4 June, 2018; v1 submitted 8 September, 2017; originally announced September 2017.

    MSC Class: 47H09 (Primary); 90C25 (Secondary)