Skip to main content

Showing 1–50 of 95 results for author: Ye, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.02628  [pdf, ps, other

    stat.ML cs.CC cs.DS cs.LG

    Replicability in High Dimensional Statistics

    Authors: Max Hopkins, Russell Impagliazzo, Daniel Kane, Sihan Liu, Christopher Ye

    Abstract: The replicability crisis is a major issue across nearly all areas of empirical science, calling for the formal study of replicability in statistics. Motivated in this context, [Impagliazzo, Lei, Pitassi, and Sorrell STOC 2022] introduced the notion of replicable learning algorithms, and gave basic procedures for $1$-dimensional tasks including statistical queries. In this work, we study the comput… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 119 pages

    ACM Class: F.2.0

  2. arXiv:2403.14830  [pdf, other

    stat.ML cs.LG

    Deep Clustering Evaluation: How to Validate Internal Clustering Validation Measures

    Authors: Zeya Wang, Chenglong Ye

    Abstract: Deep clustering, a method for partitioning complex, high-dimensional data using deep neural networks, presents unique evaluation challenges. Traditional clustering validation measures, designed for low-dimensional spaces, are problematic for deep clustering, which involves projecting data into lower-dimensional embeddings before partitioning. Two key issues are identified: 1) the curse of dimensio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  3. arXiv:2403.14183  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

    Authors: Kwanyoung Kim, Yu** Oh, Jong Chul Ye

    Abstract: The recent success of CLIP has demonstrated promising results in zero-shot semantic segmentation by transferring muiltimodal knowledge to pixel-level classification. However, leveraging pre-trained CLIP knowledge to closely align text embeddings with pixel embeddings still has limitations in existing approaches. To address this issue, we propose OTSeg, a novel multimodal attention mechanism aimed… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 22 pages, 7 figures

  4. arXiv:2402.08991  [pdf, ps, other

    stat.ML cs.LG

    Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

    Authors: Chenlu Ye, Jiafan He, Quanquan Gu, Tong Zhang

    Abstract: This study tackles the challenges of adversarial corruption in model-based reinforcement learning (RL), where the transition dynamics can be corrupted by an adversary. Existing studies on corruption-robust RL mostly focus on the setting of model-free RL, where robust least-square regression is often employed for value function estimation. However, these techniques cannot be directly applied to mod… ▽ More

    Submitted 14 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2402.08222  [pdf, other

    stat.ME

    Integration of multiview microbiome data for deciphering microbiome-metabolome-disease pathways

    Authors: Lei Fang, Yue Wang, Chenglong Ye

    Abstract: The intricate interplay between host organisms and their gut microbiota has catalyzed research into the microbiome's role in disease, shedding light on novel aspects of disease pathogenesis. However, the mechanisms through which the microbiome exerts its influence on disease remain largely unclear. In this study, we first introduce a structural equation model to delineate the pathways connecting t… ▽ More

    Submitted 16 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2402.07314  [pdf, other

    cs.LG stat.ML

    Online Iterative Reinforcement Learning from Human Feedback with General Preference Model

    Authors: Chenlu Ye, Wei Xiong, Yuheng Zhang, Nan Jiang, Tong Zhang

    Abstract: We study Reinforcement Learning from Human Feedback (RLHF) under a general preference oracle. In particular, we do not assume that there exists a reward function and the preference signal is drawn from the Bradley-Terry model as most of the prior works do. We consider a standard mathematical formulation, the reverse-KL regularized minimax game between two LLMs for RLHF under general preference ora… ▽ More

    Submitted 25 April, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: RLHF, Preference Learning, Alignment for LLMs

  7. arXiv:2312.11456  [pdf, other

    cs.LG cs.AI stat.ML

    Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint

    Authors: Wei Xiong, Hanze Dong, Chenlu Ye, Ziqi Wang, Han Zhong, Heng Ji, Nan Jiang, Tong Zhang

    Abstract: This paper studies the alignment process of generative models with Reinforcement Learning from Human Feedback (RLHF). We first identify the primary challenges of existing popular methods like offline PPO and offline DPO as lacking in strategical exploration of the environment. Then, to understand the mathematical principle of RLHF, we consider a standard mathematical formulation, the reverse-KL re… ▽ More

    Submitted 1 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 53 pages; theoretical study and algorithmic design of iterative RLHF and DPO

  8. arXiv:2311.13180  [pdf, other

    stat.ML cs.LG

    Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

    Authors: Jianqing Fan, Zhaoran Wang, Zhuoran Yang, Chenlu Ye

    Abstract: We study high-dimensional multi-armed contextual bandits with batched feedback where the $T$ steps of online interactions are divided into $L$ batches. In specific, each batch collects data according to a policy that depends on previous batches and the rewards are revealed only at the end of the batch. Such a feedback structure is popular in applications such as personalized medicine and online ad… ▽ More

    Submitted 24 November, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  9. arXiv:2310.02712  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ED-NeRF: Efficient Text-Guided Editing of 3D Scene with Latent Space NeRF

    Authors: Jangho Park, Gihyun Kwon, Jong Chul Ye

    Abstract: Recently, there has been a significant advancement in text-to-image diffusion models, leading to groundbreaking performance in 2D image generation. These advancements have been extended to 3D models, enabling the generation of novel 3D objects from textual descriptions. This has evolved into NeRF editing methods, which allow the manipulation of existing 3D objects through textual conditioning. How… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; Project Page: https://jhq1234.github.io/ed-nerf.github.io/

  10. arXiv:2310.01110  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Prompt-tuning latent diffusion models for inverse problems

    Authors: Hyung** Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio

    Abstract: We propose a new method for solving imaging inverse problems using text-to-image latent diffusion models as general priors. Existing methods using latent diffusion models for inverse problems typically rely on simple null text prompts, which can lead to suboptimal performance. To address this limitation, we introduce a method for prompt tuning, which jointly optimizes the text embedding on-the-fly… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 22 pages, 10 figures

  11. arXiv:2310.01107  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

    Authors: Hyeonho Jeong, Jong Chul Ye

    Abstract: Recent endeavors in video editing have showcased promising results in single-attribute editing or style transfer tasks, either by training text-to-video (T2V) models on text-video data or adopting training-free methods. However, when confronted with the complexities of multi-attribute editing scenarios, they exhibit shortcomings such as omitting or overlooking intended attribute changes, modifying… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024, Project Page: http://ground-a-video.github.io

  12. arXiv:2309.02476  [pdf, other

    stat.ML cs.LG

    Optimal Sample Selection Through Uncertainty Estimation and Its Application in Deep Learning

    Authors: Yong Lin, Chen Liu, Chenlu Ye, Qing Lian, Yuan Yao, Tong Zhang

    Abstract: Modern deep learning heavily relies on large labeled datasets, which often comse with high costs in terms of both manual labeling and computational resources. To mitigate these challenges, researchers have explored the use of informative subset selection techniques, including coreset selection and active learning. Specifically, coreset selection involves sampling data with both input ($\bx$) and o… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  13. arXiv:2308.07418  [pdf, other

    cs.LG stat.ML

    Locally Adaptive and Differentiable Regression

    Authors: Mingxuan Han, Varun Shankar, Jeff M Phillips, Chenglong Ye

    Abstract: Over-parameterized models like deep nets and random forests have become very popular in machine learning. However, the natural goals of continuity and differentiability, common in regression models, are now often ignored in modern overparametrized, locally-adaptive models. We propose a general framework to construct a global continuous and differentiable model based on a weighted average of locall… ▽ More

    Submitted 12 October, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

    Journal ref: Journal of Machine Learning for Modeling and Computing 2023

  14. arXiv:2306.04396  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to disentangle style and content, preserving the source image's structure while successfully transitioning from a source to a target domain under text or one-shot image… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  15. arXiv:2305.19809  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Direct Diffusion Bridge using Data Consistency for Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have shown impressive performance, but are limited in speed, mostly as they require reverse diffusion sampling starting from noise. Several recent works have tried to alleviate this problem by building a diffusion process, directly bridging the clean and the corrupted for specific inverse problems. In this paper, we first unify these existing works und… ▽ More

    Submitted 24 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready. 16 pages, 6 figures

  16. arXiv:2305.16375  [pdf, other

    cs.LG cs.AI stat.ML

    Data Topology-Dependent Upper Bounds of Neural Network Widths

    Authors: Sangmin Lee, Jong Chul Ye

    Abstract: This paper investigates the relationship between the universal approximation property of deep neural networks and topological characteristics of datasets. Our primary contribution is to introduce data topology-dependent upper bounds on the network width. Specifically, we first show that a three-layer neural network, applying a ReLU activation function and max pooling, can be designed to approximat… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  17. arXiv:2305.15086  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Unpaired Image-to-Image Translation via Neural Schrödinger Bridge

    Authors: Beomsu Kim, Gihyun Kwon, Kwanyoung Kim, Jong Chul Ye

    Abstract: Diffusion models are a powerful class of generative models which simulate stochastic differential equations (SDEs) to generate data from noise. While diffusion models have achieved remarkable progress, they have limitations in unpaired image-to-image (I2I) translation tasks due to the Gaussian prior assumption. Schrödinger Bridge (SB), which learns an SDE to translate between two arbitrary distrib… ▽ More

    Submitted 2 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ICLR 2024

  18. arXiv:2305.00520  [pdf, other

    stat.ML cs.LG

    The ART of Transfer Learning: An Adaptive and Robust Pipeline

    Authors: Boxiang Wang, Yunan Wu, Chenglong Ye

    Abstract: Transfer learning is an essential tool for improving the performance of primary tasks by leveraging information from auxiliary data resources. In this work, we propose Adaptive Robust Transfer Learning (ART), a flexible pipeline of performing transfer learning with generic machine learning algorithms. We establish the non-asymptotic learning theory of ART, providing a provable theoretical guarante… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  19. arXiv:2303.08622  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer

    Authors: Serin Yang, Hyunmin Hwang, Jong Chul Ye

    Abstract: Diffusion models have shown great promise in text-guided image style transfer, but there is a trade-off between style transformation and content preservation due to their stochastic nature. Existing methods require computationally expensive fine-tuning of diffusion models or additional neural network. To address this, here we propose a zero-shot contrastive loss for diffusion models that doesn't r… ▽ More

    Submitted 12 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  20. arXiv:2303.05754  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

    Authors: Hyung** Chung, Suhyeon Lee, Jong Chul Ye

    Abstract: Krylov subspace, which is generated by multiplying a given vector by the matrix of a linear transformation and its successive powers, has been extensively studied in classical optimization literature to design algorithms that converge quickly for large linear inverse problems. For example, the conjugate gradient method (CG), one of the most popular Krylov subspace methods, is based on the idea of… ▽ More

    Submitted 19 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: ICLR 2024; 28 pages, 9 figures

  21. arXiv:2302.03900  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

    Authors: Hyeonho Jeong, Gihyun Kwon, Jong Chul Ye

    Abstract: Recent advancements in large scale text-to-image models have opened new possibilities for guiding the creation of images through human-devised natural language. However, while prior literature has primarily focused on the generation of individual images, it is essential to consider the capability of these models to ensure coherency within a sequence of images to fulfill the demands of real-world a… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  22. arXiv:2301.12334  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Don't Play Favorites: Minority Guidance for Diffusion Models

    Authors: Soobin Um, Suhyeon Lee, Jong Chul Ye

    Abstract: We explore the problem of generating minority samples using diffusion models. The minority samples are instances that lie on low-density regions of a data manifold. Generating a sufficient number of such minority instances is important, since they often contain some unique attributes of the data. However, the conventional generation process of the diffusion models mostly yields majority samples (t… ▽ More

    Submitted 26 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: ICLR 2024

  23. arXiv:2301.12171  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

    Authors: Kwanyoung Kim, Yu** Oh, Jong Chul Ye

    Abstract: Recent success of large-scale Contrastive Language-Image Pre-training (CLIP) has led to great promise in zero-shot semantic segmentation by transferring image-text aligned knowledge to pixel-level classification. However, existing methods usually require an additional image encoder or retraining/tuning the CLIP module. Here, we propose a novel Zero-shot segmentation with Optimal Transport (ZegOT)… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 18pages, 8 figures

  24. arXiv:2301.12003  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Minimizing Trajectory Curvature of ODE-based Generative Models

    Authors: Sangyun Lee, Beomsu Kim, Jong Chul Ye

    Abstract: Recent ODE/SDE-based generative models, such as diffusion models, rectified flows, and flow matching, define a generative process as a time reversal of a fixed forward process. Even though these models show impressive performance on large-scale datasets, numerical simulation requires multiple evaluations of a neural network, leading to a slow sampling speed. We attribute the reason to the high cur… ▽ More

    Submitted 25 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  25. arXiv:2212.05949  [pdf, ps, other

    stat.ML cs.LG

    Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes

    Authors: Chenlu Ye, Wei Xiong, Quanquan Gu, Tong Zhang

    Abstract: Despite the significant interest and progress in reinforcement learning (RL) problems with adversarial corruption, current works are either confined to the linear setting or lead to an undesired $\tilde{O}(\sqrt{T}ζ)$ regret bound, where $T$ is the number of rounds and $ζ$ is the total amount of corruption. In this paper, we consider the contextual bandit with general function approximation and pr… ▽ More

    Submitted 10 February, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: We study the corruption-robust MDPs and contextual bandits with general function approximation

    Journal ref: ICML 2023

  26. arXiv:2211.10656  [pdf, other

    cs.CV cs.LG stat.ML

    Parallel Diffusion Models of Operator and Image for Blind Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have demonstrated state-of-the-art performance in cases where the forward operator is known (i.e. non-blind). However, the applicability of the method to blind inverse problems has yet to be explored. In this work, we show that we can indeed solve a family of blind inverse problems by constructing another diffusion prior for the forward operator. Speci… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 25 pages, 13 figures

  27. arXiv:2210.05248  [pdf, other

    cs.LG cs.AI stat.ML

    Self-supervised debiasing using low rank regularization

    Authors: Geon Yeong Park, Chanyong Jung, Sangmin Lee, Jong Chul Ye, Sang Wan Lee

    Abstract: Spurious correlations can cause strong biases in deep neural networks, impairing generalization ability. While most existing debiasing methods require full supervision on either spurious attributes or target labels, training a debiased model from a limited amount of both annotations is still an open question. To address this issue, we investigate an interesting phenomenon using the spectral analys… ▽ More

    Submitted 8 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  28. arXiv:2210.05247  [pdf, other

    cs.LG cs.AI stat.ML

    Training Debiased Subnetworks with Contrastive Weight Pruning

    Authors: Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye

    Abstract: Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize. This raises an interesting question: ``Does an optimal unbiased functional subnetwork exist in a severely biased network? If so, how to extract such subnetwork?" While empirical evidence has been accumulated about the existence of such unbiased subnetworks, thes… ▽ More

    Submitted 26 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, code: https://github.com/ParkGeonYeong/DCWP

  29. arXiv:2209.15264  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Diffusion-based Image Translation using Disentangled Style and Content Representation

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion-based image translation guided by semantic texts or a single target image has enabled flexible style transfer which is not limited to the specific domains. Unfortunately, due to the stochastic nature of diffusion models, it is often difficult to maintain the original content of the image during the reverse diffusion. To address this, here we present a novel diffusion-based unsupervised i… ▽ More

    Submitted 1 February, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 camera ready

  30. arXiv:2209.14687  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Diffusion Posterior Sampling for General Noisy Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye

    Abstract: Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear inverse problems in noiseless settings, which significantly under-represents the complexity of real-world problems. In this work, we extend diffusion solvers t… ▽ More

    Submitted 20 May, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  31. arXiv:2208.01864  [pdf, other

    cs.CV cs.LG stat.ML

    Pyramidal Denoising Diffusion Probabilistic Models

    Authors: Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion model have demonstrated impressive image generation performances, and have been extensively studied in various computer vision tasks. Unfortunately, training and evaluating diffusion models consume a lot of time and computational resources. To address this problem, here we present a novel pyramidal diffusion model that can generate high resolution images starting from much coar… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

  32. arXiv:2206.11944  [pdf, ps, other

    stat.ME

    High-dimensional Variable Screening via Conditional Martingale Difference Divergence

    Authors: Lei Fang, Qingcong Yuan, Xiangrong Yin, Chenglong Ye

    Abstract: Variable screening has been a useful research area that deals with ultrahigh-dimensional data. When there exist both marginally and jointly dependent predictors to the response, existing methods such as conditional screening or iterative screening often suffer from instability against the selection of the conditional set or the computational burden, respectively. In this article, we propose a new… ▽ More

    Submitted 6 July, 2023; v1 submitted 23 June, 2022; originally announced June 2022.

  33. arXiv:2206.00941  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Improving Diffusion Models for Inverse Problems using Manifold Constraints

    Authors: Hyung** Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion models have been used to solve various inverse problems in an unsupervised manner with appropriate modifications to the sampling process. However, the current solvers, which recursively apply a reverse diffusion step followed by a projection-based measurement consistency step, often produce suboptimal results. By studying the generative sampling path, here we show that current… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures

  34. arXiv:2203.09301  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    One-Shot Adaptation of GAN in Just One CLIP

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: There are many recent research efforts to fine-tune a pre-trained generator with a few target images to generate images of a novel domain. Unfortunately, these methods often suffer from overfitting or under-fitting when fine-tuned with a single target image. To address this, here we present a novel single-shot GAN adaptation method through unified CLIP space manipulations. Specifically, our model… ▽ More

    Submitted 30 January, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

  35. arXiv:2202.05510  [pdf, other

    cs.LG cs.AI stat.ML

    Support Vectors and Gradient Dynamics of Single-Neuron ReLU Networks

    Authors: Sangmin Lee, Byeongsu Sim, Jong Chul Ye

    Abstract: Understanding implicit bias of gradient descent for generalization capability of ReLU networks has been an important research topic in machine learning research. Unfortunately, even for a single ReLU neuron trained with the square loss, it was recently shown impossible to characterize the implicit regularization in terms of a norm of model parameters (Vardi & Shamir, 2021). In order to close the g… ▽ More

    Submitted 13 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  36. arXiv:2112.05146  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

    Authors: Hyung** Chung, Byeongsu Sim, Jong Chul Ye

    Abstract: Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance. Unfortunately, diffusion models have a critical downside - they are inherently slow to sample from, needing few thousand steps of iteration to generate images from p… ▽ More

    Submitted 19 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  37. arXiv:2112.03696  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching

    Authors: Kwanyoung Kim, Taesung Kwon, Jong Chul Ye

    Abstract: Tweedie distributions are a special case of exponential dispersion models, which are often used in classical statistics as distributions for generalized linear models. Here, we reveal that Tweedie distributions also play key roles in modern deep learning era, leading to a distribution independent self-supervised image denoising formula without clean reference images. Specifically, by combining wit… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  38. arXiv:2106.09246  [pdf, other

    cs.CV cs.LG stat.ML

    Federated CycleGAN for Privacy-Preserving Image-to-Image Translation

    Authors: Joonyoung Song, Jong Chul Ye

    Abstract: Unsupervised image-to-image translation methods such as CycleGAN learn to convert images from one domain to another using unpaired training data sets from different domains. Unfortunately, these approaches still require centrally collected unpaired records, potentially violating privacy and security issues. Although the recent federated learning (FL) allows a neural network to be trained without d… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  39. arXiv:2104.09435  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Deep learning enables reference-free isotropic super-resolution for volumetric fluorescence microscopy

    Authors: Hyoungjun Park, Myeongsu Na, Bumju Kim, Soohyun Park, Ki Hean Kim, Sunghoe Chang, Jong Chul Ye

    Abstract: Volumetric imaging by fluorescence microscopy is often limited by anisotropic spatial resolution from inferior axial resolution compared to the lateral resolution. To address this problem, here we present a deep-learning-enabled unsupervised super-resolution technique that enhances anisotropic images in volumetric fluorescence microscopy. In contrast to the existing deep learning approaches that r… ▽ More

    Submitted 6 June, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  40. arXiv:2104.08538  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Cycle-free CycleGAN using Invertible Generator for Unsupervised Low-Dose CT Denoising

    Authors: Taesung Kwon, Jong Chul Ye

    Abstract: Recently, CycleGAN was shown to provide high-performance, ultra-fast denoising for low-dose X-ray computed tomography (CT) without the need for a paired training dataset. Although this was possible thanks to cycle consistency, CycleGAN requires two generators and two discriminators to enforce cycle consistency, demanding significant GPU resources and technical skills for training. A recent proposa… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 12 pages, 12 figures

  41. arXiv:2103.00874  [pdf, ps, other

    eess.SP stat.AP

    Path-specific Underwater Acoustic Channel Tracking and its Application in Passive Time Reversal Mirror

    Authors: Xiuqing Li, Wei Li, Xinlin Yi, Qihang Huang, Yuhang Wang, Chenzhe Ye

    Abstract: We consider the underwater acoustic channel which is time-variant and doubly-spread in this work. Since conventional channel estimation and decision feedback equalizer (DFE) can not work well for this type of channel, a path-specific underwater acoustic channel tracking is proposed. It is based on the framework of Kalman filter. We provide a simplified sound propagation model as the state transiti… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to IEEE Journal of Oceanic Engineering

  42. arXiv:2012.03842  [pdf, other

    cs.CV cs.LG stat.ML

    CycleQSM: Unsupervised QSM Deep Learning using Physics-Informed CycleGAN

    Authors: Gyutaek Oh, Hyokyoung Bae, Hyun-Seo Ahn, Sung-Hong Park, Jong Chul Ye

    Abstract: Quantitative susceptibility map** (QSM) is a useful magnetic resonance imaging (MRI) technique which provides spatial distribution of magnetic susceptibility values of tissues. QSMs can be obtained by deconvolving the dipole kernel from phase images, but the spectral nulls in the dipole kernel make the inversion ill-posed. In recent times, deep learning approaches have shown a comparable QSM rec… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  43. arXiv:2011.10475  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    DeepPhaseCut: Deep Relaxation in Phase for Unsupervised Fourier Phase Retrieval

    Authors: Eunju Cha, Chanseok Lee, Mooseok Jang, Jong Chul Ye

    Abstract: Fourier phase retrieval is a classical problem of restoring a signal only from the measured magnitude of its Fourier transform. Although Fienup-type algorithms, which use prior knowledge in both spatial and Fourier domains, have been widely used in practice, they can often stall in local minima. Modern methods such as PhaseLift and PhaseCut may offer performance guarantees with the help of convex… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  44. arXiv:2011.06337  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Unsupervised MR Motion Artifact Deep Learning using Outlier-Rejecting Bootstrap Aggregation

    Authors: Gyutaek Oh, Jeong Eun Lee, Jong Chul Ye

    Abstract: Recently, deep learning approaches for MR motion artifact correction have been extensively studied. Although these approaches have shown high performance and reduced computational complexity compared to classical methods, most of them require supervised training using paired artifact-free and artifact-corrupted images, which may prohibit its use in many important clinical applications. For example… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  45. arXiv:2008.13646  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Switchable Deep Beamformer

    Authors: Shujaat Khan, Jaeyoung Huh, Jong Chul Ye

    Abstract: Recent proposals of deep beamformers using deep neural networks have attracted significant attention as computational efficient alternatives to adaptive and compressive beamformers. Moreover, deep beamformers are versatile in that image post-processing algorithms can be combined with the beamforming. Unfortunately, in the current technology, a separate beamformer should be trained and stored for e… ▽ More

    Submitted 4 September, 2020; v1 submitted 31 August, 2020; originally announced August 2020.

  46. arXiv:2008.12967  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unpaired Deep Learning for Accelerated MRI using Optimal Transport Driven CycleGAN

    Authors: Gyutaek Oh, Byeongsu Sim, Hyung** Chung, Leonard Sunwoo, Jong Chul Ye

    Abstract: Recently, deep learning approaches for accelerated MRI have been extensively studied thanks to their high performance reconstruction in spite of significantly reduced runtime complexity. These neural networks are usually trained in a supervised manner, so matched pairs of subsampled and fully sampled k-space data are required. Unfortunately, it is often difficult to acquire matched fully sampled k… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: Accepted for IEEE Transactions on Computational Imaging

  47. arXiv:2008.05772  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    CycleMorph: Cycle Consistent Unsupervised Deformable Image Registration

    Authors: Boah Kim, Dong Hwan Kim, Seong Ho Park, Jieun Kim, June-Goo Lee, Jong Chul Ye

    Abstract: Image registration is a fundamental task in medical image analysis. Recently, deep learning based image registration methods have been extensively investigated due to their excellent performance despite the ultra-fast computational time. However, the existing deep learning methods still have limitation in the preservation of original topology during the deformation with registration vector fields.… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  48. arXiv:2008.05753  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    AdaIN-Switchable CycleGAN for Efficient Unsupervised Low-Dose CT Denoising

    Authors: Jawook Gu, Jong Chul Ye

    Abstract: Recently, deep learning approaches have been extensively studied for low-dose CT denoising thanks to its superior performance despite the fast computational time. In particular, cycleGAN has been demonstrated as a powerful unsupervised learning scheme to improve the low-dose CT image quality without requiring matched high-dose reference data. Unfortunately, one of the main limitations of the cycle… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 12 pages, 10 figures

  49. arXiv:2008.01362  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Two-Stage Deep Learning for Accelerated 3D Time-of-Flight MRA without Matched Training Data

    Authors: Hyung** Chung, Eunju Cha, Leonard Sunwoo, Jong Chul Ye

    Abstract: Time-of-flight magnetic resonance angiography (TOF-MRA) is one of the most widely used non-contrast MR imaging methods to visualize blood vessels, but due to the 3-D volume acquisition highly accelerated acquisition is necessary. Accordingly, high quality reconstruction from undersampled TOF-MRA is an important research topic for deep learning. However, most existing deep learning works require ma… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

  50. arXiv:2007.05205  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    OT-driven Multi-Domain Unsupervised Ultrasound Image Artifact Removal using a Single CNN

    Authors: Jaeyoung Huh, Shujaat Khan, Jong Chul Ye

    Abstract: Ultrasound imaging (US) often suffers from distinct image artifacts from various sources. Classic approaches for solving these problems are usually model-based iterative approaches that have been developed specifically for each type of artifact, which are often computationally intensive. Recently, deep learning approaches have been proposed as computationally efficient and high performance alterna… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.