Skip to main content

Showing 1–50 of 161 results for author: Ye, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.07756  [pdf, other

    stat.ME

    The Exchangeability Assumption for Permutation Tests of Multiple Regression Models: Implications for Statistics and Data Science

    Authors: Johanna Hardin, Lauren Quesada, Julie Ye, Nicholas J. Horton

    Abstract: Permutation tests are a powerful and flexible approach to inference via resampling. As computational methods become more ubiquitous in the statistics curriculum, use of permutation tests has become more tractable. At the heart of the permutation approach is the exchangeability assumption, which determines the appropriate null sampling distribution. We explore the exchangeability assumption in the… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2403.14183  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation

    Authors: Kwanyoung Kim, Yu** Oh, Jong Chul Ye

    Abstract: The recent success of CLIP has demonstrated promising results in zero-shot semantic segmentation by transferring muiltimodal knowledge to pixel-level classification. However, leveraging pre-trained CLIP knowledge to closely align text embeddings with pixel embeddings still has limitations in existing approaches. To address this issue, we propose OTSeg, a novel multimodal attention mechanism aimed… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 22 pages, 7 figures

  3. arXiv:2402.10062  [pdf, other

    cs.LG stat.ML

    Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection

    Authors: Chao Chen, Zhihang Fu, Kai Liu, Ze Chen, Mingyuan Tao, Jie** Ye

    Abstract: For a machine learning model deployed in real world scenarios, the ability of detecting out-of-distribution (OOD) samples is indispensable and challenging. Most existing OOD detection methods focused on exploring advanced training skills or training-free tricks to prevent the model from yielding overconfident confidence score for unknown samples. The training-based methods require expensive traini… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by NeurIPS 2023. 19 pages

    Journal ref: NeurIPS 2023

  4. arXiv:2310.20579  [pdf, other

    stat.ML cs.CR cs.LG

    Initialization Matters: Privacy-Utility Analysis of Overparameterized Neural Networks

    Authors: Jiayuan Ye, Zhenyu Zhu, Fanghui Liu, Reza Shokri, Volkan Cevher

    Abstract: We analytically investigate how over-parameterization of models in randomized machine learning algorithms impacts the information leakage about their training data. Specifically, we prove a privacy bound for the KL divergence between model distributions on worst-case neighboring datasets, and explore its dependence on the initialization, width, and depth of fully connected neural networks. We find… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  5. arXiv:2310.19973  [pdf, other

    stat.ML cs.CR cs.LG math.ST stat.ME

    Unified Enhancement of Privacy Bounds for Mixture Mechanisms via $f$-Differential Privacy

    Authors: Chendi Wang, Buxin Su, Jiayuan Ye, Reza Shokri, Weijie J. Su

    Abstract: Differentially private (DP) machine learning algorithms incur many sources of randomness, such as random initialization, random batch subsampling, and shuffling. However, such randomness is difficult to take into account when proving differential privacy bounds because it induces mixture distributions for the algorithm's output that are difficult to analyze. This paper focuses on improving privacy… ▽ More

    Submitted 1 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  6. arXiv:2310.02712  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ED-NeRF: Efficient Text-Guided Editing of 3D Scene with Latent Space NeRF

    Authors: Jangho Park, Gihyun Kwon, Jong Chul Ye

    Abstract: Recently, there has been a significant advancement in text-to-image diffusion models, leading to groundbreaking performance in 2D image generation. These advancements have been extended to 3D models, enabling the generation of novel 3D objects from textual descriptions. This has evolved into NeRF editing methods, which allow the manipulation of existing 3D objects through textual conditioning. How… ▽ More

    Submitted 21 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; Project Page: https://jhq1234.github.io/ed-nerf.github.io/

  7. arXiv:2310.01110  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Prompt-tuning latent diffusion models for inverse problems

    Authors: Hyung** Chung, Jong Chul Ye, Peyman Milanfar, Mauricio Delbracio

    Abstract: We propose a new method for solving imaging inverse problems using text-to-image latent diffusion models as general priors. Existing methods using latent diffusion models for inverse problems typically rely on simple null text prompts, which can lead to suboptimal performance. To address this limitation, we introduce a method for prompt tuning, which jointly optimizes the text embedding on-the-fly… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 22 pages, 10 figures

  8. arXiv:2310.01107  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

    Authors: Hyeonho Jeong, Jong Chul Ye

    Abstract: Recent endeavors in video editing have showcased promising results in single-attribute editing or style transfer tasks, either by training text-to-video (T2V) models on text-video data or adopting training-free methods. However, when confronted with the complexities of multi-attribute editing scenarios, they exhibit shortcomings such as omitting or overlooking intended attribute changes, modifying… ▽ More

    Submitted 24 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024, Project Page: http://ground-a-video.github.io

  9. arXiv:2309.05505  [pdf, other

    cs.LG stat.ML

    Share Your Representation Only: Guaranteed Improvement of the Privacy-Utility Tradeoff in Federated Learning

    Authors: Zebang Shen, Jiayuan Ye, Anmin Kang, Hamed Hassani, Reza Shokri

    Abstract: Repeated parameter sharing in federated learning causes significant information leakage about private data, thus defeating its main purpose: data privacy. Mitigating the risk of this information leakage, using state of the art differentially private algorithms, also does not come for free. Randomized mechanisms can prevent convergence of models on learning even the useful representation functions,… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: ICLR 2023 revised

  10. arXiv:2308.12859  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ME

    Towards Automated Animal Density Estimation with Acoustic Spatial Capture-Recapture

    Authors: Yuheng Wang, Juan Ye, David L. Borchers

    Abstract: Passive acoustic monitoring can be an effective way of monitoring wildlife populations that are acoustically active but difficult to survey visually. Digital recorders allow surveyors to gather large volumes of data at low cost, but identifying target species vocalisations in these data is non-trivial. Machine learning (ML) methods are often used to do the identification. They can process large vo… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 35 pages, 5 figures

  11. arXiv:2307.05050  [pdf

    stat.AP

    Considerations for Master Protocols Using External Controls

    Authors: Jie Chen, Xiaoyun, Li, Chengxing, Lu, Sammy Yuan, Godwin Yung, **g**g Ye, Hong Tian, Jianchang Lin

    Abstract: There has been an increasing use of master protocols in oncology clinical trials because of its efficiency and flexibility to accelerate cancer drug development. Depending on the study objective and design, a master protocol trial can be a basket trial, an umbrella trial, a platform trial, or any other form of trials in which multiple investigational products and/or subpopulations are studied unde… ▽ More

    Submitted 10 November, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  12. arXiv:2306.04396  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to disentangle style and content, preserving the source image's structure while successfully transitioning from a source to a target domain under text or one-shot image… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  13. arXiv:2305.19809  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Direct Diffusion Bridge using Data Consistency for Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have shown impressive performance, but are limited in speed, mostly as they require reverse diffusion sampling starting from noise. Several recent works have tried to alleviate this problem by building a diffusion process, directly bridging the clean and the corrupted for specific inverse problems. In this paper, we first unify these existing works und… ▽ More

    Submitted 24 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready. 16 pages, 6 figures

  14. arXiv:2305.16375  [pdf, other

    cs.LG cs.AI stat.ML

    Data Topology-Dependent Upper Bounds of Neural Network Widths

    Authors: Sangmin Lee, Jong Chul Ye

    Abstract: This paper investigates the relationship between the universal approximation property of deep neural networks and topological characteristics of datasets. Our primary contribution is to introduce data topology-dependent upper bounds on the network width. Specifically, we first show that a three-layer neural network, applying a ReLU activation function and max pooling, can be designed to approximat… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  15. arXiv:2305.15086  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Unpaired Image-to-Image Translation via Neural Schrödinger Bridge

    Authors: Beomsu Kim, Gihyun Kwon, Kwanyoung Kim, Jong Chul Ye

    Abstract: Diffusion models are a powerful class of generative models which simulate stochastic differential equations (SDEs) to generate data from noise. While diffusion models have achieved remarkable progress, they have limitations in unpaired image-to-image (I2I) translation tasks due to the Gaussian prior assumption. Schrödinger Bridge (SB), which learns an SDE to translate between two arbitrary distrib… ▽ More

    Submitted 2 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ICLR 2024

  16. arXiv:2305.03555  [pdf, other

    cs.LG stat.ML

    Contrastive Graph Clustering in Curvature Spaces

    Authors: Li Sun, Feiyang Wang, Junda Ye, Hao Peng, Philip S. Yu

    Abstract: Graph clustering is a longstanding research topic, and has achieved remarkable success with the deep learning methods in recent years. Nevertheless, we observe that several important issues largely remain open. On the one hand, graph clustering from the geometric perspective is appealing but has rarely been touched before, as it lacks a promising space for geometric clustering. On the other hand,… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted by IJCAI'23

  17. arXiv:2303.08622  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer

    Authors: Serin Yang, Hyunmin Hwang, Jong Chul Ye

    Abstract: Diffusion models have shown great promise in text-guided image style transfer, but there is a trade-off between style transformation and content preservation due to their stochastic nature. Existing methods require computationally expensive fine-tuning of diffusion models or additional neural network. To address this, here we propose a zero-shot contrastive loss for diffusion models that doesn't r… ▽ More

    Submitted 12 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  18. arXiv:2303.05754  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems

    Authors: Hyung** Chung, Suhyeon Lee, Jong Chul Ye

    Abstract: Krylov subspace, which is generated by multiplying a given vector by the matrix of a linear transformation and its successive powers, has been extensively studied in classical optimization literature to design algorithms that converge quickly for large linear inverse problems. For example, the conjugate gradient method (CG), one of the most popular Krylov subspace methods, is based on the idea of… ▽ More

    Submitted 19 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: ICLR 2024; 28 pages, 9 figures

  19. arXiv:2303.04746  [pdf, other

    stat.ME

    Necessary and sufficient conditions for multiple objective optimal regression designs

    Authors: Lucy L. Gao, Jane J. Ye, Shangzhi Zeng, Julie Zhou

    Abstract: We typically construct optimal designs based on a single objective function. To better capture the breadth of an experiment's goals, we could instead construct a multiple objective optimal design based on multiple objective functions. While algorithms have been developed to find multi-objective optimal designs (e.g. efficiency-constrained and maximin optimal designs), it is far less clear how to v… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  20. arXiv:2302.03900  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

    Authors: Hyeonho Jeong, Gihyun Kwon, Jong Chul Ye

    Abstract: Recent advancements in large scale text-to-image models have opened new possibilities for guiding the creation of images through human-devised natural language. However, while prior literature has primarily focused on the generation of individual images, it is essential to consider the capability of these models to ensure coherency within a sequence of images to fulfill the demands of real-world a… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  21. arXiv:2301.12334  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Don't Play Favorites: Minority Guidance for Diffusion Models

    Authors: Soobin Um, Suhyeon Lee, Jong Chul Ye

    Abstract: We explore the problem of generating minority samples using diffusion models. The minority samples are instances that lie on low-density regions of a data manifold. Generating a sufficient number of such minority instances is important, since they often contain some unique attributes of the data. However, the conventional generation process of the diffusion models mostly yields majority samples (t… ▽ More

    Submitted 26 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: ICLR 2024

  22. arXiv:2301.12171  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts

    Authors: Kwanyoung Kim, Yu** Oh, Jong Chul Ye

    Abstract: Recent success of large-scale Contrastive Language-Image Pre-training (CLIP) has led to great promise in zero-shot semantic segmentation by transferring image-text aligned knowledge to pixel-level classification. However, existing methods usually require an additional image encoder or retraining/tuning the CLIP module. Here, we propose a novel Zero-shot segmentation with Optimal Transport (ZegOT)… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 18pages, 8 figures

  23. arXiv:2301.12003  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Minimizing Trajectory Curvature of ODE-based Generative Models

    Authors: Sangyun Lee, Beomsu Kim, Jong Chul Ye

    Abstract: Recent ODE/SDE-based generative models, such as diffusion models, rectified flows, and flow matching, define a generative process as a time reversal of a fixed forward process. Even though these models show impressive performance on large-scale datasets, numerical simulation requires multiple evaluations of a neural network, leading to a slow sampling speed. We attribute the reason to the high cur… ▽ More

    Submitted 25 May, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  24. arXiv:2211.13329  [pdf, other

    stat.AP

    Extent of Safety Database in Pediatric Drug Development: Types of Assessment, Analytical Precision, and Pathway for Extrapolation through On-Target Effects

    Authors: Margaret Gamalo, Yihua Zhao, Aijun Gao, **g**g Ye, Ralph DeMasi, Eiji Eshida, YJ Choi, Robert Nelson

    Abstract: Pediatric patients should have access to medicines that have been appropriately evaluated for safety and efficacy. Given this goal of revised labelling, the adequacy of the pediatric clinical development plan and resulting safety database must inform a favorable benefit-risk assessment for the intended use of the medicinal product. While extrapolation from adults can be used to support efficacy of… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  25. arXiv:2211.10656  [pdf, other

    cs.CV cs.LG stat.ML

    Parallel Diffusion Models of Operator and Image for Blind Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Sehui Kim, Jong Chul Ye

    Abstract: Diffusion model-based inverse problem solvers have demonstrated state-of-the-art performance in cases where the forward operator is known (i.e. non-blind). However, the applicability of the method to blind inverse problems has yet to be explored. In this work, we show that we can indeed solve a family of blind inverse problems by constructing another diffusion prior for the forward operator. Speci… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 25 pages, 13 figures

  26. arXiv:2210.05248  [pdf, other

    cs.LG cs.AI stat.ML

    Self-supervised debiasing using low rank regularization

    Authors: Geon Yeong Park, Chanyong Jung, Sangmin Lee, Jong Chul Ye, Sang Wan Lee

    Abstract: Spurious correlations can cause strong biases in deep neural networks, impairing generalization ability. While most existing debiasing methods require full supervision on either spurious attributes or target labels, training a debiased model from a limited amount of both annotations is still an open question. To address this issue, we investigate an interesting phenomenon using the spectral analys… ▽ More

    Submitted 8 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  27. arXiv:2210.05247  [pdf, other

    cs.LG cs.AI stat.ML

    Training Debiased Subnetworks with Contrastive Weight Pruning

    Authors: Geon Yeong Park, Sangmin Lee, Sang Wan Lee, Jong Chul Ye

    Abstract: Neural networks are often biased to spuriously correlated features that provide misleading statistical evidence that does not generalize. This raises an interesting question: ``Does an optimal unbiased functional subnetwork exist in a severely biased network? If so, how to extract such subnetwork?" While empirical evidence has been accumulated about the existence of such unbiased subnetworks, thes… ▽ More

    Submitted 26 June, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, code: https://github.com/ParkGeonYeong/DCWP

  28. arXiv:2209.15264  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Diffusion-based Image Translation using Disentangled Style and Content Representation

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: Diffusion-based image translation guided by semantic texts or a single target image has enabled flexible style transfer which is not limited to the specific domains. Unfortunately, due to the stochastic nature of diffusion models, it is often difficult to maintain the original content of the image during the reverse diffusion. To address this, here we present a novel diffusion-based unsupervised i… ▽ More

    Submitted 1 February, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 camera ready

  29. arXiv:2209.14687  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Diffusion Posterior Sampling for General Noisy Inverse Problems

    Authors: Hyung** Chung, Jeongsol Kim, Michael T. Mccann, Marc L. Klasky, Jong Chul Ye

    Abstract: Diffusion models have been recently studied as powerful generative inverse problem solvers, owing to their high quality reconstructions and the ease of combining existing iterative solvers. However, most works focus on solving simple linear inverse problems in noiseless settings, which significantly under-represents the complexity of real-world problems. In this work, we extend diffusion solvers t… ▽ More

    Submitted 20 May, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  30. arXiv:2208.01864  [pdf, other

    cs.CV cs.LG stat.ML

    Pyramidal Denoising Diffusion Probabilistic Models

    Authors: Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion model have demonstrated impressive image generation performances, and have been extensively studied in various computer vision tasks. Unfortunately, training and evaluating diffusion models consume a lot of time and computational resources. To address this problem, here we present a novel pyramidal diffusion model that can generate high resolution images starting from much coar… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

  31. arXiv:2206.00941  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Improving Diffusion Models for Inverse Problems using Manifold Constraints

    Authors: Hyung** Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion models have been used to solve various inverse problems in an unsupervised manner with appropriate modifications to the sampling process. However, the current solvers, which recursively apply a reverse diffusion step followed by a projection-based measurement consistency step, often produce suboptimal results. By studying the generative sampling path, here we show that current… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures

  32. arXiv:2203.09301  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    One-Shot Adaptation of GAN in Just One CLIP

    Authors: Gihyun Kwon, Jong Chul Ye

    Abstract: There are many recent research efforts to fine-tune a pre-trained generator with a few target images to generate images of a novel domain. Unfortunately, these methods often suffer from overfitting or under-fitting when fine-tuned with a single target image. To address this, here we present a novel single-shot GAN adaptation method through unified CLIP space manipulations. Specifically, our model… ▽ More

    Submitted 30 January, 2023; v1 submitted 17 March, 2022; originally announced March 2022.

  33. arXiv:2203.05363  [pdf, other

    stat.ML cs.CR cs.LG

    Differentially Private Learning Needs Hidden State (Or Much Faster Convergence)

    Authors: Jiayuan Ye, Reza Shokri

    Abstract: Prior work on differential privacy analysis of randomized SGD algorithms relies on composition theorems, where the implicit (unrealistic) assumption is that the internal state of the iterative algorithm is revealed to the adversary. As a result, the Rényi DP bounds derived by such composition-based analyses linearly grow with the number of training epochs. When the internal state of the algorithm… ▽ More

    Submitted 17 October, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

  34. arXiv:2202.10887  [pdf, other

    stat.ME cs.LG stat.ML

    Policy Evaluation for Temporal and/or Spatial Dependent Experiments

    Authors: Shikai Luo, Ying Yang, Chengchun Shi, Fang Yao, Jie** Ye, Hongtu Zhu

    Abstract: The aim of this paper is to establish a causal link between the policies implemented by technology companies and the outcomes they yield within intricate temporal and/or spatial dependent experiments. We propose a novel temporal/spatio-temporal Varying Coefficient Decision Process (VCDP) model, capable of effectively capturing the evolving treatment effects in situations characterized by temporal… ▽ More

    Submitted 3 December, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

  35. arXiv:2202.05510  [pdf, other

    cs.LG cs.AI stat.ML

    Support Vectors and Gradient Dynamics of Single-Neuron ReLU Networks

    Authors: Sangmin Lee, Byeongsu Sim, Jong Chul Ye

    Abstract: Understanding implicit bias of gradient descent for generalization capability of ReLU networks has been an important research topic in machine learning research. Unfortunately, even for a single ReLU neuron trained with the square loss, it was recently shown impossible to characterize the implicit regularization in terms of a norm of model parameters (Vardi & Shamir, 2021). In order to close the g… ▽ More

    Submitted 13 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  36. arXiv:2112.05146  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

    Authors: Hyung** Chung, Byeongsu Sim, Jong Chul Ye

    Abstract: Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance. Unfortunately, diffusion models have a critical downside - they are inherently slow to sample from, needing few thousand steps of iteration to generate images from p… ▽ More

    Submitted 19 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  37. arXiv:2112.03696  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Noise Distribution Adaptive Self-Supervised Image Denoising using Tweedie Distribution and Score Matching

    Authors: Kwanyoung Kim, Taesung Kwon, Jong Chul Ye

    Abstract: Tweedie distributions are a special case of exponential dispersion models, which are often used in classical statistics as distributions for generalized linear models. Here, we reveal that Tweedie distributions also play key roles in modern deep learning era, leading to a distribution independent self-supervised image denoising formula without clean reference images. Specifically, by combining wit… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  38. arXiv:2111.15155  [pdf, other

    cs.LG stat.ML

    gCastle: A Python Toolbox for Causal Discovery

    Authors: Keli Zhang, Shengyu Zhu, Marcus Kalander, Ignavier Ng, Junjian Ye, Zhitang Chen, Lujia Pan

    Abstract: $\texttt{gCastle}… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: Tech report describing the gCastle toolbox. More details can be found in the github repository https://github.com/huawei-noah/trustworthyAI/tree/master/gcastle

  39. arXiv:2111.09679  [pdf, other

    cs.LG cs.CR stat.ML

    Enhanced Membership Inference Attacks against Machine Learning Models

    Authors: Jiayuan Ye, Aadyaa Maddi, Sasi Kumar Murakonda, Vincent Bindschaedler, Reza Shokri

    Abstract: How much does a machine learning algorithm leak about its training data, and why? Membership inference attacks are used as an auditing tool to quantify this leakage. In this paper, we present a comprehensive \textit{hypothesis testing framework} that enables us not only to formally express the prior work in a consistent way, but also to design new membership inference attacks that use reference mo… ▽ More

    Submitted 13 September, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: To appear at ACM CCS 2022

  40. arXiv:2110.09771  [pdf, ps, other

    cs.LG cs.GT stat.ML

    On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

    Authors: Shuang Qiu, Jie** Ye, Zhaoran Wang, Zhuoran Yang

    Abstract: To achieve sample efficiency in reinforcement learning (RL), it necessitates efficiently exploring the underlying environment. Under the offline setting, addressing the exploration challenge lies in collecting an offline dataset with sufficient coverage. Motivated by such a challenge, we study the reward-free RL problem, where an agent aims to thoroughly explore the environment without any pre-spe… ▽ More

    Submitted 13 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: ICML 2021

  41. arXiv:2109.00650  [pdf, other

    cs.LG cs.CV stat.ML

    Dash: Semi-Supervised Learning with Dynamic Thresholding

    Authors: Yi Xu, Lei Shang, **xing Ye, Qi Qian, Yu-Feng Li, Baigui Sun, Hao Li, Rong **

    Abstract: While semi-supervised learning (SSL) has received tremendous attentions in many machine learning tasks due to its successful use of unlabeled data, existing SSL algorithms use either all unlabeled examples or the unlabeled examples with a fixed high-confidence prediction during the training progress. However, it is possible that too many correct/wrong pseudo labeled examples are eliminated/selecte… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: ICML 2021

  42. arXiv:2106.09246  [pdf, other

    cs.CV cs.LG stat.ML

    Federated CycleGAN for Privacy-Preserving Image-to-Image Translation

    Authors: Joonyoung Song, Jong Chul Ye

    Abstract: Unsupervised image-to-image translation methods such as CycleGAN learn to convert images from one domain to another using unpaired training data sets from different domains. Unfortunately, these approaches still require centrally collected unpaired records, potentially violating privacy and security issues. Although the recent federated learning (FL) allows a neural network to be trained without d… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  43. arXiv:2106.03500  [pdf, other

    cs.LG stat.ML

    Density estimation on smooth manifolds with normalizing flows

    Authors: Dimitris Kalatzis, Johan Ziruo Ye, Alison Pouplin, Jesper Wohlert, Søren Hauberg

    Abstract: We present a framework for learning probability distributions on topologically non-trivial manifolds, utilizing normalizing flows. Current methods focus on manifolds that are homeomorphic to Euclidean space, enforce strong structural priors on the learned models or use operations that do not easily scale to high dimensions. In contrast, our method learns distributions on a data manifold by "gluing… ▽ More

    Submitted 9 July, 2022; v1 submitted 7 June, 2021; originally announced June 2021.

  44. arXiv:2104.09435  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Deep learning enables reference-free isotropic super-resolution for volumetric fluorescence microscopy

    Authors: Hyoungjun Park, Myeongsu Na, Bumju Kim, Soohyun Park, Ki Hean Kim, Sunghoe Chang, Jong Chul Ye

    Abstract: Volumetric imaging by fluorescence microscopy is often limited by anisotropic spatial resolution from inferior axial resolution compared to the lateral resolution. To address this problem, here we present a deep-learning-enabled unsupervised super-resolution technique that enhances anisotropic images in volumetric fluorescence microscopy. In contrast to the existing deep learning approaches that r… ▽ More

    Submitted 6 June, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  45. arXiv:2104.08538  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Cycle-free CycleGAN using Invertible Generator for Unsupervised Low-Dose CT Denoising

    Authors: Taesung Kwon, Jong Chul Ye

    Abstract: Recently, CycleGAN was shown to provide high-performance, ultra-fast denoising for low-dose X-ray computed tomography (CT) without the need for a paired training dataset. Although this was possible thanks to cycle consistency, CycleGAN requires two generators and two discriminators to enforce cycle consistency, demanding significant GPU resources and technical skills for training. A recent proposa… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 12 pages, 12 figures

  46. arXiv:2102.05855  [pdf, ps, other

    stat.ML cs.CR cs.LG

    Differential Privacy Dynamics of Langevin Diffusion and Noisy Gradient Descent

    Authors: Rishav Chourasia, Jiayuan Ye, Reza Shokri

    Abstract: What is the information leakage of an iterative randomized learning algorithm about its training data, when the internal state of the algorithm is \emph{private}? How much is the contribution of each specific training epoch to the information leakage through the released model? We study this problem for noisy gradient descent algorithms, and model the \emph{dynamics} of Rényi differential privacy… ▽ More

    Submitted 8 September, 2022; v1 submitted 11 February, 2021; originally announced February 2021.

  47. arXiv:2102.05805  [pdf, other

    math.OC stat.AP

    Graph-Based Equilibrium Metrics for Dynamic Supply-Demand Systems with Applications to Ride-sourcing Platforms

    Authors: Fan Zhou, Shikai Luo, Xiaohu Qie, Jie** Ye, Hongtu Zhu

    Abstract: How to dynamically measure the local-to-global spatio-temporal coherence between demand and supply networks is a fundamental task for ride-sourcing platforms, such as DiDi. Such coherence measurement is critically important for the quantification of the market efficiency and the comparison of different platform policies, such as dispatching. The aim of this paper is to introduce a graph-based equi… ▽ More

    Submitted 23 March, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: Accepted by Journal of the American Statistical Association

  48. arXiv:2012.03842  [pdf, other

    cs.CV cs.LG stat.ML

    CycleQSM: Unsupervised QSM Deep Learning using Physics-Informed CycleGAN

    Authors: Gyutaek Oh, Hyokyoung Bae, Hyun-Seo Ahn, Sung-Hong Park, Jong Chul Ye

    Abstract: Quantitative susceptibility map** (QSM) is a useful magnetic resonance imaging (MRI) technique which provides spatial distribution of magnetic susceptibility values of tissues. QSMs can be obtained by deconvolving the dipole kernel from phase images, but the spectral nulls in the dipole kernel make the inversion ill-posed. In recent times, deep learning approaches have shown a comparable QSM rec… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  49. arXiv:2011.10475  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    DeepPhaseCut: Deep Relaxation in Phase for Unsupervised Fourier Phase Retrieval

    Authors: Eunju Cha, Chanseok Lee, Mooseok Jang, Jong Chul Ye

    Abstract: Fourier phase retrieval is a classical problem of restoring a signal only from the measured magnitude of its Fourier transform. Although Fienup-type algorithms, which use prior knowledge in both spatial and Fourier domains, have been widely used in practice, they can often stall in local minima. Modern methods such as PhaseLift and PhaseCut may offer performance guarantees with the help of convex… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  50. arXiv:2011.06337  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Unsupervised MR Motion Artifact Deep Learning using Outlier-Rejecting Bootstrap Aggregation

    Authors: Gyutaek Oh, Jeong Eun Lee, Jong Chul Ye

    Abstract: Recently, deep learning approaches for MR motion artifact correction have been extensively studied. Although these approaches have shown high performance and reduced computational complexity compared to classical methods, most of them require supervised training using paired artifact-free and artifact-corrupted images, which may prohibit its use in many important clinical applications. For example… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.