Skip to main content

Showing 1–50 of 232 results for author: Zhu, Y

Searching in archive stat. Search in all archives.
.
  1. Causal Inference with Latent Variables: Recent Advances and Future Prospectives

    Authors: Yaochen Zhu, Yinhan He, **g Ma, Mengxuan Hu, Sheng Li, Jundong Li

    Abstract: Causality lays the foundation for the trajectory of our world. Causal inference (CI), which aims to infer intrinsic causal relations among variables of interest, has emerged as a crucial research topic. Nevertheless, the lack of observation of important variables (e.g., confounders, mediators, exogenous variables, etc.) severely compromises the reliability of CI methods. The issue may arise from t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD'24 Survey Track

  2. arXiv:2406.12171  [pdf, other

    stat.ME stat.AP

    Model Selection for Causal Modeling in Missing Exposure Problems

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: In causal inference, properly selecting the propensity score (PS) model is a popular topic and has been widely investigated in observational studies. In addition, there is a large literature concerning the missing data problem. However, there are very few studies investigating the model selection issue for causal inference when the exposure is missing at random (MAR). In this paper, we discuss how… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.08819  [pdf, other

    cs.LG cs.AI stat.ML

    AIM: Attributing, Interpreting, Mitigating Data Unfairness

    Authors: Zhining Liu, Ruizhong Qiu, Zhichen Zeng, Yada Zhu, Hendrik Hamann, Hanghang Tong

    Abstract: Data collected in the real world often encapsulates historical discrimination against disadvantaged groups and individuals. Existing fair machine learning (FairML) research has predominantly focused on mitigating discriminative bias in the model prediction, with far less effort dedicated towards exploring how to trace biases present in the data, despite its importance for the transparency and inte… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, accepted by ACM SIGKDD 2024. Webpage: https://github.com/ZhiningLiu1998/AIM

  4. arXiv:2406.08668  [pdf, other

    stat.ME

    Causal Inference on Missing Exposure via Robust Estimation

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: How to deal with missing data in observational studies is a common concern for causal inference. When the covariates are missing at random (MAR), multiple approaches have been provided to help solve the issue. However, if the exposure is MAR, few approaches are available and careful adjustments on both missingness and confounding issues are required to ensure a consistent estimate of the true caus… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  5. arXiv:2406.05745  [pdf, other

    stat.ML cs.AI cs.LG

    Structured Learning of Compositional Sequential Interventions

    Authors: Jialin Yu, Andreas Koukorinis, Nicolò Colombo, Yuchen Zhu, Ricardo Silva

    Abstract: We consider sequential treatment regimes where each unit is exposed to combinations of interventions over time. When interventions are described by qualitative labels, such as ``close schools for a month due to a pandemic'' or ``promote this podcast to this user during this week'', it is unclear which appropriate structural assumptions allow us to generalize behavioral predictions to previously un… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2405.20418  [pdf, other

    stat.AP stat.ME

    A Bayesian joint model of multiple nonlinear longitudinal and competing risks outcomes for dynamic prediction in multiple myeloma: joint estimation and corrected two-stage approaches

    Authors: Danilo Alvares, Jessica K. Barrett, François Mercier, Spyros Roumpanis, Sean Yiu, Felipe Castro, Jochen Schulze, Ya**g Zhu

    Abstract: Predicting cancer-associated clinical events is challenging in oncology. In Multiple Myeloma (MM), a cancer of plasma cells, disease progression is determined by changes in biomarkers, such as serum concentration of the paraprotein secreted by plasma cells (M-protein). Therefore, the time-dependent behaviour of M-protein and the transition across lines of therapy (LoT) that may be a consequence of… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 38 pages, 13 figures

  7. arXiv:2405.16577  [pdf, other

    stat.ML cs.LG

    Reflected Flow Matching

    Authors: Tianyu Xie, Yu Zhu, Longlin Yu, Tong Yang, Ziheng Cheng, Shiyue Zhang, Xiangyu Zhang, Cheng Zhang

    Abstract: Continuous normalizing flows (CNFs) learn an ordinary differential equation to transform prior samples into data. Flow matching (FM) has recently emerged as a simulation-free approach for training CNFs by regressing a velocity model towards the conditional velocity field. However, on constrained domains, the learned velocity model may lead to undesirable flows that result in highly unnatural sampl… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: ICML 2024 camera-ready

  8. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  9. arXiv:2405.08920  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Neural Collapse Meets Differential Privacy: Curious Behaviors of NoisyGD with Near-perfect Representation Learning

    Authors: Chendi Wang, Yuqing Zhu, Weijie J. Su, Yu-Xiang Wang

    Abstract: A recent study by De et al. (2022) has reported that large-scale representation learning through pre-training on a public dataset significantly enhances differentially private (DP) learning in downstream tasks, despite the high dimensionality of the feature space. To theoretically explain this phenomenon, we consider the setting of a layer-peeled model in representation learning, which results in… ▽ More

    Submitted 16 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: To appear in ICML 2024

  10. arXiv:2404.12312  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

    Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

    Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted

  11. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  12. Statistical Inference For Noisy Matrix Completion Incorporating Auxiliary Information

    Authors: Shujie Ma, Po-Yao Niu, Yichong Zhang, Yinchu Zhu

    Abstract: This paper investigates statistical inference for noisy matrix completion in a semi-supervised model when auxiliary covariates are available. The model consists of two parts. One part is a low-rank matrix induced by unobserved latent factors; the other part models the effects of the observed covariates through a coefficient matrix which is composed of high-dimensional column vectors. We model the… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  13. arXiv:2403.11163  [pdf, ps, other

    stat.ME cs.LG math.ST stat.CO

    A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

    Authors: Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, **g Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

    Abstract: This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  14. arXiv:2402.08845  [pdf, other

    cs.LG stat.ME

    Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation

    Authors: Xuexin Chen, Ruichu Cai, Zhengting Huang, Yuxuan Zhu, Julien Horwood, Zhifeng Hao, Zijian Li, Jose Miguel Hernandez-Lobato

    Abstract: We investigate the problem of explainability for machine learning models, focusing on Feature Attribution Methods (FAMs) that evaluate feature importance through perturbation tests. Despite their utility, FAMs struggle to distinguish the contributions of different features, when their prediction changes are similar after perturbation. To enhance FAMs' discriminative power, we introduce Feature Att… ▽ More

    Submitted 4 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted in the Proceedings of the 41st International Conference on Machine Learning (ICML2024)

  15. arXiv:2402.05336  [pdf, other

    stat.AP cs.SI

    Treatment Effect Estimation Amidst Dynamic Network Interference in Online Gaming Experiments

    Authors: Yu Zhu, Zehang Richard Li, Yang Su, Zhenyu Zhao

    Abstract: The evolving landscape of online multiplayer gaming presents unique challenges in assessing the causal impacts of game features. Traditional A/B testing methodologies fall short due to complex player interactions, leading to violations of fundamental assumptions like the Stable Unit Treatment Value Assumption (SUTVA). Unlike traditional social networks with stable and long-term connections, networ… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  16. arXiv:2402.04010  [pdf, other

    cs.LG stat.ML

    Efficient Availability Attacks against Supervised and Contrastive Learning Simultaneously

    Authors: Yihan Wang, Yifan Zhu, Xiao-Shan Gao

    Abstract: Availability attacks can prevent the unauthorized use of private data and commercial datasets by generating imperceptible noise and making unlearnable examples before release. Ideally, the obtained unlearnability prevents algorithms from training usable models. When supervised learning (SL) algorithms have failed, a malicious data collector possibly resorts to contrastive learning (CL) algorithms… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  17. arXiv:2312.17065  [pdf, other

    stat.ME cs.SE stat.AP

    CluBear: A Subsampling Package for Interactive Statistical Analysis with Massive Data on A Single Machine

    Authors: Ke Xu, Yingqiu Zhu, Yi**g Liu, Hansheng Wang

    Abstract: This article introduces CluBear, a Python-based open-source package for interactive massive data analysis. The key feature of CluBear is that it enables users to conduct convenient and interactive statistical analysis of massive data with only a traditional single-computer system. Thus, CluBear provides a cost-effective solution when mining large-scale datasets. In addition, the CluBear package in… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  18. arXiv:2312.08728  [pdf, other

    stat.CO

    Mini-batch Gradient Descent with Buffer

    Authors: Haobo Qi, Du Huang, Yingqiu Zhu, Danyang Huang, Hansheng Wang

    Abstract: In this paper, we studied a buffered mini-batch gradient descent (BMGD) algorithm for training complex model on massive datasets. The algorithm studied here is designed for fast training on a GPU-CPU system, which contains two steps: the buffering step and the computation step. In the buffering step, a large batch of data (i.e., a buffer) are loaded from the hard drive to the graphical memory of G… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  19. arXiv:2310.17153  [pdf, other

    cs.LG stat.ME

    Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration

    Authors: Longlin Yu, Tianyu Xie, Yu Zhu, Tong Yang, Xiangyu Zhang, Cheng Zhang

    Abstract: Semi-implicit variational inference (SIVI) has been introduced to expand the analytical variational families by defining expressive semi-implicit distributions in a hierarchical manner. However, the single-layer architecture commonly used in current SIVI methods can be insufficient when the target posterior has complicated structures. In this paper, we propose hierarchical semi-implicit variationa… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 25 pages, 13 figures, NeurIPS 2023

  20. arXiv:2310.08843  [pdf

    stat.AP

    A Longitudinal Analysis about the Effect of Air Pollution on Astigmatism for Children and Young Adults

    Authors: Lin An, Qiuyue Hu, Jieying Guan, Yingting Zhu, Chenyao Jiang, Xiaoyun Zhong, Shuyue Ma, Dongmei Yu, Canyang Zhang, Yehong Zhuo, Peiwu Qin

    Abstract: Purpose: This study aimed to investigate the correlation between air pollution and astigmatism, considering the detrimental effects of air pollution on respiratory, cardiovascular, and eye health. Methods: A longitudinal study was conducted with 127,709 individuals aged 4-27 years from 9 cities in Guangdong Province, China, spanning from 2019 to 2021. Astigmatism was measured using cylinder values… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  21. arXiv:2310.03218  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Energy-Based Prior Model with Diffusion-Amortized MCMC

    Authors: Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun Zhu, Ying Nian Wu

    Abstract: Latent space Energy-Based Models (EBMs), also known as energy-based priors, have drawn growing interests in the field of generative modeling due to its flexibility in the formulation and strong modeling power of the latent space. However, the common practice of learning latent space EBMs with non-convergent short-run MCMC for prior and posterior sampling is hindering the model from further progres… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  22. arXiv:2309.08335  [pdf, other

    stat.ME

    A Multi-Companion Method to Periodically Integrated Autoregressive Models

    Authors: Yueyun Zhu, Georgi N. Boshnakov

    Abstract: There has been an enormous interest in analysing and modelling periodic time series. The research on periodically integrated autoregressive (PIAR) models which capture the periodic structure and the presence of unit roots is widely applied in environmental, financial and energy areas. In this paper, we propose a multi-companion method which uses the eigen information of the multi-companion matrix… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  23. arXiv:2309.05153  [pdf, other

    stat.ML cs.LG

    Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood

    Authors: Yaxuan Zhu, Jianwen Xie, Yingnian Wu, Ruiqi Gao

    Abstract: Training energy-based models (EBMs) on high-dimensional data can be both challenging and time-consuming, and there exists a noticeable gap in sample quality between EBMs and other generative frameworks like GANs and diffusion models. To close this gap, inspired by the recent efforts of learning EBMs by maximizing diffusion recovery likelihood (DRL), we propose cooperative diffusion recovery likeli… ▽ More

    Submitted 18 April, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

  24. arXiv:2308.15709  [pdf, other

    cs.LG cs.CR cs.GT stat.ML

    Threshold KNN-Shapley: A Linear-Time and Privacy-Friendly Approach to Data Valuation

    Authors: Jiachen T. Wang, Yuqing Zhu, Yu-Xiang Wang, Ruoxi Jia, Prateek Mittal

    Abstract: Data valuation aims to quantify the usefulness of individual data sources in training machine learning (ML) models, and is a critical aspect of data-centric ML research. However, data valuation faces significant yet frequently overlooked privacy challenges despite its importance. This paper studies these challenges with a focus on KNN-Shapley, one of the most practical data valuation methods nowad… ▽ More

    Submitted 25 November, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: NeurIPS 2023 Spotlight

  25. arXiv:2308.11838  [pdf, other

    cs.LG cs.AI stat.ML

    A Benchmark Study on Calibration

    Authors: Linwei Tao, Younan Zhu, Haolan Guo, Min**g Dong, Chang Xu

    Abstract: Deep neural networks are increasingly utilized in various machine learning tasks. However, as these models grow in complexity, they often face calibration issues, despite enhanced prediction accuracy. Many studies have endeavored to improve calibration performance through the use of specific loss functions, data preprocessing and training frameworks. Yet, investigations into calibration properties… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: ICLR 2024 poster

  26. arXiv:2308.00894  [pdf, other

    cs.IR cs.LG stat.ME

    User-Controllable Recommendation via Counterfactual Retrospective and Prospective Explanations

    Authors: Juntao Tan, Yingqiang Ge, Yan Zhu, Yinglong Xia, Jiebo Luo, Jianchao Ji, Yongfeng Zhang

    Abstract: Modern recommender systems utilize users' historical behaviors to generate personalized recommendations. However, these systems often lack user controllability, leading to diminished user satisfaction and trust in the systems. Acknowledging the recent advancements in explainable recommender systems that enhance users' understanding of recommendation mechanisms, we propose leveraging these advancem… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted for presentation at 26th European Conference on Artificial Intelligence (ECAI2023)

  27. arXiv:2307.15205  [pdf, other

    stat.ME math.ST

    Robust graph-based methods for overcoming the curse of dimensionality

    Authors: Yejiong Zhu, Hao Chen

    Abstract: Graph-based two-sample tests and graph-based change-point detection that utilize a similarity graph provide a powerful tool for analyzing high-dimensional and non-Euclidean data as these methods do not impose distributional assumptions on data and have good performance across various scenarios. Current graph-based tests that deliver efficacy across a broad spectrum of alternatives typically reply… ▽ More

    Submitted 19 October, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  28. arXiv:2306.06262  [pdf, other

    stat.ML cs.LG

    Spectral gap-based deterministic tensor completion

    Authors: Kameron Decker Harris, Oscar López, Angus Read, Yizhe Zhu

    Abstract: Tensor completion is a core machine learning algorithm used in recommender systems and other domains with missing data. While the matrix case is well-understood, theoretical results for tensor problems are limited, particularly when the sampling patterns are deterministic. Here we bound the generalization error of the solutions of two tensor completion methods, Poisson loss and atomic norm minimiz… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: In proceedings of SampTA 2023

  29. arXiv:2306.01253  [pdf, other

    stat.ML cs.LG

    Mixture Proportion Estimation Beyond Irreducibility

    Authors: Yilun Zhu, Aaron Fjeldsted, Darren Holland, George Landon, Azaree Lintereur, Clayton Scott

    Abstract: The task of mixture proportion estimation (MPE) is to estimate the weight of a component distribution in a mixture, given observations from both the component and mixture. Previous work on MPE adopts the irreducibility assumption, which ensures identifiablity of the mixture proportion. In this paper, we propose a more general sufficient condition that accommodates several settings of interest wher… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:42962-42982, 2023

  30. arXiv:2305.08457  [pdf, other

    cs.LG stat.ML

    MolHF: A Hierarchical Normalizing Flow for Molecular Graph Generation

    Authors: Yiheng Zhu, Zhenqiu Ouyang, Ben Liao, Jialu Wu, Yixuan Wu, Chang-Yu Hsieh, Tingjun Hou, Jian Wu

    Abstract: Molecular de novo design is a critical yet challenging task in scientific fields, aiming to design novel molecular structures with desired property profiles. Significant progress has been made by resorting to generative models for graphs. However, limited attention is paid to hierarchical generative models, which can exploit the inherent hierarchical structure (with rich semantic information) of t… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: IJCAI 2023

  31. arXiv:2305.02894  [pdf, other

    cs.LG math.AP math.OC stat.ML

    FedCBO: Reaching Group Consensus in Clustered Federated Learning through Consensus-based Optimization

    Authors: Jose A. Carrillo, Nicolas Garcia Trillos, Sixu Li, Yuhua Zhu

    Abstract: Federated learning is an important framework in modern machine learning that seeks to integrate the training of learning models from multiple users, each user having their own local data set, in a way that is sensitive to data privacy and to communication loss constraints. In clustered federated learning, one assumes an additional unknown group structure among users, and the goal is to train model… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  32. arXiv:2305.00164  [pdf, other

    math.ST stat.ME

    Estimation and inference for minimizer and minimum of convex functions: optimality, adaptivity and uncertainty principles

    Authors: T. Tony Cai, Ran Chen, Yuancheng Zhu

    Abstract: Optimal estimation and inference for both the minimizer and minimum of a convex regression function under the white noise and nonparametric regression models are studied in a nonasymptotic local minimax framework, where the performance of a procedure is evaluated at individual functions. Fully adaptive and computationally efficient algorithms are proposed and sharp minimax lower bounds are given f… ▽ More

    Submitted 9 March, 2024; v1 submitted 29 April, 2023; originally announced May 2023.

    Journal ref: Ann. Statist. 52(1): 392-411 (February 2024)

  33. arXiv:2304.11625  [pdf, ps, other

    cs.AI stat.ML

    Meaningful Causal Aggregation and Paradoxical Confounding

    Authors: Yuchen Zhu, Kailash Budhathoki, Jonas Kuebler, Dominik Janzing

    Abstract: In aggregated variables the impact of interventions is typically ill-defined because different micro-realizations of the same macro-intervention can result in different changes of downstream macro-variables. We show that this ill-definedness of causality on aggregated variables can turn unconfounded causal relations into confounded ones and vice versa, depending on the respective micro-realization… ▽ More

    Submitted 22 February, 2024; v1 submitted 23 April, 2023; originally announced April 2023.

    Comments: CLeaR 2024

  34. arXiv:2304.09754  [pdf, other

    stat.AP

    Characterizing Alzheimer's Disease Biomarker Cascade Through Non-linear Mixed Effect Models

    Authors: Zhuojun Tang, Yuxin Zhu, Zheyu Wang

    Abstract: Alzheimer's Disease (AD) research has shifted to focus on biomarker trajectories and their potential use in understanding the underlying AD-related pathological process. A conceptual framework was proposed in such modern AD research that hypothesized biomarker cascades as a result of underlying AD pathology. In this paper, we leverage the idea of biomarker cascades and develop methods that use a n… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 28 pages, 2 figures, 3 tables

  35. arXiv:2304.07918  [pdf, other

    cs.CV stat.ML

    Likelihood-Based Generative Radiance Field with Latent Space Energy-Based Model for 3D-Aware Disentangled Image Representation

    Authors: Yaxuan Zhu, Jianwen Xie, ** Li

    Abstract: We propose the NeRF-LEBM, a likelihood-based top-down 3D-aware 2D image generative model that incorporates 3D representation via Neural Radiance Fields (NeRF) and 2D imaging process via differentiable volume rendering. The model represents an image as a rendering process from 3D object to 2D image and is conditioned on some latent variables that account for object characteristics and are assumed t… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  36. arXiv:2304.06292  [pdf, ps, other

    cs.LG stat.AP stat.ME

    Improved Naive Bayes with Mislabeled Data

    Authors: Qianhan Zeng, Yingqiu Zhu, Xuening Zhu, Feifei Wang, Weichen Zhao, Shuning Sun, Meng Su, Hansheng Wang

    Abstract: Labeling mistakes are frequently encountered in real-world applications. If not treated well, the labeling mistakes can deteriorate the classification performances of a model seriously. To address this issue, we propose an improved Naive Bayes method for text classification. It is analytically simple and free of subjective judgements on the correct and incorrect labels. By specifying the generatin… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  37. arXiv:2302.11971   

    stat.CO cs.DS cs.LG math.OC

    Efficiently handling constraints with Metropolis-adjusted Langevin algorithm

    Authors: **yuan Chang, Cheng Yong Tang, Yuanzheng Zhu

    Abstract: In this study, we investigate the performance of the Metropolis-adjusted Langevin algorithm in a setting with constraints on the support of the target distribution. We provide a rigorous analysis of the resulting Markov chain, establishing its convergence and deriving an upper bound for its mixing time. Our results demonstrate that the Metropolis-adjusted Langevin algorithm is highly effective in… ▽ More

    Submitted 14 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: We find some error in the proof of Theorem 2 and the associated result may not be correct

  38. arXiv:2302.04611  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    A Text-guided Protein Design Framework

    Authors: Shengchao Liu, Yan**g Li, Zhuoxinran Li, Anthony Gitter, Yutao Zhu, Jiarui Lu, Zhao Xu, Weili Nie, Arvind Ramanathan, Chaowei Xiao, Jian Tang, Hongyu Guo, Anima Anandkumar

    Abstract: Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework tha… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  39. arXiv:2302.04040  [pdf, other

    cs.LG stat.ML

    Sample-efficient Multi-objective Molecular Optimization with GFlowNets

    Authors: Yiheng Zhu, Jialu Wu, Chaowen Hu, Jiahuan Yan, Chang-Yu Hsieh, Tingjun Hou, Jian Wu

    Abstract: Many crucial scientific problems involve designing novel molecules with desired properties, which can be formulated as a black-box optimization problem over the discrete chemical space. In practice, multiple conflicting objectives and costly evaluations (e.g., wet-lab experiments) make the diversity of candidates paramount. Computational methods have achieved initial success but still struggle wit… ▽ More

    Submitted 2 November, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  40. arXiv:2301.11604   

    cs.LG eess.SY stat.AP

    A critical look at deep neural network for dynamic system modeling

    Authors: **ming Zhou, Yucai Zhu

    Abstract: Neural network models become increasingly popular as dynamic modeling tools in the control community. They have many appealing features including nonlinear structures, being able to approximate any functions. While most researchers hold optimistic attitudes towards such models, this paper questions the capability of (deep) neural networks for the modeling of dynamic systems using input-output data… ▽ More

    Submitted 20 October, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: The failure of NARX model modeling a noiseless LTI system is mainly due to some initilization issues with the current Matlab SYSID Toolbox. If this procedure is done purely in the Neural Network Toolbox, the situation can be improved for great extent

  41. Prioritizing Variables for Observational Study Design using the Joint Variable Importance Plot

    Authors: Lauren D. Liao, Yeyi Zhu, Amanda L. Ngo, Rana F. Chehab, Samuel D. Pimentel

    Abstract: Observational studies of treatment effects require adjustment for confounding variables. However, causal inference methods typically cannot deliver perfect adjustment on all measured baseline variables, and there is often ambiguity about which variables should be prioritized. Standard prioritization methods based on treatment imbalance alone neglect variables' relationships with the outcome. We pr… ▽ More

    Submitted 15 February, 2024; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: 30 pages, 2 figures

    Journal ref: The American Statistician, 2024, p. 1-17

  42. arXiv:2301.09300  [pdf, other

    stat.ML cs.LG

    A Tale of Two Latent Flows: Learning Latent Space Normalizing Flow with Short-run Langevin Flow for Approximate Inference

    Authors: Jianwen Xie, Yaxuan Zhu, Yifei Xu, Dingcheng Li, ** Li

    Abstract: We study a normalizing flow in the latent space of a top-down generator model, in which the normalizing flow model plays the role of the informative prior model of the generator. We propose to jointly learn the latent space normalizing flow prior model and the top-down generator model by a Markov chain Monte Carlo (MCMC)-based maximum likelihood algorithm, where a short-run Langevin sampling from… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: The Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI) 2023

  43. arXiv:2301.05135  [pdf, ps, other

    math.ST stat.ME

    On Existence Theorems for Conditional Inferential Models

    Authors: Rongrong Zhang, Michael Y. Zhu, Chuanhai Liu

    Abstract: The framework of Inferential Models (IMs) has recently been developed in search of what is referred to as the holy grail of statistical theory, that is, prior-free probabilistic inference. Its method of Conditional IMs (CIMs) is a critical component in that it serves as a desirable extension of the Bayes theorem for combining information when no prior distribution is available. The general form of… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  44. arXiv:2211.14692  [pdf, other

    math.ST stat.ME

    Radial Neighbors for Provably Accurate Scalable Approximations of Gaussian Processes

    Authors: Yichen Zhu, Michele Peruzzi, Cheng Li, David B. Dunson

    Abstract: In geostatistical problems with massive sample size, Gaussian processes can be approximated using sparse directed acyclic graphs to achieve scalable $O(n)$ computational complexity. In these models, data at each location are typically assumed conditionally dependent on a small set of parents which usually include a subset of the nearest neighbors. These methodologies often exhibit excellent empiri… ▽ More

    Submitted 20 June, 2024; v1 submitted 26 November, 2022; originally announced November 2022.

  45. arXiv:2211.06077  [pdf, other

    math.ST cs.LG math.PR stat.ML

    Overparameterized random feature regression with nearly orthogonal data

    Authors: Zhichao Wang, Yizhe Zhu

    Abstract: We investigate the properties of random feature ridge regression (RFRR) given by a two-layer neural network with random Gaussian initialization. We study the non-asymptotic behaviors of the RFRR with nearly orthogonal deterministic unit-length input data vectors in the overparameterized regime, where the width of the first layer is much larger than the sample size. Our analysis shows high-probabil… ▽ More

    Submitted 13 August, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 39 pages. A condition on the activation function is added in Assumption 2.2

  46. arXiv:2210.08367  [pdf, ps, other

    cs.LG stat.ML

    Active Learning with Neural Networks: Insights from Nonparametric Statistics

    Authors: Yinglun Zhu, Robert Nowak

    Abstract: Deep neural networks have great representation power, but typically require large numbers of training examples. This motivates deep active learning methods that can significantly reduce the amount of labeled training data. Empirical successes of deep active learning have been recently reported in the literature, however, rigorous label complexity guarantees of deep active learning have remained el… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022

  47. arXiv:2210.07513  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    Continuous-in-time Limit for Bayesian Bandits

    Authors: Yuhua Zhu, Zachary Izzo, Lexing Ying

    Abstract: This paper revisits the bandit problem in the Bayesian setting. The Bayesian approach formulates the bandit problem as an optimization problem, and the goal is to find the optimal policy which minimizes the Bayesian regret. One of the main challenges facing the Bayesian approach is that computation of the optimal policy is often intractable, especially when the length of the problem horizon or the… ▽ More

    Submitted 29 September, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  48. arXiv:2208.12035  [pdf, ps, other

    cs.LG cs.AI stat.ME

    Seamless Tracking of Group Targets and Ungrouped Targets Using Belief Propagation

    Authors: Xuqi Zhang, Fanqin Meng, Haiqi Liu, Xiao**g Shen, Yunmin Zhu

    Abstract: This paper considers the problem of tracking a large-scale number of group targets. Usually, multi-target in most tracking scenarios are assumed to have independent motion and are well-separated. However, for group target tracking (GTT), the targets within groups are closely spaced and move in a coordinated manner, the groups can split or merge, and the numbers of targets in groups may be large, w… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  49. arXiv:2207.05849  [pdf, other

    cs.LG stat.ML

    Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces

    Authors: Yinglun Zhu, Paul Mineiro

    Abstract: Designing efficient general-purpose contextual bandit algorithms that work with large -- or even continuous -- action spaces would facilitate application to important scenarios such as information retrieval, recommendation systems, and continuous control. While obtaining standard regret guarantees can be hopeless, alternative regret notions have been proposed to tackle the large action setting. We… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: To appear at ICML 2022

  50. arXiv:2207.05836  [pdf, other

    cs.LG stat.ML

    Contextual Bandits with Large Action Spaces: Made Practical

    Authors: Yinglun Zhu, Dylan J. Foster, John Langford, Paul Mineiro

    Abstract: A central problem in sequential decision making is to develop algorithms that are practical and computationally efficient, yet support the use of flexible, general-purpose models. Focusing on the contextual bandit problem, recent progress provides provably efficient algorithms with strong empirical performance when the number of possible alternatives ("actions") is small, but guarantees for decisi… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: To appear at ICML 2022