Skip to main content

Showing 1–50 of 302 results for author: Yang, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02689  [pdf, ps, other

    cs.LG cs.DC math.OC stat.ML

    Accelerating Distributed Optimization: A Primal-Dual Perspective on Local Steps

    Authors: Junchi Yang, Murat Yildirim, Qiu Feng

    Abstract: In distributed machine learning, efficient training across multiple agents with different data distributions poses significant challenges. Even with a centralized coordinator, current algorithms that achieve optimal communication complexity typically require either large minibatches or compromise on gradient complexity. In this work, we tackle both centralized and decentralized settings across str… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.12588  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    UIFV: Data Reconstruction Attack in Vertical Federated Learning

    Authors: Jirui Yang, Peng Chen, Zhihui Lu, Qiang Duan, Yubing Bao

    Abstract: Vertical Federated Learning (VFL) facilitates collaborative machine learning without the need for participants to share raw private data. However, recent studies have revealed privacy risks where adversaries might reconstruct sensitive features through data leakage during the learning process. Although data reconstruction methods based on gradient or model information are somewhat effective, they… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.10262  [pdf, other

    cs.IR cs.AI math.OC stat.CO

    Fast solution to the fair ranking problem using the Sinkhorn algorithm

    Authors: Yuki Uehara, Shunnosuke Ikeda, Naoki Nishimura, Koya Ohashi, Yilin Li, Jie Yang, Deddy Jobson, Xingxia Zha, Takeshi Matsumoto, Noriyoshi Sukegawa, Yuichi Takano

    Abstract: In two-sided marketplaces such as online flea markets, recommender systems for providing consumers with personalized item rankings play a key role in promoting transactions between providers and consumers. Meanwhile, two-sided marketplaces face the problem of balancing consumer satisfaction and fairness among items to stimulate activity of item providers. Saito and Joachims (2022) devised an impac… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.09253  [pdf, other

    stat.ML cs.LG

    Deep Sketched Output Kernel Regression for Structured Prediction

    Authors: Tamim El Ahmad, Junjie Yang, Pierre Laforgue, Florence d'Alché-Buc

    Abstract: By leveraging the kernel trick in the output space, kernel-induced losses provide a principled way to define structured output prediction tasks for a wide variety of output modalities. In particular, they have been successfully used in the context of surrogate non-parametric regression, where the kernel trick is typically exploited in the input space as well. However, when inputs are images or tex… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2405.13785  [pdf, other

    cs.LG cs.AI math.PR stat.ML

    Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling

    Authors: Shifan Zhao, Jiaying Lu, Ji Yang, Edmond Chow, Yuanzhe Xi

    Abstract: Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical application… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    ACM Class: G.3; J.3

  6. arXiv:2405.11046  [pdf, other

    stat.AP

    Temporal and spatial downscaling for solar radiation

    Authors: Maggie Bailey, Doug Nychka, Manajit Sengupta, Jaemo Yang, Soutir Bandyopadhyay

    Abstract: Global and regional climate model projections are useful for gauging future patterns of climate variables, including solar radiation, but data from these models is often too coarse to assess local impacts. Within the context of solar radiation, the changing climate may have an effect on photovoltaic (PV) production, especially as the PV industry moves to extend plant lifetimes to 50 years. Predict… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 35 pages, 14 figures

  7. arXiv:2405.08631  [pdf, other

    stat.CO cs.LG cs.MS cs.SE

    A Fast and Scalable Pathwise-Solver for Group Lasso and Elastic Net Penalized Regression via Block-Coordinate Descent

    Authors: James Yang, Trevor Hastie

    Abstract: We develop fast and scalable algorithms based on block-coordinate descent to solve the group lasso and the group elastic net for generalized linear models along a regularization path. Special attention is given when the loss is the usual least squares loss (Gaussian loss). We show that each block-coordinate update can be solved efficiently using Newton's method and further improved using an adapti… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  8. arXiv:2405.07791  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Decentralized Kernel Ridge Regression Based on Data-dependent Random Feature

    Authors: Ruikai Yang, Fan He, Mingzhen He, Jie Yang, Xiaolin Huang

    Abstract: Random feature (RF) has been widely used for node consistency in decentralized kernel ridge regression (KRR). Currently, the consistency is guaranteed by imposing constraints on coefficients of features, necessitating that the random features on different nodes are identical. However, in many applications, data on different nodes varies significantly on the number or distribution, which calls for… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  9. arXiv:2404.18980  [pdf, other

    econ.GN physics.soc-ph stat.AP

    The Impact of COVID-19 on Co-authorship and Economics Scholars' Productivity

    Authors: Hanqiao Zhang, Joy D. Xiuyao Yang

    Abstract: The COVID-19 pandemic has disrupted traditional academic collaboration patterns, prompting a unique opportunity to analyze the influence of peer effects and coauthorship dynamics on research output. Using a novel dataset, this paper endeavors to make a first cut at investigating the role of peer effects on the productivity of economics scholars, measured by the number of publications, in both pre-… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  10. arXiv:2404.07457  [pdf, ps, other

    math.ST stat.CO

    From Poisson Observations to Fitted Negative Binomial Distribution

    Authors: Yingying Yang, Niloufar Dousti Mousavi, Zhou Yu, Jie Yang

    Abstract: The Kolmogorov-Smirnov (KS) test has been widely used for testing whether a random sample comes from a specific distribution, possibly with estimated parameters. If the data come from a Poisson distribution, however, one can hardly tell that they do not come from a negative binomial distribution by running a KS test, even with a large sample size. In this paper, we rigorously justify that the KS t… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  11. arXiv:2404.01730  [pdf, other

    cs.LG cs.IT stat.ML

    Asymptotics of Language Model Alignment

    Authors: Joy Qi** Yang, Salman Salamatian, Ziteng Sun, Ananda Theertha Suresh, Ahmad Beirami

    Abstract: Let $p$ denote a generative language model. Let $r$ denote a reward model that returns a scalar that captures the degree at which a draw from $p$ is preferred. The goal of language model alignment is to alter $p$ to a new distribution $φ$ that results in a higher expected reward while kee** $φ$ close to $p.$ A popular alignment method is the KL-constrained reinforcement learning (RL), which choo… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  12. arXiv:2402.15127  [pdf, other

    cs.LG cs.IT stat.ML

    Multi-Armed Bandits with Abstention

    Authors: Junwen Yang, Tianyuan **, Vincent Y. F. Tan

    Abstract: We introduce a novel extension of the canonical multi-armed bandit problem that incorporates an additional strategic element: abstention. In this enhanced framework, the agent is not only tasked with selecting an arm at each time step, but also has the option to abstain from accepting the stochastic instantaneous reward before observing it. When opting for abstention, the agent either suffers a fi… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Preprint

  13. arXiv:2402.09723  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    Efficient Prompt Optimization Through the Lens of Best Arm Identification

    Authors: Chengshuai Shi, Kun Yang, Zihan Chen, Jundong Li, **g Yang, Cong Shen

    Abstract: The remarkable instruction-following capability of large language models (LLMs) has sparked a growing interest in automatically finding good prompts, i.e., prompt optimization. Most existing works follow the scheme of selecting from a pre-generated pool of candidate prompts. However, these designs mainly focus on the generation strategy, while limited attention has been paid to the selection metho… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2402.04691  [pdf, ps, other

    stat.ML cs.LG math.FA math.ST

    Learning Operators with Stochastic Gradient Descent in General Hilbert Spaces

    Authors: Lei Shi, Jia-Qi Yang

    Abstract: This study investigates leveraging stochastic gradient descent (SGD) to learn operators between general Hilbert spaces. We propose weak and strong regularity conditions for the target operator to depict its intrinsic structure and complexity. Under these conditions, we establish upper bounds for convergence rates of the SGD algorithm and conduct a minimax lower bound analysis, further illustrating… ▽ More

    Submitted 13 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 56 pages

  15. arXiv:2402.02949  [pdf, other

    cs.LG stat.ML

    Kernel PCA for Out-of-Distribution Detection

    Authors: Kun Fang, Qinghua Tao, Kexin Lv, Mingzhen He, Xiaolin Huang, Jie Yang

    Abstract: Out-of-Distribution (OoD) detection is vital for the reliability of Deep Neural Networks (DNNs). Existing works have shown the insufficiency of Principal Component Analysis (PCA) straightforwardly applied on the features of DNNs in detecting OoD data from In-Distribution (InD) data. The failure of PCA suggests that the network features residing in OoD and InD are not well separated by simply proce… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  16. arXiv:2402.01460  [pdf, other

    stat.ML cs.LG

    Deep conditional distribution learning via conditional Föllmer flow

    Authors: **yuan Chang, Zhao Ding, Yuling Jiao, Ruoxuan Li, Jerry Zhijian Yang

    Abstract: We introduce an ordinary differential equation (ODE) based deep generative method for learning conditional distributions, named Conditional Föllmer Flow. Starting from a standard Gaussian distribution, the proposed flow could approximate the target conditional distribution very well when the time is close to 1. For effective implementation, we discretize the flow with Euler's method where we estim… ▽ More

    Submitted 13 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: The original title of this paper is "Deep Conditional Generative Learning: Model and Error Analysis"

  17. arXiv:2401.06403  [pdf, other

    stat.ME math.ST

    Fourier analysis of spatial point processes

    Authors: Junho Yang, Yongtao Guan

    Abstract: In this article, we develop comprehensive frequency domain methods for estimating and inferring the second-order structure of spatial point processes. The main element here is on utilizing the discrete Fourier transform (DFT) of the point pattern and its tapered counterpart. Under second-order stationarity, we show that both the DFTs and the tapered DFTs are asymptotically jointly independent Gaus… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  18. arXiv:2401.04535  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Deep Sobolev Regression: Estimation, Variable Selection and Beyond

    Authors: Zhao Ding, Chenguang Duan, Yuling Jiao, Jerry Zhijian Yang

    Abstract: We propose SDORE, a semi-supervised deep Sobolev regressor, for the nonparametric estimation of the underlying regression function and its gradient. SDORE employs deep neural networks to minimize empirical risk with gradient norm regularization, allowing computation of the gradient norm on unlabeled data. We conduct a comprehensive analysis of the convergence rates of SDORE and establish a minimax… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    MSC Class: 62G05; 62G08; 65N21

  19. arXiv:2401.02529  [pdf, other

    stat.ME

    Simulation-based transition density approximation for the inference of SDE models

    Authors: Xin Cai, **gyu Yang, Zhibao Li, Hongqiao Wang, Miao Huang

    Abstract: Stochastic Differential Equations (SDEs) serve as a powerful modeling tool in various scientific domains, including systems science, engineering, and ecological science. While the specific form of SDEs is typically known for a given problem, certain model parameters remain unknown. Efficiently inferring these unknown parameters based on observations of the state in discrete time series represents… ▽ More

    Submitted 25 February, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    MSC Class: 62M20

  20. arXiv:2312.16260  [pdf, other

    stat.ME

    Multinomial Link Models

    Authors: Tianmeng Wang, Li** Tong, Jie Yang

    Abstract: We propose a unified multinomial link model for analyzing categorical responses. It not only covers the existing multinomial logistic models and their extensions as special cases, but also includes new models that can incorporate the observations with NA or Unknown responses in the data analysis. We provide explicit formulae and detailed algorithms for finding the maximum likelihood estimates of t… ▽ More

    Submitted 18 June, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 39 pages, 5 figures

  21. arXiv:2312.15489  [pdf, other

    cs.CY cs.IR cs.SI physics.soc-ph stat.AP

    Browsing behavior exposes identities on the Web

    Authors: Marcos Oliveira, Junran Yang, Daniel Griffiths, Denis Bonnay, Juhi Kulshrestha

    Abstract: How easy is it to uniquely identify a person based solely on their web browsing behavior? Here we show that when people navigate the Web, their online traces produce fingerprints that identify them. Merely the four most visited web domains are enough to identify 95% of the individuals. These digital fingerprints are stable and render high re-identifiability. We demonstrate that we can re-identify… ▽ More

    Submitted 14 June, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: 13 pages, 1 figure

  22. arXiv:2312.15023  [pdf, other

    cs.LG stat.ML

    Federated Q-Learning: Linear Regret Speedup with Low Communication Cost

    Authors: Zhong Zheng, Fengyu Gao, Lingzhou Xue, **g Yang

    Abstract: In this paper, we consider federated reinforcement learning for tabular episodic Markov Decision Processes (MDP) where, under the coordination of a central server, multiple agents collaboratively explore the environment and learn an optimal policy without sharing their raw data. While linear speedup in the number of agents has been achieved for some metrics, such as convergence rate and sample com… ▽ More

    Submitted 7 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 51 pages

  23. arXiv:2312.02447  [pdf, other

    q-bio.BM stat.ML

    Fast non-autoregressive inverse folding with discrete diffusion

    Authors: John J. Yang, Jason Yim, Regina Barzilay, Tommi Jaakkola

    Abstract: Generating protein sequences that fold into a intended 3D structure is a fundamental step in de novo protein design. De facto methods utilize autoregressive generation, but this eschews higher order interactions that could be exploited to improve inference speed. We describe a non-autoregressive alternative that performs inference using a constant number of calls resulting in a 23 times speed up w… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: NeurIPS Machine learning for Stuctural Biology workshop

  24. arXiv:2312.01260  [pdf, other

    cs.LG cs.CR stat.ML

    Rethinking PGD Attack: Is Sign Function Necessary?

    Authors: Junjie Yang, Tianlong Chen, Xuxi Chen, Zhangyang Wang, Yingbin Liang

    Abstract: Neural networks have demonstrated success in various domains, yet their performance can be significantly degraded by even a small input perturbation. Consequently, the construction of such perturbations, known as adversarial attacks, has gained significant attention, many of which fall within "white-box" scenarios where we have full access to the neural network. Existing attack algorithms, such as… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  25. arXiv:2311.04408  [pdf, other

    stat.AP

    Bayesian modelling of response to therapy and drug-sensitivity in acute lymphoblastic leukemia

    Authors: Andrea Cremaschi, Wenjian Yang, Maria De Iorio, William E. Evans, Jun J. Yang, Gary L. Rosner

    Abstract: Acute lymphoblastic leukemia (ALL) is a heterogeneous hematologic malignancy involving the abnormal proliferation of immature lymphocytes, accounting for most pediatric cancer cases. ALL management in children has seen great improvement in the last decades thanks to better understanding of the disease leading to improved treatment strategies evidenced through clinical trials. Commonly a first cour… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  26. arXiv:2311.03252  [pdf, other

    math.OC cs.LG stat.ML

    Parameter-Agnostic Optimization under Relaxed Smoothness

    Authors: Florian Hübler, Junchi Yang, Xiang Li, Niao He

    Abstract: Tuning hyperparameters, such as the stepsize, presents a major challenge of training machine learning models. To address this challenge, numerous adaptive optimization algorithms have been developed that achieve near-optimal complexities, even when stepsizes are independent of problem-specific parameters, provided that the loss function is $L$-smooth. However, as the assumption is relaxed to the m… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  27. arXiv:2310.19360  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective

    Authors: Yifei Wang, Liangchen Li, Jiansheng Yang, Zhouchen Lin, Yisen Wang

    Abstract: Adversarial Training (AT) has become arguably the state-of-the-art algorithm for extracting robust features. However, researchers recently notice that AT suffers from severe robust overfitting problems, particularly after learning rate (LR) decay. In this paper, we explain this phenomenon by viewing adversarial training as a dynamic minimax game between the model trainer and the attacker. Specific… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  28. arXiv:2310.17817  [pdf, other

    stat.ML cs.AI cs.LG math.PR

    Bayesian imaging inverse problem with SA-Roundtrip prior via HMC-pCN sampler

    Authors: Jiayu Qian, Yuanyuan Liu, **gya Yang, Qing** Zhou

    Abstract: Bayesian inference with deep generative prior has received considerable interest for solving imaging inverse problems in many scientific and engineering fields. The selection of the prior distribution is learned from, and therefore an important representation learning of, available prior measurements. The SA-Roundtrip, a novel deep generative prior, is introduced to enable controlled sampling gene… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  29. arXiv:2310.17759  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization

    Authors: Liang Zhang, Junchi Yang, Amin Karbasi, Niao He

    Abstract: Algorithmic reproducibility measures the deviation in outputs of machine learning algorithms upon minor changes in the training process. Previous work suggests that first-order methods would need to trade-off convergence rate (gradient complexity) for better reproducibility. In this work, we challenge this perception and demonstrate that both optimal reproducibility and near-optimal convergence gu… ▽ More

    Submitted 9 January, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Spotlight

  30. arXiv:2310.16238  [pdf, other

    stat.CO

    Efficient GPU-accelerated fitting of observational health-scaled stratified and time-varying Cox models

    Authors: Jianxiao Yang, Martijn J. Schuemie, Marc A. Suchard

    Abstract: The Cox proportional hazards model stands as a widely-used semi-parametric approach for survival analysis in medical research and many other fields. Numerous extensions of the Cox model have further expanded its versatility. Statistical computing challenges arise, however, when applying many of these extensions with the increasing complexity and volume of modern observational health datasets. To a… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  31. arXiv:2310.13550  [pdf, other

    cs.LG stat.ML

    Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes

    Authors: Ruiquan Huang, Yuan Cheng, **g Yang, Vincent Tan, Yingbin Liang

    Abstract: In multi-task reinforcement learning (RL) under Markov decision processes (MDPs), the presence of shared latent structures among multiple MDPs has been shown to yield significant benefits to the sample efficiency compared to single-task RL. In this paper, we investigate whether such a benefit can extend to more general sequential decision making problems, such as partially observable MDPs (POMDPs)… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  32. arXiv:2310.10393  [pdf, other

    stat.ME

    Statistical and Causal Robustness for Causal Null Hypothesis Tests

    Authors: Junhui Yang, Rohit Bhattacharya, You** Lee, Ted Westling

    Abstract: Prior work applying semiparametric theory to causal inference has primarily focused on deriving estimators that exhibit statistical robustness under a prespecified causal model that permits identification of a desired causal parameter. However, a fundamental challenge is correct specification of such a model, which usually involves making untestable assumptions. Evidence factors is an approach to… ▽ More

    Submitted 29 June, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  33. arXiv:2310.06713  [pdf, other

    cs.LG stat.AP

    Interpretable Traffic Event Analysis with Bayesian Networks

    Authors: Tong Yuan, Jian Yang, Zeyi Wen

    Abstract: Although existing machine learning-based methods for traffic accident analysis can provide good quality results to downstream tasks, they lack interpretability which is crucial for this critical problem. This paper proposes an interpretable framework based on Bayesian Networks for traffic accident prediction. To enable the ease of interpretability, we design a dataset construction pipeline to feed… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures

    MSC Class: 62F15 ACM Class: G.3

  34. arXiv:2310.06333  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST stat.ML

    Learning bounded-degree polytrees with known skeleton

    Authors: Davin Choo, Joy Qi** Yang, Arnab Bhattacharyya, Clément L. Canonne

    Abstract: We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results… ▽ More

    Submitted 21 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Fixed some typos. Added some discussions. Accepted to ALT 2024

  35. arXiv:2310.00551  [pdf, other

    math.NA math.PR stat.CO

    Derivative based global sensitivity analysis and its entropic link

    Authors: Jiannan Yang

    Abstract: Variance-based Sobol' sensitivity is one of the most well known measures in global sensitivity analysis (GSA). However, uncertainties with certain distributions, such as highly skewed distributions or those with a heavy tail, cannot be adequately characterised using the second central moment only. Entropy-based GSA can consider the entire probability density function, but its application has been… ▽ More

    Submitted 9 May, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

    Comments: 17 page, 4 figures, 8 tables

  36. arXiv:2309.16604  [pdf, other

    stat.ML cs.LG

    Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance

    Authors: Junjie Yang, Matthieu Labeau, Florence d'Alché-Buc

    Abstract: Pairwise comparison of graphs is key to many applications in Machine learning ranging from clustering, kernel-based classification/regression and more recently supervised graph prediction. Distances between graphs usually rely on informative representations of these structured objects such as bag of substructures or other graph embeddings. A recently popular solution consists in representing graph… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  37. arXiv:2309.12658  [pdf, other

    cs.LG stat.ML

    Neural Operator Variational Inference based on Regularized Stein Discrepancy for Deep Gaussian Processes

    Authors: Jian Xu, Shian Du, Junmei Yang, Qianli Ma, Delu Zeng

    Abstract: Deep Gaussian Process (DGP) models offer a powerful nonparametric approach for Bayesian inference, but exact inference is typically intractable, motivating the use of various approximations. However, existing approaches, such as mean-field Gaussian assumptions, limit the expressiveness and efficacy of DGP models, while stochastic approximation can be computationally expensive. To tackle these chal… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  38. arXiv:2309.09367  [pdf, other

    stat.CO stat.ME

    ForLion: A New Algorithm for D-optimal Designs under General Parametric Statistical Models with Mixed Factors

    Authors: Yifei Huang, Keren Li, Abhyuday Mandal, Jie Yang

    Abstract: In this paper, we address the problem of designing an experimental plan with both discrete and continuous factors under fairly general parametric statistical models. We propose a new algorithm, named ForLion, to search for locally optimal approximate designs under the D-criterion. The algorithm performs an exhaustive search in a design space with mixed factors while kee** high efficiency and red… ▽ More

    Submitted 22 May, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: 36 pages, 7 tables, 5 figures

  39. arXiv:2309.09222  [pdf, other

    cs.LG stat.ML

    Data-driven Modeling and Inference for Bayesian Gaussian Process ODEs via Double Normalizing Flows

    Authors: Jian Xu, Shian Du, Junmei Yang, Xinghao Ding, John Paisley, Delu Zeng

    Abstract: Recently, Gaussian processes have been used to model the vector field of continuous dynamical systems, referred to as GPODEs, which are characterized by a probabilistic ODE equation. Bayesian inference for these models has been extensively studied and applied in tasks such as time series prediction. However, the use of standard GPs with basic kernels like squared exponential kernels has been commo… ▽ More

    Submitted 2 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  40. arXiv:2308.16816  [pdf, other

    stat.ME

    A General Equivalence Theorem for Crossover Designs under Generalized Linear Models

    Authors: Jeevan Jankar, Jie Yang, Abhyuday Mandal

    Abstract: With the help of Generalized Estimating Equations, we identify locally D-optimal crossover designs for generalized linear models. We adopt the variance of parameters of interest as the objective function, which is minimized using constrained optimization to obtain optimal crossover designs. In this case, the traditional general equivalence theorem could not be used directly to check the optimality… ▽ More

    Submitted 7 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  41. arXiv:2308.08858  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games

    Authors: Songtao Feng, Ming Yin, Yu-Xiang Wang, **g Yang, Yingbin Liang

    Abstract: The problem of two-player zero-sum Markov games has recently attracted increasing interests in theoretical studies of multi-agent reinforcement learning (RL). In particular, for finite-horizon episodic Markov decision processes (MDPs), it has been shown that model-based algorithms can find an $ε$-optimal Nash Equilibrium (NE) with the sample complexity of $O(H^3SAB/ε^2)$, which is optimal in the d… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  42. arXiv:2307.09295  [pdf, other

    cs.LG stat.ML

    Nested Elimination: A Simple Algorithm for Best-Item Identification from Choice-Based Feedback

    Authors: Junwen Yang, Yifan Feng

    Abstract: We study the problem of best-item identification from choice-based feedback. In this problem, a company sequentially and adaptively shows display sets to a population of customers and collects their choices. The objective is to identify the most preferred item with the least number of samples and at a high confidence level. We propose an elimination-based algorithm, namely Nested Elimination (NE),… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted to ICML 2023

  43. arXiv:2307.00405  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

    Authors: Ruiquan Huang, Yingbin Liang, **g Yang

    Abstract: The general sequential decision-making problem, which includes Markov decision processes (MDPs) and partially observable MDPs (POMDPs) as special cases, aims at maximizing a cumulative reward by making a sequence of decisions based on a history of observations and actions over time. Recent studies have shown that the sequential decision-making problem is statistically learnable if it admits a low-… ▽ More

    Submitted 6 February, 2024; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted by ICLR 2024

  44. arXiv:2306.11058  [pdf, other

    physics.flu-dyn physics.app-ph stat.AP

    Reciprocal hydrodynamic response estimation in a random spreading sea

    Authors: Jiannan Yang, Robin Langley, Richard Lines

    Abstract: Direct estimation of the hydrodynamic response of an offshore structure in a random spreading sea can lead to large computational costs. In this paper the actual spreading sea is replaced by an idealised diffuse wave field and the diffuse field reciprocity (DFR) relationship is derived analytically and verified against diffraction analysis for offshore application. The DFR approach provides an ana… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 5 figures; 1 table

  45. arXiv:2306.08364  [pdf, other

    stat.ML cs.IT cs.LG

    Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources

    Authors: Chengshuai Shi, Wei Xiong, Cong Shen, **g Yang

    Abstract: Existing theoretical studies on offline reinforcement learning (RL) mostly consider a dataset sampled directly from the target task. In practice, however, data often come from several heterogeneous but related sources. Motivated by this gap, this work aims at rigorously understanding offline RL with multiple datasets that are collected from randomly perturbed versions of the target task instead of… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  46. arXiv:2306.08280  [pdf, other

    cs.IT cs.CR cs.LG eess.SP stat.ML

    Differentially Private Wireless Federated Learning Using Orthogonal Sequences

    Authors: Xizixiang Wei, Tianhao Wang, Ruiquan Huang, Cong Shen, **g Yang, H. Vincent Poor

    Abstract: We propose a privacy-preserving uplink over-the-air computation (AirComp) method, termed FLORAS, for single-input single-output (SISO) wireless federated learning (FL) systems. From the perspective of communication designs, FLORAS eliminates the requirement of channel state information at the transmitters (CSIT) by leveraging the properties of orthogonal sequences. From the privacy perspective, we… ▽ More

    Submitted 21 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 33 pages, 5 figures

  47. arXiv:2306.07652  [pdf

    stat.AP q-bio.TO

    Inactivated COVID-19 Vaccination did not affect In vitro fertilization (IVF) / Intra-Cytoplasmic Sperm Injection (ICSI) cycle outcomes

    Authors: Qi Wan, Ying Ling Yao, XingYu Lv, Li Hong Geng, Yue Wang, Enoch Appiah Adu-Gyamfi, Xue Jiao Wang, Yue Qian, Juan Yang, Ming Xing Chend, Zhao Hui Zhong, Yuan Li, Yu Bin Ding

    Abstract: Background: The objective of this study is to evaluate the impact of COVID-19 inactivated vaccine administration on the outcomes of in vitro fertilization (IVF) and intracytoplasmic sperm injection (ICSI) cycles in infertile couples in China. Methods: We collected data from the CYART prospective cohort, which included couples undergoing IVF treatment from January 2021 to September 2022 at Sichuan… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 26 pages, 4 figures and 5 tables

  48. arXiv:2306.07464  [pdf, other

    cs.AI cs.LG stat.ML

    Unlocking Sales Growth: Account Prioritization Engine with Explainable AI

    Authors: Suvendu Jena, Jilei Yang, Fangfang Tan

    Abstract: B2B sales requires effective prediction of customer growth, identification of upsell potential, and mitigation of churn risks. LinkedIn sales representatives traditionally relied on intuition and fragmented data signals to assess customer performance. This resulted in significant time investment in data understanding as well as strategy formulation and under-investment in active selling. To overco… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 9 pages, 11 figures, 2 tables

  49. arXiv:2306.06265  [pdf, other

    cs.LG cs.IT stat.ML

    Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints

    Authors: Donghao Li, Ruiquan Huang, Cong Shen, **g Yang

    Abstract: This paper investigates conservative exploration in reinforcement learning where the performance of the learning agent is guaranteed to be above a certain threshold throughout the learning process. It focuses on the tabular episodic Markov Decision Process (MDP) setting that has finite states and actions. With the knowledge of an existing safe baseline policy, an algorithm termed as StepMix is pro… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML2023

  50. arXiv:2306.05275  [pdf, ps, other

    cs.LG cs.CR cs.IT stat.ML

    Federated Linear Contextual Bandits with User-level Differential Privacy

    Authors: Ruiquan Huang, Huanyu Zhang, Luca Melis, Milan Shen, Meisam Hajzinia, **g Yang

    Abstract: This paper studies federated linear contextual bandits under the notion of user-level differential privacy (DP). We first introduce a unified federated bandits framework that can accommodate various definitions of DP in the sequential decision-making setting. We then formally introduce user-level central DP (CDP) and local DP (LDP) in the federated bandits framework, and investigate the fundamenta… ▽ More

    Submitted 9 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023