Skip to main content

Showing 1–50 of 5,384 results for author: Li

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02671  [pdf, other

    stat.ME stat.AP

    When Do Natural Mediation Effects Differ from Their Randomized Interventional Analogues: Test and Theory

    Authors: Ang Yu, Li Ge, Felix Elwert

    Abstract: In causal mediation analysis, the natural direct and indirect effects (natural effects) are nonparametrically unidentifiable in the presence of treatment-induced confounding, which motivated the development of randomized interventional analogues (RIAs) of the natural effects. The RIAs are easier to identify and widely used in practice. Applied researchers often interpret RIA estimates as if they w… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.02010  [pdf, other

    stat.ML cs.LG

    Feynman-Kac Operator Expectation Estimator

    Authors: **gyuan Li, Wei Liu

    Abstract: The Feynman-Kac Operator Expectation Estimator (FKEE) is an innovative method for estimating the target Mathematical Expectation $\mathbb{E}_{X\sim P}[f(X)]$ without relying on a large number of samples, in contrast to the commonly used Markov Chain Monte Carlo (MCMC) Expectation Estimator. FKEE comprises diffusion bridge models and approximation of the Feynman-Kac operator. The key idea is to use… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2407.01763  [pdf, other

    stat.ME stat.CO

    A Cepstral Model for Efficient Spectral Analysis of Covariate-dependent Time Series

    Authors: Zeda Li, Yuexiao Dong

    Abstract: This article introduces a novel and computationally fast model to study the association between covariates and power spectra of replicated time series. A random covariate-dependent Cramér spectral representation and a semiparametric log-spectral model are used to quantify the association between the log-spectra and covariates. Each replicate-specific log-spectrum is represented by the cepstrum, in… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2407.01621  [pdf, other

    cs.LG q-bio.QM stat.ME stat.ML

    Deciphering interventional dynamical causality from non-intervention systems

    Authors: Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Kazuyuki Aihara, Luonan Chen, Wei Lin

    Abstract: Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational crite… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  5. arXiv:2407.01607  [pdf, other

    cs.LG cs.IR stat.ML

    Multi-Epoch learning with Data Augmentation for Deep Click-Through Rate Prediction

    Authors: Zhongxiang Fan, Zhaocheng Liu, Jian Liang, Dongying Kong, Han Li, Peng Jiang, Shuang Li, Kun Gai

    Abstract: This paper investigates the one-epoch overfitting phenomenon in Click-Through Rate (CTR) models, where performance notably declines at the start of the second epoch. Despite extensive research, the efficacy of multi-epoch training over the conventional one-epoch approach remains unclear. We identify the overfitting of the embedding layer, caused by high-dimensional data sparsity, as the primary is… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  6. arXiv:2407.01316  [pdf, other

    cs.LG cs.CY stat.ML

    Evaluating Model Performance Under Worst-case Subpopulations

    Authors: Mike Li, Hongseok Namkoong, Shangzhou Xia

    Abstract: The performance of ML models degrades when the training population is different from that seen under operation. Towards assessing distributional robustness, we study the worst-case performance of a model over all subpopulations of a given size, defined with respect to core attributes Z. This notion of robustness can consider arbitrary (continuous) attributes Z, and automatically accounts for compl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Earlier version appeared in the proceedings of Advances in Neural Information Processing Systems 34 (NeurIPS 2021): https://proceedings.neurips.cc/paper_files/paper/2021/file/908075ea2c025c335f4865f7db427062-Paper.pdf

  7. arXiv:2407.01111  [pdf, other

    cs.LG cs.AI stat.ML

    Proximity Matters: Local Proximity Preserved Balancing for Treatment Effect Estimation

    Authors: Hao Wang, Zhichao Chen, Yuan Shen, Jiajun Fan, Zhaoran Liu, Degui Yang, Xinggao Liu, Haoxuan Li

    Abstract: Heterogeneous treatment effect (HTE) estimation from observational data poses significant challenges due to treatment selection bias. Existing methods address this bias by minimizing distribution discrepancies between treatment groups in latent space, focusing on global alignment. However, the fruitful aspect of local proximity, where similar units exhibit similar outcomes, is often overlooked. In… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Code is available at https://anonymous.4open.science/status/ncr-B697

  8. arXiv:2407.01079  [pdf, ps, other

    stat.ML cs.AI cs.LG

    On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs)

    Authors: Jerry Yao-Chieh Hu, Weimin Wu, Zhuoru Li, Zhao Song, Han Liu

    Abstract: We investigate the statistical and computational limits of latent \textbf{Di}ffusion \textbf{T}ransformers (\textbf{DiT}s) under the low-dimensional linear latent space assumption. Statistically, we study the universal approximation and sample complexity of the DiTs score function, as well as the distribution recovery property of the initial data. Specifically, under mild data assumptions, we deri… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  9. arXiv:2407.00397  [pdf, other

    cs.LG stat.ML

    Markovian Gaussian Process: A Universal State-Space Representation for Stationary Temporal Gaussian Process

    Authors: Weihan Li, Yule Wang, Chengrui Li, Anqi Wu

    Abstract: Gaussian Processes (GPs) and Linear Dynamical Systems (LDSs) are essential time series and dynamic system modeling tools. GPs can handle complex, nonlinear dynamics but are computationally demanding, while LDSs offer efficient computation but lack the expressive power of GPs. To combine their benefits, we introduce a universal method that allows an LDS to mirror stationary temporal GPs. This state… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  10. arXiv:2406.18829  [pdf, other

    stat.ME stat.ML

    Full Information Linked ICA: addressing missing data problem in multimodal fusion

    Authors: Ruiyang Li, F. DuBois Bowman, Seonjoo Lee

    Abstract: Recent advances in multimodal imaging acquisition techniques have allowed us to measure different aspects of brain structure and function. Multimodal fusion, such as linked independent component analysis (LICA), is popularly used to integrate complementary information. However, it has suffered from missing data, commonly occurring in neuroimaging data. Therefore, in this paper, we propose a Full I… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 17 pages, 6 figures

  11. arXiv:2406.18189  [pdf, other

    stat.ME math.ST

    Functional knockoffs selection with applications to functional data analysis in high dimensions

    Authors: Xinghao Qiao, Mingya Long, Qizhai Li

    Abstract: The knockoffs is a recently proposed powerful framework that effectively controls the false discovery rate (FDR) for variable selection. However, none of the existing knockoff solutions are directly suited to handle multivariate or high-dimensional functional data, which has become increasingly prevalent in various scientific applications. In this paper, we propose a novel functional model-X knock… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  12. arXiv:2406.18137  [pdf, ps, other

    stat.ML cs.LG

    Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression

    Authors: Dongya Wu, Xin Li

    Abstract: Generalization theory has been established for sparse deep neural networks under high-dimensional regime. Beyond generalization, parameter estimation is also important since it is crucial for variable selection and interpretability of deep neural networks. Current theoretical studies concerning parameter estimation mainly focus on two-layer neural networks, which is due to the fact that the conver… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  13. arXiv:2406.17863  [pdf, other

    cs.AI stat.ML

    What type of inference is planning?

    Authors: Miguel Lázaro-Gredilla, Li Yang Ku, Kevin P. Murphy, Dileep George

    Abstract: Multiple types of inference are available for probabilistic graphical models, e.g., marginal, maximum-a-posteriori, and even marginal maximum-a-posteriori. Which one do researchers mean when they talk about "planning as inference"? There is no consistency in the literature, different types are used, and their ability to do planning is further entangled with specific approximations or additional co… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  14. arXiv:2406.17698  [pdf, other

    stat.ML cs.LG

    Identifying Nonstationary Causal Structures with High-Order Markov Switching Models

    Authors: Carles Balsells-Rodas, Yixin Wang, Pedro A. M. Mediano, Yingzhen Li

    Abstract: Causal discovery in time series is a rapidly evolving field with a wide variety of applications in other areas such as climate science and neuroscience. Traditional approaches assume a stationary causal graph, which can be adapted to nonstationary time series with time-dependent effects or heterogeneous noise. In this work we address nonstationarity via regime-dependent causal structures. We first… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: CI4TS Workshop @UAI2024

  15. arXiv:2406.16708  [pdf, other

    cs.LG stat.ME

    CausalFormer: An Interpretable Transformer for Temporal Causal Discovery

    Authors: Lingbai Kong, Wengen Li, Hanchen Yang, Yichao Zhang, Jihong Guan, Shuigeng Zhou

    Abstract: Temporal causal discovery is a crucial task aimed at uncovering the causal relations within time series data. The latest temporal causal discovery methods usually train deep learning models on prediction tasks to uncover the causality between time series. They capture causal relations by analyzing the parameters of some components of the trained models, e.g., attention weights and convolution weig… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  16. arXiv:2406.16306  [pdf, other

    cs.CL cs.LG stat.ML

    Cascade Reward Sampling for Efficient Decoding-Time Alignment

    Authors: Bolian Li, Yifan Wang, Ananth Grama, Ruqi Zhang

    Abstract: Aligning large language models (LLMs) with human preferences is critical for their deployment. Recently, decoding-time alignment has emerged as an effective plug-and-play technique that requires no fine-tuning of model parameters. However, generating text that achieves both high reward and high likelihood remains a significant challenge. Existing methods often fail to generate high-reward text or… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  17. arXiv:2406.15762  [pdf, other

    cs.LG stat.ML

    Rethinking the Diffusion Models for Numerical Tabular Data Imputation from the Perspective of Wasserstein Gradient Flow

    Authors: Zhichao Chen, Haoxuan Li, Fangyikang Wang, Odin Zhang, Hu Xu, Xiaoyu Jiang, Zhihuan Song, Eric H. Wang

    Abstract: Diffusion models (DMs) have gained attention in Missing Data Imputation (MDI), but there remain two long-neglected issues to be addressed: (1). Inaccurate Imputation, which arises from inherently sample-diversification-pursuing generative process of DMs. (2). Difficult Training, which stems from intricate design required for the mask matrix in model training stage. To address these concerns within… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  18. arXiv:2406.15523  [pdf, other

    cs.LG stat.ML

    Unifying Unsupervised Graph-Level Anomaly Detection and Out-of-Distribution Detection: A Benchmark

    Authors: Yili Wang, Yixin Liu, Xu Shen, Chenyu Li, Kaize Ding, Rui Miao, Ying Wang, Shirui Pan, Xin Wang

    Abstract: To build safe and reliable graph machine learning systems, unsupervised graph-level anomaly detection (GLAD) and unsupervised graph-level out-of-distribution (OOD) detection (GLOD) have received significant attention in recent years. Though those two lines of research indeed share the same objective, they have been studied independently in the community due to distinct evaluation setups, creating… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  19. arXiv:2406.14742  [pdf, other

    cs.LG stat.ML

    Latent Variable Sequence Identification for Cognitive Models with Neural Bayes Estimation

    Authors: Ti-Fen Pan, **g-**g Li, Bill Thompson, Anne Collins

    Abstract: Extracting time-varying latent variables from computational cognitive models is a key step in model-based neural analysis, which aims to understand the neural correlates of cognitive processes. However, existing methods only allow researchers to infer latent variables that explain subjects' behavior in a relatively small class of cognitive models. For example, a broad class of relevant cognitive m… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  20. arXiv:2406.14699  [pdf, other

    cs.LG math.OC stat.ML

    Preferential Multi-Objective Bayesian Optimization

    Authors: Raul Astudillo, Kejun Li, Maegan Tucker, Chu Xin Cheng, Aaron D. Ames, Yisong Yue

    Abstract: Preferential Bayesian optimization (PBO) is a framework for optimizing a decision-maker's latent preferences over available design choices. While preferences often involve multiple conflicting objectives, existing work in PBO assumes that preferences can be encoded by a single objective function. For example, in robotic assistive devices, technicians often attempt to maximize user comfort while si… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  21. Causal Inference with Latent Variables: Recent Advances and Future Prospectives

    Authors: Yaochen Zhu, Yinhan He, **g Ma, Mengxuan Hu, Sheng Li, Jundong Li

    Abstract: Causality lays the foundation for the trajectory of our world. Causal inference (CI), which aims to infer intrinsic causal relations among variables of interest, has emerged as a crucial research topic. Nevertheless, the lack of observation of important variables (e.g., confounders, mediators, exogenous variables, etc.) severely compromises the reliability of CI methods. The issue may arise from t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD'24 Survey Track

  22. arXiv:2406.13936  [pdf, other

    stat.ML cs.LG math.OC

    Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods

    Authors: Tim Tsz-Kit Lau, Weijian Li, Chenwei Xu, Han Liu, Mladen Kolar

    Abstract: Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite under… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  23. arXiv:2406.13154  [pdf, other

    stat.ML cs.AI cs.LG

    Conditional score-based diffusion models for solving inverse problems in mechanics

    Authors: Agnimitra Dasgupta, Harisankar Ramaswamy, Javier Murgoitio Esandi, Ken Foo, Runze Li, Qifa Zhou, Brendan Kennedy, Assad Oberai

    Abstract: We propose a framework to perform Bayesian inference using conditional score-based diffusion models to solve a class of inverse problems in mechanics involving the inference of a specimen's spatially varying material properties from noisy measurements of its mechanical response to loading. Conditional score-based diffusion models are generative models that learn to approximate the score function o… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  24. arXiv:2406.13036  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.CO

    Sharp detection of low-dimensional structure in probability measures via dimensional logarithmic Sobolev inequalities

    Authors: Matthew T. C. Li, Tiangang Cui, Fengyi Li, Youssef Marzouk, Olivier Zahm

    Abstract: Identifying low-dimensional structure in high-dimensional probability measures is an essential pre-processing step for efficient sampling. We introduce a method for identifying and approximating a target measure $π$ as a perturbation of a given reference measure $μ$ along a few significant directions of $\mathbb{R}^{d}$. The reference measure can be a Gaussian or a nonlinear transformation of a Ga… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  25. arXiv:2406.12474  [pdf, other

    cs.CL stat.ME

    Exploring Intra and Inter-language Consistency in Embeddings with ICA

    Authors: Rongzhi Li, Takeru Matsuda, Hitomi Yanaka

    Abstract: Word embeddings represent words as multidimensional real vectors, facilitating data analysis and processing, but are often challenging to interpret. Independent Component Analysis (ICA) creates clearer semantic axes by identifying independent key features. Previous research has shown ICA's potential to reveal universal semantic axes across languages. However, it lacked verification of the consiste… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  26. arXiv:2406.11666  [pdf, other

    math.ST cs.LG stat.ML

    ROTI-GCV: Generalized Cross-Validation for right-ROTationally Invariant Data

    Authors: Kevin Luo, Yufan Li, Pragya Sur

    Abstract: Two key tasks in high-dimensional regularized regression are tuning the regularization strength for good predictions and estimating the out-of-sample risk. It is known that the standard approach -- $k$-fold cross-validation -- is inconsistent in modern high-dimensional settings. While leave-one-out and generalized cross-validation remain consistent in some high-dimensional cases, they become incon… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures

  27. arXiv:2406.11501  [pdf, other

    cs.LG cs.AI stat.ME

    Teleporter Theory: A General and Simple Approach for Modeling Cross-World Counterfactual Causality

    Authors: Jiangmeng Li, Bin Qin, Qirui Ji, Yi Li, Wenwen Qiang, Jianwen Cao, Fanjiang Xu

    Abstract: Leveraging the development of structural causal model (SCM), researchers can establish graphical models for exploring the causal mechanisms behind machine learning techniques. As the complexity of machine learning applications rises, single-world interventionism causal analysis encounters theoretical adaptation limitations. Accordingly, cross-world counterfactual approach extends our understanding… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  28. arXiv:2406.11490  [pdf, other

    cs.LG stat.ME

    Interventional Imbalanced Multi-Modal Representation Learning via $β$-Generalization Front-Door Criterion

    Authors: Yi Li, Jiangmeng Li, Fei Song, Qingmeng Zhu, Changwen Zheng, Wenwen Qiang

    Abstract: Multi-modal methods establish comprehensive superiority over uni-modal methods. However, the imbalanced contributions of different modalities to task-dependent predictions constantly degrade the discriminative performance of canonical multi-modal methods. Based on the contribution to task-dependent predictions, modalities can be identified as predominant and auxiliary modalities. Benchmark methods… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  29. arXiv:2406.10962  [pdf, other

    stat.ME stat.AP stat.ML

    SynthTree: Co-supervised Local Model Synthesis for Explainable Prediction

    Authors: Evgenii Kuriabov, Jia Li

    Abstract: Explainable machine learning (XML) has emerged as a major challenge in artificial intelligence (AI). Although black-box models such as Deep Neural Networks and Gradient Boosting often exhibit exceptional predictive accuracy, their lack of interpretability is a notable drawback, particularly in domains requiring transparency and trust. This paper tackles this core AI problem by proposing a novel me… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  30. arXiv:2406.10554  [pdf, other

    stat.ME stat.AP

    Causal Inference with Outcomes Truncated by Death and Missing Not at Random

    Authors: Wei Li, Yuan Liu, Shanshan Luo, Zhi Geng

    Abstract: In clinical trials, principal stratification analysis is commonly employed to address the issue of truncation by death, where a subject dies before the outcome can be measured. However, in practice, many survivor outcomes may remain uncollected or be missing not at random, posing a challenge to standard principal stratification analyses. In this paper, we explore the identification, estimation, an… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  31. arXiv:2406.10262  [pdf, other

    cs.IR cs.AI math.OC stat.CO

    Fast solution to the fair ranking problem using the Sinkhorn algorithm

    Authors: Yuki Uehara, Shunnosuke Ikeda, Naoki Nishimura, Koya Ohashi, Yilin Li, Jie Yang, Deddy Jobson, Xingxia Zha, Takeshi Matsumoto, Noriyoshi Sukegawa, Yuichi Takano

    Abstract: In two-sided marketplaces such as online flea markets, recommender systems for providing consumers with personalized item rankings play a key role in promoting transactions between providers and consumers. Meanwhile, two-sided marketplaces face the problem of balancing consumer satisfaction and fairness among items to stimulate activity of item providers. Saito and Joachims (2022) devised an impac… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  32. arXiv:2406.09694  [pdf, other

    stat.ML cs.LG

    An Efficient Approach to Regression Problems with Tensor Neural Networks

    Authors: Yongxin Li

    Abstract: This paper introduces a tensor neural network (TNN) to address nonparametric regression problems. Characterized by its distinct sub-network structure, the TNN effectively facilitates variable separation, thereby enhancing the approximation of complex, unknown functions. Our comparative analysis reveals that the TNN outperforms conventional Feed-Forward Networks (FFN) and Radial Basis Function Netw… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    MSC Class: 62J02; 68T05

  33. arXiv:2406.08709  [pdf, other

    cs.LG stat.ME

    Introducing Diminutive Causal Structure into Graph Representation Learning

    Authors: Hang Gao, Peng Qiao, Yifan **, Fengge Wu, Jiangmeng Li, Changwen Zheng

    Abstract: When engaging in end-to-end graph representation learning with Graph Neural Networks (GNNs), the intricate causal relationships and rules inherent in graph data pose a formidable challenge for the model in accurately capturing authentic data relationships. A proposed mitigating strategy involves the direct integration of rules or relationships corresponding to the graph data into the model. Howeve… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  34. arXiv:2406.08209  [pdf, other

    stat.ML cs.LG math.OC

    Forward-Euler time-discretization for Wasserstein gradient flows can be wrong

    Authors: Yewei Xu, Qin Li

    Abstract: In this note, we examine the forward-Euler discretization for simulating Wasserstein gradient flows. We provide two counter-examples showcasing the failure of this discretization even for a simple case where the energy functional is defined as the KL divergence against some nicely structured probability densities. A simple explanation of this failure is also discussed.

    Submitted 12 June, 2024; originally announced June 2024.

    MSC Class: 65M12

  35. arXiv:2406.06980  [pdf, other

    stat.ME

    Sensitivity Analysis for the Test-Negative Design

    Authors: Soumyabrata Kundu, Peng Ding, Xinran Li, **gshu Wang

    Abstract: The test-negative design has become popular for evaluating the effectiveness of post-licensure vaccines using observational data. In addition to its logistical convenience on data collection, the design is also believed to control for the differential health-care-seeking behavior between vaccinated and unvaccinated individuals, which is an important while often unmeasured confounder between the va… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  36. arXiv:2406.06941  [pdf, other

    stat.ME math.ST

    Efficient combination of observational and experimental datasets under general restrictions on outcome mean functions

    Authors: Harrison H. Li

    Abstract: A researcher collecting data from a randomized controlled trial (RCT) often has access to an auxiliary observational dataset that may be confounded or otherwise biased for estimating causal effects. Common modeling assumptions impose restrictions on the outcome mean function - the conditional expectation of the outcome of interest given observed covariates - in the two datasets. Running examples f… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 52 pages, 4 figures

  37. arXiv:2406.06920  [pdf, other

    stat.AP

    Where to place a mosquito trap for West Nile Virus surveillance?

    Authors: Anwesha Chakravarti, Bo Li, Dan Bartlett, Patrick Irwin, Rebecca Smith

    Abstract: The rapid spread of West Nile Virus (WNV) is a growing concern. With no vaccines or specific medications available, prevention through mosquito control is the only solution to curb the spread. Mosquito traps, used to detect viral presence in mosquito populations, are essential tools for WNV surveillance. But how do we decide where to place a mosquito trap? And what makes a good trap location, anyw… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 22 pages, 9 figures

  38. arXiv:2406.06767  [pdf

    stat.ME q-bio.QM stat.CO

    ULV: A robust statistical method for clustered data, with applications to multisubject, single-cell omics data

    Authors: Mingyu Du, Kevin Johnston, Veronica Berrocal, Wei Li, Xiangmin Xu, Zhaoxia Yu

    Abstract: Molecular and genomic technological advancements have greatly enhanced our understanding of biological processes by allowing us to quantify key biological variables such as gene expression, protein levels, and microbiome compositions. These breakthroughs have enabled us to achieve increasingly higher levels of resolution in our measurements, exemplified by our ability to comprehensively profile bi… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  39. arXiv:2406.05855  [pdf, other

    cs.LG cs.AI stat.ML

    Self-Distilled Disentangled Learning for Counterfactual Prediction

    Authors: Xinshu Li, Mingming Gong, Lina Yao

    Abstract: The advancements in disentangled representation learning significantly enhance the accuracy of counterfactual predictions by granting precise control over instrumental variables, confounders, and adjustable variables. An appealing method for achieving the independent separation of these factors is mutual information minimization, a task that presents challenges in numerous machine learning scenari… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  40. arXiv:2406.05666  [pdf, other

    cs.LG cs.IR stat.ML

    General Distribution Learning: A theoretical framework for Deep Learning

    Authors: Binchuan Qi, Li Li, Wei Gong

    Abstract: There remain numerous unanswered research questions on deep learning (DL) within the classical learning theory framework. These include the remarkable generalization capabilities of overparametrized neural networks (NNs), the efficient optimization performance despite non-convexity of objectives, the mechanism of flat minima for generalization, and the exceptional performance of deep architectures… ▽ More

    Submitted 26 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2105.04026 by other authors. arXiv admin note: text overlap with arXiv:2105.04026 by other authors

  41. arXiv:2406.05637  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    A Generalized Version of Chung's Lemma and its Applications

    Authors: Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

    Abstract: Chung's lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 43 pages, 5 figures

    MSC Class: 90C15; 90C30; 90C26

  42. arXiv:2406.05340  [pdf, other

    stat.ME stat.ML

    Selecting the Number of Communities for Weighted Degree-Corrected Stochastic Block Models

    Authors: Yucheng Liu, Xiaodong Li

    Abstract: We investigate how to select the number of communities for weighted networks without a full likelihood modeling. First, we propose a novel weighted degree-corrected stochastic block model (DCSBM), in which the mean adjacency matrix is modeled as the same as in standard DCSBM, while the variance profile matrix is assumed to be related to the mean adjacency matrix through a given variance function.… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 3 figures, 2 tables

  43. arXiv:2406.05260  [pdf, other

    stat.ML cs.LG

    Generative modeling of density regression through tree flows

    Authors: Zhuoqun Wang, Naoki Awaya, Li Ma

    Abstract: A common objective in the analysis of tabular data is estimating the conditional distribution (in contrast to only producing predictions) of a set of "outcome" variables given a set of "covariates", which is sometimes referred to as the "density regression" problem. Beyond estimation on the conditional distribution, the generative ability of drawing synthetic samples from the learned conditional d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures

  44. arXiv:2406.04374  [pdf, other

    cs.IR cs.GT cs.LG stat.ML

    Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility

    Authors: Yuantong Li, Guang Cheng, Xiaowu Dai

    Abstract: Recommender systems play a crucial role in internet economies by connecting users with relevant products or services. However, designing effective recommender systems faces two key challenges: (1) the exploration-exploitation tradeoff in balancing new product exploration against exploiting known preferences, and (2) dynamic incentive compatibility in accounting for users' self-interested behaviors… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  45. arXiv:2406.04201  [pdf, ps, other

    cs.LG cs.MA math.OC stat.ML

    Towards Principled Superhuman AI for Multiplayer Symmetric Games

    Authors: Jiawei Ge, Yuanhao Wang, Wenzhe Li, Chi **

    Abstract: Multiplayer games, when the number of players exceeds two, present unique challenges that fundamentally distinguish them from the extensively studied two-player zero-sum games. These challenges arise from the non-uniqueness of equilibria and the risk of agents performing highly suboptimally when adopting equilibrium strategies. While a line of recent works developed learning systems successfully a… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  46. arXiv:2406.03849  [pdf

    cs.LG stat.AP stat.ML

    A Noise-robust Multi-head Attention Mechanism for Formation Resistivity Prediction: Frequency Aware LSTM

    Authors: Yongan Zhang, Junfeng Zhao, Jian Li, Xuanran Wang, Youzhuang Sun, Yuntian Chen, Dongxiao Zhang

    Abstract: The prediction of formation resistivity plays a crucial role in the evaluation of oil and gas reservoirs, identification and assessment of geothermal energy resources, groundwater detection and monitoring, and carbon capture and storage. However, traditional well logging techniques fail to measure accurate resistivity in cased boreholes, and the transient electromagnetic method for cased borehole… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  47. arXiv:2406.03707  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions

    Authors: Liyi Zhang, Michael Y. Li, Thomas L. Griffiths

    Abstract: Autoregressive language models have demonstrated a remarkable ability to extract latent structure from text. The embeddings from large language models have been shown to capture aspects of the syntax and semantics of language. But what {\em should} embeddings represent? We connect the autoregressive prediction objective to the idea of constructing predictive sufficient statistics to summarize the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

    ACM Class: I.2; I.5

  48. arXiv:2406.03683  [pdf, other

    cs.LG stat.ML

    Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models

    Authors: Ding Huang, Ting Li, Jian Huang

    Abstract: We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 25 pages, 26 figures, and 4 tables

    MSC Class: 62G05; 68T07

  49. arXiv:2406.03628  [pdf, other

    stat.ML cs.LG

    Synthetic Oversampling: Theory and A Practical Approach Using LLMs to Address Data Imbalance

    Authors: Ryumei Nakada, Yichen Xu, Lexin Li, Linjun Zhang

    Abstract: Imbalanced data and spurious correlations are common challenges in machine learning and data science. Oversampling, which artificially increases the number of instances in the underrepresented classes, has been widely adopted to tackle these challenges. In this article, we introduce OPAL (\textbf{O}versam\textbf{P}ling with \textbf{A}rtificial \textbf{L}LM-generated data), a systematic oversamplin… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 59 pages, 7 figures

  50. arXiv:2406.03596  [pdf

    stat.ME

    A Multivariate Equivalence Test Based on Mahalanobis Distance with a Data-Driven Margin

    Authors: Chao Wang, Yu-Ting Weng, Shaobo Liu, Tengfei Li, Meiyu Shen, Yi Tsong

    Abstract: Multivariate equivalence testing is needed in a variety of scenarios for drug development. For example, drug products obtained from natural sources may contain many components for which the individual effects and/or their interactions on clinical efficacy and safety cannot be completely characterized. Such lack of sufficient characterization poses a challenge for both generic drug developers to de… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.