Skip to main content

Showing 1–50 of 154 results for author: Li, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13036  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.CO

    Sharp detection of low-dimensional structure in probability measures via dimensional logarithmic Sobolev inequalities

    Authors: Matthew T. C. Li, Tiangang Cui, Fengyi Li, Youssef Marzouk, Olivier Zahm

    Abstract: Identifying low-dimensional structure in high-dimensional probability measures is an essential pre-processing step for efficient sampling. We introduce a method for identifying and approximating a target measure $π$ as a perturbation of a given reference measure $μ$ along a few significant directions of $\mathbb{R}^{d}$. The reference measure can be a Gaussian or a nonlinear transformation of a Ga… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.03683  [pdf, other

    cs.LG stat.ML

    Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models

    Authors: Ding Huang, Ting Li, Jian Huang

    Abstract: We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 25 pages, 26 figures, and 4 tables

    MSC Class: 62G05; 68T07

  3. arXiv:2406.03596  [pdf

    stat.ME

    A Multivariate Equivalence Test Based on Mahalanobis Distance with a Data-Driven Margin

    Authors: Chao Wang, Yu-Ting Weng, Shaobo Liu, Tengfei Li, Meiyu Shen, Yi Tsong

    Abstract: Multivariate equivalence testing is needed in a variety of scenarios for drug development. For example, drug products obtained from natural sources may contain many components for which the individual effects and/or their interactions on clinical efficacy and safety cannot be completely characterized. Such lack of sufficient characterization poses a challenge for both generic drug developers to de… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.00317  [pdf, other

    stat.ML cs.LG stat.ME

    Combining Experimental and Historical Data for Policy Evaluation

    Authors: Ting Li, Chengchun Shi, Qianglin Wen, Yang Sui, Yongli Qin, Chunbo Lai, Hongtu Zhu

    Abstract: This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to min… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2405.20782  [pdf, other

    cs.CR cs.IT stat.ML

    Universal Exact Compression of Differentially Private Mechanisms

    Authors: Yanxiao Liu, Wei-Ning Chen, Ayfer Özgür, Cheuk Ting Li

    Abstract: To reduce the communication cost of differential privacy mechanisms, we introduce a novel construction, called Poisson private representation (PPR), designed to compress and simulate any local randomizer while ensuring local differential privacy. Unlike previous simulation-based local differential privacy mechanisms, PPR exactly preserves the joint distribution of the data and the output of the or… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 30 pages, 3 figures

  6. arXiv:2405.12838  [pdf, ps, other

    quant-ph stat.CO

    Quantum Non-Identical Mean Estimation: Efficient Algorithms and Fundamental Limits

    Authors: Jiachen Hu, Tongyang Li, Xinzhao Wang, Yecheng Xue, Chenyi Zhang, Han Zhong

    Abstract: We systematically investigate quantum algorithms and lower bounds for mean estimation given query access to non-identically distributed samples. On the one hand, we give quantum mean estimators with quadratic quantum speed-up given samples from different bounded or sub-Gaussian random variables. On the other hand, we prove that, in general, it is impossible for any quantum algorithm to achieve qua… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 31 pages, 0 figure. To appear in the 19th Theory of Quantum Computation, Communication and Cryptography (TQC 2024)

  7. arXiv:2403.18951  [pdf, other

    stat.ME stat.AP stat.CO stat.OT

    Robust estimations from distribution structures: V. Non-asymptotic

    Authors: Tuobang Li

    Abstract: Due to the complexity of order statistics, the finite sample behaviour of robust statistics is generally not analytically solvable. While the Monte Carlo method can provide approximate solutions, its convergence rate is typically very slow, making the computational cost to achieve the desired accuracy unaffordable for ordinary users. In this paper, we propose an approach analogous to the Fourier t… ▽ More

    Submitted 13 June, 2024; v1 submitted 26 February, 2024; originally announced March 2024.

  8. arXiv:2403.12110  [pdf, other

    math.ST stat.AP stat.CO stat.ME stat.OT

    Robust estimations from distribution structures: I. Mean

    Authors: Tuobang Li

    Abstract: As the most fundamental problem in statistics, robust location estimation has many prominent solutions, such as the trimmed mean, Winsorized mean, Hodges Lehmann estimator, Huber M estimator, and median of means. Recent studies suggest that their maximum biases concerning the mean can be quite different, but the underlying mechanisms largely remain unclear. This study exploited a semiparametric me… ▽ More

    Submitted 13 June, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  9. arXiv:2402.17287  [pdf, other

    cs.LG cs.CV stat.ML

    An Interpretable Evaluation of Entropy-based Novelty of Generative Models

    Authors: **gwei Zhang, Cheuk Ting Li, Farzan Farnia

    Abstract: The massive developments of generative model frameworks require principled methods for the evaluation of a model's novelty compared to a reference dataset. While the literature has extensively studied the evaluation of the quality, diversity, and generalizability of generative models, the assessment of a model's novelty compared to a reference model has not been adequately explored in the machine… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  10. arXiv:2402.05802  [pdf, other

    cs.LG stat.AP stat.ML

    Unsupervised Discovery of Clinical Disease Signatures Using Probabilistic Independence

    Authors: Thomas A. Lasko, John M. Still, Thomas Z. Li, Marco Barbero Mota, William W. Stead, Eric V. Strobl, Bennett A. Landman, Fabien Maldonado

    Abstract: Insufficiently precise diagnosis of clinical disease is likely responsible for many treatment failures, even for common conditions and treatments. With a large enough dataset, it may be possible to use unsupervised machine learning to define clinical disease patterns more precisely. We present an approach to learning these patterns by using probabilistic independence to disentangle the imprint on… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 29 Pages, 8 figures

    ACM Class: I.2.6; I.2.1; J.3

  11. arXiv:2401.16320  [pdf, ps, other

    quant-ph stat.ML

    A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning

    Authors: X. L. Zhao, Y. M. Zhao, M. Li, T. T. Li, Q. Liu, S. Guo, X. X. Yi

    Abstract: We propose a scheme leveraging reinforcement learning to engineer control fields for generating non-classical states. It is exemplified by the application to prepare spin-squeezed states for an open collective spin model where a linear control field is designed to govern the dynamics. The reinforcement learning agent determines the temporal sequence of control pulses, commencing from a coherent sp… ▽ More

    Submitted 14 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  12. arXiv:2312.05579  [pdf, other

    stat.ML cs.LG

    Conditional Stochastic Interpolation for Generative Learning

    Authors: Ding Huang, Jian Huang, Ting Li, Guohao Shen

    Abstract: We propose a conditional stochastic interpolation (CSI) approach to learning conditional distributions. CSI learns probability flow equations or stochastic differential equations that transport a reference distribution to the target conditional distribution. This is achieved by first learning the drift function and the conditional score function based on conditional stochastic interpolation, which… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 44 pages, 4 figures

  13. arXiv:2311.15598  [pdf, other

    math.ST cs.LG cs.SI stat.ME stat.ML

    Optimal Clustering of Discrete Mixtures: Binomial, Poisson, Block Models, and Multi-layer Networks

    Authors: Zhongyuan Lyu, Ting Li, Dong Xia

    Abstract: In this paper, we first study the fundamental limit of clustering networks when a multi-layer network is present. Under the mixture multi-layer stochastic block model (MMSBM), we show that the minimax optimal network clustering error rate, which takes an exponential form and is characterized by the Renyi divergence between the edge probability distributions of the component networks. We propose a… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  14. arXiv:2311.05248  [pdf, other

    stat.ME math.ST

    A General Space of Belief Updates for Model Misspecification in Bayesian Networks

    Authors: Tian** Li

    Abstract: In an ideal setting for Bayesian agents, a perfect description of the rules of the environment (i.e., the objective observation model) is available, allowing them to reason through the Bayesian posterior to update their beliefs in an optimal way. But such an ideal setting hardly ever exists in the natural world, so agents have to make do with reasoning about how they should update their beliefs si… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 14 pages, 4 figures

  15. arXiv:2311.02532  [pdf, other

    stat.ME

    Optimal Treatment Allocation for Efficient Policy Evaluation in Sequential Decision Making

    Authors: Ting Li, Chengchun Shi, Jianing Wang, Fan Zhou, Hongtu Zhu

    Abstract: A/B testing is critical for modern technological companies to evaluate the effectiveness of newly developed products against standard baselines. This paper studies optimal designs that aim to maximize the amount of information obtained from online experiments to estimate treatment effects accurately. We propose three optimal allocation strategies in a dynamic setting where treatments are sequentia… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  16. arXiv:2309.08809  [pdf

    stat.AP

    Associations Between Sleep Efficiency Variability and Cognition Among Older Adults: Cross-Sectional Accelerometer Study

    Authors: Collin Sakal, Tingyou Li, Juan Li, Xinyue Li

    Abstract: Objective: We aimed to determine the relationship between day-to-day sleep efficiency variability and cognitive function among older adults using accelerometer data and three cognitive tests. Methods: Older adults aged 65+ with 5 days of accelerometer data from the National Health and Nutrition Examination Survey (NHANES) who completed the Digit Symbol Substitution Test (DSST), the Consortium to… ▽ More

    Submitted 5 November, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: Revised study design and figures

  17. Predicting Battery Lifetime Under Varying Usage Conditions from Early Aging Data

    Authors: Tingkai Li, Zihao Zhou, Adam Thelen, David Howey, Chao Hu

    Abstract: Accurate battery lifetime prediction is important for preventative maintenance, warranties, and improved cell design and manufacturing. However, manufacturing variability and usage-dependent degradation make life prediction challenging. Here, we investigate new features derived from capacity-voltage data in early life to predict the lifetime of cells cycled under widely varying charge rates, disch… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Journal ref: Cell Reports Physical Science. 5(4), 101891. 2024

  18. arXiv:2307.01497  [pdf, other

    math.OC cs.LG stat.CO stat.ML

    Accelerated stochastic approximation with state-dependent noise

    Authors: Sasila Ilandarideva, Anatoli Juditsky, Guanghui Lan, Tianjiao Li

    Abstract: We consider a class of stochastic smooth convex optimization problems under rather general assumptions on the noise in the stochastic gradient observation. As opposed to the classical problem setting in which the variance of noise is assumed to be uniformly bounded, herein we assume that the variance of stochastic gradients is related to the "sub-optimality" of the approximate solutions delivered… ▽ More

    Submitted 13 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  19. arXiv:2306.10405  [pdf, other

    stat.ME stat.CO

    A semi-parametric estimation method for quantile coherence with an application to bivariate financial time series clustering

    Authors: Cristian F. Jiménez-Varón, Ying Sun, Ta-Hsin Li

    Abstract: In multivariate time series analysis, spectral coherence measures the linear dependency between two time series at different frequencies. However, real data applications often exhibit nonlinear dependency in the frequency domain. Conventional coherence analysis fails to capture such dependency. The quantile coherence, on the other hand, characterizes nonlinear dependency by defining the coherence… ▽ More

    Submitted 29 February, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: 39 pages, 11 figures

  20. arXiv:2306.06581  [pdf, other

    stat.ML cs.DS cs.LG math.OC

    Importance Sparsification for Sinkhorn Algorithm

    Authors: Mengyu Li, Jun Yu, Tao Li, Cheng Meng

    Abstract: Sinkhorn algorithm has been used pervasively to approximate the solution to optimal transport (OT) and unbalanced optimal transport (UOT) problems. However, its practical application is limited due to the high computational complexity. To alleviate the computational burden, we propose a novel importance sparsification method, called Spar-Sink, to efficiently approximate entropy-regularized OT and… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted by Journal of Machine Learning Research

  21. arXiv:2306.02826  [pdf, ps, other

    quant-ph cs.AI cs.DS cs.LG stat.ML

    Near-Optimal Quantum Coreset Construction Algorithms for Clustering

    Authors: Yecheng Xue, Xiaoyu Chen, Tongyang Li, Shaofeng H. -C. Jiang

    Abstract: $k$-Clustering in $\mathbb{R}^d$ (e.g., $k$-median and $k$-means) is a fundamental machine learning problem. While near-linear time approximation algorithms were known in the classical setting for a dataset with cardinality $n$, it remains open to find sublinear-time quantum algorithms. We give quantum algorithms that find coresets for $k$-clustering in $\mathbb{R}^d$ with $\tilde{O}(\sqrt{nk}d^{3… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Comments: 32 pages, 0 figures, 1 table. To appear in the Fortieth International Conference on Machine Learning (ICML 2023)

  22. arXiv:2305.10187  [pdf, other

    stat.ME cs.LG stat.ML

    Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing

    Authors: Ting Li, Chengchun Shi, Zhaohua Lu, Yi Li, Hongtu Zhu

    Abstract: Many modern tech companies, such as Google, Uber, and Didi, utilize online experiments (also known as A/B testing) to evaluate new policies against existing ones. While most studies concentrate on average treatment effects, situations with skewed and heavy-tailed outcome distributions may benefit from alternative criteria, such as quantiles. However, assessing dynamic quantile treatment effects (Q… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  23. arXiv:2305.06172  [pdf, other

    stat.CO math.PR math.ST

    Principal Feature Detection via $Φ$-Sobolev Inequalities

    Authors: Matthew T. C. Li, Youssef Marzouk, Olivier Zahm

    Abstract: We investigate the approximation of high-dimensional target measures as low-dimensional updates of a dominating reference measure. This approximation class replaces the associated density with the composition of: (i) a feature map that identifies the leading principal components or features of the target measure, relative to the reference, and (ii) a low-dimensional profile function. When the refe… ▽ More

    Submitted 16 January, 2024; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: To appear in Bernoulli, but this version contains both the main file and the supplementary material

  24. arXiv:2305.02500  [pdf

    stat.AP

    Identifying the most predictive risk factors for future cognitive impairment among elderly Chinese

    Authors: Collin Sakal, Tingyou Li, Juan Li, Xinyue Li

    Abstract: Introduction. The societal burden of cognitive impairments in China has prompted researchers to develop clinical prediction models aimed at making risk assessments that enable preventative interventions. However, it is unclear which risk factors best predict future cognitive impairment and if predictive ability is consistent across different socioeconomic groups. Methods. We quantified the ability… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 3 figures, 2 tables

  25. arXiv:2303.11230  [pdf, other

    cs.SI cs.LG stat.ML

    Fitting Low-rank Models on Egocentrically Sampled Partial Networks

    Authors: Angus Chan, Tianxi Li

    Abstract: The statistical modeling of random networks has been widely used to uncover interaction mechanisms in complex systems and to predict unobserved links in real-world networks. In many applications, network connections are collected via egocentric sampling: a subset of nodes is sampled first, after which all links involving this subset are recorded; all other information is missing. Compared with the… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  26. arXiv:2303.10599  [pdf, ps, other

    stat.ML math.OC

    Convergence Analysis of Stochastic Gradient Descent with MCMC Estimators

    Authors: Tianyou Li, Fan Chen, Huajie Chen, Zaiwen Wen

    Abstract: Understanding stochastic gradient descent (SGD) and its variants is essential for machine learning. However, most of the preceding analyses are conducted under amenable conditions such as unbiased gradient estimator and bounded objective functions, which does not encompass many sophisticated applications, such as variational Monte Carlo, entropy-regularized reinforcement learning and variational i… ▽ More

    Submitted 23 March, 2024; v1 submitted 19 March, 2023; originally announced March 2023.

  27. arXiv:2302.10796  [pdf, ps, other

    quant-ph cs.AI cs.LG stat.ML

    Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret

    Authors: Han Zhong, Jiachen Hu, Yecheng Xue, Tongyang Li, Liwei Wang

    Abstract: While quantum reinforcement learning (RL) has attracted a surge of attention recently, its theoretical understanding is limited. In particular, it remains elusive how to design provably efficient quantum RL algorithms that can address the exploration-exploitation trade-off. To this end, we propose a novel UCRL-style algorithm that takes advantage of quantum computing for tabular Markov decision pr… ▽ More

    Submitted 13 June, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: ICML 2024

  28. arXiv:2302.04437  [pdf, other

    stat.ML cs.LG stat.AP

    rMultiNet: An R Package For Multilayer Networks Analysis

    Authors: Ting Li, Zhongyuan Lyu, Chenyu Ren, Dong Xia

    Abstract: This paper develops an R package rMultiNet to analyze multilayer network data. We provide two general frameworks from recent literature, e.g. mixture multilayer stochastic block model(MMSBM) and mixture multilayer latent space model(MMLSM) to generate the multilayer network. We also provide several methods to reveal the embedding of both nodes and layers followed by further data analysis methods,… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  29. arXiv:2211.09221  [pdf, other

    stat.ML cs.LG

    The non-overlap** statistical approximation to overlap** group lasso

    Authors: Mingyu Qi, Tianxi Li

    Abstract: Group lasso is a commonly used regularization method in statistical learning in which parameters are eliminated from the model according to predefined groups. However, when the groups overlap, optimizing the group lasso penalized objective can be time-consuming on large-scale problems because of the non-separability induced by the overlap** groups. This bottleneck has seriously limited the appli… ▽ More

    Submitted 20 February, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

  30. arXiv:2211.05844  [pdf, ps, other

    stat.ME stat.CO

    Quantile Fourier Transform, Quantile Series, and Nonparametric Estimation of Quantile Spectra

    Authors: Ta-Hsin Li

    Abstract: A nonparametric method is proposed for estimating the quantile spectra and cross-spectra introduced in Li (2012; 2014) as bivariate functions of frequency and quantile level. The method is based on the quantile discrete Fourier transform (QDFT) defined by trigonometric quantile regression and the quantile series (QSER) defined by the inverse Fourier transform of the QDFT. A nonparametric spectral… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  31. arXiv:2210.14086  [pdf, ps, other

    math.ST stat.ME

    A Global Wavelet Based Bootstrapped Test of Covariance Stationarity

    Authors: Jonathan B. Hill, Tianqi Li

    Abstract: We propose a covariance stationarity test for an otherwise dependent and possibly globally non-stationary time series. We work in a generalized version of the new setting in **, Wang and Wang (2015), who exploit Walsh (1923) functions in order to compare sub-sample covariances with the full sample counterpart. They impose strict stationarity under the null, only consider linear processes under ei… ▽ More

    Submitted 21 May, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

    MSC Class: 62G10; 62M10; 62F40

  32. arXiv:2210.09217  [pdf, other

    stat.AP q-bio.NC

    Statistical learning methods for neuroimaging data analysis with applications

    Authors: Hongtu Zhu, Tengfei Li, Bingxin Zhao

    Abstract: The aim of this paper is to provide a comprehensive review of statistical challenges in neuroimaging data analysis from neuroimaging techniques to large-scale neuroimaging studies to statistical learning methods. We briefly review eight popular neuroimaging techniques and their potential applications in neuroscience research and clinical translation. We delineate the four common themes of neuroima… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 73 pages, 4 Figures

  33. arXiv:2210.01084  [pdf, other

    stat.ME stat.AP

    A Partially Functional Linear Modeling Framework for Integrating Genetic, Imaging, and Clinical Data

    Authors: Ting Li, Yang Yu, J. S. Marron, Hongtu Zhu

    Abstract: This paper is motivated by the joint analysis of genetic, imaging, and clinical (GIC) data collected in the Alzheimer's Disease Neuroimaging Initiative (ADNI) study. We propose a regression framework based on partially functional linear regression models to map high-dimensional GIC-related pathways for Alzheimer's Disease (AD). We develop a joint model selection and estimation procedure by embeddi… ▽ More

    Submitted 22 February, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

  34. arXiv:2209.10433  [pdf, ps, other

    eess.SY stat.ME

    Arithmetic Average Density Fusion -- Part II: Unified Derivation for Unlabeled and Labeled RFS Fusion

    Authors: Tiancheng Li

    Abstract: As a fundamental information fusion approach, the arithmetic average (AA) fusion has recently been investigated for various random finite set (RFS) filter fusion in the context of multi-sensor multi-target tracking. It is not a straightforward extension of the ordinary density-AA fusion to the RFS distribution but has to preserve the form of the fusing multi-target density. In this work, we first… ▽ More

    Submitted 22 November, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: 13 pages, 4 figures, 1 table

    Journal ref: IEEE Transactions on Aerospace and Electronics Systems, 2024

  35. arXiv:2208.02161  [pdf, other

    cs.LG stat.ME

    A Screening Strategy for Structured Optimization Involving Nonconvex $\ell_{q,p}$ Regularization

    Authors: Tiange Li, Xiangyu Yang, Hao Wang

    Abstract: In this paper, we develop a simple yet effective screening rule strategy to improve the computational efficiency in solving structured optimization involving nonconvex $\ell_{q,p}$ regularization. Based on an iteratively reweighted $\ell_1$ (IRL1) framework, the proposed screening rule works like a preprocessing module that potentially removes the inactive groups before starting the subproblem sol… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  36. arXiv:2207.05301  [pdf, other

    cs.SI physics.data-an physics.soc-ph stat.ML

    Edge Augmentation on Disconnected Graphs via Eigenvalue Elevation

    Authors: Tianyi Li

    Abstract: The graph-theoretical task of determining most likely inter-community edges based on disconnected subgraphs' intra-community connectivity is proposed. An algorithm is developed for this edge augmentation task, based on elevating the zero eigenvalues of graph's spectrum. Upper bounds for eigenvalue elevation amplitude and for the corresponding augmented edge density are derived and are authenticate… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: 6 pages, 3 figures

  37. arXiv:2206.10240  [pdf, other

    stat.CO stat.ME

    Core-Elements for Classical Linear Regression

    Authors: Mengyu Li, Jun Yu, Tao Li, Cheng Meng

    Abstract: The coresets approach, also called subsampling or subset selection, aims to select a subsample as a surrogate for the observed sample. Such an approach has been used pervasively in large-scale data analysis. Existing coresets methods construct the subsample using a subset of rows from the predictor matrix. Such methods can be significantly inefficient when the predictor matrix is sparse or numeric… ▽ More

    Submitted 17 March, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  38. arXiv:2206.08649  [pdf

    stat.ME stat.AP stat.OT

    On the probability of invalidating a causal inference due to limited external validity

    Authors: Tenglong Li

    Abstract: External validity is often questionable in empirical research, especially in randomized experiments due to the trade-off between internal validity and external validity. To quantify the robustness of external validity, one must first conceptualize the gap between a sample that is fully representative of the target population (i.e., the ideal sample) and the observed sample. Drawing on Frank & Min… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 57 pages, 3 figures

  39. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  40. arXiv:2206.03861  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Decentralized Online Regularized Learning Over Random Time-Varying Graphs

    Authors: Xiwei Zhang, Tao Li, Xiaozheng Fu

    Abstract: We study the decentralized online regularized linear regression algorithm over random time-varying graphs. At each time step, every node runs an online estimation algorithm consisting of an innovation term processing its own new measurement, a consensus term taking a weighted sum of estimations of its own and its neighbors with additive and multiplicative communication noises and a regularization… ▽ More

    Submitted 21 April, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

  41. arXiv:2206.01341  [pdf, other

    cs.LG eess.SY stat.ML

    Equip** Black-Box Policies with Model-Based Advice for Stable Nonlinear Control

    Authors: Tongxin Li, Ruixiao Yang, Guannan Qu, Yiheng Lin, Steven Low, Adam Wierman

    Abstract: Machine-learned black-box policies are ubiquitous for nonlinear control problems. Meanwhile, crude model information is often available for these problems from, e.g., linear approximations of nonlinear dynamics. We study the problem of equip** a black-box control policy with model-based advice for nonlinear control on a single trajectory. We first show a general negative result that a naive conv… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 33 pages, 7 figures

  42. arXiv:2205.15059  [pdf, other

    cs.LG stat.ML

    Hilbert Curve Projection Distance for Distribution Comparison

    Authors: Tao Li, Cheng Meng, Hongteng Xu, Jun Yu

    Abstract: Distribution comparison plays a central role in many machine learning tasks like data classification and generative modeling. In this study, we propose a novel metric, called Hilbert curve projection (HCP) distance, to measure the distance between two probability distributions with low complexity. In particular, we first project two high-dimensional probability distributions using Hilbert curve to… ▽ More

    Submitted 6 February, 2024; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 33 pages, 11 figures

  43. arXiv:2205.05800  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic first-order methods for average-reward Markov decision processes

    Authors: Tianjiao Li, Feiyang Wu, Guanghui Lan

    Abstract: We study the problem of average-reward Markov decision processes (AMDPs) and develop novel first-order methods with strong theoretical guarantees for both policy evaluation and optimization. Existing on-policy evaluation methods suffer from sub-optimal convergence rates as well as failure in handling insufficiently random policies, e.g., deterministic policies, for lack of exploration. To remedy t… ▽ More

    Submitted 14 September, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

  44. arXiv:2202.05963  [pdf, other

    cs.LG cs.CR stat.ML

    Private Adaptive Optimization with Side Information

    Authors: Tian Li, Manzil Zaheer, Sashank J. Reddi, Virginia Smith

    Abstract: Adaptive optimization methods have become the default solvers for many machine learning tasks. Unfortunately, the benefits of adaptivity may degrade when training with differential privacy, as the noise added to ensure privacy reduces the effectiveness of the adaptive preconditioner. To this end, we propose AdaDPS, a general framework that uses non-sensitive side information to precondition the gr… ▽ More

    Submitted 24 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  45. arXiv:2112.13109  [pdf, other

    stat.ML cs.LG math.OC

    Accelerated and instance-optimal policy evaluation with linear function approximation

    Authors: Tianjiao Li, Guanghui Lan, Ashwin Pananjady

    Abstract: We study the problem of policy evaluation with linear function approximation and present efficient and practical algorithms that come with strong optimality guarantees. We begin by proving lower bounds that establish baselines on both the deterministic error and stochastic error in this problem. In particular, we prove an oracle complexity lower bound on the deterministic error in an instance-depe… ▽ More

    Submitted 13 August, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

  46. arXiv:2112.08507  [pdf, other

    cs.LG stat.ML

    Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization

    Authors: Tong Li, Jacob Nogas, Haochen Song, Harsh Kumar, Audrey Durand, Anna Rafferty, Nina Deliu, Sofia S. Villar, Joseph J. Williams

    Abstract: Multi-armed bandit algorithms like Thompson Sampling (TS) can be used to conduct adaptive experiments, in which maximizing reward means that data is used to progressively assign participants to more effective arms. Such assignment strategies increase the risk of statistical hypothesis tests identifying a difference between arms when there is not one, and failing to conclude there is a difference i… ▽ More

    Submitted 23 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  47. arXiv:2111.14069  [pdf, other

    math.OC cs.LG stat.ML

    Escape saddle points by a simple gradient-descent based algorithm

    Authors: Chenyi Zhang, Tongyang Li

    Abstract: Esca** saddle points is a central research topic in nonconvex optimization. In this paper, we propose a simple gradient-based algorithm such that for a smooth function $f\colon\mathbb{R}^n\to\mathbb{R}$, it outputs an $ε$-approximate second-order stationary point in $\tilde{O}(\log n/ε^{1.75})$ iterations. Compared to the previous state-of-the-art algorithms by ** et al. with… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

    Comments: 34 pages, 8 figures, to appear in the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  48. arXiv:2111.03721  [pdf, other

    stat.ME stat.AP

    Compressed spectral screening for large-scale differential correlation analysis with application in selecting Glioblastoma gene modules

    Authors: Tianxi Li, Xiwei Tang, Ajay Chatrath

    Abstract: Differential co-expression analysis has been widely applied by scientists in understanding the biological mechanisms of diseases. However, the unknown differential patterns are often complicated; thus, models based on simplified parametric assumptions can be ineffective in identifying the differences. Meanwhile, the gene expression data involved in such analysis are in extremely high dimensions by… ▽ More

    Submitted 12 January, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

  49. arXiv:2109.06141  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    On Tilted Losses in Machine Learning: Theory and Applications

    Authors: Tian Li, Ahmad Beirami, Maziar Sanjabi, Virginia Smith

    Abstract: Exponential tilting is a technique commonly used in fields such as statistics, probability, information theory, and optimization to create parametric distribution shifts. Despite its prevalence in related fields, tilting has not seen widespread use in machine learning. In this work, we aim to bridge this gap by exploring the use of tilting in risk minimization. We study a simple extension to ERM -… ▽ More

    Submitted 1 June, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.01162

  50. arXiv:2109.00171  [pdf

    stat.ME econ.EM stat.AP stat.OT

    A generalized bootstrap procedure of the standard error and confidence interval estimation for inverse probability of treatment weighting

    Authors: Tenglong Li, Jordan Lawson

    Abstract: The inverse probability of treatment weighting (IPTW) approach is commonly used in propensity score analysis to infer causal effects in regression models. Due to oversized IPTW weights and errors associated with propensity score estimation, the IPTW approach can underestimate the standard error of causal effect. To remediate this, bootstrap standard errors have been recommended to replace the IPTW… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.