Skip to main content

Showing 1–50 of 111 results for author: Yang, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.01868  [pdf, other

    stat.ME stat.AP stat.CO

    Forecast Linear Augmented Projection (FLAP): A free lunch to reduce forecast error variance

    Authors: Yangzhuoran Fin Yang, George Athanasopoulos, Rob J. Hyndman, Anastasios Panagiotelis

    Abstract: A novel forecast linear augmented projection (FLAP) method is introduced, which reduces the forecast error variance of any unbiased multivariate forecast without introducing bias. The method first constructs new component series which are linear combinations of the original series. Forecasts are then generated for both the original and component series. Finally, the full vector of forecasts is pro… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18072  [pdf, ps, other

    stat.ML cs.LG

    Learning for Bandits under Action Erasures

    Authors: Osama Hanna, Merve Karakas, Lin F. Yang, Christina Fragouli

    Abstract: We consider a novel multi-arm bandit (MAB) setup, where a learner needs to communicate the actions to distributed agents over erasure channels, while the rewards for the actions are directly available to the learner through external sensors. In our model, while the distributed agents know if an action is erased, the central learner does not (there is no feedback), and thus does not know whether th… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2404.18905  [pdf, other

    stat.ME cs.LG stat.ML

    Detecting critical treatment effect bias in small subgroups

    Authors: Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang

    Abstract: Randomized trials are considered the gold standard for making informed decisions in medicine, yet they often lack generalizability to the patient populations in clinical practice. Observational studies, on the other hand, cover a broader patient population but are prone to various biases. Thus, before using an observational study for decision-making, it is crucial to benchmark its treatment effect… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at the Conference on Uncertainty in Artificial Intelligence (UAI) 2024

  4. arXiv:2402.15691  [pdf, other

    cs.LG stat.ML

    Orthogonal Gradient Boosting for Simpler Additive Rule Ensembles

    Authors: Fan Yang, Pierre Le Bodic, Michael Kamp, Mario Boley

    Abstract: Gradient boosting of prediction rules is an efficient approach to learn potentially interpretable yet accurate probabilistic models. However, actual interpretability requires to limit the number and size of the generated rules, and existing boosting variants are not designed for this purpose. Though corrective boosting refits all rule weights in each iteration to minimise prediction risk, the incl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 21 pages, 11 figures, accepted at AISTATS 2024

  5. arXiv:2401.02708  [pdf, other

    cs.LG cs.AI stat.ML

    TripleSurv: Triplet Time-adaptive Coordinate Loss for Survival Analysis

    Authors: Liwen Zhang, Lianzhen Zhong, Fan Yang, Di Dong, Hui Hui, Jie Tian

    Abstract: A core challenge in survival analysis is to model the distribution of censored time-to-event data, where the event of interest may be a death, failure, or occurrence of a specific event. Previous studies have showed that ranking and maximum likelihood estimation (MLE)loss functions are widely-used for survival analysis. However, ranking loss only focus on the ranking of survival time and does not… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 9 pages,6 figures

  6. arXiv:2401.02154  [pdf, other

    cs.LG cs.AI cs.CR stat.ME

    Disentangle Estimation of Causal Effects from Cross-Silo Data

    Authors: Yuxuan Liu, Haozhao Wang, Shuang Wang, Zhiming He, Wenchao Xu, Jialiang Zhu, Fan Yang

    Abstract: Estimating causal effects among different events is of great importance to critical fields such as drug development. Nevertheless, the data features associated with events may be distributed across various silos and remain private within respective parties, impeding direct information exchange between them. This, in turn, can result in biased estimations of local causal effects, which rely on the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  7. arXiv:2312.04464  [pdf, other

    cs.LG stat.ML

    Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation

    Authors: Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang

    Abstract: To tackle long planning horizon problems in reinforcement learning with general function approximation, we propose the first algorithm, termed as UCRL-WVTR, that achieves both \emph{horizon-free} and \emph{instance-dependent}, since it eliminates the polynomial dependency on the planning horizon. The derived regret bound is deemed \emph{sharp}, as it matches the minimax lower bound when specialize… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  8. arXiv:2312.03871  [pdf, other

    stat.ML cs.LG

    Hidden yet quantifiable: A lower bound for confounding strength using randomized trials

    Authors: Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang

    Abstract: In the era of fast-paced precision medicine, observational studies play a major role in properly evaluating new treatments in clinical practice. Yet, unobserved confounding can significantly compromise causal conclusions drawn from non-randomized data. We propose a novel strategy that leverages randomized trials to quantify unobserved confounding. First, we design a statistical test to detect unob… ▽ More

    Submitted 1 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted for presentation at the International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  9. arXiv:2311.18557  [pdf, other

    cs.LG stat.ML

    Can semi-supervised learning use all the data effectively? A lower bound perspective

    Authors: Alexandru Ţifrea, Gizem Yüce, Amartya Sanyal, Fanny Yang

    Abstract: Prior works have shown that semi-supervised learning algorithms can leverage unlabeled data to improve over the labeled sample complexity of supervised learning (SL) algorithms. However, existing theoretical analyses focus on regimes where the unlabeled data is sufficient to learn a good decision boundary using unsupervised learning (UL) alone. This begs the question: Can SSL algorithms simultaneo… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Published in Advances in Neural Information Processing Systems 2023

  10. arXiv:2311.04686  [pdf, other

    cs.LG cs.DC stat.ML

    Robust and Communication-Efficient Federated Domain Adaptation via Random Features

    Authors: Zhanbo Feng, Yuanjie Wang, Jie Li, Fan Yang, Jiong Lou, Tiebin Mi, Robert. C. Qiu, Zhenyu Liao

    Abstract: Modern machine learning (ML) models have grown to a scale where training them on a single machine becomes impractical. As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner. These models, however, when deployed on new devices, might struggle to generalize well due to domain shifts. In this context, fe… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 21 pages

  11. arXiv:2309.02073  [pdf, other

    stat.ME math.ST

    Debiased Regression Adjustment in Completely Randomized Experiments with Moderately High-dimensional Covariates

    Authors: Xin Lu, Fan Yang, Yuhao Wang

    Abstract: Completely randomized experiment is the gold standard for causal inference. When the covariate information for each experimental candidate is available, one typical way is to include them in covariate adjustments for more accurate treatment effect estimation. In this paper, we investigate this problem under the randomization-based framework, i.e., that the covariates and potential outcomes of all… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  12. arXiv:2306.06836  [pdf, other

    cs.LG cs.AI stat.ML

    Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds

    Authors: Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang

    Abstract: While numerous works have focused on devising efficient algorithms for reinforcement learning (RL) with uniformly bounded rewards, it remains an open question whether sample or time-efficient algorithms for RL with large state-action space exist when the rewards are \emph{heavy-tailed}, i.e., with only finite $(1+ε)$-th moments for some $ε\in(0,1]$. In this work, we address the challenge of such r… ▽ More

    Submitted 7 March, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  13. arXiv:2306.03962  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    PILLAR: How to make semi-private learning more effective

    Authors: Francesco Pinto, Yaxi Hu, Fanny Yang, Amartya Sanyal

    Abstract: In Semi-Supervised Semi-Private (SP) learning, the learner has access to both public unlabelled and private labelled data. We propose a computationally efficient algorithm that, under mild assumptions on the data, provably achieves significantly lower private labelled sample complexity and can be efficiently run on real-world datasets. For this purpose, we leverage the features extracted by networ… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  14. arXiv:2305.19562  [pdf, other

    cs.LG cs.AI stat.ML

    Replicability in Reinforcement Learning

    Authors: Amin Karbasi, Grigoris Velegkas, Lin F. Yang, Felix Zhou

    Abstract: We initiate the mathematical study of replicability as an algorithmic property in the context of reinforcement learning (RL). We focus on the fundamental setting of discounted tabular MDPs with access to a generative model. Inspired by Impagliazzo et al. [2022], we say that an RL algorithm is replicable if, with high probability, it outputs the exact same policy after two executions on i.i.d. samp… ▽ More

    Submitted 27 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: to be published in neurips 2023

  15. arXiv:2305.09798  [pdf

    cs.CL cs.HC eess.SY stat.AP

    The Ways of Words: The Impact of Word Choice on Information Engagement and Decision Making

    Authors: Nimrod Dvir, Elaine Friedman, Suraj Commuri, Fan Yang, Jennifer Romano

    Abstract: Little research has explored how information engagement (IE), the degree to which individuals interact with and use information in a manner that manifests cognitively, behaviorally, and affectively. This study explored the impact of phrasing, specifically word choice, on IE and decision making. Synthesizing two theoretical models, User Engagement Theory UET and Information Behavior Theory IBT, a t… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    MSC Class: 28-08 ACM Class: H.5.2; H.1.2

  16. arXiv:2301.07605  [pdf, other

    stat.ML cs.LG

    Strong inductive biases provably prevent harmless interpolation

    Authors: Michael Aerni, Marco Milanta, Konstantin Donhauser, Fanny Yang

    Abstract: Classical wisdom suggests that estimators should avoid fitting noise to achieve good generalization. In contrast, modern overparameterized models can yield small test error despite interpolating noise -- a phenomenon often called "benign overfitting" or "harmless interpolation". This paper argues that the degree to which interpolation is harmless hinges upon the strength of an estimator's inductiv… ▽ More

    Submitted 1 March, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted at ICLR 2023

  17. arXiv:2212.05577  [pdf, other

    stat.ME

    Mediation analysis with the mediator and outcome missing not at random

    Authors: Shuozhi Zuo, Debashis Ghosh, Peng Ding, Fan Yang

    Abstract: Mediation analysis is widely used for investigating direct and indirect causal pathways through which an effect arises. However, many mediation analysis studies are challenged by missingness in the mediator and outcome. In general, when the mediator and outcome are missing not at random, the direct and indirect effects are not identifiable without further assumptions. In this work, we study the id… ▽ More

    Submitted 22 September, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  18. arXiv:2212.03783  [pdf, ps, other

    stat.ML cs.LG

    Tight bounds for maximum $\ell_1$-margin classifiers

    Authors: Stefan Stojanovic, Konstantin Donhauser, Fanny Yang

    Abstract: Popular iterative algorithms such as boosting methods and coordinate descent on linear models converge to the maximum $\ell_1$-margin classifier, a.k.a. sparse hard-margin SVM, in high dimensional regimes where the data is linearly separable. Previous works consistently show that many estimators relying on the $\ell_1$-norm achieve improved statistical rates for hard sparse ground truths. We show… ▽ More

    Submitted 20 January, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  19. arXiv:2211.05632  [pdf, ps, other

    stat.ML cs.LG

    Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms

    Authors: Osama A. Hanna, Lin F. Yang, Christina Fragouli

    Abstract: In this paper, we address the stochastic contextual linear bandit problem, where a decision maker is provided a context (a random set of actions drawn from a distribution). The expected reward of each action is specified by the inner product of the action and an unknown parameter. The goal is to design an algorithm that learns to play as close as possible to the unknown optimal policy after a numb… ▽ More

    Submitted 26 May, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  20. arXiv:2211.00128  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    SIMPLE-RC: Group Network Inference with Non-Sharp Nulls and Weak Signals

    Authors: Jianqing Fan, Yingying Fan, **chi Lv, Fan Yang

    Abstract: Large-scale network inference with uncertainty quantification has important applications in natural, social, and medical sciences. The recent work of Fan, Fan, Han and Lv (2022) introduced a general framework of statistical inference on membership profiles in large networks (SIMPLE) for testing the sharp null hypothesis that a pair of given nodes share the same membership profiles. In real applica… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 71 pages, 4 figures

  21. arXiv:2206.06270  [pdf, other

    cs.LG stat.ML

    Near-Optimal Sample Complexity Bounds for Constrained MDPs

    Authors: Sharan Vaswani, Lin F. Yang, Csaba Szepesvári

    Abstract: In contrast to the advances in characterizing the sample complexity for solving Markov decision processes (MDPs), the optimal statistical complexity for solving constrained MDPs (CMDPs) remains unknown. We resolve this question by providing minimax upper and lower bounds on the sample complexity for learning near-optimal policies in a discounted CMDP with access to a generative model (simulator).… ▽ More

    Submitted 19 November, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: NeurIPS'22

  22. arXiv:2206.03985  [pdf, other

    cs.LG cs.CR stat.ML

    How unfair is private learning ?

    Authors: Amartya Sanyal, Yaxi Hu, Fanny Yang

    Abstract: As machine learning algorithms are deployed on sensitive data in critical decision making processes, it is becoming increasingly important that they are also private and fair. In this paper, we show that, when the data has a long-tailed structure, it is not possible to build accurate learning algorithms that are both private and results in higher accuracy on minority subpopulations. We further sho… ▽ More

    Submitted 24 December, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted as an Oral paper in UAI '2022, Major update on 23 Dec, 2022

  23. arXiv:2206.03718  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach

    Authors: Fan Yang, Kai He, Linxiao Yang, Hongxia Du, **gbang Yang, Bo Yang, Liang Sun

    Abstract: Rule sets are highly interpretable logical models in which the predicates for decision are expressed in disjunctive normal form (DNF, OR-of-ANDs), or, equivalently, the overall model comprises an unordered collection of if-then decision rules. In this paper, we consider a submodular optimization based approach for learning rule sets. The learning problem is framed as a subset selection task in whi… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2021 (Spotlight)

  24. arXiv:2206.00270  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation

    Authors: Sanae Amani, Lin F. Yang, Ching-An Cheng

    Abstract: We study lifelong reinforcement learning (RL) in a regret minimization setting of linear contextual Markov decision process (MDP), where the agent needs to learn a multi-task policy while solving a streaming sequence of tasks. We propose an algorithm, called UCB Lifelong Value Distillation (UCBlvd), that provably achieves sublinear regret for any sequence of tasks, which may be adaptively chosen b… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  25. arXiv:2205.13170  [pdf, other

    cs.LG stat.ML

    Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

    Authors: Sanae Amani, Tor Lattimore, András György, Lin F. Yang

    Abstract: We study distributed contextual linear bandits with stochastic contexts, where $N$ agents act cooperatively to solve a linear bandit-optimization problem with $d$-dimensional features over the course of $T$ rounds. For this problem, we derive the first ever information-theoretic lower bound $Ω(dN)$ on the communication cost of any algorithm that performs optimally in a regret minimization setup. W… ▽ More

    Submitted 7 December, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  26. arXiv:2204.00492  [pdf, other

    cs.LG stat.ME

    Provable concept learning for interpretable predictions using variational autoencoders

    Authors: Armeen Taeb, Nicolo Ruggeri, Carina Schnuck, Fanny Yang

    Abstract: In safety-critical applications, practitioners are reluctant to trust neural networks when no interpretable explanations are available. Many attempts to provide such explanations revolve around pixel-based attributions or use previously known concepts. In this paper we aim to provide explanations by provably identifying \emph{high-level, previously unknown ground-truth concepts}. To this end, we p… ▽ More

    Submitted 22 July, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

  27. arXiv:2203.06335  [pdf, other

    stat.ME

    Doubly Coupled Designs for Computer Experiments with both Qualitative and Quantitative Factors

    Authors: Feng Yang, C. Devon Lin, Yongdao Zhou, Yuanzhen He

    Abstract: Computer experiments with both qualitative and quantitative input variables occur frequently in many scientific and engineering applications. How to choose input settings for such experiments is an important issue for accurate statistical analysis, uncertainty quantification and decision making. Sliced Latin hypercube designs are the first systematic approach to address this issue. However, it com… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: Statistica Sinica (2021)

  28. arXiv:2203.03597  [pdf, other

    stat.ML cs.LG

    Fast Rates for Noisy Interpolation Require Rethinking the Effects of Inductive Bias

    Authors: Konstantin Donhauser, Nicolo Ruggeri, Stefan Stojanovic, Fanny Yang

    Abstract: Good generalization performance on high-dimensional data crucially hinges on a simple structure of the ground truth and a corresponding strong inductive bias of the estimator. Even though this intuition is valid for regularized models, in this paper we caution against a strong inductive bias for interpolation in the presence of noise: While a stronger inductive bias encourages a simpler structure… ▽ More

    Submitted 26 October, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

  29. arXiv:2203.02006  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Why adversarial training can hurt robust accuracy

    Authors: Jacob Clarysse, Julia Hörrmann, Fanny Yang

    Abstract: Machine learning classifiers with high test accuracy often perform poorly under adversarial attacks. It is commonly believed that adversarial training alleviates this issue. In this paper, we demonstrate that, surprisingly, the opposite may be true -- Even though adversarial training helps when enough data is available, it may hurt robust generalization in the small sample size regime. We first pr… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  30. arXiv:2111.05987  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Tight bounds for minimum l1-norm interpolation of noisy data

    Authors: Guillaume Wang, Konstantin Donhauser, Fanny Yang

    Abstract: We provide matching upper and lower bounds of order $σ^2/\log(d/n)$ for the prediction error of the minimum $\ell_1$-norm interpolator, a.k.a. basis pursuit. Our result is tight up to negligible terms when $d \gg n$, and is the first to imply asymptotic consistency of noisy minimum-norm interpolation for isotropic features and sparse ground truths. Our work complements the literature on "benign ov… ▽ More

    Submitted 7 March, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: 33 pages, 1 figure; accepted to AISTATS 2022

  31. arXiv:2111.00633  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Settling the Horizon-Dependence of Sample Complexity in Reinforcement Learning

    Authors: Yuanzhi Li, Ruosong Wang, Lin F. Yang

    Abstract: Recently there is a surge of interest in understanding the horizon-dependence of the sample complexity in reinforcement learning (RL). Notably, for an RL environment with horizon length $H$, previous work have shown that there is a probably approximately correct (PAC) algorithm that learns an $O(1)$-optimal policy using $\mathrm{polylog}(H)$ episodes of environment interactions when the number of… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  32. arXiv:2108.02883  [pdf, other

    stat.ML cs.LG

    Interpolation can hurt robust generalization even when there is no noise

    Authors: Konstantin Donhauser, Alexandru Ţifrea, Michael Aerni, Reinhard Heckel, Fanny Yang

    Abstract: Numerous recent works show that overparameterization implicitly reduces variance for min-norm interpolators and max-margin classifiers. These findings suggest that ridge regularization has vanishing benefits in high dimensions. We challenge this narrative by showing that, even in the absence of noise, avoiding interpolation through ridge regularization can significantly improve generalization. We… ▽ More

    Submitted 16 December, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

  33. arXiv:2107.11014  [pdf

    stat.ME stat.AP

    Post-Treatment Confounding in Causal Mediation Studies: A Cutting-Edge Problem and A Novel Solution via Sensitivity Analysis

    Authors: Guanglei Hong, Fan Yang, Xu Qin

    Abstract: In causal mediation studies that decompose an average treatment effect into a natural indirect effect (NIE) and a natural direct effect (NDE), examples of post-treatment confounding are abundant. Past research has generally considered it infeasible to adjust for a post-treatment confounder of the mediator-outcome relationship due to incomplete information: it is observed under the actual treatment… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  34. arXiv:2106.07841  [pdf, other

    cs.LG stat.ML

    Randomized Exploration for Reinforcement Learning with General Value Function Approximation

    Authors: Haque Ishfaq, Qiwen Cui, Viet Nguyen, Alex Ayoub, Zhuoran Yang, Zhaoran Wang, Doina Precup, Lin F. Yang

    Abstract: We propose a model-free reinforcement learning algorithm inspired by the popular randomized least squares value iteration (RLSVI) algorithm as well as the optimism principle. Unlike existing upper-confidence-bound (UCB) based approaches, which are often computationally intractable, our algorithm drives exploration by simply perturbing the training data with judiciously chosen i.i.d. scalar noises.… ▽ More

    Submitted 25 October, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 32 page, 5 figures, in Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  35. arXiv:2106.07203  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Online Sub-Sampling for Reinforcement Learning with General Function Approximation

    Authors: Dingwen Kong, Ruslan Salakhutdinov, Ruosong Wang, Lin F. Yang

    Abstract: Most of the existing works for reinforcement learning (RL) with general function approximation (FA) focus on understanding the statistical complexity or regret bounds. However, the computation complexity of such approaches is far from being understood -- indeed, a simple optimization problem over the function class might be as well intractable. In this paper, we tackle this problem by establishing… ▽ More

    Submitted 18 April, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  36. arXiv:2106.06239  [pdf, ps, other

    cs.LG stat.ML

    Safe Reinforcement Learning with Linear Function Approximation

    Authors: Sanae Amani, Christos Thrampoulidis, Lin F. Yang

    Abstract: Safety in reinforcement learning has become increasingly important in recent years. Yet, existing solutions either fail to strictly avoid choosing unsafe actions, which may lead to catastrophic results in safety-critical systems, or fail to provide regret guarantees for settings where safety constraints need to be learned. In this paper, we address both problems by first modeling safety as an unkn… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  37. arXiv:2106.03591  [pdf, other

    stat.ML cs.LG stat.ME

    Calibrating multi-dimensional complex ODE from noisy data via deep neural networks

    Authors: Kexuan Li, Fangfang Wang, Ruiqi Liu, Fan Yang, Zuofeng Shang

    Abstract: Ordinary differential equations (ODEs) are widely used to model complex dynamics that arises in biology, chemistry, engineering, finance, physics, etc. Calibration of a complicated ODE system using noisy data is generally very difficult. In this work, we propose a two-stage nonparametric approach to address this problem. We first extract the de-noised data and their higher order derivatives using… ▽ More

    Submitted 18 September, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

  38. arXiv:2104.04244  [pdf, other

    math.ST cs.LG stat.ML

    How rotational invariance of common kernels prevents generalization in high dimensions

    Authors: Konstantin Donhauser, Mingqi Wu, Fanny Yang

    Abstract: Kernel ridge regression is well-known to achieve minimax optimal rates in low-dimensional settings. However, its behavior in high dimensions is much less understood. Recent work establishes consistency for kernel regression under certain assumptions on the ground truth function and the distribution of the input data. In this paper, we show that the rotational invariance property of commonly studie… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

  39. arXiv:2103.11559  [pdf, other

    cs.LG stat.ML

    Provably Correct Optimization and Exploration with Non-linear Policies

    Authors: Fei Feng, Wotao Yin, Alekh Agarwal, Lin F. Yang

    Abstract: Policy optimization methods remain a powerful workhorse in empirical Reinforcement Learning (RL), with a focus on neural policies that can easily reason over complex and continuous state and/or action spaces. Theoretical understanding of strategic exploration in policy-based methods with non-linear function approximation, however, is largely missing. In this paper, we address this question by desi… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

  40. arXiv:2102.12948  [pdf, ps, other

    cs.LG stat.ML

    Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally

    Authors: Nived Rajaraman, Yanjun Han, Lin F. Yang, Kannan Ramchandran, Jiantao Jiao

    Abstract: We study the statistical limits of Imitation Learning (IL) in episodic Markov Decision Processes (MDPs) with a state space $\mathcal{S}$. We focus on the known-transition setting where the learner is provided a dataset of $N$ length-$H$ trajectories from a deterministic expert policy and knows the MDP transition. We establish an upper bound $O(|\mathcal{S}|H^{3/2}/N)$ for the suboptimality using t… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 30 pages, 2 figures

  41. arXiv:2101.00494  [pdf, ps, other

    cs.LG cs.AI stat.ML

    A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost

    Authors: Minbo Gao, Tianle Xie, Simon S. Du, Lin F. Yang

    Abstract: Many real-world applications, such as those in medical domains, recommendation systems, etc, can be formulated as large state space reinforcement learning problems with only a small budget of the number of policy changes, i.e., low switching cost. This paper focuses on the linear Markov Decision Process (MDP) recently studied in [Yang et al 2019, ** et al 2020] where the linear function approxima… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

  42. arXiv:2011.14267  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Minimax Sample Complexity for Turn-based Stochastic Game

    Authors: Qiwen Cui, Lin F. Yang

    Abstract: The empirical success of Multi-agent reinforcement learning is encouraging, while few theoretical guarantees have been revealed. In this work, we prove that the plug-in solver approach, probably the most natural reinforcement learning algorithm, achieves minimax sample complexity for turn-based stochastic game (TBSG). Specifically, we plan in an empirical TBSG by utilizing a `simulator' that allow… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

    Comments: 15 pages

  43. arXiv:2011.13034  [pdf, other

    cs.LG stat.ML

    Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning

    Authors: **gfeng Wu, Vladimir Braverman, Lin F. Yang

    Abstract: In this paper we consider multi-objective reinforcement learning where the objectives are balanced using preferences. In practice, the preferences are often given in an adversarial manner, e.g., customers can be picky in many applications. We formalize this problem as an episodic learning problem on a Markov decision process, where transitions are unknown and a reward function is the inner product… ▽ More

    Submitted 27 October, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2021 Camera Ready Version

  44. arXiv:2010.11750  [pdf, other

    stat.ML cs.LG

    Precise High-Dimensional Asymptotics for Quantifying Heterogeneous Transfers

    Authors: Fan Yang, Hongyang R. Zhang, Sen Wu, Christopher Ré, Weijie J. Su

    Abstract: The problem of learning one task with samples from another task has received much interest recently. In this paper, we ask a fundamental question: when is combining data from two tasks better than learning one task alone? Intuitively, the transfer effect from one task to another task depends on dataset shifts such as sample sizes and covariance matrices. However, quantifying such a transfer effect… ▽ More

    Submitted 10 August, 2023; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 64 pages, 6 figures; We thoroughly revised the paper by adding new results and reorganizing the presentation

  45. arXiv:2009.05990  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Toward the Fundamental Limits of Imitation Learning

    Authors: Nived Rajaraman, Lin F. Yang, Jiantao Jiao, Kannan Ramachandran

    Abstract: Imitation learning (IL) aims to mimic the behavior of an expert policy in a sequential decision-making problem given only demonstrations. In this paper, we focus on understanding the minimax statistical limits of IL in episodic Markov Decision Processes (MDPs). We first consider the setting where the learner is provided a dataset of $N$ expert trajectories ahead of time, and cannot interact with t… ▽ More

    Submitted 13 September, 2020; originally announced September 2020.

    Comments: 45 pages, 3 figures

  46. arXiv:2008.06736  [pdf, other

    cs.LG stat.ML

    Obtaining Adjustable Regularization for Free via Iterate Averaging

    Authors: **gfeng Wu, Vladimir Braverman, Lin F. Yang

    Abstract: Regularization for optimization is a crucial technique to avoid overfitting in machine learning. In order to obtain the best performance, we usually train a model by tuning the regularization parameters. It becomes costly, however, when a single round of training takes significant amount of time. Very recently, Neu and Rosasco show that if we run stochastic gradient descent (SGD) on linear regress… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: ICML 2020 camera ready

  47. A Calibration Approach to Transportability and Data-Fusion with Observational Data

    Authors: Kevin P. Josey, Fan Yang, Debashis Ghosh, Sridharan Raghavan

    Abstract: Two important considerations in clinical research studies are proper evaluations of internal and external validity. While randomized clinical trials can overcome several threats to internal validity, they may be prone to poor external validity. Conversely, large prospective observational studies sampled from a broadly generalizable population may be externally valid, yet susceptible to threats to… ▽ More

    Submitted 7 July, 2022; v1 submitted 14 August, 2020; originally announced August 2020.

  48. arXiv:2007.07461  [pdf, ps, other

    cs.LG cs.GT cs.MA math.OC stat.ML

    Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

    Authors: Kaiqing Zhang, Sham M. Kakade, Tamer Başar, Lin F. Yang

    Abstract: Model-based reinforcement learning (RL), which finds an optimal policy using an empirical model, has long been recognized as one of the corner stones of RL. It is especially suitable for multi-agent RL (MARL), as it naturally decouples the learning and the planning phases, and avoids the non-stationarity problem when all agents are improving their policies simultaneously using samples. Though intu… ▽ More

    Submitted 8 August, 2023; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Updated version accepted to Journal of Machine Learning Research (JMLR)

  49. arXiv:2006.13485  [pdf, other

    cs.LG stat.ML

    Fairness with Overlap** Groups

    Authors: Forest Yang, Moustapha Cisse, Sanmi Koyejo

    Abstract: In algorithmically fair prediction problems, a standard goal is to ensure the equality of fairness metrics across multiple overlap** groups simultaneously. We reconsider this standard fair classification problem using a probabilistic population analysis, which, in turn, reveals the Bayes-optimal classifier. Our approach unifies a variety of existing group-fair classification methods and enables… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

  50. arXiv:2006.11274  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    On Reward-Free Reinforcement Learning with Linear Function Approximation

    Authors: Ruosong Wang, Simon S. Du, Lin F. Yang, Ruslan Salakhutdinov

    Abstract: Reward-free reinforcement learning (RL) is a framework which is suitable for both the batch RL setting and the setting where there are many reward functions of interest. During the exploration phase, an agent collects samples without using a pre-specified reward function. After the exploration phase, a reward function is given, and the agent uses samples collected during the exploration phase to c… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.