Skip to main content

Showing 1–50 of 68 results for author: Blanchet, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.19619  [pdf, other

    stat.ML cs.LG math.ST

    ScoreFusion: fusing score-based generative models via Kullback-Leibler barycenters

    Authors: Hao Liu, Junze, Ye, Jose Blanchet, Nian Si

    Abstract: We study the problem of fusing pre-trained (auxiliary) generative models to enhance the training of a target generative model. We propose using KL-divergence weighted barycenters as an optimal fusion mechanism, in which the barycenter weights are optimally trained to minimize a suitable loss for the target population. While computing the optimal KL-barycenter weights can be challenging, we demonst… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 40 pages, 6 figures

  2. arXiv:2406.11281  [pdf, ps, other

    stat.ML cs.LG

    Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: We explore the control of stochastic systems with potentially continuous state and action spaces, characterized by the state dynamics $X_{t+1} = f(X_t, A_t, W_t)$. Here, $X$, $A$, and $W$ represent the state, action, and exogenous random noise processes, respectively, with $f$ denoting a known function that describes state transitions. Traditionally, the noise process $\{W_t, t \geq 0\}$ is assume… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.20435  [pdf, other

    cs.LG math.PR stat.ML

    Deep Learning for Computing Convergence Rates of Markov Chains

    Authors: Yanlin Qu, Jose Blanchet, Peter Glynn

    Abstract: Convergence rate analysis for general state-space Markov chains is fundamentally important in areas such as Markov chain Monte Carlo and algorithmic analysis (for computing explicit convergence bounds). This problem, however, is notoriously difficult because traditional analytical methods often do not generate practically useful convergence bounds for realistic Markov chains. We propose the Deep C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  4. arXiv:2405.16436  [pdf, other

    cs.LG cs.AI stat.ML

    Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

    Authors: Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang

    Abstract: Aligning generative models with human preference via RLHF typically suffers from overoptimization, where an imperfectly learned reward model can misguide the generative model to output undesired responses. We investigate this problem in a principled manner by identifying the source of the misalignment as a form of distributional shift and uncertainty in learning human preferences. To mitigate over… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 27 pages, 7 figures

  5. arXiv:2405.15673  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency of Neural Causal Partial Identification

    Authors: Jiyuan Tan, Jose Blanchet, Vasilis Syrgkanis

    Abstract: Recent progress in Neural Causal Models (NCMs) showcased how identification and partial identification of causal effects can be automatically carried out via training of neural generative models that respect the constraints encoded in a given causal graph [Xia et al. 2022, Balazadeh et al. 2022]. However, formal consistency of these methods has only been proven for the case of discrete variables o… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 37 pages, 8 figures

  6. arXiv:2405.03198  [pdf, other

    stat.ML cs.LG math.OC

    Stability Evaluation via Distributional Perturbation Analysis

    Authors: Jose Blanchet, Peng Cui, Jia** Li, Jiashuo Liu

    Abstract: The performance of learning models often deteriorates when deployed in out-of-sample environments. To ensure reliable deployment, we propose a stability evaluation criterion based on distributional perturbations. Conceptually, our stability evaluation criterion is defined as the minimal perturbation required on our observed dataset to induce a prescribed deterioration in risk evaluation. In this p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  7. arXiv:2404.19145  [pdf, other

    stat.ME cs.LG econ.EM math.ST stat.ML

    Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty

    Authors: Kaizhao Liu, Jose Blanchet, Lexing Ying, Yi** Lu

    Abstract: Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of required Monte Carlo replications. We decomposes the target being simulated into two parts: the \textit{non-orthogonal part} which has a closed-form result kno… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  8. arXiv:2404.03578  [pdf, ps, other

    cs.LG stat.ML

    Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm

    Authors: Miao Lu, Han Zhong, Tong Zhang, Jose Blanchet

    Abstract: The sim-to-real gap, which represents the disparity between training and testing environments, poses a significant challenge in reinforcement learning (RL). A promising approach to addressing this challenge is distributionally robust RL, often framed as a robust Markov decision process (RMDP). In this framework, the objective is to find a robust policy that achieves good performance under the wors… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  9. arXiv:2404.01431  [pdf, other

    stat.CO math.NA

    When are Unbiased Monte Carlo Estimators More Preferable than Biased Ones?

    Authors: Guanyang Wang, Jose Blanchet, Peter W. Glynn

    Abstract: Due to the potential benefits of parallelization, designing unbiased Monte Carlo estimators, primarily in the setting of randomized multilevel Monte Carlo, has recently become very popular in operations research and computational statistics. However, existing work primarily substantiates the benefits of unbiased estimators at an intuitive level or using empirical evaluations. The intuition being t… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 35 pages

  10. arXiv:2403.14067  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Automatic Outlier Rectification via Optimal Transport

    Authors: Jose Blanchet, Jia** Li, Markus Pelger, Greg Zanotti

    Abstract: In this paper, we propose a novel conceptual framework to detect outliers using optimal transport with a concave cost function. Conventional outlier detection approaches typically use a two-stage procedure: first, outliers are detected and removed, and then estimation is performed on the cleaned data. However, this approach does not inform outlier removal with the estimation task, leaving room for… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2401.14655  [pdf, other

    stat.ME

    Distributionally Robust Optimization and Robust Statistics

    Authors: Jose Blanchet, Jia** Li, Sirui Lin, Xuhui Zhang

    Abstract: We review distributionally robust optimization (DRO), a principled approach for constructing statistical estimators that hedge against the impact of deviations in the expected loss between the training and deployment environments. Many well-known estimators in statistics and machine learning (e.g. AdaBoost, LASSO, ridge regression, dropout training, etc.) are distributionally robust in a precise s… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  12. arXiv:2312.09862  [pdf, other

    math.ST stat.ME

    Wasserstein-based Minimax Estimation of Dependence in Multivariate Regularly Varying Extremes

    Authors: Xuhui Zhang, Jose Blanchet, Youssef Marzouk, Viet Anh Nguyen, Sven Wang

    Abstract: We study minimax risk bounds for estimators of the spectral measure in multivariate linear factor models, where observations are linear combinations of regularly varying latent factors. Non-asymptotic convergence rates are derived for the multivariate Peak-over-Threshold estimator in terms of the $p$-th order Wasserstein distance, and information-theoretic lower bounds for the minimax risks are es… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  13. arXiv:2311.09018  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    On the Foundation of Distributionally Robust Reinforcement Learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Motivated by the need for a robust policy in the face of environment shifts between training and the deployment, we contribute to the theoretical foundation of distributionally robust reinforcement learning (DRRL). This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs). This framework obliges the decision maker to… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  14. arXiv:2310.08833  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Sample Complexity for Average Reward Markov Decision Processes

    Authors: Shengbo Wang, Jose Blanchet, Peter Glynn

    Abstract: We resolve the open question regarding the sample complexity of policy learning for maximizing the long-run average reward associated with a uniformly ergodic Markov decision process (MDP), assuming a generative model. In this context, the existing literature provides a sample complexity upper bound of $\widetilde O(|S||A|t_{\text{mix}}^2 ε^{-2})$ and a lower bound of… ▽ More

    Submitted 12 February, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  15. arXiv:2308.05414  [pdf, other

    math.OC stat.ML

    Unifying Distributionally Robust Optimization via Optimal Transport Theory

    Authors: Jose Blanchet, Daniel Kuhn, Jia** Li, Bahar Taskesen

    Abstract: In the past few years, there has been considerable interest in two prominent approaches for Distributionally Robust Optimization (DRO): Divergence-based and Wasserstein-based methods. The divergence approach models misspecification in terms of likelihood ratios, while the latter models it through a measure of distance or cost in actual outcomes. Building upon these advances, this paper introduces… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  16. arXiv:2305.18420  [pdf, other

    cs.LG math.OC stat.ML

    Sample Complexity of Variance-reduced Distributionally Robust Q-learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Dynamic decision making under distributional shifts is of fundamental interest in theory and applications of reinforcement learning: The distribution of the environment on which the data is collected can differ from that of the environment on which the model is deployed. This paper presents two novel model-free algorithms, namely the distributionally robust Q-learning and its variance-reduced coun… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  17. arXiv:2305.16527  [pdf, other

    math.ST cs.IT math.NA stat.ML

    When can Regression-Adjusted Control Variates Help? Rare Events, Sobolev Embedding and Minimax Optimality

    Authors: Jose Blanchet, Haoxuan Chen, Yi** Lu, Lexing Ying

    Abstract: This paper studies the use of a machine learning-based estimator as a control variate for mitigating the variance of Monte Carlo sampling. Specifically, we seek to uncover the key factors that influence the efficiency of control variates in reducing variance. We examine a prototype estimation problem that involves simulating the moments of a Sobolev function based on observations obtained from (ra… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  18. arXiv:2305.09659  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

    Authors: Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong

    Abstract: In this paper, we study distributionally robust offline reinforcement learning (robust offline RL), which seeks to find an optimal policy purely from an offline dataset that can perform well in perturbed environments. In specific, we propose a generic algorithm framework called Doubly Pessimistic Model-based Policy Optimization ($P^2MPO$), which features a novel combination of a flexible model est… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: V2 adds results on robust offline Markov games

  19. arXiv:2302.13203  [pdf, other

    cs.LG stat.ML

    A Finite Sample Complexity Bound for Distributionally Robust Q-learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: We consider a reinforcement learning setting in which the deployment environment is different from the training environment. Applying a robust Markov decision processes formulation, we extend the distributionally robust $Q$-learning framework studied in Liu et al. [2022]. Further, we improve the design and analysis of their multi-level Monte Carlo estimator. Assuming access to a simulator, we prov… ▽ More

    Submitted 2 March, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted by AISTATS 2023

  20. arXiv:2302.07477  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes

    Authors: Shengbo Wang, Jose Blanchet, Peter Glynn

    Abstract: We consider the optimal sample complexity theory of tabular reinforcement learning (RL) for maximizing the infinite horizon discounted reward in a Markov decision process (MDP). Optimal worst-case complexity results have been developed for tabular RL problems in this setting, leading to a sample complexity dependence on $γ$ and $ε$ of the form $\tilde Θ((1-γ)^{-3}ε^{-2})$, where $γ$ denotes the di… ▽ More

    Submitted 30 September, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  21. arXiv:2301.11721  [pdf, other

    stat.ML cs.AI cs.LG

    Single-Trajectory Distributionally Robust Reinforcement Learning

    Authors: Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou

    Abstract: As a framework for sequential decision-making, Reinforcement Learning (RL) has been regarded as an essential component leading to Artificial General Intelligence (AGI). However, RL is often criticized for having the same training environment as the test one, which also hinders its application in the real world. To mitigate this problem, Distributionally Robust RL (DRRL) is proposed to improve the… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: First two authors contribute equally

  22. arXiv:2212.12978  [pdf, other

    math.OC cs.LG stat.ML

    Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization

    Authors: Taoli Zheng, Linglingzhi Zhu, Anthony Man-Cho So, Jose Blanchet, Jia** Li

    Abstract: Nonconvex-nonconcave minimax optimization has received intense attention over the last decade due to its broad applications in machine learning. Most existing algorithms rely on one-sided information, such as the convexity (resp. concavity) of the primal (resp. dual) functions, or other specific structures, such as the Polyak-Łojasiewicz (PŁ) and Kurdyka-Łojasiewicz (KŁ) conditions. However, verif… ▽ More

    Submitted 30 October, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

  23. arXiv:2211.15241  [pdf, other

    econ.EM cs.LG math.OC stat.ML

    Synthetic Principal Component Design: Fast Covariate Balancing with Synthetic Controls

    Authors: Yi** Lu, Jia** Li, Lexing Ying, Jose Blanchet

    Abstract: The optimal design of experiments typically involves solving an NP-hard combinatorial optimization problem. In this paper, we aim to develop a globally convergent and practically efficient optimization algorithm. Specifically, we consider a setting where the pre-treatment outcome data is available and the synthetic control estimator is invoked. The average treatment effect is estimated via the dif… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  24. arXiv:2210.01413  [pdf, other

    math.OC cs.LG stat.ML

    Tikhonov Regularization is Optimal Transport Robust under Martingale Constraints

    Authors: Jia** Li, Sirui Lin, Jose Blanchet, Viet Anh Nguyen

    Abstract: Distributionally robust optimization has been shown to offer a principled way to regularize learning models. In this paper, we find that Tikhonov regularization is distributionally robust in an optimal transport sense (i.e., if an adversary chooses distributions in a suitable optimal transport neighborhood of the empirical measure), provided that suitable martingale constraints are also imposed. F… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  25. arXiv:2209.14430  [pdf, other

    cs.LG econ.EM math.NA math.ST stat.ML

    Minimax Optimal Kernel Operator Learning via Multilevel Training

    Authors: Jikai **, Yi** Lu, Jose Blanchet, Lexing Ying

    Abstract: Learning map**s between infinite-dimensional function spaces has achieved empirical success in many disciplines of machine learning, including generative modeling, functional data analysis, causal inference, and multi-agent reinforcement learning. In this paper, we study the statistical limit of learning a Hilbert-Schmidt operator between two infinite-dimensional Sobolev reproducing kernel Hilbe… ▽ More

    Submitted 24 July, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  26. arXiv:2209.06620  [pdf, other

    cs.LG cs.AI stat.ML

    Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

    Authors: Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

    Abstract: Among the reasons hindering reinforcement learning (RL) applications to real-world problems, two factors are critical: limited data and the mismatch between the testing environment (real environment in which the policy is deployed) and the training environment (e.g., a simulator). This paper attempts to address these issues simultaneously with distributionally robust offline RL, where we learn a d… ▽ More

    Submitted 27 January, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: First two authors contribute equally

  27. arXiv:2205.07331  [pdf, other

    math.NA cs.LG math.ST physics.comp-ph stat.ML

    Sobolev Acceleration and Statistical Optimality for Learning Elliptic Equations via Gradient Descent

    Authors: Yi** Lu, Jose Blanchet, Lexing Ying

    Abstract: In this paper, we study the statistical limits in terms of Sobolev norms of gradient descent for solving inverse problem from randomly sampled noisy observations using a general class of objective functions. Our class of objective functions includes Sobolev training for kernel regression, Deep Ritz Methods (DRM), and Physics Informed Neural Networks (PINN) for solving elliptic partial differential… ▽ More

    Submitted 19 September, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  28. arXiv:2202.11685  [pdf, other

    cs.LG stat.ME stat.ML

    A Class of Geometric Structures in Transfer Learning: Minimax Bounds and Optimality

    Authors: Xuhui Zhang, Jose Blanchet, Soumyadip Ghosh, Mark S. Squillante

    Abstract: We study the problem of transfer learning, observing that previous efforts to understand its information-theoretic limits do not fully exploit the geometric structure of the source and target domains. In contrast, our study first illustrates the benefits of incorporating a natural geometric structure within a linear regression model, which corresponds to the generalized eigenvalue problem formed b… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022

  29. arXiv:2202.06383  [pdf, other

    cs.LG stat.AP

    Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

    Authors: Yuan Shi, Saied Mahdian, Jose Blanchet, Peter Glynn, Andrew Y. Shin, David Scheinker

    Abstract: Using data from cardiovascular surgery patients with long and highly variable post-surgical lengths of stay (LOS), we develop a modeling framework to reduce recovery unit congestion. We estimate the LOS and its probability distribution using machine learning models, schedule procedures on a rolling basis using a variety of optimization models, and estimate performance with simulation. The machine… ▽ More

    Submitted 28 November, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  30. arXiv:2202.00871  [pdf, other

    stat.ME q-fin.MF

    Bayesian Imputation with Optimal Look-Ahead-Bias and Variance Tradeoff

    Authors: Jose Blanchet, Fernando Hernandez, Viet Anh Nguyen, Markus Pelger, Xuhui Zhang

    Abstract: Missing time-series data is a prevalent problem in many prescriptive analytics models in operations management, healthcare and finance. Imputation methods for time-series data are usually applied to the full panel data with the purpose of training a prescriptive model for a downstream out-of-sample task. For example, the imputation of missing asset returns may be applied before estimating an optim… ▽ More

    Submitted 11 April, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: This work merges and supersedes arXiv:2102.12736

  31. arXiv:2110.06897  [pdf, other

    math.NA cs.LG math.ST physics.comp-ph stat.ML

    Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

    Authors: Yi** Lu, Haoxuan Chen, Jianfeng Lu, Lexing Ying, Jose Blanchet

    Abstract: In this paper, we study the statistical limits of deep learning techniques for solving elliptic partial differential equations (PDEs) from random samples using the Deep Ritz Method (DRM) and Physics-Informed Neural Networks (PINNs). To simplify the problem, we focus on a prototype elliptic PDE: the Schrödinger equation on a hypercube with zero Dirichlet boundary condition, which has wide applicati… ▽ More

    Submitted 12 November, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: add a proof Proof Sketch in section 4.1

  32. arXiv:2109.14875  [pdf, other

    stat.ML cs.LG math.OC

    Adversarial Regression with Doubly Non-negative Weighting Matrices

    Authors: Tam Le, Truyen Nguyen, Makoto Yamada, Jose Blanchet, Viet Anh Nguyen

    Abstract: Many machine learning tasks that involve predicting an output response can be solved by training a weighted regression model. Unfortunately, the predictive power of this type of models may severely deteriorate under low sample sizes or under covariate perturbations. Reweighting the training samples has aroused as an effective mitigation strategy to these problems. In this paper, we propose a novel… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: Accepted to the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS2021)

  33. arXiv:2108.02120  [pdf, other

    math.ST cs.LG math.OC stat.ML

    Statistical Analysis of Wasserstein Distributionally Robust Estimators

    Authors: Jose Blanchet, Karthyek Murthy, Viet Anh Nguyen

    Abstract: We consider statistical methods which invoke a min-max distributionally robust formulation to extract good out-of-sample performance in data-driven optimization and learning problems. Acknowledging the distributional uncertainty in learning from limited samples, the min-max formulations introduce an adversarial inner player to explore unseen covariate data. The resulting Distributionally Robust Op… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  34. arXiv:2106.02263  [pdf, other

    stat.CO math.PR q-fin.CP

    Unbiased Optimal Stop** via the MUSE

    Authors: Zhengqing Zhou, Guanyang Wang, Jose Blanchet, Peter W. Glynn

    Abstract: We propose a new unbiased estimator for estimating the utility of the optimal stop** problem. The MUSE, short for Multilevel Unbiased Stop** Estimator, constructs the unbiased Multilevel Monte Carlo (MLMC) estimator at every stage of the optimal stop** problem in a backward recursive way. In contrast to traditional sequential methods, the MUSE can be implemented in parallel. We prove the MUS… ▽ More

    Submitted 26 December, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: 39 pages, add several numerical experiments and technical results, accepted by Stochastic Processes and their Applications

    MSC Class: 62C05; 60G40; 62L15

  35. arXiv:2106.01070  [pdf, ps, other

    stat.ML cs.CY cs.LG math.ST

    Testing Group Fairness via Optimal Transport Projections

    Authors: Nian Si, Karthyek Murthy, Jose Blanchet, Viet Anh Nguyen

    Abstract: We present a statistical testing framework to detect if a given machine learning classifier fails to satisfy a wide range of group fairness notions. The proposed test is a flexible, interpretable, and statistically rigorous tool for auditing whether exhibited biases are intrinsic to the algorithm or due to the randomness in the data. The statistical challenges, which may arise from multiple impact… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Journal ref: International Conference on Machine Learning 2021

  36. arXiv:2106.00322  [pdf, other

    cs.LG math.OC stat.ML

    Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts

    Authors: Bahar Taskesen, Man-Chung Yue, Jose Blanchet, Daniel Kuhn, Viet Anh Nguyen

    Abstract: Least squares estimators, when trained on a few target domain samples, may predict poorly. Supervised domain adaptation aims to improve the predictive accuracy by exploiting additional labeled training samples from a source distribution that is close to the target distribution. Given available data, we investigate novel strategies to synthesize a family of least squares estimator experts that are… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  37. arXiv:2105.05352  [pdf, ps, other

    stat.CO

    Frank-Wolfe Methods in Probability Space

    Authors: Carson Kent, Jose Blanchet, Peter Glynn

    Abstract: We introduce a new class of Frank-Wolfe algorithms for minimizing differentiable functionals over probability measures. This framework can be shown to encompass a diverse range of tasks in areas such as artificial intelligence, reinforcement learning, and optimization. Concrete computational complexities for these algorithms are established and demonstrate that these methods enjoy convergence in r… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  38. arXiv:2103.16451  [pdf, other

    q-fin.PM math.OC stat.ML

    Robustifying Conditional Portfolio Decisions via Optimal Transport

    Authors: Viet Anh Nguyen, Fan Zhang, Shanshan Wang, Jose Blanchet, Erick Delage, Yinyu Ye

    Abstract: We propose a data-driven portfolio selection model that integrates side information, conditional estimation and robustness using the framework of distributionally robust optimization. Conditioning on the observed side information, the portfolio manager solves an allocation problem that minimizes the worst-case conditional risk-return trade-off, subject to all possible perturbations of the covariat… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 1 figure

  39. arXiv:2102.12736   

    stat.ML cs.LG

    Time-Series Imputation with Wasserstein Interpolation for Optimal Look-Ahead-Bias and Variance Tradeoff

    Authors: Jose Blanchet, Fernando Hernandez, Viet Anh Nguyen, Markus Pelger, Xuhui Zhang

    Abstract: Missing time-series data is a prevalent practical problem. Imputation methods in time-series data often are applied to the full panel data with the purpose of training a model for a downstream out-of-sample task. For example, in finance, imputation of missing returns may be applied prior to training a portfolio optimization model. Unfortunately, this practice may result in a look-ahead-bias in the… ▽ More

    Submitted 11 April, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: This paper has been superseded by arXiv:2202.00871

  40. arXiv:2102.09042  [pdf, other

    stat.ML cs.LG stat.CO

    Modeling Extremes with d-max-decreasing Neural Networks

    Authors: Ali Hasan, Khalil Elkhalil, Yuting Ng, Joao M. Pereira, Sina Farsiu, Jose H. Blanchet, Vahid Tarokh

    Abstract: We propose a novel neural network architecture that enables non-parametric calibration and generation of multivariate extreme value distributions (MEVs). MEVs arise from Extreme Value Theory (EVT) as the necessary class of models when extrapolating a distributional fit over large spatial and temporal scales based on data observed in intermediate scales. In turn, EVT dictates that $d$-max-decreasin… ▽ More

    Submitted 1 March, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

  41. arXiv:2012.04800  [pdf, other

    cs.LG cs.CY stat.ML

    A Statistical Test for Probabilistic Fairness

    Authors: Bahar Taskesen, Jose Blanchet, Daniel Kuhn, Viet Anh Nguyen

    Abstract: Algorithms are now routinely used to make consequential decisions that affect human lives. Examples include college admissions, medical interventions or law enforcement. While algorithms empower us to harness all information hidden in vast amounts of data, they may inadvertently amplify existing biases in the available datasets. This concern has sparked increasing interest in fair machine learning… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  42. arXiv:2010.05373  [pdf, other

    stat.ML cs.LG math.ST

    Distributionally Robust Local Non-parametric Conditional Estimation

    Authors: Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

    Abstract: Conditional estimation given specific covariate values (i.e., local conditional estimation or functional estimation) is ubiquitously useful with applications in engineering, social and natural sciences. Existing data-driven non-parametric estimators mostly focus on structured homogeneous data (e.g., weakly independent and stationary data), thus they are sensitive to adversarial noise and may perfo… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  43. arXiv:2010.05321  [pdf, ps, other

    stat.ML cs.LG math.ST

    Distributionally Robust Parametric Maximum Likelihood Estimation

    Authors: Viet Anh Nguyen, Xuhui Zhang, Jose Blanchet, Angelos Georghiou

    Abstract: We consider the parameter estimation problem of a probabilistic generative model prescribed using a natural exponential family of distributions. For this problem, the typical maximum likelihood estimator usually overfits under limited training sample size, is sensitive to noise and may perform poorly on downstream predictive tasks. To mitigate these issues, we propose a distributionally robust max… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  44. arXiv:2009.06111  [pdf, other

    stat.ML cs.LG

    Machine Learning's Dropout Training is Distributionally Robust Optimal

    Authors: Jose Blanchet, Yang Kang, Jose Luis Montiel Olea, Viet Anh Nguyen, Xuhui Zhang

    Abstract: This paper shows that dropout training in Generalized Linear Models is the minimax solution of a two-player, zero-sum game where an adversarial nature corrupts a statistician's covariates using a multiplicative nonparametric errors-in-variables model. In this game, nature's least favorable distribution is dropout noise, where nature independently deletes entries of the covariate vector with some f… ▽ More

    Submitted 14 April, 2021; v1 submitted 13 September, 2020; originally announced September 2020.

  45. arXiv:2007.09530  [pdf, other

    cs.LG stat.ML

    A Distributionally Robust Approach to Fair Classification

    Authors: Bahar Taskesen, Viet Anh Nguyen, Daniel Kuhn, Jose Blanchet

    Abstract: We propose a distributionally robust logistic regression model with an unfairness penalty that prevents discrimination with respect to sensitive attributes such as gender or ethnicity. This model is equivalent to a tractable convex optimization problem if a Wasserstein ball centered at the empirical distribution on the training data is used to model distributional uncertainty and if a new convex u… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

  46. arXiv:2007.04458  [pdf, other

    cs.LG stat.ML

    Robust Bayesian Classification Using an Optimistic Score Ratio

    Authors: Viet Anh Nguyen, Nian Si, Jose Blanchet

    Abstract: We build a Bayesian contextual classification model using an optimistic score ratio for robust binary classification when there is limited information on the class-conditional, or contextual, distribution. The optimistic score searches for the distribution that is most plausible to explain the observed outcomes in the testing sample among all distributions belonging to the contextual ambiguity set… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  47. arXiv:2006.05630  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Distributionally Robust Batch Contextual Bandits

    Authors: Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

    Abstract: Policy learning using historical observational data is an important problem that has found widespread applications. Examples include selecting offers, prices, advertisements to send to customers, as well as selecting which medication to prescribe to a patient. However, existing literature rests on the crucial assumption that the future environment where the learned policy will be deployed is the s… ▽ More

    Submitted 11 September, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: The short version has been accepted in ICML 2020

  48. arXiv:2004.06321  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Sequential Batch Learning in Finite-Action Linear Contextual Bandits

    Authors: Yanjun Han, Zhengqing Zhou, Zhengyuan Zhou, Jose Blanchet, Peter W. Glynn, Yinyu Ye

    Abstract: We study the sequential batch learning problem in linear contextual bandits with finite action sets, where the decision maker is constrained to split incoming individuals into (at most) a fixed number of batches and can only observe outcomes for the individuals within a batch at the batch's end. Compared to both standard online contextual bandits learning or offline policy learning in contexutal b… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

  49. arXiv:2003.05174  [pdf, ps, other

    cs.LG stat.ML

    Delay-Adaptive Learning in Generalized Linear Contextual Bandits

    Authors: Jose Blanchet, Renyuan Xu, Zhengyuan Zhou

    Abstract: In this paper, we consider online learning in generalized linear contextual bandits where rewards are not immediately observed. Instead, rewards are available to the decision-maker only after some delay, which is unknown and stochastic. We study the performance of two well-known algorithms adapted to this delayed setting: one based on upper confidence bounds, and the other based on Thompson sampli… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

  50. arXiv:1906.03317  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    Optimal Transport Relaxations with Application to Wasserstein GANs

    Authors: Saied Mahdian, Jose Blanchet, Peter Glynn

    Abstract: We propose a family of relaxations of the optimal transport problem which regularize the problem by introducing an additional minimization step over a small region around one of the underlying transporting measures. The type of regularization that we obtain is related to smoothing techniques studied in the optimization literature. When using our approach to estimate optimal transport costs based o… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.