Skip to main content

Showing 1–50 of 98 results for author: Blanchet, J

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.19619  [pdf, other

    stat.ML cs.LG math.ST

    ScoreFusion: fusing score-based generative models via Kullback-Leibler barycenters

    Authors: Hao Liu, Junze, Ye, Jose Blanchet, Nian Si

    Abstract: We study the problem of fusing pre-trained (auxiliary) generative models to enhance the training of a target generative model. We propose using KL-divergence weighted barycenters as an optimal fusion mechanism, in which the barycenter weights are optimally trained to minimize a suitable loss for the target population. While computing the optimal KL-barycenter weights can be challenging, we demonst… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 40 pages, 6 figures

  2. arXiv:2405.20435  [pdf, other

    cs.LG math.PR stat.ML

    Deep Learning for Computing Convergence Rates of Markov Chains

    Authors: Yanlin Qu, Jose Blanchet, Peter Glynn

    Abstract: Convergence rate analysis for general state-space Markov chains is fundamentally important in areas such as Markov chain Monte Carlo and algorithmic analysis (for computing explicit convergence bounds). This problem, however, is notoriously difficult because traditional analytical methods often do not generate practically useful convergence bounds for realistic Markov chains. We propose the Deep C… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.03198  [pdf, other

    stat.ML cs.LG math.OC

    Stability Evaluation via Distributional Perturbation Analysis

    Authors: Jose Blanchet, Peng Cui, Jia** Li, Jiashuo Liu

    Abstract: The performance of learning models often deteriorates when deployed in out-of-sample environments. To ensure reliable deployment, we propose a stability evaluation criterion based on distributional perturbations. Conceptually, our stability evaluation criterion is defined as the minimal perturbation required on our observed dataset to induce a prescribed deterioration in risk evaluation. In this p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  4. arXiv:2404.19145  [pdf, other

    stat.ME cs.LG econ.EM math.ST stat.ML

    Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty

    Authors: Kaizhao Liu, Jose Blanchet, Lexing Ying, Yi** Lu

    Abstract: Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is large. We propose a new approach called \textbf{Orthogonal Bootstrap} that reduces the number of required Monte Carlo replications. We decomposes the target being simulated into two parts: the \textit{non-orthogonal part} which has a closed-form result kno… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2404.09064  [pdf, ps, other

    math.PR

    On the First Passage Times of Branching Random Walks in $\mathbb R^d$

    Authors: Jose Blanchet, Wei Cai, Shaswat Mohanty, Zhenyuan Zhang

    Abstract: We study the first passage times of discrete-time branching random walks in ${\mathbb R}^d$ where $d\geq 1$. Here, the genealogy of the particles follows a supercritical Galton-Watson process. We provide asymptotics of the first passage times to a ball of radius one with a distance $x$ from the origin, conditioned upon survival. We provide explicitly the linear dominating term and the logarithmic… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 40 pages, 8 figures

    MSC Class: 60G70; 60J80; 60J85; 60G50

  6. arXiv:2404.01431  [pdf, other

    stat.CO math.NA

    When are Unbiased Monte Carlo Estimators More Preferable than Biased Ones?

    Authors: Guanyang Wang, Jose Blanchet, Peter W. Glynn

    Abstract: Due to the potential benefits of parallelization, designing unbiased Monte Carlo estimators, primarily in the setting of randomized multilevel Monte Carlo, has recently become very popular in operations research and computational statistics. However, existing work primarily substantiates the benefits of unbiased estimators at an intuitive level or using empirical evaluations. The intuition being t… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 35 pages

  7. arXiv:2403.14067  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Automatic Outlier Rectification via Optimal Transport

    Authors: Jose Blanchet, Jia** Li, Markus Pelger, Greg Zanotti

    Abstract: In this paper, we propose a novel conceptual framework to detect outliers using optimal transport with a concave cost function. Conventional outlier detection approaches typically use a two-stage procedure: first, outliers are detected and removed, and then estimation is performed on the cleaned data. However, this approach does not inform outlier removal with the estimation task, leaving room for… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  8. arXiv:2401.12197  [pdf, other

    math.PR

    Empirical martingale projections via the adapted Wasserstein distance

    Authors: Jose Blanchet, Johannes Wiesel, Erica Zhang, Zhenyuan Zhang

    Abstract: Given a collection of multidimensional pairs $\{(X_i,Y_i):1 \leq i\leq n\}$, we study the problem of projecting the associated suitably smoothed empirical measure onto the space of martingale couplings (i.e. distributions satisfying $\mathbb{E}[Y|X]=X$) using the adapted Wasserstein distance. We call the resulting distance the smoothed empirical martingale projection distance (SE-MPD), for which w… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 55 pages, 7 figures

  9. arXiv:2401.05016  [pdf, other

    math.ST

    Exploring first and second-order spatio-temporal structures of lightning strike impacts in the French Alps using subsampling

    Authors: Jean-François Coeurjolly, J Blanchet, Alexis Pellerin

    Abstract: We model cloud-to-ground lightning strike impacts in the French Alps over the period 2011-2021 (approximately 1.4 million of events) using spatio-temporal point processes. We investigate first and higher-order structure for this point pattern and address the questions of homogeneity of the intensity function, first-order separability and dependence between events. The tuning of nonparametric metho… ▽ More

    Submitted 11 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  10. arXiv:2312.09862  [pdf, other

    math.ST stat.ME

    Wasserstein-based Minimax Estimation of Dependence in Multivariate Regularly Varying Extremes

    Authors: Xuhui Zhang, Jose Blanchet, Youssef Marzouk, Viet Anh Nguyen, Sven Wang

    Abstract: We study minimax risk bounds for estimators of the spectral measure in multivariate linear factor models, where observations are linear combinations of regularly varying latent factors. Non-asymptotic convergence rates are derived for the multivariate Peak-over-Threshold estimator in terms of the $p$-th order Wasserstein distance, and information-theoretic lower bounds for the minimax risks are es… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  11. arXiv:2311.09018  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    On the Foundation of Distributionally Robust Reinforcement Learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Motivated by the need for a robust policy in the face of environment shifts between training and the deployment, we contribute to the theoretical foundation of distributionally robust reinforcement learning (DRRL). This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs). This framework obliges the decision maker to… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  12. arXiv:2311.02423  [pdf, other

    cs.GT cs.LG math.OC quant-ph

    Payoff-based learning with matrix multiplicative weights in quantum games

    Authors: Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos, Jose Blanchet

    Abstract: In this paper, we study the problem of learning in quantum games - and other classes of semidefinite games - with scalar, payoff-based feedback. For concreteness, we focus on the widely used matrix multiplicative weights (MMW) algorithm and, instead of requiring players to have full knowledge of the game (and/or each other's chosen states), we introduce a suite of minimal-information matrix multip… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 39 pages, 21 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; 37N40; secondary 68Q32; 81Q93

  13. arXiv:2310.18551  [pdf, ps, other

    math-ph math.PR

    Modeling Shortest Paths in Polymeric Networks using Spatial Branching Processes

    Authors: Zhenyuan Zhang, Shaswat Mohanty, Jose Blanchet, Wei Cai

    Abstract: Recent studies have established a connection between the macroscopic mechanical response of polymeric materials and the statistics of the shortest path (SP) length between distant nodes in the polymer network. Since these statistics can be costly to compute and difficult to study theoretically, we introduce a branching random walk (BRW) model to describe the SP statistics from the coarse-grained m… ▽ More

    Submitted 30 March, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 37 pages, 17 figures

  14. arXiv:2310.08833  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Sample Complexity for Average Reward Markov Decision Processes

    Authors: Shengbo Wang, Jose Blanchet, Peter Glynn

    Abstract: We resolve the open question regarding the sample complexity of policy learning for maximizing the long-run average reward associated with a uniformly ergodic Markov decision process (MDP), assuming a generative model. In this context, the existing literature provides a sample complexity upper bound of $\widetilde O(|S||A|t_{\text{mix}}^2 ε^{-2})$ and a lower bound of… ▽ More

    Submitted 12 February, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  15. arXiv:2308.10341  [pdf, ps, other

    math.PR math.OC

    Computable Bounds on Convergence of Markov Chains in Wasserstein Distance

    Authors: Yanlin Qu, Jose Blanchet, Peter Glynn

    Abstract: We introduce a unified framework to estimate the convergence of Markov chains to equilibrium using Wasserstein distance. The framework provides convergence bounds with various rates, ranging from polynomial to exponential, all derived from a single contractive drift condition. This approach removes the need for finding a specific set with drift outside and contraction inside. The convergence bound… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    MSC Class: 60J05

  16. arXiv:2308.05414  [pdf, other

    math.OC stat.ML

    Unifying Distributionally Robust Optimization via Optimal Transport Theory

    Authors: Jose Blanchet, Daniel Kuhn, Jia** Li, Bahar Taskesen

    Abstract: In the past few years, there has been considerable interest in two prominent approaches for Distributionally Robust Optimization (DRO): Divergence-based and Wasserstein-based methods. The divergence approach models misspecification in terms of likelihood ratios, while the latter models it through a measure of distance or cost in actual outcomes. Building upon these advances, this paper introduces… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  17. arXiv:2305.18420  [pdf, other

    cs.LG math.OC stat.ML

    Sample Complexity of Variance-reduced Distributionally Robust Q-learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Dynamic decision making under distributional shifts is of fundamental interest in theory and applications of reinforcement learning: The distribution of the environment on which the data is collected can differ from that of the environment on which the model is deployed. This paper presents two novel model-free algorithms, namely the distributionally robust Q-learning and its variance-reduced coun… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  18. arXiv:2305.16527  [pdf, other

    math.ST cs.IT math.NA stat.ML

    When can Regression-Adjusted Control Variates Help? Rare Events, Sobolev Embedding and Minimax Optimality

    Authors: Jose Blanchet, Haoxuan Chen, Yi** Lu, Lexing Ying

    Abstract: This paper studies the use of a machine learning-based estimator as a control variate for mitigating the variance of Monte Carlo sampling. Specifically, we seek to uncover the key factors that influence the efficiency of control variates in reducing variance. We examine a prototype estimation problem that involves simulating the moments of a Sobolev function based on observations obtained from (ra… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  19. arXiv:2305.09659  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

    Authors: Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong

    Abstract: In this paper, we study distributionally robust offline reinforcement learning (robust offline RL), which seeks to find an optimal policy purely from an offline dataset that can perform well in perturbed environments. In specific, we propose a generic algorithm framework called Doubly Pessimistic Model-based Policy Optimization ($P^2MPO$), which features a novel combination of a flexible model est… ▽ More

    Submitted 22 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: V2 adds results on robust offline Markov games

  20. arXiv:2303.14867  [pdf, ps, other

    math.OC math.ST

    Statistical Limit Theorems in Distributionally Robust Optimization

    Authors: Jose Blanchet, Alexander Shapiro

    Abstract: The goal of this paper is to develop methodology for the systematic analysis of asymptotic statistical properties of data driven DRO formulations based on their corresponding non-DRO counterparts. We illustrate our approach in various settings, including both phi-divergence and Wasserstein uncertainty sets. Different types of asymptotic behaviors are obtained depending on the rate at which the unc… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    MSC Class: 90C15

  21. arXiv:2303.06595  [pdf, other

    cs.CG cs.LG math.OC

    A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data

    Authors: Jia** Li, Jianheng Tang, Lemin Kong, Huikang Liu, Jia Li, Anthony Man-Cho So, Jose Blanchet

    Abstract: In this work, we present the Bregman Alternating Projected Gradient (BAPG) method, a single-loop algorithm that offers an approximate solution to the Gromov-Wasserstein (GW) distance. We introduce a novel relaxation technique that balances accuracy and computational efficiency, albeit with some compromises in the feasibility of the coupling map. Our analysis is based on the observation that the GW… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: Accepted by ICLR 2023

  22. arXiv:2302.07477  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes

    Authors: Shengbo Wang, Jose Blanchet, Peter Glynn

    Abstract: We consider the optimal sample complexity theory of tabular reinforcement learning (RL) for maximizing the infinite horizon discounted reward in a Markov decision process (MDP). Optimal worst-case complexity results have been developed for tabular RL problems in this setting, leading to a sample complexity dependence on $γ$ and $ε$ of the form $\tilde Θ((1-γ)^{-3}ε^{-2})$, where $γ$ denotes the di… ▽ More

    Submitted 30 September, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  23. arXiv:2212.12978  [pdf, other

    math.OC cs.LG stat.ML

    Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization

    Authors: Taoli Zheng, Linglingzhi Zhu, Anthony Man-Cho So, Jose Blanchet, Jia** Li

    Abstract: Nonconvex-nonconcave minimax optimization has received intense attention over the last decade due to its broad applications in machine learning. Most existing algorithms rely on one-sided information, such as the convexity (resp. concavity) of the primal (resp. dual) functions, or other specific structures, such as the Polyak-Łojasiewicz (PŁ) and Kurdyka-Łojasiewicz (KŁ) conditions. However, verif… ▽ More

    Submitted 30 October, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

  24. arXiv:2211.15241  [pdf, other

    econ.EM cs.LG math.OC stat.ML

    Synthetic Principal Component Design: Fast Covariate Balancing with Synthetic Controls

    Authors: Yi** Lu, Jia** Li, Lexing Ying, Jose Blanchet

    Abstract: The optimal design of experiments typically involves solving an NP-hard combinatorial optimization problem. In this paper, we aim to develop a globally convergent and practically efficient optimization algorithm. Specifically, we consider a setting where the pre-treatment outcome data is available and the synthetic control estimator is invoked. The average treatment effect is estimated via the dif… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  25. arXiv:2210.01413  [pdf, other

    math.OC cs.LG stat.ML

    Tikhonov Regularization is Optimal Transport Robust under Martingale Constraints

    Authors: Jia** Li, Sirui Lin, Jose Blanchet, Viet Anh Nguyen

    Abstract: Distributionally robust optimization has been shown to offer a principled way to regularize learning models. In this paper, we find that Tikhonov regularization is distributionally robust in an optimal transport sense (i.e., if an adversary chooses distributions in a suitable optimal transport neighborhood of the empirical measure), provided that suitable martingale constraints are also imposed. F… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted by NeurIPS 2022

  26. arXiv:2209.14430  [pdf, other

    cs.LG econ.EM math.NA math.ST stat.ML

    Minimax Optimal Kernel Operator Learning via Multilevel Training

    Authors: Jikai **, Yi** Lu, Jose Blanchet, Lexing Ying

    Abstract: Learning map**s between infinite-dimensional function spaces has achieved empirical success in many disciplines of machine learning, including generative modeling, functional data analysis, causal inference, and multi-agent reinforcement learning. In this paper, we study the statistical limit of learning a Hilbert-Schmidt operator between two infinite-dimensional Sobolev reproducing kernel Hilbe… ▽ More

    Submitted 24 July, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  27. arXiv:2205.13111  [pdf, other

    math.OC math.PR

    Distributionally Robust Gaussian Process Regression and Bayesian Inverse Problems

    Authors: Xuhui Zhang, Jose Blanchet, Youssef Marzouk, Viet Anh Nguyen, Sven Wang

    Abstract: We study a distributionally robust optimization formulation (i.e., a min-max game) for two representative problems in Bayesian nonparametric estimation: Gaussian process regression and, more generally, linear inverse problems. Our formulation seeks the best mean-squared error predictor, in an infinite-dimensional space, against an adversary who chooses the worst-case model in a Wasserstein ball ar… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

  28. arXiv:2205.07331  [pdf, other

    math.NA cs.LG math.ST physics.comp-ph stat.ML

    Sobolev Acceleration and Statistical Optimality for Learning Elliptic Equations via Gradient Descent

    Authors: Yi** Lu, Jose Blanchet, Lexing Ying

    Abstract: In this paper, we study the statistical limits in terms of Sobolev norms of gradient descent for solving inverse problem from randomly sampled noisy observations using a general class of objective functions. Our class of objective functions includes Sobolev training for kernel regression, Deep Ritz Methods (DRM), and Physics Informed Neural Networks (PINN) for solving elliptic partial differential… ▽ More

    Submitted 19 September, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  29. arXiv:2202.10799  [pdf, other

    math.PR

    Large deviations asymptotics for unbounded additive functionals of diffusion processes

    Authors: Mihail Bazhba, Jose Blanchet, Roger J. A. Laeven, Bert Zwart

    Abstract: We study large deviations asymptotics for a class of unbounded additive functionals, interpreted as normalized accumulated areas, of one-dimensional Langevin diffusions with sub-linear gradient drifts. Our results provide parametric insights on the speed and the rate functions in terms of the growth rate of the drift and the growth rate of the additive functional. We find a critical value in terms… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: In this revision, we have: fixed a mistake in the proof of Lemma 4.3; suppressed some elementary technical details; fixed some typos; and further improved the presentation

    MSC Class: 60F10 (Primary); 60J60 (Secondary)

  30. arXiv:2110.06897  [pdf, other

    math.NA cs.LG math.ST physics.comp-ph stat.ML

    Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality

    Authors: Yi** Lu, Haoxuan Chen, Jianfeng Lu, Lexing Ying, Jose Blanchet

    Abstract: In this paper, we study the statistical limits of deep learning techniques for solving elliptic partial differential equations (PDEs) from random samples using the Deep Ritz Method (DRM) and Physics-Informed Neural Networks (PINNs). To simplify the problem, we focus on a prototype elliptic PDE: the Schrödinger equation on a hypercube with zero Dirichlet boundary condition, which has wide applicati… ▽ More

    Submitted 12 November, 2021; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: add a proof Proof Sketch in section 4.1

  31. arXiv:2109.14875  [pdf, other

    stat.ML cs.LG math.OC

    Adversarial Regression with Doubly Non-negative Weighting Matrices

    Authors: Tam Le, Truyen Nguyen, Makoto Yamada, Jose Blanchet, Viet Anh Nguyen

    Abstract: Many machine learning tasks that involve predicting an output response can be solved by training a weighted regression model. Unfortunately, the predictive power of this type of models may severely deteriorate under low sample sizes or under covariate perturbations. Reweighting the training samples has aroused as an effective mitigation strategy to these problems. In this paper, we propose a novel… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

    Comments: Accepted to the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS2021)

  32. arXiv:2108.02120  [pdf, other

    math.ST cs.LG math.OC stat.ML

    Statistical Analysis of Wasserstein Distributionally Robust Estimators

    Authors: Jose Blanchet, Karthyek Murthy, Viet Anh Nguyen

    Abstract: We consider statistical methods which invoke a min-max distributionally robust formulation to extract good out-of-sample performance in data-driven optimization and learning problems. Acknowledging the distributional uncertainty in learning from limited samples, the min-max formulations introduce an adversarial inner player to explore unseen covariate data. The resulting Distributionally Robust Op… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  33. arXiv:2106.07191  [pdf, ps, other

    math.PR math.OC math.ST q-fin.CP

    Distributionally Robust Martingale Optimal Transport

    Authors: Zhengqing Zhou, Jose Blanchet, Peter W. Glynn

    Abstract: We study the problem of bounding path-dependent expectations (within any finite time horizon $d$) over the class of discrete-time martingales whose marginal distributions lie within a prescribed tolerance of a given collection of benchmark marginal distributions. This problem is a relaxation of the martingale optimal transport (MOT) problem and is motivated by applications to super-hedging in fina… ▽ More

    Submitted 29 November, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  34. arXiv:2106.02263  [pdf, other

    stat.CO math.PR q-fin.CP

    Unbiased Optimal Stop** via the MUSE

    Authors: Zhengqing Zhou, Guanyang Wang, Jose Blanchet, Peter W. Glynn

    Abstract: We propose a new unbiased estimator for estimating the utility of the optimal stop** problem. The MUSE, short for Multilevel Unbiased Stop** Estimator, constructs the unbiased Multilevel Monte Carlo (MLMC) estimator at every stage of the optimal stop** problem in a backward recursive way. In contrast to traditional sequential methods, the MUSE can be implemented in parallel. We prove the MUS… ▽ More

    Submitted 26 December, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: 39 pages, add several numerical experiments and technical results, accepted by Stochastic Processes and their Applications

    MSC Class: 62C05; 60G40; 62L15

  35. arXiv:2106.01070  [pdf, ps, other

    stat.ML cs.CY cs.LG math.ST

    Testing Group Fairness via Optimal Transport Projections

    Authors: Nian Si, Karthyek Murthy, Jose Blanchet, Viet Anh Nguyen

    Abstract: We present a statistical testing framework to detect if a given machine learning classifier fails to satisfy a wide range of group fairness notions. The proposed test is a flexible, interpretable, and statistically rigorous tool for auditing whether exhibited biases are intrinsic to the algorithm or due to the randomness in the data. The statistical challenges, which may arise from multiple impact… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Journal ref: International Conference on Machine Learning 2021

  36. arXiv:2106.00322  [pdf, other

    cs.LG math.OC stat.ML

    Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts

    Authors: Bahar Taskesen, Man-Chung Yue, Jose Blanchet, Daniel Kuhn, Viet Anh Nguyen

    Abstract: Least squares estimators, when trained on a few target domain samples, may predict poorly. Supervised domain adaptation aims to improve the predictive accuracy by exploiting additional labeled training samples from a source distribution that is close to the target distribution. Given available data, we investigate novel strategies to synthesize a family of least squares estimator experts that are… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  37. arXiv:2103.16451  [pdf, other

    q-fin.PM math.OC stat.ML

    Robustifying Conditional Portfolio Decisions via Optimal Transport

    Authors: Viet Anh Nguyen, Fan Zhang, Shanshan Wang, Jose Blanchet, Erick Delage, Yinyu Ye

    Abstract: We propose a data-driven portfolio selection model that integrates side information, conditional estimation and robustness using the framework of distributionally robust optimization. Conditioning on the observed side information, the portfolio manager solves an allocation problem that minimizes the worst-case conditional risk-return trade-off, subject to all possible perturbations of the covariat… ▽ More

    Submitted 9 April, 2024; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 1 figure

  38. arXiv:2010.05373  [pdf, other

    stat.ML cs.LG math.ST

    Distributionally Robust Local Non-parametric Conditional Estimation

    Authors: Viet Anh Nguyen, Fan Zhang, Jose Blanchet, Erick Delage, Yinyu Ye

    Abstract: Conditional estimation given specific covariate values (i.e., local conditional estimation or functional estimation) is ubiquitously useful with applications in engineering, social and natural sciences. Existing data-driven non-parametric estimators mostly focus on structured homogeneous data (e.g., weakly independent and stationary data), thus they are sensitive to adversarial noise and may perfo… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  39. arXiv:2010.05321  [pdf, ps, other

    stat.ML cs.LG math.ST

    Distributionally Robust Parametric Maximum Likelihood Estimation

    Authors: Viet Anh Nguyen, Xuhui Zhang, Jose Blanchet, Angelos Georghiou

    Abstract: We consider the parameter estimation problem of a probabilistic generative model prescribed using a natural exponential family of distributions. For this problem, the typical maximum likelihood estimator usually overfits under limited training sample size, is sensitive to noise and may perform poorly on downstream predictive tasks. To mitigate these issues, we propose a distributionally robust max… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

  40. arXiv:2007.09320  [pdf, ps, other

    q-fin.RM math.OC math.PR q-fin.MF

    Convolution Bounds on Quantile Aggregation

    Authors: Jose Blanchet, Henry Lam, Yang Liu, Ruodu Wang

    Abstract: Quantile aggregation with dependence uncertainty has a long history in probability theory with wide applications in finance, risk management, statistics, and operations research. Using a recent result on inf-convolution of quantile-based risk measures, we establish new analytical bounds for quantile aggregation which we call convolution bounds. Convolution bounds both unify every analytical result… ▽ More

    Submitted 24 April, 2024; v1 submitted 17 July, 2020; originally announced July 2020.

  41. arXiv:2006.05630  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Distributionally Robust Batch Contextual Bandits

    Authors: Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

    Abstract: Policy learning using historical observational data is an important problem that has found widespread applications. Examples include selecting offers, prices, advertisements to send to customers, as well as selecting which medication to prescribe to a patient. However, existing literature rests on the crucial assumption that the future environment where the learned policy will be deployed is the s… ▽ More

    Submitted 11 September, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: The short version has been accepted in ICML 2020

  42. arXiv:2003.14381  [pdf, other

    math.PR

    Sample-path large deviations for unbounded additive functionals of the reflected random walk

    Authors: Mihail Bazhba, Jose Blanchet, Chang-Han Rhee, Bert Zwart

    Abstract: We prove a sample path large deviation principle (LDP) with sub-linear speed for unbounded functionals of certain Markov chains induced by the Lindley recursion. The LDP holds in the Skorokhod space $\mathbb{D}[0,T]$ equipped with the $M_1'$ topology. Our technique hinges on a suitable decomposition of the Markov chain in terms of regeneration cycles. Each regeneration cycle denotes the area accum… ▽ More

    Submitted 30 September, 2023; v1 submitted 31 March, 2020; originally announced March 2020.

    MSC Class: 60F10 (Primary); 60G17 (Secondary)

  43. arXiv:2002.03205  [pdf, other

    math.PR cs.SI econ.EM

    Asymptotically Optimal Control of a Centralized Dynamic Matching Market with General Utilities

    Authors: Jose H. Blanchet, Martin I. Reiman, Viragh Shah, Lawrence M. Wein, Linjia Wu

    Abstract: We consider a matching market where buyers and sellers arrive according to independent Poisson processes at the same rate and independently abandon the market if not matched after an exponential amount of time with the same mean. In this centralized market, the utility for the system manager from matching any buyer and any seller is a general random variable. We consider a sequence of systems inde… ▽ More

    Submitted 10 June, 2021; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: 81 pages

    MSC Class: 90B50 (primary); 90B22 (secondary) ACM Class: G.3

  44. arXiv:2002.02149  [pdf, other

    math.OC math.PR

    Efficient Scenario Generation for Heavy-tailed Chance Constrained Optimization

    Authors: Jose Blanchet, Fan Zhang, Bert Zwart

    Abstract: We consider a generic class of chance-constrained optimization problems with heavy-tailed (i.e., power-law type) risk factors. In this setting, we use the scenario approach to obtain a constant approximation to the optimal solution with a computational complexity that is uniform in the risk tolerance parameter. We additionally illustrate the efficiency of our algorithm in the context of solvency i… ▽ More

    Submitted 7 May, 2023; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: 31pages, 7 figure

  45. arXiv:2001.08384  [pdf, other

    math.PR

    Efficient Steady-state Simulation of High-dimensional Stochastic Networks

    Authors: Jose Blanchet, Xinyun Chen, Peter Glynn, Nian Si

    Abstract: We propose and study an asymptotically optimal Monte Carlo estimator for steady-state expectations of a d-dimensional reflected Brownian motion. Our estimator is asymptotically optimal in the sense that it requires $\tilde{O}(d)$ (up to logarithmic factors in $d$) i.i.d. Gaussian random variables in order to output an estimate with a controlled error. Our construction is based on the analysis of a… ▽ More

    Submitted 27 January, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

  46. arXiv:1906.03317  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    Optimal Transport Relaxations with Application to Wasserstein GANs

    Authors: Saied Mahdian, Jose Blanchet, Peter Glynn

    Abstract: We propose a family of relaxations of the optimal transport problem which regularize the problem by introducing an additional minimization step over a small region around one of the underlying transporting measures. The type of regularization that we obtain is related to smoothing techniques studied in the optimization literature. When using our approach to estimate optimal transport costs based o… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  47. arXiv:1906.01614  [pdf, ps, other

    math.ST math.OC stat.ML

    Confidence Regions in Wasserstein Distributionally Robust Estimation

    Authors: Jose Blanchet, Karthyek Murthy, Nian Si

    Abstract: Wasserstein distributionally robust optimization estimators are obtained as solutions of min-max problems in which the statistician selects a parameter minimizing the worst-case loss among all probability models within a certain distance (in a Wasserstein sense) from the underlying empirical measure. While motivated by the need to identify optimal model parameters or decision choices that are robu… ▽ More

    Submitted 3 March, 2021; v1 submitted 4 June, 2019; originally announced June 2019.

  48. arXiv:1905.12231  [pdf, other

    math.ST

    Multivariate Distributionally Robust Convex Regression under Absolute Error Loss

    Authors: Jose Blanchet, Peter W. Glynn, Jun Yan, Zhengqing Zhou

    Abstract: This paper proposes a novel non-parametric multidimensional convex regression estimator which is designed to be robust to adversarial perturbations in the empirical measure. We minimize over convex functions the maximum (over Wasserstein perturbations of the empirical measure) of the absolute regression errors. The inner maximization is solved in closed form resulting in a regularization penalty i… ▽ More

    Submitted 25 July, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: v3. 17 pages, 2 figures

    MSC Class: 62H12; 62G20; 62G05

  49. arXiv:1905.07845  [pdf, other

    stat.ML cs.LG math.OC

    A Distributionally Robust Boosting Algorithm

    Authors: Jose Blanchet, Yang Kang, Fan Zhang, Zhangyi Hu

    Abstract: Distributionally Robust Optimization (DRO) has been shown to provide a flexible framework for decision making under uncertainty and statistical estimation. For example, recent works in DRO have shown that popular statistical estimators can be interpreted as the solutions of suitable formulated data-driven DRO problems. In turn, this connection is used to optimally select tuning parameters in terms… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.

    Comments: 13 pages, 1 figure

  50. arXiv:1904.09929  [pdf, other

    math.ST math.OC stat.CO

    Unbiased Multilevel Monte Carlo: Stochastic Optimization, Steady-state Simulation, Quantiles, and Other Applications

    Authors: Jose H. Blanchet, Peter W. Glynn, Yanan Pei

    Abstract: We present general principles for the design and analysis of unbiased Monte Carlo estimators in a wide range of settings. Our estimators posses finite work-normalized variance under mild regularity conditions. We apply our estimators to various settings of interest, including unbiased optimization in Sample Average Approximations, unbiased steady-state simulation of regenerative processes, quantil… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: 20 pages, 2 figures