Skip to main content

Showing 1–13 of 13 results for author: Si, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.19619  [pdf, other

    stat.ML cs.LG math.ST

    ScoreFusion: fusing score-based generative models via Kullback-Leibler barycenters

    Authors: Hao Liu, Junze, Ye, Jose Blanchet, Nian Si

    Abstract: We study the problem of fusing pre-trained (auxiliary) generative models to enhance the training of a target generative model. We propose using KL-divergence weighted barycenters as an optimal fusion mechanism, in which the barycenter weights are optimally trained to minimize a suitable loss for the target population. While computing the optimal KL-barycenter weights can be challenging, we demonst… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 40 pages, 6 figures

  2. arXiv:2406.11281  [pdf, ps, other

    stat.ML cs.LG

    Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: We explore the control of stochastic systems with potentially continuous state and action spaces, characterized by the state dynamics $X_{t+1} = f(X_t, A_t, W_t)$. Here, $X$, $A$, and $W$ represent the state, action, and exogenous random noise processes, respectively, with $f$ denoting a known function that describes state transitions. Traditionally, the noise process $\{W_t, t \geq 0\}$ is assume… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2401.15811  [pdf, other

    stat.ME cs.IR

    Seller-Side Experiments under Interference Induced by Feedback Loops in Two-Sided Platforms

    Authors: Zhihua Zhu, Zheng Cai, Liang Zheng, Nian Si

    Abstract: Two-sided platforms are central to modern commerce and content sharing and often utilize A/B testing for develo** new features. While user-side experiments are common, seller-side experiments become crucial for specific interventions and metrics. This paper investigates the effects of interference caused by feedback loops on seller-side experiments in two-sided platforms, with a particular focus… ▽ More

    Submitted 9 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  4. arXiv:2311.09018  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    On the Foundation of Distributionally Robust Reinforcement Learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Motivated by the need for a robust policy in the face of environment shifts between training and the deployment, we contribute to the theoretical foundation of distributionally robust reinforcement learning (DRRL). This is accomplished through a comprehensive modeling framework centered around distributionally robust Markov decision processes (DRMDPs). This framework obliges the decision maker to… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  5. arXiv:2310.17496  [pdf, other

    stat.ME cs.LG econ.EM

    Tackling Interference Induced by Data Training Loops in A/B Tests: A Weighted Training Approach

    Authors: Nian Si

    Abstract: In modern recommendation systems, the standard pipeline involves training machine learning models on historical data to predict user behaviors and improve recommendations continuously. However, these data training loops can introduce interference in A/B tests, where data generated by control and treatment algorithms, potentially with different distributions, are combined. To address these challeng… ▽ More

    Submitted 4 April, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  6. arXiv:2305.18420  [pdf, other

    cs.LG math.OC stat.ML

    Sample Complexity of Variance-reduced Distributionally Robust Q-learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: Dynamic decision making under distributional shifts is of fundamental interest in theory and applications of reinforcement learning: The distribution of the environment on which the data is collected can differ from that of the environment on which the model is deployed. This paper presents two novel model-free algorithms, namely the distributionally robust Q-learning and its variance-reduced coun… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  7. arXiv:2302.13203  [pdf, other

    cs.LG stat.ML

    A Finite Sample Complexity Bound for Distributionally Robust Q-learning

    Authors: Shengbo Wang, Nian Si, Jose Blanchet, Zhengyuan Zhou

    Abstract: We consider a reinforcement learning setting in which the deployment environment is different from the training environment. Applying a robust Markov decision processes formulation, we extend the distributionally robust $Q$-learning framework studied in Liu et al. [2022]. Further, we improve the design and analysis of their multi-level Monte Carlo estimator. Assuming access to a simulator, we prov… ▽ More

    Submitted 2 March, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted by AISTATS 2023

  8. arXiv:2205.09809  [pdf, other

    cs.LG stat.ME

    Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems

    Authors: Yewen Fan, Nian Si, Kun Zhang

    Abstract: Calibration is defined as the ratio of the average predicted click rate to the true click rate. The optimization of calibration is essential to many online advertising recommendation systems because it directly affects the downstream bids in ads auctions and the amount of money charged to advertisers. Despite its importance, calibration optimization often suffers from a problem called "maximizatio… ▽ More

    Submitted 21 March, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted in ICLR 2023

  9. arXiv:2201.03065  [pdf, ps, other

    stat.ME math.OC math.ST stat.ML

    Selecting the Best Optimizing System

    Authors: Nian Si, Zeyu Zheng

    Abstract: We formulate selecting the best optimizing system (SBOS) problems and provide solutions for those problems. In an SBOS problem, a finite number of systems are contenders. Inside each system, a continuous decision variable affects the system's expected performance. An SBOS problem compares different systems based on their expected performances under their own optimally chosen decision to select the… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    Comments: Code in https://github.com/nian-si/SelectOptSys

  10. arXiv:2106.01070  [pdf, ps, other

    stat.ML cs.CY cs.LG math.ST

    Testing Group Fairness via Optimal Transport Projections

    Authors: Nian Si, Karthyek Murthy, Jose Blanchet, Viet Anh Nguyen

    Abstract: We present a statistical testing framework to detect if a given machine learning classifier fails to satisfy a wide range of group fairness notions. The proposed test is a flexible, interpretable, and statistically rigorous tool for auditing whether exhibited biases are intrinsic to the algorithm or due to the randomness in the data. The statistical challenges, which may arise from multiple impact… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Journal ref: International Conference on Machine Learning 2021

  11. arXiv:2007.04458  [pdf, other

    cs.LG stat.ML

    Robust Bayesian Classification Using an Optimistic Score Ratio

    Authors: Viet Anh Nguyen, Nian Si, Jose Blanchet

    Abstract: We build a Bayesian contextual classification model using an optimistic score ratio for robust binary classification when there is limited information on the class-conditional, or contextual, distribution. The optimistic score searches for the distribution that is most plausible to explain the observed outcomes in the testing sample among all distributions belonging to the contextual ambiguity set… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  12. arXiv:2006.05630  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Distributionally Robust Batch Contextual Bandits

    Authors: Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

    Abstract: Policy learning using historical observational data is an important problem that has found widespread applications. Examples include selecting offers, prices, advertisements to send to customers, as well as selecting which medication to prescribe to a patient. However, existing literature rests on the crucial assumption that the future environment where the learned policy will be deployed is the s… ▽ More

    Submitted 11 September, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: The short version has been accepted in ICML 2020

  13. arXiv:1906.01614  [pdf, ps, other

    math.ST math.OC stat.ML

    Confidence Regions in Wasserstein Distributionally Robust Estimation

    Authors: Jose Blanchet, Karthyek Murthy, Nian Si

    Abstract: Wasserstein distributionally robust optimization estimators are obtained as solutions of min-max problems in which the statistician selects a parameter minimizing the worst-case loss among all probability models within a certain distance (in a Wasserstein sense) from the underlying empirical measure. While motivated by the need to identify optimal model parameters or decision choices that are robu… ▽ More

    Submitted 3 March, 2021; v1 submitted 4 June, 2019; originally announced June 2019.