Skip to main content

Showing 1–50 of 75 results for author: Wager, S

.
  1. arXiv:2405.05534  [pdf, ps, other

    econ.EM

    Sequential Validation of Treatment Heterogeneity

    Authors: Stefan Wager

    Abstract: We use the martingale construction of Luedtke and van der Laan (2016) to develop tests for the presence of treatment heterogeneity. The resulting sequential validation approach can be instantiated using various validation metrics, such as BLPs, GATES, QINI curves, etc., and provides an alternative to cross-validation-like cross-fold application of these metrics.

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: This note was prepared as a comment on the Fisher-Schultz paper by Chernozhukov, Demirer, Duflo and Fernandez-Val, forthcoming in Econometrica

  2. arXiv:2403.01386  [pdf, other

    stat.ME econ.EM

    Minimax-Regret Sample Selection in Randomized Experiments

    Authors: Yuchen Hu, Henry Zhu, Emma Brunskill, Stefan Wager

    Abstract: Randomized controlled trials are often run in settings with many subpopulations that may have differential benefits from the treatment being evaluated. We consider the problem of sample selection, i.e., whom to enroll in a randomized trial, such as to optimize welfare in a heterogeneous population. We formalize this problem within the minimax-regret framework, and derive optimal sample-selection s… ▽ More

    Submitted 25 June, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  3. arXiv:2402.08201  [pdf, other

    stat.ML cs.LG

    Off-Policy Evaluation in Markov Decision Processes under Weak Distributional Overlap

    Authors: Mohammad Mehrabi, Stefan Wager

    Abstract: Doubly robust methods hold considerable promise for off-policy evaluation in Markov decision processes (MDPs) under sequential ignorability: They have been shown to converge as $1/\sqrt{T}$ with the horizon $T$, to be statistically efficient in large samples, and to allow for modular implementation where preliminary estimation tasks can be executed using standard reinforcement learning techniques.… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 50 pages, 4 figures

  4. arXiv:2312.02482  [pdf, other

    stat.CO stat.AP stat.ME

    Treatment heterogeneity with right-censored outcomes using grf

    Authors: Erik Sverdrup, Stefan Wager

    Abstract: This article walks through how to estimate conditional average treatment effects (CATEs) with right-censored time-to-event outcomes using the function causal_survival_forest (Cui et al., 2023) in the R package grf (Athey et al., 2019, Tibshirani et al., 2024) using data from the National Job Training Partnership Act.

    Submitted 25 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Software review article prepared for January 2024 ASA Lifetime Data Science newsletter

  5. arXiv:2306.11979  [pdf, other

    stat.ME

    Qini Curves for Multi-Armed Treatment Rules

    Authors: Erik Sverdrup, Han Wu, Susan Athey, Stefan Wager

    Abstract: Qini curves have emerged as an attractive and popular approach for evaluating the benefit of data-driven targeting rules for treatment allocation. We propose a generalization of the Qini curve to multiple costly treatment arms, that quantifies the value of optimally selecting among both units and treatment arms at different budget levels. We develop an efficient algorithm for computing these curve… ▽ More

    Submitted 23 April, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  6. arXiv:2304.11735  [pdf, other

    econ.EM math.ST stat.ME

    Policy Learning under Biased Sample Selection

    Authors: Lihua Lei, Roshni Sahoo, Stefan Wager

    Abstract: Practitioners often use data from a randomized controlled trial to learn a treatment assignment policy that can be deployed on a target population. A recurring concern in doing so is that, even if the randomized trial was well-executed (i.e., internal validity holds), the study participants may not represent a random sample of the target population (i.e., external validity fails)--and this may lea… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  7. arXiv:2302.12093  [pdf, other

    eess.SY math.OC stat.ME

    Experimenting under Stochastic Congestion

    Authors: Shuangning Li, Ramesh Johari, Xu Kuang, Stefan Wager

    Abstract: We study randomized experiments in a service system when stochastic congestion can arise from temporarily limited supply and/or demand. Such congestion gives rise to cross-unit interference between the waiting customers, and analytic strategies that do not account for this interference may be biased. In current practice, one of the most widely used ways to address stochastic congestion is to use s… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

  8. arXiv:2209.01754  [pdf, other

    stat.ME cs.LG stat.ML

    Learning from a Biased Sample

    Authors: Roshni Sahoo, Lihua Lei, Stefan Wager

    Abstract: The empirical risk minimization approach to data-driven decision making assumes that we can learn a decision rule from training data drawn under the same conditions as the ones we want to deploy it in. However, in a number of settings, we may be concerned that our training sample is biased, and that some groups (characterized by either observable or unobservable attributes) may be under- or over-r… ▽ More

    Submitted 5 January, 2023; v1 submitted 5 September, 2022; originally announced September 2022.

  9. arXiv:2209.00197  [pdf, other

    stat.ME econ.EM

    Switchback Experiments under Geometric Mixing

    Authors: Yuchen Hu, Stefan Wager

    Abstract: The switchback is an experimental design that measures treatment effects by repeatedly turning an intervention on and off for a whole system. Switchback experiments are a robust way to overcome cross-unit spillover effects; however, they are vulnerable to bias from temporal carryovers. In this paper, we consider properties of switchback experiments in Markovian systems that mix at a geometric rate… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 August, 2022; originally announced September 2022.

  10. arXiv:2207.07758  [pdf, other

    stat.AP

    Treatment Heterogeneity for Survival Outcomes

    Authors: Yizhe Xu, Nikolaos Ignatiadis, Erik Sverdrup, Scott Fleming, Stefan Wager, Nigam Shah

    Abstract: Estimation of conditional average treatment effects (CATEs) plays an essential role in modern medicine by informing treatment decision-making at a patient level. Several metalearners have been proposed recently to estimate CATEs in an effective and flexible way by re-purposing predictive machine learning models for causal estimation. In this chapter, we summarize the literature on metalearners and… ▽ More

    Submitted 6 September, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: A chapter of the 'Handbook of Matching and Weighting Adjustments for Causal Inference'

  11. arXiv:2206.10323  [pdf, other

    stat.ME stat.ML

    What Makes Forest-Based Heterogeneous Treatment Effect Estimators Work?

    Authors: Susanne Dandl, Torsten Hothorn, Heidi Seibold, Erik Sverdrup, Stefan Wager, Achim Zeileis

    Abstract: Estimation of heterogeneous treatment effects (HTE) is of prime importance in many disciplines, ranging from personalized medicine to economics among many others. Random forests have been shown to be a flexible and powerful approach to HTE estimation in both randomized trials and observational studies. In particular "causal forests", introduced by Athey, Tibshirani and Wager (2019), along with the… ▽ More

    Submitted 20 December, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Contribution has been accepted for publication in the Annals of Applied Statistics

  12. arXiv:2204.01884  [pdf, other

    stat.ML cs.LG econ.EM

    Policy Learning with Competing Agents

    Authors: Roshni Sahoo, Stefan Wager

    Abstract: Decision makers often aim to learn a treatment assignment policy under a capacity constraint on the number of agents that they can treat. When agents can respond strategically to such policies, competition arises, complicating estimation of the optimal policy. In this paper, we study capacity-constrained treatment assignment in the presence of such interference. We consider a dynamic model where t… ▽ More

    Submitted 17 April, 2024; v1 submitted 4 April, 2022; originally announced April 2022.

  13. arXiv:2203.12053  [pdf, other

    eess.AS cs.SD

    Upmixing via style transfer: a variational autoencoder for disentangling spatial images and musical content

    Authors: Haici Yang, Sanna Wager, Spencer Russell, Mike Luo, Minje Kim, Wontak Kim

    Abstract: In the stereo-to-multichannel upmixing problem for music, one of the main tasks is to set the directionality of the instrument sources in the multichannel rendering results. In this paper, we propose a modified variational autoencoder model that learns a latent space to describe the spatial images in multichannel music. We seek to disentangle the spatial images and music content, so the learned la… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  14. arXiv:2203.00820  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    Partial Likelihood Thompson Sampling

    Authors: Han Wu, Stefan Wager

    Abstract: We consider the problem of deciding how best to target and prioritize existing vaccines that may offer protection against new variants of an infectious disease. Sequential experiments are a promising approach; however, challenges due to delayed feedback and the overall ebb and flow of disease prevalence make available methods inapplicable for this task. We present a method, partial likelihood Thom… ▽ More

    Submitted 19 June, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

  15. arXiv:2202.12431  [pdf, other

    cs.LG math.ST stat.ME

    Thompson Sampling with Unrestricted Delays

    Authors: Han Wu, Stefan Wager

    Abstract: We investigate properties of Thompson Sampling in the stochastic multi-armed bandit problem with delayed feedback. In a setting with i.i.d delays, we establish to our knowledge the first regret bounds for Thompson Sampling with arbitrary delay distributions, including ones with unbounded expectation. Our bounds are qualitatively comparable to the best available bounds derived via ad-hoc algorithms… ▽ More

    Submitted 22 May, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  16. arXiv:2202.05356  [pdf, ps, other

    stat.ME

    Network Interference in Micro-Randomized Trials

    Authors: Shuangning Li, Stefan Wager

    Abstract: The micro-randomized trial (MRT) is an experimental design that can be used to develop optimal mobile health interventions. In MRTs, interventions in the form of notifications or messages are sent through smart phones to individuals, targeting a health-related outcome such as physical activity or weight management. Often, mobile health interventions have a social media component; an individual's o… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  17. arXiv:2112.04723  [pdf, other

    econ.EM stat.ME

    Covariate Balancing Sensitivity Analysis for Extrapolating Randomized Trials across Locations

    Authors: Xinkun Nie, Guido Imbens, Stefan Wager

    Abstract: The ability to generalize experimental results from randomized control trials (RCTs) across locations is crucial for informing policy decisions in targeted regions. Such generalization is often hindered by the lack of identifiability due to unmeasured effect modifiers that compromise direct transport of treatment effect estimates from one location to another. We build upon sensitivity analysis in… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  18. arXiv:2111.07966  [pdf, other

    stat.ME stat.ML

    Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects

    Authors: Steve Yadlowsky, Scott Fleming, Nigam Shah, Emma Brunskill, Stefan Wager

    Abstract: There are a number of available methods for selecting whom to prioritize for treatment, including ones based on treatment effect estimation, risk scoring, and hand-crafted rules. We propose rank-weighted average treatment effect (RATE) metrics as a simple and general family of metrics for comparing and testing the quality of treatment prioritization rules. RATE metrics are agnostic as to how the p… ▽ More

    Submitted 28 November, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

  19. arXiv:2110.12343  [pdf, other

    cs.LG math.ST stat.ME

    Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability

    Authors: Yuchen Hu, Stefan Wager

    Abstract: We consider off-policy evaluation of dynamic treatment rules under sequential ignorability, given an assumption that the underlying system can be modeled as a partially observed Markov decision process (POMDP). We propose an estimator, partial history importance weighting, and show that it can consistently estimate the stationary mean rewards of a target policy given long enough draws from the beh… ▽ More

    Submitted 9 May, 2023; v1 submitted 23 October, 2021; originally announced October 2021.

  20. arXiv:2109.11647  [pdf, other

    econ.EM stat.ME

    Treatment Effects in Market Equilibrium

    Authors: Evan Munro, Xu Kuang, Stefan Wager

    Abstract: Policy-relevant treatment effect estimation in a marketplace setting requires taking into account both the direct benefit of the treatment and any spillovers induced by changes to the market equilibrium. The standard way to address these challenges is to evaluate interventions via cluster-randomized experiments, where each cluster corresponds to an isolated market. This approach, however, cannot b… ▽ More

    Submitted 17 June, 2024; v1 submitted 23 September, 2021; originally announced September 2021.

  21. arXiv:2104.03802  [pdf, other

    stat.ME econ.EM

    Average Direct and Indirect Causal Effects under Interference

    Authors: Yuchen Hu, Shuangning Li, Stefan Wager

    Abstract: We propose a definition for the average indirect effect of a binary treatment in the potential outcomes model for causal inference under cross-unit interference. Our definition is analogous to the standard definition of the average direct effect, and can be expressed without needing to compare outcomes across multiple randomized experiments. We show that the proposed indirect effect satisfies a de… ▽ More

    Submitted 11 January, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

  22. arXiv:2103.11066  [pdf, other

    stat.ME

    Treatment Allocation under Uncertain Costs

    Authors: Hao Sun, Evan Munro, Georgy Kalashnov, Shuyang Du, Stefan Wager

    Abstract: We consider the problem of learning how to optimally allocate treatments whose cost is uncertain and can vary with pre-treatment covariates. This setting may arise in medicine if we need to prioritize access to a scarce resource that different patients would use for different amounts of time, or in marketing if we want to target discounts whose cost to the company depends on how much the discounts… ▽ More

    Submitted 11 March, 2024; v1 submitted 19 March, 2021; originally announced March 2021.

  23. arXiv:2101.09855  [pdf, other

    math.ST cs.LG

    Weak Signal Asymptotics for Sequentially Randomized Experiments

    Authors: Xu Kuang, Stefan Wager

    Abstract: We use the lens of weak signal asymptotics to study a class of sequentially randomized experiments, including those that arise in solving multi-armed bandit problems. In an experiment with $n$ time steps, we let the mean reward gaps between actions scale to the order $1/\sqrt{n}$ so as to preserve the difficulty of the learning task as $n$ grows. In this regime, we show that the sample paths of a… ▽ More

    Submitted 22 June, 2023; v1 submitted 24 January, 2021; originally announced January 2021.

    Comments: Forthcoming in Management Science. An earlier draft of this paper was circulated under the title "Diffusion Asymptotics for Sequential Experiments.'' Xu Kuang published under a different full name in earlier versions of this manuscript. Please use X. Kuang and S. Wager when citing this paper

    MSC Class: 62B15; 60J70

  24. arXiv:2007.13302  [pdf, other

    math.ST

    Random Graph Asymptotics for Treatment Effect Estimation under Network Interference

    Authors: Shuangning Li, Stefan Wager

    Abstract: The network interference model for causal inference places all experimental units at the vertices of an undirected exposure graph, such that treatment assigned to one unit may affect the outcome of another unit if and only if these two units are connected by an edge. This model has recently gained popularity as means of incorporating interference effects into the Neyman--Rubin potential outcomes f… ▽ More

    Submitted 16 March, 2022; v1 submitted 27 July, 2020; originally announced July 2020.

  25. arXiv:2007.12581  [pdf, other

    eess.AS cs.LG cs.SD

    Dereverberation using joint estimation of dry speech signal and acoustic system

    Authors: Sanna Wager, Keunwoo Choi, Simon Durand

    Abstract: The purpose of speech dereverberation is to remove quality-degrading effects of a time-invariant impulse response filter from the signal. In this report, we describe an approach to speech dereverberation that involves joint estimation of the dry speech signal and of the room impulse response. We explore deep learning models that apply to each task separately, and how these can be combined in a joi… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  26. arXiv:2004.09458  [pdf, other

    stat.ME econ.EM

    Noise-Induced Randomization in Regression Discontinuity Designs

    Authors: Dean Eckles, Nikolaos Ignatiadis, Stefan Wager, Han Wu

    Abstract: Regression discontinuity designs assess causal effects in settings where treatment is determined by whether an observed running variable crosses a pre-specified threshold. Here we propose a new approach to identification, estimation, and inference in regression discontinuity designs that uses knowledge about exogenous noise (e.g., measurement error) in the running variable. In our strategy, we wei… ▽ More

    Submitted 26 November, 2023; v1 submitted 20 April, 2020; originally announced April 2020.

  27. arXiv:2002.05511  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Deep Autotuner: a Pitch Correcting Network for Singing Performances

    Authors: Sanna Wager, George Tzanetakis, Cheng-i Wang, Minje Kim

    Abstract: We introduce a data-driven approach to automatic pitch correction of solo singing performances. The proposed approach predicts note-wise pitch shifts from the relationship between the respective spectrograms of the singing and accompaniment. This approach differs from commercial systems, where vocal track notes are usually shifted to be centered around pitches in a user-defined score, or mapped to… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1902.00956

    Journal ref: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

  28. Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning

    Authors: Sanna Wager, Aparna Khare, Minhua Wu, Kenichi Kumatani, Shiva Sundaram

    Abstract: In this work, we investigated the teacher-student training paradigm to train a fully learnable multi-channel acoustic model for far-field automatic speech recognition (ASR). Using a large offline teacher model trained on beamformed audio, we trained a simpler multi-channel student acoustic model used in the speech recognition system. For the student, both multi-channel feature extraction layers an… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

    Comments: To appear in ICASSP 2020

  29. arXiv:2001.09887  [pdf, other

    stat.ME cs.LG stat.ML

    Estimating heterogeneous treatment effects with right-censored data via causal survival forests

    Authors: Yifan Cui, Michael R. Kosorok, Erik Sverdrup, Stefan Wager, Ruoqing Zhu

    Abstract: Forest-based methods have recently gained in popularity for non-parametric treatment effect estimation. Building on this line of work, we introduce causal survival forests, which can be used to estimate heterogeneous treatment effects in a survival and observational setting where outcomes may be right-censored. Our approach relies on orthogonal estimating equations to robustly adjust for both cens… ▽ More

    Submitted 28 February, 2023; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: To appear in the Journal of the Royal Statistical Society, Series B

    MSC Class: 62N01

  30. arXiv:1911.02768  [pdf, other

    stat.ML cs.LG stat.ME

    Confidence Intervals for Policy Evaluation in Adaptive Experiments

    Authors: Vitor Hadad, David A. Hirshberg, Ruohan Zhan, Stefan Wager, Susan Athey

    Abstract: Adaptive experiment designs can dramatically improve statistical efficiency in randomized trials, but they also complicate statistical inference. For example, it is now well known that the sample mean is biased in adaptive trials. Inferential challenges are exacerbated when our parameter of interest differs from the parameter the trial was designed to target, such as when we are interested in esti… ▽ More

    Submitted 12 February, 2021; v1 submitted 7 November, 2019; originally announced November 2019.

  31. arXiv:1910.10624  [pdf, other

    stat.ME

    Doubly robust treatment effect estimation with missing attributes

    Authors: Imke Mayer, Erik Sverdrup, Tobias Gauss, Jean-Denis Moyer, Stefan Wager, Julie Josse

    Abstract: Missing attributes are ubiquitous in causal inference, as they are in most applied statistical work. In this paper, we consider various sets of assumptions under which causal inference is possible despite missing attributes and discuss corresponding approaches to average treatment effect estimation, including generalized propensity score methods and multiple imputation. Across an extensive simulat… ▽ More

    Submitted 22 May, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    MSC Class: 93C41; 62G35; 62F35; 62P10

  32. arXiv:1910.09714  [pdf, other

    cs.LG stat.ML

    Smoothness-Adaptive Contextual Bandits

    Authors: Yonatan Gur, Ahmadreza Momeni, Stefan Wager

    Abstract: We study a non-parametric multi-armed bandit problem with stochastic covariates, where a key complexity driver is the smoothness of payoff functions with respect to covariates. Previous studies have focused on deriving minimax-optimal algorithms in cases where it is a priori known how smooth the payoff functions are. In practice, however, the smoothness of payoff functions is typically not known i… ▽ More

    Submitted 15 October, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

  33. arXiv:1909.11696  [pdf, other

    stat.ME

    Cross-Validation, Risk Estimation, and Model Selection

    Authors: Stefan Wager

    Abstract: Cross-validation is a popular non-parametric method for evaluating the accuracy of a predictive rule. The usefulness of cross-validation depends on the task we want to employ it for. In this note, I discuss a simple non-parametric setting, and find that cross-validation is asymptotically uninformative about the expected test error of any given predictive rule, but allows for asymptotically consist… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: This note was prepared as a comment on a paper by Rosset and Tibshirani, forthcoming in the Journal of the American Statistical Association

  34. arXiv:1908.09874  [pdf, other

    stat.ML cs.LG

    Sufficient Representations for Categorical Variables

    Authors: Jonathan Johannemann, Vitor Hadad, Susan Athey, Stefan Wager

    Abstract: Many learning algorithms require categorical data to be transformed into real vectors before it can be used as input. Often, categorical variables are encoded as one-hot (or dummy) vectors. However, this mode of representation can be wasteful since it adds many low-signal regressors, especially when the number of unique categories is large. In this paper, we investigate simple alternative solution… ▽ More

    Submitted 28 October, 2021; v1 submitted 26 August, 2019; originally announced August 2019.

  35. arXiv:1906.01611  [pdf, other

    stat.ME stat.ML

    Covariate-Powered Empirical Bayes Estimation

    Authors: Nikolaos Ignatiadis, Stefan Wager

    Abstract: We study methods for simultaneous analysis of many noisy experiments in the presence of rich covariate information. The goal of the analyst is to optimally estimate the true effect underlying each experiment. Both the noisy experimental results and the auxiliary covariates are useful for this purpose, but neither data source on its own captures all the information available to the analyst. In this… ▽ More

    Submitted 12 January, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)

  36. arXiv:1905.11622  [pdf, other

    stat.ME

    Nonparametric Heterogeneous Treatment Effect Estimation in Repeated Cross Sectional Designs

    Authors: Xinkun Nie, Chen Lu, Stefan Wager

    Abstract: Identifying heterogeneity in a population's response to a health or policy intervention is crucial for evaluating and informing policy decisions. We propose a novel heterogeneous treatment effect estimator in the difference-in-differences design with repeated cross sectional data, where we observe different samples of a population at two time periods separated by the onset of a policy intervention… ▽ More

    Submitted 22 August, 2021; v1 submitted 28 May, 2019; originally announced May 2019.

  37. arXiv:1905.09751  [pdf, other

    stat.ME stat.ML

    Learning When-to-Treat Policies

    Authors: Xinkun Nie, Emma Brunskill, Stefan Wager

    Abstract: Many applied decision-making problems have a dynamic component: The policymaker needs not only to choose whom to treat, but also when to start which treatment. For example, a medical doctor may choose between postponing treatment (watchful waiting) and prescribing one of several available treatments during the many visits from a patient. We develop an "advantage doubly robust" estimator for learni… ▽ More

    Submitted 30 April, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

  38. arXiv:1905.00744  [pdf, ps, other

    math.ST econ.EM stat.ME

    Sparsity Double Robust Inference of Average Treatment Effects

    Authors: Jelena Bradic, Stefan Wager, Yinchu Zhu

    Abstract: Many popular methods for building confidence intervals on causal effects under high-dimensional confounding require strong "ultra-sparsity" assumptions that may be difficult to validate in practice. To alleviate this difficulty, we here study a new method for average treatment effect estimation that yields asymptotically exact confidence intervals assuming that either the conditional response surf… ▽ More

    Submitted 2 May, 2019; originally announced May 2019.

  39. arXiv:1903.02124  [pdf, other

    math.OC econ.EM stat.ME

    Experimenting in Equilibrium

    Authors: Stefan Wager, Kuang Xu

    Abstract: Classical approaches to experimental design assume that intervening on one unit does not affect other units. There are many important settings, however, where this non-interference assumption does not hold, as when running experiments on supply-side incentives on a ride-sharing platform or subsidies in an energy marketplace. In this paper, we introduce a new approach to experimental design in larg… ▽ More

    Submitted 30 June, 2020; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: Forthcoming in Management Science

  40. arXiv:1902.07409  [pdf, other

    stat.ME

    Estimating Treatment Effects with Causal Forests: An Application

    Authors: Susan Athey, Stefan Wager

    Abstract: We apply causal forests to a dataset derived from the National Study of Learning Mindsets, and consider resulting practical and conceptual challenges. In particular, we discuss how causal forests use estimated propensity scores to be more robust to confounding, and how they handle data with clustered errors.

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: This note will appear in an upcoming issue of Observational Studies, Empirical Investigation of Methods for Heterogeneity, that compiles several analyses of the same dataset

  41. arXiv:1902.02774  [pdf, other

    stat.ME

    Confidence Intervals for Nonparametric Empirical Bayes Analysis

    Authors: Nikolaos Ignatiadis, Stefan Wager

    Abstract: In an empirical Bayes analysis, we use data from repeated sampling to imitate inferences made by an oracle Bayesian with extensive knowledge of the data-generating distribution. Existing results provide a comprehensive characterization of when and why empirical Bayes point estimates accurately recover oracle Bayes behavior. In this paper, we develop flexible and practical confidence intervals that… ▽ More

    Submitted 8 September, 2021; v1 submitted 7 February, 2019; originally announced February 2019.

  42. arXiv:1902.00956  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ML

    Deep Autotuner: A Data-Driven Approach to Natural-Sounding Pitch Correction for Singing Voice in Karaoke Performances

    Authors: Sanna Wager, George Tzanetakis, Cheng-i Wang, Lijiang Guo, Aswin Sivaraman, Minje Kim

    Abstract: We describe a machine-learning approach to pitch correcting a solo singing performance in a karaoke setting, where the solo voice and accompaniment are on separate tracks. The proposed approach addresses the situation where no musical score of the vocals nor the accompaniment exists: It predicts the amount of correction from the relationship between the spectral contents of the vocal and accompani… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

  43. arXiv:1812.09970  [pdf, other

    stat.ME

    Synthetic Difference in Differences

    Authors: Dmitry Arkhangelsky, Susan Athey, David A. Hirshberg, Guido W. Imbens, Stefan Wager

    Abstract: We present a new estimator for causal effects with panel data that builds on insights behind the widely used difference in differences and synthetic control methods. Relative to these methods we find, both theoretically and empirically, that this "synthetic difference in differences" estimator has desirable robustness properties, and that it performs well in settings where the conventional estimat… ▽ More

    Submitted 2 July, 2021; v1 submitted 24 December, 2018; originally announced December 2018.

  44. arXiv:1811.02547  [pdf, other

    math.ST

    Debiased Inference of Average Partial Effects in Single-Index Models

    Authors: David A. Hirshberg, Stefan Wager

    Abstract: We propose a method for average partial effect estimation in high-dimensional single-index models that is root-n-consistent and asymptotically unbiased given sparsity assumptions on the underlying regression model. This note was prepared as a comment on Wooldridge and Zhu [2018], forthcoming in the Journal of Business and Economic Statistics.

    Submitted 6 November, 2018; originally announced November 2018.

  45. arXiv:1810.04778  [pdf, other

    stat.ML cs.LG econ.EM

    Offline Multi-Action Policy Learning: Generalization and Optimization

    Authors: Zhengyuan Zhou, Susan Athey, Stefan Wager

    Abstract: In many settings, a decision-maker wishes to learn a rule, or policy, that maps from observable characteristics of an individual to an action. Examples include selecting offers, prices, advertisements, or emails to send to consumers, as well as the problem of determining which medication to prescribe to a patient. While there is a growing body of literature devoted to this problem, most existing r… ▽ More

    Submitted 19 November, 2018; v1 submitted 10 October, 2018; originally announced October 2018.

  46. arXiv:1807.11408  [pdf, other

    stat.ML cs.LG econ.EM math.ST

    Local Linear Forests

    Authors: Rina Friedberg, Julie Tibshirani, Susan Athey, Stefan Wager

    Abstract: Random forests are a powerful method for non-parametric regression, but are limited in their ability to fit smooth signals, and can show poor predictive performance in the presence of strong, smooth effects. Taking the perspective of random forests as an adaptive kernel method, we pair the forest kernel with a local linear regression adjustment to better capture smoothness. The resulting procedure… ▽ More

    Submitted 4 September, 2020; v1 submitted 30 July, 2018; originally announced July 2018.

    Comments: Forthcoming in the Journal of Computational and Graphical Statistics

  47. arXiv:1805.02603  [pdf, ps, other

    cs.SD eess.AS

    A Data-Driven Approach to Smooth Pitch Correction for Singing Voice in Pop Music

    Authors: Sanna Wager, Lijiang Guo, Aswin Sivaraman, Minje Kim

    Abstract: In this paper, we present a machine-learning approach to pitch correction for voice in a karaoke setting, where the vocals and accompaniment are on separate tracks and time-aligned. The network takes as input the time-frequency representation of the two tracks and predicts the amount of pitch-shifting in cents required to make the voice sound in-tune with the accompaniment. It is trained on exampl… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

  48. arXiv:1712.04912  [pdf, other

    stat.ML econ.EM math.ST

    Quasi-Oracle Estimation of Heterogeneous Treatment Effects

    Authors: Xinkun Nie, Stefan Wager

    Abstract: Flexible estimation of heterogeneous treatment effects lies at the heart of many statistical challenges, such as personalized medicine and optimal resource allocation. In this paper, we develop a general class of two-step algorithms for heterogeneous treatment effect estimation in observational studies. We first estimate marginal effects and treatment propensities in order to form an objective fun… ▽ More

    Submitted 6 August, 2020; v1 submitted 13 December, 2017; originally announced December 2017.

    Comments: Biometrika, forthcoming

  49. arXiv:1712.00038  [pdf, other

    stat.ME

    Augmented Minimax Linear Estimation

    Authors: David A. Hirshberg, Stefan Wager

    Abstract: Many statistical estimands can expressed as continuous linear functionals of a conditional expectation function. This includes the average treatment effect under unconfoundedness and generalizations for continuous-valued and personalized treatments. In this paper, we discuss a general approach to estimating such quantities: we begin with a simple plug-in estimator based on an estimate of the condi… ▽ More

    Submitted 19 November, 2020; v1 submitted 30 November, 2017; originally announced December 2017.

    Comments: 67 pages, 3 figures

    MSC Class: 62F12

  50. arXiv:1706.07550  [pdf, other

    math.ST

    Shape-constrained partial identification of a population mean under unknown probabilities of sample selection

    Authors: Luke W. Miratrix, Stefan Wager, Jose R. Zubizarreta

    Abstract: A prevailing challenge in the biomedical and social sciences is to estimate a population mean from a sample obtained with unknown selection probabilities. Using a well-known ratio estimator, Aronow and Lee (2013) proposed a method for partial identification of the mean by allowing the unknown selection probabilities to vary arbitrarily between two fixed extreme values. In this paper, we show how t… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.