Skip to main content

Showing 1–50 of 61 results for author: Song, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.13098  [pdf, other

    cs.LG cs.AI cs.SI stat.AP

    Gravity-Informed Deep Learning Framework for Predicting Ship Traffic Flow and Invasion Risk of Non-Indigenous Species via Ballast Water Discharge

    Authors: Ruixin Song, Gabriel Spadon, Ronald Pelot, Stan Matwin, Amilcar Soares

    Abstract: Invasive species in water bodies pose a major threat to the environment and biodiversity globally. Due to increased transportation and trade, non-native species have been introduced to new environments, causing damage to ecosystems and leading to economic losses in agriculture, forestry, and fisheries. Therefore, there is a pressing need for risk assessment and management techniques to mitigate th… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages, 7 figures, under review

  2. arXiv:2401.05517  [pdf, other

    stat.ME econ.EM math.ST

    On Efficient Inference of Causal Effects with Multiple Mediators

    Authors: Haoyu Wei, Hengrui Cai, Chengchun Shi, Rui Song

    Abstract: This paper provides robust estimators and efficient inference of causal effects involving multiple interacting mediators. Most existing works either impose a linear model assumption among the mediators or are restricted to handle conditionally independent mediators given the exposure. To overcome these limitations, we define causal and individual mediation effects in a general setting, and employ… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 62A09; 62G05; 62G35

  3. arXiv:2401.00139  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Is Knowledge All Large Language Models Needed for Causal Reasoning?

    Authors: Hengrui Cai, Shengjie Liu, Rui Song

    Abstract: This paper explores the causal reasoning of large language models (LLMs) to enhance their interpretability and reliability in advancing artificial intelligence. Despite the proficiency of LLMs in a range of tasks, their potential for understanding causality requires further exploration. We propose a novel causal attribution model that utilizes ``do-operators" for constructing counterfactual scenar… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: A Python implementation of our proposed method is available at https://github.com/ncsulsj/Causal_LLM

  4. arXiv:2312.17122  [pdf, other

    cs.CL cs.AI stat.ML

    Large Language Model for Causal Decision Making

    Authors: Haitao Jiang, Lin Ge, Yuhe Gao, Jianian Wang, Rui Song

    Abstract: Large Language Models (LLMs) have shown their success in language understanding and reasoning on general topics. However, their capability to perform inference based on user-specified structured data and knowledge in corpus-rare concepts, such as causal decision-making is still limited. In this work, we explore the possibility of fine-tuning an open-sourced LLM into LLM4Causal, which can identify… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  5. arXiv:2312.15595  [pdf, other

    stat.ML cs.LG econ.EM

    Zero-Inflated Bandits

    Authors: Haoyu Wei, Runzhe Wan, Lei Shi, Rui Song

    Abstract: Many real applications of bandits have sparse non-zero rewards, leading to slow learning rates. A careful distribution modeling that utilizes problem-specific structures is known as critical to estimation efficiency in the statistics literature, yet is under-explored in bandits. To fill the gap, we initiate the study of zero-inflated bandits, where the reward is modeled as a classic semi-parametri… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  6. arXiv:2312.12871  [pdf, other

    cs.LG stat.ML

    Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

    Authors: Yu Liu, Runzhe Wan, James McQueen, Doug Hains, **xiang Gu, Rui Song

    Abstract: The selection of the assumed effect size (AES) critically determines the duration of an experiment, and hence its accuracy and efficiency. Traditionally, experimenters determine AES based on domain knowledge. However, this method becomes impractical for online experimentation services managing numerous experiments, and a more automated approach is hence of great demand. We initiate the study of da… ▽ More

    Submitted 17 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  7. arXiv:2301.13348  [pdf, other

    stat.ML cs.LG stat.ME

    A Reinforcement Learning Framework for Dynamic Mediation Analysis

    Authors: Lin Ge, Jitao Wang, Chengchun Shi, Zhenke Wu, Rui Song

    Abstract: Mediation analysis learns the causal effect transmitted via mediator variables between treatments and outcomes and receives increasing attention in various scientific domains to elucidate causal relations. Most existing works focus on point-exposure studies where each subject only receives one treatment at a single time point. However, there are a number of applications (e.g., mobile health) where… ▽ More

    Submitted 2 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  8. arXiv:2301.12389  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    On Learning Necessary and Sufficient Causal Graphs

    Authors: Hengrui Cai, Yixin Wang, Michael Jordan, Rui Song

    Abstract: The causal revolution has stimulated interest in understanding complex relationships in various fields. Most of the existing methods aim to discover causal relationships among all variables within a complex large-scale graph. However, in practice, only a small subset of variables in the graph are relevant to the outcomes of interest. Consequently, causal estimation with the full causal graph -- pa… ▽ More

    Submitted 1 November, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: Advances in Neural Information Processing Systems 37 (Spotlight)

  9. arXiv:2301.12383  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    On Heterogeneous Treatment Effects in Heterogeneous Causal Graphs

    Authors: Richard A Watson, Hengrui Cai, Xinming An, Samuel McLean, Rui Song

    Abstract: Heterogeneity and comorbidity are two interwoven challenges associated with various healthcare problems that greatly hampered research on develo** effective treatment and understanding of the underlying neurobiological mechanism. Very few studies have been conducted to investigate heterogeneous causal effects (HCEs) in graphical contexts due to the lack of statistical methods. To characterize th… ▽ More

    Submitted 25 June, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: In Proceedings of the 40th International Conference on Machine Learning (ICML) Code implementing the proposed algorithm is open-source and publicly available at: https://github.com/richard-watson/ISL

  10. arXiv:2301.00927  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Deep Spectral Q-learning with Application to Mobile Health

    Authors: Yuhe Gao, Chengchun Shi, Rui Song

    Abstract: Dynamic treatment regimes assign personalized treatments to patients sequentially over time based on their baseline information and time-varying covariates. In mobile health applications, these covariates are typically collected at different frequencies over a long time horizon. In this paper, we propose a deep spectral Q-learning algorithm, which integrates principal component analysis (PCA) with… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  11. arXiv:2212.14580  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Heterogeneous Synthetic Learner for Panel Data

    Authors: Ye Shen, Runzhe Wan, Hengrui Cai, Rui Song

    Abstract: In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications. Yet, most existing HTE estimation methods focus on independently and identically distributed observations and cannot handle the non-stationarity and temporal dependency in the common panel data setting. The treatment evaluators developed for panel data, on th… ▽ More

    Submitted 29 January, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

  12. arXiv:2212.14468  [pdf, other

    stat.ML cs.LG stat.ME

    An Instrumental Variable Approach to Confounded Off-Policy Evaluation

    Authors: Yang Xu, ** Zhu, Chengchun Shi, Shikai Luo, Rui Song

    Abstract: Off-policy evaluation (OPE) is a method for estimating the return of a target policy using some pre-collected observational data generated by a potentially different behavior policy. In some cases, there may be unmeasured variables that can confound the action-reward or action-next-state relationships, rendering many existing OPE approaches ineffective. This paper develops an instrumental variable… ▽ More

    Submitted 2 February, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

  13. arXiv:2212.14466  [pdf, other

    stat.ML cs.LG

    Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

    Authors: Yang Xu, Chengchun Shi, Shikai Luo, Lan Wang, Rui Song

    Abstract: Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline data generated by a potentially different behavior policy. It is critical in a number of sequential decision making problems ranging from healthcare to technology industries. Most of the work in existing literature is focused on evaluating the mean outcome of a given policy, and ignores the variability of th… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

  14. arXiv:2212.12845  [pdf, ps, other

    stat.ME cs.LG

    Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

    Authors: Runzhe Wan, Yingying Li, Wenbin Lu, Rui Song

    Abstract: Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing th… ▽ More

    Submitted 2 January, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  15. arXiv:2209.11363  [pdf, other

    stat.ME stat.AP stat.CO

    Sure Screening for Transelliptical Graphical Models

    Authors: Yuxiang Xie, Chengchun Shi, Rui Song

    Abstract: We propose a sure screening approach for recovering the structure of a transelliptical graphical model in the high dimensional setting. We estimate the partial correlation graph by thresholding the elements of an estimator of the sample correlation matrix obtained using Kendall's tau statistic. Under a simple assumption on the relationship between the correlation and partial correlation graphs, we… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: The paper won the David Byar travel award in the Joint Statistical Meetings (JSM) 2016

  16. arXiv:2204.04052  [pdf, other

    stat.ME

    Transformation-Invariant Learning of Optimal Individualized Decision Rules with Time-to-Event Outcomes

    Authors: Yu Zhou, Lan Wang, Rui Song, Tuoyi Zhao

    Abstract: In many important applications of precision medicine, the outcome of interest is time to an event (e.g., death, relapse of disease) and the primary goal is to identify the optimal individualized decision rule (IDR) to prolong survival time. Existing work in this area have been mostly focused on estimating the optimal IDR to maximize the We propose a new robust framework for estimating an optimal s… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  17. arXiv:2203.06509  [pdf, other

    stat.CO

    Distributed Community Detection in Large Networks

    Authors: Sheng Zhang, Rui Song, Wenbin Lu, Ji Zhu

    Abstract: Community detection for large networks is a challenging task due to the high computational cost as well as the heterogeneous community structure. Stochastic block model (SBM) is a popular model to analyze community structure where nodes belonging to the same communities are connected with equal probability. Modularity optimization methods provide a fast and effective way for community detection un… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

  18. arXiv:2203.02318  [pdf, ps, other

    stat.ME stat.ML

    Adaptive Semi-Supervised Inference for Optimal Treatment Decisions with Electronic Medical Record Data

    Authors: Kevin Gunn, Wenbin Lu, Rui Song

    Abstract: A treatment regime is a rule that assigns a treatment to patients based on their covariate information. Recently, estimation of the optimal treatment regime that yields the greatest overall expected clinical outcome of interest has attracted a lot of attention. In this work, we consider estimation of the optimal treatment regime with electronic medical record data under a semi-supervised setting.… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  19. arXiv:2202.13163  [pdf, other

    stat.ML cs.LG

    Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

    Authors: Chengchun Shi, Shikai Luo, Yuan Le, Hongtu Zhu, Rui Song

    Abstract: We consider reinforcement learning (RL) methods in offline domains without additional online data collection, such as mobile health applications. Most of existing policy optimization algorithms in the computer science literature are developed in online settings where data are easy to collect or simulate. Their generalizations to mobile health applications with a pre-collected offline dataset remai… ▽ More

    Submitted 26 July, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

  20. arXiv:2202.12819  [pdf, other

    stat.AP stat.ML

    Exploratory Hidden Markov Factor Models for Longitudinal Mobile Health Data: Application to Adverse Posttraumatic Neuropsychiatric Sequelae

    Authors: Lin Ge, Xinming An, Donglin Zeng, Samuel McLean, Ronald Kessler, Rui Song

    Abstract: Adverse posttraumatic neuropsychiatric sequelae (APNS) are common among veterans and millions of Americans after traumatic exposures, resulting in substantial burdens for trauma survivors and society. Despite numerous studies conducted on APNS over the past decades, there has been limited progress in understanding the underlying neurobiological mechanisms due to several unique challenges. One of t… ▽ More

    Submitted 4 June, 2023; v1 submitted 25 February, 2022; originally announced February 2022.

  21. arXiv:2202.12440  [pdf, other

    stat.ML cs.LG

    On Learning and Testing of Counterfactual Fairness through Data Preprocessing

    Authors: Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh

    Abstract: Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly. Recent work brings the discussion of machine learning fairness into the causal framework and elaborates on the concept of Counterfactual Fairness. In this paper, we develop the Fair Learning through dAta Preprocessing (FLAP) algorithm to lea… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  22. arXiv:2202.10589  [pdf, other

    stat.ML cs.LG

    Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process

    Authors: Chengchun Shi, ** Zhu, Ye Shen, Shikai Luo, Hongtu Zhu, Rui Song

    Abstract: This paper is concerned with constructing a confidence interval for a target policy's value offline based on a pre-collected observational data in infinite horizon settings. Most of the existing works assume no unmeasured variables exist that confound the observed actions. This assumption, however, is likely to be violated in real applications such as healthcare and technological industries. In th… ▽ More

    Submitted 3 November, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

  23. arXiv:2202.10574  [pdf, other

    stat.ML cs.LG

    A Multi-Agent Reinforcement Learning Framework for Off-Policy Evaluation in Two-sided Markets

    Authors: Chengchun Shi, Runzhe Wan, Ge Song, Shikai Luo, Rui Song, Hongtu Zhu

    Abstract: The two-sided markets such as ride-sharing companies often involve a group of subjects who are making sequential decisions across time and/or location. With the rapid development of smart phones and internet of things, they have substantially transformed the transportation landscape of human beings. In this paper we consider large-scale fleet management in ride-sharing companies that involve multi… ▽ More

    Submitted 26 March, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

  24. arXiv:2202.00088  [pdf, other

    cs.LG stat.ME

    Reinforcement Learning with Heterogeneous Data: Estimation and Inference

    Authors: Elynn Y. Chen, Rui Song, Michael I. Jordan

    Abstract: Reinforcement Learning (RL) has the promise of providing data-driven support for decision-making in a wide range of problems in healthcare, education, business, and other domains. Classical RL methods focus on the mean of the total return and, thus, may provide misleading results in the setting of the heterogeneous populations that commonly underlie large-scale datasets. We introduce the K-Heterog… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  25. arXiv:2201.07998  [pdf, ps, other

    stat.ML math.ST

    Statistical Learning for Individualized Asset Allocation

    Authors: Yi Ding, Yingying Li, Rui Song

    Abstract: We establish a high-dimensional statistical learning framework for individualized asset allocation. Our proposed methodology addresses continuous-action decision-making with a large number of characteristics. We develop a discretization approach to model the effect of continuous actions and allow the discretization frequency to be large and diverge with the number of observations. The value functi… ▽ More

    Submitted 8 November, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

  26. arXiv:2111.15367  [pdf, other

    q-fin.ST cs.LG stat.AP

    A Review on Graph Neural Network Methods in Financial Applications

    Authors: Jianian Wang, Sheng Zhang, Yanghua Xiao, Rui Song

    Abstract: With multiple components and relations, financial data are often presented as graph data, since it could represent both the individual features and the complicated relations. Due to the complexity and volatility of the financial market, the graph constructed on the financial data is often heterogeneous or time-varying, which imposes challenges on modeling technology. Among the graph modeling techn… ▽ More

    Submitted 26 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  27. arXiv:2111.10425  [pdf, other

    stat.ME

    Flexible Inference of Optimal Individualized Treatment Strategy in Covariate Adjusted Randomization with Multiple Covariates

    Authors: Trinetri Ghosh, Yanyuan Ma, Rui Song, **shou Zhong

    Abstract: To maximize clinical benefit, clinicians routinely tailor treatment to the individual characteristics of each patient, where individualized treatment rules are needed and are of significant research interest to statisticians. In the covariate-adjusted randomization clinical trial with many covariates, we model the treatment effect with an unspecified function of a single index of the covariates an… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 28 pages, 7 figures

  28. arXiv:2111.08885  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Jump Interval-Learning for Individualized Decision Making

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-val… ▽ More

    Submitted 28 January, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

  29. arXiv:2111.03943  [pdf, ps, other

    cs.LG stat.CO stat.ML

    A Probit Tensor Factorization Model For Relational Learning

    Authors: Ye Liu, Rui Song, Wenbin Lu, Yanghua Xiao

    Abstract: With the proliferation of knowledge graphs, modeling data with complex multirelational structure has gained increasing attention in the area of statistical relational learning. One of the most important goals of statistical relational learning is link prediction, i.e., predicting whether certain relations exist in the knowledge graph. A large number of models and algorithms have been proposed to p… ▽ More

    Submitted 8 November, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: 30 pages

  30. arXiv:2111.03908  [pdf, other

    stat.ME math.ST

    An Online Sequential Test for Qualitative Treatment Effects

    Authors: Chengchun Shi, Shikai Luo, Hongtu Zhu, Rui Song

    Abstract: Tech companies (e.g., Google or Facebook) often use randomized online experiments and/or A/B testing primarily based on the average treatment effects to compare their new product with an old one. However, it is also critically important to detect qualitative treatment effects such that the new one may significantly outperform the existing one only under some specific circumstances. The aim of this… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  31. arXiv:2110.15501  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

    Authors: Ye Shen, Hengrui Cai, Rui Song

    Abstract: Evaluating the performance of an ongoing policy plays a vital role in many areas such as medicine and economics, to provide crucial instruction on the early-stop of the online experiment and timely feedback from the environment. Policy evaluation in online learning thus attracts increasing attention by inferring the mean outcome of the optimal policy (i.e., the value) in real-time. Yet, such a pro… ▽ More

    Submitted 28 January, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

  32. arXiv:2109.00712  [pdf, other

    stat.ME

    Online Testing of Subgroup Treatment Effects Based on Value Difference

    Authors: Miao Yu, Wenbin Lu, Rui Song

    Abstract: Online A/B testing plays a critical role in the high-tech industry to guide product development and accelerate innovation. It performs a null hypothesis statistical test to determine which variant is better. However, a typical A/B test presents two problems: (i) a fixed-horizon framework inflates the false-positive errors under continuous monitoring; (ii) the homogeneous effects assumption fails t… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  33. arXiv:2107.04839  [pdf, other

    stat.ME stat.AP

    On Estimating Optimal Regime for Treatment Initiation Time Based on Restricted Mean Residual Lifetime

    Authors: Xin Chen, Rui Song, Jiajia Zhang, Swann Arp Adams, Liuquan Sun, Wenbin Lu

    Abstract: When to initiate treatment on patients is an important problem in many medical studies such as AIDS and cancer. In this article, we formulate the treatment initiation time problem for time-to-event data and propose an optimal individualized regime that determines the best treatment initiation time for individual patients based on their characteristics. Different from existing optimal treatment reg… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

  34. arXiv:2105.04646  [pdf, other

    stat.ML cs.AI cs.LG

    Deeply-Debiased Off-Policy Interval Estimation

    Authors: Chengchun Shi, Runzhe Wan, Victor Chernozhukov, Rui Song

    Abstract: Off-policy evaluation learns a target policy's value with a historical dataset generated by a different behavior policy. In addition to a point estimate, many applications would benefit significantly from having a confidence interval (CI) that quantifies the uncertainty of the point estimate. In this paper, we propose a novel deeply-debiasing procedure to construct an efficient, robust, and flexib… ▽ More

    Submitted 7 June, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  35. arXiv:2104.10573  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    GEAR: On Optimal Decision Making with Auxiliary Data

    Authors: Hengrui Cai, Rui Song, Wenbin Lu

    Abstract: Personalized optimal decision making, finding the optimal decision rule (ODR) based on individual characteristics, has attracted increasing attention recently in many fields, such as education, economics, and medicine. Current ODR methods usually require the primary outcome of interest in samples for assessing treatment effects, namely the experimental sample. However, in many studies, treatments… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  36. arXiv:2104.10554  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

    Authors: Hengrui Cai, Wenbin Lu, Rui Song

    Abstract: We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available. The outcome of interest is limited in the sense that it is only observed in the primary sample. In reality, such multiple data sources may belong to heterogeneous studies and thus cannot be combined directly. This paper proposes a new framework to handle heterogeneous samples… ▽ More

    Submitted 21 September, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

  37. arXiv:2103.04215  [pdf, ps, other

    stat.ML cs.LG

    Hierarchical Causal Bandit

    Authors: Ruiyang Song, Stefano Rini, Kuang Xu

    Abstract: Causal bandit is a nascent learning model where an agent sequentially experiments in a causal network of variables, in order to identify the reward-maximizing intervention. Despite the model's wide applicability, existing analytical results are largely restricted to a parallel bandit version where all variables are mutually independent. We introduce in this work the hierarchical causal bandit mode… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

  38. arXiv:2010.15963  [pdf, other

    stat.ML cs.LG

    Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: We consider off-policy evaluation (OPE) in continuous treatment settings, such as personalized dose-finding. In OPE, one aims to estimate the mean outcome under a new treatment decision rule using historical data generated by a different decision rule. Most existing works on OPE focus on discrete treatment settings. To handle continuous treatments, we develop a novel estimation method for OPE usin… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  39. Statistical Inference for Online Decision Making via Stochastic Gradient Descent

    Authors: Haoyu Chen, Wenbin Lu, Rui Song

    Abstract: Online decision making aims to learn the optimal decision rule by making personalized decisions and updating the decision rule recursively. It has become easier than before with the help of big data, but new challenges also come along. Since the decision rule should be updated once per step, an offline update which uses all the historical data is inefficient in computation and storage. To this end… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted by the Journal of the American Statistical Association

  40. Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting

    Authors: Haoyu Chen, Wenbin Lu, Rui Song

    Abstract: Online decision-making problem requires us to make a sequence of decisions based on incremental information. Common solutions often need to learn a reward model of different actions given the contextual information and then maximize the long-term reward. It is meaningful to know if the posited model is reasonable and how the model performs in the asymptotic sense. We study this problem under the s… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted by the Journal of the American Statistical Association

  41. arXiv:2009.04607  [pdf, other

    cs.LG stat.ML

    Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control

    Authors: Runzhe Wan, Xinyu Zhang, Rui Song

    Abstract: Severe infectious diseases such as the novel coronavirus (COVID-19) pose a huge threat to public health. Stringent control measures, such as school closures and stay-at-home orders, while having significant effects, also bring huge economic losses. In the face of an emerging infectious disease, a crucial question for policymakers is how to make the trade-off and implement the appropriate intervent… ▽ More

    Submitted 26 February, 2022; v1 submitted 9 September, 2020; originally announced September 2020.

  42. arXiv:2007.09812  [pdf, ps, other

    stat.ME

    Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health

    Authors: Liangyu Zhu, Wenbin Lu, Rui Song

    Abstract: In this article, we propose novel structural nested models to estimate causal effects of continuous treatments based on mobile health data. To find the treatment regime that optimizes the expected short-term outcomes for patients, we define a weighted lag-K advantage as the value function. The optimal treatment regime is then defined to be the one that maximizes the value function. Our method impo… ▽ More

    Submitted 23 July, 2020; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted for ICML 2020

  43. arXiv:2007.09811  [pdf, ps, other

    stat.ME math.ST stat.AP stat.ML

    Kernel Assisted Learning for Personalized Dose Finding

    Authors: Liangyu Zhu, Wenbin Lu, Michael R. Kosorok, Rui Song

    Abstract: An individualized dose rule recommends a dose level within a continuous safe dose range based on patient level information such as physical conditions, genetic factors and medication histories. Traditionally, personalized dose finding process requires repeating clinical visits of the patient and frequent adjustments of the dosage. Thus the patient is constantly exposed to the risk of underdosing a… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted for KDD 2020

  44. arXiv:2005.04353  [pdf, other

    cs.SD cs.LG stat.ML

    Dual-track Music Generation using Deep Learning

    Authors: Sudi Lyu, Anxiang Zhang, Rong Song

    Abstract: Music generation is always interesting in a sense that there is no formalized recipe. In this work, we propose a novel dual-track architecture for generating classical piano music, which is able to model the inter-dependency of left-hand and right-hand piano music. Particularly, we experimented with a lot of different models of neural network as well as different representations of music, and the… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 8 pages, 7 figures

  45. arXiv:2002.03277  [pdf, ps, other

    stat.ME stat.AP

    A New Framework for Online Testing of Heterogeneous Treatment Effect

    Authors: Miao Yu, Wenbin Lu, Rui Song

    Abstract: We propose a new framework for online testing of heterogeneous treatment effects. The proposed test, named sequential score test (SST), is able to control type I error under continuous monitoring and detect multi-dimensional heterogeneous treatment effects. We provide an online p-value calculation for SST, making it convenient for continuous monitoring, and extend our tests to online multiple test… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: 8 pages, no figures. To be published on AAAI 2020 proceedings

  46. arXiv:2002.01751  [pdf, other

    stat.ML cs.LG

    Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making

    Authors: Chengchun Shi, Runzhe Wan, Rui Song, Wenbin Lu, Ling Leng

    Abstract: The Markov assumption (MA) is fundamental to the empirical validity of reinforcement learning. In this paper, we propose a novel Forward-Backward Learning procedure to test MA in sequential decision making. The proposed test does not assume any parametric form on the joint distribution of the observed data and plays an important role for identifying the optimal policy in high-order Markov decision… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  47. arXiv:2002.01711  [pdf, other

    cs.LG stat.ML

    Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework

    Authors: Chengchun Shi, Xiaoyu Wang, Shikai Luo, Hongtu Zhu, Jie** Ye, Rui Song

    Abstract: A/B testing, or online experiment is a standard business strategy to compare a new product with an old one in pharmaceutical, technological, and traditional industries. Major challenges arise in online experiments of two-sided marketplace platforms (e.g., Uber) where there is only one unit that receives a sequence of treatments over time. In those experiments, the treatment at a given time impacts… ▽ More

    Submitted 3 November, 2022; v1 submitted 5 February, 2020; originally announced February 2020.

  48. arXiv:2001.04515  [pdf, other

    stat.ML cs.LG

    Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings

    Authors: C. Shi, S. Zhang, W. Lu, R. Song

    Abstract: Reinforcement learning is a general technique that allows an agent to learn an optimal policy and interact with an environment in sequential decision making problems. The goodness of a policy is measured by its value function starting from some initial state. The focus of this paper is to construct confidence intervals (CIs) for a policy's value in infinite horizon settings where the number of dec… ▽ More

    Submitted 20 June, 2021; v1 submitted 13 January, 2020; originally announced January 2020.

  49. Efficient Multivariate Bandit Algorithm with Path Planning

    Authors: Keyu Nie, Zezhong Zhang, Ted Tao Yuan, Rong Song, Pauline Berry Burke

    Abstract: In this paper, we solve the arms exponential exploding issue in multivariate Multi-Armed Bandit (Multivariate-MAB) problem when the arm dimension hierarchy is considered. We propose a framework called path planning (TS-PP) which utilizes decision graph/trees to model arm reward success rate with m-way dimension interaction, and adopts Thompson sampling (TS) for heuristic search of arm selection. N… ▽ More

    Submitted 15 September, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: Multi-Armed Bandit, Monte Carlo Tree Search, Decision Tree, Path Planning

    Journal ref: Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence (ICTAI '20), November 09--11, 2020, Baltimore, MD, USA

  50. arXiv:1903.05212  [pdf, ps, other

    stat.ME

    Doubly Robust Inference when Combining Probability and Non-probability Samples with High-dimensional Data

    Authors: Shu Yang, Jae Kwang Kim, Rui Song

    Abstract: Non-probability samples become increasingly popular in survey statistics but may suffer from selection biases that limit the generalizability of results to the target population. We consider integrating a non-probability sample with a probability sample which provides high-dimensional representative covariate information of the target population. We propose a two-step approach for variable selecti… ▽ More

    Submitted 23 August, 2019; v1 submitted 12 March, 2019; originally announced March 2019.