Skip to main content

Showing 1–37 of 37 results for author: Murphy, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.10946  [pdf, other

    stat.ML cs.LG

    The Fallacy of Minimizing Local Regret in the Sequential Task Setting

    Authors: Zi** Xu, Kelly W. Zhang, Susan A. Murphy

    Abstract: In the realm of Reinforcement Learning (RL), online RL is often conceptualized as an optimization problem, where an algorithm interacts with an unknown environment to minimize cumulative regret. In a stationary setting, strong theoretical guarantees, like a sublinear ($\sqrt{T}$) regret bound, can be obtained, which typically implies the convergence to an optimal policy and the cessation of explor… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  2. arXiv:2308.07843  [pdf, other

    cs.LG stat.AP stat.ML

    Dyadic Reinforcement Learning

    Authors: Shuangning Li, Lluis Salvat Niell, Sung Won Choi, Inbal Nahum-Shani, Guy Shani, Susan Murphy

    Abstract: Mobile health aims to enhance health outcomes by delivering interventions to individuals as they go about their daily life. The involvement of care partners and social support networks often proves crucial in hel** individuals managing burdensome medical conditions. This presents opportunities in mobile health to design interventions that target the dyadic relationship -- the relationship betwee… ▽ More

    Submitted 1 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  3. arXiv:2307.13916  [pdf, other

    stat.ML cs.LG

    Online learning in bandits with predicted context

    Authors: Yongyi Guo, Zi** Xu, Susan Murphy

    Abstract: We consider the contextual bandit problem where at each time, the agent only has access to a noisy version of the context and the error variance (or an estimator of this variance). This setting is motivated by a wide range of applications where the true context for decision-making is unobserved, and only a prediction of the context by a potentially complex machine learning algorithm is available.… ▽ More

    Submitted 17 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

  4. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  5. arXiv:2306.10983  [pdf, other

    stat.ML cs.LG

    Effect-Invariant Mechanisms for Policy Generalization

    Authors: Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

    Abstract: Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call f… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  6. arXiv:2304.05365  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling

    Authors: Susobhan Ghosh, Raphael Kim, Prasidh Chhabria, Raaz Dwivedi, Predrag Klasnja, Peng Liao, Kelly Zhang, Susan Murphy

    Abstract: There is a growing interest in using reinforcement learning (RL) to personalize sequences of treatments in digital health to support users in adopting healthier behaviors. Such sequential decision-making problems involve decisions about when to treat and how to treat based on the user's context (e.g., prior activity level, location, etc.). Online RL is a promising data-driven approach for this pro… ▽ More

    Submitted 7 August, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: The first two authors contributed equally

  7. arXiv:2211.14297  [pdf, ps, other

    stat.ML cs.LG

    Doubly robust nearest neighbors in factor models

    Authors: Raaz Dwivedi, Katherine Tian, Sabina Tomkins, Predrag Klasnja, Susan Murphy, Devavrat Shah

    Abstract: We introduce and analyze an improved variant of nearest neighbors (NN) for estimation with missing data in latent factor models. We consider a matrix completion problem with missing data, where the $(i, t)$-th entry, when observed, is given by its mean $f(u_i, v_t)$ plus mean-zero noise for an unknown function $f$ and latent factors $u_i$ and $v_t$. Prior NN strategies, like unit-unit NN, for esti… ▽ More

    Submitted 29 January, 2024; v1 submitted 25 November, 2022; originally announced November 2022.

  8. arXiv:2203.00097  [pdf

    stat.ME cs.AI cs.LG econ.EM math.OC

    Estimating causal effects with optimization-based methods: A review and empirical comparison

    Authors: Martin Cousineau, Vedat Verter, Susan A. Murphy, Joelle Pineau

    Abstract: In the absence of randomized controlled and natural experiments, it is necessary to balance the distributions of (observable) covariates of the treated and control groups in order to obtain an unbiased estimate of a causal effect of interest; otherwise, a different effect size may be estimated, and incorrect recommendations may be given. To achieve this balance, there exist a wide variety of metho… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: In Press, Corrected Proof

    Journal ref: European Journal of Operational Research, 2022, 14 pages

  9. arXiv:2202.07098  [pdf, ps, other

    cs.LG stat.ME

    Statistical Inference After Adaptive Sampling for Longitudinal Data

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: Online reinforcement learning and other adaptive sampling algorithms are increasingly used in digital intervention experiments to optimize treatment delivery for users over time. In this work, we focus on longitudinal user data collected by a large class of adaptive sampling algorithms that are designed to optimize treatment decisions online using accruing data from multiple users. Combining or "p… ▽ More

    Submitted 19 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Fixing typos

  10. arXiv:2202.06891  [pdf, other

    stat.ML cs.LG

    Counterfactual inference for sequential experiments

    Authors: Raaz Dwivedi, Katherine Tian, Sabina Tomkins, Predrag Klasnja, Susan Murphy, Devavrat Shah

    Abstract: We consider after-study statistical inference for sequentially designed experiments wherein multiple units are assigned treatments for multiple time points using treatment policies that adapt over time. Our goal is to provide inference guarantees for the counterfactual mean at the smallest possible scale -- mean outcome under different treatments for each unit and each time -- with minimal assumpt… ▽ More

    Submitted 16 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

  11. arXiv:2109.08134  [pdf, other

    cs.LG stat.ML

    Comparison and Unification of Three Regularization Methods in Batch Reinforcement Learning

    Authors: Sarah Rathnam, Susan A. Murphy, Finale Doshi-Velez

    Abstract: In batch reinforcement learning, there can be poorly explored state-action pairs resulting in poorly learned, inaccurate models and poorly performing associated policies. Various regularization methods can mitigate the problem of learning overly-complex models in Markov decision processes (MDPs), however they operate in technically and intuitively distinct ways and lack a common form in which to c… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: ICML Workshop on Reinforcement Learning Theory 2021

  12. arXiv:2107.09949  [pdf, other

    cs.LG stat.ML

    Online structural kernel selection for mobile health

    Authors: Eura Shin, Pedja Klasnja, Susan Murphy, Finale Doshi-Velez

    Abstract: Motivated by the need for efficient and personalized learning in mobile health, we investigate the problem of online kernel selection for Gaussian Process regression in the multi-task setting. We propose a novel generative process on the kernel composition for this purpose. Our method demonstrates that trajectories of kernel evolutions can be transferred between users to improve learning and that… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: Workshop paper in ICML IMLH 2021

  13. arXiv:2107.03544  [pdf

    stat.AP stat.ME

    The Micro-Randomized Trial for Develo** Digital Interventions: Experimental Design and Data Analysis Considerations

    Authors: Tianchen Qian, Ashley E. Walton, Linda M. Collins, Predrag Klasnja, Stephanie T. Lanza, Inbal Nahum-Shani, Mashifiqui Rabbi, Michael A. Russell, Maureen A. Walton, Hyesun Yoo, Susan A. Murphy

    Abstract: Just-in-time adaptive interventions (JITAIs) are time-varying adaptive interventions that use frequent opportunities for the intervention to be adapted--weekly, daily, or even many times a day. The micro-randomized trial (MRT) has emerged for use in informing the construction of JITAIs. MRTs can be used to address research questions about whether and under what circumstances JITAI components are e… ▽ More

    Submitted 25 November, 2021; v1 submitted 7 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2005.05880, arXiv:2004.10241

  14. arXiv:2012.11646  [pdf, other

    cs.LG cs.CY stat.ML

    Fast Physical Activity Suggestions: Efficient Hyperparameter Learning in Mobile Health

    Authors: Marianne Menictas, Sabina Tomkins, Susan Murphy

    Abstract: Users can be supported to adopt healthy behaviors, such as regular physical activity, via relevant and timely suggestions on their mobile devices. Recently, reinforcement learning algorithms have been found to be effective for learning the optimal context under which to provide suggestions. However, these algorithms are not necessarily designed for the constraints posed by mobile health (mHealth)… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: Neurips 2020 workshop: Machine Learning in Mobile Health. arXiv admin note: substantial text overlap with arXiv:2003.12881

  15. arXiv:2008.03869  [pdf

    stat.ML cs.LG q-bio.QM

    Individualized Prediction of COVID-19 Adverse outcomes with MLHO

    Authors: Hossein Estiri, Zachary H. Strasser, Shawn N. Murphy

    Abstract: We developed MLHO (pronounced as melo), an end-to-end Machine Learning framework that leverages iterative feature and algorithm selection to predict Health Outcomes. MLHO implements iterative sequential representation mining, and feature and model selection, for predicting the patient-level risk of hospitalization, ICU admission, need for mechanical ventilation, and death. It bases this prediction… ▽ More

    Submitted 29 December, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

  16. arXiv:2008.01571  [pdf, other

    cs.LG cs.CY stat.ML

    IntelligentPooling: Practical Thompson Sampling for mHealth

    Authors: Sabina Tomkins, Peng Liao, Predrag Klasnja, Susan Murphy

    Abstract: In mobile health (mHealth) smart devices deliver behavioral treatments repeatedly over time to a user with the goal of hel** the user adopt and maintain healthy behaviors. Reinforcement learning appears ideal for learning how to optimally make these sequential treatment decisions. However, significant challenges must be overcome before reinforcement learning can be effectively deployed in a mobi… ▽ More

    Submitted 12 December, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: arXiv admin note: text overlap with arXiv:2002.09971

  17. arXiv:2007.11771  [pdf, other

    math.ST stat.ML

    Batch Policy Learning in Average Reward Markov Decision Processes

    Authors: Peng Liao, Zhengling Qi, Runzhe Wan, Predrag Klasnja, Susan Murphy

    Abstract: We consider the batch (off-line) policy learning problem in the infinite horizon Markov Decision Process. Motivated by mobile health applications, we focus on learning a policy that maximizes the long-term average reward. We propose a doubly robust estimator for the average reward and show that it achieves semiparametric efficiency. Further we develop an optimization algorithm to compute the optim… ▽ More

    Submitted 17 September, 2022; v1 submitted 22 July, 2020; originally announced July 2020.

  18. arXiv:2005.05880  [pdf

    cs.HC stat.ME

    The Micro-Randomized Trial for Develo** Digital Interventions: Experimental Design Considerations

    Authors: Ashley E. Walton, Linda M. Collins, Predrag Klasnja, Inbal Nahum-Shani, Mashfiqui Rabbi, Maureen A. Walton, Susan A. Murphy

    Abstract: Just-in-time adaptive interventions (JITAIs) are time-varying adaptive interventions that use frequent opportunities for the intervention to be adapted such as weekly, daily, or even many times a day. This high intensity of adaptation is facilitated by the ability of digital technology to continuously collect information about an individual's current context and deliver treatments adapted to this… ▽ More

    Submitted 23 April, 2020; originally announced May 2020.

    MSC Class: 62P15

  19. arXiv:2004.10241  [pdf

    stat.AP

    The Micro-Randomized Trial for Develo** Digital Interventions: Data Analysis Methods

    Authors: Tianchen Qian, Michael A. Russell, Linda M. Collins, Predrag Klasnja, Stephanie T. Lanza, Hyesun Yoo, Susan A. Murphy

    Abstract: Although there is much excitement surrounding the use of mobile and wearable technology for the purposes of delivering interventions as people go through their day-to-day lives, data analysis methods for constructing and optimizing digital interventions lag behind. Here, we elucidate data analysis methods for primary and secondary analyses of micro-randomized trials (MRTs), an experimental design… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  20. arXiv:2004.06230  [pdf, other

    cs.LG stat.ML

    Power Constrained Bandits

    Authors: Jiayu Yao, Emma Brunskill, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Contextual bandits often provide simple and effective personalization in decision making problems, making them popular tools to deliver personalized interventions in mobile health as well as other health applications. However, when bandits are deployed in the context of a scientific study -- e.g. a clinical trial to test if a mobile health intervention is effective -- the aim is not only to person… ▽ More

    Submitted 27 July, 2021; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: Accepted at MLHC 2021

  21. arXiv:2003.12881  [pdf, other

    stat.ML cs.LG

    Streamlined Empirical Bayes Fitting of Linear Mixed Models in Mobile Health

    Authors: Marianne Menictas, Sabina Tomkins, Susan A Murphy

    Abstract: To effect behavior change a successful algorithm must make high-quality decisions in real-time. For example, a mobile health (mHealth) application designed to increase physical activity must make contextually relevant suggestions to motivate users. While machine learning offers solutions for certain stylized settings, such as when batch data can be processed offline, there is a dearth of approache… ▽ More

    Submitted 28 March, 2020; originally announced March 2020.

  22. arXiv:2002.09971  [pdf, other

    cs.LG cs.CY stat.ML

    Rapidly Personalizing Mobile Health Treatment Policies with Limited Data

    Authors: Sabina Tomkins, Peng Liao, Predrag Klasnja, Serena Yeung, Susan Murphy

    Abstract: In mobile health (mHealth), reinforcement learning algorithms that adapt to one's context without learning personalized policies might fail to distinguish between the needs of individuals. Yet the high amount of noise due to the in situ delivery of mHealth interventions can cripple the ability of an algorithm to learn when given access to only a single user's data, making personalization challengi… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  23. arXiv:2002.03217  [pdf, other

    cs.LG stat.ML

    Inference for Batched Bandits

    Authors: Kelly W. Zhang, Lucas Janson, Susan A. Murphy

    Abstract: As bandit algorithms are increasingly utilized in scientific studies and industrial applications, there is an associated increasing need for reliable inference methods based on the resulting adaptively-collected data. In this work, we develop methods for inference on data collected in batches using a bandit algorithm. We first prove that the ordinary least squares estimator (OLS), which is asympto… ▽ More

    Submitted 8 January, 2021; v1 submitted 8 February, 2020; originally announced February 2020.

    Journal ref: NeurIPS 2020

  24. arXiv:1912.13088  [pdf, ps, other

    cs.LG math.ST stat.ML

    Off-Policy Estimation of Long-Term Average Outcomes with Applications to Mobile Health

    Authors: Peng Liao, Predrag Klasnja, Susan Murphy

    Abstract: Due to the recent advancements in wearables and sensing technology, health scientists are increasingly develo** mobile health (mHealth) interventions. In mHealth interventions, mobile devices are used to deliver treatment to individuals as they go about their daily lives. These treatments are generally designed to impact a near time, proximal outcome such as stress or physical activity. The mHea… ▽ More

    Submitted 22 July, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

  25. arXiv:1906.00528  [pdf, other

    stat.ME

    Estimating Time-Varying Causal Excursion Effect in Mobile Health with Binary Outcomes

    Authors: Tianchen Qian, Hyesun Yoo, Predrag Klasnja, Daniel Almirall, Susan A. Murphy

    Abstract: Advances in wearables and digital technology now make it possible to deliver behavioral mobile health interventions to individuals in their everyday life. The micro-randomized trial (MRT) is increasingly used to provide data to inform the construction of these interventions. In an MRT, each individual is repeatedly randomized among multiple intervention options, often hundreds or even thousands of… ▽ More

    Submitted 29 July, 2020; v1 submitted 2 June, 2019; originally announced June 2019.

  26. arXiv:1902.10861  [pdf, ps, other

    stat.ME

    Linear mixed models with endogenous covariates: modeling sequential treatment effects with application to a mobile health study

    Authors: Tianchen Qian, Predrag Klasnja, Susan A. Murphy

    Abstract: Mobile health is a rapidly develo** field in which behavioral treatments are delivered to individuals via wearables or smartphones to facilitate health-related behavior change. Micro-randomized trials (MRT) are an experimental design for develo** mobile health interventions. In an MRT the treatments are randomized numerous times for each individual over course of the trial. Along with assessin… ▽ More

    Submitted 10 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

  27. Practical Considerations for Data Collection and Management in Mobile Health Micro-randomized Trials

    Authors: Nicholas J. Seewald, Shawna N. Smith, Andy **seok Lee, Predrag Klasnja, Susan A. Murphy

    Abstract: There is a growing interest in leveraging the prevalence of mobile technology to improve health by delivering momentary, contextualized interventions to individuals' smartphones. A just-in-time adaptive intervention (JITAI) adjusts to an individual's changing state and/or context to provide the right treatment, at the right time, in the right place. Micro-randomized trials (MRTs) allow for the col… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

    Comments: Author accepted manuscript

  28. arXiv:1812.00463  [pdf, other

    cs.LG stat.ML

    Personalizing Intervention Probabilities By Pooling

    Authors: Sabina Tomkins, Predrag Klasnja, Susan Murphy

    Abstract: In many mobile health interventions, treatments should only be delivered in a particular context, for example when a user is currently stressed, walking or sedentary. Even in an optimal context, concerns about user burden can restrict which treatments are sent. To diffuse the treatment delivery over times when a user is in a desired context, it is critical to predict the future number of times the… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200

    Report number: ML4H/2018/90

  29. arXiv:1711.03596  [pdf, other

    stat.ME

    Action Centered Contextual Bandits

    Authors: Kristjan Greenewald, Ambuj Tewari, Predrag Klasnja, Susan Murphy

    Abstract: Contextual bandits have become popular as they offer a middle ground between very simple approaches based on multi-armed bandits and very complex approaches using the full power of reinforcement learning. They have demonstrated success in web applications and have a rich body of associated theoretical guarantees. Linear models are well understood theoretically and preferred by practitioners becaus… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: to appear at NIPS 2017

  30. arXiv:1711.03587  [pdf, other

    stat.AP

    The stratified micro-randomized trial design: sample size considerations for testing nested causal effects of time-varying treatments

    Authors: Walter Dempsey, Peng Liao, Santosh Kumar, Susan A. Murphy

    Abstract: Technological advancements in the field of mobile devices and wearable sensors have helped overcome obstacles in the delivery of care, making it possible to deliver behavioral treatments anytime and anywhere. Increasingly the delivery of these treatments is triggered by predictions of risk or engagement which may have been impacted by prior treatments. Furthermore the treatments are often designed… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

  31. arXiv:1706.09090  [pdf, other

    stat.ML cs.LG

    An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions

    Authors: Huitian Lei, Yangyi Lu, Ambuj Tewari, Susan A. Murphy

    Abstract: Increasing technological sophistication and widespread use of smartphones and wearable devices provide opportunities for innovative and highly personalized health interventions. A Just-In-Time Adaptive Intervention (JITAI) uses real-time data collection and communication capabilities of modern mobile devices to deliver interventions in real-time that are adapted to the in-the-moment needs of the u… ▽ More

    Submitted 22 April, 2022; v1 submitted 27 June, 2017; originally announced June 2017.

    Comments: The theoretical analyses in this version are stronger compared to the previous one. This manuscript is not intended for publication

  32. arXiv:1607.05047  [pdf, other

    stat.ML cs.LG

    A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward

    Authors: S. A. Murphy, Y. Deng, E. B. Laber, H. R. Maei, R. S. Sutton, K. Witkiewitz

    Abstract: We develop an off-policy actor-critic algorithm for learning an optimal policy from a training set composed of data from multiple individuals. This algorithm is developed with a view towards its use in mobile health.

    Submitted 18 July, 2016; originally announced July 2016.

  33. arXiv:1601.00237  [pdf, ps, other

    stat.ME

    Assessing Time-Varying Causal Effect Moderation in Mobile Health

    Authors: Audrey Boruvka, Daniel Almirall, Katie Witkiewitz, Susan A. Murphy

    Abstract: In mobile health interventions aimed at behavior change and maintenance, treatments are provided in real time to manage current or impending high risk situations or promote healthy behaviors in near real time. Currently there is great scientific interest in develo** data analysis approaches to guide the development of mobile interventions. In particular data from mobile health studies might be u… ▽ More

    Submitted 16 August, 2016; v1 submitted 2 January, 2016; originally announced January 2016.

    Comments: 24 pages plus Supplemental Appendix (18 pages), Github link for R code in Supplemental Appendix E

  34. Sample Size Calculations for Micro-randomized Trials in mHealth

    Authors: Peng Liao, Predrag Klasnja, Ambuj Tewari, Susan A. Murphy

    Abstract: The use and development of mobile interventions are experiencing rapid growth. In "just-in-time" mobile interventions, treatments are provided via a mobile device and they are intended to help an individual make healthy decisions "in the moment," and thus have a proximal, near future impact. Currently the development of mobile interventions is proceeding at a much faster pace than that of associat… ▽ More

    Submitted 22 July, 2020; v1 submitted 1 April, 2015; originally announced April 2015.

    Comments: 29 pages, 5 figures, 18 tables

    Journal ref: Statistics in medicine 35, no. 12 (2016): 1944-1971

  35. arXiv:1206.3274  [pdf

    cs.LG stat.ML

    Small Sample Inference for Generalization Error in Classification Using the CUD Bound

    Authors: Eric B. Laber, Susan A. Murphy

    Abstract: Confidence measures for the generalization error are crucial when small training samples are used to construct classifiers. A common approach is to estimate the generalization error by resampling and then assume the resampled estimator follows a known distribution to form a confidence set [Kohavi 1995, Martin 1996,Yang 2006]. Alternatively, one might bootstrap the resampled estimator of the genera… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-357-365

  36. arXiv:1202.3714  [pdf

    cs.LG stat.ML

    Active Learning for Develo** Personalized Treatment

    Authors: Kun Deng, Joelle Pineau, Susan A. Murphy

    Abstract: The personalization of treatment via bio-markers and other risk categories has drawn increasing interest among clinical scientists. Personalized treatment strategies can be learned using data from clinical trials, but such trials are very costly to run. This paper explores the use of active learning techniques to design more efficient trials, addressing issues such as whom to recruit, at what poin… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-161-168

  37. arXiv:1006.5831  [pdf, other

    stat.ME stat.ML stat.OT

    Statistical Inference in Dynamic Treatment Regimes

    Authors: Eric B. Laber, Min Qian, Dan J. Lizotte, William E. Pelham, Susan A. Murphy

    Abstract: Dynamic treatment regimes are of growing interest across the clinical sciences as these regimes provide one way to operationalize and thus inform sequential personalized clinical decision making. A dynamic treatment regime is a sequence of decision rules, with a decision rule per stage of clinical intervention; each decision rule maps up-to-date patient information to a recommended treatment. We b… ▽ More

    Submitted 26 November, 2013; v1 submitted 30 June, 2010; originally announced June 2010.

    MSC Class: 47N30