Skip to main content

Showing 1–3 of 3 results for author: Sonabend-W, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.05581  [pdf, other

    stat.ML cs.LG stat.ME

    Federated Offline Reinforcement Learning

    Authors: Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai

    Abstract: Evidence-based or data-driven dynamic treatment regimes are essential for personalized medicine, which can benefit from offline reinforcement learning (RL). Although massive healthcare data are available across medical institutions, they are prohibited from sharing due to privacy constraints. Besides, heterogeneity exists in different sites. As a result, federated offline RL algorithms are necessa… ▽ More

    Submitted 27 January, 2024; v1 submitted 11 June, 2022; originally announced June 2022.

  2. arXiv:2012.04809  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Semi-Supervised Off Policy Reinforcement Learning

    Authors: Aaron Sonabend-W, Nilanjana Laha, Ashwin N. Ananthakrishnan, Tianxi Cai, Rajarshi Mukherjee

    Abstract: Reinforcement learning (RL) has shown great success in estimating sequential treatment strategies which take into account patient heterogeneity. However, health-outcome information, which is used as the reward for reinforcement learning methods, is often not well coded but rather embedded in clinical notes. Extracting precise outcome information is a resource intensive task, so most of the availab… ▽ More

    Submitted 22 February, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

  3. arXiv:2006.13189  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

    Authors: Aaron Sonabend-W, Junwei Lu, Leo A. Celi, Tianxi Cai, Peter Szolovits

    Abstract: Offline Reinforcement Learning (RL) is a promising approach for learning optimal policies in environments where direct exploration is expensive or unfeasible. However, the adoption of such policies in practice is often challenging, as they are hard to interpret within the application context, and lack measures of uncertainty for the learned policy value and its decisions. To overcome these issues,… ▽ More

    Submitted 30 October, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: to be published in NeurIPS 2020