Search | arXiv e-print repository

arXiv:2404.17187 [pdf, other]

An Explainable Deep Reinforcement Learning Model for Warfarin Maintenance Dosing Using Policy Distillation and Action Forging

Authors: Sadjad Anzabi Zadeh, W. Nick Street, Barrett W. Thomas

Abstract: Deep Reinforcement Learning is an effective tool for drug dosing for chronic condition management. However, the final protocol is generally a black box without any justification for its prescribed doses. This paper addresses this issue by proposing an explainable dosing protocol for warfarin using a Proximal Policy Optimization method combined with Policy Distillation. We introduce Action Forging… ▽ More Deep Reinforcement Learning is an effective tool for drug dosing for chronic condition management. However, the final protocol is generally a black box without any justification for its prescribed doses. This paper addresses this issue by proposing an explainable dosing protocol for warfarin using a Proximal Policy Optimization method combined with Policy Distillation. We introduce Action Forging as an effective tool to achieve explainability. Our focus is on the maintenance dosing protocol. Results show that the final model is as easy to understand and deploy as the current dosing protocols and outperforms the baseline dosing algorithms. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2202.03486 [pdf, other]

doi 10.1016/j.jbi.2022.104267

Optimizing Warfarin Dosing using Deep Reinforcement Learning

Authors: Sadjad Anzabi Zadeh, W. Nick Street, Barrett W. Thomas

Abstract: Warfarin is a widely used anticoagulant, and has a narrow therapeutic range. Dosing of warfarin should be individualized, since slight overdosing or underdosing can have catastrophic or even fatal consequences. Despite much research on warfarin dosing, current dosing protocols do not live up to expectations, especially for patients sensitive to warfarin. We propose a deep reinforcement learning-ba… ▽ More Warfarin is a widely used anticoagulant, and has a narrow therapeutic range. Dosing of warfarin should be individualized, since slight overdosing or underdosing can have catastrophic or even fatal consequences. Despite much research on warfarin dosing, current dosing protocols do not live up to expectations, especially for patients sensitive to warfarin. We propose a deep reinforcement learning-based dosing model for warfarin. To overcome the issue of relatively small sample sizes in dosing trials, we use a Pharmacokinetic/ Pharmacodynamic (PK/PD) model of warfarin to simulate dose-responses of virtual patients. Applying the proposed algorithm on virtual test patients shows that this model outperforms a set of clinically accepted dosing protocols by a wide margin. We tested the robustness of our dosing protocol on a second PK/PD model and showed that its performance is comparable to the set of baseline protocols. △ Less

Submitted 23 December, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: 32 pages (including 3 appendices)

Journal ref: Journal of Biomedical Informatics, 137 (2023) 104267

arXiv:1511.02402 [pdf, ps, other]

Max-Sum Diversification, Monotone Submodular Functions and Semi-metric Spaces

Authors: Sepehr Abbasi Zadeh, Mehrdad Ghadiri

Abstract: In many applications such as web-based search, document summarization, facility location and other applications, the results are preferable to be both representative and diversified subsets of documents. The goal of this study is to select a good "quality", bounded-size subset of a given set of items, while maintaining their diversity relative to a semi-metric distance function. This problem was f… ▽ More In many applications such as web-based search, document summarization, facility location and other applications, the results are preferable to be both representative and diversified subsets of documents. The goal of this study is to select a good "quality", bounded-size subset of a given set of items, while maintaining their diversity relative to a semi-metric distance function. This problem was first studied by Borodin et al\cite{borodin}, but a crucial property used throughout their proof is the triangle inequality. In this modified proof, we want to relax the triangle inequality and relate the approximation ratio of max-sum diversification problem to the parameter of the relaxed triangle inequality in the normal form of the problem (i.e., a uniform matroid) and also in an arbitrary matroid. △ Less

Submitted 7 November, 2015; originally announced November 2015.

Comments: This article draws heavily from arXiv:1203.6397 by other authors

Showing 1–3 of 3 results for author: Zadeh, S A