Melrose Roderick is qualified to endorse.
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick: | Is registered as an author of this paper. Can endorse for cs.AI, cs.LG, stat.ML. (why?) |
Gaurav Manek, Felix Berkenkamp and J. Zico Kolter are not registered as owners of this paper. (why?)