We gratefully acknowledge support from
the Simons Foundation and member institutions.

Melrose Roderick is qualified to endorse.

Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning

Melrose Roderick: Is registered as an author of this paper.
Can endorse for cs.AI, cs.LG, stat.ML. (why?)

Gaurav Manek, Felix Berkenkamp and J. Zico Kolter are not registered as owners of this paper. (why?)