Zhirui Chen is qualified to endorse.
Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback
Zhirui Chen: | Is registered as an author of this paper. Can endorse for cs.AI, cs.CV, cs.IT, cs.LG, math.IT, stat.ML. (why?) |
Vincent Y. F. Tan is not registered as an owner of this paper. (why?)