Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo... are qualified to endorse.
One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning
Pedro Cisneros-Velarde: | Is registered as an author of this paper. Can endorse for cs.AI, cs.GT, cs.LG, cs.SI, cs.SY, eess.SY, stat.ML. (why?) |
Boxiang Lyu: | Is registered as an author of this paper. Can endorse for cs.GT, cs.IR, cs.LG, stat.ML. (why?) |
Sanmi Koyejo: | Is registered as an author of this paper. Can endorse for cs.LG, stat.ML. (why?) |
Mladen Kolar: | Is registered as an author of this paper. Can endorse for math.ST, stat.ME, stat.ML, stat.TH. (why?) |