We gratefully acknowledge support from
the Simons Foundation and member institutions.

Nasrin Sultana is qualified to endorse.

Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems

Nasrin Sultana: Is registered as an author of this paper.
Can endorse for cs.AI, cs.LG. (why?)

Jeffrey Chan, Tabinda Sarwar and A. K. Qin are not registered as owners of this paper. (why?)