We gratefully acknowledge support from
the Simons Foundation and member institutions.

Daniele Calandriello and Michal Valko are qualified to endorse.

Human Alignment of Large Language Models through Online Preference Optimisation

Daniele Calandriello: Is registered as an author of this paper.
Can endorse for cs.LG, stat.ML. (why?)
Michal Valko: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CV, cs.DS, cs.GT, cs.LG, cs.MA, cs.SI, stat.AP, stat.CO, stat.ML. (why?)

Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Tianqi Liu, Rishabh Joshi, Zeyu Zheng and Bilal Piot are not registered as owners of this paper. (why?)