We gratefully acknowledge support from
the Simons Foundation and member institutions.

Samuel Dooley is qualified to endorse.

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

Samuel Dooley: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CY, cs.GT, cs.LG. (why?)

Arka Pal, Deep Karkhanis, Manley Roberts, Siddartha Naidu and Colin White are not registered as owners of this paper. (why?)