We gratefully acknowledge support from
the Simons Foundation and member institutions.

Zeqiu Wu is qualified to endorse.

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Zeqiu Wu: Is registered as an author of this paper.
Can endorse for cs.CL. (why?)

Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf and Hannaneh Hajishirzi are not registered as owners of this paper. (why?)