Zeqiu Wu is qualified to endorse.
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
Zeqiu Wu: | Is registered as an author of this paper. Can endorse for cs.CL. (why?) |
Yushi Hu, Weijia Shi, Nouha Dziri, Alane Suhr, Prithviraj Ammanabrolu, Noah A. Smith, Mari Ostendorf and Hannaneh Hajishirzi are not registered as owners of this paper. (why?)