We gratefully acknowledge support from
the Simons Foundation and member institutions.

Songyang Gao is qualified to endorse.

Secrets of RLHF in Large Language Models Part I: PPO

Songyang Gao: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.LG. (why?)

Rui Zheng, Shihan Dou, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie **, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang, Xipeng Qiu and Xuan**g Huang are not registered as owners of this paper. (why?)