Yang Yu is qualified to endorse.
Language Model Self-improvement by Reinforcement Learning Contemplation
Yang Yu: | Is registered as an author of this paper. Can endorse for cs.AI, cs.CL, cs.IR, cs.LG, cs.MA, cs.NE, stat.ML. (why?) |
**g-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu and Zongzhang Zhang are not registered as owners of this paper. (why?)