Skip to main content

Showing 1–1 of 1 results for author: Chaimalas, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20304  [pdf, other

    cs.CL cs.LG

    Group Robust Preference Optimization in Reward-free RLHF

    Authors: Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj Mehta, Pier Giuseppe Sessa, Haitham Bou Ammar, Ilija Bogunovic

    Abstract: Adapting large language models (LLMs) for specific tasks usually involves fine-tuning through reinforcement learning with human feedback (RLHF) on preference data. While these data often come from diverse labelers' groups (e.g., different demographics, ethnicities, company teams, etc.), traditional RLHF approaches adopt a "one-size-fits-all" approach, i.e., they indiscriminately assume and optimiz… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Preprint