Skip to main content

Showing 1–1 of 1 results for author: Ziheng, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.05503  [pdf, other

    cs.CL cs.AI cs.LG

    Aligner: One Global Token is Worth Millions of Parameters When Aligning Large Language Models

    Authors: Zhou Ziheng, Yingnian Wu, Song-Chun Zhu, Demetri Terzopoulos

    Abstract: We introduce Aligner, a novel Parameter-Efficient Fine-Tuning (PEFT) method for aligning multi-billion-parameter-sized Large Language Models (LLMs). Aligner employs a unique design that constructs a globally shared set of tunable tokens that modify the attention of every layer. Remarkably with this method, even when using one token accounting for a mere 5,000 parameters, Aligner can still perform… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 81 pages, 77 figures

    ACM Class: I.2; I.2.6; I.2.7