Skip to main content

Showing 1–1 of 1 results for author: Suppattarachai, N

.
  1. arXiv:2401.00788  [pdf, other

    cs.CL cs.AI cs.SE

    Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

    Authors: Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

    Abstract: The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion para… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 25 pages (12 main), 19 figures, 8 tables