Skip to main content

Showing 1–1 of 1 results for author: Benarroch-Lelong, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.00949  [pdf, other

    cs.CL math.OC

    Hyperparameter Optimization for Large Language Model Instruction-Tuning

    Authors: Christophe Tribes, Sacha Benarroch-Lelong, Peng Lu, Ivan Kobyzev

    Abstract: The fine-tuning of Large Language Models (LLMs) has enabled them to recently achieve milestones in natural language processing applications. The emergence of ever larger LLMs has paved the way for more efficient fine-tuning methods. Among these, the Low-Rank Adaptation (LoRA) method keeps most of the weights of the pre-trained LLM frozen while introducing a low-rank decomposition of the weight mat… ▽ More

    Submitted 30 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.