Skip to main content

Showing 1–1 of 1 results for author: Yannam, P K R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00213  [pdf, other

    cs.CL

    Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

    Authors: Nick Mecklenburg, Yiyou Lin, Xiaoxiao Li, Daniel Holstein, Leonardo Nunes, Sara Malvar, Bruno Silva, Ranveer Chandra, Vijay Aski, Pavan Kumar Reddy Yannam, Tolga Aktas, Todd Hendry

    Abstract: In recent years, Large Language Models (LLMs) have shown remarkable performance in generating human-like text, proving to be a valuable asset across various applications. However, adapting these models to incorporate new, out-of-domain knowledge remains a challenge, particularly for facts and events that occur after the model's knowledge cutoff date. This paper investigates the effectiveness of Su… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: 16 pages; 7 figures. updated authors list