Skip to main content

Showing 1–1 of 1 results for author: Kennedy, S J J

.
  1. arXiv:2407.00996  [pdf, other

    cs.CL cs.LG

    Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?

    Authors: Nicy Scaria, Silvester John Joseph Kennedy, Deepak Subramani

    Abstract: Small Language Models (SLMs) are generally considered to be more compact versions of large language models (LLMs), typically having fewer than 7 billion parameters. This study investigates the ability of small language models to learn, retain, and subsequently eliminate noise that is typically not found on the internet, where most pretraining datasets are sourced. For this, four pre-trained SLMs w… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.