Skip to main content

Showing 1–1 of 1 results for author: Gidron, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.03588  [pdf, other

    cs.CL

    Understanding Transformer Memorization Recall Through Idioms

    Authors: Adi Haviv, Ido Cohen, Jacob Gidron, Roei Schuster, Yoav Goldberg, Mor Geva

    Abstract: To produce accurate predictions, language models (LMs) must balance between generalization and memorization. Yet, little is known about the mechanism by which transformer LMs employ their memorization capacity. When does a model decide to output a memorized phrase, and how is this phrase then retrieved from memory? In this work, we offer the first methodological framework for probing and character… ▽ More

    Submitted 13 February, 2023; v1 submitted 7 October, 2022; originally announced October 2022.