Skip to main content

Showing 1–1 of 1 results for author: Van Landeghem, S

.
  1. arXiv:2212.09255  [pdf, other

    cs.CL

    Multi hash embeddings in spaCy

    Authors: Lester James Miranda, Ákos Kádár, Adriane Boyd, Sofie Van Landeghem, Anders Søgaard, Matthew Honnibal

    Abstract: The distributed representation of symbols is one of the key technologies in machine learning systems today, playing a pivotal role in modern natural language processing. Traditional word embeddings associate a separate vector with each word. While this approach is simple and leads to good performance, it requires a lot of memory for representing a large vocabulary. To reduce the memory footprint,… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    ACM Class: I.2.7