Skip to main content

Showing 1–3 of 3 results for author: Ghaemmaghami, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.15881  [pdf, other

    cs.LG cs.AI

    Enhancing Cross-Category Learning in Recommendation Systems with Multi-Layer Embedding Training

    Authors: Zihao Deng, Benjamin Ghaemmaghami, Ashish Kumar Singh, Benjamin Cho, Leo Orshansky, Mattan Erez, Michael Orshansky

    Abstract: Modern DNN-based recommendation systems rely on training-derived embeddings of sparse features. Input sparsity makes obtaining high-quality embeddings for rarely-occurring categories harder as their representations are updated infrequently. We demonstrate a training-time technique to produce superior embeddings via effective cross-category learning and theoretically explain its surprising effectiv… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: This is the preprint of our paper accepted at ACML 2023

  2. arXiv:2203.15837  [pdf

    cs.IR cs.AI cs.DC cs.LG

    Learning to Collide: Recommendation System Model Compression with Learned Hash Functions

    Authors: Benjamin Ghaemmaghami, Mustafa Ozdal, Rakesh Komuravelli, Dmitriy Korchev, Dheevatsa Mudigere, Krishnakumar Nair, Maxim Naumov

    Abstract: A key characteristic of deep recommendation models is the immense memory requirements of their embedding tables. These embedding tables can often reach hundreds of gigabytes which increases hardware requirements and training cost. A common technique to reduce model size is to hash all of the categorical variable identifiers (ids) into a smaller space. This hashing reduces the number of unique repr… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  3. arXiv:2006.05623  [pdf, other

    cs.LG stat.ML

    Training with Multi-Layer Embeddings for Model Reduction

    Authors: Benjamin Ghaemmaghami, Zihao Deng, Benjamin Cho, Leo Orshansky, Ashish Kumar Singh, Mattan Erez, Michael Orshansky

    Abstract: Modern recommendation systems rely on real-valued embeddings of categorical features. Increasing the dimension of embedding vectors improves model accuracy but comes at a high cost to model size. We introduce a multi-layer embedding training (MLET) architecture that trains embeddings via a sequence of linear layers to derive superior embedding accuracy vs. model size trade-off. Our approach is f… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: 10 pages, 3 figures