Skip to main content

Showing 1–1 of 1 results for author: Pumma, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.05239  [pdf, other

    cs.LG cs.DC cs.IR cs.PF

    RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure

    Authors: Mark Zhao, Dhruv Choudhary, Devashish Tyagi, Ajay Somani, Max Kaplan, Sung-Han Lin, Sarunya Pumma, Jongsoo Park, Aarti Basant, Niket Agarwal, Carole-Jean Wu, Christos Kozyrakis

    Abstract: We present RecD (Recommendation Deduplication), a suite of end-to-end infrastructure optimizations across the Deep Learning Recommendation Model (DLRM) training pipeline. RecD addresses immense storage, preprocessing, and training overheads caused by feature duplication inherent in industry-scale DLRM training datasets. Feature duplication arises because DLRM datasets are generated from interactio… ▽ More

    Submitted 1 May, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: Published in the Proceedings of the Sixth Conference on Machine Learning and Systems (MLSys 2023)