Skip to main content

Showing 1–4 of 4 results for author: Dilipkumar, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.14523  [pdf, other

    cs.SI cs.LG stat.ML

    Model Size Reduction Using Frequency Based Double Hashing for Recommender Systems

    Authors: Cao** Zhang, Yicun Liu, Yuanpu Xie, Sofia Ira Ktena, Alykhan Tejani, Akshay Gupta, Pranay Kumar Myana, Deepak Dilipkumar, Suvadip Paul, Ikuhiro Ihara, Prasang Upadhyaya, Ferenc Huszar, Wenzhe Shi

    Abstract: Deep Neural Networks (DNNs) with sparse input features have been widely used in recommender systems in industry. These models have large memory requirements and need a huge amount of training data. The large model size usually entails a cost, in the range of millions of dollars, for storage and communication with the inference services. In this paper, we propose a hybrid hashing method to combine… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Paper is accepted to RecSys 2020

  2. arXiv:1907.06558  [pdf, other

    stat.ML cs.LG

    Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction

    Authors: Sofia Ira Ktena, Alykhan Tejani, Lucas Theis, Pranay Kumar Myana, Deepak Dilipkumar, Ferenc Huszar, Steven Yoo, Wenzhe Shi

    Abstract: One of the challenges in display advertising is that the distribution of features and click through rate (CTR) can exhibit large shifts over time due to seasonality, changes to ad campaigns and other factors. The predominant strategy to keep up with these shifts is to train predictive models continuously, on fresh data, in order to prevent them from becoming stale. However, in many ad systems posi… ▽ More

    Submitted 23 April, 2021; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted at RecSys '19

  3. arXiv:1801.03137  [pdf, other

    cs.LG cs.AI stat.ML

    Convergence Analysis of Gradient Descent Algorithms with Proportional Updates

    Authors: Igor Gitman, Deepak Dilipkumar, Ben Parr

    Abstract: The rise of deep learning in recent years has brought with it increasingly clever optimization methods to deal with complex, non-linear loss functions. These methods are often designed with convex optimization in mind, but have been shown to work well in practice even for the highly non-convex optimization associated with neural networks. However, one significant drawback of these methods when the… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: Source code (uses TensorFlow): https://github.com/bparr/lars

  4. arXiv:1712.03280  [pdf, other

    cs.AI

    Nintendo Super Smash Bros. Melee: An "Untouchable" Agent

    Authors: Ben Parr, Deepak Dilipkumar, Yuan Liu

    Abstract: Nintendo's Super Smash Bros. Melee fighting game can be emulated on modern hardware allowing us to inspect internal memory states, such as character positions. We created an AI that avoids being hit by training using these internal memory states and outputting controller button presses. After training on a month's worth of Melee matches, our best agent learned to avoid the toughest AI built into t… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.