Skip to main content

Showing 1–6 of 6 results for author: Ramanath, R

.
  1. arXiv:2108.05839  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Logit Attenuating Weight Normalization

    Authors: Aman Gupta, Rohan Ramanath, Jun Shi, Anika Ramachandran, Sirou Zhou, Mingzhou Zhou, S. Sathiya Keerthi

    Abstract: Over-parameterized deep networks trained using gradient-based optimizers are a popular choice for solving classification and ranking problems. Without appropriately tuned $\ell_2$ regularization or weight decay, such networks have the tendency to make output scores (logits) and network weights large, causing training loss to become too small and the network to lose its adaptivity (ability to move… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

    Comments: 23 pages

  2. arXiv:2103.05277  [pdf, ps, other

    cs.AI cs.LG stat.ML

    Efficient Vertex-Oriented Polytopic Projection for Web-scale Applications

    Authors: Rohan Ramanath, S. Sathiya Keerthi, Yao Pan, Konstantin Salomatin, Kinjal Basu

    Abstract: We consider applications involving a large set of instances of projecting points to polytopes. We develop an intuition guided by theoretical and empirical analysis to show that when these instances follow certain structures, a large majority of the projections lie on vertices of the polytopes. To do these projections efficiently we derive a vertex-oriented incremental algorithm to project a point… ▽ More

    Submitted 6 January, 2022; v1 submitted 9 March, 2021; originally announced March 2021.

    ACM Class: G.1.6; I.2.11

  3. arXiv:2010.05154  [pdf, other

    cs.LG cs.AI stat.ML

    Lambda Learner: Fast Incremental Learning on Data Streams

    Authors: Rohan Ramanath, Konstantin Salomatin, Jeffrey D. Gee, Kirill Talanine, Onkar Dalal, Gungor Polatkan, Sara Smoot, Deepak Kumar

    Abstract: One of the most well-established applications of machine learning is in deciding what content to show website visitors. When observation data comes from high-velocity, user-generated data streams, machine learning methods perform a balancing act between model complexity, training time, and computational costs. Furthermore, when model freshness is critical, the training of models becomes time-const… ▽ More

    Submitted 28 June, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

  4. arXiv:1904.02874  [pdf, other

    cs.LG stat.ML

    An Attentive Survey of Attention Models

    Authors: Sneha Chaudhari, Varun Mithal, Gungor Polatkan, Rohan Ramanath

    Abstract: Attention Model has now become an important concept in neural networks that has been researched within diverse application domains. This survey provides a structured and comprehensive overview of the developments in modeling attention. In particular, we propose a taxonomy which groups existing techniques into coherent categories. We review salient neural architectures in which attention has been i… ▽ More

    Submitted 12 July, 2021; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: accepted to Transactions on Intelligent Systems and Technology(TIST); 33 pages

  5. arXiv:1809.06473  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Deep and Representation Learning for Talent Search at LinkedIn

    Authors: Rohan Ramanath, Hakan Inan, Gungor Polatkan, Bo Hu, Qi Guo, Cagri Ozcaglar, Xianren Wu, Krishnaram Kenthapadi, Sahin Cem Geyik

    Abstract: Talent search and recommendation systems at LinkedIn strive to match the potential candidates to the hiring needs of a recruiter or a hiring manager expressed in terms of a search query or a job posting. Recent work in this domain has mainly focused on linear models, which do not take complex relationships between features into account, as well as ensemble tree models, which introduce non-linearit… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: This paper has been accepted for publication in ACM CIKM 2018

  6. arXiv:1806.02281  [pdf, other

    cs.IR cs.AI cs.SI

    Deploying Deep Ranking Models for Search Verticals

    Authors: Rohan Ramanath, Gungor Polatkan, Liqin Xu, Harold Lee, Bo Hu, Shan Zhou

    Abstract: In this paper, we present an architecture executing a complex machine learning model such as a neural network capturing semantic similarity between a query and a document; and deploy to a real-world production system serving 500M+users. We present the challenges that arise in a real-world system and how we solve them. We demonstrate that our architecture provides competitive modeling capability wi… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: Published at the SysML Conference - 2018