Skip to main content

Showing 1–8 of 8 results for author: Keshavan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00245  [pdf, other

    cs.IR

    Aligning Large Language Models with Recommendation Knowledge

    Authors: Yuwei Cao, Nikhil Mehta, Xinyang Yi, Raghunandan Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy

    Abstract: Large language models (LLMs) have recently been used as backbones for recommender systems. However, their performance often lags behind conventional methods in standard tasks like retrieval. We attribute this to a mismatch between LLMs' knowledge and the knowledge crucial for effective recommendations. While LLMs excel at natural language reasoning, they cannot model complex user-item interactions… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted to the NAACL 2024 Findings

  2. arXiv:2306.08121  [pdf, other

    cs.IR cs.LG

    Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations

    Authors: Anima Singh, Trung Vu, Nikhil Mehta, Raghunandan Keshavan, Maheswaran Sathiamoorthy, Yilin Zheng, Lichan Hong, Lukasz Heldt, Li Wei, Devansh Tandon, Ed H. Chi, Xinyang Yi

    Abstract: Randomly-hashed item ids are used ubiquitously in recommendation models. However, the learned representations from random hashing prevents generalization across similar items, causing problems of learning unseen and long-tail items, especially when item corpus is large, power-law distributed, and evolving dynamically. In this paper, we propose using content-derived features as a replacement for ra… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2305.05065  [pdf, other

    cs.IR cs.LG

    Recommender Systems with Generative Retrieval

    Authors: Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan H. Keshavan, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy

    Abstract: Modern recommender systems perform large-scale retrieval by first embedding queries and item candidates in the same unified space, followed by approximate nearest neighbor search to select top candidates given a query embedding. In this paper, we propose a novel generative retrieval approach, where the retrieval model autoregressively decodes the identifiers of the target candidates. To that end,… ▽ More

    Submitted 3 November, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: To appear in The 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  4. A Gradient Descent Algorithm on the Grassman Manifold for Matrix Completion

    Authors: Raghunandan H. Keshavan, Sewoong Oh

    Abstract: We consider the problem of reconstructing a low-rank matrix from a small subset of its entries. In this paper, we describe the implementation of an efficient algorithm called OptSpace, based on singular value decomposition followed by local manifold optimization, for solving the low-rank matrix completion problem. It has been shown that if the number of revealed entries is large enough, the outp… ▽ More

    Submitted 3 November, 2009; v1 submitted 27 October, 2009; originally announced October 2009.

    Comments: 26 pages, 15 figures

  5. arXiv:0910.0921  [pdf, ps, other

    cs.LG math.NA

    Low-rank Matrix Completion with Noisy Observations: a Quantitative Comparison

    Authors: Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

    Abstract: We consider a problem of significant practical importance, namely, the reconstruction of a low-rank data matrix from a small subset of its entries. This problem appears in many areas such as collaborative filtering, computer vision and wireless sensor networks. In this paper, we focus on the matrix completion problem in the case when the observed samples are corrupted by noise. We compare the pe… ▽ More

    Submitted 3 November, 2009; v1 submitted 6 October, 2009; originally announced October 2009.

    Comments: 7 pages, 7 figures, 47th Allerton Conference on Communication Control and Computing, 2009, invited paper

  6. arXiv:0906.2027  [pdf, ps, other

    cs.LG stat.ML

    Matrix Completion from Noisy Entries

    Authors: Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

    Abstract: Given a matrix M of low-rank, we consider the problem of reconstructing it from noisy observations of a small, random subset of its entries. The problem arises in a variety of applications, from collaborative filtering (the `Netflix problem') to structure-from-motion and positioning. We study a low complexity algorithm introduced by Keshavan et al.(2009), based on a combination of spectral techniq… ▽ More

    Submitted 9 April, 2012; v1 submitted 10 June, 2009; originally announced June 2009.

    Comments: 22 pages, 3 figures

  7. arXiv:0901.3150  [pdf, ps, other

    cs.LG stat.ML

    Matrix Completion from a Few Entries

    Authors: Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

    Abstract: Let M be a random (alpha n) x n matrix of rank r<<n, and assume that a uniformly random subset E of its entries is observed. We describe an efficient algorithm that reconstructs M from |E| = O(rn) observed entries with relative root mean square error RMSE <= C(rn/|E|)^0.5 . Further, if r=O(1), M can be reconstructed exactly from |E| = O(n log(n)) entries. These results apply beyond random matric… ▽ More

    Submitted 17 September, 2009; v1 submitted 20 January, 2009; originally announced January 2009.

    Comments: 30 pages, 1 figure, journal version (v1, v2: Conference version ISIT 2009)

  8. arXiv:0812.2599  [pdf, ps, other

    cs.DS

    Learning Low Rank Matrices from O(n) Entries

    Authors: Raghunandan H. Keshavan, Andrea Montanari, Sewoong Oh

    Abstract: How many random entries of an n by m, rank r matrix are necessary to reconstruct the matrix within an accuracy d? We address this question in the case of a random matrix with bounded rank, whereby the observed entries are chosen uniformly at random. We prove that, for any d>0, C(r,d)n observations are sufficient. Finally we discuss the question of reconstructing the matrix efficiently, and demon… ▽ More

    Submitted 14 December, 2008; originally announced December 2008.

    Comments: 8 pages, 11 figures, Forty-sixth Allerton Conference on Communication, Control and Computing, invited paper