Skip to main content

Showing 1–17 of 17 results for author: Rendle, S

.
  1. arXiv:2310.15454  [pdf, other

    cs.LG cs.CR stat.ML

    Private Learning with Public Features

    Authors: Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study a class of private learning problems in which the data is a join of private and public features. This is often the case in private personalization tasks such as recommendation or ad prediction, in which features related to individuals are sensitive, while features related to items (the movies or songs to be recommended, or the ads to be shown to users) are publicly available and do not re… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  2. arXiv:2306.04133  [pdf, ps, other

    cs.IR cs.LG

    Answering Compositional Queries with Set-Theoretic Embeddings

    Authors: Shib Dasgupta, Andrew McCallum, Steffen Rendle, Li Zhang

    Abstract: The need to compactly and robustly represent item-attribute relations arises in many important tasks, such as faceted browsing and recommendation systems. A popular machine learning approach for this task denotes that an item has an attribute by a high dot-product between vectors for the item and attribute -- a representation that is not only dense, but also tends to correct noisy and incomplete d… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  3. arXiv:2112.02194  [pdf, other

    cs.LG cs.DC

    ALX: Large Scale Matrix Factorization on TPUs

    Authors: Harsh Mehta, Steffen Rendle, Walid Krichene, Li Zhang

    Abstract: We present ALX, an open-source library for distributed matrix factorization using Alternating Least Squares, written in JAX. Our design allows for efficient use of the TPU architecture and scales well to matrix factorization problems of O(B) rows/columns by scaling the number of available TPU cores. In order to spur future research on large scale matrix factorization methods and to illustrate the… ▽ More

    Submitted 29 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

  4. arXiv:2110.14044  [pdf, other

    cs.LG cs.IR

    iALS++: Speeding up Matrix Factorization with Subspace Optimization

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, Yehuda Koren

    Abstract: iALS is a popular algorithm for learning matrix factorization models from implicit feedback with alternating least squares. This algorithm was invented over a decade ago but still shows competitive quality compared to recent approaches like VAE, EASE, SLIM, or NCF. Due to a computational trick that avoids negative sampling, iALS is very efficient especially for large item catalogues. However, iALS… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  5. arXiv:2110.14037  [pdf, other

    cs.IR cs.LG

    Revisiting the Performance of iALS on Item Recommendation Benchmarks

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, Yehuda Koren

    Abstract: Matrix factorization learned by implicit alternating least squares (iALS) is a popular baseline in recommender system research publications. iALS is known to be one of the most computationally efficient and scalable collaborative filtering methods. However, recent studies suggest that its prediction quality is not competitive with the current state of the art, in particular autoencoders and other… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  6. arXiv:2107.09802  [pdf, other

    cs.LG cs.CR stat.ML

    Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates

    Authors: Steve Chien, Prateek Jain, Walid Krichene, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of differentially private (DP) matrix completion under user-level privacy. We design a joint differentially private variant of the popular Alternating-Least-Squares (ALS) method that achieves: i) (nearly) optimal sample complexity for matrix completion (in terms of number of items, users), and ii) the best known privacy/utility trade-off both theoretically, as well as on bench… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  7. arXiv:2101.08769  [pdf, ps, other

    cs.IR cs.LG

    Item Recommendation from Implicit Feedback

    Authors: Steffen Rendle

    Abstract: The task of item recommendation is to select the best items for a user from a large catalogue of items. Item recommenders are commonly trained from implicit feedback which consists of past actions that are positive only. Core challenges of item recommendation are (1) how to formulate a training objective from implicit feedback and (2) how to efficiently train models over a large item catalogue. Th… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

  8. arXiv:2008.02930  [pdf, other

    cs.LG cs.IR stat.ML

    Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval

    Authors: Tao Wu, Ellie Ka-In Chio, Heng-Tze Cheng, Yu Du, Steffen Rendle, Dima Kuzmin, Ritesh Agarwal, Li Zhang, John Anderson, Sarvjeet Singh, Tushar Chandra, Ed H. Chi, Wen Li, Ankit Kumar, Xiang Ma, Alex Soares, Nitin **dal, Pei Cao

    Abstract: Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown… ▽ More

    Submitted 18 August, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted at CIKM 2020

  9. arXiv:2005.09683  [pdf, other

    cs.IR cs.LG stat.ML

    Neural Collaborative Filtering vs. Matrix Factorization Revisited

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, John Anderson

    Abstract: Embedding based models have been the state of the art in collaborative filtering for over a decade. Traditionally, the dot product or higher order equivalents have been used to combine two or more embeddings, e.g., most notably in matrix factorization. In recent years, it was suggested to replace the dot product with a learned similarity e.g. using a multilayer perceptron (MLP). This approach is o… ▽ More

    Submitted 1 June, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

  10. arXiv:2002.04723  [pdf, other

    cs.LG cs.CL stat.ML

    Superbloom: Bloom filter meets Transformer

    Authors: John Anderson, Qingqing Huang, Walid Krichene, Steffen Rendle, Li Zhang

    Abstract: We extend the idea of word pieces in natural language models to machine learning tasks on opaque ids. This is achieved by applying hash functions to map each id to multiple hash tokens in a much smaller space, similarly to a Bloom filter. We show that by applying a multi-layer Transformer to these Bloom filter digests, we are able to obtain models with high accuracy. They outperform models of a si… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  11. arXiv:1912.02263  [pdf, other

    cs.IR cs.LG

    Evaluation Metrics for Item Recommendation under Sampling

    Authors: Steffen Rendle

    Abstract: The task of item recommendation requires ranking a large catalogue of items given a context. Item recommendation algorithms are evaluated using ranking metrics that depend on the positions of relevant items. To speed up the computation of metrics, recent work often uses sampled metrics where only a smaller set of random items and the relevant items are ranked. This paper investigates sampled metri… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  12. arXiv:1905.01395  [pdf, other

    cs.IR cs.LG

    On the Difficulty of Evaluating Baselines: A Study on Recommender Systems

    Authors: Steffen Rendle, Li Zhang, Yehuda Koren

    Abstract: Numerical evaluations with comparisons to baselines play a central role when judging research in recommender systems. In this paper, we show that running baselines properly is difficult. We demonstrate this issue on two extensively studied datasets. First, we show that results for baselines that have been used in numerous publications over the past five years for the Movielens 10M benchmark are su… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

  13. arXiv:1807.07187  [pdf, other

    stat.ML cs.CL cs.LG

    Efficient Training on Very Large Corpora via Gramian Estimation

    Authors: Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Li Zhang, Xinyang Yi, Lichan Hong, Ed Chi, John Anderson

    Abstract: We study the problem of learning similarity functions over very large corpora using neural network embedding models. These models are typically trained using SGD with sampling of random observed and unobserved pairs, with a number of samples that grows quadratically with the corpus size, making it expensive to scale to very large corpora. We propose new efficient methods to train these models with… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  14. arXiv:1712.00527  [pdf, other

    cs.LG

    Adaptive Sampled Softmax with Kernel Based Sampling

    Authors: Guy Blanc, Steffen Rendle

    Abstract: Softmax is the most commonly used output function for multiclass problems and is widely used in areas such as vision, natural language processing, and recommendation. A softmax model has linear costs in the number of classes which makes it too expensive for many real-world problems. A common approach to speed up training involves sampling only some of the classes at each training step. It is known… ▽ More

    Submitted 1 August, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

  15. Graph Based Relational Features for Collective Classification

    Authors: Immanuel Bayer, Uwe Nagel, Steffen Rendle

    Abstract: Statistical Relational Learning (SRL) methods have shown that classification accuracy can be improved by integrating relations between samples. Techniques such as iterative classification or relaxation labeling achieve this by propagating information between related samples during the inference process. When only a few samples are labeled and connections between samples are sparse, collective infe… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

    Comments: Pacific-Asia Conference on Knowledge Discovery and Data Mining

  16. arXiv:1611.04666  [pdf, other

    cs.IR cs.LG

    A Generic Coordinate Descent Framework for Learning from Implicit Feedback

    Authors: Immanuel Bayer, Xiangnan He, Bhargav Kanagal, Steffen Rendle

    Abstract: In recent years, interest in recommender research has shifted from explicit feedback towards implicit feedback data. A diversity of complex models has been proposed for a wide variety of applications. Despite this, learning from implicit feedback is still computationally challenging. So far, most work relies on stochastic gradient descent (SGD) solvers which are easy to derive, but in practice cha… ▽ More

    Submitted 14 November, 2016; originally announced November 2016.

  17. arXiv:1205.2618  [pdf

    cs.IR cs.LG stat.ML

    BPR: Bayesian Personalized Ranking from Implicit Feedback

    Authors: Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, Lars Schmidt-Thieme

    Abstract: Item recommendation is the task of predicting a personalized ranking on a set of items (e.g. websites, movies, products). In this paper, we investigate the most common scenario with implicit feedback (e.g. clicks, purchases). There are many methods for item recommendation from implicit feedback like matrix factorization (MF) or adaptive knearest-neighbor (kNN). Even though these methods are design… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-452-461