Skip to main content

Showing 1–22 of 22 results for author: Krichene, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.15246  [pdf, other

    cs.LG cs.CR cs.IR

    Training Differentially Private Ad Prediction Models with Semi-Sensitive Features

    Authors: Lynn Chua, Qiliang Cui, Badih Ghazi, Charlie Harrison, Pritish Kamath, Walid Krichene, Ravi Kumar, Pasin Manurangsi, Krishna Giri Narra, Amer Sinha, Avinash Varadarajan, Chiyuan Zhang

    Abstract: Motivated by problems arising in digital advertising, we introduce the task of training differentially private (DP) machine learning models with semi-sensitive features. In this setting, a subset of the features is known to the attacker (and thus need not be protected) while the remaining features as well as the label are unknown to the attacker and should be protected by the DP guarantee. This ta… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 7 pages, 4 figures

  2. arXiv:2310.15454  [pdf, other

    cs.LG cs.CR stat.ML

    Private Learning with Public Features

    Authors: Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study a class of private learning problems in which the data is a join of private and public features. This is often the case in private personalization tasks such as recommendation or ad prediction, in which features related to individuals are sensitive, while features related to items (the movies or songs to be recommended, or the ads to be shown to users) are publicly available and do not re… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  3. arXiv:2309.11516  [pdf, other

    cs.IR cs.CR cs.LG

    Private Matrix Factorization with Public Item Features

    Authors: Mihaela Curmei, Walid Krichene, Li Zhang, Mukund Sundararajan

    Abstract: We consider the problem of training private recommendation models with access to public item features. Training with Differential Privacy (DP) offers strong privacy guarantees, at the expense of loss in recommendation quality. We show that incorporating public item features during training can help mitigate this loss in quality. We propose a general approach based on collective matrix factorizatio… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Presented at ACM Recsys 2023

  4. arXiv:2302.07975  [pdf, other

    cs.LG cs.CR stat.ML

    Multi-Task Differential Privacy Under Distribution Skew

    Authors: Walid Krichene, Prateek Jain, Shuang Song, Mukund Sundararajan, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of multi-task learning under user-level differential privacy, in which $n$ users contribute data to $m$ tasks, each involving a subset of users. One important aspect of the problem, that can significantly impact quality, is the distribution skew among tasks. Certain tasks may have much fewer data samples than others, making them more susceptible to the noise added for privacy.… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  5. arXiv:2211.13403  [pdf, other

    cs.LG cs.CR cs.CV

    Differentially Private Image Classification from Features

    Authors: Harsh Mehta, Walid Krichene, Abhradeep Thakurta, Alexey Kurakin, Ashok Cutkosky

    Abstract: Leveraging transfer learning has recently been shown to be an effective strategy for training large models with Differential Privacy (DP). Moreover, somewhat surprisingly, recent works have found that privately training just the last layer of a pre-trained model provides the best utility with DP. While past studies largely rely on algorithms like DP-SGD for training large models, in the specific c… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  6. arXiv:2202.09480  [pdf, other

    cs.LG cs.AI econ.GN

    Reciprocity in Machine Learning

    Authors: Mukund Sundararajan, Walid Krichene

    Abstract: Machine learning is pervasive. It powers recommender systems such as Spotify, Instagram and YouTube, and health-care systems via models that predict sleep patterns, or the risk of disease. Individuals contribute data to these models and benefit from them. Are these contributions (outflows of influence) and benefits (inflows of influence) reciprocal? We propose measures of outflows, inflows and rec… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  7. arXiv:2112.02194  [pdf, other

    cs.LG cs.DC

    ALX: Large Scale Matrix Factorization on TPUs

    Authors: Harsh Mehta, Steffen Rendle, Walid Krichene, Li Zhang

    Abstract: We present ALX, an open-source library for distributed matrix factorization using Alternating Least Squares, written in JAX. Our design allows for efficient use of the TPU architecture and scales well to matrix factorization problems of O(B) rows/columns by scaling the number of available TPU cores. In order to spur future research on large scale matrix factorization methods and to illustrate the… ▽ More

    Submitted 29 March, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

  8. arXiv:2110.14044  [pdf, other

    cs.LG cs.IR

    iALS++: Speeding up Matrix Factorization with Subspace Optimization

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, Yehuda Koren

    Abstract: iALS is a popular algorithm for learning matrix factorization models from implicit feedback with alternating least squares. This algorithm was invented over a decade ago but still shows competitive quality compared to recent approaches like VAE, EASE, SLIM, or NCF. Due to a computational trick that avoids negative sampling, iALS is very efficient especially for large item catalogues. However, iALS… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  9. arXiv:2110.14037  [pdf, other

    cs.IR cs.LG

    Revisiting the Performance of iALS on Item Recommendation Benchmarks

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, Yehuda Koren

    Abstract: Matrix factorization learned by implicit alternating least squares (iALS) is a popular baseline in recommender system research publications. iALS is known to be one of the most computationally efficient and scalable collaborative filtering methods. However, recent studies suggest that its prediction quality is not competitive with the current state of the art, in particular autoencoders and other… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  10. arXiv:2107.09802  [pdf, other

    cs.LG cs.CR stat.ML

    Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates

    Authors: Steve Chien, Prateek Jain, Walid Krichene, Steffen Rendle, Shuang Song, Abhradeep Thakurta, Li Zhang

    Abstract: We study the problem of differentially private (DP) matrix completion under user-level privacy. We design a joint differentially private variant of the popular Alternating-Least-Squares (ALS) method that achieves: i) (nearly) optimal sample complexity for matrix completion (in terms of number of items, users), and ii) the best known privacy/utility trade-off both theoretically, as well as on bench… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  11. arXiv:2007.06852  [pdf, other

    math.OC cs.AI cs.LG

    Global Convergence of Second-order Dynamics in Two-layer Neural Networks

    Authors: Walid Krichene, Kenneth F. Caluya, Abhishek Halder

    Abstract: Recent results have shown that for two-layer fully connected neural networks, gradient flow converges to a global optimum in the infinite width limit, by making a connection between the mean field dynamics and the Wasserstein gradient flow. These results were derived for first-order gradient flow, and a natural question is whether second-order dynamics, i.e., dynamics with momentum, exhibit a simi… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  12. arXiv:2005.09683  [pdf, other

    cs.IR cs.LG stat.ML

    Neural Collaborative Filtering vs. Matrix Factorization Revisited

    Authors: Steffen Rendle, Walid Krichene, Li Zhang, John Anderson

    Abstract: Embedding based models have been the state of the art in collaborative filtering for over a decade. Traditionally, the dot product or higher order equivalents have been used to combine two or more embeddings, e.g., most notably in matrix factorization. In recent years, it was suggested to replace the dot product with a learned similarity e.g. using a multilayer perceptron (MLP). This approach is o… ▽ More

    Submitted 1 June, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

  13. arXiv:2002.04723  [pdf, other

    cs.LG cs.CL stat.ML

    Superbloom: Bloom filter meets Transformer

    Authors: John Anderson, Qingqing Huang, Walid Krichene, Steffen Rendle, Li Zhang

    Abstract: We extend the idea of word pieces in natural language models to machine learning tasks on opaque ids. This is achieved by applying hash functions to map each id to multiple hash tokens in a much smaller space, similarly to a Bloom filter. We show that by applying a multi-layer Transformer to these Bloom filter digests, we are able to obtain models with high accuracy. They outperform models of a si… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  14. arXiv:1905.09874  [pdf, other

    cs.LG cs.IR stat.ML

    Scaling Up Collaborative Filtering Data Sets through Randomized Fractal Expansions

    Authors: Francois Belletti, Karthik Lakshmanan, Walid Krichene, Nicolas Mayoraz, Yi-Fan Chen, John Anderson, Taylor Robie, Tayo Oguntebi, Dan Shirron, Amit Bleiwess

    Abstract: Recommender system research suffers from a disconnect between the size of academic data sets and the scale of industrial production systems. In order to bridge that gap, we propose to generate large-scale user/item interaction data sets by expanding pre-existing public data sets. Our key contribution is a technique that expands user/item incidence matrices matrices to large numbers of rows (users)… ▽ More

    Submitted 8 April, 2019; originally announced May 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.08910

    ACM Class: I.2.6; H.3.3

  15. arXiv:1901.08910  [pdf, other

    cs.IR cs.LG stat.ML

    Scalable Realistic Recommendation Datasets through Fractal Expansions

    Authors: Francois Belletti, Karthik Lakshmanan, Walid Krichene, Yi-Fan Chen, John Anderson

    Abstract: Recommender System research suffers currently from a disconnect between the size of academic data sets and the scale of industrial production systems. In order to bridge that gap we propose to generate more massive user/item interaction data sets by expanding pre-existing public data sets. User/item incidence matrices record interactions between users and items on a given platform as a large spars… ▽ More

    Submitted 20 February, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    ACM Class: I.2.6; H.3.3

  16. arXiv:1807.07187  [pdf, other

    stat.ML cs.CL cs.LG

    Efficient Training on Very Large Corpora via Gramian Estimation

    Authors: Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Li Zhang, Xinyang Yi, Lichan Hong, Ed Chi, John Anderson

    Abstract: We study the problem of learning similarity functions over very large corpora using neural network embedding models. These models are typically trained using SGD with sampling of random observed and unobserved pairs, with a number of samples that grows quadratically with the corpus size, making it expensive to scale to very large corpora. We propose new efficient methods to train these models with… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

  17. arXiv:1606.01261  [pdf, other

    cs.LG

    Minimizing Regret on Reflexive Banach Spaces and Learning Nash Equilibria in Continuous Zero-Sum Games

    Authors: Maximilian Balandat, Walid Krichene, Claire Tomlin, Alexandre Bayen

    Abstract: We study a general version of the adversarial online learning problem. We are given a decision set $\mathcal{X}$ in a reflexive Banach space $X$ and a sequence of reward vectors in the dual space of $X$. At each iteration, we choose an action from $\mathcal{X}$, based on the observed sequence of previous rewards. Our goal is to minimize regret, defined as the gap between the realized reward and th… ▽ More

    Submitted 3 June, 2016; originally announced June 2016.

  18. Differential Privacy of Populations in Routing Games

    Authors: Roy Dong, Walid Krichene, Alexandre M. Bayen, S. Shankar Sastry

    Abstract: As our ground transportation infrastructure modernizes, the large amount of data being measured, transmitted, and stored motivates an analysis of the privacy aspect of these emerging cyber-physical technologies. In this paper, we consider privacy in the routing game, where the origins and destinations of drivers are considered private. This is motivated by the fact that this spatiotemporal informa… ▽ More

    Submitted 15 January, 2016; originally announced January 2016.

    Comments: Extended draft of paper that appears in 2015 IEEE CDC

  19. arXiv:1504.07720  [pdf, other

    cs.LG math.OC

    Dual Averaging on Compactly-Supported Distributions And Application to No-Regret Learning on a Continuum

    Authors: Walid Krichene

    Abstract: We consider an online learning problem on a continuum. A decision maker is given a compact feasible set $S$, and is faced with the following sequential problem: at iteration~$t$, the decision maker chooses a distribution $x^{(t)} \in Δ(S)$, then a loss function $\ell^{(t)} : S \to \mathbb{R}_+$ is revealed, and the decision maker incurs expected loss… ▽ More

    Submitted 29 April, 2015; originally announced April 2015.

  20. arXiv:1408.0017  [pdf, other

    cs.LG cs.GT

    Learning Nash Equilibria in Congestion Games

    Authors: Walid Krichene, Benjamin Drighès, Alexandre M. Bayen

    Abstract: We study the repeated congestion game, in which multiple populations of players share resources, and make, at each iteration, a decentralized decision on which resources to utilize. We investigate the following question: given a model of how individual players update their strategies, does the resulting dynamics of strategy profiles converge to the set of Nash equilibria of the one-shot game? We c… ▽ More

    Submitted 31 July, 2014; originally announced August 2014.

  21. arXiv:1312.1075  [pdf, ps, other

    cs.GT eess.SY math.OC

    A Necessary and Sufficient Condition for the Existence of Potential Functions for Heterogeneous Routing Games

    Authors: Farhad Farokhi, Walid Krichene, Alexandre M. Bayen, Karl H. Johansson

    Abstract: We study a heterogeneous routing game in which vehicles might belong to more than one type. The type determines the cost of traveling along an edge as a function of the flow of various types of vehicles over that edge. We relax the assumptions needed for the existence of a Nash equilibrium in this heterogeneous routing game. We extend the available results to present necessary and sufficient condi… ▽ More

    Submitted 3 February, 2014; v1 submitted 4 December, 2013; originally announced December 2013.

    Comments: Improved Literature Review; Updated Introduction

  22. arXiv:1302.6617  [pdf, other

    cs.LG cs.AI

    Arriving on time: estimating travel time distributions on large-scale road networks

    Authors: Timothy Hunter, Aude Hofleitner, Jack Reilly, Walid Krichene, Jerome Thai, Anastasios Kouvelas, Pieter Abbeel, Alexandre Bayen

    Abstract: Most optimal routing problems focus on minimizing travel time or distance traveled. Oftentimes, a more useful objective is to maximize the probability of on-time arrival, which requires statistical distributions of travel times, rather than just mean values. We propose a method to estimate travel time distributions on large-scale road networks, using probe vehicle data collected from GPS. We prese… ▽ More

    Submitted 26 February, 2013; originally announced February 2013.