Skip to main content

Showing 1–50 of 129 results for author: Mirrokni, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11206  [pdf, other

    cs.LG cs.CR stat.ML

    Retraining with Predicted Hard Labels Provably Increases Model Accuracy

    Authors: Rudrajit Das, Inderjit S. Dhillon, Alessandro Epasto, Adel Javanmard, Jieming Mao, Vahab Mirrokni, Sujay Sanghavi, Peilin Zhong

    Abstract: The performance of a model trained with \textit{noisy labels} is often improved by simply \textit{retraining} the model with its own predicted \textit{hard} labels (i.e., $1$/$0$ labels). Yet, a detailed theoretical characterization of this phenomenon is lacking. In this paper, we theoretically analyze retraining in a linearly separable setting with randomly corrupted labels given to us and prove… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.04868  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Perturb-and-Project: Differentially Private Similarities and Marginals

    Authors: Vincent Cohen-Addad, Tommaso d'Orsi, Alessandro Epasto, Vahab Mirrokni, Peilin Zhong

    Abstract: We revisit the input perturbations framework for differential privacy where noise is added to the input $A\in \mathcal{S}$ and the result is then projected back to the space of admissible datasets $\mathcal{S}$. Through this framework, we first design novel efficient algorithms to privately release pair-wise cosine similarities. Second, we derive a novel algorithm to compute $k$-way marginal queri… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 21 ppages, ICML 2024

    ACM Class: F.2; G.3

  3. arXiv:2406.02910  [pdf, other

    cs.DS

    High-Dimensional Geometric Streaming for Nearly Low Rank Data

    Authors: Hossein Esfandiari, Vahab Mirrokni, Praneeth Kacham, David P. Woodruff, Peilin Zhong

    Abstract: We study streaming algorithms for the $\ell_p$ subspace approximation problem. Given points $a_1, \ldots, a_n$ as an insertion-only stream and a rank parameter $k$, the $\ell_p$ subspace approximation problem is to find a $k$-dimensional subspace $V$ such that $(\sum_{i=1}^n d(a_i, V)^p)^{1/p}$ is minimized, where $d(a, V)$ denotes the Euclidean distance between $a$ and $V$ defined as… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  4. arXiv:2406.00487  [pdf, other

    cs.LG cs.AI stat.ML

    Optimistic Rates for Learning from Label Proportions

    Authors: Gene Li, Lin Chen, Adel Javanmard, Vahab Mirrokni

    Abstract: We consider a weakly supervised learning problem called Learning from Label Proportions (LLP), where examples are grouped into ``bags'' and only the average label within each bag is revealed to the learner. We study various learning rules for LLP that achieve PAC learning guarantees for classification loss. We establish that the classical Empirical Proportional Risk Minimization (EPRM) learning ru… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to COLT 2024. Comments welcome

  5. arXiv:2405.19504  [pdf, other

    cs.DS cs.DB cs.IR

    MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings

    Authors: Laxman Dhulipala, Majid Hadian, Rajesh Jayaram, Jason Lee, Vahab Mirrokni

    Abstract: Neural embedding models have become a fundamental component of modern information retrieval (IR) pipelines. These models produce a single embedding $x \in \mathbb{R}^d$ per data-point, allowing for fast retrieval via highly optimized maximum inner product search (MIPS) algorithms. Recently, beginning with the landmark ColBERT paper, multi-vector models, which produce a set of embedding per data po… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.18512  [pdf, ps, other

    cs.LG cs.AI

    Understanding Transformer Reasoning Capabilities via Graph Algorithms

    Authors: Clayton Sanford, Bahare Fatemi, Ethan Hall, Anton Tsitsulin, Mehran Kazemi, Jonathan Halcrow, Bryan Perozzi, Vahab Mirrokni

    Abstract: Which transformer scaling regimes are able to perfectly solve different classes of algorithmic problems? While tremendous empirical advances have been attained by transformer-based neural networks, a theoretical understanding of their algorithmic reasoning capabilities in realistic parameter regimes is lacking. We investigate this question in terms of the network's depth, width, and number of extr… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 43 pages, 8 figures

  7. arXiv:2403.20307  [pdf, ps, other

    cs.DS

    Optimal Communication for Classic Functions in the Coordinator Model and Beyond

    Authors: Hossein Esfandiari, Praneeth Kacham, Vahab Mirrokni, David P. Woodruff, Peilin Zhong

    Abstract: In the coordinator model of communication with $s$ servers, given an arbitrary non-negative function $f$, we study the problem of approximating the sum $\sum_{i \in [n]}f(x_i)$ up to a $1 \pm \varepsilon$ factor. Here the vector $x \in R^n$ is defined to be $x = x(1) + \cdots + x(s)$, where $x(j) \ge 0$ denotes the non-negative vector held by the $j$-th server. A special case of the problem is whe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 52 pages. To appear in STOC 2024

  8. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  9. arXiv:2402.17902  [pdf, other

    cs.LG

    SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization

    Authors: Taisuke Yasuda, Kyriakos Axiotis, Gang Fu, MohammadHossein Bateni, Vahab Mirrokni

    Abstract: Neural network pruning is a key technique towards engineering large yet scalable, interpretable, and generalizable models. Prior work on the subject has developed largely along two orthogonal directions: (1) differentiable pruning for efficiently and accurately scoring the importance of parameters, and (2) combinatorial optimization for efficiently searching over the space of sparse models. We uni… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  10. arXiv:2402.17327  [pdf, other

    cs.LG cs.DS

    Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond

    Authors: Kyriakos Axiotis, Vincent Cohen-Addad, Monika Henzinger, Sammy Jerome, Vahab Mirrokni, David Saulpic, David Woodruff, Michael Wunder

    Abstract: We study the data selection problem, whose aim is to select a small representative subset of data that can be used to efficiently train a machine learning model. We present a new data selection approach based on $k$-means clustering and sensitivity sampling. Assuming access to an embedding representation of the data with respect to which the model loss is Hölder continuous, our approach provably a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2402.06082  [pdf, other

    cs.LG cs.AI cs.DS

    SubGen: Token Generation in Sublinear Time and Memory

    Authors: Amir Zandieh, Insu Han, Vahab Mirrokni, Amin Karbasi

    Abstract: Despite the significant success of large language models (LLMs), their extensive memory requirements pose challenges for deploying them in long-context token generation. The substantial memory footprint of LLM decoders arises from the necessity to store all previous tokens in the attention module, a requirement imposed by key-value (KV) caching. In this work, our focus is on develo** an efficien… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  12. arXiv:2402.04987  [pdf, other

    cs.LG cs.DS

    PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses

    Authors: Adel Javanmard, Matthew Fahrbach, Vahab Mirrokni

    Abstract: This work studies algorithms for learning from aggregate responses. We focus on the construction of aggregation sets (called bags in the literature) for event-level loss functions. We prove for linear regression and generalized linear models (GLMs) that the optimal bagging problem reduces to one-dimensional size-constrained $k$-means clustering. Further, we theoretically quantify the advantage of… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 29 pages, 4 figures

  13. arXiv:2401.11081  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions

    Authors: Adel Javanmard, Lin Chen, Vahab Mirrokni, Ashwinkumar Badanidiyuru, Gang Fu

    Abstract: Due to the rise of privacy concerns, in many practical applications the training data is aggregated before being shared with the learner, in order to protect privacy of users' sensitive responses. In an aggregate learning framework, the dataset is grouped into bags of samples, where each bag is available only with an aggregate response, providing a summary of individuals' responses in that bag. In… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: To appear in the Twelfth International Conference on Learning Representations (ICLR 2024)

  14. arXiv:2311.10679  [pdf, other

    cs.GT

    Non-uniform Bid-scaling and Equilibria for Different Auctions: An Empirical Study

    Authors: Yuan Deng, Jieming Mao, Vahab Mirrokni, Yifeng Teng, Song Zuo

    Abstract: In recent years, the growing adoption of autobidding has motivated the study of auction design with value-maximizing auto-bidders. It is known that under mild assumptions, uniform bid-scaling is an optimal bidding strategy in truthful auctions, e.g., Vickrey-Clarke-Groves auction (VCG), and the price of anarchy for VCG is $2$. However, for other auction formats like First-Price Auction (FPA) and G… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  15. arXiv:2310.10826  [pdf, ps, other

    cs.GT econ.TH

    Mechanism Design for Large Language Models

    Authors: Paul Duetting, Vahab Mirrokni, Renato Paes Leme, Haifeng Xu, Song Zuo

    Abstract: We investigate auction mechanisms for AI-generated content, focusing on applications like ad creative generation. In our model, agents' preferences over stochastically generated content are encoded as large language models (LLMs). We propose an auction format that operates on a token-by-token basis, and allows LLM agents to influence content creation through single dimensional bids. We formulate t… ▽ More

    Submitted 2 July, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: WWW'24 Best Paper

  16. arXiv:2310.05869  [pdf, other

    cs.LG cs.AI

    HyperAttention: Long-context Attention in Near-Linear Time

    Authors: Insu Han, Rajesh Jayaram, Amin Karbasi, Vahab Mirrokni, David P. Woodruff, Amir Zandieh

    Abstract: We present an approximate attention mechanism named HyperAttention to address the computational challenges posed by the growing complexity of long contexts used in Large Language Models (LLMs). Recent work suggests that in the worst-case scenario, quadratic time is necessary unless the entries of the attention matrix are bounded or the matrix has low stable rank. We introduce two parameters which… ▽ More

    Submitted 1 December, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  17. arXiv:2310.04015  [pdf, other

    cs.LG math.ST stat.ML

    Anonymous Learning via Look-Alike Clustering: A Precise Analysis of Model Generalization

    Authors: Adel Javanmard, Vahab Mirrokni

    Abstract: While personalized recommendations systems have become increasingly popular, ensuring user data protection remains a top concern in the development of these learning systems. A common approach to enhancing privacy involves training models using anonymous data rather than individual data. In this paper, we explore a natural technique called \emph{look-alike clustering}, which involves replacing sen… ▽ More

    Submitted 1 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: accepted at the Conference on Neural Information Processing Systems (NeurIPS 2023)

  18. arXiv:2310.03105  [pdf, other

    cs.GT

    Efficiency of the Generalized Second-Price Auction for Value Maximizers

    Authors: Yuan Deng, Mohammad Mahdian, Jieming Mao, Vahab Mirrokni, Hanrui Zhang, Song Zuo

    Abstract: We study the price of anarchy of the generalized second-price auction where bidders are value maximizers (i.e., autobidders). We show that in general the price of anarchy can be as bad as $0$. For comparison, the price of anarchy of running VCG is $1/2$ in the autobidding world. We further show a fined-grained price of anarchy with respect to the discount factors (i.e., the ratios of click probabi… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  19. arXiv:2310.01655  [pdf, other

    cs.LG

    PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels

    Authors: Praneeth Kacham, Vahab Mirrokni, Peilin Zhong

    Abstract: The quadratic time and memory complexity inherent to self-attention mechanisms, with respect to sequence length, presents a critical computational bottleneck in the training and deployment of large-scale Transformer-based language models. Recent theoretical results indicate the intractability of sub-quadratic softmax attention approximation under reasonable complexity assumptions. This paper addre… ▽ More

    Submitted 17 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Added results of more experiments. Added a link to our JAX implementation of models

  20. arXiv:2308.03578  [pdf, other

    cs.DS cs.DB cs.DC cs.IR

    TeraHAC: Hierarchical Agglomerative Clustering of Trillion-Edge Graphs

    Authors: Laxman Dhulipala, Jason Lee, Jakub Łącki, Vahab Mirrokni

    Abstract: We introduce TeraHAC, a $(1+ε)$-approximate hierarchical agglomerative clustering (HAC) algorithm which scales to trillion-edge graphs. Our algorithm is based on a new approach to computing $(1+ε)$-approximate HAC, which is a novel combination of the nearest-neighbor chain algorithm and the notion of $(1+ε)$-approximate HAC. Our approach allows us to partition the graph among multiple machines and… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: SIGMOD 2024

  21. arXiv:2308.00957  [pdf, other

    stat.ML cs.CR cs.LG stat.ME

    Causal Inference with Differentially Private (Clustered) Outcomes

    Authors: Adel Javanmard, Vahab Mirrokni, Jean Pouget-Abadie

    Abstract: Estimating causal effects from randomized experiments is only feasible if participants agree to reveal their potentially sensitive responses. Of the many ways of ensuring privacy, label differential privacy is a widely used measure of an algorithm's privacy guarantee, which might encourage participants to share responses without running the risk of de-anonymization. Many differentially private mec… ▽ More

    Submitted 30 April, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 41 pages, 10 figures

  22. arXiv:2308.00503  [pdf, ps, other

    cs.DS

    Massively Parallel Algorithms for High-Dimensional Euclidean Minimum Spanning Tree

    Authors: Rajesh Jayaram, Vahab Mirrokni, Shyam Narayanan, Peilin Zhong

    Abstract: We study the classic Euclidean Minimum Spanning Tree (MST) problem in the Massively Parallel Computation (MPC) model. Given a set $X \subset \mathbb{R}^d$ of $n$ points, the goal is to produce a spanning tree for $X$ with weight within a small factor of optimal. Euclidean MST is one of the most fundamental hierarchical geometric clustering algorithms, and with the proliferation of enormous high-di… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  23. arXiv:2306.00485  [pdf, other

    stat.ME cs.LG econ.EM

    Causal Estimation of User Learning in Personalized Systems

    Authors: Evan Munro, David Jones, Jennifer Brennan, Roland Nelet, Vahab Mirrokni, Jean Pouget-Abadie

    Abstract: In online platforms, the impact of a treatment on an observed outcome may change over time as 1) users learn about the intervention, and 2) the system personalization, such as individualized recommendations, change over time. We introduce a non-parametric causal model of user actions in a personalized system. We show that the Cookie-Cookie-Day (CCD) experiment, designed for the measurement of the… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: EC 2023

  24. arXiv:2305.15723  [pdf, other

    cs.LG cs.CR math.OC

    Learning across Data Owners with Joint Differential Privacy

    Authors: Yangsibo Huang, Haotian Jiang, Daogao Liu, Mohammad Mahdian, Jieming Mao, Vahab Mirrokni

    Abstract: In this paper, we study the setting in which data owners train machine learning models collaboratively under a privacy notion called joint differential privacy [Kearns et al., 2018]. In this setting, the model trained for each data owner $j$ uses $j$'s data without privacy consideration and other owners' data with differential privacy guarantees. This setting was initiated in [Jain et al., 2021] w… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  25. arXiv:2305.09557  [pdf, other

    cs.LG cs.AI stat.ML

    Learning from Aggregated Data: Curated Bags versus Random Bags

    Authors: Lin Chen, Gang Fu, Amin Karbasi, Vahab Mirrokni

    Abstract: Protecting user privacy is a major concern for many machine learning systems that are deployed at scale and collect from a diverse set of population. One way to address this concern is by collecting and releasing data labels in an aggregated manner so that the information about a single user is potentially combined with others. In this paper, we explore the possibility of training machine learning… ▽ More

    Submitted 18 May, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  26. arXiv:2304.11741  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Robust and differentially private stochastic linear bandits

    Authors: Vasileios Charisopoulos, Hossein Esfandiari, Vahab Mirrokni

    Abstract: In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm usi… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 25 pages

  27. arXiv:2304.07210  [pdf, other

    cs.CR cs.LG

    Measuring Re-identification Risk

    Authors: CJ Carey, Travis Dick, Alessandro Epasto, Adel Javanmard, Josh Karlin, Shankar Kumar, Andres Munoz Medina, Vahab Mirrokni, Gabriel Henrique Nunes, Sergei Vassilvitskii, Peilin Zhong

    Abstract: Compact user representations (such as embeddings) form the backbone of personalization services. In this work, we present a new theoretical framework to measure re-identification risk in such user representations. Our framework, based on hypothesis testing, formally bounds the probability that an attacker may be able to obtain the identity of a user from their representation. As an application, we… ▽ More

    Submitted 31 July, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  28. arXiv:2303.15634  [pdf, other

    cs.LG math.OC stat.ML

    Learning Rate Schedules in the Presence of Distribution Shift

    Authors: Matthew Fahrbach, Adel Javanmard, Vahab Mirrokni, Pratik Worah

    Abstract: We design learning rate schedules that minimize regret for SGD-based online learning in the presence of a changing data distribution. We fully characterize the optimal learning rate schedule for online linear regression via a novel analysis with stochastic differential equations. For general convex loss functions, we propose new learning rate schedules that are robust to distribution shift and we… ▽ More

    Submitted 20 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 33 pages, 6 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML 2023) 9523-9546

  29. Optimal Fully Dynamic $k$-Center Clustering for Adaptive and Oblivious Adversaries

    Authors: MohammadHossein Bateni, Hossein Esfandiari, Hendrik Fichtenberger, Monika Henzinger, Rajesh Jayaram, Vahab Mirrokni, Andreas Wiese

    Abstract: In fully dynamic clustering problems, a clustering of a given data set in a metric space must be maintained while it is modified through insertions and deletions of individual points. In this paper, we resolve the complexity of fully dynamic $k$-center clustering against both adaptive and oblivious adversaries. Against oblivious adversaries, we present the first algorithm for fully dynamic $k$-cen… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2112.07050, arXiv:2112.07217

  30. arXiv:2302.10359  [pdf, other

    cs.LG cs.DS

    Replicable Clustering

    Authors: Hossein Esfandiari, Amin Karbasi, Vahab Mirrokni, Grigoris Velegkas, Felix Zhou

    Abstract: We design replicable algorithms in the context of statistical clustering under the recently introduced notion of replicability from Impagliazzo et al. [2022]. According to this definition, a clustering algorithm is replicable if, with high probability, its output induces the exact same partition of the sample space after two executions on different inputs drawn from the same distribution, when its… ▽ More

    Submitted 26 October, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: to be published in NeurIPS 2023

  31. arXiv:2302.08530  [pdf, other

    cs.GT

    A Field Guide for Pacing Budget and ROS Constraints

    Authors: Santiago R. Balseiro, Kshipra Bhawalkar, Zhe Feng, Haihao Lu, Vahab Mirrokni, Balasubramanian Sivan, Di Wang

    Abstract: Budget pacing is a popular service that has been offered by major internet advertising platforms since their inception. Budget pacing systems seek to optimize advertiser returns subject to budget constraints by smoothly spending advertiser budgets. In the past few years, autobidding products that provide real-time bidding as a service to advertisers have seen a prominent rise in adoption. A popula… ▽ More

    Submitted 15 December, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

  32. arXiv:2302.03886  [pdf, other

    cs.DS cs.LG math.CO

    Approximately Optimal Core Shapes for Tensor Decompositions

    Authors: Mehrdad Ghadiri, Matthew Fahrbach, Gang Fu, Vahab Mirrokni

    Abstract: This work studies the combinatorial optimization problem of finding an optimal core tensor shape, also called multilinear rank, for a size-constrained Tucker decomposition. We give an algorithm with provable approximation guarantees for its reconstruction error via connections to higher-order singular values. Specifically, we introduce a novel Tucker packing problem, which we prove is NP-hard, and… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 18 pages, 4 figures

    Journal ref: Proceedings of the 40th International Conference on Machine Learning (ICML 2023) 11237-11254

  33. arXiv:2302.02006  [pdf, other

    cs.LG cs.DS math.OC

    Robust Budget Pacing with a Single Sample

    Authors: Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang

    Abstract: Major Internet advertising platforms offer budget pacing tools as a standard service for advertisers to manage their ad campaigns. Given the inherent non-stationarity in an advertiser's value and also competing advertisers' values over time, a commonly used approach is to learn a target expenditure plan that specifies a target spend as a function of time, and then run a controller that tracks this… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  34. arXiv:2302.01523  [pdf, other

    cs.GT cs.LG

    Multi-channel Autobidding with Budget and ROI Constraints

    Authors: Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni

    Abstract: In digital online advertising, advertisers procure ad impressions simultaneously on multiple platforms, or so-called channels, such as Google Ads, Meta Ads Manager, etc., each of which consists of numerous ad auctions. We study how an advertiser maximizes total conversion (e.g. ad clicks) while satisfying aggregate return-on-investment (ROI) and budget constraints across all channels. In practice,… ▽ More

    Submitted 14 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  35. arXiv:2302.00377  [pdf, ps, other

    cs.GT

    Autobidding Auctions in the Presence of User Costs

    Authors: Yuan Deng, Jieming Mao, Vahab Mirrokni, Hanrui Zhang, Song Zuo

    Abstract: We study autobidding ad auctions with user costs, where each bidder is value-maximizing subject to a return-over-investment (ROI) constraint, and the seller aims to maximize the social welfare taking into consideration the user's cost of viewing an ad. We show that in the worst case, the approximation ratio of social welfare by running the vanilla VCG auctions with user costs could as bad as 0. To… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  36. arXiv:2302.00037  [pdf, other

    cs.LG cs.CR cs.DS

    Differentially-Private Hierarchical Clustering with Provable Approximation Guarantees

    Authors: Jacob Imola, Alessandro Epasto, Mohammad Mahdian, Vincent Cohen-Addad, Vahab Mirrokni

    Abstract: Hierarchical Clustering is a popular unsupervised machine learning method with decades of history and numerous applications. We initiate the study of differentially private approximation algorithms for hierarchical clustering under the rigorous framework introduced by (Dasgupta, 2016). We show strong lower bounds for the problem: that any $ε$-DP algorithm must exhibit $O(|V|^2/ ε)$-additive error… ▽ More

    Submitted 23 May, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: 28 pages, 1 figure

  37. arXiv:2301.05605  [pdf, ps, other

    cs.DS

    Differentially Private Continual Releases of Streaming Frequency Moment Estimations

    Authors: Alessandro Epasto, Jieming Mao, Andres Munoz Medina, Vahab Mirrokni, Sergei Vassilvitskii, Peilin Zhong

    Abstract: The streaming model of computation is a popular approach for working with large-scale data. In this setting, there is a stream of items and the goal is to compute the desired quantities (usually data statistics) while making a single pass through the stream and using as little space as possible. Motivated by the importance of data privacy, we develop differentially private streaming algorithms u… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  38. arXiv:2212.14334  [pdf, ps, other

    cs.DS cs.LG cs.SI

    Constant Approximation for Normalized Modularity and Associations Clustering

    Authors: Jakub Łącki, Vahab Mirrokni, Christian Sohler

    Abstract: We study the problem of graph clustering under a broad class of objectives in which the quality of a cluster is defined based on the ratio between the number of edges in the cluster, and the total weight of vertices in the cluster. We show that our definition is closely related to popular clustering measures, namely normalized associations, which is a dual of the normalized cut objective, and norm… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    MSC Class: 68W25 ACM Class: F.2.2

  39. arXiv:2212.02635  [pdf, ps, other

    cs.LG cs.DS

    Stars: Tera-Scale Graph Building for Clustering and Graph Learning

    Authors: CJ Carey, Jonathan Halcrow, Rajesh Jayaram, Vahab Mirrokni, Warren Schudy, Peilin Zhong

    Abstract: A fundamental procedure in the analysis of massive datasets is the construction of similarity graphs. Such graphs play a key role for many downstream tasks, including clustering, classification, graph learning, and nearest neighbor search. For these tasks, it is critical to build graphs which are sparse yet still representative of the underlying data. The benefits of sparsity are twofold: firstly,… ▽ More

    Submitted 9 January, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2022

  40. arXiv:2210.12198  [pdf, other

    cs.LG

    Anonymous Bandits for Multi-User Systems

    Authors: Hossein Esfandiari, Vahab Mirrokni, Jon Schneider

    Abstract: In this work, we present and study a new framework for online learning in systems with multiple users that provide user anonymity. Specifically, we extend the notion of bandits to obey the standard $k$-anonymity constraint by requiring each observation to be an aggregation of rewards for at least $k$ users. This provides a simple yet effective framework where one can learn a clustering of users in… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  41. arXiv:2210.01898  [pdf, ps, other

    cs.LG

    Replicable Bandits

    Authors: Hossein Esfandiari, Alkis Kalavasis, Amin Karbasi, Andreas Krause, Vahab Mirrokni, Grigoris Velegkas

    Abstract: In this paper, we introduce the notion of replicable policies in the context of stochastic bandits, one of the canonical problems in interactive learning. A policy in the bandit environment is called replicable if it pulls, with high probability, the exact same sequence of arms in two different and independent executions (i.e., under independent reward realizations). We show that not only do repli… ▽ More

    Submitted 14 February, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  42. arXiv:2209.14881  [pdf, other

    cs.LG stat.ML

    Sequential Attention for Feature Selection

    Authors: Taisuke Yasuda, MohammadHossein Bateni, Lin Chen, Matthew Fahrbach, Gang Fu, Vahab Mirrokni

    Abstract: Feature selection is the problem of selecting a subset of features for a machine learning model that maximizes model quality subject to a budget constraint. For neural networks, prior methods, including those based on $\ell_1$ regularization, attention, and other techniques, typically select the entire feature subset in one evaluation round, ignoring the residual value of features during selection… ▽ More

    Submitted 25 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted to ICLR 2023

    Journal ref: Proceedings of the 11th International Conference on Learning Representations (ICLR 2023)

  43. arXiv:2209.04748  [pdf, other

    cs.GT

    Individual Welfare Guarantees in the Autobidding World with Machine-learned Advice

    Authors: Yuan Deng, Negin Golrezaei, Patrick Jaillet, Jason Cheuk Nam Liang, Vahab Mirrokni

    Abstract: Online advertising channels have commonly focused on maximizing total advertiser value (or welfare) to enhance long-run retention and channel healthiness. Previous literature has studied auction design by incorporating machine learning predictions on advertiser values (also known as machine-learned advice) through various forms to improve total welfare. Yet, such improvements could come at the cos… ▽ More

    Submitted 14 June, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

  44. arXiv:2208.10650  [pdf, other

    cs.GT cs.DS

    Efficiency of the First-Price Auction in the Autobidding World

    Authors: Yuan Deng, Jieming Mao, Vahab Mirrokni, Hanrui Zhang, Song Zuo

    Abstract: We study the price of anarchy of the first-price auction in the autobidding world, where bidders can be either utility maximizers (i.e., traditional bidders) or value maximizers (i.e., autobidders). We show that with autobidders only, the price of anarchy of the first-price auction is $1/2$, and with both kinds of bidders, the price of anarchy degrades to about $0.457$ (the precise number is given… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  45. arXiv:2207.06944  [pdf, ps, other

    cs.CR cs.LG cs.SI stat.ML

    Differentially Private Graph Learning via Sensitivity-Bounded Personalized PageRank

    Authors: Alessandro Epasto, Vahab Mirrokni, Bryan Perozzi, Anton Tsitsulin, Peilin Zhong

    Abstract: Personalized PageRank (PPR) is a fundamental tool in unsupervised learning of graph representations such as node ranking, labeling, and graph embedding. However, while data privacy is one of the most important recent concerns, existing PPR algorithms are not designed to protect user privacy. PPR is highly sensitive to the input graph edges: the difference of only one edge may cause a big change in… ▽ More

    Submitted 14 February, 2024; v1 submitted 14 July, 2022; originally announced July 2022.

  46. arXiv:2207.06358  [pdf, other

    cs.CR cs.LG

    Smooth Anonymity for Sparse Graphs

    Authors: Alessandro Epasto, Hossein Esfandiari, Vahab Mirrokni, Andres Munoz Medina

    Abstract: When working with user data providing well-defined privacy guarantees is paramount. In this work, we aim to manipulate and share an entire sparse dataset with a third party privately. In fact, differential privacy has emerged as the gold standard of privacy, however, when it comes to sharing sparse datasets, e.g. sparse networks, as one of our main results, we prove that \emph{any} differentially… ▽ More

    Submitted 14 May, 2024; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: WWW 2024 Short Paper

  47. arXiv:2207.03522  [pdf, other

    cs.LG cs.NE cs.SI physics.soc-ph stat.ML

    TF-GNN: Graph Neural Networks in TensorFlow

    Authors: Oleksandr Ferludin, Arno Eigenwillig, Martin Blais, Dustin Zelle, Jan Pfeifer, Alvaro Sanchez-Gonzalez, Wai Lok Sibon Li, Sami Abu-El-Haija, Peter Battaglia, Neslihan Bulut, Jonathan Halcrow, Filipe Miguel Gonçalves de Almeida, Pedro Gonnet, Liangze Jiang, Parth Kothari, Silvio Lattanzi, André Linhares, Brandon Mayer, Vahab Mirrokni, John Palowitch, Mihir Paradkar, Jennifer She, Anton Tsitsulin, Kevin Villela, Lisa Wang , et al. (2 additional authors not shown)

    Abstract: TensorFlow-GNN (TF-GNN) is a scalable library for Graph Neural Networks in TensorFlow. It is designed from the bottom up to support the kinds of rich heterogeneous graph data that occurs in today's information ecosystems. In addition to enabling machine learning researchers and advanced developers, TF-GNN offers low-code solutions to empower the broader developer community in graph learning. Many… ▽ More

    Submitted 23 July, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

  48. arXiv:2206.08646  [pdf, other

    cs.DS cs.CR cs.LG

    Scalable Differentially Private Clustering via Hierarchically Separated Trees

    Authors: Vincent Cohen-Addad, Alessandro Epasto, Silvio Lattanzi, Vahab Mirrokni, Andres Munoz, David Saulpic, Chris Schwiegelshohn, Sergei Vassilvitskii

    Abstract: We study the private $k$-median and $k$-means clustering problem in $d$ dimensional Euclidean space. By leveraging tree embeddings, we give an efficient and easy to implement algorithm, that is empirically competitive with state of the art non private methods. We prove that our method computes a solution with cost at most $O(d^{3/2}\log n)\cdot OPT + O(k d^2 \log^2 n / ε^2)$, where $ε$ is the priv… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: To appear at KDD'22

  49. arXiv:2206.07554  [pdf, other

    cs.DS

    Hierarchical Clustering in Graph Streams: Single-Pass Algorithms and Space Lower Bounds

    Authors: Sepehr Assadi, Vaggos Chatziafratis, Jakub Łącki, Vahab Mirrokni, Chen Wang

    Abstract: The Hierarchical Clustering (HC) problem consists of building a hierarchy of clusters to represent a given dataset. Motivated by the modern large-scale applications, we study the problem in the \streaming model, in which the memory is heavily limited and only a single or very few passes over the input are allowed. Specifically, we investigate whether a good hierarchical clustering can be obtained,… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Full version of the paper accepted to COLT 2022. 55 pages, 3 figures

  50. arXiv:2205.10403  [pdf, other

    cs.LG cs.CC

    Tackling Provably Hard Representative Selection via Graph Neural Networks

    Authors: Mehran Kazemi, Anton Tsitsulin, Hossein Esfandiari, MohammadHossein Bateni, Deepak Ramachandran, Bryan Perozzi, Vahab Mirrokni

    Abstract: Representative Selection (RS) is the problem of finding a small subset of exemplars from a dataset that is representative of the dataset. In this paper, we study RS for attributed graphs, and focus on finding representative nodes that optimize the accuracy of a model trained on the selected representatives. Theoretically, we establish a new hardness result forRS (in the absence of a graph structur… ▽ More

    Submitted 19 July, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted at the Transactions of Machine Learning Research (TMLR) Journal