Skip to main content

Showing 1–33 of 33 results for author: Vakilian, A

.
  1. arXiv:2405.10378  [pdf, other

    cs.DS cs.AI cs.LG

    A Polynomial-Time Approximation for Pairwise Fair $k$-Median Clustering

    Authors: Sayan Bandyapadhyay, Eden Chlamtáč, Yury Makarychev, Ali Vakilian

    Abstract: In this work, we study pairwise fair clustering with $\ell \ge 2$ groups, where for every cluster $C$ and every group $i \in [\ell]$, the number of points in $C$ from group $i$ must be at most $t$ times the number of points in $C$ from any other group $j \in [\ell]$, for a given integer $t$. To the best of our knowledge, only bi-criteria approximation and exponential-time algorithms follow for thi… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2403.10365  [pdf, ps, other

    cs.DS cs.AI cs.CY cs.LG

    Scalable Algorithms for Individual Preference Stable Clustering

    Authors: Ron Mosenzon, Ali Vakilian

    Abstract: In this paper, we study the individual preference (IP) stability, which is an notion capturing individual fairness and stability in clustering. Within this setting, a clustering is $α$-IP stable when each data point's average distance to its cluster is no more than $α$ times its average distance to any other cluster. In this paper, we study the natural local search algorithm for IP stable clusteri… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 59 pages, 9 figures, submitted to AIStats2024

  3. arXiv:2402.17736  [pdf, other

    cs.DS cs.AI cs.LG

    Learning-Based Algorithms for Graph Searching Problems

    Authors: Adela Frances DePavia, Erasmo Tani, Ali Vakilian

    Abstract: We consider the problem of graph searching with prediction recently introduced by Banerjee et al. (2022). In this problem, an agent, starting at some vertex $r$ has to traverse a (potentially unknown) graph $G$ to find a hidden goal node $g$ while minimizing the total distance travelled. We study a setting in which at any node $v$, the agent receives a noisy estimate of the distance from $v$ to… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: AISTATS 2024

  4. arXiv:2402.10806  [pdf, other

    cs.DS

    Streaming Algorithms for Connectivity Augmentation

    Authors: Ce **, Michael Kapralov, Sepideh Mahabadi, Ali Vakilian

    Abstract: We study the $k$-connectivity augmentation problem ($k$-CAP) in the single-pass streaming model. Given a $(k-1)$-edge connected graph $G=(V,E)$ that is stored in memory, and a stream of weighted edges $L$ with weights in $\{0,1,\dots,W\}$, the goal is to choose a minimum weight subset $L'\subseteq L$ such that $G'=(V,E\cup L')$ is $k$-edge connected. We give a $(2+ε)$-approximation algorithm for t… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2402.08758  [pdf, ps, other

    cs.LG cs.GT

    Bayesian Strategic Classification

    Authors: Lee Cohen, Saeed Sharifi-Malvajerdi, Kevin Stangl, Ali Vakilian, Juba Ziani

    Abstract: In strategic classification, agents modify their features, at a cost, to ideally obtain a positive classification from the learner's classifier. The typical response of the learner is to carefully modify their classifier to be robust to such strategic behavior. When reasoning about agent manipulations, most papers that study strategic classification rely on the following strong assumption: agents… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  6. arXiv:2312.07535  [pdf, other

    cs.DS cs.LG

    Improved Frequency Estimation Algorithms with and without Predictions

    Authors: Anders Aamand, Justin Y. Chen, Huy Lê Nguyen, Sandeep Silwal, Ali Vakilian

    Abstract: Estimating frequencies of elements appearing in a data stream is a key task in large-scale data analysis. Popular sketching approaches to this problem (e.g., CountMin and CountSketch) come with worst-case guarantees that probabilistically bound the error of the estimated frequencies for any possible input. The work of Hsu et al. (2019) introduced the idea of using machine learning to tailor sketch… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  7. arXiv:2310.00175  [pdf, other

    cs.DS cs.LG

    Tight Bounds for Volumetric Spanners and Applications

    Authors: Aditya Bhaskara, Sepideh Mahabadi, Ali Vakilian

    Abstract: Given a set of points of interest, a volumetric spanner is a subset of the points using which all the points can be expressed using "small" coefficients (measured in an appropriate norm). Formally, given a set of vectors $X = \{v_1, v_2, \dots, v_n\}$, the goal is to find $T \subseteq [n]$ such that every $v \in X$ can be expressed as $\sum_{i\in T} α_i v_i$, with $\|α\|$ being small. This notion,… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  8. arXiv:2309.16840  [pdf, other

    cs.DS cs.LG

    Constant Approximation for Individual Preference Stable Clustering

    Authors: Anders Aamand, Justin Y. Chen, Allen Liu, Sandeep Silwal, Pattara Sukprasert, Ali Vakilian, Fred Zhang

    Abstract: Individual preference (IP) stability, introduced by Ahmadi et al. (ICML 2022), is a natural clustering objective inspired by stability and fairness constraints. A clustering is $α$-IP stable if the average distance of every data point to its own cluster is at most $α$ times the average distance to any other cluster. Unfortunately, determining if a dataset admits a $1$-IP stable clustering is NP-Ha… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 20 pages

  9. arXiv:2306.06778  [pdf, other

    cs.LG cs.AI cs.DS

    Approximation Algorithms for Fair Range Clustering

    Authors: Sèdjro S. Hotegni, Sepideh Mahabadi, Ali Vakilian

    Abstract: This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick $k$ centers with the minimum clustering cost such that each group is at least minimally represented in the centers set and no group dominates the centers set. More precisely, given a set of $n$ points in a metric space $(P,d)$ where each point belongs to one o… ▽ More

    Submitted 22 June, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  10. arXiv:2306.06611  [pdf, other

    cs.LG cs.DS

    Learning the Positions in CountSketch

    Authors: Yi Li, Honghao Lin, Simin Liu, Ali Vakilian, David P. Woodruff

    Abstract: We consider sketching algorithms which first compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem, e.g., low-rank approximation and regression. In the learning-based sketching paradigm proposed by~\cite{indyk2019learning}, the sketch matrix is found by choosing a random sparse matrix, e.g., CountSketch, and then the values… ▽ More

    Submitted 10 April, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Corrected the proof of Theorem 5.1. arXiv admin note: text overlap with arXiv:2007.09890

  11. arXiv:2302.00213  [pdf, other

    cs.DS

    Approximating Red-Blue Set Cover and Minimum Monotone Satisfying Assignment

    Authors: Eden Chlamtáč, Yury Makarychev, Ali Vakilian

    Abstract: We provide new approximation algorithms for the Red-Blue Set Cover and Circuit Minimum Monotone Satisfying Assignment (MMSA) problems. Our algorithm for Red-Blue Set Cover achieves $\tilde O(m^{1/3})$-approximation improving on the $\tilde O(m^{1/2})$-approximation due to Elkin and Peleg (where $m$ is the number of sets). Our approximation algorithm for MMSA$_t$ (for circuits of depth $t$) gives a… ▽ More

    Submitted 7 July, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: APPROX 2023

  12. arXiv:2301.13397  [pdf, other

    cs.LG cs.CY cs.GT cs.MA

    Sequential Strategic Screening

    Authors: Lee Cohen, Saeed Sharifi-Malvajerdi, Kevin Stangl, Ali Vakilian, Juba Ziani

    Abstract: We initiate the study of strategic behavior in screening processes with multiple classifiers. We focus on two contrasting settings: a conjunctive setting in which an individual must satisfy all classifiers simultaneously, and a sequential setting in which an individual to succeed must satisfy classifiers one at a time. In other words, we introduce the combination of strategic classification with s… ▽ More

    Submitted 10 February, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    MSC Class: 91 ACM Class: I.2; J.4

  13. arXiv:2207.03600  [pdf, other

    cs.LG

    Individual Preference Stability for Clustering

    Authors: Saba Ahmadi, Pranjal Awasthi, Samir Khuller, Matthäus Kleindessner, Jamie Morgenstern, Pattara Sukprasert, Ali Vakilian

    Abstract: In this paper, we propose a natural notion of individual preference (IP) stability for clustering, which asks that every data point, on average, is closer to the points in its own cluster than to the points in any other cluster. Our notion can be motivated from several perspectives, including game theory and algorithmic fairness. We study several questions related to our proposed notion. We first… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to ICML'22. This is a full version of the ICML version as well as a substantially improved version of arXiv:2006.04960

  14. arXiv:2204.12055  [pdf, other

    cs.DS

    Faster Fundamental Graph Algorithms via Learned Predictions

    Authors: Justin Y. Chen, Sandeep Silwal, Ali Vakilian, Fred Zhang

    Abstract: We consider the question of speeding up classic graph algorithms with machine-learned predictions. In this model, algorithms are furnished with extra advice learned from past or similar instances. Given the additional information, we aim to improve upon the traditional worst-case run-time guarantees. Our contributions are the following: (i) We give a faster algorithm for minimum-weight bipartite… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  15. arXiv:2203.07513  [pdf, ps, other

    cs.LG cs.AI

    Multi Stage Screening: Enforcing Fairness and Maximizing Efficiency in a Pre-Existing Pipeline

    Authors: Avrim Blum, Kevin Stangl, Ali Vakilian

    Abstract: Consider an actor making selection decisions using a series of classifiers, which we term a sequential screening process. The early stages filter out some applicants, and in the final stage an expensive but accurate test is applied to the individuals that make it to the final stage. Since the final stage is expensive, if there are multiple groups with different fractions of positives at the penult… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    MSC Class: 68T01 ACM Class: I.2.6

  16. arXiv:2202.01391  [pdf, ps, other

    cs.DS cs.LG

    Fair Representation Clustering with Several Protected Classes

    Authors: Zhen Dai, Yury Makarychev, Ali Vakilian

    Abstract: We study the problem of fair $k$-median where each cluster is required to have a fair representation of individuals from different groups. In the fair representation $k$-median problem, we are given a set of points $X$ in a metric space. Each point $x\in X$ belongs to one of $\ell$ groups. Further, we are given fair representation parameters $α_j$ and $β_j$ for each group $j\in [\ell]$. We say tha… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  17. arXiv:2111.04804  [pdf, ps, other

    cs.DS cs.LG

    Approximating Fair Clustering with Cascaded Norm Objectives

    Authors: Eden Chlamtáč, Yury Makarychev, Ali Vakilian

    Abstract: We introduce the $(p,q)$-Fair Clustering problem. In this problem, we are given a set of points $P$ and a collection of different weight functions $W$. We would like to find a clustering which minimizes the $\ell_q$-norm of the vector over $W$ of the $\ell_p$-norms of the weighted distances of points in $P$ from the centers. This generalizes various clustering problems, including Socially Fair… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: SODA 2022

  18. arXiv:2106.14043  [pdf, ps, other

    cs.DS cs.AI cs.CY cs.LG

    Improved Approximation Algorithms for Individually Fair Clustering

    Authors: Ali Vakilian, Mustafa Yalçıner

    Abstract: We consider the $k$-clustering problem with $\ell_p$-norm cost, which includes $k$-median, $k$-means and $k$-center, under an individual notion of fairness proposed by Jung et al. [2020]: given a set of points $P$ of size $n$, a set of $k$ centers induces a fair clustering if every point in $P$ has a center among its $n/k$ closest neighbors. Mahabadi and Vakilian [2020] presented a $(p^{O(p)},7)$-… ▽ More

    Submitted 1 March, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: AISTATS 2022

  19. arXiv:2103.02512  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Approximation Algorithms for Socially Fair Clustering

    Authors: Yury Makarychev, Ali Vakilian

    Abstract: We present an $(e^{O(p)} \frac{\log \ell}{\log\log\ell})$-approximation algorithm for socially fair clustering with the $\ell_p$-objective. In this problem, we are given a set of points in a metric space. Each point belongs to one (or several) of $\ell$ groups. The goal is to find a $k$-medians, $k$-means, or, more generally, $\ell_p$-clustering that is simultaneously good for all of the groups. M… ▽ More

    Submitted 15 July, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: COLT 2021

  20. arXiv:2007.09890  [pdf, ps, other

    cs.LG cs.DS math.NA stat.ML

    Learning the Positions in CountSketch

    Authors: Simin Liu, Tianrui Liu, Ali Vakilian, Yulin Wan, David P. Woodruff

    Abstract: We consider sketching algorithms which first quickly compress data by multiplication with a random sketch matrix, and then apply the sketch to quickly solve an optimization problem, e.g., low rank approximation. In the learning-based sketching paradigm proposed by Indyk et al. [2019], the sketch matrix is found by choosing a random sparse matrix, e.g., the CountSketch, and then updating the values… ▽ More

    Submitted 7 June, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

  21. arXiv:2002.06742  [pdf, other

    cs.DS cs.LG stat.ML

    Individual Fairness for $k$-Clustering

    Authors: Sepideh Mahabadi, Ali Vakilian

    Abstract: We give a local search based algorithm for $k$-median and $k$-means (and more generally for any $k$-clustering with $\ell_p$ norm cost function) from the perspective of individual fairness. More precisely, for a point $x$ in a point set $P$ of size $n$, let $r(x)$ be the minimum radius such that the ball of radius $r(x)$ centered at $x$ has at least $n/k$ points from $P$. Intuitively, if a set of… ▽ More

    Submitted 21 September, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  22. arXiv:1910.14154  [pdf, ps, other

    cs.DS cs.DC

    Improved Local Computation Algorithm for Set Cover via Sparsification

    Authors: Christoph Grunau, Slobodan Mitrović, Ronitt Rubinfeld, Ali Vakilian

    Abstract: We design a Local Computation Algorithm (LCA) for the set cover problem. Given a set system where each set has size at most $s$ and each element is contained in at most $t$ sets, the algorithm reports whether a given set is in some fixed set cover whose expected size is $O(\log{s})$ times the minimum fractional set cover value. Our algorithm requires… ▽ More

    Submitted 5 November, 2019; v1 submitted 30 October, 2019; originally announced October 2019.

    Comments: To appear in ACM-SIAM Symposium on Discrete Algorithms (SODA 2020)

  23. arXiv:1910.13984  [pdf, other

    cs.LG cs.DS stat.ML

    Learning-Based Low-Rank Approximations

    Authors: Piotr Indyk, Ali Vakilian, Yang Yuan

    Abstract: We introduce a "learning-based" algorithm for the low-rank decomposition problem: given an $n \times d$ matrix $A$, and a parameter $k$, compute a rank-$k$ matrix $A'$ that minimizes the approximation loss $\|A-A'\|_F$. The algorithm uses a training set of input matrices in order to optimize its performance. Specifically, some of the most efficient approximate algorithms for computing low-rank app… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  24. arXiv:1910.07616  [pdf, ps, other

    cs.DS

    Node-Weighted Network Design in Planar and Minor-Closed Families of Graphs

    Authors: Chandra Chekuri, Alina Ene, Ali Vakilian

    Abstract: We consider node-weighted survivable network design (SNDP) in planar graphs and minor-closed families of graphs. The input consists of a node-weighted undirected graph $G=(V,E)$ and integer connectivity requirements $r(uv)$ for each unordered pair of nodes $uv$. The goal is to find a minimum weighted subgraph $H$ of $G$ such that $H$ contains $r(uv)$ disjoint paths between $u$ and $v$ for each nod… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: This paper builds upon an earlier version with results on edge-connectivity that appeared in ICALP'12 and extends its result to the setting with element-connectivity requirements

  25. arXiv:1908.05198  [pdf, other

    cs.DS

    (Learned) Frequency Estimation Algorithms under Zipfian Distribution

    Authors: Anders Aamand, Piotr Indyk, Ali Vakilian

    Abstract: \begin{abstract} The frequencies of the elements in a data stream are an important statistical measure and the task of estimating them arises in many applications within data analysis and machine learning. Two of the most popular algorithms for this problem, Count-Min and Count-Sketch, are widely used in practice. In a recent work [Hsu et al., ICLR'19], it was shown empirically that augmenting C… ▽ More

    Submitted 11 August, 2020; v1 submitted 14 August, 2019; originally announced August 2019.

  26. arXiv:1906.00339  [pdf, other

    cs.DS cs.LG

    Sample-Optimal Low-Rank Approximation of Distance Matrices

    Authors: Piotr Indyk, Ali Vakilian, Tal Wagner, David Woodruff

    Abstract: A distance matrix $A \in \mathbb R^{n \times m}$ represents all pairwise distances, $A_{ij}=\mathrm{d}(x_i,y_j)$, between two point sets $x_1,...,x_n$ and $y_1,...,y_m$ in an arbitrary metric space $(\mathcal Z, \mathrm{d})$. Such matrices arise in various computational contexts such as learning image manifolds, handwriting recognition, and multi-dimensional unfolding. In this work we study algo… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Comments: COLT 2019

  27. arXiv:1902.08266  [pdf, other

    cs.DS cs.DM

    Local Computation Algorithms for Spanners

    Authors: Merav Parter, Ronitt Rubinfeld, Ali Vakilian, Anak Yodpinyanee

    Abstract: A graph spanner is a fundamental graph structure that faithfully preserves the pairwise distances in the input graph up to a small multiplicative stretch. The common objective in the computation of spanners is to achieve the best-known existential size-stretch trade-off efficiently. Classical models and algorithmic analysis of graph spanners essentially assume that the algorithm can read the inp… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: An extended abstract appeared in the proceedings of ITCS 2019

  28. arXiv:1902.03534  [pdf, ps, other

    cs.DS cs.DM

    Set Cover in Sub-linear Time

    Authors: Piotr Indyk, Sepideh Mahabadi, Ronitt Rubinfeld, Ali Vakilian, Anak Yodpinyanee

    Abstract: We study the classic set cover problem from the perspective of sub-linear algorithms. Given access to a collection of $m$ sets over $n$ elements in the query model, we show that sub-linear algorithms derived from existing techniques have almost tight query complexities. On one hand, first we show an adaptation of the streaming algorithm presented in Har-Peled et al. [2016] to the sub-linear quer… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  29. arXiv:1902.03519  [pdf, other

    cs.DS cs.LG

    Scalable Fair Clustering

    Authors: Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali Vakilian, Tal Wagner

    Abstract: We study the fair variant of the classic $k$-median problem introduced by Chierichetti et al. [2017]. In the standard $k$-median problem, given an input pointset $P$, the goal is to find $k$ centers $C$ and assign each input point to one of the centers in $C$ such that the average distance of points to their cluster center is minimized. In the fair variant of $k$-median, the points are colored,… ▽ More

    Submitted 10 June, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  30. arXiv:1806.02771  [pdf, other

    cs.CC cs.DS math.CO

    Structural Rounding: Approximation Algorithms for Graphs Near an Algorithmically Tractable Class

    Authors: Erik D. Demaine, Timothy D. Goodrich, Kyle Kloster, Brian Lavallee, Quanquan C. Liu, Blair D. Sullivan, Ali Vakilian, Andrew van der Poel

    Abstract: We develop a new framework for generalizing approximation algorithms from the structural graph algorithm literature so that they apply to graphs somewhat close to that class (a scenario we expect is common when working with real-world networks) while still guaranteeing approximation ratios. The idea is to $\textit{edit}$ a given graph via vertex- or edge-deletions to put the graph into an algorith… ▽ More

    Submitted 9 December, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: 72 pages, 10 figures

    ACM Class: F.2.2

  31. Towards Tight Bounds for the Streaming Set Cover Problem

    Authors: Sariel Har-Peled, Piotr Indyk, Sepideh Mahabadi, Ali Vakilian

    Abstract: We consider the classic Set Cover problem in the data stream model. For $n$ elements and $m$ sets ($m\geq n$) we give a $O(1/δ)$-pass algorithm with a strongly sub-linear $\tilde{O}(mn^δ)$ space and logarithmic approximation factor. This yields a significant improvement over the earlier algorithm of Demaine et al. [DIMV14] that uses exponentially larger number of passes. We complement this result… ▽ More

    Submitted 2 May, 2016; v1 submitted 31 August, 2015; originally announced September 2015.

    Comments: A preliminary version of this paper is to appear in PODS 2016

  32. arXiv:1503.05656  [pdf, other

    cs.DB

    Cost-Effective Conceptual Design Using Taxonomies

    Authors: Ali Vakilian, Yodsawalai Chodpathumwan, Arash Termehchy, Amir Nayyeri

    Abstract: It is known that annotating named entities in unstructured and semi-structured data sets by their concepts improves the effectiveness of answering queries over these data sets. As every enterprise has a limited budget of time or computational resources, it has to annotate a subset of concepts in a given domain whose costs of annotation do not exceed the budget. We call such a subset of concepts a… ▽ More

    Submitted 6 January, 2018; v1 submitted 19 March, 2015; originally announced March 2015.

  33. arXiv:1305.4308  [pdf, ps, other

    cs.DS cs.DM

    Connected Domatic Packings in Node-capacitated Graphs

    Authors: Alina Ene, Nitish Korula, Ali Vakilian

    Abstract: A set of vertices in a graph is a dominating set if every vertex outside the set has a neighbor in the set. A dominating set is connected if the subgraph induced by its vertices is connected. The connected domatic partition problem asks for a partition of the nodes into connected dominating sets. The connected domatic number of a graph is the size of a largest connected domatic partition and it is… ▽ More

    Submitted 5 July, 2013; v1 submitted 18 May, 2013; originally announced May 2013.

    Comments: 12 pages