Skip to main content

Showing 1–23 of 23 results for author: Norouzi-Fard, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19977  [pdf, other

    cs.DS cs.LG stat.ML

    Consistent Submodular Maximization

    Authors: Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam

    Abstract: Maximizing monotone submodular functions under cardinality constraints is a classic optimization task with several applications in data mining and machine learning. In this paper we study this problem in a dynamic environment with consistency constraints: elements arrive in a streaming fashion and the goal is maintaining a constant approximation to the optimal solution while having a stable soluti… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: To appear at ICML 24

  2. arXiv:2312.14299  [pdf, ps, other

    cs.LG cs.CY cs.DM cs.DS math.CO math.OC

    Fairness in Submodular Maximization over a Matroid Constraint

    Authors: Marwa El Halabi, Jakub Tarnawski, Ashkan Norouzi-Fard, Thuy-Duong Vuong

    Abstract: Submodular maximization over a matroid constraint is a fundamental problem with various applications in machine learning. Some of these applications involve decision-making over datapoints with sensitive attributes such as gender or race. In such settings, it is crucial to guarantee that the selected solution is fairly distributed with respect to this attribute. Recently, fairness has been investi… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  3. arXiv:2305.19918  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Fully Dynamic Submodular Maximization over Matroids

    Authors: Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam

    Abstract: Maximizing monotone submodular functions under a matroid constraint is a classic algorithmic problem with multiple applications in data mining and machine learning. We study this classic problem in the fully dynamic setting, where elements can be both inserted and deleted in real-time. Our main result is a randomized algorithm that maintains an efficient data structure with an $\tilde{O}(k^2)$ amo… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted at ICML 2023

  4. arXiv:2305.15118  [pdf, other

    cs.LG cs.CY cs.DS

    Fairness in Streaming Submodular Maximization over a Matroid Constraint

    Authors: Marwa El Halabi, Federico Fusco, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

    Abstract: Streaming submodular maximization is a natural model for the task of selecting a representative subset from a large-scale dataset. If datapoints have sensitive attributes such as gender or race, it becomes important to enforce fairness to avoid bias and discrimination. This has spurred significant interest in develo** fair machine learning algorithms. Recently, such algorithms have been develope… ▽ More

    Submitted 19 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML 23

  5. arXiv:2208.07582  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Deletion Robust Non-Monotone Submodular Maximization over Matroids

    Authors: Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam

    Abstract: Maximizing a submodular function is a fundamental task in machine learning and in this paper we study the deletion robust version of the problem under the classic matroids constraint. Here the goal is to extract a small size summary of the dataset that contains a high value independent set even after an adversary deleted some elements. We present constant-factor approximation algorithms, whose spa… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: Preliminary versions of this work appeared as arXiv:2201.13128 and in ICML'22. The main difference with respect to these versions consists in extending our results to non-monotone submodular functions

  6. arXiv:2204.05154  [pdf, ps, other

    cs.DS cs.DM

    Submodular Maximization Subject to Matroid Intersection on the Fly

    Authors: Moran Feldman, Ashkan Norouzi-Fard, Ola Svensson, Rico Zenklusen

    Abstract: Despite a surge of interest in submodular maximization in the data stream model, there remain significant gaps in our knowledge about what can be achieved in this setting, especially when dealing with multiple constraints. In this work, we nearly close several basic gaps in submodular maximization subject to $k$ matroid constraints in the data stream model. We present a new hardness result showing… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 41 pages, 1 figure. arXiv admin note: text overlap with arXiv:2107.07183

    MSC Class: 68R05 (Primary) 68W27; 68Q25; 90C27 (Secondary) ACM Class: F.2.2; G.2.1

  7. arXiv:2203.01440  [pdf, ps, other

    cs.LG cs.CR cs.DS

    Near-Optimal Correlation Clustering with Privacy

    Authors: Vincent Cohen-Addad, Chenglin Fan, Silvio Lattanzi, Slobodan Mitrović, Ashkan Norouzi-Fard, Nikos Parotsidis, Jakub Tarnawski

    Abstract: Correlation clustering is a central problem in unsupervised learning, with applications spanning community detection, duplicate detection, automated labelling and many more. In the correlation clustering problem one receives as input a set of nodes and for each node a list of co-clustering preferences, and the goal is to output a clustering that minimizes the disagreement with the specified nodes'… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  8. arXiv:2201.13128  [pdf, other

    cs.DS cs.LG stat.ML

    Deletion Robust Submodular Maximization over Matroids

    Authors: Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard, Morteza Zadimoghaddam

    Abstract: Maximizing a monotone submodular function is a fundamental task in machine learning. In this paper, we study the deletion robust version of the problem under the classic matroids constraint. Here the goal is to extract a small size summary of the dataset that contains a high value independent set even after an adversary deleted some elements. We present constant-factor approximation algorithms, wh… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:5671-5693, 2022

  9. arXiv:2107.07183  [pdf, other

    cs.DS cs.DM

    Streaming Submodular Maximization under Matroid Constraints

    Authors: Moran Feldman, Paul Liu, Ashkan Norouzi-Fard, Ola Svensson, Rico Zenklusen

    Abstract: Recent progress in (semi-)streaming algorithms for monotone submodular function maximization has led to tight results for a simple cardinality constraint. However, current techniques fail to give a similar understanding for natural generalizations, including matroid constraints. This paper aims at closing this gap. For a single matroid of rank $k$ (i.e., any solution has cardinality at most $k$),… ▽ More

    Submitted 16 February, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: 44 pages

    MSC Class: 68W27 (Primary) 68R05; 68Q11 (Secondary) ACM Class: F.2.2; G.2.1

  10. arXiv:2106.08448  [pdf, other

    cs.DS cs.DC cs.LG

    Correlation Clustering in Constant Many Parallel Rounds

    Authors: Vincent Cohen-Addad, Silvio Lattanzi, Slobodan Mitrović, Ashkan Norouzi-Fard, Nikos Parotsidis, Jakub Tarnawski

    Abstract: Correlation clustering is a central topic in unsupervised learning, with many applications in ML and data mining. In correlation clustering, one receives as input a signed graph and the goal is to partition it to minimize the number of disagreements. In this work we propose a massively parallel computation (MPC) algorithm for this problem that is considerably faster than prior work. In particular,… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: ICML 2021 (long talk)

  11. arXiv:2106.04805  [pdf, other

    stat.ML cs.LG cs.SI math.PR

    Streaming Belief Propagation for Community Detection

    Authors: Yuchen Wu, MohammadHossein Bateni, Andre Linhares, Filipe Miguel Goncalves de Almeida, Andrea Montanari, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: The community detection problem requires to cluster the nodes of a network into a small number of well-connected "communities". There has been substantial recent progress in characterizing the fundamental statistical limits of community detection under simple stochastic block models. However, in real-world applications, the network structure is typically dynamic, with nodes that join over time. In… ▽ More

    Submitted 10 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: 36 pages, 13 figures

  12. arXiv:2012.11891  [pdf, ps, other

    cs.LG cs.DS

    Fast and Accurate $k$-means++ via Rejection Sampling

    Authors: Vincent Cohen-Addad, Silvio Lattanzi, Ashkan Norouzi-Fard, Christian Sohler, Ola Svensson

    Abstract: $k$-means++ \cite{arthur2007k} is a widely used clustering algorithm that is easy to implement, has nice theoretical guarantees and strong empirical performance. Despite its wide adoption, $k… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  13. arXiv:2011.06888  [pdf, other

    cs.DS

    Consistent k-Clustering for General Metrics

    Authors: Hendrik Fichtenberger, Silvio Lattanzi, Ashkan Norouzi-Fard, Ola Svensson

    Abstract: Given a stream of points in a metric space, is it possible to maintain a constant approximate clustering by changing the cluster centers only a small number of times during the entire execution of the algorithm? This question received attention in recent years in the machine learning literature and, before our work, the best known algorithm performs $\widetilde{O}(k^2)$ center swaps (the… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  14. arXiv:2010.07431  [pdf, other

    cs.LG cs.DS

    Fairness in Streaming Submodular Maximization: Algorithms and Hardness

    Authors: Marwa El Halabi, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos, Jakub Tarnawski

    Abstract: Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning algorithms, left unchecked, are known to exhibit bias: under- or over-representation of particular groups. This has made the design of fair machine learning algori… ▽ More

    Submitted 18 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2020

  15. Fully Dynamic Algorithm for Constrained Submodular Optimization

    Authors: Silvio Lattanzi, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakub Tarnawski, Morteza Zadimoghaddam

    Abstract: The task of maximizing a monotone submodular function under a cardinality constraint is at the core of many machine learning and data mining applications, including data summarization, sparse regression and coverage problems. We study this classic problem in the fully dynamic setting, where elements can be both inserted and removed. Our main result is a randomized algorithm that maintains an effic… ▽ More

    Submitted 24 May, 2023; v1 submitted 8 June, 2020; originally announced June 2020.

    Journal ref: NeurIPS 2020

  16. arXiv:2003.13459  [pdf, ps, other

    cs.DS cs.DM

    The One-way Communication Complexity of Submodular Maximization with Applications to Streaming and Robustness

    Authors: Moran Feldman, Ashkan Norouzi-Fard, Ola Svensson, Rico Zenklusen

    Abstract: We consider the classical problem of maximizing a monotone submodular function subject to a cardinality constraint, which, due to its numerous applications, has recently been studied in various computational models. We consider a clean multi-player model that lies between the offline and streaming model, and study it under the aspect of one-way communication complexity. Our model captures the stre… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 56 pages, no figures, to appear in STOC 2020 in the form of an extended abstract

    MSC Class: 68R05 (Primary) 68W27; 68Q25 (Secondary) ACM Class: F.2.2; G.2.1

  17. arXiv:1907.05725  [pdf, other

    cs.DS

    Space Efficient Approximation to Maximum Matching Size from Uniform Edge Samples

    Authors: Michael Kapralov, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: Given a source of iid samples of edges of an input graph $G$ with $n$ vertices and $m$ edges, how many samples does one need to compute a constant factor approximation to the maximum matching size in $G$? Moreover, is it possible to obtain such an estimate in a small amount of space? We show that, on the one hand, this problem cannot be solved using a nontrivially sublinear (in $m$) number of samp… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  18. arXiv:1808.01842  [pdf, other

    cs.LG stat.ML

    Beyond $1/2$-Approximation for Submodular Maximization on Massive Data Streams

    Authors: Ashkan Norouzi-Fard, Jakub Tarnawski, Slobodan Mitrović, Amir Zandieh, Aida Mousavifar, Ola Svensson

    Abstract: Many tasks in machine learning and data mining, such as data diversification, non-parametric learning, kernel machines, clustering etc., require extracting a small but representative summary from a massive dataset. Often, such problems can be posed as maximizing a submodular set function subject to a cardinality constraint. We consider this question in the streaming setting, where elements arrive… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    Journal ref: Proc. of 35th International Conference on Machine Learning (ICML), 2018, pages 3829-3838

  19. arXiv:1711.02598  [pdf, other

    cs.DS stat.ML

    Streaming Robust Submodular Maximization: A Partitioned Thresholding Approach

    Authors: Slobodan Mitrović, Ilija Bogunovic, Ashkan Norouzi-Fard, Jakub Tarnawski, Volkan Cevher

    Abstract: We study the classical problem of maximizing a monotone submodular function subject to a cardinality constraint k, with two additional twists: (i) elements arrive in a streaming fashion, and (ii) m items from the algorithm's memory are removed after the stream is finished. We develop a robust submodular algorithm STAR-T. It is based on a novel partitioning structure and an exponentially decreasing… ▽ More

    Submitted 7 November, 2017; originally announced November 2017.

    Comments: To appear in NIPS 2017

    Journal ref: Proc. of 30th Advances in Neural Information Processing Systems (NIPS) 2017, pages 4558-4567

  20. arXiv:1612.07925  [pdf, ps, other

    cs.DS

    Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms

    Authors: Sara Ahmadian, Ashkan Norouzi-Fard, Ola Svensson, Justin Ward

    Abstract: Clustering is a classic topic in optimization with $k$-means being one of the most fundamental such problems. In the absence of any restrictions on the input, the best known algorithm for $k$-means with a provable guarantee is a simple local search heuristic yielding an approximation guarantee of $9+ε$, a ratio that is known to be tight with respect to such methods. We overcome this barrier by p… ▽ More

    Submitted 10 April, 2017; v1 submitted 23 December, 2016; originally announced December 2016.

  21. arXiv:1611.08574  [pdf, other

    cs.DS

    An Efficient Streaming Algorithm for the Submodular Cover Problem

    Authors: Ashkan Norouzi-Fard, Abbas Bazzi, Marwa El Halabi, Ilija Bogunovic, Ya-** Hsieh, Volkan Cevher

    Abstract: We initiate the study of the classical Submodular Cover (SC) problem in the data streaming model which we refer to as the Streaming Submodular Cover (SSC). We show that any single pass streaming algorithm using sublinear memory in the size of the stream will fail to provide any non-trivial approximation guarantees for SSC. Hence, we consider a relaxed version of SSC, where we only seek to find a p… ▽ More

    Submitted 25 November, 2016; originally announced November 2016.

    Comments: To appear in NIPS'16

  22. arXiv:1507.01906  [pdf, other

    cs.CC

    Towards Tight Lower Bounds for Scheduling Problems

    Authors: Abbas Bazzi, Ashkan Norouzi-Fard

    Abstract: We show a close connection between structural hardness for $k$-partite graphs and tight inapproximability results for scheduling problems with precedence constraints. Assuming a natural but nontrivial generalisation of the bipartite structural hardness result of Bansal and Khot, we obtain a hardness of $2-ε$ for the problem of minimising the makespan for scheduling precedence-constrained jobs with… ▽ More

    Submitted 7 July, 2015; originally announced July 2015.

    Comments: 25 pages, 3 figures, To appear in the Proceedings of the 23rd Annual European Symposium on Algorithms 2015

  23. arXiv:1411.4476  [pdf, other

    cs.DS

    Dynamic Facility Location via Exponential Clocks

    Authors: Hyung-Chan An, Ashkan Norouzi-Fard, Ola Svensson

    Abstract: The \emph{dynamic facility location problem} is a generalization of the classic facility location problem proposed by Eisenstat, Mathieu, and Schabanel to model the dynamics of evolving social/infrastructure networks. The generalization lies in that the distance metric between clients and facilities changes over time. This leads to a trade-off between optimizing the classic objective function and… ▽ More

    Submitted 17 November, 2014; originally announced November 2014.