Skip to main content

Showing 1–7 of 7 results for author: Knittel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.12501  [pdf, other

    cs.LG cs.DS

    Fair Polylog-Approximate Low-Cost Hierarchical Clustering

    Authors: Marina Knittel, Max Springer, John Dickerson, MohammadTaghi Hajiaghayi

    Abstract: Research in fair machine learning, and particularly clustering, has been crucial in recent years given the many ethical controversies that modern intelligent systems have posed. Ahmadian et al. [2020] established the study of fairness in \textit{hierarchical} clustering, a stronger, more structured variant of its well-known flat counterpart, though their proposed algorithm that optimizes for Dasgu… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS '23 (16 pages, 5 figures)

  2. arXiv:2205.14198  [pdf, other

    cs.LG cs.DS

    Generalized Reductions: Making any Hierarchical Clustering Fair and Balanced with Low Cost

    Authors: Marina Knittel, Max Springer, John P. Dickerson, MohammadTaghi Hajiaghayi

    Abstract: Clustering is a fundamental building block of modern statistical analysis pipelines. Fair clustering has seen much attention from the machine learning community in recent years. We are some of the first to study fairness in the context of hierarchical clustering, after the results of Ahmadian et al. from NeurIPS in 2020. We evaluate our results using Dasgupta's cost function, perhaps one of the mo… ▽ More

    Submitted 9 May, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

  3. arXiv:2205.14101  [pdf, ps, other

    cs.DS

    Adaptive Massively Parallel Algorithms for Cut Problems

    Authors: MohammadTaghi Hajiaghayi, Marina Knittel, Jan Olkowski, Hamed Saleh

    Abstract: We study the Weighted Min Cut problem in the Adaptive Massively Parallel Computation (AMPC) model. In 2019, Behnezhad et al. [3] introduced the AMPC model as an extension of the Massively Parallel Computation (MPC) model. In the past decade, research on highly scalable algorithms has had significant impact on many massive systems. The MPC model, introduced in 2010 by Karloff et al. [16], which is… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

  4. arXiv:2202.11095  [pdf, other

    cs.GT cs.AI cs.DS

    The Dichotomous Affiliate Stable Matching Problem: Approval-Based Matching with Applicant-Employer Relations

    Authors: Marina Knittel, Samuel Dooley, John P. Dickerson

    Abstract: While the stable marriage problem and its variants model a vast range of matching markets, they fail to capture complex agent relationships, such as the affiliation of applicants and employers in an interview marketplace. To model this problem, the existing literature on matching with externalities permits agents to provide complete and total rankings over matchings based off of both their own and… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 19 pages, 2 figures

  5. arXiv:2111.01904  [pdf, other

    cs.DS cs.DC

    Adaptive Massively Parallel Constant-round Tree Contraction

    Authors: MohammadTaghi Hajiaghayi, Marina Knittel, Hamed Saleh, Hsin-Hao Su

    Abstract: Miller and Reif's FOCS'85 classic and fundamental tree contraction algorithm is a broadly applicable technique for the parallel solution of a large number of tree problems. Additionally it is also used as an algorithmic design technique for a large number of parallel graph algorithms. In all previously explored models of computation, however, tree contractions have only been achieved in… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 35 pages, 3 figures, to be published in Innovations in Theoretical Computer Science (ITCS)

  6. arXiv:2101.04818  [pdf, other

    cs.DS

    Improved Hierarchical Clustering on Massive Datasets with Broad Guarantees

    Authors: MohammadTaghi Hajiaghayi, Marina Knittel

    Abstract: Hierarchical clustering is a stronger extension of one of today's most influential unsupervised learning methods: clustering. The goal of this method is to create a hierarchy of clusters, thus constructing cluster evolutionary history and simultaneously finding clusterings at all resolutions. We propose four traits of interest for hierarchical clustering algorithms: (1) empirical performance, (2)… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 25 pages, 4 figures

  7. arXiv:2006.10221  [pdf, other

    cs.DS cs.LG stat.ML

    Fair Hierarchical Clustering

    Authors: Sara Ahmadian, Alessandro Epasto, Marina Knittel, Ravi Kumar, Mohammad Mahdian, Benjamin Moseley, Philip Pham, Sergei Vassilvitskii, Yuyan Wang

    Abstract: As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates over-representation in traditional clustering. In this paper we extend this notion to hierarchical clustering, where the goal is to recursively partition the data to opt… ▽ More

    Submitted 18 June, 2020; v1 submitted 17 June, 2020; originally announced June 2020.