Skip to main content

Showing 1–22 of 22 results for author: Chatziafratis, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.14332  [pdf, other

    cs.LG stat.ML

    From Large to Small Datasets: Size Generalization for Clustering Algorithm Selection

    Authors: Vaggos Chatziafratis, Ishani Karmarkar, Ellen Vitercik

    Abstract: In clustering algorithm selection, we are given a massive dataset and must efficiently select which clustering algorithm to use. We study this problem in a semi-supervised setting, with an unknown ground-truth clustering that we can only access through expensive oracle queries. Ideally, the clustering algorithm's output will be structurally close to the ground truth. We approach this problem by in… ▽ More

    Submitted 25 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  2. arXiv:2312.13490  [pdf, ps, other

    cs.DS

    Dimension-Accuracy Tradeoffs in Contrastive Embeddings for Triplets, Terminals & Top-k Nearest Neighbors

    Authors: Vaggos Chatziafratis, Piotr Indyk

    Abstract: Metric embeddings traditionally study how to map $n$ items to a target metric space such that distance lengths are not heavily distorted; but what if we only care to preserve the relative order of the distances (and not their length)? In this paper, we are motivated by the following basic question: given triplet comparisons of the form ``item $i$ is closer to item $j$ than to item $k$,'' can we fi… ▽ More

    Submitted 29 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Abstract shortened for arxiv

  3. arXiv:2212.12765  [pdf, other

    cs.DS cs.CC

    Triplet Reconstruction and all other Phylogenetic CSPs are Approximation Resistant

    Authors: Vaggos Chatziafratis, Konstantin Makarychev

    Abstract: We study the natural problem of Triplet Reconstruction (also Rooted Triplets Consistency or Triplet Clustering), originally motivated in computational biology and relational databases (Aho, Sagiv, Szymanski, and Ullman, 1981): given $n$ points, we want to embed them onto the $n$ leaves of a rooted binary tree (a hierarchical clustering or ultrametric embedding) such that a given set of $m$ triplet… ▽ More

    Submitted 5 April, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

    Comments: 47 pages, 12 figures, Abstract shortened for arxiv

  4. arXiv:2210.05212  [pdf, other

    cs.LG cs.AI

    On Scrambling Phenomena for Randomly Initialized Recurrent Networks

    Authors: Vaggos Chatziafratis, Ioannis Panageas, Clayton Sanford, Stelios Andrew Stavroulakis

    Abstract: Recurrent Neural Networks (RNNs) frequently exhibit complicated dynamics, and their sensitivity to the initialization process often renders them notoriously hard to train. Recent works have shed light on such phenomena analyzing when exploding or vanishing gradients may occur, either of which is detrimental for training dynamics. In this paper, we point to a formal connection between RNNs and chao… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted for publication, Neurips 2022

  5. arXiv:2208.02204  [pdf, ps, other

    cs.GT cs.LG cs.MA

    Efficiently Computing Nash Equilibria in Adversarial Team Markov Games

    Authors: Fivos Kalogiannis, Ioannis Anagnostides, Ioannis Panageas, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Vaggos Chatziafratis, Stelios Stavroulakis

    Abstract: Computing Nash equilibrium policies is a central problem in multi-agent reinforcement learning that has received extensive attention both in theory and in practice. However, provable guarantees have been thus far either limited to fully competitive or cooperative scenarios or impose strong assumptions that are difficult to meet in most practical applications. In this work, we depart from those pri… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  6. arXiv:2206.07554  [pdf, other

    cs.DS

    Hierarchical Clustering in Graph Streams: Single-Pass Algorithms and Space Lower Bounds

    Authors: Sepehr Assadi, Vaggos Chatziafratis, Jakub Łącki, Vahab Mirrokni, Chen Wang

    Abstract: The Hierarchical Clustering (HC) problem consists of building a hierarchy of clusters to represent a given dataset. Motivated by the modern large-scale applications, we study the problem in the \streaming model, in which the memory is heavily limited and only a single or very few passes over the input are allowed. Specifically, we investigate whether a good hierarchical clustering can be obtained,… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Full version of the paper accepted to COLT 2022. 55 pages, 3 figures

  7. arXiv:2110.10295  [pdf, other

    cs.LG math.DS nlin.CD

    Expressivity of Neural Networks via Chaotic Itineraries beyond Sharkovsky's Theorem

    Authors: Clayton Sanford, Vaggos Chatziafratis

    Abstract: Given a target function $f$, how large must a neural network be in order to approximate $f$? Recent works examine this basic question on neural network \textit{expressivity} from the lens of dynamical systems and provide novel ``depth-vs-width'' tradeoffs for a large family of functions $f$. They suggest that such tradeoffs are governed by the existence of \textit{periodic} points or \emph{cycles}… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 47 pages, 19 figures

  8. arXiv:2102.11548  [pdf, other

    cs.DS

    Maximizing Agreements for Ranking, Clustering and Hierarchical Clustering via MAX-CUT

    Authors: Vaggos Chatziafratis, Mohammad Mahdian, Sara Ahmadian

    Abstract: In this paper, we study a number of well-known combinatorial optimization problems that fit in the following paradigm: the input is a collection of (potentially inconsistent) local relationships between the elements of a ground set (e.g., pairwise comparisons, similar/dissimilar pairs, or ancestry structure of triples of points), and the goal is to aggregate this information into a global structur… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: AISTATS 2021 accepted paper

  9. arXiv:2101.10639  [pdf, other

    cs.DS

    Hierarchical Clustering via Sketches and Hierarchical Correlation Clustering

    Authors: Danny Vainstein, Vaggos Chatziafratis, Gui Citovsky, Anand Rajagopalan, Mohammad Mahdian, Yossi Azar

    Abstract: Recently, Hierarchical Clustering (HC) has been considered through the lens of optimization. In particular, two maximization objectives have been defined. Moseley and Wang defined the \emph{Revenue} objective to handle similarity information given by a weighted graph on the data points (w.l.o.g., $[0,1]$ weights), while Cohen-Addad et al. defined the \emph{Dissimilarity} objective to handle dissim… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

  10. arXiv:2010.01459  [pdf, other

    cs.DS cs.CC

    Inapproximability for Local Correlation Clustering and Dissimilarity Hierarchical Clustering

    Authors: Vaggos Chatziafratis, Neha Gupta, Euiwoong Lee

    Abstract: We present hardness of approximation results for Correlation Clustering with local objectives and for Hierarchical Clustering with dissimilarity information. For the former, we study the local objective of Puleo and Milenkovic (ICML '16) that prioritizes reducing the disagreements at data points that are worst off and for the latter we study the maximization version of Dasgupta's cost function (ST… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  11. arXiv:2010.00402  [pdf, other

    cs.DS cs.LG stat.ML

    From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical Clustering

    Authors: Ines Chami, Albert Gu, Vaggos Chatziafratis, Christopher Ré

    Abstract: Similarity-based Hierarchical Clustering (HC) is a classical unsupervised machine learning algorithm that has traditionally been solved with heuristic algorithms like Average-Linkage. Recently, Dasgupta reframed HC as a discrete optimization problem by introducing a global cost function measuring the quality of a given tree. In this work, we provide the first continuous relaxation of Dasgupta's di… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

  12. arXiv:2003.00777  [pdf, other

    cs.LG math.DS stat.ML

    Better Depth-Width Trade-offs for Neural Networks through the lens of Dynamical Systems

    Authors: Vaggos Chatziafratis, Sai Ganesh Nagarajan, Ioannis Panageas

    Abstract: The expressivity of neural networks as a function of their depth, width and type of activation units has been an important question in deep learning theory. Recently, depth separation results for ReLU networks were obtained via a new connection with dynamical systems, using a generalized notion of fixed points of a continuous map $f$, called periodic points. In this work, we strengthen the connect… ▽ More

    Submitted 20 July, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Appeared in ICML 2020

  13. arXiv:1912.06983  [pdf, other

    cs.DS

    Bisect and Conquer: Hierarchical Clustering via Max-Uncut Bisection

    Authors: Sara Ahmadian, Vaggos Chatziafratis, Alessandro Epasto, Euiwoong Lee, Mohammad Mahdian, Konstantin Makarychev, Grigory Yaroslavtsev

    Abstract: Hierarchical Clustering is an unsupervised data analysis method which has been widely used for decades. Despite its popularity, it had an underdeveloped analytical foundation and to address this, Dasgupta recently introduced an optimization viewpoint of hierarchical clustering with pairwise similarity information that spurred a line of work shedding light on old algorithms (e.g., Average-Linkage),… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

  14. arXiv:1912.04378  [pdf, ps, other

    cs.LG math.DS stat.ML

    Depth-Width Trade-offs for ReLU Networks via Sharkovsky's Theorem

    Authors: Vaggos Chatziafratis, Sai Ganesh Nagarajan, Ioannis Panageas, Xiao Wang

    Abstract: Understanding the representational power of Deep Neural Networks (DNNs) and how their structural properties (e.g., depth, width, type of activation unit) affect the functions they can compute, has been an important yet challenging question in deep learning and approximation theory. In a seminal paper, Telgarsky highlighted the benefits of depth by presenting a family of functions (based on simple… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  15. arXiv:1911.13268  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Adversarially Robust Low Dimensional Representations

    Authors: Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

    Abstract: Many machine learning systems are vulnerable to small perturbations made to inputs either at test time or at training time. This has received much recent interest on the empirical front due to applications where reliability and security are critical. However, theoretical understanding of algorithms that are robust to adversarial perturbations is limited. In this work we focus on Principal Compon… ▽ More

    Submitted 13 August, 2021; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 68 pages including references

  16. arXiv:1812.10582  [pdf, other

    cs.DS

    Hierarchical Clustering for Euclidean Data

    Authors: Moses Charikar, Vaggos Chatziafratis, Rad Niazadeh, Grigory Yaroslavtsev

    Abstract: Recent works on Hierarchical Clustering (HC), a well-studied problem in exploratory data analysis, have focused on optimizing various objective functions for this problem under arbitrary similarity measures. In this paper we take the first step and give novel scalable algorithms for this problem tailored to Euclidean data in R^d and under vector-based similarity measures, a prevalent model in seve… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

  17. arXiv:1810.08414  [pdf, other

    cs.DS

    Bilu-Linial stability, certified algorithms and the Independent Set problem

    Authors: Haris Angelidakis, Pranjal Awasthi, Avrim Blum, Vaggos Chatziafratis, Chen Dan

    Abstract: We study the Maximum Independent Set (MIS) problem under the notion of stability introduced by Bilu and Linial (2010): a weighted instance of MIS is $γ$-stable if it has a unique optimal solution that remains the unique optimum under multiplicative perturbations of the weights by a factor of at most $γ\geq 1$. The goal then is to efficiently recover the unique optimal solution. In this work, we so… ▽ More

    Submitted 29 November, 2021; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Funding and affiliation corrections. Full version of work that appeared in ESA 2019

  18. arXiv:1808.02227  [pdf, other

    cs.DS cs.GT cs.LG

    Hierarchical Clustering better than Average-Linkage

    Authors: Moses Charikar, Vaggos Chatziafratis, Rad Niazadeh

    Abstract: Hierarchical Clustering (HC) is a widely studied problem in exploratory data analysis, usually tackled by simple agglomerative procedures like average-linkage, single-linkage or complete-linkage. In this paper we focus on two objectives, introduced recently to give insight into the performance of average-linkage clustering: a similarity based HC objective proposed by [Moseley and Wang, 2017] and a… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

  19. arXiv:1807.01280  [pdf, other

    cs.LG stat.ML

    On the Computational Power of Online Gradient Descent

    Authors: Vaggos Chatziafratis, Tim Roughgarden, Joshua R. Wang

    Abstract: We prove that the evolution of weight vectors in online gradient descent can encode arbitrary polynomial-space computations, even in very simple learning settings. Our results imply that, under weak complexity-theoretic assumptions, it is impossible to reason efficiently about the fine-grained behavior of online gradient descent.

    Submitted 6 February, 2019; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: Added results, linear regression, neural nets. Fixed typos

  20. arXiv:1805.09476  [pdf, other

    cs.DS cs.AI cs.LG stat.ML

    Hierarchical Clustering with Structural Constraints

    Authors: Vaggos Chatziafratis, Rad Niazadeh, Moses Charikar

    Abstract: Hierarchical clustering is a popular unsupervised data analysis method. For many real-world applications, we would like to exploit prior information about the data that imposes constraints on the clustering hierarchy, and is not captured by the set of features available to the algorithm. This gives rise to the problem of "hierarchical clustering with structural constraints". Structural constraints… ▽ More

    Submitted 14 July, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: In Proc. 35th International Conference on Machine Learning (ICML 2018)

  21. arXiv:1705.00127  [pdf, ps, other

    cs.DS

    Stability and Recovery for Independence Systems

    Authors: Vaggos Chatziafratis, Tim Roughgarden, Jan Vondrak

    Abstract: Two genres of heuristics that are frequently reported to perform much better on "real-world" instances than in the worst case are greedy algorithms and local search algorithms. In this paper, we systematically study these two types of algorithms for the problem of maximizing a monotone submodular set function subject to downward-closed feasibility constraints. We consider perturbation-stable insta… ▽ More

    Submitted 30 June, 2017; v1 submitted 29 April, 2017; originally announced May 2017.

    Comments: version 3, after some reviews/fixes in pdf

  22. arXiv:1609.09548  [pdf, ps, other

    cs.DS

    Approximate Hierarchical Clustering via Sparsest Cut and Spreading Metrics

    Authors: Moses Charikar, Vaggos Chatziafratis

    Abstract: Dasgupta recently introduced a cost function for the hierarchical clustering of a set of points given pairwise similarities between them. He showed that this function is NP-hard to optimize, but a top-down recursive partitioning heuristic based on an alpha_n-approximation algorithm for uniform sparsest cut gives an approximation of O(alpha_n log n) (the current best algorithm has alpha_n=O(sqrt{lo… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.