Skip to main content

Showing 1–7 of 7 results for author: Dreveton, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03852  [pdf, other

    cs.SI cs.LG math.PR

    Why the Metric Backbone Preserves Community Structure

    Authors: Maximilien Dreveton, Charbel Chucri, Matthias Grossglauser, Patrick Thiran

    Abstract: The metric backbone of a weighted graph is the union of all-pairs shortest paths. It is obtained by removing all edges $(u,v)$ that are not the shortest path between $u$ and $v$. In networks with well-separated communities, the metric backbone tends to preserve many inter-community edges, because these edges serve as bridges connecting two communities, but tends to delete many intra-community edge… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2402.15432  [pdf, ps, other

    math.ST cs.LG stat.ML

    Universal Lower Bounds and Optimal Rates: Achieving Minimax Clustering Error in Sub-Exponential Mixture Models

    Authors: Maximilien Dreveton, Alperen Gözeten, Matthias Grossglauser, Patrick Thiran

    Abstract: Clustering is a pivotal challenge in unsupervised machine learning and is often investigated through the lens of mixture models. The optimal error rate for recovering cluster labels in Gaussian and sub-Gaussian mixture models involves ad hoc signal-to-noise ratios. Simple iterative algorithms, such as Lloyd's algorithm, attain this optimal error rate. In this paper, we first establish a universal… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    MSC Class: 62H30; 62F12; 62B10

  3. arXiv:2310.19854  [pdf, other

    cs.SI cs.LG stat.ML

    Exact Recovery and Bregman Hard Clustering of Node-Attributed Stochastic Block Model

    Authors: Maximilien Dreveton, Felipe S. Fernandes, Daniel R. Figueiredo

    Abstract: Network clustering tackles the problem of identifying sets of nodes (communities) that have similar connection patterns. However, in many scenarios, nodes also have attributes that are correlated with the clustering structure. Thus, network information (edges) and node information (attributes) can be jointly leveraged to design high-performance clustering algorithms. Under a general model for the… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    MSC Class: 62H30; 62F12

    Journal ref: NeurIPS 2023

  4. arXiv:2306.00833  [pdf, other

    cs.SI cs.LG math.ST stat.ME stat.ML

    When Does Bottom-up Beat Top-down in Hierarchical Community Detection?

    Authors: Maximilien Dreveton, Daichi Kuroda, Matthias Grossglauser, Patrick Thiran

    Abstract: Hierarchical clustering of networks consists in finding a tree of communities, such that lower levels of the hierarchy reveal finer-grained community structures. There are two main classes of algorithms tackling this problem. Divisive ($\textit{top-down}$) algorithms recursively partition the nodes into two communities, until a stop** rule indicates that no further split is needed. In contrast,… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  5. arXiv:2009.11353  [pdf, ps, other

    cs.LG cs.SI math.PR math.SP stat.ML

    Higher-Order Spectral Clustering for Geometric Graphs

    Authors: Konstantin Avrachenkov, Andrei Bobu, Maximilien Dreveton

    Abstract: The present paper is devoted to clustering geometric graphs. While the standard spectral clustering is often not effective for geometric graphs, we present an effective generalization, which we call higher-order spectral clustering. It resembles in concept the classical spectral clustering method but uses for partitioning the eigenvector associated with a higher-order eigenvalue. We establish the… ▽ More

    Submitted 15 March, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: 23 pages, 6 figures

    Journal ref: Journal of Fourier Analysis and Applications, 27:22, 2021

  6. arXiv:2008.04790  [pdf, ps, other

    math.ST cs.LG math.PR

    Community recovery in non-binary and temporal stochastic block models

    Authors: Konstantin Avrachenkov, Maximilien Dreveton, Lasse Leskelä

    Abstract: This article studies the estimation of latent community memberships from pairwise interactions in a network of $N$ nodes, where the observed interactions can be of arbitrary type, including binary, categorical, and vector-valued, and not excluding even more general objects such as time series or spatial point patterns. As a generative model for such data, we introduce a stochastic block model with… ▽ More

    Submitted 30 August, 2022; v1 submitted 11 August, 2020; originally announced August 2020.

    MSC Class: 62H30; 60J10; 90B15; 91D30

  7. arXiv:2007.14717  [pdf, ps, other

    cs.LG math.ST stat.ML

    Almost exact recovery in noisy semi-supervised learning

    Authors: Konstantin Avrachenkov, Maximilien Dreveton

    Abstract: Graph-based semi-supervised learning methods combine the graph structure and labeled data to classify unlabeled data. In this work, we study the effect of a noisy oracle on classification. In particular, we derive the Maximum A Posteriori (MAP) estimator for clustering a Degree Corrected Stochastic Block Model (DC-SBM) when a noisy oracle reveals a fraction of the labels. We then propose an algori… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 July, 2020; originally announced July 2020.

    MSC Class: 62F12; 62H30; 68T10