Skip to main content

Showing 1–50 of 67 results for author: Lelarge, M

.
  1. arXiv:2406.15076  [pdf, other

    cs.LG

    Neural Incremental Data Assimilation

    Authors: Matthieu Blanke, Ronan Fablet, Marc Lelarge

    Abstract: Data assimilation is a central problem in many geophysical applications, such as weather forecasting. It aims to estimate the state of a potentially large system, such as the atmosphere, from sparse observations, supplemented by prior physical knowledge. The size of the systems involved and the complexity of the underlying physical equations make it a challenging task from a computational point of… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2312.09860  [pdf, other

    cs.LG stat.CO

    Automatic Rao-Blackwellization for Sequential Monte Carlo with Belief Propagation

    Authors: Waïss Azizian, Guillaume Baudart, Marc Lelarge

    Abstract: Exact Bayesian inference on state-space models~(SSM) is in general untractable, and unfortunately, basic Sequential Monte Carlo~(SMC) methods do not yield correct approximations for complex models. In this paper, we propose a mixed inference algorithm that computes closed-form solutions using belief propagation as much as possible, and falls back to sampling-based SMC methods when exact computatio… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  3. arXiv:2312.00477  [pdf, other

    cs.LG stat.ML

    Interpretable Meta-Learning of Physical Systems

    Authors: Matthieu Blanke, Marc Lelarge

    Abstract: Machine learning methods can be a valuable aid in the scientific process, but they need to face challenging settings where data come from inhomogeneous experimental conditions. Recent meta-learning methods have made significant progress in multi-task learning, but they rely on black-box neural networks, resulting in high computational costs and limited interpretability. Leveraging the structure of… ▽ More

    Submitted 20 March, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Journal ref: The Twelfth International Conference on Learning Representations, ICLR 2024

  4. arXiv:2304.13426  [pdf, other

    cs.LG stat.ML

    FLEX: an Adaptive Exploration Algorithm for Nonlinear Systems

    Authors: Matthieu Blanke, Marc Lelarge

    Abstract: Model-based reinforcement learning is a powerful tool, but collecting data to fit an accurate model of the system can be costly. Exploring an unknown environment in a sample-efficient manner is hence of great importance. However, the complexity of dynamics and the computational limitations of real systems make this task challenging. In this work, we introduce FLEX, an exploration algorithm for non… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICML 2023

  5. arXiv:2301.08117  [pdf, other

    cs.LG

    Convergence beyond the over-parameterized regime using Rayleigh quotients

    Authors: David A. R. Robin, Kevin Scaman, Marc Lelarge

    Abstract: In this paper, we present a new strategy to prove the convergence of deep learning architectures to a zero training (or even testing) loss by gradient flow. Our analysis is centered on the notion of Rayleigh quotients in order to prove Kurdyka-Łojasiewicz inequalities for a broader set of neural network architectures and loss functions. We show that Rayleigh quotients provide a unified view for se… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Published at the 36th conference on Neural Information Processing Systems (NeurIPS 2022)

  6. arXiv:2204.06375  [pdf, other

    stat.ML cs.LG eess.SY

    Online greedy identification of linear dynamical systems

    Authors: Matthieu Blanke, Marc Lelarge

    Abstract: This work addresses the problem of exploration in an unknown environment. For linear dynamical systems, we use an experimental design framework and introduce an online greedy policy where the control maximizes the information of the next step. In a setting with a limited number of experimental trials, our algorithm has low complexity and shows experimentally competitive performances compared to mo… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: 17 pages, 2 figures

  7. arXiv:2203.10107  [pdf, other

    cs.LG

    SiMCa: Sinkhorn Matrix Factorization with Capacity Constraints

    Authors: Eric Daoud, Luca Ganassali, Antoine Baker, Marc Lelarge

    Abstract: For a very broad range of problems, recommendation algorithms have been increasingly used over the past decade. In most of these algorithms, the predictions are built upon user-item affinity scores which are obtained from high-dimensional embeddings of items and users. In more complex scenarios, with geometrical or capacity constraints, prediction based on embeddings may not be sufficient and some… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: All comments are welcome

  8. arXiv:2107.07623  [pdf, other

    cs.DS cs.LG math.PR math.ST stat.ML

    Correlation detection in trees for planted graph alignment

    Authors: Luca Ganassali, Laurent Massoulié, Marc Lelarge

    Abstract: Motivated by alignment of correlated sparse random graphs, we introduce a hypothesis testing problem of deciding whether or not two random trees are correlated. We obtain sufficient conditions under which this testing is impossible or feasible. We propose MPAlign, a message-passing algorithm for graph alignment inspired by the tree correlation detection problem. We prove MPAlign to succeed in poly… ▽ More

    Submitted 5 December, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: 38 pages, 9 figures

  9. arXiv:2102.02685  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Impossibility of Partial Recovery in the Graph Alignment Problem

    Authors: Luca Ganassali, Laurent Massoulié, Marc Lelarge

    Abstract: Random graph alignment refers to recovering the underlying vertex correspondence between two random graphs with correlated edges. This can be viewed as an average-case and noisy version of the well-known graph isomorphism problem. For the correlated Erdös-Rényi model, we prove an impossibility result for partial recovery in the sparse regime, with constant average degree and correlation, as well a… ▽ More

    Submitted 29 June, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

    Comments: 23 pages, 8 figures. Accepted for publication at COLT21

    Journal ref: Proceedings of Thirty Fourth Conference on Learning Theory, PMLR 134:2080-2102, 2021

  10. arXiv:2011.02143  [pdf, other

    cs.CL cs.AI cs.LG

    Conditioned Text Generation with Transfer for Closed-Domain Dialogue Systems

    Authors: Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge

    Abstract: Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. Our contribution is twofold. First we show how to optimally train and contr… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1911.03698

  11. arXiv:2006.15646  [pdf, other

    cs.LG cs.DM stat.ML

    Expressive Power of Invariant and Equivariant Graph Neural Networks

    Authors: Waïss Azizian, Marc Lelarge

    Abstract: Various classes of Graph Neural Networks (GNN) have been proposed and shown to be successful in a wide range of applications with graph structured data. In this paper, we propose a theoretical framework able to compare the expressive power of these GNN architectures. The current universality theorems only apply to intractable classes of GNNs. Here, we prove the first approximation guarantees for p… ▽ More

    Submitted 6 June, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: Appears in: Proceedings of the 9th International Conference on Learning Representations, ICLR 2021. 39 pages

    ACM Class: G.1.6; I.2.6

  12. Spectral Alignment of Correlated Gaussian matrices

    Authors: Luca Ganassali, Marc Lelarge, Laurent Massoulié

    Abstract: In this paper we analyze a simple spectral method (EIG1) for the problem of matrix alignment, consisting in aligning their leading eigenvectors: given two matrices $A$ and $B$, we compute $v_1$ and $v'_1$ two corresponding leading eigenvectors. The algorithm returns the permutation $\hatπ$ such that the rank of coordinate $\hatπ(i)$ in $v_1$ and that of coordinate $i$ in $v'_1$ (up to the sign of… ▽ More

    Submitted 11 May, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: 26 pages, 4 figures. Figures and paper organization updated, typos corrected. Remark 4.2. added

    Journal ref: Advances in Applied Probability (2022) 1-32

  13. arXiv:1911.03698  [pdf, other

    cs.CL cs.LG stat.ML

    Conditioned Query Generation for Task-Oriented Dialogue Systems

    Authors: Stéphane d'Ascoli, Alice Coucke, Francesco Caltagirone, Alexandre Caulier, Marc Lelarge

    Abstract: Scarcity of training data for task-oriented dialogue systems is a well known problem that is usually tackled with costly and time-consuming manual data annotation. An alternative solution is to rely on automatic text generation which, although less accurate than human supervision, has the advantage of being cheap and fast. In this paper we propose a novel controlled data generation method that cou… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  14. arXiv:1907.03792  [pdf, other

    cs.LG math.ST stat.ML

    Asymptotic Bayes risk for Gaussian mixture in a semi-supervised setting

    Authors: Marc Lelarge, Leo Miolane

    Abstract: Semi-supervised learning (SSL) uses unlabeled data for training and has been shown to greatly improve performance when compared to a supervised approach on the labeled data available. This claim depends both on the amount of labeled data available and on the algorithm used. In this paper, we compute analytically the gap between the best fully-supervised approach using only labeled data and the b… ▽ More

    Submitted 28 September, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 13 pages

  15. arXiv:1809.11115  [pdf, ps, other

    cs.LG stat.ML

    Weighted Spectral Embedding of Graphs

    Authors: Thomas Bonald, Alexandre Hollocou, Marc Lelarge

    Abstract: We present a novel spectral embedding of graphs that incorporates weights assigned to the nodes, quantifying their relative importance. This spectral embedding is based on the first eigenvectors of some properly normalized version of the Laplacian. We prove that these eigenvectors correspond to the configurations of lowest energy of an equivalent physical system, either mechanical or electrical, i… ▽ More

    Submitted 3 October, 2018; v1 submitted 28 September, 2018; originally announced September 2018.

  16. arXiv:1806.08240  [pdf, other

    cs.LG stat.ML

    InfoCatVAE: Representation Learning with Categorical Variational Autoencoders

    Authors: Edouard Pineau, Marc Lelarge

    Abstract: This paper describes InfoCatVAE, an extension of the variational autoencoder that enables unsupervised disentangled representation learning. InfoCatVAE uses multimodal distributions for the prior and the inference network and then maximizes the evidence lower bound objective (ELBO). We connect the new ELBO derived for our model with a natural soft clustering objective which explains the robustness… ▽ More

    Submitted 25 June, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: 9 pages, 3 appendix, 5 figures. arXiv admin note: text overlap with arXiv:1606.03657 by other authors

  17. arXiv:1803.09533  [pdf, other

    cs.CY cs.LG stat.ML

    Deep Representation for Patient Visits from Electronic Health Records

    Authors: Jean-Baptiste Escudié, Alaa Saade, Alice Coucke, Marc Lelarge

    Abstract: We show how to learn low-dimensional representations (embeddings) of patient visits from the corresponding electronic health record (EHR) where International Classification of Diseases (ICD) diagnosis codes are removed. We expect that these embeddings will be useful for the construction of predictive statistical models anticipated to drive personalized medicine and improve healthcare quality. Thes… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  18. arXiv:1801.02889  [pdf, ps, other

    cs.PF

    Optimal Content Replication and Request Matching in Large Caching Systems

    Authors: Arpan Mukhopadhyay, Nidhi Hegde, Marc Lelarge

    Abstract: We consider models of content delivery networks in which the servers are constrained by two main resources: memory and bandwidth. In such systems, the throughput crucially depends on how contents are replicated across servers and how the requests of specific contents are matched to servers storing those contents. In this paper, we first formulate the problem of computing the optimal replication po… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: INFOCOM 2018

  19. arXiv:1712.04337  [pdf, ps, other

    cs.LG cs.SI

    A Streaming Algorithm for Graph Clustering

    Authors: Alexandre Hollocou, Julien Maudet, Thomas Bonald, Marc Lelarge

    Abstract: We introduce a novel algorithm to perform graph clustering in the edge streaming setting. In this model, the graph is presented as a sequence of edges that can be processed strictly once. Our streaming algorithm has an extremely low memory footprint as it stores only three integers per node and does not keep any edge in memory. We provide a theoretical justification of the design of the algorithm… ▽ More

    Submitted 9 December, 2017; originally announced December 2017.

    Comments: NIPS Wokshop on Advances in Modeling and Learning Interactions from Complex Data, 2017. arXiv admin note: substantial text overlap with arXiv:1703.02955

  20. Replica Bounds by Combinatorial Interpolation for Diluted Spin Systems

    Authors: Marc Lelarge, Mendes Oulamara

    Abstract: In two papers Franz, Leone and Toninelli proved bounds for the free energy of diluted random constraints satisfaction problems, for a Poisson degree distribution [5] and a general distribution [6]. Panchenko and Talagrand [16] simplified the proof and generalized the result of [5] for the Poisson case. We provide a new proof for the general degree distribution case and as a corollary, we obtain ne… ▽ More

    Submitted 17 January, 2018; v1 submitted 8 August, 2017; originally announced August 2017.

    Comments: Accepted in Journal of Statistical Physics

  21. arXiv:1703.02955  [pdf, other

    cs.SI physics.soc-ph

    A linear streaming algorithm for community detection in very large networks

    Authors: Alexandre Hollocou, Julien Maudet, Thomas Bonald, Marc Lelarge

    Abstract: In this paper, we introduce a novel community detection algorithm in graphs, called SCoDA (Streaming Community Detection Algorithm), based on an edge streaming setting. This algorithm has an extremely low memory footprint and a lightning-fast execution time as it only stores two integers per node and processes each edge strictly once. The approach is based on the following simple observation: if w… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: Currently under review by an international conference

  22. arXiv:1701.08010  [pdf, other

    math.ST cond-mat.dis-nn cs.IT

    Statistical and computational phase transitions in spiked tensor estimation

    Authors: Thibault Lesieur, Léo Miolane, Marc Lelarge, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider tensor factorizations using a generative model and a Bayesian approach. We compute rigorously the mutual information, the Minimal Mean Squared Error (MMSE), and unveil information-theoretic phase transitions. In addition, we study the performance of Approximate Message Passing (AMP) and show that it achieves the MMSE for a large set of parameters, and that factorization is algorithmica… ▽ More

    Submitted 16 December, 2017; v1 submitted 27 January, 2017; originally announced January 2017.

    Comments: 17 pages, 3 figures, 1 table

    Journal ref: IEEE International Symposium on Information Theory (ISIT), pp. 511-515 (2017)

  23. arXiv:1611.03888  [pdf, other

    math.PR

    Fundamental limits of symmetric low-rank matrix estimation

    Authors: Marc Lelarge, Léo Miolane

    Abstract: We consider the high-dimensional inference problem where the signal is a low-rank symmetric matrix which is corrupted by an additive Gaussian noise. Given a probabilistic model for the low-rank matrix, we compute the limit in the large dimension setting for the mutual information between the signal and the observations, as well as the matrix minimum mean square error, while the rank of the signal… ▽ More

    Submitted 30 March, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

  24. arXiv:1610.08722  [pdf, other

    cs.SI physics.soc-ph

    Improving PageRank for Local Community Detection

    Authors: Alexandre Hollocou, Thomas Bonald, Marc Lelarge

    Abstract: Community detection is a classical problem in the field of graph mining. While most algorithms work on the entire graph, it is often interesting in practice to recover only the community containing some given set of seed nodes. In this paper, we propose a novel approach to this problem, using some low-dimensional embedding of the graph based on random walks starting from the seed nodes. From this… ▽ More

    Submitted 7 November, 2016; v1 submitted 27 October, 2016; originally announced October 2016.

    Comments: Currently under review by an international conference

  25. arXiv:1610.03680  [pdf, other

    math.PR

    Recovering asymmetric communities in the stochastic block model

    Authors: Francesco Caltagirone, Marc Lelarge, Léo Miolane

    Abstract: We consider the sparse stochastic block model in the case where the degrees are uninformative. The case where the two communities have approximately the same size has been extensively studied and we concentrate here on the community detection problem in the case of unbalanced communities. In this setting, spectral algorithms based on the non-backtracking matrix are known to solve the community det… ▽ More

    Submitted 31 March, 2017; v1 submitted 12 October, 2016; originally announced October 2016.

  26. arXiv:1609.02487  [pdf, ps, other

    math.PR cs.LG cs.SI stat.ML

    Non-Backtracking Spectrum of Degree-Corrected Stochastic Block Models

    Authors: Lennart Gulikers, Marc Lelarge, Laurent Massoulié

    Abstract: Motivated by community detection, we characterise the spectrum of the non-backtracking matrix $B$ in the Degree-Corrected Stochastic Block Model. Specifically, we consider a random graph on $n$ vertices partitioned into two equal-sized clusters. The vertices have i.i.d. weights $\{ φ_u \}_{u=1}^n$ with second moment $Φ^{(2)}$. The intra-cluster connection probability for vertices $u$ and $v$ is… ▽ More

    Submitted 18 May, 2017; v1 submitted 8 September, 2016; originally announced September 2016.

  27. arXiv:1606.00858  [pdf, other

    math.PR

    Impact of Community Structure on Cascades

    Authors: Mehrdad Moharrami, Vijay Subramanian, Mingyan Liu, Marc Lelarge

    Abstract: We study cascades under the threshold model on sparse random graphs with community structure. In this model, individuals adopt the new behavior based on how many neighbors have already chosen it. Specifically, we consider the permanent adoption model wherein individuals that have adopted the new behavior (or opinion) cannot change their state. We present a differential-equation-based tight approxi… ▽ More

    Submitted 4 May, 2022; v1 submitted 2 June, 2016; originally announced June 2016.

    MSC Class: 05C80

  28. arXiv:1605.06422  [pdf, other

    cs.LG math.PR math.ST stat.ML

    Fast Randomized Semi-Supervised Clustering

    Authors: Alaa Saade, Florent Krzakala, Marc Lelarge, Lenka Zdeborová

    Abstract: We consider the problem of clustering partially labeled data from a minimal number of randomly chosen pairwise comparisons between the items. We introduce an efficient local algorithm based on a power iteration of the non-backtracking operator and study its performance on a simple model. For the case of two clusters, we give bounds on the classification error and show that a small error can be ach… ▽ More

    Submitted 9 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Journal ref: Journal of Physics: Conf. Series 1036 (2018) 012015

  29. arXiv:1601.06683  [pdf, other

    cs.SI cond-mat.dis-nn cs.LG

    Clustering from Sparse Pairwise Measurements

    Authors: Alaa Saade, Marc Lelarge, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the problem of grou** items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that thes… ▽ More

    Submitted 19 May, 2016; v1 submitted 25 January, 2016; originally announced January 2016.

    Journal ref: Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT) Pages: 780 - 784

  30. arXiv:1511.00546  [pdf, ps, other

    math.PR cs.LG cs.SI stat.ML

    An Impossibility Result for Reconstruction in a Degree-Corrected Planted-Partition Model

    Authors: Lennart Gulikers, Marc Lelarge, Laurent Massoulié

    Abstract: We consider the Degree-Corrected Stochastic Block Model (DC-SBM): a random graph on $n$ nodes, having i.i.d. weights $(φ_u)_{u=1}^n$ (possibly heavy-tailed), partitioned into $q \geq 2$ asymptotically equal-sized clusters. The model parameters are two constants $a,b > 0$ and the finite second moment of the weights $Φ^{(2)}$. Vertices $u$ and $v$ are connected by an edge with probability… ▽ More

    Submitted 24 November, 2018; v1 submitted 2 November, 2015; originally announced November 2015.

    Comments: Appeared in Annals of Applied Probability

    Journal ref: Annals of Applied Probability - Volume 28, Number 5 (2018), 3002-3027

  31. arXiv:1507.04739  [pdf, ps, other

    math.CO cs.DM math-ph

    Counting matchings in irregular bipartite graphs and random lifts

    Authors: Marc Lelarge

    Abstract: We give a sharp lower bound on the number of matchings of a given size in a bipartite graph. When specialized to regular bipartite graphs, our results imply Friedland's Lower Matching Conjecture and Schrijver's theorem proven by Gurvits and Csikvari. Indeed, our work extends the recent work of Csikvari done for regular and bi-regular bipartite graphs. Moreover, our lower bounds are order optimal a… ▽ More

    Submitted 5 November, 2015; v1 submitted 16 July, 2015; originally announced July 2015.

    Comments: 26 pages, extended version (results for random lifts and more related work)

  32. arXiv:1506.08621  [pdf, other

    math.PR cs.LG cs.SI stat.ML

    A spectral method for community detection in moderately-sparse degree-corrected stochastic block models

    Authors: Lennart Gulikers, Marc Lelarge, Laurent Massoulié

    Abstract: We consider community detection in Degree-Corrected Stochastic Block Models (DC-SBM). We propose a spectral clustering algorithm based on a suitably normalized adjacency matrix. We show that this algorithm consistently recovers the block-membership of all but a vanishing fraction of nodes, in the regime where the lowest degree is of order log$(n)$ or higher. Recovery succeeds even for very heterog… ▽ More

    Submitted 7 February, 2017; v1 submitted 29 June, 2015; originally announced June 2015.

  33. arXiv:1506.04158  [pdf, other

    stat.ML

    A Spectral Algorithm with Additive Clustering for the Recovery of Overlap** Communities in Networks

    Authors: Emilie Kaufmann, Thomas Bonald, Marc Lelarge

    Abstract: This paper presents a novel spectral algorithm with additive clustering designed to identify overlap** communities in networks. The algorithm is based on geometric properties of the spectrum of the expected adjacency matrix in a random graph model that we call stochastic blockmodel with overlap (SBMO). An adaptive version of the algorithm, that does not require the knowledge of the number of hi… ▽ More

    Submitted 6 November, 2017; v1 submitted 12 June, 2015; originally announced June 2015.

    Comments: Journal of Theoretical Computer Science (TCS), Elsevier, A Paraître

  34. arXiv:1504.03156  [pdf, ps, other

    math.SP stat.ML

    Streaming, Memory Limited Matrix Completion with Noise

    Authors: Se-Young Yun, Marc Lelarge, Alexandre Proutiere

    Abstract: In this paper, we consider the streaming memory-limited matrix completion problem when the observed entries are noisy versions of a small random fraction of the original entries. We are interested in scenarios where the matrix size is very large so the matrix is very hard to store and manipulate. Here, columns of the observed matrix are presented sequentially and the goal is to complete the missin… ▽ More

    Submitted 13 April, 2015; originally announced April 2015.

    Comments: 21 pages

  35. arXiv:1502.04631  [pdf, other

    stat.ML

    Clustering and Inference From Pairwise Comparisons

    Authors: Rui Wu, Jiaming Xu, R. Srikant, Laurent Massoulié, Marc Lelarge, Bruce Hajek

    Abstract: Given a set of pairwise comparisons, the classical ranking problem computes a single ranking that best represents the preferences of all users. In this paper, we study the problem of inferring individual preferences, arising in the context of making personalized recommendations. In particular, we assume that there are $n$ users of $r$ types; users of the same type provide similar pairwise comparis… ▽ More

    Submitted 17 December, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

    Comments: Corrected typos in the abstract

  36. arXiv:1502.03475  [pdf, other

    cs.LG math.OC stat.ML

    Combinatorial Bandits Revisited

    Authors: Richard Combes, M. Sadegh Talebi, Alexandre Proutiere, Marc Lelarge

    Abstract: This paper investigates stochastic and adversarial combinatorial multi-armed bandit problems. In the stochastic setting under semi-bandit feedback, we derive a problem-specific regret lower bound, and discuss its scaling with the dimension of the decision space. We propose ESCB, an algorithm that efficiently exploits the structure of the problem and provide a finite-time analysis of its regret. ES… ▽ More

    Submitted 5 November, 2015; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 30 pages, Advances in Neural Information Processing Systems 28 (NIPS 2015)

  37. arXiv:1502.03365  [pdf, other

    stat.ML

    Reconstruction in the Labeled Stochastic Block Model

    Authors: Marc Lelarge, Laurent Massoulié, Jiaming Xu

    Abstract: The labeled stochastic block model is a random graph model representing networks with community structure and interactions of multiple types. In its simplest form, it consists of two communities of approximately equal size, and the edges are drawn and labeled at random with probability depending on whether their two endpoints belong to the same community or not. It has been conjectured in \cite{… ▽ More

    Submitted 11 February, 2015; originally announced February 2015.

    Comments: A preliminary version of this paper appeared in the Proceedings of the 2013 Information Theory Workshop

  38. arXiv:1502.00163  [pdf, other

    cs.SI cond-mat.dis-nn cs.LG math.PR

    Spectral Detection in the Censored Block Model

    Authors: Alaa Saade, Florent Krzakala, Marc Lelarge, Lenka Zdeborová

    Abstract: We consider the problem of partially recovering hidden binary variables from the observation of (few) censored edge weights, a problem with applications in community detection, correlation clustering and synchronization. We describe two spectral algorithms for this task based on the non-backtracking and the Bethe Hessian operators. These algorithms are shown to be asymptotically optimal for the pa… ▽ More

    Submitted 10 June, 2015; v1 submitted 31 January, 2015; originally announced February 2015.

    Comments: ISIT 2015

    Journal ref: IEEE International Symposium on Information Theory (ISIT), pp.1184-1188 (2015)

  39. arXiv:1501.06087  [pdf, other

    math.PR cs.SI

    Non-backtracking spectrum of random graphs: community detection and non-regular Ramanujan graphs

    Authors: Charles Bordenave, Marc Lelarge, Laurent Massoulié

    Abstract: A non-backtracking walk on a graph is a directed path such that no edge is the inverse of its preceding edge. The non-backtracking matrix of a graph is indexed by its directed edges and can be used to count non-backtracking walks of a given length. It has been used recently in the context of community detection and has appeared previously in connection with the Ihara zeta function and in some gene… ▽ More

    Submitted 22 April, 2015; v1 submitted 24 January, 2015; originally announced January 2015.

    Comments: 59 pages

    MSC Class: 05C80; 05C50; 91D30

  40. arXiv:1412.1004  [pdf, other

    math.CO cond-mat.stat-mech math.PR

    On rigidity, orientability and cores of random graphs with sliders

    Authors: Julien Barré, Marc Lelarge, Dieter Mitsche

    Abstract: Suppose that you add rigid bars between points in the plane, and suppose that a constant fraction $q$ of the points moves freely in the whole plane; the remaining fraction is constrained to move on fixed lines called sliders. When does a giant rigid cluster emerge? Under a genericity condition, the answer only depends on the graph formed by the points (vertices) and the bars (edges). We find for t… ▽ More

    Submitted 20 February, 2015; v1 submitted 2 December, 2014; originally announced December 2014.

    Comments: 32 pages, 1 figure

  41. arXiv:1411.1279  [pdf, ps, other

    cs.SI cs.DS

    Streaming, Memory Limited Algorithms for Community Detection

    Authors: Se-Young Yun, Marc Lelarge, Alexandre Proutiere

    Abstract: In this paper, we consider sparse networks consisting of a finite number of non-overlap** communities, i.e. disjoint clusters, so that there is higher density within clusters than across clusters. Both the intra- and inter-cluster edge densities vanish when the size of the graph grows large, making the cluster reconstruction problem nosier and hence difficult to solve. We are interested in scena… ▽ More

    Submitted 3 November, 2014; originally announced November 2014.

    Comments: NIPS 2014

  42. arXiv:1406.6897  [pdf, other

    math.ST stat.ML

    Edge Label Inference in Generalized Stochastic Block Models: from Spectral Theory to Impossibility Results

    Authors: Jiaming Xu, Laurent Massoulié, Marc Lelarge

    Abstract: The classical setting of community detection consists of networks exhibiting a clustered structure. To more accurately model real systems we consider a class of networks (i) whose edges may carry labels and (ii) which may lack a clustered structure. Specifically we assume that nodes possess latent attributes drawn from a general compact space and edges between two nodes are randomly generated and… ▽ More

    Submitted 26 June, 2014; originally announced June 2014.

    Comments: 17 pages

  43. arXiv:1401.7923  [pdf, ps, other

    cs.DM cs.DS cs.IT math-ph math.PR

    Loopy annealing belief propagation for vertex cover and matching: convergence, LP relaxation, correctness and Bethe approximation

    Authors: Marc Lelarge

    Abstract: For the minimum cardinality vertex cover and maximum cardinality matching problems, the max-product form of belief propagation (BP) is known to perform poorly on general graphs. In this paper, we present an iterative loopy annealing BP (LABP) algorithm which is shown to converge and to solve a Linear Programming relaxation of the vertex cover or matching problem on general graphs. LABP finds (asym… ▽ More

    Submitted 7 July, 2014; v1 submitted 30 January, 2014; originally announced January 2014.

    Comments: revised version, 23 pages

  44. arXiv:1401.1770  [pdf, ps, other

    cs.NI

    Adaptive Replication in Distributed Content Delivery Networks

    Authors: Mathieu Leconte, Marc Lelarge, Laurent Massoulié

    Abstract: We address the problem of content replication in large distributed content delivery networks, composed of a data center assisted by many small servers with limited capabilities and located at the edge of the network. The objective is to optimize the placement of contents on the servers to offload as much as possible the data center. We model the system constituted by the small servers as a loss ne… ▽ More

    Submitted 8 January, 2014; originally announced January 2014.

    Comments: 10 pages, 5 figures

  45. arXiv:1303.4325  [pdf, other

    math.PR

    Contagions in Random Networks with Overlap** Communities

    Authors: Emilie Coupechoux, Marc Lelarge

    Abstract: We consider a threshold epidemic model on a clustered random graph with overlap** communities. In other words, our epidemic model is such that an individual becomes infected as soon as the proportion of her infected neighbors exceeds the threshold q of the epidemic. In our random graph model, each individual can belong to several communities. The distributions for the community sizes and the num… ▽ More

    Submitted 31 January, 2014; v1 submitted 18 March, 2013; originally announced March 2013.

    Comments: Minor modifications for the second version: added comments (end of Section 3.2, beginning of Section 5.3); moved remark (end of Section 3.1, beginning of Section 4.1); corrected typos; changed title

    MSC Class: 60C05; 05C80; 91D30

  46. arXiv:1302.6974  [pdf, ps, other

    cs.LG cs.NI math.OC

    Spectrum Bandit Optimization

    Authors: Marc Lelarge, Alexandre Proutiere, M. Sadegh Talebi

    Abstract: We consider the problem of allocating radio channels to links in a wireless network. Links interact through interference, modelled as a conflict graph (i.e., two interfering links cannot be simultaneously active on the same channel). We aim at identifying the channel allocation maximizing the total network throughput over a finite time horizon. Should we know the average radio conditions on each c… ▽ More

    Submitted 17 February, 2015; v1 submitted 27 February, 2013; originally announced February 2013.

    Comments: 21 pages

  47. arXiv:1210.4839  [pdf

    cs.LG stat.ML

    Leveraging Side Observations in Stochastic Bandits

    Authors: Stephane Caron, Branislav Kveton, Marc Lelarge, Smriti Bhagat

    Abstract: This paper considers stochastic bandits with side observations, a model that accounts for both the exploration/exploitation dilemma and relationships between arms. In this setting, after pulling an arm i, the decision maker also observes the rewards for some other actions related to i. We will see that this model is suited to content recommendation in social networks, where users' reactions may be… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-142-151

  48. arXiv:1209.2910  [pdf, other

    cs.SI cs.LG math.PR physics.soc-ph

    Community Detection in the Labelled Stochastic Block Model

    Authors: Simon Heimlicher, Marc Lelarge, Laurent Massoulié

    Abstract: We consider the problem of community detection from observed interactions between individuals, in the context where multiple types of interaction are possible. We use labelled stochastic block models to represent the observed data, where labels correspond to interaction types. Focusing on a two-community scenario, we conjecture a threshold for the problem of reconstructing the hidden communities i… ▽ More

    Submitted 13 September, 2012; originally announced September 2012.

    Comments: 9 pages

  49. arXiv:1208.3994  [pdf, ps, other

    cs.GT cs.NI cs.SI

    Coordination in Network Security Games: a Monotone Comparative Statics Approach

    Authors: Marc Lelarge

    Abstract: Malicious softwares or malwares for short have become a major security threat. While originating in criminal behavior, their impact are also influenced by the decisions of legitimate end users. Getting agents in the Internet, and in networks in general, to invest in and deploy security features and protocols is a challenge, in particular because of economic reasons arising from the presence of net… ▽ More

    Submitted 20 August, 2012; originally announced August 2012.

    Comments: 10 pages, to appear in IEEE JSAC

  50. arXiv:1208.3629  [pdf, ps, other

    cs.DS cs.DM math.PR

    Sublinear-Time Algorithms for Monomer-Dimer Systems on Bounded Degree Graphs

    Authors: Marc Lelarge, Hang Zhou

    Abstract: For a graph $G$, let $Z(G,λ)$ be the partition function of the monomer-dimer system defined by $\sum_k m_k(G)λ^k$, where $m_k(G)$ is the number of matchings of size $k$ in $G$. We consider graphs of bounded degree and develop a sublinear-time algorithm for estimating $\log Z(G,λ)$ at an arbitrary value $λ>0$ within additive error $εn$ with high probability. The query complexity of our algorithm do… ▽ More

    Submitted 4 September, 2013; v1 submitted 17 August, 2012; originally announced August 2012.