Skip to main content

Showing 1–22 of 22 results for author: Rubin-Delanchy, P

.
  1. arXiv:2405.19230  [pdf, other

    stat.ML cs.LG

    Valid Conformal Prediction for Dynamic GNNs

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: Graph neural networks (GNNs) are powerful black-box models which have shown impressive empirical performance. However, without any form of uncertainty quantification, it can be difficult to trust such models in high-risk scenarios. Conformal prediction aims to address this problem, however, an assumption of exchangeability is required for its validity which has limited its applicability to static… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures

    MSC Class: 62H30

  2. arXiv:2311.09251  [pdf, other

    cs.SI cs.LG stat.ML

    A Simple and Powerful Framework for Stable Dynamic Network Embedding

    Authors: Ed Davis, Ian Gallagher, Daniel John Lawson, Patrick Rubin-Delanchy

    Abstract: In this paper, we address the problem of dynamic network embedding, that is, representing the nodes of a dynamic network as evolving vectors within a low-dimensional space. While the field of static network embedding is wide and established, the field of dynamic network embedding is comparatively in its infancy. We propose that a wide class of established static network embedding methods can be us… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 33 pages, 9 figures

    MSC Class: 62H15 (Primary) 62H30; 62M10; 62G99 (Secondary)

  3. arXiv:2306.06155  [pdf, other

    cs.LG stat.ME stat.ML

    Intensity Profile Projection: A Framework for Continuous-Time Representation Learning for Dynamic Networks

    Authors: Alexander Modell, Ian Gallagher, Emma Ceccherini, Nick Whiteley, Patrick Rubin-Delanchy

    Abstract: We present a new representation learning framework, Intensity Profile Projection, for continuous-time dynamic network data. Given triples $(i,j,t)$, each representing a time-stamped ($t$) interaction between two entities ($i,j$), our procedure returns a continuous-time trajectory for each node, representing its behaviour over time. The framework consists of three stages: estimating pairwise intens… ▽ More

    Submitted 17 January, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 38 pages, 10 figures

    MSC Class: 62H12 (primary); 62H30 (secondary)

  4. arXiv:2305.15022  [pdf, other

    stat.ML cs.LG

    Hierarchical clustering with dot products recovers hidden tree structure

    Authors: Annie Gray, Alexander Modell, Patrick Rubin-Delanchy, Nick Whiteley

    Abstract: In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure. We recommend a simple variant of the standard algorithm, in which clusters are merged by maximum average dot product and not, for example, by minimum distance or within-cluster variance. We demonstrate that the tree output by this algorithm provides a… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  5. arXiv:2210.15277  [pdf, other

    stat.ML cs.LG

    Implications of sparsity and high triangle density for graph representation learning

    Authors: Hannah Sansford, Alexander Modell, Nick Whiteley, Patrick Rubin-Delanchy

    Abstract: Recent work has shown that sparse graphs containing many triangles cannot be reproduced using a finite-dimensional representation of the nodes, in which link probabilities are inner products. Here, we show that such graphs can be reproduced using an infinite-dimensional inner product model, where the node representations lie on a low-dimensional manifold. Recovering a global representation of the… ▽ More

    Submitted 21 April, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

  6. arXiv:2208.11665  [pdf, other

    stat.ME cs.LG stat.ML

    Statistical exploration of the Manifold Hypothesis

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: The Manifold Hypothesis is a widely accepted tenet of Machine Learning which asserts that nominally high-dimensional data are in fact concentrated near a low-dimensional manifold, embedded in high-dimensional space. This phenomenon is observed empirically in many real world situations, has led to development of a wide range of statistical methods in the last few decades, and has been suggested as… ▽ More

    Submitted 9 February, 2024; v1 submitted 24 August, 2022; originally announced August 2022.

    MSC Class: 62R20; 62R40; 62G05; 62G20; 62R07; 62-08; 62H25; 62H30

  7. arXiv:2202.03945  [pdf, other

    stat.ME stat.ML

    Spectral embedding and the latent geometry of multipartite networks

    Authors: Alexander Modell, Ian Gallagher, Joshua Cape, Patrick Rubin-Delanchy

    Abstract: Spectral embedding finds vector representations of the nodes of a network, based on the eigenvectors of its adjacency or Laplacian matrix, and has found applications throughout the sciences. Many such networks are multipartite, meaning their nodes can be divided into groups and nodes of the same group are never connected. When the network is multipartite, this paper demonstrates that the node repr… ▽ More

    Submitted 14 September, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: 13 pages, 5 figures, 2 tables

    MSC Class: 62H30; 62H12

  8. arXiv:2106.01282  [pdf, other

    stat.ML cs.LG

    Spectral embedding for dynamic networks with stability guarantees

    Authors: Ian Gallagher, Andrew Jones, Patrick Rubin-Delanchy

    Abstract: We consider the problem of embedding a dynamic network, to obtain time-evolving vector representations of each node, which can then be used to describe changes in behaviour of individual nodes, communities, or the entire graph. Given this open-ended remit, we argue that two types of stability in the spatio-temporal positioning of nodes are desirable: to assign the same position, up to noise, to no… ▽ More

    Submitted 20 January, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

    MSC Class: 62M10; 62H30; 62G99

  9. arXiv:2106.01260  [pdf, other

    stat.ML cs.LG

    Matrix factorisation and the interpretation of geodesic distance

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: Given a graph or similarity matrix, we consider the problem of recovering a notion of true distance between the nodes, and so their true positions. We show that this can be accomplished in two steps: matrix factorisation, followed by nonlinear dimension reduction. This combination is effective because the point cloud obtained in the first step lives close to a manifold in which latent distance is… ▽ More

    Submitted 22 September, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    MSC Class: 62G05; 62H20; 62H12; 62H30

  10. arXiv:2105.00987  [pdf, other

    stat.ME stat.ML

    Spectral clustering under degree heterogeneity: a case for the random walk Laplacian

    Authors: Alexander Modell, Patrick Rubin-Delanchy

    Abstract: This paper shows that graph spectral embedding using the random walk Laplacian produces vector representations which are completely corrected for node degree. Under a generalised random dot product graph, the embedding provides uniformly consistent estimates of degree-corrected latent positions, with asymptotically Gaussian error. In the special case of a degree-corrected stochastic block model, t… ▽ More

    Submitted 4 May, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 22 pages, 10 figures

    MSC Class: 62H30 (primary); 62H12 (secondary)

  11. Spectral clustering on spherical coordinates under the degree-corrected stochastic blockmodel

    Authors: Francesco Sanna Passino, Nicholas A. Heard, Patrick Rubin-Delanchy

    Abstract: Spectral clustering is a popular method for community detection in network graphs: starting from a matrix representation of the graph, the nodes are clustered on a low dimensional projection obtained from a truncated spectral decomposition of the matrix. Estimating correctly the number of communities and the dimension of the reduced latent space is critical for good performance of spectral cluster… ▽ More

    Submitted 8 September, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Journal ref: Technometrics 64(3), 346-357 (2022)

  12. arXiv:2007.10455  [pdf, other

    stat.ML cs.LG

    The multilayer random dot product graph

    Authors: Andrew Jones, Patrick Rubin-Delanchy

    Abstract: We present a comprehensive extension of the latent position network model known as the random dot product graph to accommodate multiple graphs -- both undirected and directed -- which share a common subset of nodes, and propose a method for jointly embedding the associated adjacency matrices, or submatrices thereof, into a suitable latent space. Theoretical results concerning the asymptotic behavi… ▽ More

    Submitted 25 January, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: 45 pages, 15 figures

  13. arXiv:2006.05168  [pdf, other

    stat.ML cs.LG

    Manifold structure in graph embeddings

    Authors: Patrick Rubin-Delanchy

    Abstract: Statistical analysis of a graph often starts with embedding, the process of representing its nodes as points in space. How to choose the embedding dimension is a nuanced decision in practice, but in theory a notion of true dimension is often available. In spectral embedding, this dimension may be very high. However, this paper shows that existing random graph models, including graphon and other la… ▽ More

    Submitted 5 January, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  14. arXiv:1912.10238   

    math.ST math.AT

    Persistent Homology of Graph Embeddings

    Authors: Vinesh Solanki, Patrick Rubin-Delanchy, Ian Gallagher

    Abstract: Popular network models such as the mixed membership and standard stochastic block model are known to exhibit distinct geometric structure when embedded into $\mathbb{R}^{d}$ using spectral methods. The resulting point cloud concentrates around a simplex in the first model, whereas it separates into clusters in the second. By adopting the formalism of generalised random dot-product graphs, we demon… ▽ More

    Submitted 14 October, 2021; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: The first author wishes to withdraw their authorship. The remaining authors wish to respect that decision, but do not want to claim full authorship of work that is only partially theirs

    MSC Class: 62G05; 62G10; 62G20

  15. arXiv:1910.05534  [pdf, other

    stat.ML cs.LG

    Spectral embedding of weighted graphs

    Authors: Ian Gallagher, Andrew Jones, Anna Bertiger, Carey Priebe, Patrick Rubin-Delanchy

    Abstract: When analyzing weighted networks using spectral embedding, a judicious transformation of the edge weights may produce better results. To formalize this idea, we consider the asymptotic behavior of spectral embedding for different edge-weight representations, under a generic low rank model. We measure the quality of different embeddings -- which can be on entirely different scales -- by how easy it… ▽ More

    Submitted 19 January, 2023; v1 submitted 12 October, 2019; originally announced October 2019.

    Comments: 27 pages, 5 figures

  16. arXiv:1709.05506  [pdf, other

    stat.ML cs.LG

    A statistical interpretation of spectral embedding: the generalised random dot product graph

    Authors: Patrick Rubin-Delanchy, Joshua Cape, Minh Tang, Carey E. Priebe

    Abstract: Spectral embedding is a procedure which can be used to obtain vector representations of the nodes of a graph. This paper proposes a generalisation of the latent position network model known as the random dot product graph, to allow interpretation of those vector representations as latent position estimates. The generalisation is needed to model heterophilic connectivity (e.g., `opposites attract')… ▽ More

    Submitted 16 November, 2021; v1 submitted 16 September, 2017; originally announced September 2017.

    Comments: 34 pages; 12 figures

    MSC Class: 62H30; 62H12; 62E20;

  17. Choosing Between Methods of Combining p-values

    Authors: Nicholas Heard, Patrick Rubin-Delanchy

    Abstract: Combining p-values from independent statistical tests is a popular approach to meta-analysis, particularly when the data underlying the tests are either no longer available or are difficult to combine. A diverse range of p-value combination methods appear in the literature, each with different statistical properties. Yet all too often the final choice used in a meta-analysis can appear arbitrary,… ▽ More

    Submitted 14 December, 2017; v1 submitted 21 July, 2017; originally announced July 2017.

  18. arXiv:1705.04518  [pdf, other

    stat.ME

    Consistency of adjacency spectral embedding for the mixed membership stochastic blockmodel

    Authors: Patrick Rubin-Delanchy, Carey E. Priebe, Minh Tang

    Abstract: The mixed membership stochastic blockmodel is a statistical model for a graph, which extends the stochastic blockmodel by allowing every node to randomly choose a different community each time a decision of whether to form an edge is made. Whereas spectral analysis for the stochastic blockmodel is increasingly well established, theory for the mixed membership case is considerably less developed. H… ▽ More

    Submitted 12 May, 2017; originally announced May 2017.

    Comments: 12 pages, 6 figures

  19. arXiv:1505.05068  [pdf, other

    math.ST

    Meta-analysis of mid-p-values: some new results based on the convex order

    Authors: Patrick Rubin-Delanchy, Nicholas A. Heard, Daniel John Lawson

    Abstract: The mid-p-value is a proposed improvement on the ordinary p-value for the case where the test statistic is partially or completely discrete. In this case, the ordinary p-value is conservative, meaning that its null distribution is larger than a uniform distribution on the unit interval, in the usual stochastic order. The mid-p-value is not conservative. However, its null distribution is dominated… ▽ More

    Submitted 31 May, 2017; v1 submitted 19 May, 2015; originally announced May 2015.

    Comments: 12 pages, 3 figures

    MSC Class: 62F03; 62G10

  20. arXiv:1412.3442  [pdf, other

    math.ST

    Posterior predictive p-values and the convex order

    Authors: Patrick Rubin-Delanchy, Daniel John Lawson

    Abstract: Posterior predictive p-values are a common approach to Bayesian model-checking. This article analyses their frequency behaviour, that is, their distribution when the parameters and the data are drawn from the prior and the model respectively. We show that the family of possible distributions is exactly described as the distributions that are less variable than uniform on [0,1], in the convex order… ▽ More

    Submitted 29 March, 2015; v1 submitted 10 December, 2014; originally announced December 2014.

    Comments: 14 pages, 3 figures

    MSC Class: 62F15; 62H15; 62G10

  21. arXiv:1408.3845  [pdf, other

    stat.ME

    A test for dependence between two point processes on the real line

    Authors: Patrick Rubin-Delanchy, Nicholas A. Heard

    Abstract: Many scientific questions rely on determining whether two sequences of event times are associated. This article introduces a likelihood ratio test which can be parameterised in several ways to detect different forms of dependence. A common finite-sample distribution is derived, and shown to be asymptotically related to a weighted Kolmogorov-Smirnov test. Analysis leading to these results also moti… ▽ More

    Submitted 20 December, 2014; v1 submitted 17 August, 2014; originally announced August 2014.

    Comments: 13 pages, 4 figures

    MSC Class: 62N03; 62P35; 62P30; 62M10; 62F03; 62F05

  22. arXiv:1110.1248  [pdf, ps, other

    stat.CO math.ST

    An algorithm to compute the power of Monte Carlo tests with guaranteed precision

    Authors: Axel Gandy, Patrick Rubin-Delanchy

    Abstract: This article presents an algorithm that generates a conservative confidence interval of a specified length and coverage probability for the power of a Monte Carlo test (such as a bootstrap or permutation test). It is the first method that achieves this aim for almost any Monte Carlo test. Previous research has focused on obtaining as accurate a result as possible for a fixed computational effort,… ▽ More

    Submitted 12 March, 2013; v1 submitted 6 October, 2011; originally announced October 2011.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOS1076 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1076

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 1, 125-142