Skip to main content

Showing 1–23 of 23 results for author: Chamberlain, B P

.
  1. arXiv:2210.00513  [pdf, other

    cs.LG stat.ML

    Gradient Gating for Deep Multi-Rate Learning on Graphs

    Authors: T. Konstantin Rusch, Benjamin P. Chamberlain, Michael W. Mahoney, Michael M. Bronstein, Siddhartha Mishra

    Abstract: We present Gradient Gating (G$^2$), a novel framework for improving the performance of Graph Neural Networks (GNNs). Our framework is based on gating the output of GNN layers with a mechanism for multi-rate flow of message passing information across nodes of the underlying graph. Local gradients are harnessed to further modulate message passing updates. Our framework flexibly allows one to use any… ▽ More

    Submitted 15 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

  2. arXiv:2209.15486  [pdf, other

    cs.LG cs.IR

    Graph Neural Networks for Link Prediction with Subgraph Sketching

    Authors: Benjamin Paul Chamberlain, Sergey Shirobokov, Emanuele Rossi, Fabrizio Frasca, Thomas Markovich, Nils Hammerla, Michael M. Bronstein, Max Hansmire

    Abstract: Many Graph Neural Networks (GNNs) perform poorly compared to simple heuristics on Link Prediction (LP) tasks. This is due to limitations in expressive power such as the inability to count triangles (the backbone of most LP heuristics) and because they can not distinguish automorphic nodes (those having identical structural roles). Both expressiveness issues can be alleviated by learning link (rath… ▽ More

    Submitted 2 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 29 pages, 19 figures, 6 appendices

    Journal ref: The Eleventh International Conference on Learning Representations 2023 (oral - top 5%)

  3. arXiv:2206.10991  [pdf, other

    cs.LG stat.ML

    Understanding convolution on graphs via energies

    Authors: Francesco Di Giovanni, James Rowbottom, Benjamin P. Chamberlain, Thomas Markovich, Michael M. Bronstein

    Abstract: Graph Neural Networks (GNNs) typically operate by message-passing, where the state of a node is updated based on the information received from its neighbours. Most message-passing models act as graph convolutions, where features are mixed by a shared, linear transformation before being propagated over the edges. On node-classification tasks, graph convolutions have been shown to suffer from two li… ▽ More

    Submitted 6 September, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted at TMLR; First two authors equal contribution; 35 pages

  4. arXiv:2202.04579  [pdf, other

    cs.LG math.AT

    Neural Sheaf Diffusion: A Topological Perspective on Heterophily and Oversmoothing in GNNs

    Authors: Cristian Bodnar, Francesco Di Giovanni, Benjamin Paul Chamberlain, Pietro Liò, Michael M. Bronstein

    Abstract: Cellular sheaves equip graphs with a "geometrical" structure by assigning vector spaces and linear maps to nodes and edges. Graph Neural Networks (GNNs) implicitly assume a graph with a trivial underlying sheaf. This choice is reflected in the structure of the graph Laplacian operator, the properties of the associated diffusion equation, and the characteristics of the convolutional models that dis… ▽ More

    Submitted 6 January, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted to NeurIPS 2022. Contains 29 pages, 10 figures

  5. arXiv:2202.02296  [pdf, other

    cs.LG math.DS stat.ML

    Graph-Coupled Oscillator Networks

    Authors: T. Konstantin Rusch, Benjamin P. Chamberlain, James Rowbottom, Siddhartha Mishra, Michael M. Bronstein

    Abstract: We propose Graph-Coupled Oscillator Networks (GraphCON), a novel framework for deep learning on graphs. It is based on discretizations of a second-order system of ordinary differential equations (ODEs), which model a network of nonlinear controlled and damped oscillators, coupled via the adjacency structure of the underlying graph. The flexibility of our framework permits any basic GNN layer (e.g.… ▽ More

    Submitted 23 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  6. arXiv:2111.14522  [pdf, other

    stat.ML cs.LG

    Understanding over-squashing and bottlenecks on graphs via curvature

    Authors: Jake Top**, Francesco Di Giovanni, Benjamin Paul Chamberlain, Xiaowen Dong, Michael M. Bronstein

    Abstract: Most graph neural networks (GNNs) use the message passing paradigm, in which node features are propagated on the input graph. Recent works pointed to the distortion of information flowing from distant nodes as a factor limiting the efficiency of message passing for tasks relying on long-distance interactions. This phenomenon, referred to as 'over-squashing', has been heuristically attributed to gr… ▽ More

    Submitted 12 November, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: First two authors equal contribution; outstanding paper honorable mention at ICLR22; 30 pages. v2: corrected Corollary 3 and rephrased in terms of graph diameter. Any other result is unaffected

  7. arXiv:2111.12128  [pdf, other

    cs.LG

    On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

    Authors: Emanuele Rossi, Henry Kenlay, Maria I. Gorinova, Benjamin Paul Chamberlain, Xiaowen Dong, Michael Bronstein

    Abstract: While Graph Neural Networks (GNNs) have recently become the de facto standard for modeling relational data, they impose a strong assumption on the availability of the node or edge features of the graph. In many real-world applications, however, features are only partially available; for example, in social networks, age and gender are available only for a small subset of users. We present a general… ▽ More

    Submitted 23 May, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

  8. arXiv:2110.09443  [pdf, other

    cs.LG cs.AI stat.ML

    Beltrami Flow and Neural Diffusion on Graphs

    Authors: Benjamin Paul Chamberlain, James Rowbottom, Davide Eynard, Francesco Di Giovanni, Xiaowen Dong, Michael M Bronstein

    Abstract: We propose a novel class of graph neural networks based on the discretised Beltrami flow, a non-Euclidean diffusion PDE. In our model, node features are supplemented with positional encodings derived from the graph topology and jointly evolved by the Beltrami flow, producing simultaneously continuous feature learning and topology evolution. The resulting model generalises many popular graph neural… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 21 pages, 5 figures. Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021

  9. arXiv:2106.10934  [pdf, other

    cs.LG stat.ML

    GRAND: Graph Neural Diffusion

    Authors: Benjamin Paul Chamberlain, James Rowbottom, Maria Gorinova, Stefan Webb, Emanuele Rossi, Michael M. Bronstein

    Abstract: We present Graph Neural Diffusion (GRAND) that approaches deep learning on graphs as a continuous diffusion process and treats Graph Neural Networks (GNNs) as discretisations of an underlying PDE. In our model, the layer structure and topology correspond to the discretisation choices of temporal and spatial operators. Our approach allows a principled development of a broad new class of GNNs that a… ▽ More

    Submitted 22 September, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 15 pages, 4 figures. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021. Copyright 2021 by the author(s)

  10. arXiv:2009.12192  [pdf, other

    cs.IR cs.CL cs.LG

    Tuning Word2vec for Large Scale Recommendation Systems

    Authors: Benjamin P. Chamberlain, Emanuele Rossi, Dan Shiebler, Suvash Sedhain, Michael M. Bronstein

    Abstract: Word2vec is a powerful machine learning tool that emerged from Natural Lan-guage Processing (NLP) and is now applied in multiple domains, including recom-mender systems, forecasting, and network analysis. As Word2vec is often used offthe shelf, we address the question of whether the default hyperparameters are suit-able for recommender systems. The answer is emphatically no. In this paper, wefirst… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 11 pages, 4 figures, Fourteenth ACM Conference on Recommender Systems

    Journal ref: Fourteenth ACM Conference on Recommender Systems (RecSys '20), September 22--26, 2020, Virtual Event, Brazil

  11. arXiv:1909.03457  [pdf, other

    stat.ME stat.AP

    What is the value of experimentation & measurement?

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Experimentation and Measurement (E&M) capabilities allow organizations to accurately assess the impact of new propositions and to experiment with many variants of existing products. However, until now, the question of measuring the measurer, or valuing the contribution of an E&M capability to organizational success has not been addressed. We tackle this problem by analyzing how, by decreasing esti… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    Comments: Accepted into IEEE International Conference on Data Mining (ICDM) 2019. Main paper: 6 pages, 3 figures; Supplementary document: 7 pages, 2 figures. Code available on: https://github.com/liuchbryan/value_of_experimentation

  12. arXiv:1904.00741  [pdf, other

    cs.CV cs.LG stat.ML

    Fashion Outfit Generation for E-commerce

    Authors: Elaine M. Bettaney, Stephen R. Hardwick, Odysseas Zisimopoulos, Benjamin Paul Chamberlain

    Abstract: Combining items of clothing into an outfit is a major task in fashion retail. Recommending sets of items that are compatible with a particular seed item is useful for providing users with guidance and inspiration, but is currently a manual process that requires expert stylists and is therefore not scalable or easy to personalise. We use a multilayer neural network fed by visual and textual feature… ▽ More

    Submitted 18 March, 2019; originally announced April 2019.

    Comments: 9 pages, 9 figures, 4 tables

  13. arXiv:1902.08648  [pdf, other

    cs.IR cs.LG stat.ML

    Scalable Hyperbolic Recommender Systems

    Authors: Benjamin Paul Chamberlain, Stephen R. Hardwick, David R. Wardrope, Fabon Dzogang, Fabio Daolio, Saúl Vargas

    Abstract: We present a large scale hyperbolic recommender system. We discuss why hyperbolic geometry is a more suitable underlying geometry for many recommendation systems and cover the fundamental milestones and insights that we have gained from its development. In doing so, we demonstrate the viability of hyperbolic geometry for recommender systems, showing that they significantly outperform Euclidean mod… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: 11 pages, 8 figures, 2 tables

  14. arXiv:1807.04098  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    A Recurrent Neural Network Survival Model: Predicting Web User Return Time

    Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

    Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

  15. arXiv:1806.02588  [pdf, other

    stat.ME cs.DM stat.AP

    Designing Experiments to Measure Incrementality on Facebook

    Authors: C. H. Bryan Liu, Elaine M. Bettaney, Benjamin Paul Chamberlain

    Abstract: The importance of Facebook advertising has risen dramatically in recent years, with the platform accounting for almost 20% of the global online ad spend in 2017. An important consideration in advertising is incrementality: how much of the change in an experimental metric is an advertising campaign responsible for. To measure incrementality, Facebook provide lift studies. As Facebook lift studies d… ▽ More

    Submitted 11 July, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: Accepted into 2018 AdKDD & TargetAd Workshop in conjunction with KDD 2018; 6 pages, 4 figures, and 2 tables

  16. arXiv:1804.04095  [pdf, other

    cs.CL cs.AI cs.SI

    Predicting Twitter User Socioeconomic Attributes with Network and Language Information

    Authors: Nikolaos Aletras, Benjamin Paul Chamberlain

    Abstract: Inferring socioeconomic attributes of social media users such as occupation and income is an important problem in computational social science. Automated inference of such characteristics has applications in personalised recommender systems, targeted computational advertising and online political campaigning. While previous work has shown that language features can reliably predict socioeconomic a… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Accepted at ACM HT 2018

  17. arXiv:1803.06258  [pdf, other

    stat.ME cs.DM stat.AP

    Online Controlled Experiments for Personalised e-Commerce Strategies: Design, Challenges, and Pitfalls

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Online controlled experiments are the primary tool for measuring the causal impact of product changes in digital businesses. It is increasingly common for digital products and services to interact with customers in a personalised way. Using online controlled experiments to optimise personalised interaction strategies is challenging because the usual assumption of statistically equivalent user grou… ▽ More

    Submitted 1 July, 2021; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: Not peer-reviewed but retained for historic interest. Removed an erroneous statement on Welch's t-test assumptions in Section 3.2. 9 pages, 7 figures

  18. Speeding Up BigClam Implementation on SNAP

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: We perform a detailed analysis of the C++ implementation of the Cluster Affiliation Model for Big Networks (BigClam) on the Stanford Network Analysis Project (SNAP). BigClam is a popular graph mining algorithm that is capable of finding overlap** communities in networks containing millions of nodes. Our analysis shows a key stage of the algorithm - determining if a node belongs to a community -… ▽ More

    Submitted 4 September, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: To appear in 2018 Imperial College Computing Student Workshop (ICCSW'18); 12 pages, 4 figures, and 3 tables

    Journal ref: 2018 Imperial College Computing Student Workshop (ICCSW 2018). OpenAccess Series in Informatics (OASIcs), vol. 66, pp. 1:1-1:13

  19. arXiv:1706.09865  [pdf, other

    stat.ML cs.CY cs.LG

    Generalising Random Forest Parameter Optimisation to Include Stability and Cost

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

    Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More

    Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear in ECML-PKDD 2017

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

  20. arXiv:1705.10359  [pdf, other

    stat.ML cs.LG

    Neural Embeddings of Graphs in Hyperbolic Space

    Authors: Benjamin Paul Chamberlain, James Clough, Marc Peter Deisenroth

    Abstract: Neural embeddings have been used with great success in Natural Language Processing (NLP). They provide compact representations that encapsulate word similarity and attain state-of-the-art performance in a range of linguistic tasks. The success of neural embeddings has prompted significant amounts of research into applications in domains other than language. One such domain is graph-structured data… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: 7 pages, 5 figures

    Journal ref: 13th international workshop on mining and learning from graphs held in conjunction with KDD, 2017

  21. arXiv:1703.02596  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    Customer Lifetime Value Prediction Using Embeddings

    Authors: Benjamin Paul Chamberlain, Angelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth

    Abstract: We describe the Customer LifeTime Value (CLTV) prediction system deployed at ASOS.com, a global online fashion retailer. CLTV prediction is an important problem in e-commerce where an accurate estimate of future value allows retailers to effectively allocate marketing spend, identify and nurture high value customers and mitigate exposure to losses. The system at ASOS provides daily estimates of th… ▽ More

    Submitted 6 July, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pages 1753-1762, 2017

  22. arXiv:1601.04621  [pdf, other

    cs.SI stat.ML

    Probabilistic Inference of Twitter Users' Age based on What They Follow

    Authors: Benjamin Paul Chamberlain, Clive Humby, Marc Peter Deisenroth

    Abstract: Twitter provides an open and rich source of data for studying human behaviour at scale and is widely used in social and network sciences. However, a major criticism of Twitter data is that demographic information is largely absent. Enhancing Twitter data with user ages would advance our ability to study social network structures, information flows and the spread of contagions. Approaches toward ag… ▽ More

    Submitted 24 February, 2017; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 9 pages, 9 figures

  23. arXiv:1601.03958  [pdf, other

    cs.SI stat.ML

    Real-Time Community Detection in Large Social Networks on a Laptop

    Authors: Benjamin Paul Chamberlain, Josh Levy-Kramer, Clive Humby, Marc Peter Deisenroth

    Abstract: For a broad range of research, governmental and commercial applications it is important to understand the allegiances, communities and structure of key players in society. One promising direction towards extracting this information is to exploit the rich relational data in digital social networks (the social graph). As social media data sets are very large, most approaches make use of distributed… ▽ More

    Submitted 4 September, 2016; v1 submitted 15 January, 2016; originally announced January 2016.