Skip to main content

Showing 1–31 of 31 results for author: Chamberlain, B

.
  1. arXiv:2308.16712  [pdf, other

    physics.comp-ph cs.DC

    Implementing scalable matrix-vector products for the exact diagonalization methods in quantum many-body physics

    Authors: Tom Westerhout, Bradford L. Chamberlain

    Abstract: Exact diagonalization is a well-established method for simulating small quantum systems. Its applicability is limited by the exponential growth of the so-called Hamiltonian matrix that needs to be diagonalized. Physical symmetries are usually utilized to reduce the matrix dimension, and distributed-memory parallelism is employed to explore larger systems. This paper focuses on the implementation t… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 11 pages, 9 figures

  2. arXiv:2211.04279  [pdf, other

    cs.CV

    Detecting Shortcuts in Medical Images -- A Case Study in Chest X-rays

    Authors: Amelia Jiménez-Sánchez, Dovile Juodelyte, Bethany Chamberlain, Veronika Cheplygina

    Abstract: The availability of large public datasets and the increased amount of computing power have shifted the interest of the medical community to high-performance algorithms. However, little attention is paid to the quality of the data and their annotations. High performance on benchmark datasets may be reported without considering possible shortcuts or artifacts in the data, besides, models are not tes… ▽ More

    Submitted 9 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Submitted to ISBI 2023

  3. arXiv:2210.01542  [pdf, other

    cs.LG cs.AI

    Hyperbolic Deep Reinforcement Learning

    Authors: Edoardo Cetin, Benjamin Chamberlain, Michael Bronstein, Jonathan J Hunt

    Abstract: We propose a new class of deep reinforcement learning (RL) algorithms that model latent representations in hyperbolic space. Sequential decision-making requires reasoning about the possible future consequences of current behavior. Consequently, capturing the relationship between key evolving features for a given task is conducive to recovering effective policies. To this end, hyperbolic geometry p… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Preprint

  4. arXiv:2210.00513  [pdf, other

    cs.LG stat.ML

    Gradient Gating for Deep Multi-Rate Learning on Graphs

    Authors: T. Konstantin Rusch, Benjamin P. Chamberlain, Michael W. Mahoney, Michael M. Bronstein, Siddhartha Mishra

    Abstract: We present Gradient Gating (G$^2$), a novel framework for improving the performance of Graph Neural Networks (GNNs). Our framework is based on gating the output of GNN layers with a mechanism for multi-rate flow of message passing information across nodes of the underlying graph. Local gradients are harnessed to further modulate message passing updates. Our framework flexibly allows one to use any… ▽ More

    Submitted 15 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

  5. arXiv:2209.15486  [pdf, other

    cs.LG cs.IR

    Graph Neural Networks for Link Prediction with Subgraph Sketching

    Authors: Benjamin Paul Chamberlain, Sergey Shirobokov, Emanuele Rossi, Fabrizio Frasca, Thomas Markovich, Nils Hammerla, Michael M. Bronstein, Max Hansmire

    Abstract: Many Graph Neural Networks (GNNs) perform poorly compared to simple heuristics on Link Prediction (LP) tasks. This is due to limitations in expressive power such as the inability to count triangles (the backbone of most LP heuristics) and because they can not distinguish automorphic nodes (those having identical structural roles). Both expressiveness issues can be alleviated by learning link (rath… ▽ More

    Submitted 2 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 29 pages, 19 figures, 6 appendices

    Journal ref: The Eleventh International Conference on Learning Representations 2023 (oral - top 5%)

  6. arXiv:2206.10991  [pdf, other

    cs.LG stat.ML

    Understanding convolution on graphs via energies

    Authors: Francesco Di Giovanni, James Rowbottom, Benjamin P. Chamberlain, Thomas Markovich, Michael M. Bronstein

    Abstract: Graph Neural Networks (GNNs) typically operate by message-passing, where the state of a node is updated based on the information received from its neighbours. Most message-passing models act as graph convolutions, where features are mixed by a shared, linear transformation before being propagated over the edges. On node-classification tasks, graph convolutions have been shown to suffer from two li… ▽ More

    Submitted 6 September, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted at TMLR; First two authors equal contribution; 35 pages

  7. arXiv:2202.04579  [pdf, other

    cs.LG math.AT

    Neural Sheaf Diffusion: A Topological Perspective on Heterophily and Oversmoothing in GNNs

    Authors: Cristian Bodnar, Francesco Di Giovanni, Benjamin Paul Chamberlain, Pietro Liò, Michael M. Bronstein

    Abstract: Cellular sheaves equip graphs with a "geometrical" structure by assigning vector spaces and linear maps to nodes and edges. Graph Neural Networks (GNNs) implicitly assume a graph with a trivial underlying sheaf. This choice is reflected in the structure of the graph Laplacian operator, the properties of the associated diffusion equation, and the characteristics of the convolutional models that dis… ▽ More

    Submitted 6 January, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted to NeurIPS 2022. Contains 29 pages, 10 figures

  8. arXiv:2202.02296  [pdf, other

    cs.LG math.DS stat.ML

    Graph-Coupled Oscillator Networks

    Authors: T. Konstantin Rusch, Benjamin P. Chamberlain, James Rowbottom, Siddhartha Mishra, Michael M. Bronstein

    Abstract: We propose Graph-Coupled Oscillator Networks (GraphCON), a novel framework for deep learning on graphs. It is based on discretizations of a second-order system of ordinary differential equations (ODEs), which model a network of nonlinear controlled and damped oscillators, coupled via the adjacency structure of the underlying graph. The flexibility of our framework permits any basic GNN layer (e.g.… ▽ More

    Submitted 23 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  9. arXiv:2111.14522  [pdf, other

    stat.ML cs.LG

    Understanding over-squashing and bottlenecks on graphs via curvature

    Authors: Jake Top**, Francesco Di Giovanni, Benjamin Paul Chamberlain, Xiaowen Dong, Michael M. Bronstein

    Abstract: Most graph neural networks (GNNs) use the message passing paradigm, in which node features are propagated on the input graph. Recent works pointed to the distortion of information flowing from distant nodes as a factor limiting the efficiency of message passing for tasks relying on long-distance interactions. This phenomenon, referred to as 'over-squashing', has been heuristically attributed to gr… ▽ More

    Submitted 12 November, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: First two authors equal contribution; outstanding paper honorable mention at ICLR22; 30 pages. v2: corrected Corollary 3 and rephrased in terms of graph diameter. Any other result is unaffected

  10. arXiv:2111.12128  [pdf, other

    cs.LG

    On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

    Authors: Emanuele Rossi, Henry Kenlay, Maria I. Gorinova, Benjamin Paul Chamberlain, Xiaowen Dong, Michael Bronstein

    Abstract: While Graph Neural Networks (GNNs) have recently become the de facto standard for modeling relational data, they impose a strong assumption on the availability of the node or edge features of the graph. In many real-world applications, however, features are only partially available; for example, in social networks, age and gender are available only for a small subset of users. We present a general… ▽ More

    Submitted 23 May, 2022; v1 submitted 23 November, 2021; originally announced November 2021.

  11. arXiv:2110.09443  [pdf, other

    cs.LG cs.AI stat.ML

    Beltrami Flow and Neural Diffusion on Graphs

    Authors: Benjamin Paul Chamberlain, James Rowbottom, Davide Eynard, Francesco Di Giovanni, Xiaowen Dong, Michael M Bronstein

    Abstract: We propose a novel class of graph neural networks based on the discretised Beltrami flow, a non-Euclidean diffusion PDE. In our model, node features are supplemented with positional encodings derived from the graph topology and jointly evolved by the Beltrami flow, producing simultaneously continuous feature learning and topology evolution. The resulting model generalises many popular graph neural… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 21 pages, 5 figures. Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS) 2021

  12. arXiv:2109.08245  [pdf, other

    cs.SI

    The 2021 RecSys Challenge Dataset: Fairness is not optional

    Authors: Luca Belli, Alykhan Tejani, Frank Portman, Alexandre Lung-Yut-Fong, Ben Chamberlain, Yuanpu Xie, Kristian Lum, Jonathan Hunt, Michael Bronstein, Vito Walter Anelli, Saikishore Kalloori, Bruce Ferwerda, Wenzhe Shi

    Abstract: After the success the RecSys 2020 Challenge, we are describing a novel and bigger dataset that was released in conjunction with the ACM RecSys Challenge 2021. This year's dataset is not only bigger (~ 1B data points, a 5 fold increase), but for the first time it take into consideration fairness aspects of the challenge. Unlike many static datsets, a lot of effort went into making sure that the dat… ▽ More

    Submitted 21 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  13. arXiv:2106.10934  [pdf, other

    cs.LG stat.ML

    GRAND: Graph Neural Diffusion

    Authors: Benjamin Paul Chamberlain, James Rowbottom, Maria Gorinova, Stefan Webb, Emanuele Rossi, Michael M. Bronstein

    Abstract: We present Graph Neural Diffusion (GRAND) that approaches deep learning on graphs as a continuous diffusion process and treats Graph Neural Networks (GNNs) as discretisations of an underlying PDE. In our model, the layer structure and topology correspond to the discretisation choices of temporal and spatial operators. Our approach allows a principled development of a broad new class of GNNs that a… ▽ More

    Submitted 22 September, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 15 pages, 4 figures. Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021. Copyright 2021 by the author(s)

  14. arXiv:2009.12192  [pdf, other

    cs.IR cs.CL cs.LG

    Tuning Word2vec for Large Scale Recommendation Systems

    Authors: Benjamin P. Chamberlain, Emanuele Rossi, Dan Shiebler, Suvash Sedhain, Michael M. Bronstein

    Abstract: Word2vec is a powerful machine learning tool that emerged from Natural Lan-guage Processing (NLP) and is now applied in multiple domains, including recom-mender systems, forecasting, and network analysis. As Word2vec is often used offthe shelf, we address the question of whether the default hyperparameters are suit-able for recommender systems. The answer is emphatically no. In this paper, wefirst… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: 11 pages, 4 figures, Fourteenth ACM Conference on Recommender Systems

    Journal ref: Fourteenth ACM Conference on Recommender Systems (RecSys '20), September 22--26, 2020, Virtual Event, Brazil

  15. arXiv:2006.10637  [pdf, other

    cs.LG stat.ML

    Temporal Graph Networks for Deep Learning on Dynamic Graphs

    Authors: Emanuele Rossi, Ben Chamberlain, Fabrizio Frasca, Davide Eynard, Federico Monti, Michael Bronstein

    Abstract: Graph Neural Networks (GNNs) have recently become increasingly popular due to their ability to learn complex systems of relations or interactions arising in a broad spectrum of problems ranging from biology and particle physics to social networks and recommendation systems. Despite the plethora of different models for deep learning on graphs, few approaches have been proposed thus far for dealing… ▽ More

    Submitted 9 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  16. arXiv:2004.11198  [pdf, other

    cs.LG stat.ML

    SIGN: Scalable Inception Graph Neural Networks

    Authors: Fabrizio Frasca, Emanuele Rossi, Davide Eynard, Ben Chamberlain, Michael Bronstein, Federico Monti

    Abstract: Graph representation learning has recently been applied to a broad spectrum of problems ranging from computer graphics and chemistry to high energy physics and social media. The popularity of graph neural networks has sparked interest, both in academia and in industry, in develo** methods that scale to very large graphs such as Facebook or Twitter social networks. In most of these approaches, th… ▽ More

    Submitted 3 November, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: Extended experiments to ogbn-papers100M

  17. arXiv:1909.03457  [pdf, other

    stat.ME stat.AP

    What is the value of experimentation & measurement?

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Experimentation and Measurement (E&M) capabilities allow organizations to accurately assess the impact of new propositions and to experiment with many variants of existing products. However, until now, the question of measuring the measurer, or valuing the contribution of an E&M capability to organizational success has not been addressed. We tackle this problem by analyzing how, by decreasing esti… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    Comments: Accepted into IEEE International Conference on Data Mining (ICDM) 2019. Main paper: 6 pages, 3 figures; Supplementary document: 7 pages, 2 figures. Code available on: https://github.com/liuchbryan/value_of_experimentation

  18. arXiv:1904.00741  [pdf, other

    cs.CV cs.LG stat.ML

    Fashion Outfit Generation for E-commerce

    Authors: Elaine M. Bettaney, Stephen R. Hardwick, Odysseas Zisimopoulos, Benjamin Paul Chamberlain

    Abstract: Combining items of clothing into an outfit is a major task in fashion retail. Recommending sets of items that are compatible with a particular seed item is useful for providing users with guidance and inspiration, but is currently a manual process that requires expert stylists and is therefore not scalable or easy to personalise. We use a multilayer neural network fed by visual and textual feature… ▽ More

    Submitted 18 March, 2019; originally announced April 2019.

    Comments: 9 pages, 9 figures, 4 tables

  19. arXiv:1902.08648  [pdf, other

    cs.IR cs.LG stat.ML

    Scalable Hyperbolic Recommender Systems

    Authors: Benjamin Paul Chamberlain, Stephen R. Hardwick, David R. Wardrope, Fabon Dzogang, Fabio Daolio, Saúl Vargas

    Abstract: We present a large scale hyperbolic recommender system. We discuss why hyperbolic geometry is a more suitable underlying geometry for many recommendation systems and cover the fundamental milestones and insights that we have gained from its development. In doing so, we demonstrate the viability of hyperbolic geometry for recommender systems, showing that they significantly outperform Euclidean mod… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: 11 pages, 8 figures, 2 tables

  20. arXiv:1807.04098  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    A Recurrent Neural Network Survival Model: Predicting Web User Return Time

    Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

    Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

  21. arXiv:1806.02588  [pdf, other

    stat.ME cs.DM stat.AP

    Designing Experiments to Measure Incrementality on Facebook

    Authors: C. H. Bryan Liu, Elaine M. Bettaney, Benjamin Paul Chamberlain

    Abstract: The importance of Facebook advertising has risen dramatically in recent years, with the platform accounting for almost 20% of the global online ad spend in 2017. An important consideration in advertising is incrementality: how much of the change in an experimental metric is an advertising campaign responsible for. To measure incrementality, Facebook provide lift studies. As Facebook lift studies d… ▽ More

    Submitted 11 July, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: Accepted into 2018 AdKDD & TargetAd Workshop in conjunction with KDD 2018; 6 pages, 4 figures, and 2 tables

  22. arXiv:1804.04095  [pdf, other

    cs.CL cs.AI cs.SI

    Predicting Twitter User Socioeconomic Attributes with Network and Language Information

    Authors: Nikolaos Aletras, Benjamin Paul Chamberlain

    Abstract: Inferring socioeconomic attributes of social media users such as occupation and income is an important problem in computational social science. Automated inference of such characteristics has applications in personalised recommender systems, targeted computational advertising and online political campaigning. While previous work has shown that language features can reliably predict socioeconomic a… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: Accepted at ACM HT 2018

  23. arXiv:1803.06258  [pdf, other

    stat.ME cs.DM stat.AP

    Online Controlled Experiments for Personalised e-Commerce Strategies: Design, Challenges, and Pitfalls

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Online controlled experiments are the primary tool for measuring the causal impact of product changes in digital businesses. It is increasingly common for digital products and services to interact with customers in a personalised way. Using online controlled experiments to optimise personalised interaction strategies is challenging because the usual assumption of statistically equivalent user grou… ▽ More

    Submitted 1 July, 2021; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: Not peer-reviewed but retained for historic interest. Removed an erroneous statement on Welch's t-test assumptions in Section 3.2. 9 pages, 7 figures

  24. Speeding Up BigClam Implementation on SNAP

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: We perform a detailed analysis of the C++ implementation of the Cluster Affiliation Model for Big Networks (BigClam) on the Stanford Network Analysis Project (SNAP). BigClam is a popular graph mining algorithm that is capable of finding overlap** communities in networks containing millions of nodes. Our analysis shows a key stage of the algorithm - determining if a node belongs to a community -… ▽ More

    Submitted 4 September, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: To appear in 2018 Imperial College Computing Student Workshop (ICCSW'18); 12 pages, 4 figures, and 3 tables

    Journal ref: 2018 Imperial College Computing Student Workshop (ICCSW 2018). OpenAccess Series in Informatics (OASIcs), vol. 66, pp. 1:1-1:13

  25. arXiv:1706.09865  [pdf, other

    stat.ML cs.CY cs.LG

    Generalising Random Forest Parameter Optimisation to Include Stability and Cost

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

    Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More

    Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear in ECML-PKDD 2017

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

  26. arXiv:1705.10359  [pdf, other

    stat.ML cs.LG

    Neural Embeddings of Graphs in Hyperbolic Space

    Authors: Benjamin Paul Chamberlain, James Clough, Marc Peter Deisenroth

    Abstract: Neural embeddings have been used with great success in Natural Language Processing (NLP). They provide compact representations that encapsulate word similarity and attain state-of-the-art performance in a range of linguistic tasks. The success of neural embeddings has prompted significant amounts of research into applications in domains other than language. One such domain is graph-structured data… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: 7 pages, 5 figures

    Journal ref: 13th international workshop on mining and learning from graphs held in conjunction with KDD, 2017

  27. arXiv:1703.02596  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    Customer Lifetime Value Prediction Using Embeddings

    Authors: Benjamin Paul Chamberlain, Angelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth

    Abstract: We describe the Customer LifeTime Value (CLTV) prediction system deployed at ASOS.com, a global online fashion retailer. CLTV prediction is an important problem in e-commerce where an accurate estimate of future value allows retailers to effectively allocate marketing spend, identify and nurture high value customers and mitigate exposure to losses. The system at ASOS provides daily estimates of th… ▽ More

    Submitted 6 July, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pages 1753-1762, 2017

  28. arXiv:1601.04621  [pdf, other

    cs.SI stat.ML

    Probabilistic Inference of Twitter Users' Age based on What They Follow

    Authors: Benjamin Paul Chamberlain, Clive Humby, Marc Peter Deisenroth

    Abstract: Twitter provides an open and rich source of data for studying human behaviour at scale and is widely used in social and network sciences. However, a major criticism of Twitter data is that demographic information is largely absent. Enhancing Twitter data with user ages would advance our ability to study social network structures, information flows and the spread of contagions. Approaches toward ag… ▽ More

    Submitted 24 February, 2017; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 9 pages, 9 figures

  29. arXiv:1601.03958  [pdf, other

    cs.SI stat.ML

    Real-Time Community Detection in Large Social Networks on a Laptop

    Authors: Benjamin Paul Chamberlain, Josh Levy-Kramer, Clive Humby, Marc Peter Deisenroth

    Abstract: For a broad range of research, governmental and commercial applications it is important to understand the allegiances, communities and structure of key players in society. One promising direction towards extracting this information is to exploit the rich relational data in digital social networks (the social graph). As social media data sets are very large, most approaches make use of distributed… ▽ More

    Submitted 4 September, 2016; v1 submitted 15 January, 2016; originally announced January 2016.

  30. arXiv:1304.1146  [pdf

    cs.AI

    Analysis in HUGIN of Data Conflict

    Authors: Bo Chamberlain, Finn Verner Jensen, Frank Jensen, Torsten Nordahl

    Abstract: After a brief introduction to causal probabilistic networks and the HUGIN approach, the problem of conflicting data is discussed. A measure of conflict is defined, and it is used in the medical diagnostic system MUNIN. Finally, it is discussed how to distinguish between conflicting data and a rare case.

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-546-554

  31. arXiv:1208.3538  [pdf, other

    math.DS math.ST

    Inverse Modeling of Dynamical Systems: Multi-Dimensional Extensions of a Stochastic Switching Problem

    Authors: Erik Bates, Blake Chamberlain, Rachel Gettinger

    Abstract: The Buridan's ass paradox is characterized by perpetual indecision between two states, which are never attained. When this problem is formulated as a dynamical system, indecision is modeled by a discrete-state Markov process determined by the system's unknown parameters. Interest lies in estimating these parameters from a limited number of observations. We compare estimation methods and examine ho… ▽ More

    Submitted 17 August, 2012; originally announced August 2012.

    Comments: 40 pages, 6 figures. Work was completed by the authors while attending the SURIEM REU at Michigan State University. The authors were awarded an MAA Best Presentation Award at MathFest 2012 (Madison, WI) relating this material

    MSC Class: 62M86