Skip to main content

Showing 1–46 of 46 results for author: Hamilton, W L

.
  1. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2109.09432  [pdf, ps, other

    cs.LG stat.ML

    Edge-similarity-aware Graph Neural Networks

    Authors: Vincent Mallet, Carlos G. Oliver, William L. Hamilton

    Abstract: Graph are a ubiquitous data representation, as they represent a flexible and compact representation. For instance, the 3D structure of RNA can be efficiently represented as $\textit{2.5D graphs}$, graphs whose nodes are nucleotides and edges represent chemical interactions. In this setting, we have biological evidence of the similarity between the edge types, as some chemical interactions are more… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  3. RNAglib: A Python Package for RNA 2.5D Graphs

    Authors: Vincent Mallet, Carlos Oliver, Jonathan Broadbent, William L. Hamilton, Jérôme Waldispühl

    Abstract: RNA 3D architectures are stabilized by sophisticated networks of (non-canonical) base pair interactions, which can be conveniently encoded as multi-relational graphs and efficiently exploited by graph theoretical approaches and recent progresses in machine learning techniques. RNAglib is a library that eases the use of this representation, by providing clean data, methods to load it in machine lea… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  4. Ego-GNNs: Exploiting Ego Structures in Graph Neural Networks

    Authors: Dylan Sandfelder, Priyesh Vijayan, William L. Hamilton

    Abstract: Graph neural networks (GNNs) have achieved remarkable success as a framework for deep learning on graph-structured data. However, GNNs are fundamentally limited by their tree-structured inductive bias: the WL-subtree kernel formulation bounds the representational capacity of GNNs, and polynomial-time GNNs are provably incapable of recognizing triangles in a graph. In this work, we propose to augme… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Submitted to a special session of IEEE-ICASSP 2021

    Journal ref: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 8523-8527

  5. arXiv:2106.12144  [pdf, other

    cs.CL cs.AI cs.LG

    NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs

    Authors: Mikhail Galkin, Etienne Denis, Jiapeng Wu, William L. Hamilton

    Abstract: Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a unique embedding vector. Such a shallow lookup results in a linear growth of memory consumption for storing the embedding matrix and incurs high computational costs when working with real-world KGs. Drawing parallels with subword tokenization commonly used in NLP, we explore the landscape of more paramet… ▽ More

    Submitted 1 February, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  6. arXiv:2106.03893  [pdf, other

    cs.LG

    Rethinking Graph Transformers with Spectral Attention

    Authors: Devin Kreuzer, Dominique Beaini, William L. Hamilton, Vincent Létourneau, Prudencio Tossou

    Abstract: In recent years, the Transformer architecture has proven to be very successful in sequence processing, but its application to other data structures, such as graphs, has remained limited due to the difficulty of properly defining positions. Here, we present the $\textit{Spectral Attention Network}$ (SAN), which uses a learned positional encoding (LPE) that can take advantage of the full Laplacian s… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted in Proceedings of NeurIPS 2021

  7. arXiv:2105.10488  [pdf, other

    q-bio.BM cs.AI cs.LG

    Understanding the Performance of Knowledge Graph Embeddings in Drug Discovery

    Authors: Stephen Bonner, Ian P Barrett, Cheng Ye, Rowan Swiers, Ola Engkvist, Charles Tapley Hoyt, William L Hamilton

    Abstract: Knowledge Graphs (KG) and associated Knowledge Graph Embedding (KGE) models have recently begun to be explored in the context of drug discovery and have the potential to assist in key challenges such as target identification. In the drug discovery domain, KGs can be employed as part of a process which can result in lab-based experiments being performed, or impact on other decisions, incurring sign… ▽ More

    Submitted 23 May, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

    Journal ref: Artificial Intelligence in the Life Sciences (2022): 100036

  8. arXiv:2103.02014  [pdf, other

    cs.LG cs.CR cs.DS

    Online Adversarial Attacks

    Authors: Andjela Mladenovic, Avishek Joey Bose, Hugo Berard, William L. Hamilton, Simon Lacoste-Julien, Pascal Vincent, Gauthier Gidel

    Abstract: Adversarial attacks expose important vulnerabilities of deep learning models, yet little attention has been paid to settings where data arrives as a stream. In this paper, we formalize the online adversarial attack problem, emphasizing two key elements found in real-world use-cases: attackers must operate under partial knowledge of the target model, and the decisions made by the attacker are irrev… ▽ More

    Submitted 22 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: ICLR 2022

  9. A Review of Biomedical Datasets Relating to Drug Discovery: A Knowledge Graph Perspective

    Authors: Stephen Bonner, Ian P Barrett, Cheng Ye, Rowan Swiers, Ola Engkvist, Andreas Bender, Charles Tapley Hoyt, William L Hamilton

    Abstract: Drug discovery and development is a complex and costly process. Machine learning approaches are being investigated to help improve the effectiveness and speed of multiple stages of the drug discovery pipeline. Of these, those that use Knowledge Graphs (KG) have promise in many tasks, including drug repurposing, drug toxicity prediction and target gene-disease prioritisation. In a drug discovery KG… ▽ More

    Submitted 26 November, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Journal ref: Briefings in Bioinformatics, 2022

  10. arXiv:2102.03419  [pdf, other

    cs.AI cs.CL cs.IR cs.LG cs.SI

    Exploring the Limits of Few-Shot Link Prediction in Knowledge Graphs

    Authors: Dora Jambor, Komal Teru, Joelle Pineau, William L. Hamilton

    Abstract: Real-world knowledge graphs are often characterized by low-frequency relations - a challenge that has prompted an increasing interest in few-shot link prediction methods. These methods perform link prediction for a set of new relations, unseen during training, given only a few example facts of each relation at test time. In this work, we perform a systematic study on a spectrum of models derived b… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: code available at https://github.com/dorajam/few-shot-link-prediction-paper

    Journal ref: European Chapter of the ACL (EACL), 2021

  11. arXiv:2102.00925  [pdf, other

    q-bio.BM cs.LG

    Neural representation and generation for RNA secondary structures

    Authors: Zichao Yan, William L. Hamilton, Mathieu Blanchette

    Abstract: Our work is concerned with the generation and targeted design of RNA, a type of genetic macromolecule that can adopt complex structures which influence their cellular activities and functions. The design of large scale and complex biological structures spurs dedicated graph-based deep generative modeling techniques, which represents a key but underappreciated aspect of computational drug discovery… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

  12. arXiv:2101.00408  [pdf, other

    cs.CL cs.AI

    End-to-End Training of Neural Retrievers for Open-Domain Question Answering

    Authors: Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei **, William L Hamilton, Bryan Catanzaro

    Abstract: Recent work on training neural retrievers for open-domain question answering (OpenQA) has employed both supervised and unsupervised approaches. However, it remains unclear how unsupervised and supervised methods can be used most effectively for neural retrievers. In this work, we systematically study retriever pre-training. We first propose an approach of unsupervised pre-training with the Inverse… ▽ More

    Submitted 1 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: ACL 2021

  13. arXiv:2010.03526  [pdf, other

    cs.LG cs.AI cs.CL

    TeMP: Temporal Message Passing for Temporal Knowledge Graph Completion

    Authors: Jiapeng Wu, Meng Cao, Jackie Chi Kit Cheung, William L. Hamilton

    Abstract: Inferring missing facts in temporal knowledge graphs (TKGs) is a fundamental and challenging task. Previous works have approached this problem by augmenting methods for static knowledge graphs to leverage time-dependent representations. However, these methods do not explicitly leverage multi-hop structural information and temporal facts from recent time steps to enhance their predictions. Addition… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 17 pages, 9 figures. EMNLP 2020 Long Paper

  14. arXiv:2010.02863  [pdf, other

    cs.LG cs.CG cs.SI

    Directional Graph Networks

    Authors: Dominique Beaini, Saro Passaro, Vincent Létourneau, William L. Hamilton, Gabriele Corso, Pietro Liò

    Abstract: The lack of anisotropic kernels in graph neural networks (GNNs) strongly limits their expressiveness, contributing to well-known issues such as over-smoothing. To overcome this limitation, we propose the first globally consistent anisotropic kernels for GNNs, allowing for graph convolutions that are defined according to topologicaly-derived directional flows. First, by defining a vector field in t… ▽ More

    Submitted 7 April, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 11 pages, 10 pages appendix, 6 figures, subtitle: Anisotropic aggregation in graph neural networks via directional vector fields

  15. arXiv:2009.11355  [pdf, other

    cs.LG cs.CL stat.ML

    Structure Aware Negative Sampling in Knowledge Graphs

    Authors: Kian Ahrabian, Aarash Feizi, Yasmin Salehi, William L. Hamilton, Avishek Joey Bose

    Abstract: Learning low-dimensional representations for entities and relations in knowledge graphs using contrastive estimation represents a scalable and effective method for inferring connectivity patterns. A crucial aspect of contrastive learning approaches is the choice of corruption distribution that generates hard negative samples, which force the embedding model to learn discriminative representations… ▽ More

    Submitted 6 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020. Camera-ready submission

  16. VeRNAl: Mining RNA Structures for Fuzzy Base Pairing Network Motifs

    Authors: Carlos Oliver, Vincent Mallet, Pericles Philippopoulos, William L. Hamilton, Jerome Waldispuhl

    Abstract: RNA 3D motifs are recurrent substructures, modelled as networks of base pair interactions, which are crucial for understanding structure-function relationships. The task of automatically identifying such motifs is computationally hard, and remains a key challenge in the field of RNA structural biology and network analysis. State of the art methods solve special cases of the motif problem by constr… ▽ More

    Submitted 18 October, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

  17. arXiv:2007.00720  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Example Games

    Authors: Avishek Joey Bose, Gauthier Gidel, Hugo Berard, Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, William L. Hamilton

    Abstract: The existence of adversarial examples capable of fooling trained neural network classifiers calls for a much better understanding of possible attacks to guide the development of safeguards against them. This includes attack methods in the challenging non-interactive blackbox setting, where adversarial attacks are generated without any access, including queries, to the target model. Prior attacks i… ▽ More

    Submitted 8 January, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  18. arXiv:2005.00583  [pdf, other

    cs.CL cs.LG

    Learning an Unreferenced Metric for Online Dialogue Evaluation

    Authors: Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton, Joelle Pineau

    Abstract: Evaluating the quality of a dialogue interaction between two agents is a difficult task, especially in open-domain chit-chat style dialogue. There have been recent efforts to develop automatic dialogue evaluation metrics, but most of them do not generalize to unseen datasets and/or need a human-generated reference response during inference, making it infeasible for online evaluation. Here, we prop… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted at ACL 2020, 5 pages

  19. arXiv:2003.06560  [pdf, other

    cs.LG stat.ML

    Evaluating Logical Generalization in Graph Neural Networks

    Authors: Koustuv Sinha, Shagun Sodhani, Joelle Pineau, William L. Hamilton

    Abstract: Recent research has highlighted the role of relational inductive biases in building learning agents that can generalize and reason in a compositional manner. However, while relational learning algorithms such as graph neural networks (GNNs) show promise, we do not understand how effectively these approaches can adapt to new tasks. In this work, we study the task of logical generalization using GNN… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

  20. arXiv:2002.09127  [pdf, other

    cs.CL cs.LG

    Learning Dynamic Belief Graphs to Generalize on Text-Based Games

    Authors: Ashutosh Adhikari, Xingdi Yuan, Marc-Alexandre Côté, Mikuláš Zelinka, Marc-Antoine Rondeau, Romain Laroche, Pascal Poupart, Jian Tang, Adam Trischler, William L. Hamilton

    Abstract: Playing text-based games requires skills in processing natural language and sequential decision making. Achieving human-level performance on text-based games remains an open challenge, and prior research has largely relied on hand-crafted structured representations and heuristics. In this work, we investigate how an agent can plan and generalize in text-based games using graph-structured represent… ▽ More

    Submitted 11 May, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Bug fixed in Table 1

  21. arXiv:2002.06336  [pdf, other

    cs.LG stat.ML

    Latent Variable Modelling with Hyperbolic Normalizing Flows

    Authors: Avishek Joey Bose, Ariella Smofsky, Renjie Liao, Prakash Panangaden, William L. Hamilton

    Abstract: The choice of approximate posterior distributions plays a central role in stochastic variational inference (SVI). One effective solution is the use of normalizing flows \cut{defined on Euclidean spaces} to construct flexible posterior distributions. However, one key limitation of existing normalizing flows is that they are restricted to the Euclidean space and are ill-equipped to model data with a… ▽ More

    Submitted 13 August, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

    Comments: Preprint, work under review

  22. arXiv:2002.01335  [pdf, other

    cs.CL cs.AI cs.LG cs.MA stat.ML

    Structural Inductive Biases in Emergent Communication

    Authors: Agnieszka Słowik, Abhinav Gupta, William L. Hamilton, Mateja Jamnik, Sean B. Holden, Christopher Pal

    Abstract: In order to communicate, humans flatten a complex representation of ideas and their attributes into a single word or a sentence. We investigate the impact of representation learning in artificial agents by develo** graph referential games. We empirically show that agents parametrized by graph neural networks develop a more compositional language compared to bag-of-words and sequence models, whic… ▽ More

    Submitted 27 July, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: The first two authors contributed equally. Poster presented at CogSci 2021

  23. arXiv:2001.09063  [pdf, other

    cs.LG cs.AI cs.CL cs.MA stat.ML

    Towards Graph Representation Learning in Emergent Communication

    Authors: Agnieszka Słowik, Abhinav Gupta, William L. Hamilton, Mateja Jamnik, Sean B. Holden

    Abstract: Recent findings in neuroscience suggest that the human brain represents information in a geometric structure (for instance, through conceptual spaces). In order to communicate, we flatten the complex representation of entities and their attributes into a single word or a sentence. In this paper we use graph convolutional networks to support the evolution of language and cooperation in multi-agent… ▽ More

    Submitted 4 February, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

    Comments: The first two authors contributed equally. Accepted at the Reinforcement Learning in Games workshop at AAAI 2020

  24. arXiv:1912.09867  [pdf, other

    cs.LG cs.SI stat.ML

    Meta-Graph: Few Shot Link Prediction via Meta Learning

    Authors: Avishek Joey Bose, Ankit Jain, Piero Molino, William L. Hamilton

    Abstract: We consider the task of few shot link prediction on graphs. The goal is to learn from a distribution over graphs so that a model is able to quickly infer missing edges in a new graph after a small amount of training. We show that current link prediction methods are generally ill-equipped to handle this task. They cannot effectively transfer learned knowledge from one graph to another and are unabl… ▽ More

    Submitted 1 March, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

  25. arXiv:1911.06962  [pdf, other

    cs.LG cs.AI stat.ML

    Inductive Relation Prediction by Subgraph Reasoning

    Authors: Komal K. Teru, Etienne Denis, William L. Hamilton

    Abstract: The dominant paradigm for relation prediction in knowledge graphs involves learning and operating on latent representations (i.e., embeddings) of entities and relations. However, these embedding-based methods do not explicitly capture the compositional logical rules underlying the knowledge graph, and they are limited to the transductive setting, where the full set of entities must be known during… ▽ More

    Submitted 11 February, 2020; v1 submitted 16 November, 2019; originally announced November 2019.

  26. arXiv:1910.00760  [pdf, other

    cs.LG stat.ML

    Efficient Graph Generation with Graph Recurrent Attention Networks

    Authors: Renjie Liao, Yujia Li, Yang Song, Shenlong Wang, Charlie Nash, William L. Hamilton, David Duvenaud, Raquel Urtasun, Richard S. Zemel

    Abstract: We propose a new family of efficient and expressive deep generative models of graphs, called Graph Recurrent Attention Networks (GRANs). Our model generates graphs one block of nodes and associated edges at a time. The block size and sampling stride allow us to trade off sample quality for efficiency. Compared to previous RNN-based graph generative models, our framework better captures the auto-re… ▽ More

    Submitted 17 July, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Neural Information Processing Systems (NeurIPS) 2019

  27. arXiv:1908.06177  [pdf, other

    cs.LG cs.CL cs.LO stat.ML

    CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text

    Authors: Koustuv Sinha, Shagun Sodhani, ** Dong, Joelle Pineau, William L. Hamilton

    Abstract: The recent success of natural language understanding (NLU) systems has been troubled by results highlighting the failure of these models to generalize in a systematic and robust way. In this work, we introduce a diagnostic benchmark suite, named CLUTRR, to clarify some key issues related to the robustness and systematicity of NLU systems. Motivated by classic work on inductive logic programming, C… ▽ More

    Submitted 3 September, 2019; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: Accepted at EMNLP 2019, 9 page content + Appendix

  28. arXiv:1906.10199  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Neural Transfer Learning for Cry-based Diagnosis of Perinatal Asphyxia

    Authors: Charles C. Onu, Jonathan Lebensold, William L. Hamilton, Doina Precup

    Abstract: Despite continuing medical advances, the rate of newborn morbidity and mortality globally remains high, with over 6 million casualties every year. The prediction of pathologies affecting newborns based on their cry is thus of significant clinical interest, as it would facilitate the development of accessible, low-cost diagnostic tools\cut{ based on wearables and smartphones}. However, the inadequa… ▽ More

    Submitted 19 March, 2020; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: Accepted at INTERSPEECH 2019

  29. arXiv:1905.10864  [pdf, other

    cs.LG cs.CR stat.ML

    Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

    Authors: Avishek Joey Bose, Andre Cianflone, William L. Hamilton

    Abstract: Adversarial attacks on deep neural networks traditionally rely on a constrained optimization paradigm, where an optimization procedure is used to obtain a single adversarial perturbation for a given input example. In this work we frame the problem as learning a distribution of adversarial perturbations, enabling us to generate diverse adversarial distributions given an unperturbed input. We show t… ▽ More

    Submitted 20 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

  30. arXiv:1905.10674  [pdf, other

    cs.LG cs.AI stat.ML

    Compositional Fairness Constraints for Graph Embeddings

    Authors: Avishek Joey Bose, William L. Hamilton

    Abstract: Learning high-quality node embeddings is a key building block for machine learning models that operate on graph data, such as social networks and recommender systems. However, existing graph embedding techniques are unable to cope with fairness constraints, e.g., ensuring that the learned representations do not correlate with certain attributes, such as age or gender. Here, we introduce an adversa… ▽ More

    Submitted 16 July, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97, 2019

  31. arXiv:1811.02959  [pdf, other

    cs.CL cs.AI

    Compositional Language Understanding with Text-based Relational Reasoning

    Authors: Koustuv Sinha, Shagun Sodhani, William L. Hamilton, Joelle Pineau

    Abstract: Neural networks for natural language reasoning have largely focused on extractive, fact-based question-answering (QA) and common-sense inference. However, it is also crucial to understand the extent to which neural networks can perform relational reasoning and combinatorial generalization from natural language---abilities that are often obscured by annotation artifacts and the dominance of languag… ▽ More

    Submitted 7 November, 2018; v1 submitted 7 November, 2018; originally announced November 2018.

    Comments: 4 pages of main content, to be presented at Relational Representation Learning Workshop, NIPS 2018, Montreal

  32. arXiv:1810.02244  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks

    Authors: Christopher Morris, Martin Ritzert, Matthias Fey, William L. Hamilton, Jan Eric Lenssen, Gaurav Rattan, Martin Grohe

    Abstract: In recent years, graph neural networks (GNNs) have emerged as a powerful neural architecture to learn vector representations of nodes and graphs in a supervised, end-to-end fashion. Up to now, GNNs have only been evaluated empirically -- showing promising results. The following work investigates GNNs from a theoretical point of view and relates them to the $1$-dimensional Weisfeiler-Leman graph is… ▽ More

    Submitted 30 November, 2021; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: Extended version with proofs, accepted at AAAI 2019, added units of measurement of QM9 dataset into appendix, removed results from Wu et al., 2018 due to different units

  33. arXiv:1809.10341  [pdf, other

    stat.ML cs.IT cs.LG cs.SI

    Deep Graph Infomax

    Authors: Petar Veličković, William Fedus, William L. Hamilton, Pietro Liò, Yoshua Bengio, R Devon Hjelm

    Abstract: We present Deep Graph Infomax (DGI), a general approach for learning node representations within graph-structured data in an unsupervised manner. DGI relies on maximizing mutual information between patch representations and corresponding high-level summaries of graphs---both derived using established graph convolutional network architectures. The learnt patch representations summarize subgraphs ce… ▽ More

    Submitted 21 December, 2018; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: To appear at ICLR 2019. 17 pages, 8 figures

  34. arXiv:1806.08804  [pdf, other

    cs.LG cs.NE cs.SI stat.ML

    Hierarchical Graph Representation Learning with Differentiable Pooling

    Authors: Rex Ying, Jiaxuan You, Christopher Morris, Xiang Ren, William L. Hamilton, Jure Leskovec

    Abstract: Recently, graph neural networks (GNNs) have revolutionized the field of graph representation learning through effectively learned node embeddings, and achieved state-of-the-art results in tasks such as node classification and link prediction. However, current GNN methods are inherently flat and do not learn hierarchical representations of graphs---a limitation that is especially problematic for th… ▽ More

    Submitted 20 February, 2019; v1 submitted 22 June, 2018; originally announced June 2018.

  35. arXiv:1806.01973  [pdf, other

    cs.IR cs.LG stat.ML

    Graph Convolutional Neural Networks for Web-Scale Recommender Systems

    Authors: Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L. Hamilton, Jure Leskovec

    Abstract: Recent advancements in deep neural networks for graph-structured data have led to state-of-the-art performance on recommender system benchmarks. However, making these methods practical and scalable to web-scale recommendation tasks with billions of items and hundreds of millions of users remains a challenge. Here we describe a large-scale deep recommendation engine that we developed and deployed a… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: KDD 2018

  36. arXiv:1806.01445  [pdf, other

    cs.SI cs.LG stat.ML

    Embedding Logical Queries on Knowledge Graphs

    Authors: William L. Hamilton, Payal Bajaj, Marinka Zitnik, Dan Jurafsky, Jure Leskovec

    Abstract: Learning low-dimensional embeddings of knowledge graphs is a powerful approach used to predict unobserved or missing edges between entities. However, an open challenge in this area is develo** techniques that can go beyond simple edge prediction and handle more complex logical queries, which might involve multiple unobserved edges, entities, and variables. For instance, given an incomplete biolo… ▽ More

    Submitted 29 October, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: Published in NeurIPS 2018

  37. arXiv:1803.03697  [pdf, other

    cs.SI cs.CL cs.HC

    Community Interaction and Conflict on the Web

    Authors: Srijan Kumar, William L. Hamilton, Jure Leskovec, Dan Jurafsky

    Abstract: Users organize themselves into communities on web platforms. These communities can interact with one another, often leading to conflicts and toxic interactions. However, little is known about the mechanisms of interactions between communities and how they impact users. Here we study intercommunity interactions across 36,000 communities on Reddit, examining cases where users of one community are… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: In WWW 2018: The Web Conference. Project website with data and code is https://snap.stanford.edu/conflict/

  38. arXiv:1802.08773  [pdf, other

    cs.LG cs.AI cs.SI

    GraphRNN: Generating Realistic Graphs with Deep Auto-regressive Models

    Authors: Jiaxuan You, Rex Ying, Xiang Ren, William L. Hamilton, Jure Leskovec

    Abstract: Modeling and generating graphs is fundamental for studying networks in biology, engineering, and social sciences. However, modeling complex distributions over graphs and then efficiently sampling from these distributions is challenging due to the non-unique, high-dimensional nature of graphs and the complex, non-local dependencies that exist between edges in a given graph. Here we propose GraphRNN… ▽ More

    Submitted 23 June, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: ICML 2018

    ACM Class: I.2.6

  39. arXiv:1709.05584  [pdf, other

    cs.SI cs.LG

    Representation Learning on Graphs: Methods and Applications

    Authors: William L. Hamilton, Rex Ying, Jure Leskovec

    Abstract: Machine learning on graphs is an important and ubiquitous task with applications ranging from drug design to friendship recommendation in social networks. The primary challenge in this domain is finding a way to represent, or encode, graph structure so that it can be easily exploited by machine learning models. Traditionally, machine learning approaches relied on user-defined heuristics to extract… ▽ More

    Submitted 10 April, 2018; v1 submitted 16 September, 2017; originally announced September 2017.

    Comments: Published in the IEEE Data Engineering Bulletin, September 2017; version with minor corrections

  40. arXiv:1706.02216  [pdf, other

    cs.SI cs.LG stat.ML

    Inductive Representation Learning on Large Graphs

    Authors: William L. Hamilton, Rex Ying, Jure Leskovec

    Abstract: Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions. However, most existing approaches require that all nodes in the graph are present during training of the embeddings; these previous approaches are inherently transductive and do not naturally generalize to unseen nodes. Her… ▽ More

    Submitted 10 September, 2018; v1 submitted 7 June, 2017; originally announced June 2017.

    Comments: Published in NIPS 2017; version with full appendix and minor corrections

  41. arXiv:1705.09665  [pdf, other

    cs.SI cs.CL cs.CY physics.soc-ph

    Community Identity and User Engagement in a Multi-Community Landscape

    Authors: Justine Zhang, William L. Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

    Abstract: A community's identity defines and shapes its internal dynamics. Our current understanding of this interplay is mostly limited to glimpses gathered from isolated studies of individual communities. In this work we provide a systematic exploration of the nature of this relation across a wide variety of online communities. To this end we introduce a quantitative, language-based typology reflecting tw… ▽ More

    Submitted 26 May, 2017; originally announced May 2017.

    Comments: 10 page, 3 figures, To appear in the Proceedings of the 11th International Conference On Web And Social Media, ICWSM 2017; this version has subtle differences with the proceedings version, including an introductory quote

  42. arXiv:1703.03386  [pdf, other

    cs.SI cs.CL

    Loyalty in Online Communities

    Authors: William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

    Abstract: Loyalty is an essential component of multi-community engagement. When users have the choice to engage with a variety of different communities, they often become loyal to just one, focusing on that community at the expense of others. However, it is unclear how loyalty is manifested in user behavior, or whether loyalty is encouraged by certain community characteristics. In this paper we operationa… ▽ More

    Submitted 24 May, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

    Comments: Extended version of a paper appearing in the Proceedings of ICWSM 2017 (with the same title); please cite the official ICWSM version

  43. arXiv:1606.02821  [pdf, other

    cs.CL

    Cultural Shift or Linguistic Drift? Comparing Two Computational Measures of Semantic Change

    Authors: William L. Hamilton, Jure Leskovec, Dan Jurafsky

    Abstract: Words shift in meaning for many reasons, including cultural factors like new technologies and regular linguistic processes like subjectification. Understanding the evolution of language and culture requires disentangling these underlying causes. Here we show how two different distributional measures can be used to detect two different types of semantic change. The first measure, which has been use… ▽ More

    Submitted 23 September, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

    Comments: 5 pages, 3 figures, EMNLP 2016

  44. arXiv:1606.02820  [pdf, other

    cs.CL

    Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora

    Authors: William L. Hamilton, Kevin Clark, Jure Leskovec, Dan Jurafsky

    Abstract: A word's sentiment depends on the domain in which it is used. Computational social science research thus requires sentiment lexicons that are specific to the domains being studied. We combine domain-specific word embeddings with a label propagation framework to induce accurate domain-specific sentiment lexicons using small sets of seed words, achieving state-of-the-art performance competitive with… ▽ More

    Submitted 23 September, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

    Comments: 11 pages, 5 figures, EMNLP 2016

  45. arXiv:1605.09096  [pdf, other

    cs.CL

    Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change

    Authors: William L. Hamilton, Jure Leskovec, Dan Jurafsky

    Abstract: Understanding how words change their meanings over time is key to models of language and cultural evolution, but historical data on meaning is scarce, making theories hard to develop and test. Word embeddings show promise as a diachronic tool, but have not been carefully evaluated. We develop a robust methodology for quantifying semantic change by evaluating word embeddings (PPMI, SVD, word2vec) a… ▽ More

    Submitted 25 October, 2018; v1 submitted 29 May, 2016; originally announced May 2016.

    Comments: Association for Computational Linguistics (ACL), 2016. Minor corrections; improved methodology for Section 3

  46. arXiv:1312.0286  [pdf, other

    cs.LG stat.ML

    Efficient Learning and Planning with Compressed Predictive States

    Authors: William L. Hamilton, Mahdi Milani Fard, Joelle Pineau

    Abstract: Predictive state representations (PSRs) offer an expressive framework for modelling partially observable systems. By compactly representing systems as functions of observable quantities, the PSR learning approach avoids using local-minima prone expectation-maximization and instead employs a globally optimal moment-based algorithm. Moreover, since PSRs do not require a predetermined latent state st… ▽ More

    Submitted 20 July, 2014; v1 submitted 1 December, 2013; originally announced December 2013.

    Comments: 45 pages, 10 figures, submitted to the Journal of Machine Learning Research