Graph Inductive Biases in Transformers without Message Passing

Ma, Liheng; Lin, Chen; Lim, Derek; Romero-Soriano, Adriana; Dokania, Puneet K.; Coates, Mark; Torr, Philip; Lim, Ser-Nam

Computer Science > Machine Learning

arXiv:2305.17589v1 (cs)

[Submitted on 27 May 2023]

Title:Graph Inductive Biases in Transformers without Message Passing

Authors:Liheng Ma, Chen Lin, Derek Lim, Adriana Romero-Soriano, Puneet K. Dokania, Mark Coates, Philip Torr, Ser-Nam Lim

View PDF

Abstract:Transformers for graph data are increasingly widely studied and successful in numerous learning tasks. Graph inductive biases are crucial for Graph Transformers, and previous works incorporate them using message-passing modules and/or positional encodings. However, Graph Transformers that use message-passing inherit known issues of message-passing, and differ significantly from Transformers used in other domains, thus making transfer of research advances more difficult. On the other hand, Graph Transformers without message-passing often perform poorly on smaller datasets, where inductive biases are more crucial. To bridge this gap, we propose the Graph Inductive bias Transformer (GRIT) -- a new Graph Transformer that incorporates graph inductive biases without using message passing. GRIT is based on several architectural changes that are each theoretically and empirically justified, including: learned relative positional encodings initialized with random walk probabilities, a flexible attention mechanism that updates node and node-pair representations, and injection of degree information in each layer. We prove that GRIT is expressive -- it can express shortest path distances and various graph propagation matrices. GRIT achieves state-of-the-art empirical performance across a variety of graph datasets, thus showing the power that Graph Transformers without message-passing can deliver.

Comments:	Published as a conference paper at ICML 2023; 17 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.17589 [cs.LG]
	(or arXiv:2305.17589v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.17589
Journal reference:	PMLR 202 (2023) 23321-23337

Submission history

From: Liheng Ma [view email]
[v1] Sat, 27 May 2023 22:26:27 UTC (702 KB)

Computer Science > Machine Learning

Title:Graph Inductive Biases in Transformers without Message Passing

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Graph Inductive Biases in Transformers without Message Passing

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators