Generative Semi-supervised Graph Anomaly Detection

Qiao, Hezhe; Wen, Qingsong; Li, Xiaoli; Lim, Ee-Peng; Pang, Guansong

Computer Science > Machine Learning

arXiv:2402.11887 (cs)

[Submitted on 19 Feb 2024 (v1), last revised 28 May 2024 (this version, v4)]

Title:Generative Semi-supervised Graph Anomaly Detection

Authors:Hezhe Qiao, Qingsong Wen, Xiaoli Li, Ee-Peng Lim, Guansong Pang

View PDF HTML (experimental)

Abstract:This work considers a practical semi-supervised graph anomaly detection (GAD) scenario, where part of the nodes in a graph are known to be normal, contrasting to the extensively explored unsupervised setting with a fully unlabeled graph. We reveal that having access to the normal nodes, even just a small percentage of normal nodes, helps enhance the detection performance of existing unsupervised GAD methods when they are adapted to the semi-supervised setting. However, their utilization of these normal nodes is limited. In this paper, we propose a novel Generative GAD approach (namely GGAD) for the semi-supervised scenario to better exploit the normal nodes. The key idea is to generate pseudo anomaly nodes, referred to as 'outlier nodes', for providing effective negative node samples in training a discriminative one-class classifier. The main challenge here lies in the lack of ground truth information about real anomaly nodes. To address this challenge, GGAD is designed to leverage two important priors about the anomaly nodes -- asymmetric local affinity and egocentric closeness -- to generate reliable outlier nodes that assimilate anomaly nodes in both graph structure and feature representations. Comprehensive experiments on six real-world GAD datasets are performed to establish a benchmark for semi-supervised GAD and show that GGAD substantially outperforms state-of-the-art unsupervised and semi-supervised GAD methods with varying numbers of training normal nodes. Code will be made available at this https URL.

Comments:	20 pages, 11 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.11887 [cs.LG]
	(or arXiv:2402.11887v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.11887

Submission history

From: Hezhe Qiao [view email]
[v1] Mon, 19 Feb 2024 06:55:50 UTC (3,821 KB)
[v2] Sun, 17 Mar 2024 12:08:33 UTC (3,821 KB)
[v3] Thu, 4 Apr 2024 10:08:25 UTC (3,821 KB)
[v4] Tue, 28 May 2024 08:31:28 UTC (3,494 KB)

Computer Science > Machine Learning

Title:Generative Semi-supervised Graph Anomaly Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generative Semi-supervised Graph Anomaly Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators