PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models

Wu, Chen; Zhang, Ruqing; Guo, Jiafeng; de Rijke, Maarten; Fan, Yixing; Cheng, Xueqi

Computer Science > Information Retrieval

arXiv:2204.01321 (cs)

[Submitted on 4 Apr 2022 (v1), last revised 8 Jun 2022 (this version, v3)]

Title:PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models

Authors:Chen Wu, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng

View PDF

Abstract:Neural ranking models (NRMs) have shown remarkable success in recent years, especially with pre-trained language models. However, deep neural models are notorious for their vulnerability to adversarial examples. Adversarial attacks may become a new type of web spamming technique given our increased reliance on neural information retrieval models. Therefore, it is important to study potential adversarial attacks to identify vulnerabilities of NRMs before they are deployed. In this paper, we introduce the Word Substitution Ranking Attack (WSRA) task against NRMs, which aims to promote a target document in rankings by adding adversarial perturbations to its text. We focus on the decision-based black-box attack setting, where the attackers cannot directly get access to the model information, but can only query the target model to obtain the rank positions of the partial retrieved list. This attack setting is realistic in real-world search engines. We propose a novel Pseudo Relevance-based ADversarial ranking Attack method (PRADA) that learns a surrogate model based on Pseudo Relevance Feedback (PRF) to generate gradients for finding the adversarial perturbations. Experiments on two web search benchmark datasets show that PRADA can outperform existing attack strategies and successfully fool the NRM with small indiscernible perturbations of text.

Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2204.01321 [cs.IR]
	(or arXiv:2204.01321v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2204.01321

Submission history

From: Chen Wu [view email]
[v1] Mon, 4 Apr 2022 08:50:52 UTC (1,756 KB)
[v2] Sat, 21 May 2022 04:44:23 UTC (964 KB)
[v3] Wed, 8 Jun 2022 07:44:56 UTC (963 KB)

Computer Science > Information Retrieval

Title:PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators