Stateful Defenses for Machine Learning Models Are Not Yet Secure Against Black-box Attacks

Feng, Ryan; Hooda, Ashish; Mangaokar, Neal; Fawaz, Kassem; Jha, Somesh; Prakash, Atul

doi:10.1145/3576915.3623116

Computer Science > Cryptography and Security

arXiv:2303.06280 (cs)

[Submitted on 11 Mar 2023 (v1), last revised 26 Sep 2023 (this version, v3)]

Title:Stateful Defenses for Machine Learning Models Are Not Yet Secure Against Black-box Attacks

Authors:Ryan Feng, Ashish Hooda, Neal Mangaokar, Kassem Fawaz, Somesh Jha, Atul Prakash

View PDF

Abstract:Recent work has proposed stateful defense models (SDMs) as a compelling strategy to defend against a black-box attacker who only has query access to the model, as is common for online machine learning platforms. Such stateful defenses aim to defend against black-box attacks by tracking the query history and detecting and rejecting queries that are "similar" and thus preventing black-box attacks from finding useful gradients and making progress towards finding adversarial attacks within a reasonable query budget. Recent SDMs (e.g., Blacklight and PIHA) have shown remarkable success in defending against state-of-the-art black-box attacks. In this paper, we show that SDMs are highly vulnerable to a new class of adaptive black-box attacks. We propose a novel adaptive black-box attack strategy called Oracle-guided Adaptive Rejection Sampling (OARS) that involves two stages: (1) use initial query patterns to infer key properties about an SDM's defense; and, (2) leverage those extracted properties to design subsequent query patterns to evade the SDM's defense while making progress towards finding adversarial inputs. OARS is broadly applicable as an enhancement to existing black-box attacks - we show how to apply the strategy to enhance six common black-box attacks to be more effective against current class of SDMs. For example, OARS-enhanced versions of black-box attacks improved attack success rate against recent stateful defenses from almost 0% to to almost 100% for multiple datasets within reasonable query budgets.

Comments:	ACM CCS 2023
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2303.06280 [cs.CR]
	(or arXiv:2303.06280v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2303.06280
Related DOI:	https://doi.org/10.1145/3576915.3623116

Submission history

From: Ryan Feng [view email]
[v1] Sat, 11 Mar 2023 02:10:21 UTC (1,189 KB)
[v2] Fri, 17 Mar 2023 00:43:03 UTC (1,189 KB)
[v3] Tue, 26 Sep 2023 04:36:30 UTC (603 KB)

Computer Science > Cryptography and Security

Title:Stateful Defenses for Machine Learning Models Are Not Yet Secure Against Black-box Attacks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Stateful Defenses for Machine Learning Models Are Not Yet Secure Against Black-box Attacks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators