Benchmarking Robustness to Adversarial Image Obfuscations

Stimberg, Florian; Chakrabarti, Ayan; Lu, Chun-Ta; Hazimeh, Hussein; Stretcu, Otilia; Qiao, Wei; Liu, Yintao; Kaya, Merve; Rashtchian, Cyrus; Fuxman, Ariel; Tek, Mehmet; Gowal, Sven

Computer Science > Computer Vision and Pattern Recognition

arXiv:2301.12993 (cs)

[Submitted on 30 Jan 2023 (v1), last revised 29 Nov 2023 (this version, v2)]

Title:Benchmarking Robustness to Adversarial Image Obfuscations

Authors:Florian Stimberg, Ayan Chakrabarti, Chun-Ta Lu, Hussein Hazimeh, Otilia Stretcu, Wei Qiao, Yintao Liu, Merve Kaya, Cyrus Rashtchian, Ariel Fuxman, Mehmet Tek, Sven Gowal

View PDF

Abstract:Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters in a bid to post content that violate platform policies and codes of conduct. To reach this goal, these malicious actors may obfuscate policy violating images (e.g. overlay harmful images by carefully selected benign images or visual patterns) to prevent machine learning models from reaching the correct decision. In this paper, we invite researchers to tackle this specific issue and present a new image benchmark. This benchmark, based on ImageNet, simulates the type of obfuscations created by malicious actors. It goes beyond ImageNet-$\textrm{C}$ and ImageNet-$\bar{\textrm{C}}$ by proposing general, drastic, adversarial modifications that preserve the original content intent. It aims to tackle a more common adversarial threat than the one considered by $\ell_p$-norm bounded adversaries. We evaluate 33 pretrained models on the benchmark and train models with different augmentations, architectures and training methods on subsets of the obfuscations to measure generalization. We hope this benchmark will encourage researchers to test their models and methods and try to find new approaches that are more robust to these obfuscations.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.2.10; I.4.0
Cite as:	arXiv:2301.12993 [cs.CV]
	(or arXiv:2301.12993v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2301.12993

Submission history

From: Chun-Ta Lu [view email]
[v1] Mon, 30 Jan 2023 15:36:44 UTC (2,626 KB)
[v2] Wed, 29 Nov 2023 18:33:43 UTC (2,906 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking Robustness to Adversarial Image Obfuscations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking Robustness to Adversarial Image Obfuscations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators