SHAPFUZZ: Efficient Fuzzing via Shapley-Guided Byte Selection

Zhang, Kunpeng; Zhu, Xiaogang; Xiao, Xi; Xue, Minhui; Zhang, Chao; Wen, Sheng

doi:10.14722/ndss.2024.23134

Computer Science > Cryptography and Security

arXiv:2308.09239 (cs)

[Submitted on 18 Aug 2023 (v1), last revised 23 Oct 2023 (this version, v3)]

Title:SHAPFUZZ: Efficient Fuzzing via Shapley-Guided Byte Selection

Authors:Kunpeng Zhang, Xiaogang Zhu, Xi Xiao, Minhui Xue, Chao Zhang, Sheng Wen

View PDF

Abstract:Mutation-based fuzzing is popular and effective in discovering unseen code and exposing bugs. However, only a few studies have concentrated on quantifying the importance of input bytes, which refers to the degree to which a byte contributes to the discovery of new code. They often focus on obtaining the relationship between input bytes and path constraints, ignoring the fact that not all constraint-related bytes can discover new code. In this paper, we conduct Shapely analysis to understand the effect of byte positions on fuzzing performance, and find that some byte positions contribute more than others and this property often holds across seeds. Based on this observation, we propose a novel fuzzing solution, ShapFuzz, to guide byte selection and mutation. Specifically, ShapFuzz updates Shapley values (importance) of bytes when each input is tested during fuzzing with a low overhead, and utilizes contextual multi-armed bandit to trade off between mutating high Shapley value bytes and low-frequently chosen bytes. We implement a prototype of this solution based on AFL++, i.e., ShapFuzz. We evaluate ShapFuzz against ten state-of-the-art fuzzers, including five byte schedule-reinforced fuzzers and five commonly used fuzzers. Compared with byte schedule-reinforced fuzzers, ShapFuzz discovers more edges and exposes more bugs than the best baseline on three different sets of initial seeds. Compared with commonly used fuzzers, ShapFuzz exposes 20 more bugs than the best comparison fuzzer, and discovers 6 more CVEs than the best baseline on MAGMA. Furthermore, ShapFuzz discovers 11 new bugs on the latest versions of programs, and 3 of them are confirmed by vendors.

Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2308.09239 [cs.CR]
	(or arXiv:2308.09239v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2308.09239
Journal reference:	Network and Distributed System Security (NDSS) Symposium 2024, 26 February - 1 March 2024, San Diego, CA, USA
Related DOI:	https://doi.org/10.14722/ndss.2024.23134

Submission history

From: Kunpeng Zhang [view email]
[v1] Fri, 18 Aug 2023 01:59:12 UTC (13,808 KB)
[v2] Mon, 21 Aug 2023 07:10:35 UTC (13,808 KB)
[v3] Mon, 23 Oct 2023 02:07:48 UTC (13,808 KB)

Computer Science > Cryptography and Security

Title:SHAPFUZZ: Efficient Fuzzing via Shapley-Guided Byte Selection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:SHAPFUZZ: Efficient Fuzzing via Shapley-Guided Byte Selection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators