PiClick: Picking the desired mask in click-based interactive segmentation

Yan, Cilin; Wang, Haochen; Liu, Jie; Jiang, Xiaolong; Hu, Yao; Tang, Xu; Kang, Guoliang; Gavves, Efstratios

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.11609v1 (cs)

[Submitted on 23 Apr 2023 (this version), latest version 17 Jun 2024 (v5)]

Title:PiClick: Picking the desired mask in click-based interactive segmentation

Authors:Cilin Yan, Haochen Wang, Jie Liu, Xiaolong Jiang, Yao Hu, Xu Tang, Guoliang Kang, Efstratios Gavves

View PDF

Abstract:Click-based interactive segmentation enables productive pixel-level annotation and image editing with simple user clicks, whereas target ambiguity remains a problem hindering precise segmentation. That is, in scenes with rich context, one click may refer to multiple potential targets residing in corresponding masks, while most interactive segmentors can only generate one single mask and fail to capture the rich context. To resolve target ambiguity, we here propose PiClick to produce semantically diversified masks. PiClick leverages a transformer network design wherein mutually interactive mask queries are integrated to infuse target priors. Moreover, a Target Reasoning Module is designed in PiClick to automatically imply the best-matched mask from all proposals, significantly relieving target ambiguity as well as extra human intervention. Extensive experiments conducted on all 9 interactive segmentation datasets not only demonstrate the state-of-the-art segmentation performance of PiClick, but also reduces human interventions with multiple proposal generation and target reasoning. To promote direct usage and future endeavors, we release the source code of PiClick together with a plug-and-play annotation tool at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.11609 [cs.CV]
	(or arXiv:2304.11609v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.11609

Submission history

From: Cilin Yan [view email]
[v1] Sun, 23 Apr 2023 10:46:16 UTC (3,993 KB)
[v2] Sat, 19 Aug 2023 02:30:56 UTC (8,695 KB)
[v3] Mon, 28 Aug 2023 13:26:52 UTC (8,694 KB)
[v4] Mon, 29 Jan 2024 14:33:02 UTC (8,695 KB)
[v5] Mon, 17 Jun 2024 06:41:56 UTC (5,821 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PiClick: Picking the desired mask in click-based interactive segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PiClick: Picking the desired mask in click-based interactive segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators