CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

Pei, Jialun; Jiang, Tao; Tang, He; Liu, Nian; **, Yueming; Fan, Deng-**; Heng, Pheng-Ann

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.08098 (cs)

[Submitted on 16 Jul 2023 (v1), last revised 11 Jun 2024 (this version, v2)]

Title:CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

Authors:Jialun Pei, Tao Jiang, He Tang, Nian Liu, Yueming **, Deng-** Fan, Pheng-Ann Heng

View PDF HTML (experimental)

Abstract:We propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet. Our method simultaneously calibrates depth and RGB features in the kernel and mask branches to generate instance-aware kernels and mask features. CalibNet consists of three simple modules, a dynamic interactive kernel (DIK) and a weight-sharing fusion (WSF), which work together to generate effective instance-aware kernels and integrate cross-modal features. To improve the quality of depth features, we incorporate a depth similarity assessment (DSA) module prior to DIK and WSF. In addition, we further contribute a new DSIS dataset, which contains 1,940 images with elaborate instance-level annotations. Extensive experiments on three challenging benchmarks show that CalibNet yields a promising result, i.e., 58.0% AP with 320*480 input size on the COME15K-N test set, which significantly surpasses the alternative frameworks. Our code and dataset are available at: this https URL.

Comments:	This work has been accepted by TIP 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.08098 [cs.CV]
	(or arXiv:2307.08098v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.08098

Submission history

From: Jialun Pei [view email]
[v1] Sun, 16 Jul 2023 16:49:59 UTC (9,924 KB)
[v2] Tue, 11 Jun 2024 14:07:59 UTC (12,309 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators