DE-Net: Dynamic Text-guided Image Editing Adversarial Networks

Tao, Ming; Bao, Bing-Kun; Tang, Hao; Wu, Fei; Wei, Longhui; Tian, Qi

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.01160v1 (cs)

[Submitted on 2 Jun 2022 (this version), latest version 20 Aug 2022 (v2)]

Title:DE-Net: Dynamic Text-guided Image Editing Adversarial Networks

Authors:Ming Tao, Bing-Kun Bao, Hao Tang, Fei Wu, Longhui Wei, Qi Tian

View PDF

Abstract:Text-guided image editing models have shown remarkable results. However, there remain two problems. First, they employ fixed manipulation modules for various editing requirements (e.g., color changing, texture changing, content adding and removing), which result in over-editing or insufficient editing. Second, they do not clearly distinguish between text-required parts and text-irrelevant parts, which leads to inaccurate editing. To solve these limitations, we propose: (i) a Dynamic Editing Block (DEBlock) which combines spatial- and channel-wise manipulations dynamically for various editing requirements. (ii) a Combination Weights Predictor (CWP) which predicts the combination weights for DEBlock according to the inference on text and visual features. (iii) a Dynamic text-adaptive Convolution Block (DCBlock) which queries source image features to distinguish text-required parts and text-irrelevant parts. Extensive experiments demonstrate that our DE-Net achieves excellent performance and manipulates source images more effectively and accurately. Code is available at \url{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2206.01160 [cs.CV]
	(or arXiv:2206.01160v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.01160

Submission history

From: Ming Tao [view email]
[v1] Thu, 2 Jun 2022 17:20:52 UTC (9,248 KB)
[v2] Sat, 20 Aug 2022 15:46:46 UTC (7,018 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DE-Net: Dynamic Text-guided Image Editing Adversarial Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DE-Net: Dynamic Text-guided Image Editing Adversarial Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators