T-former: An Efficient Transformer for Image Inpainting

Deng, Ye; Hui, Siqi; Zhou, San**; Meng, Deyu; Wang, **jun

doi:10.1145/3503161.3548446

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.07239 (cs)

[Submitted on 12 May 2023 (v1), last revised 19 May 2023 (this version, v2)]

Title:T-former: An Efficient Transformer for Image Inpainting

Authors:Ye Deng, Siqi Hui, San** Zhou, Deyu Meng, **jun Wang

View PDF

Abstract:Benefiting from powerful convolutional neural networks (CNNs), learning-based image inpainting methods have made significant breakthroughs over the years. However, some nature of CNNs (e.g. local prior, spatially shared parameters) limit the performance in the face of broken images with diverse and complex forms. Recently, a class of attention-based network architectures, called transformer, has shown significant performance on natural language processing fields and high-level vision tasks. Compared with CNNs, attention operators are better at long-range modeling and have dynamic weights, but their computational complexity is quadratic in spatial resolution, and thus less suitable for applications involving higher resolution images, such as image inpainting. In this paper, we design a novel attention linearly related to the resolution according to Taylor expansion. And based on this attention, a network called $T$-former is designed for image inpainting. Experiments on several benchmark datasets demonstrate that our proposed method achieves state-of-the-art accuracy while maintaining a relatively low number of parameters and computational complexity. The code can be found at \href{this https URL}{this http URL\_image\_inpainting}

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.07239 [cs.CV]
	(or arXiv:2305.07239v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.07239
Journal reference:	ACM Multimedia 2022
Related DOI:	https://doi.org/10.1145/3503161.3548446

Submission history

From: Ye Deng [view email]
[v1] Fri, 12 May 2023 04:10:42 UTC (1,829 KB)
[v2] Fri, 19 May 2023 02:11:54 UTC (1,438 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:T-former: An Efficient Transformer for Image Inpainting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:T-former: An Efficient Transformer for Image Inpainting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators