OrthoNets: Orthogonal Channel Attention Networks

Salman, Hadi; Parks, Caleb; Swan, Matthew; Gauch, John

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.03071 (cs)

[Submitted on 6 Nov 2023 (v1), last revised 7 Nov 2023 (this version, v2)]

Title:OrthoNets: Orthogonal Channel Attention Networks

Authors:Hadi Salman, Caleb Parks, Matthew Swan, John Gauch

View PDF

Abstract:Designing an effective channel attention mechanism implores one to find a lossy-compression method allowing for optimal feature representation. Despite recent progress in the area, it remains an open problem. FcaNet, the current state-of-the-art channel attention mechanism, attempted to find such an information-rich compression using Discrete Cosine Transforms (DCTs). One drawback of FcaNet is that there is no natural choice of the DCT frequencies. To circumvent this issue, FcaNet experimented on ImageNet to find optimal frequencies. We hypothesize that the choice of frequency plays only a supporting role and the primary driving force for the effectiveness of their attention filters is the orthogonality of the DCT kernels. To test this hypothesis, we construct an attention mechanism using randomly initialized orthogonal filters. Integrating this mechanism into ResNet, we create OrthoNet. We compare OrthoNet to FcaNet (and other attention mechanisms) on Birds, MS-COCO, and Places356 and show superior performance. On the ImageNet dataset, our method competes with or surpasses the current state-of-the-art. Our results imply that an optimal choice of filter is elusive and generalization can be achieved with a sufficiently large number of orthogonal filters. We further investigate other general principles for implementing channel attention, such as its position in the network and channel grou**s. Our code is publicly available at this https URL

Comments:	IEEE BigData 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.03071 [cs.CV]
	(or arXiv:2311.03071v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.03071
Journal reference:	IEEE BigData 2023

Submission history

From: Hadi Salman [view email]
[v1] Mon, 6 Nov 2023 12:54:20 UTC (3,412 KB)
[v2] Tue, 7 Nov 2023 02:23:30 UTC (1,705 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:OrthoNets: Orthogonal Channel Attention Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:OrthoNets: Orthogonal Channel Attention Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators