Why Are Deep Representations Good Perceptual Quality Features?

Tariq, Taimoor; Tursun, Okan Tarhan; Kim, Munchurl; Didyk, Piotr

Computer Science > Computer Vision and Pattern Recognition

arXiv:1812.00412 (cs)

[Submitted on 2 Dec 2018 (v1), last revised 23 Jul 2020 (this version, v4)]

Title:Why Are Deep Representations Good Perceptual Quality Features?

Authors:Taimoor Tariq, Okan Tarhan Tursun, Munchurl Kim, Piotr Didyk

View PDF

Abstract:Recently, intermediate feature maps of pre-trained convolutional neural networks have shown significant perceptual quality improvements, when they are used in the loss function for training new networks. It is believed that these features are better at encoding the perceptual quality and provide more efficient representations of input images compared to other perceptual metrics such as SSIM and PSNR. However, there have been no systematic studies to determine the underlying reason. Due to the lack of such an analysis, it is not possible to evaluate the performance of a particular set of features or to improve the perceptual quality even more by carefully selecting a subset of features from a pre-trained CNN. This work shows that the capabilities of pre-trained deep CNN features in optimizing the perceptual quality are correlated with their success in capturing basic human visual perception characteristics. In particular, we focus our analysis on fundamental aspects of human perception, such as the contrast sensitivity and orientation selectivity. We introduce two new formulations to measure the frequency and orientation selectivity of the features learned by convolutional layers for evaluating deep features learned by widely-used deep CNNs such as VGG-16. We demonstrate that the pre-trained CNN features which receive higher scores are better at predicting human quality judgment. Furthermore, we show the possibility of using our method to select deep features to form a new loss function, which improves the image reconstruction quality for the well-known single-image super-resolution problem.

Comments:	To be presented at ECCV 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1812.00412 [cs.CV]
	(or arXiv:1812.00412v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1812.00412

Submission history

From: Taimoor Tariq Mr. [view email]
[v1] Sun, 2 Dec 2018 15:54:29 UTC (4,834 KB)
[v2] Mon, 25 Feb 2019 07:57:16 UTC (4,784 KB)
[v3] Fri, 16 Aug 2019 17:36:46 UTC (617 KB)
[v4] Thu, 23 Jul 2020 13:20:17 UTC (7,353 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Why Are Deep Representations Good Perceptual Quality Features?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Why Are Deep Representations Good Perceptual Quality Features?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators