Search | arXiv e-print repository

3D Video Quality Metric for Mobile Applications

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: In this paper, we propose a new full-reference quality metric for mobile 3D content. Our method is modeled around the Human Visual System, fusing the information of both left and right channels, considering color components, the cyclopean views of the two videos and disparity. Our method is assessing the quality of 3D videos displayed on a mobile 3DTV, taking into account the effect of resolution,… ▽ More In this paper, we propose a new full-reference quality metric for mobile 3D content. Our method is modeled around the Human Visual System, fusing the information of both left and right channels, considering color components, the cyclopean views of the two videos and disparity. Our method is assessing the quality of 3D videos displayed on a mobile 3DTV, taking into account the effect of resolution, distance from the viewers eyes, and dimensions of the mobile display. Performance evaluations showed that our mobile 3D quality metric monitors the degradation of quality caused by several representative types of distortion with 82 percent correlation with results of subjective tests, an accuracy much better than that of the state of the art mobile 3D quality metric. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1803.04624; text overlap with arXiv:1803.04832 and arXiv:1803.04836

Journal ref: ICASSP, 2013

arXiv:1803.05507 [pdf]

ISO/IEC JTC1/SC29/WG11 MPEG2014/ m34661: Quality Assessment of High Dynamic Range (HDR) Video Content Using Existing Full-Reference Metrics

Authors: Amin Banitalebi-Dehkordi, Maryam Azimi, Yuanyuan Dong, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: The main focus of this document is to evaluate the performance of the existing LDR and HDR metrics on HDR video content which in turn will allow for a better understanding of how well each of these metrics work and if they can be applied in capturing, compressing, transmitting process of HDR data. To this end a series of subjective tests is performed to evaluate the quality of DML-HDR video databa… ▽ More The main focus of this document is to evaluate the performance of the existing LDR and HDR metrics on HDR video content which in turn will allow for a better understanding of how well each of these metrics work and if they can be applied in capturing, compressing, transmitting process of HDR data. To this end a series of subjective tests is performed to evaluate the quality of DML-HDR video database [1], when several different representing types of artifacts are present using a HDR display. Then, the correlation between the results from the existing LDR and HDR quality metrics and those from subjective tests is measured to determine the most effective exiting quality metric for HDR. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1803.04815

Journal ref: MPEG, 2014

arXiv:1803.05506 [pdf]

ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29/WG 11 - JCT3V-C0032: A human visual system based 3D video quality metric

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: This contribution proposes a full-reference Human-Visual-System based 3D video quality metric. In this report, the presented metric is used to evaluate the quality of compressed stereo pair formed from a decoded view and a synthesized view. The performance of the proposed metric is verified through a series of subjective tests and compared with that of PSNR, SSIM, MS-SSIM, VIFp, and VQM metrics. T… ▽ More This contribution proposes a full-reference Human-Visual-System based 3D video quality metric. In this report, the presented metric is used to evaluate the quality of compressed stereo pair formed from a decoded view and a synthesized view. The performance of the proposed metric is verified through a series of subjective tests and compared with that of PSNR, SSIM, MS-SSIM, VIFp, and VQM metrics. The experimental results show that HV3D has the highest correlation with Mean Opinion Scores (MOS) compared to other tested metrics. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1803.04624, arXiv:1803.04629

Journal ref: MPEG, 2013

arXiv:1803.05110 [pdf]

A Study on the Relationship Between Depth Map Quality and the Overall 3D Video Quality OF Experience

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: The emergence of multiview displays has made the need for synthesizing virtual views more pronounced, since it is not practical to capture all of the possible views when filming multiview content. View synthesis is performed using the available views and depth maps. There is a correlation between the quality of the synthesized views and the quality of depth maps. In this paper we study the effect… ▽ More The emergence of multiview displays has made the need for synthesizing virtual views more pronounced, since it is not practical to capture all of the possible views when filming multiview content. View synthesis is performed using the available views and depth maps. There is a correlation between the quality of the synthesized views and the quality of depth maps. In this paper we study the effect of depth map quality on perceptual quality of synthesized view through subjective and objective analysis. Our evaluation results show that: 1) 3D video quality depends highly on the depth map quality and 2) the Visual Information Fidelity index computed between the reference and distorted depth maps has Pearson correlation ratio of 0.75 and Spearman rank order correlation coefficient of 0.67 with the subjective 3D video quality. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: 3DTV, 2013

arXiv:1803.04966 [pdf]

An Improvement Technique based on Structural Similarity Thresholding for Digital Watermarking

Authors: Amin Banitalebi-Dehkordi, Mehdi Banitalebi-Dehkordi, Jamshid Abouei, Said Nader-Esfahani

Abstract: Digital watermarking is extensively used in ownership authentication and copyright protection. In this paper, we propose an efficient thresholding scheme to improve the watermark embedding procedure in an image. For the proposed algorithm, watermark casting is performed separately in each block of an image, and embedding in each block continues until a certain structural similarity threshold is re… ▽ More Digital watermarking is extensively used in ownership authentication and copyright protection. In this paper, we propose an efficient thresholding scheme to improve the watermark embedding procedure in an image. For the proposed algorithm, watermark casting is performed separately in each block of an image, and embedding in each block continues until a certain structural similarity threshold is reached. Numerical evaluations demonstrate that our scheme improves the imperceptibility of the watermark when the capacity remains fix, and at the same time, robustness against attacks is assured. The proposed method is applicable to most image watermarking algorithms. We verify this issue on watermarking schemes in Discrete Cosine Transform (DCT), wavelet, and spatial domain. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: ACENG, 2014

arXiv:1803.04847 [pdf]

Introducing A Public Stereoscopic 3D High Dynamic Range (SHDR) Video Database

Authors: Amin Banitalebi-Dehkordi

Abstract: High Dynamic Range (HDR) displays and cameras are paving their ways through the consumer market at a rapid growth rate. Thanks to TV and camera manufacturers, HDR systems are now becoming available commercially to end users. This is taking place only a few years after the blooming of 3D video technologies. MPEG/ITU are also actively working towards the standardization of these technologies. Howeve… ▽ More High Dynamic Range (HDR) displays and cameras are paving their ways through the consumer market at a rapid growth rate. Thanks to TV and camera manufacturers, HDR systems are now becoming available commercially to end users. This is taking place only a few years after the blooming of 3D video technologies. MPEG/ITU are also actively working towards the standardization of these technologies. However, preliminary research efforts in these video technologies are hammered by the lack of sufficient experimental data. In this paper, we introduce a Stereoscopic 3D HDR (SHDR) database of videos that is made publicly available to the research community. We explain the procedure taken to capture, calibrate, and post-process the videos. In addition, we provide insights on potential use-cases, challenges, and research opportunities, implied by the combination of higher dynamic range of the HDR aspect, and depth impression of the 3D aspect. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: Journal of 3D Research, 2017

arXiv:1803.04845 [pdf]

Benchmark 3D eye-tracking dataset for visual saliency prediction on stereoscopic 3D video

Authors: Amin Banitalebi-Dehkordi, Eleni Nasiopoulos, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: Visual Attention Models (VAMs) predict the location of an image or video regions that are most likely to attract human attention. Although saliency detection is well explored for 2D image and video content, there are only few attempts made to design 3D saliency prediction models. Newly proposed 3D visual attention models have to be validated over large-scale video saliency prediction datasets, whi… ▽ More Visual Attention Models (VAMs) predict the location of an image or video regions that are most likely to attract human attention. Although saliency detection is well explored for 2D image and video content, there are only few attempts made to design 3D saliency prediction models. Newly proposed 3D visual attention models have to be validated over large-scale video saliency prediction datasets, which also contain results of eye-tracking information. There are several publicly available eye-tracking datasets for 2D image and video content. In the case of 3D, however, there is still a need for large-scale video saliency datasets for the research community for validating different 3D-VAMs. In this paper, we introduce a large-scale dataset containing eye-tracking data collected from 61 stereoscopic 3D videos (and also 2D versions of those) and 24 subjects participated in a free-viewing test. We evaluate the performance of the existing saliency detection methods over the proposed dataset. In addition, we created an online benchmark for validating the performance of the existing 2D and 3D visual attention models and facilitate addition of new VAMs to the benchmark. Our benchmark currently contains 50 different VAMs. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: SPIE, 2016

arXiv:1803.04832 [pdf]

An Efficient Human Visual System Based Quality Metric for 3D Video

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: Stereoscopic video technologies have been introduced to the consumer market in the past few years. A key factor in designing a 3D system is to understand how different visual cues and distortions affect the perceptual quality of stereoscopic video. The ultimate way to assess 3D video quality is through subjective tests. However, subjective evaluation is time consuming, expensive, and in some cases… ▽ More Stereoscopic video technologies have been introduced to the consumer market in the past few years. A key factor in designing a 3D system is to understand how different visual cues and distortions affect the perceptual quality of stereoscopic video. The ultimate way to assess 3D video quality is through subjective tests. However, subjective evaluation is time consuming, expensive, and in some cases not possible. The other solution is develo** objective quality metrics, which attempt to model the Human Visual System (HVS) in order to assess perceptual quality. Although several 2D quality metrics have been proposed for still images and videos, in the case of 3D efforts are only at the initial stages. In this paper, we propose a new full-reference quality metric for 3D content. Our method mimics HVS by fusing information of both the left and right views to construct the cyclopean view, as well as taking to account the sensitivity of HVS to contrast and the disparity of the views. In addition, a temporal pooling strategy is utilized to address the effect of temporal variations of the quality in the video. Performance evaluations showed that our 3D quality metric quantifies quality degradation caused by several representative types of distortions very accurately, with Pearson correlation coefficient of 90.8 %, a competitive performance compared to the state-of-the-art 3D quality metrics. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: Multimedia Tools and Applications, 2015

arXiv:1803.04826 [pdf]

The Effect of Frame Rate on 3D Video Quality and Bitrate

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: Increasing the frame rate of a 3D video generally results in improved Quality of Experience (QoE). However, higher frame rates involve a higher degree of complexity in capturing, transmission, storage, and display. The question that arises here is what frame rate guarantees high viewing quality of experience given the existing/required 3D devices and technologies (3D cameras, 3D TVs, compression,… ▽ More Increasing the frame rate of a 3D video generally results in improved Quality of Experience (QoE). However, higher frame rates involve a higher degree of complexity in capturing, transmission, storage, and display. The question that arises here is what frame rate guarantees high viewing quality of experience given the existing/required 3D devices and technologies (3D cameras, 3D TVs, compression, transmission bandwidth, and storage capacity). This question has already been addressed for the case of 2D video, but not for 3D. The objective of this paper is to study the relationship between 3D quality and bitrate at different frame rates. Our performance evaluations show that increasing the frame rate of 3D videos beyond 60 fps may not be visually distinguishable. In addition, our experiments show that when the available bandwidth is reduced, the highest possible 3D quality of experience can be achieved by adjusting (decreasing) the frame rate instead of increasing the compression ratio. The results of our study are of particular interest to network providers for rate adaptation in variable bitrate channels. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: Journal of 3D Research, 2015

arXiv:1803.04823 [pdf]

Compression of High Dynamic Range Video Using the HEVC and H.264/AVC Standards

Authors: Amin Banitalebi-Dehkordi, Maryam Azimi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: The existing video coding standards such as H.264/AVC and High Efficiency Video Coding (HEVC) have been designed based on the statistical properties of Low Dynamic Range (LDR) videos and are not accustomed to the characteristics of High Dynamic Range (HDR) content. In this study, we investigate the performance of the latest LDR video compression standard, HEVC, as well as the recent widely commerc… ▽ More The existing video coding standards such as H.264/AVC and High Efficiency Video Coding (HEVC) have been designed based on the statistical properties of Low Dynamic Range (LDR) videos and are not accustomed to the characteristics of High Dynamic Range (HDR) content. In this study, we investigate the performance of the latest LDR video compression standard, HEVC, as well as the recent widely commercially used video compression standard, H.264/AVC, on HDR content. Subjective evaluations of results on an HDR display show that viewers clearly prefer the videos coded via an HEVC-based encoder to the ones encoded using an H.264/AVC encoder. In particular, HEVC outperforms H.264/AVC by an average of 10.18% in terms of mean opinion score and 25.08% in terms of bit rate savings. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: QSHINE, 2014

arXiv:1803.04815 [pdf]

Evaluating the Performance of Existing Full-Reference Quality Metrics on High Dynamic Range (HDR) Video Content

Authors: Maryam Azimi, Amin Banitalebi-Dehkordi, Yuanyuan Dong, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: While there exists a wide variety of Low Dynamic Range (LDR) quality metrics, only a limited number of metrics are designed specifically for the High Dynamic Range (HDR) content. With the introduction of HDR video compression standardization effort by international standardization bodies, the need for an efficient video quality metric for HDR applications has become more pronounced. The objective… ▽ More While there exists a wide variety of Low Dynamic Range (LDR) quality metrics, only a limited number of metrics are designed specifically for the High Dynamic Range (HDR) content. With the introduction of HDR video compression standardization effort by international standardization bodies, the need for an efficient video quality metric for HDR applications has become more pronounced. The objective of this study is to compare the performance of the existing full-reference LDR and HDR video quality metrics on HDR content and identify the most effective one for HDR applications. To this end, a new HDR video dataset is created, which consists of representative indoor and outdoor video sequences with different brightness, motion levels and different representing types of distortions. The quality of each distorted video in this dataset is evaluated both subjectively and objectively. The correlation between the subjective and objective results confirm that VIF quality metric outperforms all to ther tested metrics in the presence of the tested types of distortions. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: ICMSP, 2014

arXiv:1803.04653 [pdf]

Effect of High Frame Rates on 3D Video Quality of Experience

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: In this paper, we study the effect of 3D videos with increased frame rates on the viewers quality of experience. We performed a series of subjective tests to seek the subjects preferences among videos of the same scene at four different frame rates: 24, 30, 48, and 60 frames per second (fps). Results revealed that subjects clearly prefer higher frame rates. In particular, Mean Opinion Score (MOS)… ▽ More In this paper, we study the effect of 3D videos with increased frame rates on the viewers quality of experience. We performed a series of subjective tests to seek the subjects preferences among videos of the same scene at four different frame rates: 24, 30, 48, and 60 frames per second (fps). Results revealed that subjects clearly prefer higher frame rates. In particular, Mean Opinion Score (MOS) values associated with the 60 fps 3D videos were 55% greater than MOS values of the 24 fps 3D videos. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: ICCE, 2014

arXiv:1803.04652 [pdf]

Music Genre Classification Using Spectral Analysis and Sparse Representation of the Signals

Authors: Mehdi Banitalebi-Dehkordi, Amin Banitalebi-Dehkordi

Abstract: In this paper, we proposed a robust music genre classification method based on a sparse FFT based feature extraction method which extracted with discriminating power of spectral analysis of non-stationary audio signals, and the capability of sparse representation based classifiers. Feature extraction method combines two sets of features namely short-term features (extracted from windowed signals)… ▽ More In this paper, we proposed a robust music genre classification method based on a sparse FFT based feature extraction method which extracted with discriminating power of spectral analysis of non-stationary audio signals, and the capability of sparse representation based classifiers. Feature extraction method combines two sets of features namely short-term features (extracted from windowed signals) and long-term features (extracted from combination of extracted short-time features). Experimental results demonstrate that the proposed feature extraction method leads to a sparse representation of audio signals. As a result, a significant reduction in the dimensionality of the signals is achieved. The extracted features are then fed into a sparse representation based classifier (SRC). Our experimental results on the GTZAN database demonstrate that the proposed method outperforms the other state of the art SRC approaches. Moreover, the computational efficiency of the proposed method is better than that of the other Compressive Sampling (CS)-based classifiers. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: Journal of Signal Processing Systems, 2014

arXiv:1803.04629 [pdf]

3D Video Quality Metric for 3D Video Compression

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: As the evolution of multiview display technology is bringing glasses-free 3DTV closer to reality, MPEG and VCEG are preparing an extension to HEVC to encode multiview video content. View synthesis in the current version of the 3D video codec is performed using PSNR as a quality metric measure. In this paper, we propose a full- reference Human-Visual-System based 3D video quality metric to be used… ▽ More As the evolution of multiview display technology is bringing glasses-free 3DTV closer to reality, MPEG and VCEG are preparing an extension to HEVC to encode multiview video content. View synthesis in the current version of the 3D video codec is performed using PSNR as a quality metric measure. In this paper, we propose a full- reference Human-Visual-System based 3D video quality metric to be used in multiview encoding as an alternative to PSNR. Performance of our metric is tested in a 2-view case scenario. The quality of the compressed stereo pair, formed from a decoded view and a synthesized view, is evaluated at the encoder side. The performance is verified through a series of subjective tests and compared with that of PSNR, SSIM, MS-SSIM, VIFp, and VQM metrics. Experimental results showed that our 3D quality metric has the highest correlation with Mean Opinion Scores (MOS) compared to the other tested metrics. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: IVMSP, 2013

arXiv:1803.04627 [pdf]

A Random Matrix Approach to Wide Band Spectrum Sensing: Unknown Noise Variance Case

Authors: Sajjad Imani, Amin Banitalebi-Dehkordi, Mehdi Cheraghi

Abstract: In this paper three different scenarios in wide band spectrum sensing have been studied. While the signal and noise statistics are supposed to be unspecified, random matrixes have been utilized in order to estimate the noise variance. These scenarios are: 1- Number of subbands is specified and there is enough information regarding being used or being unused for each of them. 2- Number of subbands… ▽ More In this paper three different scenarios in wide band spectrum sensing have been studied. While the signal and noise statistics are supposed to be unspecified, random matrixes have been utilized in order to estimate the noise variance. These scenarios are: 1- Number of subbands is specified and there is enough information regarding being used or being unused for each of them. 2- Number of subbands is known but there is no information about usage distribution among them. 3- Number of subbands is unknown. Simulation results showed the superior performance of the proposed scheme. Regarding the number of samples, the proposed method requires less number of samples compared to the cyclo-stationary spectrum sensing algorithms and more samples compared to the energy detection based methods. But, regarding the detection probability, the proposed method is superior compared to both other spectrum sensing methods. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Journal ref: ICEE, 2013

arXiv:1803.04624 [pdf]

A Human Visual System-Based 3D Video Quality Metric

Authors: Amin Banitalebi-Dehkordi, Mahsa T. Pourazad, Panos Nasiopoulos

Abstract: Although several 2D quality metrics have been proposed for images and videos, in the case of 3D efforts are only at the initial stages. In this paper, we propose a new full-reference quality metric for 3D content. Our method is modeled around the HVS, fusing the information of both left and right channels, considering color components, the cyclopean views of the two videos and disparity. Performan… ▽ More Although several 2D quality metrics have been proposed for images and videos, in the case of 3D efforts are only at the initial stages. In this paper, we propose a new full-reference quality metric for 3D content. Our method is modeled around the HVS, fusing the information of both left and right channels, considering color components, the cyclopean views of the two videos and disparity. Performance evaluations showed that our 3D quality metric successfully monitors the degradation of quality caused by several representative types of distortion and it has 86% correlation with the results of subjective evaluations. △ Less

Submitted 13 March, 2018; originally announced March 2018.

Comments: IC3D, 2012

arXiv:1803.04096 [pdf]

Saliency Inspired Quality Assessment of Stereoscopic 3D Video

Authors: Amin Banitalebi-Dehkordi, Panos Nasiopoulos

Abstract: To study the visual attentional behavior of Human Visual System (HVS) on 3D content, eye tracking experiments are performed and Visual Attention Models (VAMs) are designed. One of the main applications of these VAMs is in quality assessment of 3D video. The usage of 2D VAMs in designing 2D quality metrics is already well explored. This paper investigates the added value of incorporating 3D VAMs in… ▽ More To study the visual attentional behavior of Human Visual System (HVS) on 3D content, eye tracking experiments are performed and Visual Attention Models (VAMs) are designed. One of the main applications of these VAMs is in quality assessment of 3D video. The usage of 2D VAMs in designing 2D quality metrics is already well explored. This paper investigates the added value of incorporating 3D VAMs into Full-Reference (FR) and No-Reference (NR) quality assessment metrics for stereoscopic 3D video. To this end, state-of-the-art 3D VAMs are integrated to quality assessment pipeline of various existing FR and NR stereoscopic video quality metrics. Performance evaluations using a large scale database of stereoscopic videos with various types of distortions demonstrated that using saliency maps generally improves the performance of the quality assessment task for stereoscopic video. However, depending on the type of distortion, utilized metric, and VAM, the amount of improvement will change. △ Less

Submitted 11 March, 2018; originally announced March 2018.

Comments: Multimedia Tools and Applications, 2018

Showing 1–17 of 17 results for author: Banitalebi-Dehkordi, A