Skip to main content

Showing 1–41 of 41 results for author: Min, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.09356  [pdf, other

    cs.CV eess.IV

    CMC-Bench: Towards a New Paradigm of Visual Signal Compression

    Authors: Chunyi Li, Xiele Wu, Haoning Wu, Donghui Feng, Zicheng Zhang, Guo Lu, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: Ultra-low bitrate image compression is a challenging and demanding topic. With the development of Large Multimodal Models (LMMs), a Cross Modality Compression (CMC) paradigm of Image-Text-Image has emerged. Compared with traditional codecs, this semantic-level compression can reduce image data size to 0.1\% or even lower, which has strong potential applications. However, CMC has certain defects in… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.08745  [pdf, other

    eess.IV cs.CV cs.MM

    Enhancing Blind Video Quality Assessment with Rich Quality-aware Features

    Authors: Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai

    Abstract: In this paper, we present a simple but effective method to enhance blind video quality assessment (BVQA) models for social media videos. Motivated by previous researches that leverage pre-trained features extracted from various computer vision models as the feature representation for BVQA, we further explore rich quality-aware features from pre-trained blind image quality assessment (BIQA) and BVQ… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  3. arXiv:2405.06342  [pdf, other

    cs.CV eess.IV

    Compression-Realized Deep Structural Network for Video Quality Enhancement

    Authors: Hanchi Sun, Xiaohong Liu, Xinyang Jiang, Yifei Shen, Dongsheng Li, Xiongkuo Min, Guangtao Zhai

    Abstract: This paper focuses on the task of quality enhancement for compressed videos. Although deep network-based video restorers achieve impressive progress, most of the existing methods lack a structured design to optimally leverage the priors within compression codecs. Since the quality degradation of the video is primarily induced by the compression algorithm, a new paradigm is urgently needed for a mo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  4. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Ya**g Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  5. arXiv:2404.09003  [pdf, other

    cs.CV eess.IV

    THQA: A Perceptual Quality Assessment Database for Talking Heads

    Authors: Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-** Zhang, Guangtao Zhai

    Abstract: In the realm of media technology, digital humans have gained prominence due to rapid advancements in computer technology. However, the manual modeling and control required for the majority of digital humans pose significant obstacles to efficient development. The speech-driven methods offer a novel avenue for manipulating the mouth shape and expressions of digital humans. Despite the proliferation… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  6. arXiv:2404.01024  [pdf, other

    cs.CV eess.IV

    AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images

    Authors: Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet

    Abstract: In recent years, the rapid advancement of Artificial Intelligence Generated Content (AIGC) has attracted widespread attention. Among the AIGC, AI generated omnidirectional images hold significant potential for Virtual Reality (VR) and Augmented Reality (AR) applications, hence omnidirectional AIGC techniques have also been widely studied. AI-generated omnidirectional images exhibit unique distorti… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  7. arXiv:2402.03413  [pdf, other

    cs.MM cs.CV eess.IV

    Perceptual Video Quality Assessment: A Survey

    Authors: Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

    Abstract: Perceptual video quality assessment plays a vital role in the field of video processing due to the existence of quality degradations introduced in various stages of video signal acquisition, compression, transmission and display. With the advancement of internet communication and cloud service technology, video content and traffic are growing exponentially, which further emphasizes the requirement… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2401.01117  [pdf, other

    cs.CV eess.IV

    Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

    Authors: Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai

    Abstract: With the rapid evolution of the Text-to-Image (T2I) model in recent years, their unsatisfactory generation result has become a challenge. However, uniformly refining AI-Generated Images (AIGIs) of different qualities not only limited optimization capabilities for low-quality AIGIs but also brought negative optimization to high-quality AIGIs. To address this issue, a quality-award refiner named Q-R… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures

  9. arXiv:2312.15659  [pdf, other

    eess.IV

    Perceptual Quality Assessment for Video Frame Interpolation

    Authors: **liang Han, Xiongkuo Min, Yixuan Gao, Jun Jia, Lei Sun, Zuowei Cao, Yonglin Luo, Guangtao Zhai

    Abstract: The quality of frames is significant for both research and application of video frame interpolation (VFI). In recent VFI studies, the methods of full-reference image quality assessment have generally been used to evaluate the quality of VFI frames. However, high frame rate reference videos, necessities for the full-reference methods, are difficult to obtain in most applications of VFI. To evaluate… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures

    ACM Class: I.4.0

  10. arXiv:2311.18216  [pdf, other

    cs.CV cs.MM eess.IV

    FS-BAND: A Frequency-Sensitive Banding Detector

    Authors: Zijian Chen, Wei Sun, Zicheng Zhang, Ru Huang, Fangfang Lu, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

    Abstract: Banding artifact, as known as staircase-like contour, is a common quality annoyance that happens in compression, transmission, etc. scenarios, which largely affects the user's quality of experience (QoE). The banding distortion typically appears as relatively small pixel-wise variations in smooth backgrounds, which is difficult to analyze in the spatial domain but easily reflected in the frequency… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.17752

  11. arXiv:2310.17147  [pdf, other

    cs.CV eess.IV

    Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

    Abstract: Point clouds are widely used in 3D content representation and have various applications in multimedia. However, compression and simplification processes inevitably result in the loss of quality-aware information under storage and bandwidth constraints. Therefore, there is an increasing need for effective methods to quantify the degree of distortion in point clouds. In this paper, we propose simple… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  12. arXiv:2310.16732  [pdf, other

    cs.CV eess.IV

    A No-Reference Quality Assessment Method for Digital Human Head

    Authors: Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiongkuo Min, Xianghe Ma, Guangtao Zhai

    Abstract: In recent years, digital humans have been widely applied in augmented/virtual reality (A/VR), where viewers are allowed to freely observe and interact with the volumetric content. However, the digital humans may be degraded with various distortions during the procedure of generation and transmission. Moreover, little effort has been put into the perceptual quality assessment of digital humans. The… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  13. arXiv:2310.15984  [pdf, other

    cs.CV eess.IV

    Geometry-Aware Video Quality Assessment for Dynamic Digital Human

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Guangtao Zhai

    Abstract: Dynamic Digital Humans (DDHs) are 3D digital models that are animated using predefined motions and are inevitably bothered by noise/shift during the generation process and compression distortion during the transmission process, which needs to be perceptually evaluated. Usually, DDHs are displayed as 2D rendered animation videos and it is natural to adapt video quality assessment (VQA) methods to D… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  14. StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability

    Authors: Tengchuan Kou, Xiaohong Liu, Wei Sun, Jun Jia, Xiongkuo Min, Guangtao Zhai, Ning Liu

    Abstract: Video shakiness is an unpleasant distortion of User Generated Content (UGC) videos, which is usually caused by the unstable hold of cameras. In recent years, many video stabilization algorithms have been proposed, yet no specific and accurate metric enables comprehensively evaluating the stability of videos. Indeed, most existing quality assessment models evaluate video quality as a whole without… ▽ More

    Submitted 27 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM'23

  15. arXiv:2307.13981  [pdf, other

    cs.CV cs.MM eess.IV

    Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models

    Authors: Wei Sun, Wen Wen, Xiongkuo Min, Long Lan, Guangtao Zhai, Kede Ma

    Abstract: Blind video quality assessment (BVQA) plays an indispensable role in monitoring and improving the end-users' viewing experience in various real-world video-enabled media applications. As an experimental field, the improvements of BVQA models have been measured primarily on a few human-rated VQA datasets. Thus, it is crucial to gain a better understanding of existing VQA datasets in order to proper… ▽ More

    Submitted 3 April, 2024; v1 submitted 26 July, 2023; originally announced July 2023.

  16. arXiv:2307.10813  [pdf, other

    cs.CV cs.SD eess.AS eess.IV

    Perceptual Quality Assessment of Omnidirectional Audio-visual Signals

    Authors: Xilei Zhu, Huiyu Duan, Yuqin Cao, Yuxin Zhu, Yucheng Zhu, **g Liu, Li Chen, Xiongkuo Min, Guangtao Zhai

    Abstract: Omnidirectional videos (ODVs) play an increasingly important role in the application fields of medical, education, advertising, tourism, etc. Assessing the quality of ODVs is significant for service-providers to improve the user's Quality of Experience (QoE). However, most existing quality assessment studies for ODVs only focus on the visual distortions of videos, while ignoring that the overall Q… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 12 pages, 5 figures, to be published in CICAI2023

    ACM Class: I.4.0; I.5.4

  17. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  18. arXiv:2307.02808  [pdf, other

    eess.IV cs.CV cs.DB

    Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Haoning Wu, Chunyi Li, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: Digital humans have witnessed extensive applications in various domains, necessitating related quality assessment studies. However, there is a lack of comprehensive digital human quality assessment (DHQA) databases. To address this gap, we propose SJTU-H3D, a subjective quality assessment database specifically designed for full-body digital humans. It comprises 40 high-quality reference digital hu… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  19. arXiv:2307.00211  [pdf, other

    cs.CV eess.IV

    AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence

    Authors: Jiarui Wang, Huiyu Duan, **g Liu, Shi Chen, Xiongkuo Min, Guangtao Zhai

    Abstract: In this paper, in order to get a better understanding of the human visual preferences for AIGIs, a large-scale IQA database for AIGC is established, which is named as AIGCIQA2023. We first generate over 2000 images based on 6 state-of-the-art text-to-image generation models using 100 prompts. Based on these images, a well-organized subjective experiment is conducted to assess the human visual pref… ▽ More

    Submitted 15 July, 2023; v1 submitted 30 June, 2023; originally announced July 2023.

  20. arXiv:2306.05658  [pdf, other

    cs.CV eess.IV

    GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Houning Wu, Yingjie Zhou, Chunyi Li, Xiongkuo Min, Guangtao Zhai, Weisi Lin

    Abstract: Nowadays, most 3D model quality assessment (3DQA) methods have been aimed at improving performance. However, little attention has been paid to the computational cost and inference time required for practical applications. Model-based 3DQA methods extract features directly from the 3D models, which are characterized by their high degree of complexity. As a result, many researchers are inclined towa… ▽ More

    Submitted 31 January, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  21. arXiv:2306.04717  [pdf, other

    cs.CV cs.AI eess.IV

    AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment

    Authors: Chunyi Li, Zicheng Zhang, Haoning Wu, Wei Sun, Xiongkuo Min, Xiaohong Liu, Guangtao Zhai, Weisi Lin

    Abstract: With the rapid advancements of the text-to-image generative model, AI-generated images (AGIs) have been widely applied to entertainment, education, social media, etc. However, considering the large quality variance among different AGIs, there is an urgent need for quality models that are consistent with human subjective ratings. To address this issue, we extensively consider various popular AGI mo… ▽ More

    Submitted 12 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 12 pages, 11 figures

  22. arXiv:2303.12618  [pdf, other

    cs.CV eess.IV

    A Perceptual Quality Assessment Exploration for AIGC Images

    Authors: Zicheng Zhang, Chunyi Li, Wei Sun, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: \underline{AI} \underline{G}enerated \underline{C}ontent (\textbf{AIGC}) has gained widespread attention with the increasing efficiency of deep learning in content creation. AIGC, created with the assistance of artificial intelligence technology, includes various forms of content, among which the AI-generated images (AGIs) have brought significant impact to society and have been applied to various… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  23. arXiv:2303.09290  [pdf, other

    eess.IV

    VDPVE: VQA Dataset for Perceptual Video Enhancement

    Authors: Yixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Recently, many video enhancement methods have been proposed to improve video quality from different aspects such as color, brightness, contrast, and stability. Therefore, how to evaluate the quality of the enhanced video in a way consistent with human visual perception is an important research topic. However, most video quality assessment methods mainly calculate video quality by estimating the di… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  24. arXiv:2303.08050  [pdf, other

    cs.CV eess.IV

    Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Jun Jia, Zhichao Zhang, **g Liu, Xiongkuo Min, Guangtao Zhai

    Abstract: Computer graphics images (CGIs) are artificially generated by means of computer programs and are widely perceived under various scenarios, such as games, streaming media, etc. In practice, the quality of CGIs consistently suffers from poor rendering during production, inevitable compression artifacts during the transmission of multimedia applications, and low aesthetic quality resulting from poor… ▽ More

    Submitted 1 November, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  25. Audio-Visual Quality Assessment for User Generated Content: Database and Method

    Authors: Yuqin Cao, Xiongkuo Min, Wei Sun, ** Zhang, Guangtao Zhai

    Abstract: With the explosive increase of User Generated Content (UGC), UGC video quality assessment (VQA) becomes more and more important for improving users' Quality of Experience (QoE). However, most existing UGC VQA studies only focus on the visual distortions of videos, ignoring that the user's QoE also depends on the accompanying audio signals. In this paper, we conduct the first study to address the p… ▽ More

    Submitted 27 December, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  26. arXiv:2302.08715  [pdf, other

    cs.CV eess.IV

    EEP-3DQA: Efficient and Effective Projection-based 3D Model Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Yingjie Zhou, Wei Lu, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai

    Abstract: Currently, great numbers of efforts have been put into improving the effectiveness of 3D model quality assessment (3DQA) methods. However, little attention has been paid to the computational costs and inference time, which is also important for practical applications. Unlike 2D media, 3D models are represented by more complicated and irregular digital formats, such as point cloud and mesh. Thus it… ▽ More

    Submitted 27 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  27. arXiv:2210.00933  [pdf, other

    cs.CV eess.IV

    Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop

    Authors: Weixia Zhang, Dingquan Li, Xiongkuo Min, Guangtao Zhai, Guodong Guo, Xiaokang Yang, Kede Ma

    Abstract: No-reference image quality assessment (NR-IQA) aims to quantify how humans perceive visual distortions of digital images without access to their undistorted references. NR-IQA models are extensively studied in computational vision, and are widely used for performance evaluation and perceptual optimization of man-made vision systems. Here we make one of the first attempts to examine the perceptual… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  28. arXiv:2209.09489  [pdf, other

    cs.CV eess.IV

    Perceptual Quality Assessment for Digital Human Heads

    Authors: Zicheng Zhang, Yingjie Zhou, Wei Sun, Xiongkuo Min, Yuzhe Wu, Guangtao Zhai

    Abstract: Digital humans are attracting more and more research interest during the last decade, the generation, representation, rendering, and animation of which have been put into large amounts of effort. However, the quality assessment of digital humans has fallen behind. Therefore, to tackle the challenge of digital human quality assessment issues, we propose the first large-scale quality assessment data… ▽ More

    Submitted 28 February, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

  29. arXiv:2209.00244  [pdf, other

    cs.CV eess.IV

    MM-PCQA: Multi-Modal Learning for No-reference Point Cloud Quality Assessment

    Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Quan Zhou, Jun He, Qiyuan Wang, Guangtao Zhai

    Abstract: The visual quality of point clouds has been greatly emphasized since the ever-increasing 3D vision applications are expected to provide cost-effective and high-quality experiences for users. Looking back on the development of point cloud quality assessment (PCQA) methods, the visual quality is usually evaluated by utilizing single-modal information, i.e., either extracted from the 2D projections o… ▽ More

    Submitted 24 April, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

  30. arXiv:2208.14085  [pdf, other

    cs.CV eess.IV

    Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric

    Authors: Zicheng Zhang, Wei Sun, Yucheng Zhu, Xiongkuo Min, Wei Wu, Ying Chen, Guangtao Zhai

    Abstract: Point cloud is one of the most widely used digital representation formats for three-dimensional (3D) contents, the visual quality of which may suffer from noise and geometric shift distortions during the production procedure as well as compression and downsampling distortions during the transmission process. To tackle the challenge of point cloud quality assessment (PCQA), many PCQA methods have b… ▽ More

    Submitted 6 December, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

  31. arXiv:2206.05054  [pdf, other

    eess.IV cs.CV

    A No-reference Quality Assessment Metric for Point Cloud Based on Captured Video Sequences

    Authors: Yu Fan, Zicheng Zhang, Wei Sun, Xiongkuo Min, Wei Lu, Tao Wang, Ning Liu, Guangtao Zhai

    Abstract: Point cloud is one of the most widely used digital formats of 3D models, the visual quality of which is quite sensitive to distortions such as downsampling, noise, and compression. To tackle the challenge of point cloud quality assessment (PCQA) in scenarios where reference is not available, we propose a no-reference quality assessment metric for colored point cloud based on captured video sequenc… ▽ More

    Submitted 20 September, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to IEEE 24th International Workshop on Multimedia Signal Processing, 2022

  32. arXiv:2206.04289  [pdf, other

    eess.IV cs.CV

    A No-Reference Deep Learning Quality Assessment Method for Super-resolution Images Based on Frequency Maps

    Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Wenhan Zhu, Tao Wang, Wei Lu, Guangtao Zhai

    Abstract: To support the application scenarios where high-resolution (HR) images are urgently needed, various single image super-resolution (SISR) algorithms are developed. However, SISR is an ill-posed inverse problem, which may bring artifacts like texture shift, blur, etc. to the reconstructed images, thus it is necessary to evaluate the quality of super-resolution images (SRIs). Note that most existing… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  33. arXiv:2204.14047  [pdf, other

    cs.CV cs.MM eess.IV

    A Deep Learning based No-reference Quality Assessment Model for UGC Videos

    Authors: Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

    Abstract: Quality assessment for User Generated Content (UGC) videos plays an important role in ensuring the viewing experience of end-users. Previous UGC video quality assessment (VQA) studies either use the image recognition model or the image quality assessment (IQA) models to extract frame-level features of UGC videos for quality regression, which are regarded as the sub-optimal solutions because of the… ▽ More

    Submitted 20 October, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted by ACM MM 2022

    Journal ref: Proceedings of the 30th ACM International Conference on Multimedia (2022) 856-865

  34. Confusing Image Quality Assessment: Towards Better Augmented Reality Experience

    Authors: Huiyu Duan, Xiongkuo Min, Yucheng Zhu, Guangtao Zhai, Xiaokang Yang, Patrick Le Callet

    Abstract: With the development of multimedia technology, Augmented Reality (AR) has become a promising next-generation mobile platform. The primary value of AR is to promote the fusion of digital contents and real-world environments, however, studies on how this fusion will influence the Quality of Experience (QoE) of these two components are lacking. To achieve better QoE of AR, whose two layers are influe… ▽ More

    Submitted 31 October, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  35. arXiv:2203.00926  [pdf, other

    eess.IV cs.CV

    Parameterized Image Quality Score Distribution Prediction

    Authors: Yixuan Gao, Xiongkuo Min, Wenhan Zhu, Xiao-** Zhang, Guangtao Zhai

    Abstract: Recently, image quality has been generally describedby a mean opinion score (MOS). However, we observe that thequality scores of an image given by a group of subjects are verysubjective and diverse. Thus it is not enough to use a MOS todescribe the image quality. In this paper, we propose to describeimage quality using a parameterized distribution rather thana MOS, and an objective method is also… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  36. arXiv:2107.02041  [pdf, other

    cs.CV cs.GR eess.IV

    No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models

    Authors: Zicheng Zhang, Wei Sun, Xiongkuo Min, Tao Wang, Wei Lu, Guangtao Zhai

    Abstract: To improve the viewer's Quality of Experience (QoE) and optimize computer graphics applications, 3D model quality assessment (3D-QA) has become an important task in the multimedia area. Point cloud and mesh are the two most widely used digital representation formats of 3D models, the visual quality of which is quite sensitive to lossy operations like simplification and compression. Therefore, many… ▽ More

    Submitted 2 May, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

  37. arXiv:2106.08165  [pdf, ps, other

    cs.IT eess.SP

    QoE Driven VR 360 Video Massive MIMO Transmission

    Authors: Long Teng, Guangtao Zhai, Yongpeng Wu, Xiongkuo Min, Wenjun Zhang, Zhi Ding, Chengshang Xiao

    Abstract: Massive multiple-input and multiple-output (MIMO) enables ultra-high throughput and low latency for tile-based adaptive virtual reality (VR) 360 video transmission in wireless network. In this paper, we consider a massive MIMO system where multiple users in a single-cell theater watch an identical VR 360 video. Based on tile prediction, base station (BS) deliveries the tiles in predicted field of… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Acceptede by IEEE transactions on wireless communications

  38. arXiv:2106.01111  [pdf, other

    eess.IV cs.CV cs.MM

    Deep Learning based Full-reference and No-reference Quality Assessment Models for Compressed UGC Videos

    Authors: Wei Sun, Tao Wang, Xiongkuo Min, Fuwang Yi, Guangtao Zhai

    Abstract: In this paper, we propose a deep learning based video quality assessment (VQA) framework to evaluate the quality of the compressed user's generated content (UGC) videos. The proposed VQA framework consists of three modules, the feature extraction module, the quality regression module, and the quality pooling module. For the feature extraction module, we fuse the features from intermediate layers o… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  39. arXiv:2008.00195  [pdf, other

    eess.IV cs.CV

    Joint Generative Learning and Super-Resolution For Real-World Camera-Screen Degradation

    Authors: Guanghao Yin, Shouqian Sun, Chao Li, Xin Min

    Abstract: In real-world single image super-resolution (SISR) task, the low-resolution image suffers more complicated degradations, not only downsampled by unknown kernels. However, existing SISR methods are generally studied with the synthetic low-resolution generation such as bicubic interpolation (BI), which greatly limits their performance. Recently, some researchers investigate real-world SISR from the… ▽ More

    Submitted 14 September, 2020; v1 submitted 1 August, 2020; originally announced August 2020.

  40. arXiv:2003.01299  [pdf, other

    eess.IV cs.MM

    A multiple attributes image quality database for smartphone camera photo quality assessment

    Authors: Wenhan Zhu, Guangtao Zhai, Zongxi Han, Xiongkuo Min, Tao Wang, Zicheng Zhang, Xiaokang Yang

    Abstract: Smartphone is the superstar product in digital device market and the quality of smartphone camera photos (SCPs) is becoming one of the dominant considerations when consumers purchase smartphones. How to evaluate the quality of smartphone cameras and the taken photos is urgent issue to be solved. To bridge the gap between academic research accomplishment and industrial needs, in this paper, we esta… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  41. arXiv:1912.05971  [pdf, other

    eess.IV cs.CV cs.HC

    Toward Better Understanding of Saliency Prediction in Augmented 360 Degree Videos

    Authors: Yucheng Zhu, Xiongkuo Min, DanDan Zhu, Ke Gu, Jiantao Zhou, Guangtao Zhai, Xiaokang Yang, Wenjun Zhang

    Abstract: Augmented reality (AR) overlays digital content onto the reality. In AR system, correct and precise estimations of user's visual fixations and head movements can enhance the quality of experience by allocating more computation resources on the areas of interest. However, there is inadequate research about understanding the visual exploration of users when using an AR system or modeling AR visual a… ▽ More

    Submitted 20 July, 2020; v1 submitted 12 December, 2019; originally announced December 2019.