Skip to main content

Showing 1–14 of 14 results for author: Ngan, K N

.
  1. arXiv:2405.07717  [pdf, other

    eess.IV

    On the Adversarial Robustness of Learning-based Image Compression Against Rate-Distortion Attacks

    Authors: Chenhao Wu, Qingbo Wu, Haoran Wei, Shuai Chen, Lei Wang, King Ngi Ngan, Fanman Meng, Hongliang Li

    Abstract: Despite demonstrating superior rate-distortion (RD) performance, learning-based image compression (LIC) algorithms have been found to be vulnerable to malicious perturbations in recent studies. However, the adversarial attacks considered in existing literature remain divergent from real-world scenarios, both in terms of the attack direction and bitrate. Additionally, existing methods focus solely… ▽ More

    Submitted 4 July, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2311.15846  [pdf, other

    cs.CV eess.IV

    Learning with Noisy Low-Cost MOS for Image Quality Assessment via Dual-Bias Calibration

    Authors: Lei Wang, Qingbo Wu, Desen Yuan, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

    Abstract: Learning based image quality assessment (IQA) models have obtained impressive performance with the help of reliable subjective quality labels, where mean opinion score (MOS) is the most popular choice. However, in view of the subjective bias of individual annotators, the labor-abundant MOS (LA-MOS) typically requires a large collection of opinion scores from multiple annotators for each image, whi… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2209.07126  [pdf, other

    cs.CV cs.MM

    Forgetting to Remember: A Scalable Incremental Learning Framework for Cross-Task Blind Image Quality Assessment

    Authors: Rui Ma, Qingbo Wu, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

    Abstract: Recent years have witnessed the great success of blind image quality assessment (BIQA) in various task-specific scenarios, which present invariable distortion types and evaluation criteria. However, due to the rigid structure and learning framework, they cannot apply to the cross-task BIQA scenario, where the distortion types and evaluation criteria keep changing in practical applications. This pa… ▽ More

    Submitted 6 February, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  4. Non-Homogeneous Haze Removal via Artificial Scene Prior and Bidimensional Graph Reasoning

    Authors: Haoran Wei, Qingbo Wu, Hui Li, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

    Abstract: Due to the lack of natural scene and haze prior information, it is greatly challenging to completely remove the haze from a single image without distorting its visual content. Fortunately, the real-world haze usually presents non-homogeneous distribution, which provides us with many valuable clues in partial well-preserved regions. In this paper, we propose a Non-Homogeneous Haze Removal Network (… ▽ More

    Submitted 15 November, 2022; v1 submitted 5 April, 2021; originally announced April 2021.

  5. arXiv:2103.15099  [pdf, other

    cs.CV

    BA^2M: A Batch Aware Attention Module for Image Classification

    Authors: Qishang Cheng, Hongliang Li, Qingbo Wu, King Ngi Ngan

    Abstract: The attention mechanisms have been employed in Convolutional Neural Network (CNN) to enhance the feature representation. However, existing attention mechanisms only concentrate on refining the features inside each sample and neglect the discrimination between different samples. In this paper, we propose a batch aware attention module (BA2M) for feature enrichment from a distinctive perspective. Mo… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

    Comments: 11 pages, 5 figures

  6. arXiv:2103.06549  [pdf, other

    eess.IV

    Advanced Geometry Surface Coding for Dynamic Point Cloud Compression

    Authors: Jian Xiong, Hao Gao, Miaohui Wang, Hongliang Li, King Ngi Ngan, Weisi Lin

    Abstract: In video-based dynamic point cloud compression (V-PCC), 3D point clouds are projected onto 2D images for compressing with the existing video codecs. However, the existing video codecs are originally designed for natural visual signals, and it fails to account for the characteristics of point clouds. Thus, there are still problems in the compression of geometry information generated from the point… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  7. arXiv:1909.11983  [pdf, other

    eess.IV cs.CV

    Subjective and Objective De-raining Quality Assessment Towards Authentic Rain Image

    Authors: Qingbo Wu, Lei Wang, King N. Ngan, Hongliang Li, Fanman Meng, Linfeng Xu

    Abstract: Images acquired by outdoor vision systems easily suffer poor visibility and annoying interference due to the rainy weather, which brings great challenge for accurately understanding and describing the visual contents. Recent researches have devoted great efforts on the task of rain removal for improving the image visibility. However, there is very few exploration about the quality assessment of de… ▽ More

    Submitted 5 October, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: In this revision, we add the comparison with our previous exploration towards the de-raining quality assessment in Ref. [16]. Some typos in Tables III and IV are corrected, where the missed minus signs are added back for some OU metrics

  8. arXiv:1909.09839  [pdf, other

    cs.CV

    Class Activation Map generation by Multiple Level Class Grou** and Orthogonal Constraint

    Authors: Kaixu Huang, Fanman Meng, Hongliang Li, Shuai Chen, Qingbo Wu, King N. Ngan

    Abstract: Class activation map (CAM) highlights regions of classes based on classification network, which is widely used in weakly supervised tasks. However, it faces the problem that the class activation regions are usually small and local. Although several efforts paid to the second step (the CAM generation step) have partially enhanced the generation, we believe such problem is also caused by the first s… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: International Conference on Digital Image Computing: Techniques and Applications(DICTA) 2019

  9. arXiv:1909.08754  [pdf, other

    cs.CV

    A New Few-shot Segmentation Network Based on Class Representation

    Authors: Yuwei Yang, Fanman Meng, Hongliang Li, King N. Ngan, Qingbo Wu

    Abstract: This paper studies few-shot segmentation, which is a task of predicting foreground mask of unseen classes by a few of annotations only, aided by a set of rich annotations already existed. The existing methods mainly focus the task on "\textit{how to transfer segmentation cues from support images (labeled images) to query images (unlabeled images)}", and try to learn efficient and general transfer… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: accepted by VCIP2019

  10. arXiv:1905.02114  [pdf, other

    cs.CV

    Visibility Constrained Generative Model for Depth-based 3D Facial Pose Tracking

    Authors: Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan

    Abstract: In this paper, we propose a generative framework that unifies depth-based 3D facial pose tracking and face model adaptation on-the-fly, in the unconstrained scenarios with heavy occlusions and arbitrary facial expression variations. Specifically, we introduce a statistical 3D morphable model that flexibly describes the distribution of points on the surface of the face model, with an efficient swit… ▽ More

    Submitted 6 May, 2019; originally announced May 2019.

  11. arXiv:1904.04473  [pdf, other

    cs.CV

    MVF-Net: Multi-View 3D Face Morphable Model Regression

    Authors: Fanzi Wu, Linchao Bao, Ya**g Chen, Yonggen Ling, Yibing Song, Songnan Li, King Ngi Ngan, Wei Liu

    Abstract: We address the problem of recovering the 3D geometry of a human face from a set of facial images in multiple views. While recent studies have shown impressive progress in 3D Morphable Model (3DMM) based facial reconstruction, the settings are mostly restricted to a single view. There is an inherent drawback in the single-view setting: the lack of reliable 3D constraints can cause unresolvable ambi… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

    Comments: 2019 Conference on Computer Vision and Pattern Recognition

  12. arXiv:1901.03060  [pdf, ps, other

    cs.CV cs.LG

    Hierarchy Neighborhood Discriminative Hashing for An Unified View of Single-Label and Multi-Label Image retrieval

    Authors: Lei Ma, Hongliang Li, Qingbo Wu, Fanman Meng, King Ngi Ngan

    Abstract: Recently, deep supervised hashing methods have become popular for large-scale image retrieval task. To preserve the semantic similarity notion between examples, they typically utilize the pairwise supervision or the triplet supervised information for hash learning. However, these methods usually ignore the semantic class information which can help the improvement of the semantic discriminative abi… ▽ More

    Submitted 11 January, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  13. arXiv:1712.03491  [pdf, other

    cs.CV

    3D Facial Expression Reconstruction using Cascaded Regression

    Authors: Fanzi Wu, Songnan Li, Tianhao Zhao, King Ngi Ngan, Lv Sheng

    Abstract: This paper proposes a novel model fitting algorithm for 3D facial expression reconstruction from a single image. Face expression reconstruction from a single image is a challenging task in computer vision. Most state-of-the-art methods fit the input image to a 3D Morphable Model (3DMM). These methods need to solve a stochastic problem and cannot deal with expression and pose variations. To solve t… ▽ More

    Submitted 17 August, 2018; v1 submitted 10 December, 2017; originally announced December 2017.

  14. A Perceptually Weighted Rank Correlation Indicator for Objective Image Quality Assessment

    Authors: Qingbo Wu, Hongliang Li, Fanman Meng, King N. Ngan

    Abstract: In the field of objective image quality assessment (IQA), the Spearman's $ρ$ and Kendall's $τ$ are two most popular rank correlation indicators, which straightforwardly assign uniform weight to all quality levels and assume each pair of images are sortable. They are successful for measuring the average accuracy of an IQA metric in ranking multiple processed images. However, two important perceptua… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

    Comments: This paper has been submitted to IEEE Transactions on Image Processing

    ACM Class: I.4.0; I.4.3