Skip to main content

Showing 1–4 of 4 results for author: Kwan, H M

.
  1. arXiv:2402.01596  [pdf, other

    eess.IV cs.CV

    Immersive Video Compression using Implicit Neural Representations

    Authors: Ho Man Kwan, Fan Zhang, Andrew Gower, David Bull

    Abstract: Recent work on implicit neural representations (INRs) has evidenced their potential for efficiently representing and encoding conventional video content. In this paper we, for the first time, extend their application to immersive (multi-view) videos, by proposing MV-HiNeRV, a new INR-based immersive video codec. MV-HiNeRV is an enhanced version of a state-of-the-art INR-based video codec, HiNeRV,… ▽ More

    Submitted 23 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  2. arXiv:2312.17029  [pdf, other

    cs.LG

    FedSDD: Scalable and Diversity-enhanced Distillation for Model Aggregation in Federated Learning

    Authors: Ho Man Kwan, Shenghui Song

    Abstract: Recently, innovative model aggregation methods based on knowledge distillation (KD) have been proposed for federated learning (FL). These methods not only improved the robustness of model aggregation over heterogeneous learning environment, but also allowed training heterogeneous models on client devices. However, the scalability of existing methods is not satisfactory, because the training cost o… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  3. HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

    Authors: Ho Man Kwan, Ge Gao, Fan Zhang, Andrew Gower, David Bull

    Abstract: Learning-based video compression is currently a popular research topic, offering the potential to compete with conventional standard video codecs. In this context, Implicit Neural Representations (INRs) have previously been used to represent and compress image and video content, demonstrating relatively high decoding speed compared to other methods. However, existing INR-based methods have failed… ▽ More

    Submitted 26 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  4. arXiv:2207.11511  [pdf, other

    cs.CV cs.AI cs.LG

    SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling

    Authors: Ho Man Kwan, Shenghui Song

    Abstract: Downsampling is widely adopted to achieve a good trade-off between accuracy and latency for visual recognition. Unfortunately, the commonly used pooling layers are not learned, and thus cannot preserve important information. As another dimension reduction method, adaptive sampling weights and processes regions that are relevant to the task, and is thus able to better preserve useful information. H… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.