Skip to main content

Showing 1–4 of 4 results for author: Rehman, Y A U

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.13551  [pdf, other

    cs.SD eess.AS

    AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition

    Authors: Kin Wai Lau, Yasar Abbas Ur Rehman, Lai-Man Po

    Abstract: Recent research has successfully adapted vision-based convolutional neural network (CNN) architectures for audio recognition tasks using Mel-Spectrograms. However, these CNNs have high computational costs and memory requirements, limiting their deployment on low-end edge devices. Motivated by the success of efficient vision models like InceptionNeXt and ConvNeXt, we propose AudioRepInceptionNeXt,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  2. arXiv:2402.02889  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

    Authors: Yasar Abbas Ur Rehman, Kin Wai Lau, Yuyang Xie, Lan Ma, Jiajun Shen

    Abstract: The integration of Federated Learning (FL) and Self-supervised Learning (SSL) offers a unique and synergetic combination to exploit the audio data for general-purpose audio understanding, without compromising user data privacy. However, rare efforts have been made to investigate the SSL models in the FL regime for general-purpose audio understanding, especially when the training data is generated… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2307.07265  [pdf, other

    cs.SD cs.AI eess.AS

    AudioInceptionNeXt: TCL AI LAB Submission to EPIC-SOUND Audio-Based-Interaction-Recognition Challenge 2023

    Authors: Kin Wai Lau, Yasar Abbas Ur Rehman, Yuyang Xie, Lan Ma

    Abstract: This report presents the technical details of our submission to the 2023 Epic-Kitchen EPIC-SOUNDS Audio-Based Interaction Recognition Challenge. The task is to learn the map** from audio samples to their corresponding action labels. To achieve this goal, we propose a simple yet effective single-stream CNN-based architecture called AudioInceptionNeXt that operates on the time-frequency log-mel-sp… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  4. arXiv:2008.07742  [pdf, other

    eess.IV cs.CV

    UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

    Authors: Yuqian Zhou, Michael Kwan, Kyle Tolentino, Neil Emerton, Sehoon Lim, Tim Large, Lijiang Fu, Zhihong Pan, Baopu Li, Qirui Yang, Yihao Liu, Jigang Tang, Tao Ku, Shibin Ma, Bingnan Hu, Jiarong Wang, Densen Puthussery, Hrishikesh P S, Melvin Kuriakose, Jiji C V, Varun Sundar, Sumanth Hegde, Divya Kothandaraman, Kaushik Mitra, Akashdeep Jassal , et al. (20 additional authors not shown)

    Abstract: This paper is the report of the first Under-Display Camera (UDC) image restoration challenge in conjunction with the RLQ workshop at ECCV 2020. The challenge is based on a newly-collected database of Under-Display Camera. The challenge tracks correspond to two types of display: a 4k Transparent OLED (T-OLED) and a phone Pentile OLED (P-OLED). Along with about 150 teams registered the challenge, ei… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 15 pages