Skip to main content

Showing 1–9 of 9 results for author: Ulhaq, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13059  [pdf, other

    eess.IV cs.CV

    Learned Compression of Encoding Distributions

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: The entropy bottleneck introduced by Ballé et al. is a common component used in many learned compression models. It encodes a transformed latent representation using a static distribution whose parameters are learned during training. However, the actual distribution of the latent data may vary wildly across different inputs. The static distribution attempts to encompass all possible input distribu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures, IEEE ICIP 2024

  2. arXiv:2402.12532  [pdf, other

    cs.CV eess.IV

    Scalable Human-Machine Point Cloud Compression

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: Due to the limited computational capabilities of edge devices, deep learning inference can be quite expensive. One remedy is to compress and transmit point cloud data over the network for server-side processing. Unfortunately, this approach can be sensitive to network factors, including available bitrate. Luckily, the bitrate requirements can be reduced without sacrificing inference accuracy by us… ▽ More

    Submitted 23 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, 2024 Picture Coding Symposium (PCS)

  3. arXiv:2308.05959  [pdf, other

    eess.IV cs.CV cs.LG

    Learned Point Cloud Compression for Classification

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: Deep learning is increasingly being used to perform machine vision tasks such as classification, object detection, and segmentation on 3D point cloud data. However, deep learning inference is computationally expensive. The limited computational capabilities of end devices thus necessitate a codec for transmitting point cloud data over the network for server-side processing. Such a codec must be li… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 6 pages, 4 figures, IEEE MMSP 2023

  4. arXiv:2306.13982  [pdf, other

    cs.LG cs.CV eess.IV

    Mobile-Cloud Inference for Collaborative Intelligence

    Authors: Mateen Ulhaq

    Abstract: As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for deep learning model inference. Historically, the models run on mobile devices have been smaller and simpler in comparison to large state-of-the-art research models, which can only run on the cloud. However, cloud-only inference has drawbacks such as increas… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 56 pages, 20 figures, Bachelor's Thesis, defended in 2020

  5. arXiv:2301.04183  [pdf, other

    eess.IV

    Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

    Authors: Ezgi Ozyilkan, Mateen Ulhaq, Hyomin Choi, Fabien Racape

    Abstract: As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily supporting input reconstruction. In this work, we propose a learned compression architecture that can be used to build such a codec. We introduce a novel variat… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: accepted as a paper for DCC 2023

  6. arXiv:2301.01290  [pdf, other

    eess.IV

    Frequency-aware Learned Image Compression for Quality Scalability

    Authors: Hyomin Choi, Fabien Racape, Shahab Hamidi-Rad, Mateen Ulhaq, Simon Feltman

    Abstract: Spatial frequency analysis and transforms serve a central role in most engineered image and video lossy codecs, but are rarely employed in neural network (NN)-based approaches. We propose a novel NN-based image coding framework that utilizes forward wavelet transforms to decompose the input signal by spatial frequency. Our encoder generates separate bitstreams for each latent representation of low… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Presented at VCIP'22

  7. arXiv:2205.01874  [pdf, other

    eess.IV cs.CV

    Joint Image Compression and Denoising via Latent-Space Scalability

    Authors: Saeed Ranjbar Alvar, Mateen Ulhaq, Hyomin Choi, Ivan V. Bajić

    Abstract: When it comes to image compression in digital cameras, denoising is traditionally performed prior to compression. However, there are applications where image noise may be necessary to demonstrate the trustworthiness of the image, such as court evidence and image forensics. This means that noise itself needs to be coded, in addition to the clean image itself. In this paper, we present a learning-ba… ▽ More

    Submitted 4 September, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  8. arXiv:2102.04018  [pdf, other

    cs.CV eess.IV

    Analysis of Latent-Space Motion for Collaborative Intelligence

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: When the input to a deep neural network (DNN) is a video signal, a sequence of feature tensors is produced at the intermediate layers of the model. If neighboring frames of the input video are related through motion, a natural question is, "what is the relationship between the corresponding feature tensors?" By analyzing the effect of common DNN operations on optical flow, we show that the motion… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 6 pages, 6 figures, extended version of an IEEE ICASSP 2021 paper

  9. arXiv:2002.00157  [pdf, other

    cs.AI eess.IV

    Shared Mobile-Cloud Inference for Collaborative Intelligence

    Authors: Mateen Ulhaq, Ivan V. Bajić

    Abstract: As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for neural model inference. Historically, the models run on mobile devices have been smaller and simpler in comparison to large state-of-the-art research models, which can only run on the cloud. However, cloud-only inference has drawbacks such as increased netw… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: 5 pages, 3 figures