Skip to main content

Showing 1–9 of 9 results for author: Pakdaman, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.10936  [pdf

    eess.IV cs.CV cs.MM

    Channel-wise Feature Decorrelation for Enhanced Learned Image Compression

    Authors: Farhad Pakdaman, Moncef Gabbouj

    Abstract: The emerging Learned Compression (LC) replaces the traditional codec modules with Deep Neural Networks (DNN), which are trained end-to-end for rate-distortion performance. This approach is considered as the future of image/video compression, and major efforts have been dedicated to improving its compression efficiency. However, most proposed works target compression efficiency by employing more co… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  2. arXiv:2402.05582  [pdf

    eess.IV cs.CV cs.MM

    Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs

    Authors: Yuxin Xie, Li Yu, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Noisy images are a challenge to image compression algorithms due to the inherent difficulty of compressing noise. As noise cannot easily be discerned from image details, such as high-frequency signals, its presence leads to extra bits needed for compression. Since the emerging learned image compression paradigm enables end-to-end optimization of codecs, recent efforts were made to integrate denois… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  3. arXiv:2402.02936  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Panoramic Image Inpainting With Gated Convolution And Contextual Reconstruction Loss

    Authors: Li Yu, Yanjun Gao, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Deep learning-based methods have demonstrated encouraging results in tackling the task of panoramic image inpainting. However, it is challenging for existing methods to distinguish valid pixels from invalid pixels and find suitable references for corrupted areas, thus leading to artifacts in the inpainted results. In response to these challenges, we propose a panoramic image inpainting framework t… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - to appear in IEEE ICASSP 2024

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

  4. arXiv:2402.02922  [pdf, other

    cs.CV eess.IV

    Pixel-Wise Color Constancy via Smoothness Techniques in Multi-Illuminant Scenes

    Authors: Umut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Most scenes are illuminated by several light sources, where the traditional assumption of uniform illumination is invalid. This issue is ignored in most color constancy methods, primarily due to the complex spatial impact of multiple light sources on the image. Moreover, most existing multi-illuminant methods fail to preserve the smooth change of illumination, which stems from spatial dependencies… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  5. arXiv:2402.02836  [pdf

    eess.IV cs.CV cs.MM

    Perceptual Learned Image Compression via End-to-End JND-Based Optimization

    Authors: Farhad Pakdaman, Sanaz Nami, Moncef Gabbouj

    Abstract: Emerging Learned image Compression (LC) achieves significant improvements in coding efficiency by end-to-end training of neural networks for compression. An important benefit of this approach over traditional codecs is that any optimization criteria can be directly applied to the encoder-decoder networks during training. Perceptual optimization of LC to comply with the Human Visual System (HVS) is… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Copyright 2024 IEEE - Submitted to IEEE ICIP 2024

  6. Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features

    Authors: Ali Falahati, Mohammad Karim Safavi, Ardavan Elahi, Farhad Pakdaman, Moncef Gabbouj

    Abstract: Providing high-quality video with efficient bitrate is a main challenge in video industry. The traditional one-size-fits-all scheme for bitrate ladders is inefficient and reaching the best content-aware decision computationally impractical due to extensive encodings required. To mitigate this, we propose a bitrate and complexity efficient bitrate ladder prediction method using transfer learning an… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 7 pages, 9 figures, 7 tables, Copyright 2024 IEEE - Presented in IEEE MVIP 2024

    ACM Class: I.4.2

    Journal ref: Proc. 2024 13th Iranian/3rd Int. Conf. Mach. Vis. Image Process. (MVIP) (2024) 1-7

  7. Comprehensive Complexity Assessment of Emerging Learned Image Compression on CPU and GPU

    Authors: Farhad Pakdaman, Moncef Gabbouj

    Abstract: Learned Compression (LC) is the emerging technology for compressing image and video content, using deep neural networks. Despite being new, LC methods have already gained a compression efficiency comparable to state-of-the-art image compression, such as HEVC or even VVC. However, the existing solutions often require a huge computational complexity, which discourages their adoption in international… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023

  8. arXiv:2201.07823  [pdf

    cs.MM cs.CV eess.IV

    BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding

    Authors: Farhad Pakdaman, Mohammad Ali Adelimanesh, Mahmoud Reza Hashemi

    Abstract: The latest video coding standard, Versatile Video Coding (VVC), achieves almost twice coding efficiency compared to its predecessor, the High Efficiency Video Coding (HEVC). However, achieving this efficiency (for intra coding) requires 31x computational complexity compared to HEVC, making it challenging for low power and real-time applications. This paper, proposes a novel machine learning approa… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Journal ref: Journal of Real-Time Image Processing (2022)

  9. Complexity Analysis Of Next-Generation VVC Encoding and Decoding

    Authors: Farhad Pakdaman, Mohammad Ali Adelimanesh, Moncef Gabbouj, Mahmoud Reza Hashemi

    Abstract: While the next generation video compression standard, Versatile Video Coding (VVC), provides a superior compression efficiency, its computational complexity dramatically increases. This paper thoroughly analyzes this complexity for both encoder and decoder of VVC Test Model 6, by quantifying the complexity break-down for each coding tool and measuring the complexity and memory requirements for VVC… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: IEEE ICIP 2020

    Journal ref: Proceedings of International Conference on Image Processing (ICIP), (2020) 3134-3138