Skip to main content

Showing 1–29 of 29 results for author: Deforges, O

.
  1. arXiv:2403.11651  [pdf, other

    eess.IV

    Overfitted image coding at reduced complexity

    Authors: Théophile Blard, Théo Ladune, Pierrick Philippe, Gordon Clare, Xiaoran Jiang, Olivier Déforges

    Abstract: Overfitted image codecs offer compelling compression performance and low decoder complexity, through the overfitting of a lightweight decoder for each image. Such codecs include Cool-chic, which presents image coding performance on par with VVC while requiring around 2000 multiplications per decoded pixel. This paper proposes to decrease Cool-chic encoding and decoding complexity. The encoding com… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 5 pages, submitted to European Signal Processing Conference (EUSIPCO) 2024

  2. arXiv:2402.03179  [pdf, other

    eess.IV cs.LG

    Cool-chic video: Learned video coding with 800 parameters

    Authors: Thomas Leguay, Théo Ladune, Pierrick Philippe, Olivier Déforges

    Abstract: We propose a lightweight learned video codec with 900 multiplications per decoded pixel and 800 parameters overall. To the best of our knowledge, this is one of the neural video codecs with the lowest decoding complexity. It is built upon the overfitted image codec Cool-chic and supplements it with an inter coding module to leverage the video's temporal redundancies. The proposed model is able to… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, published in Data Compression Conference 2024

  3. arXiv:2310.05623  [pdf, other

    eess.IV

    Efficient Predictive Coding of Intra Prediction Modes

    Authors: Kevin Reuzé, Wassim Hamidouche, Pierrick Philippe, Olivier Déforges

    Abstract: The high efficiency video coding (HEVC) standard and the joint exploration model (JEM) codec incorporate 35 and 67 intra prediction modes (IPMs) respectively, which are essential for efficient compression of Intra coded blocks. These IPMs are transmitted to the decoder through a coding scheme. In our paper, we present an innovative approach to construct a dedicated coding scheme for IPM based on c… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  4. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2206.02131  [pdf, other

    cs.LG cs.CR cs.CV

    Federated Adversarial Training with Transformers

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges

    Abstract: Federated learning (FL) has emerged to enable global model training over distributed clients' data while preserving its privacy. However, the global trained model is vulnerable to the evasion attacks especially, the adversarial examples (AEs), carefully crafted samples to yield false classification. Adversarial training (AT) is found to be the most promising approach against evasion attacks and it… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  6. arXiv:2202.00416  [pdf, other

    eess.IV cs.CV eess.SP

    CAESR: Conditional Autoencoder and Super-Resolution for Learned Spatial Scalability

    Authors: Charles Bonnineau, Wassim Hamidouche, Jean-François Travers, Naty Sidaty, Jean-Yves Aubié, Olivier Deforges

    Abstract: In this paper, we present CAESR, an hybrid learning-based coding approach for spatial scalability based on the versatile video coding (VVC) standard. Our framework considers a low-resolution signal encoded with VVC intra-mode as a base-layer (BL), and a deep conditional autoencoder with hyperprior (AE-HP) as an enhancement-layer (EL) model. The EL encoder takes as inputs both the upscaled BL recon… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Journal ref: 2021 International Conference on Visual Communications and Image Processing (VCIP)

  7. arXiv:2109.06555  [pdf, other

    eess.IV

    Perceptual Quality Assessment of HEVC and VVC Standards for 8K Video

    Authors: Charles Bonnineau, Wassim Hamidouche, Jerome Fournier, Naty Sidaty, Jean-Francois Travers, Olivier Deforges

    Abstract: With the growing data consumption of emerging video applications and users requirement for higher resolutions, up to 8K, a huge effort has been made in video compression technologies. Recently, versatile video coding (VVC) has been standardized by the moving picture expert group (MPEG), providing a significant improvement in compression performance over its predecessor high efficiency video coding… ▽ More

    Submitted 20 December, 2021; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Paper under review

  8. arXiv:2107.11659  [pdf, other

    eess.IV

    Lightweight Hardware Transform Design for the Versatile Video Coding 4K ASIC Decoders

    Authors: Ibrahim Farhat, Wassim Hamidouche, Adrien Grill, Daniel Ménard, Olivier Déforges

    Abstract: Versatile Video Coding (VVC) is the next generation video coding standard finalized in July 2020. VVC introduces new coding tools enhancing the coding efficiency compared to its predecessor High Efficiency Video Coding (HEVC). These new tools have a significant impact on the VVC software decoder complexity estimated to 2 times HEVC decoder complexity. In particular, the transform module includes i… ▽ More

    Submitted 6 November, 2021; v1 submitted 24 July, 2021; originally announced July 2021.

  9. arXiv:2106.03734  [pdf, other

    cs.CV

    Reveal of Vision Transformers Robustness against Adversarial Attacks

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Deforges

    Abstract: The major part of the vanilla vision transformer (ViT) is the attention block that brings the power of mimicking the global context of the input image. For better performance, ViT needs large-scale training data. To overcome this data hunger limitation, many ViT-based networks, or hybrid-ViT, have been proposed to include local context during the training. The robustness of ViTs and its variants a… ▽ More

    Submitted 20 September, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  10. arXiv:2105.11578  [pdf, other

    cs.CV

    SHD360: A Benchmark Dataset for Salient Human Detection in 360° Videos

    Authors: Yi Zhang, Lu Zhang, Kang Wang, Wassim Hamidouche, Olivier Deforges

    Abstract: Salient human detection (SHD) in dynamic 360° immersive videos is of great importance for various applications such as robotics, inter-human and human-object interaction in augmented reality. However, 360° video SHD has been seldom discussed in the computer vision community due to a lack of datasets with large-scale omnidirectional videos and rich annotations. To this end, we propose SHD360, the f… ▽ More

    Submitted 22 December, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: 21 pages, 13 figures, 5 tables; Project page: https://github.com/PanoAsh/SHD360; Technical report

  11. arXiv:2105.00949  [pdf, other

    cs.CV

    CMA-Net: A Cascaded Mutual Attention Network for Light Field Salient Object Detection

    Authors: Yi Zhang, Lu Zhang, Wassim Hamidouche, Olivier Deforges

    Abstract: In the past few years, numerous deep learning methods have been proposed to address the task of segmenting salient objects from RGB images. However, these approaches depending on single modality fail to achieve the state-of-the-art performance on widely used light field salient object detection (SOD) datasets, which collect large-scale natural images and provide multiple modalities such as multi-v… ▽ More

    Submitted 7 December, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 6 pages, 4 figures, 2 tables

  12. Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Sid Ahmed Fezza, Olivier Deforges

    Abstract: Deep learning (DL) has shown great success in many human-related tasks, which has led to its adoption in many computer vision based applications, such as security surveillance systems, autonomous vehicles and healthcare. Such safety-critical applications have to draw their path to success deployment once they have the capability to overcome safety-critical challenges. Among these challenges are th… ▽ More

    Submitted 7 January, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: Accepted and published in Artificial Intelligence Review journal

  13. arXiv:2104.13916  [pdf, other

    cs.CV

    Learning Synergistic Attention for Light Field Salient Object Detection

    Authors: Yi Zhang, Geng Chen, Qian Chen, Yujia Sun, Yong Xia, Olivier Deforges, Wassim Hamidouche, Lu Zhang

    Abstract: We propose a novel Synergistic Attention Network (SA-Net) to address the light field salient object detection by establishing a synergistic effect between multi-modal features with advanced attention mechanisms. Our SA-Net exploits the rich information of focal stacks via 3D convolutional neural networks, decodes the high-level features of multi-modal light field data with two cascaded synergistic… ▽ More

    Submitted 22 October, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: 20 pages, 12 figures; Project Page https://github.com/PanoAsh/SA-Net ; Accepted to BMVC-21

  14. arXiv:2104.09103  [pdf, other

    cs.NE eess.IV eess.SP

    Conditional Coding and Variable Bitrate for Practical Learned Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: This paper introduces a practical learned video codec. Conditional coding and quantization gain vectors are used to provide flexibility to a single encoder/decoder pair, which is able to compress video sequences at a variable bitrate. The flexibility is leveraged at test time by choosing the rate and GOP structure to optimize a rate-distortion cost. Using the CLIC21 video test conditions, the prop… ▽ More

    Submitted 20 April, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Journal ref: CLIC workshop, CVPR 2021, Jun 2021, Nashville, United States

  15. arXiv:2104.08319  [pdf, other

    cs.CV eess.SP

    Multitask Learning for VVC Quality Enhancement and Super-Resolution

    Authors: Charles Bonnineau, Wassim Hamidouche, Jean-Francois Travers, Naty Sidaty, Olivier Deforges

    Abstract: The latest video coding standard, called versatile video coding (VVC), includes several novel and refined coding tools at different levels of the coding chain. These tools bring significant coding gains with respect to the previous standard, high efficiency video coding (HEVC). However, the encoder may still introduce visible coding artifacts, mainly caused by coding decisions applied to adjust th… ▽ More

    Submitted 3 May, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: accepted as a conference paper to Picture Coding Symposium (PCS) 2021

  16. arXiv:2104.07930  [pdf, other

    eess.IV eess.SP

    Conditional Coding for Flexible Learned Video Compression

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: This paper introduces a novel framework for end-to-end learned video coding. Image compression is generalized through conditional coding to exploit information from reference frames, allowing to process intra and inter frames with the same coder. The system is trained through the minimization of a rate-distortion cost, with no pre-training or proxy loss. Its flexibility is assessed under three cod… ▽ More

    Submitted 28 April, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Neural Compression Workshop @ ICLR 2021

    Report number: hal-03192548

  17. arXiv:2103.05354  [pdf, other

    cs.CR cs.CV cs.LG

    Revisiting Model's Uncertainty and Confidences for Adversarial Example Detection

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges

    Abstract: Security-sensitive applications that rely on Deep Neural Networks (DNNs) are vulnerable to small perturbations that are crafted to generate Adversarial Examples(AEs). The AEs are imperceptible to humans and cause DNN to misclassify them. Many defense and detection techniques have been proposed. Model's confidences and Dropout, as a popular way to estimate the model's uncertainty, have been used fo… ▽ More

    Submitted 21 June, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: Under review

  18. arXiv:2103.04203  [pdf, other

    cs.CR cs.MM eess.IV

    Selective Encryption of the Versatile Video Coding Standard

    Authors: Guillaume Gautier, Mousa FarajAllah, Wassim Hamidouche, Olivier Déforges, Safwan El Assad

    Abstract: Versatile video coding (VVC) is the next generation video coding standard developed by the joint video experts team (JVET) and released in July 2020. VVC introduces several new coding tools providing a significant coding gain over the high efficiency video coding (HEVC) standard. It is well known that increasing the coding efficiency adds more dependencies in the video bitstream making format-comp… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

  19. arXiv:2103.04201  [pdf, other

    eess.IV

    Light Field Image Coding Using VVC standard and View Synthesis based on Dual Discriminator GAN

    Authors: Nader Bakir, Wassim Hamidouche, Sid Ahmed Fezza, Khouloud Samrouth, Olivier Deforges

    Abstract: Light field (LF) technology is considered as a promising way for providing a high-quality virtual reality (VR) content. However, such an imaging technology produces a large amount of data requiring efficient LF image compression solutions. In this paper, we propose a LF image coding method based on a view synthesis and view quality enhancement techniques. Instead of transmitting all the LF views,… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

  20. arXiv:2008.02580  [pdf, other

    eess.IV cs.CV cs.NE

    Optical Flow and Mode Selection for Learning-based Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: This paper introduces a new method for inter-frame coding based on two complementary autoencoders: MOFNet and CodecNet. MOFNet aims at computing and conveying the Optical Flow and a pixel-wise coding Mode selection. The optical flow is used to perform a prediction of the frame to code. The coding mode selection enables competition between direct copy of the prediction or transmission through Codec… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: MMSP 2020, IEEE 22nd International Workshop on Multimedia Signal Processing, Sep 2020, Tampere, Finland

  21. arXiv:2007.02532  [pdf, other

    cs.NE eess.SP

    ModeNet: Mode Selection Network For Learned Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: In this paper, a mode selection network (ModeNet) is proposed to enhance deep learning-based video compression. Inspired by traditional video coding, ModeNet purpose is to enable competition among several coding modes. The proposed ModeNet learns and conveys a pixel-wise partitioning of the frame, used to assign each pixel to the most suited coding mode. ModeNet is trained alongside the different… ▽ More

    Submitted 31 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Journal ref: Machine Learning for Signal Processing (MLSP) 2020, Sep 2020, Espoo, Finland

  22. arXiv:2003.12322  [pdf, other

    eess.IV

    Light Field Image Coding Using Dual Discriminator Generative Adversarial Network and VVC Temporal Scalability

    Authors: Nader Bakir, Wassim Hamidouche, Sid Fezza, Khouloud Samrouth, Olivier Déforges

    Abstract: Light field technology represents a viable path for providing a high-quality VR content. However, such an imaging system generates a high amount of data leading to an urgent need for LF image compression solution. In this paper, we propose an efficient LF image coding scheme based on view synthesis. Instead of transmitting all the LF views, only some of them are coded and transmitted, while the re… ▽ More

    Submitted 27 March, 2020; originally announced March 2020.

    Comments: IEEE International Conference on Multimedia and Expo (ICME), Jul 2020, Londre, United Kingdom

  23. arXiv:2002.09259  [pdf, other

    eess.IV cs.LG cs.NE eess.SP

    Binary Probability Model for Learning Based Image Compression

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Deforges

    Abstract: In this paper, we propose to enhance learned image compression systems with a richer probability model for the latent variables. Previous works model the latents with a Gaussian or a Laplace distribution. Inspired by binary arithmetic coding , we propose to signal the latents with three binary values and one integer, with different probability models. A relaxation method is designed to perform gra… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020, 2020

  24. arXiv:2002.09196  [pdf, other

    eess.IV eess.SP

    Extending 2D Saliency Models for Head Movement Prediction in 360-degree Images using CNN-based Fusion

    Authors: Ibrahim Djemai, Sid Fezza, Wassim Hamidouche, Olivier Deforges

    Abstract: Saliency prediction can be of great benefit for 360-degree image/video applications, including compression, streaming , rendering and viewpoint guidance. It is therefore quite natural to adapt the 2D saliency prediction methods for 360-degree images. To achieve this, it is necessary to project the 360-degree image to 2D plane. However, the existing projection techniques introduce different distort… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Journal ref: IEEE International Symposium on Circuits and Systems (ISCAS), May 2020, Seville, Spain

  25. arXiv:2002.07461  [pdf, other

    eess.IV eess.SP

    Lightweight hardware implementation of VVC transform block for ASIC decoder

    Authors: I. Farhat, W. Hamidouche, A Grill, D. Ménard, O. Deforges

    Abstract: Versatile Video Coding (VVC) is the next generation video coding standard expected by the end of 2020. Compared to its predecessor, VVC introduces new coding tools to make compression more efficient at the expense of higher computational complexity. This rises a need to design an efficient and optimised implementation especially for embedded platforms with limited memory and logic resources. One o… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelone, Spain

  26. arXiv:2002.06922  [pdf, other

    eess.IV eess.SP

    Versatile video coding and super-resolution for efficient delivery of 8K video with 4K backward-compatibility

    Authors: Charles Bonnineau, Wassim Hamidouche, Jean-Francois Travers, Olivier Deforges

    Abstract: In this paper, we propose, through an objective study, to compare and evaluate the performance of different coding approaches allowing the delivery of an 8K video signal with 4K backward-compatibility on broadcast networks. Presented approaches include simulcast of 8K and 4K single-layer signals encoded using High-Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC) standards, spatial s… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2020, Barcelone, Spain

  27. arXiv:2001.07960  [pdf, other

    cs.CV

    A Fixation-based 360° Benchmark Dataset for Salient Object Detection

    Authors: Yi Zhang, Lu Zhang, Wassim Hamidouche, Olivier Deforges

    Abstract: Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications. However, another issue within the field of visual saliency, salient object detection (SOD), has been seldom explored in 360° (or omnidirectional) images due to the lack of datasets representative of real scenes with pixel-level annotations. Toward this end,… ▽ More

    Submitted 19 May, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: 5 pages, 5 figures, accepted by ICIP2020

  28. arXiv:1911.07036  [pdf, other

    eess.IV cs.CV cs.MM

    Quality Assessment of DIBR-synthesized views: An Overview

    Authors: Shishun Tian, Lu Zhang, Wenbin Zou, Xia Li, Ting Su, Luce Morin, Olivier Deforges

    Abstract: The Depth-Image-Based-Rendering (DIBR) is one of the main fundamental technique to generate new views in 3D video applications, such as Multi-View Videos (MVV), Free-Viewpoint Videos (FVV) and Virtual Reality (VR). However, the quality assessment of DIBR-synthesized views is quite different from the traditional 2D images/videos. In recent years, several efforts have been made towards this topic, b… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 November, 2019; originally announced November 2019.

  29. arXiv:1906.00204  [pdf, other

    cs.LG cs.CR cs.CV eess.IV stat.ML

    Perceptual Evaluation of Adversarial Attacks for CNN-based Image Classification

    Authors: Sid Ahmed Fezza, Yassine Bakhti, Wassim Hamidouche, Olivier Déforges

    Abstract: Deep neural networks (DNNs) have recently achieved state-of-the-art performance and provide significant progress in many machine learning tasks, such as image classification, speech processing, natural language processing, etc. However, recent studies have shown that DNNs are vulnerable to adversarial attacks. For instance, in the image classification domain, adding small imperceptible perturbatio… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: Eleventh International Conference on Quality of Multimedia Experience (QoMEX 2019)