Skip to main content

Showing 1–19 of 19 results for author: Déforges, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.03179  [pdf, other

    eess.IV cs.LG

    Cool-chic video: Learned video coding with 800 parameters

    Authors: Thomas Leguay, Théo Ladune, Pierrick Philippe, Olivier Déforges

    Abstract: We propose a lightweight learned video codec with 900 multiplications per decoded pixel and 800 parameters overall. To the best of our knowledge, this is one of the neural video codecs with the lowest decoding complexity. It is built upon the overfitted image codec Cool-chic and supplements it with an inter coding module to leverage the video's temporal redundancies. The proposed model is able to… ▽ More

    Submitted 6 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, published in Data Compression Conference 2024

  2. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  3. arXiv:2206.02131  [pdf, other

    cs.LG cs.CR cs.CV

    Federated Adversarial Training with Transformers

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges

    Abstract: Federated learning (FL) has emerged to enable global model training over distributed clients' data while preserving its privacy. However, the global trained model is vulnerable to the evasion attacks especially, the adversarial examples (AEs), carefully crafted samples to yield false classification. Adversarial training (AT) is found to be the most promising approach against evasion attacks and it… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  4. arXiv:2202.00416  [pdf, other

    eess.IV cs.CV eess.SP

    CAESR: Conditional Autoencoder and Super-Resolution for Learned Spatial Scalability

    Authors: Charles Bonnineau, Wassim Hamidouche, Jean-François Travers, Naty Sidaty, Jean-Yves Aubié, Olivier Deforges

    Abstract: In this paper, we present CAESR, an hybrid learning-based coding approach for spatial scalability based on the versatile video coding (VVC) standard. Our framework considers a low-resolution signal encoded with VVC intra-mode as a base-layer (BL), and a deep conditional autoencoder with hyperprior (AE-HP) as an enhancement-layer (EL) model. The EL encoder takes as inputs both the upscaled BL recon… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Journal ref: 2021 International Conference on Visual Communications and Image Processing (VCIP)

  5. arXiv:2106.03734  [pdf, other

    cs.CV

    Reveal of Vision Transformers Robustness against Adversarial Attacks

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Deforges

    Abstract: The major part of the vanilla vision transformer (ViT) is the attention block that brings the power of mimicking the global context of the input image. For better performance, ViT needs large-scale training data. To overcome this data hunger limitation, many ViT-based networks, or hybrid-ViT, have been proposed to include local context during the training. The robustness of ViTs and its variants a… ▽ More

    Submitted 20 September, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  6. arXiv:2105.11578  [pdf, other

    cs.CV

    SHD360: A Benchmark Dataset for Salient Human Detection in 360° Videos

    Authors: Yi Zhang, Lu Zhang, Kang Wang, Wassim Hamidouche, Olivier Deforges

    Abstract: Salient human detection (SHD) in dynamic 360° immersive videos is of great importance for various applications such as robotics, inter-human and human-object interaction in augmented reality. However, 360° video SHD has been seldom discussed in the computer vision community due to a lack of datasets with large-scale omnidirectional videos and rich annotations. To this end, we propose SHD360, the f… ▽ More

    Submitted 22 December, 2021; v1 submitted 24 May, 2021; originally announced May 2021.

    Comments: 21 pages, 13 figures, 5 tables; Project page: https://github.com/PanoAsh/SHD360; Technical report

  7. arXiv:2105.00949  [pdf, other

    cs.CV

    CMA-Net: A Cascaded Mutual Attention Network for Light Field Salient Object Detection

    Authors: Yi Zhang, Lu Zhang, Wassim Hamidouche, Olivier Deforges

    Abstract: In the past few years, numerous deep learning methods have been proposed to address the task of segmenting salient objects from RGB images. However, these approaches depending on single modality fail to achieve the state-of-the-art performance on widely used light field salient object detection (SOD) datasets, which collect large-scale natural images and provide multiple modalities such as multi-v… ▽ More

    Submitted 7 December, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 6 pages, 4 figures, 2 tables

  8. Adversarial Example Detection for DNN Models: A Review and Experimental Comparison

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Sid Ahmed Fezza, Olivier Deforges

    Abstract: Deep learning (DL) has shown great success in many human-related tasks, which has led to its adoption in many computer vision based applications, such as security surveillance systems, autonomous vehicles and healthcare. Such safety-critical applications have to draw their path to success deployment once they have the capability to overcome safety-critical challenges. Among these challenges are th… ▽ More

    Submitted 7 January, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

    Comments: Accepted and published in Artificial Intelligence Review journal

  9. arXiv:2104.13916  [pdf, other

    cs.CV

    Learning Synergistic Attention for Light Field Salient Object Detection

    Authors: Yi Zhang, Geng Chen, Qian Chen, Yujia Sun, Yong Xia, Olivier Deforges, Wassim Hamidouche, Lu Zhang

    Abstract: We propose a novel Synergistic Attention Network (SA-Net) to address the light field salient object detection by establishing a synergistic effect between multi-modal features with advanced attention mechanisms. Our SA-Net exploits the rich information of focal stacks via 3D convolutional neural networks, decodes the high-level features of multi-modal light field data with two cascaded synergistic… ▽ More

    Submitted 22 October, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: 20 pages, 12 figures; Project Page https://github.com/PanoAsh/SA-Net ; Accepted to BMVC-21

  10. arXiv:2104.09103  [pdf, other

    cs.NE eess.IV eess.SP

    Conditional Coding and Variable Bitrate for Practical Learned Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: This paper introduces a practical learned video codec. Conditional coding and quantization gain vectors are used to provide flexibility to a single encoder/decoder pair, which is able to compress video sequences at a variable bitrate. The flexibility is leveraged at test time by choosing the rate and GOP structure to optimize a rate-distortion cost. Using the CLIC21 video test conditions, the prop… ▽ More

    Submitted 20 April, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Journal ref: CLIC workshop, CVPR 2021, Jun 2021, Nashville, United States

  11. arXiv:2104.08319  [pdf, other

    cs.CV eess.SP

    Multitask Learning for VVC Quality Enhancement and Super-Resolution

    Authors: Charles Bonnineau, Wassim Hamidouche, Jean-Francois Travers, Naty Sidaty, Olivier Deforges

    Abstract: The latest video coding standard, called versatile video coding (VVC), includes several novel and refined coding tools at different levels of the coding chain. These tools bring significant coding gains with respect to the previous standard, high efficiency video coding (HEVC). However, the encoder may still introduce visible coding artifacts, mainly caused by coding decisions applied to adjust th… ▽ More

    Submitted 3 May, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: accepted as a conference paper to Picture Coding Symposium (PCS) 2021

  12. arXiv:2103.05354  [pdf, other

    cs.CR cs.CV cs.LG

    Revisiting Model's Uncertainty and Confidences for Adversarial Example Detection

    Authors: Ahmed Aldahdooh, Wassim Hamidouche, Olivier Déforges

    Abstract: Security-sensitive applications that rely on Deep Neural Networks (DNNs) are vulnerable to small perturbations that are crafted to generate Adversarial Examples(AEs). The AEs are imperceptible to humans and cause DNN to misclassify them. Many defense and detection techniques have been proposed. Model's confidences and Dropout, as a popular way to estimate the model's uncertainty, have been used fo… ▽ More

    Submitted 21 June, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: Under review

  13. arXiv:2103.04203  [pdf, other

    cs.CR cs.MM eess.IV

    Selective Encryption of the Versatile Video Coding Standard

    Authors: Guillaume Gautier, Mousa FarajAllah, Wassim Hamidouche, Olivier Déforges, Safwan El Assad

    Abstract: Versatile video coding (VVC) is the next generation video coding standard developed by the joint video experts team (JVET) and released in July 2020. VVC introduces several new coding tools providing a significant coding gain over the high efficiency video coding (HEVC) standard. It is well known that increasing the coding efficiency adds more dependencies in the video bitstream making format-comp… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

  14. arXiv:2008.02580  [pdf, other

    eess.IV cs.CV cs.NE

    Optical Flow and Mode Selection for Learning-based Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: This paper introduces a new method for inter-frame coding based on two complementary autoencoders: MOFNet and CodecNet. MOFNet aims at computing and conveying the Optical Flow and a pixel-wise coding Mode selection. The optical flow is used to perform a prediction of the frame to code. The coding mode selection enables competition between direct copy of the prediction or transmission through Codec… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: MMSP 2020, IEEE 22nd International Workshop on Multimedia Signal Processing, Sep 2020, Tampere, Finland

  15. arXiv:2007.02532  [pdf, other

    cs.NE eess.SP

    ModeNet: Mode Selection Network For Learned Video Coding

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges

    Abstract: In this paper, a mode selection network (ModeNet) is proposed to enhance deep learning-based video compression. Inspired by traditional video coding, ModeNet purpose is to enable competition among several coding modes. The proposed ModeNet learns and conveys a pixel-wise partitioning of the frame, used to assign each pixel to the most suited coding mode. ModeNet is trained alongside the different… ▽ More

    Submitted 31 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Journal ref: Machine Learning for Signal Processing (MLSP) 2020, Sep 2020, Espoo, Finland

  16. arXiv:2002.09259  [pdf, other

    eess.IV cs.LG cs.NE eess.SP

    Binary Probability Model for Learning Based Image Compression

    Authors: Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Deforges

    Abstract: In this paper, we propose to enhance learned image compression systems with a richer probability model for the latent variables. Previous works model the latents with a Gaussian or a Laplace distribution. Inspired by binary arithmetic coding , we propose to signal the latents with three binary values and one integer, with different probability models. A relaxation method is designed to perform gra… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Journal ref: International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2020, 2020

  17. arXiv:2001.07960  [pdf, other

    cs.CV

    A Fixation-based 360° Benchmark Dataset for Salient Object Detection

    Authors: Yi Zhang, Lu Zhang, Wassim Hamidouche, Olivier Deforges

    Abstract: Fixation prediction (FP) in panoramic contents has been widely investigated along with the booming trend of virtual reality (VR) applications. However, another issue within the field of visual saliency, salient object detection (SOD), has been seldom explored in 360° (or omnidirectional) images due to the lack of datasets representative of real scenes with pixel-level annotations. Toward this end,… ▽ More

    Submitted 19 May, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: 5 pages, 5 figures, accepted by ICIP2020

  18. arXiv:1911.07036  [pdf, other

    eess.IV cs.CV cs.MM

    Quality Assessment of DIBR-synthesized views: An Overview

    Authors: Shishun Tian, Lu Zhang, Wenbin Zou, Xia Li, Ting Su, Luce Morin, Olivier Deforges

    Abstract: The Depth-Image-Based-Rendering (DIBR) is one of the main fundamental technique to generate new views in 3D video applications, such as Multi-View Videos (MVV), Free-Viewpoint Videos (FVV) and Virtual Reality (VR). However, the quality assessment of DIBR-synthesized views is quite different from the traditional 2D images/videos. In recent years, several efforts have been made towards this topic, b… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 November, 2019; originally announced November 2019.

  19. arXiv:1906.00204  [pdf, other

    cs.LG cs.CR cs.CV eess.IV stat.ML

    Perceptual Evaluation of Adversarial Attacks for CNN-based Image Classification

    Authors: Sid Ahmed Fezza, Yassine Bakhti, Wassim Hamidouche, Olivier Déforges

    Abstract: Deep neural networks (DNNs) have recently achieved state-of-the-art performance and provide significant progress in many machine learning tasks, such as image classification, speech processing, natural language processing, etc. However, recent studies have shown that DNNs are vulnerable to adversarial attacks. For instance, in the image classification domain, adding small imperceptible perturbatio… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: Eleventh International Conference on Quality of Multimedia Experience (QoMEX 2019)