Skip to main content

Showing 1–10 of 10 results for author: Sautière, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17879  [pdf, other

    cs.CV eess.IV

    Low-Latency Neural Stereo Streaming

    Authors: Qiqi Hou, Farzad Farhadzadeh, Amir Said, Guillaume Sautiere, Hoang Le

    Abstract: The rise of new video modalities like virtual reality or autonomous driving has increased the demand for efficient multi-view video compression methods, both in terms of rate-distortion (R-D) performance and in terms of delay and runtime. While most recent stereo video compression approaches have shown promising performance, they compress left and right views sequentially, leading to poor parallel… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  2. arXiv:2312.08128  [pdf, other

    cs.CV

    Clockwork Diffusion: Efficient Generation With Model-Step Distillation

    Authors: Amirhossein Habibian, Amir Ghodrati, Noor Fathima, Guillaume Sautiere, Risheek Garrepalli, Fatih Porikli, Jens Petersen

    Abstract: This work aims to improve the efficiency of text-to-image diffusion models. While diffusion models use computationally expensive UNet-based denoising operations in every generation step, we identify that not all operations are equally relevant for the final output quality. In particular, we observe that UNet layers operating on high-res feature maps are relatively sensitive to small perturbations.… ▽ More

    Submitted 20 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  3. arXiv:2310.01258  [pdf, other

    eess.IV cs.CV cs.LG

    MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device

    Authors: Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautiere, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke Wiggers

    Abstract: Neural video codecs have recently become competitive with standard codecs such as HEVC in the low-delay setting. However, most neural codecs are large floating-point networks that use pixel-dense war** operations for temporal modeling, making them too computationally expensive for deployment on mobile devices. Recent work has demonstrated that running a neural decoder in real time on mobile is f… ▽ More

    Submitted 15 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Matches version published at WACV 2024

  4. arXiv:2301.05489  [pdf, other

    cs.CV eess.IV

    A Residual Diffusion Model for High Perceptual Quality Codec Augmentation

    Authors: Noor Fathima Ghouse, Jens Petersen, Auke Wiggers, Tianlin Xu, Guillaume Sautière

    Abstract: Diffusion probabilistic models have recently achieved remarkable success in generating high quality image and video data. In this work, we build on this class of generative models and introduce a method for lossy compression of high resolution images. The resulting codec, which we call DIffuson-based Residual Augmentation Codec (DIRAC), is the first neural codec to allow smooth traversal of the ra… ▽ More

    Submitted 29 March, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: v1: 26 pages, 13 figures v2: corrected typo in first author name in arxiv metadata v3: major paper update to add base codecs and lpips loss

  5. arXiv:2208.04303  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Boosting neural video codecs by exploiting hierarchical redundancy

    Authors: Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautiere, Auke Wiggers

    Abstract: In video compression, coding efficiency is improved by reusing pixels from previously decoded frames via motion and residual compensation. We define two levels of hierarchical redundancy in video frames: 1) first-order: redundancy in pixel space, i.e., similarities in pixel values across neighboring frames, which is effectively captured using motion and residual compensation, 2) second-order: redu… ▽ More

    Submitted 16 September, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: WACV 2023

  6. arXiv:2207.08338  [pdf, other

    cs.CV cs.MM eess.IV

    MobileCodec: Neural Inter-frame Video Compression on Mobile Devices

    Authors: Hoang Le, Liang Zhang, Amir Said, Guillaume Sautiere, Yang Yang, Pranav Shrestha, Fei Yin, Reza Pourreza, Auke Wiggers

    Abstract: Realizing the potential of neural video codecs on mobile devices is a big technological challenge due to the computational complexity of deep networks and the power-constrained mobile hardware. We demonstrate practical feasibility by leveraging Qualcomm's technology and innovation, bridging the gap from neural network-based codec simulations running on wall-powered workstations, to real-time opera… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: ACM MMSys 2022

  7. arXiv:2203.01978  [pdf, other

    eess.IV cs.CV cs.LG

    Region-of-Interest Based Neural Video Compression

    Authors: Yura Perugachi-Diaz, Guillaume Sautière, Davide Abati, Yang Yang, Amirhossein Habibian, Taco S Cohen

    Abstract: Humans do not perceive all parts of a scene with the same resolution, but rather focus on few regions of interest (ROIs). Traditional Object-Based codecs take advantage of this biological intuition, and are capable of non-uniform allocation of bits in favor of salient regions, at the expense of increased distortion the remaining areas: such a strategy allows a boost in perceptual quality under low… ▽ More

    Submitted 2 November, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Updated arxiv version to the camera-ready version after acceptance at British Machine Vision Conference (BMVC) 2022

  8. arXiv:2005.04064  [pdf, other

    cs.LG stat.ML

    Lossy Compression with Distortion Constrained Optimization

    Authors: Ties van Rozendaal, Guillaume Sautière, Taco S. Cohen

    Abstract: When training end-to-end learned models for lossy compression, one has to balance the rate and distortion losses. This is typically done by manually setting a tradeoff parameter $β$, an approach called $β$-VAE. Using this approach it is difficult to target a specific rate or distortion value, because the result can be very sensitive to $β$, and the appropriate value for $β$ depends on the model an… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted as a CVPR 2020 workshop paper: Workshop and Challenge on Learned Image Compression (CLIC)

  9. arXiv:2004.04342  [pdf, other

    cs.LG cs.CV stat.ML

    Feedback Recurrent Autoencoder for Video Compression

    Authors: Adam Golinski, Reza Pourreza, Yang Yang, Guillaume Sautiere, Taco S Cohen

    Abstract: Recent advances in deep generative modeling have enabled efficient modeling of high dimensional data distributions and opened up a new horizon for solving data compression problems. Specifically, autoencoder based learned image or video compression solutions are emerging as strong competitors to traditional approaches. In this work, We propose a new network architecture, based on common and well s… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  10. arXiv:1911.04018  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Feedback Recurrent AutoEncoder

    Authors: Yang Yang, Guillaume Sautière, J. Jon Ryu, Taco S Cohen

    Abstract: In this work, we propose a new recurrent autoencoder architecture, termed Feedback Recurrent AutoEncoder (FRAE), for online compression of sequential data with temporal dependency. The recurrent structure of FRAE is designed to efficiently extract the redundancy along the time dimension and allows a compact discrete representation of the data to be learned. We demonstrate its effectiveness in spee… ▽ More

    Submitted 17 February, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)