Skip to main content

Showing 1–9 of 9 results for author: Mandt, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2304.06244  [pdf, other

    eess.IV cs.CV cs.LG

    Computationally-Efficient Neural Image Compression with Shallow Decoders

    Authors: Yibo Yang, Stephan Mandt

    Abstract: Neural image compression methods have seen increasingly strong performance in recent years. However, they suffer orders of magnitude higher computational complexity compared to traditional codecs, which hinders their real-world deployment. This paper takes a step forward towards closing this gap in decoding complexity by using a shallow or even linear decoding transform resembling that of JPEG. To… ▽ More

    Submitted 10 November, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Updated version of the ICCV 2023 paper. Previously titled "Asymmetrically-powered Neural Image Compression with Shallow Decoders" on arXiv

  2. arXiv:2209.06950  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Lossy Image Compression with Conditional Diffusion Models

    Authors: Ruihan Yang, Stephan Mandt

    Abstract: This paper outlines an end-to-end optimized lossy image compression framework using diffusion generative models. The approach relies on the transform coding paradigm, where an image is mapped into a latent space for entropy coding and, from there, mapped back to the data space for reconstruction. In contrast to VAE-based neural compression, where the (mean) decoder is a deterministic neural networ… ▽ More

    Submitted 2 January, 2024; v1 submitted 14 September, 2022; originally announced September 2022.

  3. arXiv:2203.08875  [pdf, other

    cs.LG cs.CV eess.IV

    SC2 Benchmark: Supervised Compression for Split Computing

    Authors: Yoshitomo Matsubara, Ruihan Yang, Marco Levorato, Stephan Mandt

    Abstract: With the increasing demand for deep learning models on mobile devices, splitting neural network computation between the device and a more powerful edge server has become an attractive solution. However, existing split computing approaches often underperform compared to a naive baseline of remote computation on compressed data. Recent studies propose learning compressed representations that contain… ▽ More

    Submitted 14 June, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted at TMLR. Code and models are available at https://github.com/yoshitomo-matsubara/sc2-benchmark

  4. arXiv:2202.06533  [pdf, other

    cs.LG cs.IT eess.IV

    An Introduction to Neural Data Compression

    Authors: Yibo Yang, Stephan Mandt, Lucas Theis

    Abstract: Neural compression is the application of neural networks and other machine learning methods to data compression. Recent advances in statistical machine learning have opened up new possibilities for data compression, allowing compression algorithms to be learned end-to-end from data using powerful generative models such as normalizing flows, variational autoencoders, diffusion probabilistic models,… ▽ More

    Submitted 16 August, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Published in Foundations and Trends in Computer Graphics and Vision: Vol. 15, No. 2, pp 113-200. https://www.nowpublishers.com/article/Details/CGV-107

  5. arXiv:2107.13136  [pdf, other

    eess.IV cs.CV cs.LG

    Insights from Generative Modeling for Neural Video Compression

    Authors: Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

    Abstract: While recent machine learning research has revealed connections between deep generative models such as VAEs and rate-distortion losses used in learned compression, most of this work has focused on images. In a similar spirit, we view recently proposed neural video coding algorithms through the lens of deep autoregressive and latent variable modeling. We present these codecs as instances of a gener… ▽ More

    Submitted 9 July, 2023; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: This work has been submitted to the IEEE for publication as an extension work of arXiv:2010.10258. Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: text overlap with arXiv:2010.10258

  6. arXiv:2010.10258  [pdf, other

    eess.IV cs.LG

    Hierarchical Autoregressive Modeling for Neural Video Compression

    Authors: Ruihan Yang, Yibo Yang, Joseph Marino, Stephan Mandt

    Abstract: Recent work by Marino et al. (2020) showed improved performance in sequential density estimation by combining masked autoregressive flows with hierarchical latent variable models. We draw a connection between such autoregressive generative models and the task of lossy video compression. Specifically, we view recent neural video compression methods (Lu et al., 2019; Yang et al., 2020b; Agustssonet… ▽ More

    Submitted 19 December, 2023; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Published as a conference paper at ICLR 2021

  7. arXiv:2006.04240  [pdf, other

    eess.IV cs.LG stat.ML

    Improving Inference for Neural Image Compression

    Authors: Yibo Yang, Robert Bamler, Stephan Mandt

    Abstract: We consider the problem of lossy image compression with deep latent variable models. State-of-the-art methods build on hierarchical variational autoencoders (VAEs) and learn inference networks to predict a compressible latent representation of each data point. Drawing on the variational inference perspective on compression, we identify three approximation gaps which limit performance in the conven… ▽ More

    Submitted 8 January, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 9 pages + detailed supplement with additional results; various typos corrected. Camera-ready version paper at NeurIPS 2020

  8. arXiv:2002.08158  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Variational Bayesian Quantization

    Authors: Yibo Yang, Robert Bamler, Stephan Mandt

    Abstract: We propose a novel algorithm for quantizing continuous latent representations in trained models. Our approach applies to deep probabilistic models, such as variational autoencoders (VAEs), and enables both data and model compression. Unlike current end-to-end neural compression methods that cater the model to a fixed quantization scheme, our algorithm separates model design and training from quant… ▽ More

    Submitted 7 September, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: 9 pages + detailed supplement with additional full resolution reconstructed images; ICML 2020 final camera-ready version, title changed to "Variational Bayesian Quantization" following reviewer feedback

  9. arXiv:1810.02845  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Deep Generative Video Compression

    Authors: Jun Han, Salvator Lombardo, Christopher Schroers, Stephan Mandt

    Abstract: The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video. Our approach builds upon variational autoencoder (VAE) models for sequential data and combines them with… ▽ More

    Submitted 1 November, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: Accepted at NeurIPS 2019, 15 pages, 8 figures