Skip to main content

Showing 1–3 of 3 results for author: van Rozendaal, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2310.01258  [pdf, other

    eess.IV cs.CV cs.LG

    MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device

    Authors: Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautiere, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke Wiggers

    Abstract: Neural video codecs have recently become competitive with standard codecs such as HEVC in the low-delay setting. However, most neural codecs are large floating-point networks that use pixel-dense war** operations for temporal modeling, making them too computationally expensive for deployment on mobile devices. Recent work has demonstrated that running a neural decoder in real time on mobile is f… ▽ More

    Submitted 15 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Matches version published at WACV 2024

  2. arXiv:2111.10302  [pdf, other

    eess.IV cs.CV cs.LG

    Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set

    Authors: Ties van Rozendaal, Johann Brehmer, Yunfan Zhang, Reza Pourreza, Auke Wiggers, Taco S. Cohen

    Abstract: We introduce a video compression algorithm based on instance-adaptive learning. On each video sequence to be transmitted, we finetune a pretrained compression model. The optimal parameters are transmitted to the receiver along with the latent code. By entropy-coding the parameter updates under a suitable mixture model prior, we ensure that the network parameters can be encoded efficiently. This in… ▽ More

    Submitted 23 June, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Matches version published in TMLR

  3. arXiv:1908.05717  [pdf, other

    eess.IV cs.LG stat.ML

    Video Compression With Rate-Distortion Autoencoders

    Authors: Amirhossein Habibian, Ties van Rozendaal, Jakub M. Tomczak, Taco S. Cohen

    Abstract: In this paper we present a a deep generative model for lossy video compression. We employ a model that consists of a 3D autoencoder with a discrete latent space and an autoregressive prior used for entropy coding. Both autoencoder and prior are trained jointly to minimize a rate-distortion loss, which is closely related to the ELBO used in variational autoencoders. Despite its simplicity, we find… ▽ More

    Submitted 13 November, 2019; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: Accepted to ICCV 2019