-
Towards image compression with perfect realism at ultra-low bitrates
Authors:
Marlène Careil,
Matthew J. Muckley,
Jakob Verbeek,
Stéphane Lathuilière
Abstract:
Image codecs are typically optimized to trade-off bitrate \vs distortion metrics. At low bitrates, this leads to compression artefacts which are easily perceptible, even when training with perceptual or adversarial losses. To improve image quality and remove dependency on the bitrate, we propose to decode with iterative diffusion models. We condition the decoding process on a vector-quantized imag…
▽ More
Image codecs are typically optimized to trade-off bitrate \vs distortion metrics. At low bitrates, this leads to compression artefacts which are easily perceptible, even when training with perceptual or adversarial losses. To improve image quality and remove dependency on the bitrate, we propose to decode with iterative diffusion models. We condition the decoding process on a vector-quantized image representation, as well as a global image description to provide additional context. We dub our model PerCo for 'perceptual compression', and compare it to state-of-the-art codecs at rates from 0.1 down to 0.003 bits per pixel. The latter rate is more than an order of magnitude smaller than those considered in most prior work, compressing a 512x768 Kodak image with less than 153 bytes. Despite this ultra-low bitrate, our approach maintains the ability to reconstruct realistic images. We find that our model leads to reconstructions with state-of-the-art visual quality as measured by FID and KID. As predicted by rate-distortion-perception theory, visual quality is less dependent on the bitrate than previous methods.
△ Less
Submitted 19 March, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
Authors:
Matthew J. Muckley,
Alaaeldin El-Nouby,
Karen Ullrich,
Hervé Jégou,
Jakob Verbeek
Abstract:
Lossy image compression aims to represent images in as few bits as possible while maintaining fidelity to the original. Theoretical results indicate that optimizing distortion metrics such as PSNR or MS-SSIM necessarily leads to a discrepancy in the statistics of original images from those of reconstructions, in particular at low bitrates, often manifested by the blurring of the compressed images.…
▽ More
Lossy image compression aims to represent images in as few bits as possible while maintaining fidelity to the original. Theoretical results indicate that optimizing distortion metrics such as PSNR or MS-SSIM necessarily leads to a discrepancy in the statistics of original images from those of reconstructions, in particular at low bitrates, often manifested by the blurring of the compressed images. Previous work has leveraged adversarial discriminators to improve statistical fidelity. Yet these binary discriminators adopted from generative modeling tasks may not be ideal for image compression. In this paper, we introduce a non-binary discriminator that is conditioned on quantized local image representations obtained via VQ-VAE autoencoders. Our evaluations on the CLIC2020, DIV2K and Kodak datasets show that our discriminator is more effective for jointly optimizing distortion (e.g., PSNR) and statistical fidelity (e.g., FID) than the PatchGAN of the state-of-the-art HiFiC model. On CLIC2020, we obtain the same FID as HiFiC with 30-40\% fewer bits.
△ Less
Submitted 10 August, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
Image Compression with Product Quantized Masked Image Modeling
Authors:
Alaaeldin El-Nouby,
Matthew J. Muckley,
Karen Ullrich,
Ivan Laptev,
Jakob Verbeek,
Hervé Jégou
Abstract:
Recent neural compression methods have been based on the popular hyperprior framework. It relies on Scalar Quantization and offers a very strong compression performance. This contrasts from recent advances in image generation and representation learning, where Vector Quantization is more commonly employed. In this work, we attempt to bring these lines of research closer by revisiting vector quanti…
▽ More
Recent neural compression methods have been based on the popular hyperprior framework. It relies on Scalar Quantization and offers a very strong compression performance. This contrasts from recent advances in image generation and representation learning, where Vector Quantization is more commonly employed. In this work, we attempt to bring these lines of research closer by revisiting vector quantization for image compression. We build upon the VQ-VAE framework and introduce several modifications. First, we replace the vanilla vector quantizer by a product quantizer. This intermediate solution between vector and scalar quantization allows for a much wider set of rate-distortion points: It implicitly defines high-quality quantizers that would otherwise require intractably large codebooks. Second, inspired by the success of Masked Image Modeling (MIM) in the context of self-supervised learning and generative image models, we propose a novel conditional entropy model which improves entropy coding by modelling the co-dependencies of the quantized latent codes. The resulting PQ-MIM model is surprisingly effective: its compression performance on par with recent hyperprior methods. It also outperforms HiFiC in terms of FID and KID metrics when optimized with perceptual losses (e.g. adversarial). Finally, since PQ-MIM is compatible with image generation frameworks, we show qualitatively that it can operate under a hybrid mode between compression and generation, with no further training or finetuning. As a result, we explore the extreme compression regime where an image is compressed into 200 bytes, i.e., less than a tweet.
△ Less
Submitted 6 November, 2023; v1 submitted 14 December, 2022;
originally announced December 2022.
-
On learning adaptive acquisition policies for undersampled multi-coil MRI reconstruction
Authors:
Tim Bakker,
Matthew Muckley,
Adriana Romero-Soriano,
Michal Drozdzal,
Luis Pineda
Abstract:
Most current approaches to undersampled multi-coil MRI reconstruction focus on learning the reconstruction model for a fixed, equidistant acquisition trajectory. In this paper, we study the problem of joint learning of the reconstruction model together with acquisition policies. To this end, we extend the End-to-End Variational Network with learnable acquisition policies that can adapt to differen…
▽ More
Most current approaches to undersampled multi-coil MRI reconstruction focus on learning the reconstruction model for a fixed, equidistant acquisition trajectory. In this paper, we study the problem of joint learning of the reconstruction model together with acquisition policies. To this end, we extend the End-to-End Variational Network with learnable acquisition policies that can adapt to different data points. We validate our model on a coil-compressed version of the large scale undersampled multi-coil fastMRI dataset using two undersampling factors: $4\times$ and $8\times$. Our experiments show on-par performance with the learnable non-adaptive and handcrafted equidistant strategies at $4\times$, and an observed improvement of more than $2\%$ in SSIM at $8\times$ acceleration, suggesting that potentially-adaptive $k$-space acquisition trajectories can improve reconstructed image quality for larger acceleration factors. However, and perhaps surprisingly, our best performing policies learn to be explicitly non-adaptive.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Results of the 2020 fastMRI Challenge for Machine Learning MR Image Reconstruction
Authors:
Matthew J. Muckley,
Bruno Riemenschneider,
Alireza Radmanesh,
Sunwoo Kim,
Geunu Jeong,
**gyu Ko,
Yohan Jun,
Hyungseob Shin,
Dosik Hwang,
Mahmoud Mostapha,
Simon Arberet,
Dominik Nickel,
Zaccharie Ramzi,
Philippe Ciuciu,
Jean-Luc Starck,
Jonas Teuwen,
Dimitrios Karkalousos,
Chao** Zhang,
Anuroop Sriram,
Zhengnan Huang,
Nafissa Yakubova,
Yvonne Lui,
Florian Knoll
Abstract:
Accelerating MRI scans is one of the principal outstanding problems in the MRI research community. Towards this goal, we hosted the second fastMRI competition targeted towards reconstructing MR images with subsampled k-space data. We provided participants with data from 7,299 clinical brain scans (de-identified via a HIPAA-compliant procedure by NYU Langone Health), holding back the fully-sampled…
▽ More
Accelerating MRI scans is one of the principal outstanding problems in the MRI research community. Towards this goal, we hosted the second fastMRI competition targeted towards reconstructing MR images with subsampled k-space data. We provided participants with data from 7,299 clinical brain scans (de-identified via a HIPAA-compliant procedure by NYU Langone Health), holding back the fully-sampled data from 894 of these scans for challenge evaluation purposes. In contrast to the 2019 challenge, we focused our radiologist evaluations on pathological assessment in brain images. We also debuted a new Transfer track that required participants to submit models evaluated on MRI scanners from outside the training set. We received 19 submissions from eight different groups. Results showed one team scoring best in both SSIM scores and qualitative radiologist evaluations. We also performed analysis on alternative metrics to mitigate the effects of background noise and collected feedback from the participants to inform future challenges. Lastly, we identify common failure modes across the submissions, highlighting areas of need for future research in the MRI reconstruction community.
△ Less
Submitted 3 May, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.
-
Advancing machine learning for MR image reconstruction with an open competition: Overview of the 2019 fastMRI challenge
Authors:
Florian Knoll,
Tullie Murrell,
Anuroop Sriram,
Nafissa Yakubova,
Jure Zbontar,
Michael Rabbat,
Aaron Defazio,
Matthew J. Muckley,
Daniel K. Sodickson,
C. Lawrence Zitnick,
Michael P. Recht
Abstract:
Purpose: To advance research in the field of machine learning for MR image reconstruction with an open challenge. Methods: We provided participants with a dataset of raw k-space data from 1,594 consecutive clinical exams of the knee. The goal of the challenge was to reconstruct images from these data. In order to strike a balance between realistic data and a shallow learning curve for those not al…
▽ More
Purpose: To advance research in the field of machine learning for MR image reconstruction with an open challenge. Methods: We provided participants with a dataset of raw k-space data from 1,594 consecutive clinical exams of the knee. The goal of the challenge was to reconstruct images from these data. In order to strike a balance between realistic data and a shallow learning curve for those not already familiar with MR image reconstruction, we ran multiple tracks for multi-coil and single-coil data. We performed a two-stage evaluation based on quantitative image metrics followed by evaluation by a panel of radiologists. The challenge ran from June to December of 2019. Results: We received a total of 33 challenge submissions. All participants chose to submit results from supervised machine learning approaches. Conclusion: The challenge led to new developments in machine learning for image reconstruction, provided insight into the current state of the art in the field, and highlighted remaining hurdles for clinical adoption.
△ Less
Submitted 6 January, 2020;
originally announced January 2020.
-
Training a Neural Network for Gibbs and Noise Removal in Diffusion MRI
Authors:
Matthew J. Muckley,
Benjamin Ades-Aron,
Antonios Papaioannou,
Gregory Lemberskiy,
Eddy Solomon,
Yvonne W. Lui,
Daniel K. Sodickson,
Els Fieremans,
Dmitry S. Novikov,
Florian Knoll
Abstract:
We develop and evaluate a neural network-based method for Gibbs artifact and noise removal. A convolutional neural network (CNN) was designed for artifact removal in diffusion-weighted imaging data. Two implementations were considered: one for magnitude images and one for complex images. Both models were based on the same encoder-decoder structure and were trained by simulating MRI acquisitions on…
▽ More
We develop and evaluate a neural network-based method for Gibbs artifact and noise removal. A convolutional neural network (CNN) was designed for artifact removal in diffusion-weighted imaging data. Two implementations were considered: one for magnitude images and one for complex images. Both models were based on the same encoder-decoder structure and were trained by simulating MRI acquisitions on synthetic non-MRI images. Both machine learning methods were able to mitigate artifacts in diffusion-weighted images and diffusion parameter maps. The CNN for complex images was also able to reduce artifacts in partial Fourier acquisitions. The proposed CNNs extend the ability of artifact correction in diffusion MRI. The machine learning method described here can be applied on each imaging slice independently, allowing it to be used flexibly in clinical applications.
△ Less
Submitted 15 May, 2019; v1 submitted 10 May, 2019;
originally announced May 2019.
-
fastMRI: An Open Dataset and Benchmarks for Accelerated MRI
Authors:
Jure Zbontar,
Florian Knoll,
Anuroop Sriram,
Tullie Murrell,
Zhengnan Huang,
Matthew J. Muckley,
Aaron Defazio,
Ruben Stern,
Patricia Johnson,
Mary Bruno,
Marc Parente,
Krzysztof J. Geras,
Joe Katsnelson,
Hersh Chandarana,
Zizhao Zhang,
Michal Drozdzal,
Adriana Romero,
Michael Rabbat,
Pascal Vincent,
Nafissa Yakubova,
James Pinkerton,
Duo Wang,
Erich Owens,
C. Lawrence Zitnick,
Michael P. Recht
, et al. (2 additional authors not shown)
Abstract:
Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive. We introduce the fastMRI dataset, a large-scale collection of both raw MR measurements and clinical MR images, that can be used for training and evaluation of ma…
▽ More
Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive. We introduce the fastMRI dataset, a large-scale collection of both raw MR measurements and clinical MR images, that can be used for training and evaluation of machine-learning approaches to MR image reconstruction. By introducing standardized evaluation criteria and a freely-accessible dataset, our goal is to help the community make rapid advances in the state of the art for MR image reconstruction. We also provide a self-contained introduction to MRI for machine learning researchers with no medical imaging background.
△ Less
Submitted 11 December, 2019; v1 submitted 21 November, 2018;
originally announced November 2018.