Skip to main content

Showing 1–23 of 23 results for author: Theis, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.04493  [pdf, ps, other

    cs.LG stat.ML

    What makes an image realistic?

    Authors: Lucas Theis

    Abstract: The last decade has seen tremendous progress in our ability to generate realistic-looking data, be it images, text, audio, or video. Here, we discuss the closely related problem of quantifying realism, that is, designing functions that can reliably tell realistic data from unrealistic data. This problem turns out to be significantly harder to solve and remains poorly understood, despite its preval… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, 2024

  2. arXiv:2312.02753  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    C3: High-performance and low-complexity neural compression from a single image or video

    Authors: Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont

    Abstract: Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The res… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  3. arXiv:2305.18231  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    High-Fidelity Image Compression with Score-based Generative Models

    Authors: Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis

    Abstract: Despite the tremendous success of diffusion generative models in text-to-image generation, replicating this success in the domain of image compression has proven difficult. In this paper, we demonstrate that diffusion can significantly improve perceptual quality at a given bit-rate, outperforming state-of-the-art approaches PO-ELIC and HiFiC as measured by FID score. This is achieved using a simpl… ▽ More

    Submitted 7 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  4. arXiv:2206.08889  [pdf, other

    stat.ML cs.IT cs.LG

    Lossy Compression with Gaussian Diffusion

    Authors: Lucas Theis, Tim Salimans, Matthew D. Hoffman, Fabian Mentzer

    Abstract: We consider a novel lossy compression approach based on unconditional diffusion generative models, which we call DiffC. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted information, DiffC relies on the efficient communication of pixels corrupted by Gaussian noise. We implement a proof of concept and find that it works surprisingly well d… ▽ More

    Submitted 31 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  5. arXiv:2110.12805  [pdf, other

    cs.IT stat.ML

    Algorithms for the Communication of Samples

    Authors: Lucas Theis, Noureldin Yosri

    Abstract: The efficient communication of noisy data has applications in several areas of machine learning, such as neural compression or differential privacy, and is also known as reverse channel coding or the channel simulation problem. Here we propose two new coding schemes with practical advantages over existing approaches. First, we introduce ordered random coding (ORC) which uses a simple trick to redu… ▽ More

    Submitted 25 May, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: Proceedings of the 39th International Conference on Machine Learning, 2022

  6. arXiv:2104.13662  [pdf, ps, other

    cs.IT stat.ML

    A coding theorem for the rate-distortion-perception function

    Authors: Lucas Theis, Aaron B. Wagner

    Abstract: The rate-distortion-perception function (RDPF; Blau and Michaeli, 2019) has emerged as a useful tool for thinking about realism and distortion of reconstructions in lossy compression. Unlike the rate-distortion function, however, it is unknown whether encoders and decoders exist that achieve the rate suggested by the RDPF. Building on results by Li and El Gamal (2018), we show that the RDPF can in… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Journal ref: ICLR 2021 Neural Compression Workshop

  7. arXiv:2102.09270  [pdf, ps, other

    cs.IT stat.ML

    On the advantages of stochastic encoders

    Authors: Lucas Theis, Eirikur Agustsson

    Abstract: Stochastic encoders have been used in rate-distortion theory and neural compression because they can be easier to handle. However, in performance comparisons with deterministic encoders they often do worse, suggesting that noise in the encoding process may generally be a bad idea. It is poorly understood if and when stochastic encoders do better than deterministic encoders. In this paper we provid… ▽ More

    Submitted 29 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Journal ref: ICLR 2021 Neural Compression Workshop

  8. arXiv:2006.09952  [pdf, other

    stat.ML cs.CV cs.IT cs.LG

    Universally Quantized Neural Compression

    Authors: Eirikur Agustsson, Lucas Theis

    Abstract: A popular approach to learning encoders for lossy compression is to use additive uniform noise during training as a differentiable approximation to test-time quantization. We demonstrate that a uniform noise channel can also be implemented at test time using universal quantization (Ziv, 1985). This allows us to eliminate the mismatch between training and test phases while maintaining a completely… ▽ More

    Submitted 21 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Authors contributed equally

  9. arXiv:1909.01436  [pdf, other

    stat.ML cs.IR cs.LG

    Discriminative Topic Modeling with Logistic LDA

    Authors: Iryna Korshunova, Hanchen Xiong, Mateusz Fedoryszak, Lucas Theis

    Abstract: Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In par… ▽ More

    Submitted 7 January, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Journal ref: Advances in Neural Information Processing Systems 32, 2019

  10. arXiv:1907.06558  [pdf, other

    stat.ML cs.LG

    Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction

    Authors: Sofia Ira Ktena, Alykhan Tejani, Lucas Theis, Pranay Kumar Myana, Deepak Dilipkumar, Ferenc Huszar, Steven Yoo, Wenzhe Shi

    Abstract: One of the challenges in display advertising is that the distribution of features and click through rate (CTR) can exhibit large shifts over time due to seasonality, changes to ad campaigns and other factors. The predominant strategy to keep up with these shifts is to train predictive models continuously, on fresh data, in order to prevent them from becoming stale. However, in many ad systems posi… ▽ More

    Submitted 23 April, 2021; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted at RecSys '19

  11. arXiv:1807.02175  [pdf

    stat.AP eess.IV

    Adaptive Paired-Comparison Method for Subjective Video Quality Assessment on Mobile Devices

    Authors: Katherine Storrs, Sebastiaan Van Leuven, Steve Kojder, Lucas Theis, Ferenc Huszár

    Abstract: To effectively evaluate subjective visual quality in weakly-controlled environments, we propose an Adaptive Paired Comparison method based on particle filtering. As our approach requires each sample to be rated only once, the test time compared to regular paired comparison can be reduced. The method works with non-experts and improves reliability compared to MOS and DS-MOS methods.

    Submitted 5 July, 2018; originally announced July 2018.

    Journal ref: Picture Coding Symposium, 2018

  12. arXiv:1801.05787  [pdf, other

    cs.CV stat.ML

    Faster gaze prediction with dense networks and Fisher pruning

    Authors: Lucas Theis, Iryna Korshunova, Alykhan Tejani, Ferenc Huszár

    Abstract: Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge… ▽ More

    Submitted 9 July, 2018; v1 submitted 17 January, 2018; originally announced January 2018.

  13. arXiv:1703.00395  [pdf, other

    stat.ML cs.CV

    Lossy Image Compression with Compressive Autoencoders

    Authors: Lucas Theis, Wenzhe Shi, Andrew Cunningham, Ferenc Huszár

    Abstract: We propose a new approach to the problem of optimizing autoencoders for lossy image compression. New media formats, changing hardware technology, as well as diverse requirements and content types create a need for compression algorithms which are more flexible than existing codecs. Autoencoders have the potential to address this need, but are difficult to optimize directly due to the inherent non-… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  14. arXiv:1610.04490  [pdf, other

    cs.CV cs.LG stat.ML

    Amortised MAP Inference for Image Super-resolution

    Authors: Casper Kaae Sønderby, Jose Caballero, Lucas Theis, Wenzhe Shi, Ferenc Huszár

    Abstract: Image super-resolution (SR) is an underdetermined inverse problem, where a large number of plausible high-resolution images can explain the same downsampled image. Most current single image SR methods use empirical risk minimisation, often with a pixel-wise mean squared error (MSE) loss. However, the outputs from such methods tend to be blurry, over-smoothed and generally appear implausible. A mor… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

  15. arXiv:1609.04802  [pdf, other

    cs.CV stat.ML

    Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

    Authors: Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

    Abstract: Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. R… ▽ More

    Submitted 25 May, 2017; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: 19 pages, 15 figures, 2 tables, accepted for oral presentation at CVPR, main paper + some supplementary material

  16. arXiv:1511.01844  [pdf, other

    stat.ML cs.LG

    A note on the evaluation of generative models

    Authors: Lucas Theis, Aäron van den Oord, Matthias Bethge

    Abstract: Probabilistic generative models can be used for compression, denoising, inpainting, texture synthesis, semi-supervised learning, unsupervised feature learning, and other tasks. Given this wide range of applications, it is not surprising that a lot of heterogeneity exists in the way these models are formulated, trained, and evaluated. As a consequence, direct comparison between models is often diff… ▽ More

    Submitted 24 April, 2016; v1 submitted 5 November, 2015; originally announced November 2015.

  17. arXiv:1506.03478  [pdf, other

    stat.ML cs.CV cs.LG

    Generative Image Modeling Using Spatial LSTMs

    Authors: Lucas Theis, Matthias Bethge

    Abstract: Modeling the distribution of natural images is challenging, partly because of strong statistical dependencies which can extend over hundreds of pixels. Recurrent neural networks have been successful in capturing long-range dependencies in a number of problems but only recently have found their way into generative image models. We here introduce a recurrent image model based on multi-dimensional lo… ▽ More

    Submitted 18 September, 2015; v1 submitted 10 June, 2015; originally announced June 2015.

  18. arXiv:1505.07649  [pdf, other

    stat.ML stat.AP

    A trust-region method for stochastic variational inference with applications to streaming data

    Authors: Lucas Theis, Matthew D. Hoffman

    Abstract: Stochastic variational inference allows for fast posterior inference in complex Bayesian models. However, the algorithm is prone to local optima which can make the quality of the posterior approximation sensitive to the choice of hyperparameters and initialization. We address this problem by replacing the natural gradient step of stochastic varitional inference with a trust-region update. We show… ▽ More

    Submitted 28 May, 2015; originally announced May 2015.

    Comments: in Proceedings of the 32nd International Conference on Machine Learning, 2015

  19. arXiv:1503.00135  [pdf

    stat.ML stat.AP

    Supervised learning sets benchmark for robust spike detection from calcium imaging signals

    Authors: Lucas Theis, Philipp Berens, Emmanouil Froudarakis, Jacob Reimer, Miroslav Román Rosón, Tom Baden, Thomas Euler, Andreas Tolias, Matthias Bethge

    Abstract: A fundamental challenge in calcium imaging has been to infer the timing of action potentials from the measured noisy calcium fluorescence traces. We systematically evaluate a range of spike inference algorithms on a large benchmark dataset recorded from varying neural tissue (V1 and retina) using different calcium indicators (OGB-1 and GCamp6). We show that a new algorithm based on supervised lear… ▽ More

    Submitted 28 February, 2015; originally announced March 2015.

  20. arXiv:1411.1045  [pdf, other

    cs.CV q-bio.NC stat.AP

    Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet

    Authors: Matthias Kümmerer, Lucas Theis, Matthias Bethge

    Abstract: Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting fixations. This lack in performance has been attributed to an inability to model the influence of high-level image features such as objects. Recent seminal advances in applying deep neural networks to tasks like object recognition suggests that they are able to capture this kind of structure. Howeve… ▽ More

    Submitted 9 April, 2015; v1 submitted 4 November, 2014; originally announced November 2014.

  21. arXiv:1410.4812  [pdf, other

    stat.CO math.OC stat.ML

    Inference and Mixture Modeling with the Elliptical Gamma Distribution

    Authors: Reshad Hosseini, Suvrit Sra, Lucas Theis, Matthias Bethge

    Abstract: We study modeling and inference with the Elliptical Gamma Distribution (EGD). We consider maximum likelihood (ML) estimation for EGD scatter matrices, a task for which we develop new fixed-point algorithms. Our algorithms are efficient and converge to global optima despite nonconvexity. Moreover, they turn out to be much faster than both a well-known iterative algorithm of Kent & Tyler (1991) and… ▽ More

    Submitted 20 December, 2015; v1 submitted 17 October, 2014; originally announced October 2014.

    Comments: 23 pages, 11 figures

    Journal ref: Computational Statistics & Data Analysis 2016, Vol. 101, 29-43

  22. Mixtures of conditional Gaussian scale mixtures applied to multiscale image representations

    Authors: Lucas Theis, Reshad Hosseini, Matthias Bethge

    Abstract: We present a probabilistic model for natural images which is based on Gaussian scale mixtures and a simple multiscale representation. In contrast to the dominant approach to modeling whole images focusing on Markov random fields, we formulate our model in terms of a directed graphical model. We show that it is able to generate images with interesting higher-order correlations when trained on natur… ▽ More

    Submitted 20 September, 2011; originally announced September 2011.

  23. arXiv:1011.6086  [pdf, other

    stat.ML cs.LG

    In All Likelihood, Deep Belief Is Not Enough

    Authors: Lucas Theis, Sebastian Gerwinn, Fabian Sinz, Matthias Bethge

    Abstract: Statistical models of natural stimuli provide an important tool for researchers in the fields of machine learning and computational neuroscience. A canonical way to quantitatively assess and compare the performance of statistical models is given by the likelihood. One class of statistical models which has recently gained increasing popularity and has been applied to a variety of complex data are d… ▽ More

    Submitted 28 November, 2010; originally announced November 2010.

    Journal ref: Journal of Machine Learning Research 12, 3071-3096, 2011