Skip to main content

Showing 1–16 of 16 results for author: Frolov, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17670  [pdf, other

    eess.IV cs.AI cs.CV cs.ET cs.LG

    Federated Learning for Blind Image Super-Resolution

    Authors: Brian B. Moser, Ahmed Anwar, Federico Raue, Stanislav Frolov, Andreas Dengel

    Abstract: Traditional blind image SR methods need to model real-world degradations precisely. Consequently, current research struggles with this dilemma by assuming idealized degradations, which leads to limited applicability to actual user data. Moreover, the ideal scenario - training models on data from the targeted user base - presents significant privacy concerns. To address both challenges, we propose… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2404.07564  [pdf, other

    cs.CV

    ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation

    Authors: Stanislav Frolov, Brian B. Moser, Sebastian Palacio, Andreas Dengel

    Abstract: We present ObjBlur, a novel curriculum learning approach to improve layout-to-image generation models, where the task is to produce realistic images from layouts composed of boxes and labels. Our method is based on progressive object-level blurring, which effectively stabilizes training and enhances the quality of generated images. This curriculum learning strategy systematically applies varying d… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  3. arXiv:2403.03881  [pdf, other

    cs.CV cs.AI cs.LG

    Latent Dataset Distillation with Diffusion Models

    Authors: Brian B. Moser, Federico Raue, Sebastian Palacio, Stanislav Frolov, Andreas Dengel

    Abstract: The efficacy of machine learning has traditionally relied on the availability of increasingly larger datasets. However, large datasets pose storage challenges and contain non-influential samples, which could be ignored during training without impacting the final accuracy of the model. In response to these limitations, the concept of distilling the information on a dataset into a condensed set of (… ▽ More

    Submitted 24 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  4. arXiv:2401.00736  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Diffusion Models, Image Super-Resolution And Everything: A Survey

    Authors: Brian B. Moser, Arundhati S. Shanbhag, Federico Raue, Stanislav Frolov, Sebastian Palacio, Andreas Dengel

    Abstract: Diffusion Models (DMs) have disrupted the image Super-Resolution (SR) field and further closed the gap between image quality and human perceptual preferences. They are easy to train and can produce very high-quality samples that exceed the realism of those produced by previous generative methods. Despite their promising results, they also come with new challenges that need further research: high c… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  5. arXiv:2312.04724  [pdf, other

    cs.CR cs.LG

    Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

    Authors: Manish Bhatt, Sahana Chennabasappa, Cyrus Nikolaidis, Shengye Wan, Ivan Evtimov, Dominik Gabi, Daniel Song, Faizan Ahmad, Cornelius Aschermann, Lorenzo Fontana, Sasha Frolov, Ravi Prakash Giri, Dhaval Kapil, Yiannis Kozyrakis, David LeBlanc, James Milazzo, Aleksandar Straumann, Gabriel Synnaeve, Varun Vontimitta, Spencer Whitman, Joshua Saxe

    Abstract: This paper presents CyberSecEval, a comprehensive benchmark developed to help bolster the cybersecurity of Large Language Models (LLMs) employed as coding assistants. As what we believe to be the most extensive unified cybersecurity safety benchmark to date, CyberSecEval provides a thorough evaluation of LLMs in two crucial security domains: their propensity to generate insecure code and their lev… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  6. arXiv:2308.07977  [pdf, other

    cs.CV cs.AI cs.LG

    Dynamic Attention-Guided Diffusion for Image Super-Resolution

    Authors: Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

    Abstract: Diffusion models in image Super-Resolution (SR) treat all image regions with uniform intensity, which risks compromising the overall image quality. To address this, we introduce "You Only Diffuse Areas" (YODA), a dynamic attention-guided diffusion method for image SR. YODA selectively focuses on spatial regions using attention maps derived from the low-resolution image and the current time step in… ▽ More

    Submitted 7 March, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Brian B. Moser and Stanislav Frolov contributed equally

  7. arXiv:2307.04593  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    DWA: Differential Wavelet Amplifier for Image Super-Resolution

    Authors: Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

    Abstract: This work introduces Differential Wavelet Amplifier (DWA), a drop-in module for wavelet-based image Super-Resolution (SR). DWA invigorates an approach recently receiving less attention, namely Discrete Wavelet Transformation (DWT). DWT enables an efficient image representation for SR and reduces the spatial area of its input by a factor of 4, the overall model size, and computation cost, framing i… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  8. arXiv:2304.04518  [pdf, other

    cs.CV cs.AI cs.LG

    Are Visual Recognition Models Robust to Image Compression?

    Authors: João Maria Janeiro, Stanislav Frolov, Alaaeldin El-Nouby, Jakob Verbeek

    Abstract: Reducing the data footprint of visual content via image compression is essential to reduce storage requirements, but also to reduce the bandwidth and latency requirements for transmission. In particular, the use of compressed images allows for faster transfer of data, and faster response times for visual recognition in edge devices that rely on cloud-based services. In this paper, we first analyze… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  9. arXiv:2304.01994  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Waving Goodbye to Low-Res: A Diffusion-Wavelet Approach for Image Super-Resolution

    Authors: Brian Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

    Abstract: This paper presents a novel Diffusion-Wavelet (DiWa) approach for Single-Image Super-Resolution (SISR). It leverages the strengths of Denoising Diffusion Probabilistic Models (DDPMs) and Discrete Wavelet Transformation (DWT). By enabling DDPMs to operate in the DWT domain, our DDPM models effectively hallucinate high-frequency information for super-resolved images on the wavelet spectrum, resultin… ▽ More

    Submitted 5 April, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  10. arXiv:2209.13131  [pdf, other

    cs.CV cs.LG eess.IV

    Hitchhiker's Guide to Super-Resolution: Introduction and Recent Advances

    Authors: Brian Moser, Federico Raue, Stanislav Frolov, Jörn Hees, Sebastian Palacio, Andreas Dengel

    Abstract: With the advent of Deep Learning (DL), Super-Resolution (SR) has also become a thriving research area. However, despite promising results, the field still faces challenges that require further research e.g., allowing flexible upsampling, more effective loss functions, and better evaluation metrics. We review the domain of SR in light of recent advances, and examine state-of-the-art models such as… ▽ More

    Submitted 14 February, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023

  11. arXiv:2204.02035  [pdf, other

    cs.CV

    DT2I: Dense Text-to-Image Generation from Region Descriptions

    Authors: Stanislav Frolov, Prateek Bansal, Jörn Hees, Andreas Dengel

    Abstract: Despite astonishing progress, generating realistic images of complex scenes remains a challenging problem. Recently, layout-to-image synthesis approaches have attracted much interest by conditioning the generator on a list of bounding boxes and corresponding class labels. However, previous approaches are very restrictive because the set of labels is fixed a priori. Meanwhile, text-to-image synthes… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  12. arXiv:2105.10189  [pdf, other

    cs.CV

    Combining Transformer Generators with Convolutional Discriminators

    Authors: Ricard Durall, Stanislav Frolov, Jörn Hees, Federico Raue, Franz-Josef Pfreundt, Andreas Dengel, Janis Keupe

    Abstract: Transformer models have recently attracted much interest from computer vision researchers and have since been successfully employed for several problems traditionally addressed with convolutional neural networks. At the same time, image synthesis using generative adversarial networks (GANs) has drastically improved over the last few years. The recently proposed TransGAN is the first GAN using only… ▽ More

    Submitted 10 July, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  13. arXiv:2103.13722  [pdf, other

    cs.CV

    AttrLostGAN: Attribute Controlled Image Synthesis from Reconfigurable Layout and Style

    Authors: Stanislav Frolov, Avneesh Sharma, Jörn Hees, Tushar Karayil, Federico Raue, Andreas Dengel

    Abstract: Conditional image synthesis from layout has recently attracted much interest. Previous approaches condition the generator on object locations as well as class labels but lack fine-grained control over the diverse appearance aspects of individual objects. Gaining control over the image generation process is fundamental to build practical applications with a user-friendly interface. In this paper, w… ▽ More

    Submitted 26 August, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: Accepted to GCPR 2021. Link to code: https://github.com/stanifrolov/AttrLostGAN

  14. Adversarial Text-to-Image Synthesis: A Review

    Authors: Stanislav Frolov, Tobias Hinz, Federico Raue, Jörn Hees, Andreas Dengel

    Abstract: With the advent of generative adversarial networks, synthesizing images from textual descriptions has recently become an active research area. It is a flexible and intuitive way for conditional image generation with significant progress in the last years regarding visual realism, diversity, and semantic alignment. However, the field still faces several challenges that require further research effo… ▽ More

    Submitted 6 October, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Published at Neural Networks Journal, available at https://www.sciencedirect.com/science/article/pii/S0893608021002823

    Journal ref: Neural Networks, 2021

  15. arXiv:2010.14953  [pdf, other

    cs.CV

    Leveraging Visual Question Answering to Improve Text-to-Image Synthesis

    Authors: Stanislav Frolov, Shailza Jolly, Jörn Hees, Andreas Dengel

    Abstract: Generating images from textual descriptions has recently attracted a lot of interest. While current models can generate photo-realistic images of individual objects such as birds and human faces, synthesising images with multiple objects is still very difficult. In this paper, we propose an effective way to combine Text-to-Image (T2I) synthesis with Visual Question Answering (VQA) to improve the i… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted to the LANTERN workshop at COLING 2020

  16. arXiv:2010.05069  [pdf, other

    cs.CV

    Hybrid-S2S: Video Object Segmentation with Recurrent Networks and Correspondence Matching

    Authors: Fatemeh Azimi, Stanislav Frolov, Federico Raue, Joern Hees, Andreas Dengel

    Abstract: One-shot Video Object Segmentation~(VOS) is the task of pixel-wise tracking an object of interest within a video sequence, where the segmentation mask of the first frame is given at inference time. In recent years, Recurrent Neural Networks~(RNNs) have been widely used for VOS tasks, but they often suffer from limitations such as drift and error propagation. In this work, we study an RNN-based arc… ▽ More

    Submitted 7 November, 2020; v1 submitted 10 October, 2020; originally announced October 2020.