Skip to main content

Showing 1–16 of 16 results for author: Schroers, C

.
  1. arXiv:2404.14967  [pdf, other

    cs.CV cs.AI cs.GR

    CoARF: Controllable 3D Artistic Style Transfer for Radiance Fields

    Authors: Deheng Zhang, Clara Fernandez-Labrador, Christopher Schroers

    Abstract: Creating artistic 3D scenes can be time-consuming and requires specialized knowledge. To address this, recent works such as ARF, use a radiance field-based approach with style constraints to generate 3D scenes that resemble a style image provided by the user. However, these methods lack fine-grained control over the resulting scenes. In this paper, we introduce Controllable Artistic Radiance Field… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: International Conference on 3D Vision 2024

  2. arXiv:2404.08580  [pdf, other

    eess.IV cs.CV

    Lossy Image Compression with Foundation Diffusion Models

    Authors: Lucas Relic, Roberto Azevedo, Markus Gross, Christopher Schroers

    Abstract: Incorporating diffusion models in the image compression domain has the potential to produce realistic and detailed reconstructions, especially at extremely low bitrates. Previous methods focus on using diffusion models as expressive decoders robust to quantization errors in the conditioning signals, yet achieving competitive results in this manner requires costly training of the diffusion model an… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  3. arXiv:2310.19535  [pdf, other

    cs.CV

    Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation

    Authors: Zhaowei Gao, Mingyang Song, Christopher Schroers, Yang Zhang

    Abstract: Due to old CRT display technology and limited transmission bandwidth, early film and TV broadcasts commonly used interlaced scanning. This meant each field contained only half of the information. Since modern displays require full frames, this has spurred research into deinterlacing, i.e. restoring the missing information in legacy video content. In this paper, we present a deep-learning-based met… ▽ More

    Submitted 5 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2304.07627  [pdf

    cond-mat.mtrl-sci

    A Framework for Ductility in Metallic Glasses

    Authors: Sungwoo Sohn, Naijia Liu, Geun Hee Yoo, Aya Ochiai, Jade Chen, Callie Levitt, Guannan Liu, Samuel Charles Schroers, Ethen Lund, Eun Soo Park, Jan Schroers

    Abstract: The understanding and quantification of ductility in crystalline metals, which has led to their widespread and effective usage as a structural material, is lacking in metallic glasses (MGs). Here, we introduce such a framework for ductility. This very practical framework is based on a MGs ability to support stable shear band growth, quantified in a stress gradient, gradSDB, which we measure and ca… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

  5. arXiv:2303.13006  [pdf, other

    cs.CV cs.GR cs.LG

    Controllable Inversion of Black-Box Face Recognition Models via Diffusion

    Authors: Manuel Kansy, Anton Raël, Graziana Mignone, Jacek Naruniec, Christopher Schroers, Markus Gross, Romann M. Weber

    Abstract: Face recognition models embed a face image into a low-dimensional identity vector containing abstract encodings of identity-specific facial features that allow individuals to be distinguished from one another. We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). A variety of methods have been propose… ▽ More

    Submitted 30 September, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: 8 pages main paper + 23 pages supplementary material. Moderate revisions from v1 (different template, added user study, wording). Presented at AMFG workshop at ICCV 2023. Project page: https://studios.disneyresearch.com/2023/10/02/controllable-inversion-of-black-box-face-recognition-models-via-diffusion/

    ACM Class: I.2; I.3.3; I.4

  6. arXiv:2303.09199  [pdf, other

    cs.CV eess.IV

    A Generative Model for Digital Camera Noise Synthesis

    Authors: Mingyang Song, Yang Zhang, Tunç O. Aydın, Elham Amin Mansour, Christopher Schroers

    Abstract: Noise synthesis is a challenging low-level vision task aiming to generate realistic noise given a clean image along with the camera settings. To this end, we propose an effective generative model which utilizes clean features as guidance followed by noise injections into the network. Specifically, our generator follows a UNet-like structure with skip connections but without downsampling and upsamp… ▽ More

    Submitted 13 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

  7. arXiv:2201.02624  [pdf, other

    eess.IV cs.CV

    Microdosing: Knowledge Distillation for GAN based Compression

    Authors: Leonhard Helminger, Roberto Azevedo, Abdelaziz Djelouah, Markus Gross, Christopher Schroers

    Abstract: Recently, significant progress has been made in learned image and video compression. In particular the usage of Generative Adversarial Networks has lead to impressive results in the low bit rate regime. However, the model size remains an important issue in current state-of-the-art proposals and existing solutions require significant computation effort on the decoding side. This limits their usage… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: BMVC 2021

  8. arXiv:2009.04583  [pdf, other

    eess.IV cs.CV

    Blind Image Restoration with Flow Based Priors

    Authors: Leonhard Helminger, Michael Bernasconi, Abdelaziz Djelouah, Markus Gross, Christopher Schroers

    Abstract: Image restoration has seen great progress in the last years thanks to the advances in deep neural networks. Most of these existing techniques are trained using full supervision with suitable image pairs to tackle a specific degradation. However, in a blind setting with unknown degradations this is not possible and a good prior remains crucial. Recently, neural network based approaches have been pr… ▽ More

    Submitted 9 September, 2020; originally announced September 2020.

  9. arXiv:2008.10486  [pdf, other

    cs.CV

    Lossy Image Compression with Normalizing Flows

    Authors: Leonhard Helminger, Abdelaziz Djelouah, Markus Gross, Christopher Schroers

    Abstract: Deep learning based image compression has recently witnessed exciting progress and in some cases even managed to surpass transform coding based approaches that have been established and refined over many decades. However, state-of-the-art solutions for deep image compression typically employ autoencoders which map the input to a lower dimensional latent space and thus irreversibly discard informat… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  10. arXiv:1906.01223  [pdf, other

    cs.CV eess.IV

    Content Adaptive Optimization for Neural Image Compression

    Authors: Joaquim Campos, Simon Meierhans, Abdelaziz Djelouah, Christopher Schroers

    Abstract: The field of neural image compression has witnessed exciting progress as recently proposed architectures already surpass the established transform coding based approaches. While, so far, research has mainly focused on architecture and model improvements, in this work we explore content adaptive optimization. To this end, we introduce an iterative procedure which adapts the latent representation to… ▽ More

    Submitted 5 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: CVPR Workshop and Challenge on Learned Image Compression (2019)

  11. arXiv:1810.02845  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Deep Generative Video Compression

    Authors: Jun Han, Salvator Lombardo, Christopher Schroers, Stephan Mandt

    Abstract: The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video. Our approach builds upon variational autoencoder (VAE) models for sequential data and combines them with… ▽ More

    Submitted 1 November, 2019; v1 submitted 5 October, 2018; originally announced October 2018.

    Comments: Accepted at NeurIPS 2019, 15 pages, 8 figures

  12. arXiv:1808.03232  [pdf, other

    cs.CV

    Deep Video Color Propagation

    Authors: Simone Meyer, Victor Cornillère, Abdelaziz Djelouah, Christopher Schroers, Markus Gross

    Abstract: Traditional approaches for color propagation in videos rely on some form of matching between consecutive video frames. Using appearance descriptors, colors are then propagated both spatially and temporally. These methods, however, are computationally expensive and do not take advantage of semantic information of the scene. In this work we propose a deep learning framework for color propagation tha… ▽ More

    Submitted 9 August, 2018; originally announced August 2018.

    Comments: BMVC 2018

  13. arXiv:1804.02900  [pdf, other

    cs.CV

    A Fully Progressive Approach to Single-Image Super-Resolution

    Authors: Yifan Wang, Federico Perazzi, Brian McWilliams, Alexander Sorkine-Hornung, Olga Sorkine-Hornung, Christopher Schroers

    Abstract: Recent deep learning approaches to single image super-resolution have achieved impressive results in terms of traditional error measures and perceptual quality. However, in each case it remains challenging to achieve high quality results for large upsampling factors. To this end, we propose a method (ProSR) that is progressive both in architecture and training: the network upsamples an image in in… ▽ More

    Submitted 10 April, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

  14. arXiv:1804.01346  [pdf, other

    cs.CV

    Normalized Cut Loss for Weakly-supervised CNN Segmentation

    Authors: Meng Tang, Abdelaziz Djelouah, Federico Perazzi, Yuri Boykov, Christopher Schroers

    Abstract: Most recent semantic segmentation methods train deep convolutional neural networks with fully annotated masks requiring pixel-accuracy for good quality training. Common weakly-supervised approaches generate full masks from partial input (e.g. scribbles or seeds) using standard interactive segmentation methods as preprocessing. But, errors in such masks result in poorer training since standard loss… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: Accepted at CVPR 2018

  15. arXiv:1804.00884  [pdf, other

    cs.CV

    PhaseNet for Video Frame Interpolation

    Authors: Simone Meyer, Abdelaziz Djelouah, Brian McWilliams, Alexander Sorkine-Hornung, Markus Gross, Christopher Schroers

    Abstract: Most approaches for video frame interpolation require accurate dense correspondences to synthesize an in-between frame. Therefore, they do not perform well in challenging scenarios with e.g. lighting changes or motion blur. Recent deep learning approaches that rely on kernels to represent motion can only alleviate these problems to some extent. In those cases, methods that use a per-pixel phase-ba… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

    Comments: CVPR 2018

  16. arXiv:1803.09569  [pdf, other

    cs.CV

    On Regularized Losses for Weakly-supervised CNN Segmentation

    Authors: Meng Tang, Federico Perazzi, Abdelaziz Djelouah, Ismail Ben Ayed, Christopher Schroers, Yuri Boykov

    Abstract: Minimization of regularized losses is a principled approach to weak supervision well-established in deep learning, in general. However, it is largely overlooked in semantic segmentation currently dominated by methods mimicking full supervision via "fake" fully-labeled training masks (proposals) generated from available partial input. To obtain such full masks the typical methods explicitly use sta… ▽ More

    Submitted 10 April, 2018; v1 submitted 26 March, 2018; originally announced March 2018.