Skip to main content

Showing 1–10 of 10 results for author: Schaub-Meyer, S

.
  1. arXiv:2405.20469  [pdf, other

    cs.CV

    Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images

    Authors: Krishnakant Singh, Thanush Navaratnam, Jannik Holmer, Simone Schaub-Meyer, Stefan Roth

    Abstract: A long-standing challenge in develo** machine learning approaches has been the lack of high-quality labeled data. Recently, models trained with purely synthetic data, here termed synthetic clones, generated using large-scale pre-trained diffusion models have shown promising results in overcoming this annotation bottleneck. As these synthetic clone models progress, they are likely to be deployed… ▽ More

    Submitted 30 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024 Workshop: SyntaGen-Harnessing Generative Models for Synthetic Visual Datasets. Project page at https://synbenchmark.github.io/SynCloneBenchmark Comments: Fix typo in Fig. 1

  2. arXiv:2404.16818  [pdf, other

    cs.CV

    Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals

    Authors: Oliver Hahn, Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

    Abstract: Unsupervised semantic segmentation aims to automatically partition images into semantically meaningful regions by identifying global categories within an image corpus without any form of annotation. Building upon recent advances in self-supervised representation learning, we focus on how to leverage these large pre-trained models for the downstream task of unsupervised segmentation. We present Pri… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Code: https://github.com/visinf/primaps

  3. arXiv:2403.17128  [pdf, other

    cs.CV

    Benchmarking Video Frame Interpolation

    Authors: Simon Kiefhaber, Simon Niklaus, Feng Liu, Simone Schaub-Meyer

    Abstract: Video frame interpolation, the task of synthesizing new frames in between two or more given ones, is becoming an increasingly popular research target. However, the current evaluation of frame interpolation techniques is not ideal. Due to the plethora of test datasets available and inconsistent computation of error metrics, a coherent and fair comparison across papers is very challenging. Furthermo… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: http://sniklaus.com/vfibench

  4. arXiv:2308.06248  [pdf, other

    cs.CV cs.LG

    FunnyBirds: A Synthetic Vision Dataset for a Part-Based Analysis of Explainable AI Methods

    Authors: Robin Hesse, Simone Schaub-Meyer, Stefan Roth

    Abstract: The field of explainable artificial intelligence (XAI) aims to uncover the inner workings of complex deep neural models. While being crucial for safety-critical domains, XAI inherently lacks ground-truth explanations, making its automatic evaluation an unsolved problem. We address this challenge by proposing a novel synthetic vision dataset, named FunnyBirds, and accompanying automatic evaluation… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023. Code: https://github.com/visinf/funnybirds

  5. arXiv:2305.09504  [pdf, other

    cs.CV cs.LG

    Content-Adaptive Downsampling in Convolutional Neural Networks

    Authors: Robin Hesse, Simone Schaub-Meyer, Stefan Roth

    Abstract: Many convolutional neural networks (CNNs) rely on progressive downsampling of their feature maps to increase the network's receptive field and decrease computational cost. However, this comes at the price of losing granularity in the feature maps, limiting the ability to correctly understand images or recover fine detail in dense prediction tasks. To address this, common practice is to replace the… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted at CVPR 2023 Workshop on Efficient Deep Learning for Computer Vision (ECV). Code: https://github.com/visinf/cad

  6. arXiv:2211.14005  [pdf, other

    cs.CV

    Efficient Feature Extraction for High-resolution Video Frame Interpolation

    Authors: Moritz Nottebaum, Stefan Roth, Simone Schaub-Meyer

    Abstract: Most deep learning methods for video frame interpolation consist of three main components: feature extraction, motion estimation, and image synthesis. Existing approaches are mainly distinguishable in terms of how these modules are designed. However, when interpolating high-resolution images, e.g. at 4K, the design choices for achieving high accuracy within reasonable memory requirements are limit… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted to BMVC 2022. Code: https://github.com/visinf/fldr-vfi

  7. arXiv:2211.12209  [pdf, other

    cs.CV

    $S^2$-Flow: Joint Semantic and Style Editing of Facial Images

    Authors: Krishnakant Singh, Simone Schaub-Meyer, Stefan Roth

    Abstract: The high-quality images yielded by generative adversarial networks (GANs) have motivated investigations into their application for image editing. However, GANs are often limited in the control they provide for performing specific edits. One of the principal challenges is the entangled latent space of GANs, which is not directly suitable for performing independent and detailed edits. Recent editing… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted to BMVC 2022

  8. arXiv:2209.15404  [pdf, other

    cs.CV cs.LG

    Entropy-driven Unsupervised Keypoint Representation Learning in Videos

    Authors: Ali Younes, Simone Schaub-Meyer, Georgia Chalvatzaki

    Abstract: Extracting informative representations from videos is fundamental for effectively learning various downstream tasks. We present a novel approach for unsupervised learning of meaningful representations from videos, leveraging the concept of image spatial entropy (ISE) that quantifies the per-pixel information in an image. We argue that \textit{local entropy} of pixel neighborhoods and their tempora… ▽ More

    Submitted 6 June, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: 29 pages, 14 figures, Accepted at ICML 2023

  9. arXiv:2111.07668  [pdf, other

    cs.LG cs.CV

    Fast Axiomatic Attribution for Neural Networks

    Authors: Robin Hesse, Simone Schaub-Meyer, Stefan Roth

    Abstract: Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable ax… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS*2021. Project page and code: https://visinf.github.io/fast-axiomatic-attribution

  10. arXiv:2111.06265  [pdf, other

    cs.CV cs.LG

    Dense Unsupervised Learning for Video Segmentation

    Authors: Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

    Abstract: We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train su… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS*2021. Code: https://github.com/visinf/dense-ulearn-vos