Skip to main content

Showing 1–5 of 5 results for author: Sick, L

.
  1. arXiv:2403.11821  [pdf, other

    cs.CV cs.AI cs.GR

    Evaluating Text-to-Image Synthesis: Survey and Taxonomy of Image Quality Metrics

    Authors: Sebastian Hartwig, Dominik Engel, Leon Sick, Hannah Kniesel, Tristan Payer, Poonam Poonam, Michael Glöckler, Alex Bäuerle, Timo Ropinski

    Abstract: Recent advances in text-to-image synthesis enabled through a combination of language and vision foundation models have led to a proliferation of the tools available and an increased attention to the field. When conducting text-to-image synthesis, a central goal is to ensure that the content between text and image is aligned. As such, there exist numerous evaluation metrics that aim to mimic human… ▽ More

    Submitted 15 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: preprint, 20 pages, 2 figures, 1 table

  2. arXiv:2403.11691  [pdf, other

    cs.CV

    TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models

    Authors: Lisa Weijler, Muhammad Jehanzeb Mirza, Leon Sick, Can Ekkazan, Pedro Hermosilla

    Abstract: Test-Time Training (TTT) proposes to adapt a pre-trained network to changing data distributions on-the-fly. In this work, we propose the first TTT method for 3D semantic segmentation, TTT-KD, which models Knowledge Distillation (KD) from foundation models (e.g. DINOv2) as a self-supervised objective for adaptation to distribution shifts at test-time. Given access to paired image-pointcloud (2D-3D)… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  3. arXiv:2402.15172  [pdf, other

    cs.CV cs.LG

    Attention-Guided Masked Autoencoders For Learning Image Representations

    Authors: Leon Sick, Dominik Engel, Pedro Hermosilla, Timo Ropinski

    Abstract: Masked autoencoders (MAEs) have established themselves as a powerful method for unsupervised pre-training for computer vision tasks. While vanilla MAEs put equal emphasis on reconstructing the individual parts of the image, we propose to inform the reconstruction process through an attention-guided loss function. By leveraging advances in unsupervised object discovery, we obtain an attention map o… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  4. arXiv:2309.12378  [pdf, other

    cs.CV

    Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling

    Authors: Leon Sick, Dominik Engel, Pedro Hermosilla, Timo Ropinski

    Abstract: Traditionally, training neural networks to perform semantic segmentation required expensive human-made annotations. But more recently, advances in the field of unsupervised learning have made significant progress on this issue and towards closing the gap to supervised algorithms. To achieve this, semantic knowledge is distilled by learning to correlate randomly sampled features from images across… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted at CVPR 2024

  5. Leveraging Self-Supervised Vision Transformers for Segmentation-based Transfer Function Design

    Authors: Dominik Engel, Leon Sick, Timo Ropinski

    Abstract: In volume rendering, transfer functions are used to classify structures of interest, and to assign optical properties such as color and opacity. They are commonly defined as 1D or 2D functions that map simple features to these optical properties. As the process of designing a transfer function is typically tedious and unintuitive, several approaches have been proposed for their interactive specifi… ▽ More

    Submitted 14 May, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: accepted at TVCG 2024