Skip to main content

Showing 1–3 of 3 results for author: Fischer, J S

.
  1. arXiv:2407.00783  [pdf, other

    cs.CV cs.AI

    Diffusion Models and Representation Learning: A Survey

    Authors: Michael Fuest, **chuan Ma, Ming Gui, Johannes S. Fischer, Vincent Tao Hu, Bjorn Ommer

    Abstract: Diffusion Models are popular generative modeling methods in various vision tasks, attracting significant attention. They can be considered a unique instance of self-supervised learning methods due to their independence from label annotation. This survey explores the interplay between diffusion models and representation learning. It provides an overview of diffusion models' essential aspects, inclu… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Github Repo: https://github.com/dongzhuoyao/Diffusion-Representation-Learning-Survey-Taxonomy

  2. arXiv:2403.13788  [pdf, other

    cs.CV

    DepthFM: Fast Monocular Depth Estimation with Flow Matching

    Authors: Ming Gui, Johannes S. Fischer, Ulrich Prestel, **chuan Ma, Dmytro Kotovenko, Olga Grebenkova, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer

    Abstract: Monocular depth estimation is crucial for numerous downstream vision tasks and applications. Current discriminative approaches to this problem are limited due to blurry artifacts, while state-of-the-art generative methods suffer from slow sampling due to their SDE nature. Rather than starting from noise, we seek a direct map** from input image to depth map. We observe that this can be effectivel… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2312.07360  [pdf, other

    cs.CV

    Boosting Latent Diffusion with Flow Matching

    Authors: Johannes S. Fischer, Ming Gui, **chuan Ma, Nick Stracke, Stefan A. Baumann, Björn Ommer

    Abstract: Recently, there has been tremendous progress in visual synthesis and the underlying generative models. Here, diffusion models (DMs) stand out particularly, but lately, flow matching (FM) has also garnered considerable interest. While DMs excel in providing diverse images, they suffer from long training and slow generation. With latent diffusion, these issues are only partially alleviated. Converse… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.