Skip to main content

Showing 1–3 of 3 results for author: Stracke, N

.
  1. arXiv:2405.07913  [pdf, other

    cs.CV

    CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

    Authors: Nick Stracke, Stefan Andreas Baumann, Joshua M. Susskind, Miguel Angel Bautista, Björn Ommer

    Abstract: Text-to-image generative models have become a prominent and powerful tool that excels at generating high-resolution realistic images. However, guiding the generative process of these models to consider detailed forms of conditioning reflecting style and/or structure information remains an open problem. In this paper, we present LoRAdapter, an approach that unifies both style and structure conditio… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  2. arXiv:2403.17064  [pdf, other

    cs.CV cs.AI cs.LG

    Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions

    Authors: Stefan Andreas Baumann, Felix Krause, Michael Neumayr, Nick Stracke, Vincent Tao Hu, Björn Ommer

    Abstract: In recent years, advances in text-to-image (T2I) diffusion models have substantially elevated the quality of their generated images. However, achieving fine-grained control over attributes remains a challenge due to the limitations of natural language prompts (such as no continuous set of intermediate descriptions existing between ``person'' and ``old person''). Even though many methods were intro… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://compvis.github.io/attribute-control

  3. arXiv:2312.07360  [pdf, other

    cs.CV

    Boosting Latent Diffusion with Flow Matching

    Authors: Johannes S. Fischer, Ming Gui, **chuan Ma, Nick Stracke, Stefan A. Baumann, Björn Ommer

    Abstract: Recently, there has been tremendous progress in visual synthesis and the underlying generative models. Here, diffusion models (DMs) stand out particularly, but lately, flow matching (FM) has also garnered considerable interest. While DMs excel in providing diverse images, they suffer from long training and slow generation. With latent diffusion, these issues are only partially alleviated. Converse… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.