Skip to main content

Showing 1–2 of 2 results for author: Rampas, D

.
  1. arXiv:2306.00637  [pdf, other

    cs.CV

    Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models

    Authors: Pablo Pernias, Dominic Rampas, Mats L. Richter, Christopher J. Pal, Marc Aubreville

    Abstract: We introduce Würstchen, a novel architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models. A key contribution of our work is to develop a latent diffusion technique in which we learn a detailed but extremely compact semantic image representation used to guide the diffusion process. This highly… ▽ More

    Submitted 29 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Corresponding to "Würstchen v2"

    Journal ref: The Twelfth International Conference on Learning Representations (ICLR), 2024

  2. arXiv:2211.07292  [pdf, other

    cs.CV cs.LG

    A Novel Sampling Scheme for Text- and Image-Conditional Image Synthesis in Quantized Latent Spaces

    Authors: Dominic Rampas, Pablo Pernias, Marc Aubreville

    Abstract: Recent advancements in the domain of text-to-image synthesis have culminated in a multitude of enhancements pertaining to quality, fidelity, and diversity. Contemporary techniques enable the generation of highly intricate visuals which rapidly approach near-photorealistic quality. Nevertheless, as progress is achieved, the complexity of these methodologies increases, consequently intensifying the… ▽ More

    Submitted 23 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.