Skip to main content

Showing 1–2 of 2 results for author: Tzun, T T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.06381  [pdf, other

    cs.CV

    Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

    Authors: Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Tiviatis Sim, Kenji Kawaguchi

    Abstract: Recent advancements in diffusion models have notably improved the perceptual quality of generated images in text-to-image synthesis tasks. However, diffusion models often struggle to produce images that accurately reflect the intended semantics of the associated text prompts. We examine cross-attention layers in diffusion models and observe a propensity for these layers to disproportionately focus… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  2. arXiv:2311.12803  [pdf, other

    cs.MM cs.AI cs.GR

    On Copyright Risks of Text-to-Image Diffusion Models

    Authors: Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Haonan Wang, Kenji Kawaguchi

    Abstract: Diffusion models excel in many generative modeling tasks, notably in creating images from text prompts, a task referred to as text-to-image (T2I) generation. Despite the ability to generate high-quality images, these models often replicate elements from their training data, leading to increasing copyright concerns in real applications in recent years. In response to this raising concern about copy… ▽ More

    Submitted 18 February, 2024; v1 submitted 14 September, 2023; originally announced November 2023.

    Comments: 16 pages including appendix