Skip to main content

Showing 1–1 of 1 results for author: Redden, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.10853  [pdf, other

    cs.CV

    LDM3D: Latent Diffusion Model for 3D

    Authors: Gabriela Ben Melech Stan, Diana Wofk, Scottie Fox, Alex Redden, Will Saxton, Jean Yu, Estelle Aflalo, Shao-Yen Tseng, Fabio Nonato, Matthias Muller, Vasudev Lal

    Abstract: This research paper proposes a Latent Diffusion Model for 3D (LDM3D) that generates both image and depth map data from a given text prompt, allowing users to generate RGBD images from text prompts. The LDM3D model is fine-tuned on a dataset of tuples containing an RGB image, depth map and caption, and validated through extensive experiments. We also develop an application called DepthFusion, which… ▽ More

    Submitted 21 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.