Skip to main content

Showing 1–3 of 3 results for author: Montoya, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.06968  [pdf, other

    cs.CV cs.LG

    ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

    Authors: Alec Helbling, Evan Montoya, Duen Horng Chau

    Abstract: Recent text-to-image generative models can generate high-fidelity images from text prompts. However, these models struggle to consistently generate the same objects in different contexts with the same appearance. Consistent object generation is important to many downstream tasks like generating comic book illustrations with consistent characters and setting. Numerous approaches attempt to solve th… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  2. arXiv:2210.14896  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

    Authors: Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau

    Abstract: With recent advancements in diffusion models, users can generate high-quality images by writing text prompts in natural language. However, generating images with desired details requires proper prompts, and it is often unclear how a model reacts to different prompts or what the best prompts are. To help researchers tackle these critical challenges, we introduce DiffusionDB, the first large-scale t… ▽ More

    Submitted 6 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to ACL 2023 (nominated for best paper, top 1.6% of submissions, oral presentation). 17 pages, 11 figures. The dataset is available at https://huggingface.co/datasets/poloclub/diffusiondb. The code is at https://github.com/poloclub/diffusiondb. The interactive visualization demo is at https://poloclub.github.io/diffusiondb/explorer/

  3. arXiv:2210.13510  [pdf, other

    cs.HC

    Evaluation of Argo Scholar with Observational Study

    Authors: Kevin Li, Haoyang Yang, Evan Montoya, Anish Upadhayay, Zhiyan Zhou, Jon Saad-Falcon, Duen Horng Chau

    Abstract: Discovering and making sense of relevant literature is fundamental in any scientific field. Node-link diagram-based visualization tools can aid this process; however, existing tools have been evaluated only on small scales. This paper evaluates Argo Scholar, an open-source visualization tool designed for interactive exploration of literature and easy sharing of exploration results. A large-scale u… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: VIS IEEE 22