Skip to main content

Showing 1–4 of 4 results for author: Burgert, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20095  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

    Authors: Xiang Li, Cristina Mata, Jongwoo Park, Kumara Kahatapitiya, Yoo Sung Jang, **ghuan Shang, Kanchana Ranasinghe, Ryan Burgert, Mu Cai, Yong Jae Lee, Michael S. Ryoo

    Abstract: Large Language Models (LLMs) equipped with extensive world knowledge and strong reasoning skills can tackle diverse tasks across domains, often by posing them as conversation-style instruction-response pairs. In this paper, we propose LLaRA: Large Language and Robotics Assistant, a framework which formulates robot action policy as conversations, and provides improved responses when trained with au… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2312.03817  [pdf, other

    cs.CV

    Diffusion Illusions: Hiding Images in Plain Sight

    Authors: Ryan Burgert, Xiang Li, Abe Leite, Kanchana Ranasinghe, Michael S. Ryoo

    Abstract: We explore the problem of computationally generating special `prime' images that produce optical illusions when physically arranged and viewed in a certain way. First, we propose a formal definition for this problem. Next, we introduce Diffusion Illusions, the first comprehensive pipeline designed to automatically generate a wide range of these illusions. Specifically, we both adapt the existing `… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  3. arXiv:2211.13224  [pdf, other

    cs.CV cs.CL cs.LG

    Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors

    Authors: Ryan Burgert, Kanchana Ranasinghe, Xiang Li, Michael S. Ryoo

    Abstract: Recently, text-to-image diffusion models have shown remarkable capabilities in creating realistic images from natural language prompts. However, few works have explored using these models for semantic localization or grounding. In this work, we explore how an off-the-shelf text-to-image diffusion model, trained without exposure to localization information, can ground various semantic phrases witho… ▽ More

    Submitted 21 June, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 19 pages; contains appendix

  4. arXiv:2206.13500  [pdf, other

    cs.CV cs.GR cs.LG cs.RO

    Neural Neural Textures Make Sim2Real Consistent

    Authors: Ryan Burgert, **ghuan Shang, Xiang Li, Michael Ryoo

    Abstract: Unpaired image translation algorithms can be used for sim2real tasks, but many fail to generate temporally consistent results. We present a new approach that combines differentiable rendering with image translation to achieve temporal consistency over indefinite timescales, using surface consistency losses and \emph{neural neural textures}. We call this algorithm TRITON (Texture Recovering Image T… ▽ More

    Submitted 15 December, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: 9 pages, 10 figures (without references or appendix); 16 pages, 16 figures (with appendix)