Skip to main content

Showing 1–7 of 7 results for author: Kulal, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03206  [pdf, other

    cs.CV

    Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

    Authors: Patrick Esser, Sumith Kulal, Andreas Blattmann, Rahim Entezari, Jonas Müller, Harry Saini, Yam Levi, Dominik Lorenz, Axel Sauer, Frederic Boesel, Dustin Podell, Tim Dockhorn, Zion English, Kyle Lacey, Alex Goodwin, Yannik Marek, Robin Rombach

    Abstract: Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for high-dimensional, perceptual data such as images and videos. Rectified flow is a recent generative model formulation that connects data and noise in a straight line. Despite its better theoretical properties and conceptual simplicity, it is n… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2311.15127  [pdf, other

    cs.CV

    Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets

    Authors: Andreas Blattmann, Tim Dockhorn, Sumith Kulal, Daniel Mendelevitch, Maciej Kilian, Dominik Lorenz, Yam Levi, Zion English, Vikram Voleti, Adam Letts, Varun Jampani, Robin Rombach

    Abstract: We present Stable Video Diffusion - a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained for 2D image synthesis have been turned into generative video models by inserting temporal layers and finetuning them on small, high-quality video datasets. However, training methods in the literature vary wi… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  3. arXiv:2304.14406  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Putting People in Their Place: Affordance-Aware Human Insertion into Scenes

    Authors: Sumith Kulal, Tim Brooks, Alex Aiken, Jiajun Wu, Jimei Yang, **gwan Lu, Alexei A. Efros, Krishna Kumar Singh

    Abstract: We study the problem of inferring scene affordances by presenting a method for realistically inserting people into scenes. Given a scene image with a marked region and an image of a person, we insert the person into the scene while respecting the scene affordances. Our model can infer the set of realistic poses given the scene context, re-pose the reference person, and harmonize the composition. W… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: CVPR 2023. Project page with code: https://sumith1896.github.io/affordance-insertion/

  4. arXiv:2206.13502  [pdf, other

    cs.CV cs.AI cs.GR cs.LG stat.ML

    Programmatic Concept Learning for Human Motion Description and Synthesis

    Authors: Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu

    Abstract: We introduce Programmatic Motion Concepts, a hierarchical motion representation for human actions that captures both low-level motion and high-level description as motion concepts. This representation enables human motion description, interactive editing, and controlled synthesis of novel video sequences within a single framework. We present an architecture that learns this concept representation… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: CVPR 2022. Project page: https://sumith1896.github.io/motion-concepts/

  5. arXiv:2104.11216  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Hierarchical Motion Understanding via Motion Programs

    Authors: Sumith Kulal, Jiayuan Mao, Alex Aiken, Jiajun Wu

    Abstract: Current approaches to video analysis of human motion focus on raw pixels or keypoints as the basic units of reasoning. We posit that adding higher-level motion primitives, which can capture natural coarser units of motion such as backswing or follow-through, can be used to improve downstream analysis tasks. This higher level of abstraction can also capture key features, such as loops of repeated p… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: CVPR 2021. First two authors contributed equally. Project page: https://sumith1896.github.io/motion2prog/

  6. arXiv:1906.04908  [pdf, other

    cs.LG cs.CL cs.PL stat.ML

    SPoC: Search-based Pseudocode to Code

    Authors: Sumith Kulal, Panupong Pasupat, Kartik Chandra, Mina Lee, Oded Padon, Alex Aiken, Percy Liang

    Abstract: We consider the task of map** pseudocode to long programs that are functionally correct. Given test cases as a mechanism to validate programs, we search over the space of possible translations of the pseudocode to find a program that passes the validation. However, without proper credit assignment to localize the sources of program failures, it is difficult to guide search toward more promising… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Under submission to NeurIPS 2019

  7. arXiv:1804.05507  [pdf, ps, other

    cs.LO

    What's hard about Boolean Functional Synthesis

    Authors: S. Akshay, Supratik Chakraborty, Shubham Goel, Sumith Kulal, Shetal Shah

    Abstract: Given a relational specification between Boolean inputs and outputs, the goal of Boolean functional synthesis is to synthesize each output as a function of the inputs such that the specification is met. In this paper, we first show that unless some hard conjectures in complexity theory are falsified, Boolean functional synthesis must necessarily generate exponential-sized Skolem functions, thereby… ▽ More

    Submitted 18 May, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: Full version of a conference paper to appear in CAV 2018