Skip to main content

Showing 1–13 of 13 results for author: Suwajanakorn, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11994  [pdf, other

    cs.CV

    Optimizing Diffusion Noise Can Serve As Universal Motion Priors

    Authors: Korrawe Karunratanakul, Konpat Preechakul, Emre Aksan, Thabo Beeler, Supasorn Suwajanakorn, Siyu Tang

    Abstract: We propose Diffusion Noise Optimization (DNO), a new method that effectively leverages existing motion diffusion models as motion priors for a wide range of motion-related tasks. Instead of training a task-specific diffusion model for each new task, DNO operates by optimizing the diffusion latent noise of an existing pre-trained text-to-motion model. Given the corresponding latent noise of a human… ▽ More

    Submitted 3 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project page: https://korrawe.github.io/dno-project/

  2. arXiv:2312.09168  [pdf, other

    cs.CV cs.GR cs.LG

    DiffusionLight: Light Probes for Free by Painting a Chrome Ball

    Authors: Pakkapon Phongthawee, Worameth Chinchuthakun, Nontaphat Sinsunthithet, Amit Raj, Varun Jampani, Pramook Khungurn, Supasorn Suwajanakorn

    Abstract: We present a simple yet effective technique to estimate lighting in a single input image. Current techniques rely heavily on HDR panorama datasets to train neural networks to regress an input with limited field-of-view to a full environment map. However, these approaches often struggle with real-world, uncontrolled settings due to the limited diversity and size of their datasets. To address this p… ▽ More

    Submitted 9 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Oral. For more information and code, please visit our website https://diffusionlight.github.io/

    ACM Class: I.3.3; I.4.8

  3. arXiv:2307.11118  [pdf, other

    cs.CV

    Diffusion Sampling with Momentum for Mitigating Divergence Artifacts

    Authors: Suttisak Wizadwongsa, Worameth Chinchuthakun, Pramook Khungurn, Amit Raj, Supasorn Suwajanakorn

    Abstract: Despite the remarkable success of diffusion models in image generation, slow sampling remains a persistent issue. To accelerate the sampling process, prior studies have reformulated diffusion sampling as an ODE/SDE and introduced higher-order numerical methods. However, these methods often produce divergence artifacts, especially with a low number of sampling steps, which limits the achievable acc… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: Project page: https://github.com/sWizad/momentum-diffusion

  4. arXiv:2305.12577  [pdf, other

    cs.CV

    Guided Motion Diffusion for Controllable Human Motion Synthesis

    Authors: Korrawe Karunratanakul, Konpat Preechakul, Supasorn Suwajanakorn, Siyu Tang

    Abstract: Denoising diffusion models have shown great promise in human motion synthesis conditioned on natural language descriptions. However, integrating spatial constraints, such as pre-defined motion trajectories and obstacles, remains a challenge despite being essential for bridging the gap between isolated human motion and its surrounding environment. To address this issue, we propose Guided Motion Dif… ▽ More

    Submitted 29 October, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: ICCV23. Project page: https://korrawe.github.io/gmd-project/

  5. arXiv:2304.09479  [pdf, other

    cs.CV cs.GR cs.LG

    DiFaReli: Diffusion Face Relighting

    Authors: Puntawat Ponglertnapakorn, Nontawat Tritrong, Supasorn Suwajanakorn

    Abstract: We present a novel approach to single-view face relighting in the wild. Handling non-diffuse effects, such as global illumination or cast shadows, has long been a challenge in face relighting. Prior work often assumes Lambertian surfaces, simplified lighting models or involves estimating 3D shape, albedo, or a shadow map. This estimation, however, is error-prone and requires many training examples… ▽ More

    Submitted 7 September, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: ICCV 2023

  6. arXiv:2304.02744  [pdf, other

    cs.CV

    StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer

    Authors: Sasikarn Khwanmuang, Pakkapon Phongthawee, Patsorn Sangkloy, Supasorn Suwajanakorn

    Abstract: Our paper seeks to transfer the hairstyle of a reference image to an input photo for virtual hair try-on. We target a variety of challenges scenarios, such as transforming a long hairstyle with bangs to a pixie cut, which requires removing the existing hair and inferring how the forehead would look, or transferring partially visible hair from a hat-wearing person in a different pose. Past solution… ▽ More

    Submitted 2 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR2023

  7. arXiv:2303.13396  [pdf, other

    cs.CV

    Zero-guidance Segmentation Using Zero Segment Labels

    Authors: Pitchaporn Rewatbowornwong, Nattanat Chatthee, Ekapol Chuangsuwanich, Supasorn Suwajanakorn

    Abstract: CLIP has enabled new and exciting joint vision-language applications, one of which is open-vocabulary segmentation, which can locate any segment given an arbitrary text query. In our research, we ask whether it is possible to discover semantic segments without any user guidance in the form of text queries or predefined classes, and label them using natural language automatically? We propose a nove… ▽ More

    Submitted 4 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  8. arXiv:2301.11558  [pdf, other

    cs.CV

    Accelerating Guided Diffusion Sampling with Splitting Numerical Methods

    Authors: Suttisak Wizadwongsa, Supasorn Suwajanakorn

    Abstract: Guided diffusion is a technique for conditioning the output of a diffusion model at sampling time without retraining the network for each specific task. One drawback of diffusion models, however, is their slow sampling process. Recent techniques can accelerate unguided sampling by applying high-order numerical methods to the sampling process when viewed as differential equations. On the contrary,… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: Code now available at https://github.com/sWizad/split-diffusion

  9. arXiv:2111.15640  [pdf, other

    cs.CV cs.LG

    Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

    Authors: Konpat Preechakul, Nattanat Chatthee, Suttisak Wizadwongsa, Supasorn Suwajanakorn

    Abstract: Diffusion probabilistic models (DPMs) have achieved remarkable quality in image generation that rivals GANs'. But unlike GANs, DPMs use a set of latent variables that lack semantic meaning and cannot serve as a useful representation for other tasks. This paper explores the possibility of using DPMs for representation learning and seeks to extract a meaningful and decodable representation of an inp… ▽ More

    Submitted 9 March, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: Please visit our project page: https://Diff-AE.github.io/

  10. arXiv:2103.05606  [pdf, other

    cs.CV cs.GR cs.LG

    NeX: Real-time View Synthesis with Neural Basis Expansion

    Authors: Suttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn

    Abstract: We present NeX, a new approach to novel view synthesis based on enhancements of multiplane image (MPI) that can reproduce next-level view-dependent effects -- in real time. Unlike traditional MPI that uses a set of simple RGB$α$ planes, our technique models view-dependent effects by instead parameterizing each pixel as a linear combination of basis functions learned from a neural network. Moreover… ▽ More

    Submitted 12 April, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: CVPR 2021 (Oral)

  11. arXiv:2103.04379  [pdf, other

    cs.CV cs.LG

    Repurposing GANs for One-shot Semantic Part Segmentation

    Authors: Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn

    Abstract: While GANs have shown success in realistic image generation, the idea of using GANs for other tasks unrelated to synthesis is underexplored. Do GANs learn meaningful structural parts of objects during their attempt to reproduce those objects? In this work, we test this hypothesis and propose a simple and effective approach based on GANs for semantic part segmentation that requires as few as one la… ▽ More

    Submitted 5 July, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: CVPR 2021 (Oral)

  12. arXiv:1807.03146  [pdf, other

    cs.CV cs.LG stat.ML

    Discovery of Latent 3D Keypoints via End-to-end Geometric Reasoning

    Authors: Supasorn Suwajanakorn, Noah Snavely, Jonathan Tompson, Mohammad Norouzi

    Abstract: This paper presents KeypointNet, an end-to-end geometric reasoning framework to learn an optimal set of category-specific 3D keypoints, along with their detectors. Given a single image, KeypointNet extracts 3D keypoints that are optimized for a downstream task. We demonstrate this framework on 3D pose estimation by proposing a differentiable objective that seeks the optimal set of keypoints for re… ▽ More

    Submitted 23 November, 2018; v1 submitted 5 July, 2018; originally announced July 2018.

  13. arXiv:1506.00752  [pdf, other

    cs.CV

    What Makes Kevin Spacey Look Like Kevin Spacey

    Authors: Supasorn Suwajanakorn, Ira Kemelmacher-Shlizerman, Steve Seitz

    Abstract: We reconstruct a controllable model of a person from a large photo collection that captures his or her {\em persona}, i.e., physical appearance and behavior. The ability to operate on unstructured photo collections enables modeling a huge number of people, including celebrities and other well photographed people without requiring them to be scanned. Moreover, we show the ability to drive or {\em p… ▽ More

    Submitted 2 June, 2015; originally announced June 2015.