Skip to main content

Showing 1–1 of 1 results for author: Ryoo, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.04509  [pdf, other

    cs.SD cs.CV cs.GR eess.AS

    The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion

    Authors: Yu** Jeong, Wonjeong Ryoo, Seunghyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, **kyu Kim

    Abstract: In recent years, video generation has become a prominent generative tool and has drawn significant attention. However, there is little consideration in audio-to-video generation, though audio contains unique qualities like temporal semantics and magnitude. Hence, we propose The Power of Sound (TPoS) model to incorporate audio input that includes both changeable temporal semantics and magnitude. To… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: ICCV2023