Skip to main content

Showing 1–3 of 3 results for author: Sigal, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2104.02606  [pdf, other

    cs.CV cs.SD eess.AS eess.IV

    Weakly-supervised Audio-visual Sound Source Detection and Separation

    Authors: Tanzila Rahman, Leonid Sigal

    Abstract: Learning how to localize and separate individual object sounds in the audio channel of the video is a difficult task. Current state-of-the-art methods predict audio masks from artificially mixed spectrograms, known as Mix-and-Separate framework. We propose an audio-visual co-segmentation, where the network learns both what individual objects look and sound like, from videos labeled with only objec… ▽ More

    Submitted 25 March, 2021; originally announced April 2021.

    Comments: 4 figures, 6 pages

    Journal ref: IEEE International Conference on Multimedia and Expo (ICME) 2021

  2. arXiv:1912.02401  [pdf, other

    cs.CV cs.LG eess.IV

    Generating Videos of Zero-Shot Compositions of Actions and Objects

    Authors: Megha Nawhal, Mengyao Zhai, Andreas Lehrmann, Leonid Sigal, Greg Mori

    Abstract: Human activity videos involve rich, varied interactions between people and objects. In this paper we develop methods for generating such videos -- making progress toward addressing the important, open problem of video generation in complex scenes. In particular, we introduce the task of generating human-object interaction videos in a zero-shot compositional setting, i.e., generating videos for act… ▽ More

    Submitted 17 July, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Accepted at ECCV'20; Project Page: https://www.sfu.ca/~mnawhal/projects/zs_hoi_generation.html

  3. arXiv:1811.11389  [pdf, other

    cs.CV eess.IV

    Image Generation from Layout

    Authors: Bo Zhao, Lili Meng, Weidong Yin, Leonid Sigal

    Abstract: Despite significant recent progress on generative models, controlled generation of images depicting multiple and complex object layouts is still a difficult problem. Among the core challenges are the diversity of appearance a given object may possess and, as a result, exponential set of images consistent with a specified layout. To address these challenges, we propose a novel approach for layout-b… ▽ More

    Submitted 14 October, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: Accepted to CVPR 2019 (Oral)