Skip to main content

Showing 1–1 of 1 results for author: Alghamdi, M M

Searching in archive cs. Search in all archives.
.
  1. Talking Head from Speech Audio using a Pre-trained Image Generator

    Authors: Mohammed M. Alghamdi, He Wang, Andrew J. Bulpitt, David C. Hogg

    Abstract: We propose a novel method for generating high-resolution videos of talking-heads from speech audio and a single 'identity' image. Our method is based on a convolutional neural network model that incorporates a pre-trained StyleGAN generator. We model each frame as a point in the latent space of StyleGAN so that a video corresponds to a trajectory through the latent space. Training the network is i… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: Accepted at ACM Multimedia 2022. The Project webpage can found at https://mohammedalghamdi.github.io/talking-heads-acm-mm