Skip to main content

Showing 1–1 of 1 results for author: Tubau, M

.
  1. arXiv:1903.10195  [pdf, other

    cs.MM cs.CV

    Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks

    Authors: Amanda Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giro-i-Nieto

    Abstract: Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a Generative Adversarial Network (GAN) with raw speech input. We propose a deep neural network that is trained from scratch in an end-to-end fashion, generating a face directly from the… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: ICASSP 2019. Projevct website at https://imatge-upc.github.io/wav2pix/