Skip to main content

Showing 1–2 of 2 results for author: Hasegawa, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.09170  [pdf, other

    cs.CV cs.GR cs.HC cs.LG

    Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation

    Authors: Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström

    Abstract: This paper presents a novel framework for speech-driven gesture production, applicable to virtual agents to enhance human-computer interaction. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a sequence of 3D coordina… ▽ More

    Submitted 28 January, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Extension of our IVA'19 paper. Accepted at the International Journal of Human-Computer Interaction. See more at https://svito-zar.github.io/audio2gestures/. arXiv admin note: substantial text overlap with arXiv:1903.03369

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: Int. J. Hum. Comput.Interact.(2021)

  2. Analyzing Input and Output Representations for Speech-Driven Gesture Generation

    Authors: Taras Kucherenko, Dai Hasegawa, Gustav Eje Henter, Naoshi Kaneko, Hedvig Kjellström

    Abstract: This paper presents a novel framework for automatic speech-driven gesture generation, applicable to human-agent interaction including both virtual agents and robots. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a s… ▽ More

    Submitted 11 June, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted at IVA '19. Shorter version published at AAMAS '19. The code is available at https://github.com/GestureGeneration/Speech_driven_gesture_generation_with_autoencoder

    ACM Class: I.2.6; I.5.1; J.4