Skip to main content

Showing 1–29 of 29 results for author: Kucherenko, T

.
  1. arXiv:2407.03184  [pdf, ps, other

    math.DS

    Realization of Anosov Diffeomorphisms on the Torus

    Authors: Tamara Kucherenko, Anthony Quas

    Abstract: We study area preserving Anosov maps on the two-dimensional torus within a fixed homotopy class. We show that the set of pressure functions for Anosov diffeomorphisms with respect to the geometric potential is equal to the set of pressure functions for the linear Anosov automorphism with respect to Hölder potentials. We use this result to provide a negative answer to the $C^{1+α}$ version of the q… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    MSC Class: 37D35; 37B10; 37A60; 37C15; 37D20

  2. arXiv:2310.18855  [pdf, ps, other

    math.DS

    Ergodic theory on coded shift spaces

    Authors: Tamara Kucherenko, Martin Schmoll, Christian Wolf

    Abstract: We study ergodic-theoretic properties of coded shift spaces. A coded shift space is defined as a closure of all bi-infinite concatenations of words from a fixed countable generating set. We derive sufficient conditions for the uniqueness of measures of maximal entropy and equilibrium states of Hoelder continuous potentials based on the partition of the coded shift into its concatenation set (seque… ▽ More

    Submitted 9 July, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 42 pages

    MSC Class: 37A35; 37B10; 37B40; 37D35

  3. arXiv:2308.12646  [pdf, other

    cs.HC cs.GR cs.LG

    The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings

    Authors: Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the GENEA Challenge 2023, in which participating teams built speech-driven gesture-generation systems using the same speech and motion dataset, followed by a joint evaluation. This year's challenge provided data on both sides of a dyadic interaction, allowing teams to generate full-body motion for an agent given its speech (text and audio) and the speech and motion of the int… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: The first three authors made equal contributions. Accepted for publication at the ACM International Conference on Multimodal Interaction (ICMI)

    ACM Class: I.3; I.2

  4. arXiv:2303.08737  [pdf, other

    cs.HC cs.LG cs.MM

    Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022

    Authors: Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the second GENEA Challenge to benchmark data-driven automatic co-speech gesture generation. Participating teams used the same speech and motion dataset to build gesture-generation systems. Motion generated by all these systems was rendered to video using a standardised visualisation pipeline and evaluated in several large, crowdsourced user studies. Unlike when comparing diff… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: The first three authors made equal contributions and share joint first authorship. Accepted for publication in the ACM Transactions on Graphics (TOG).Please see https://youngwoo-yoon.github.io/GENEAchallenge2022/ for all challenge materials. arXiv admin note: text overlap with arXiv:2208.10441

    ACM Class: I.3; I.2

  5. arXiv:2302.14839  [pdf, ps, other

    math.DS

    Asymptotic behavior of the pressure function for Hölder potentials

    Authors: Tamara Kucherenko, Anthony Quas

    Abstract: We study the behavior of the pressure function for Hölder continuous potentials on mixing subshifts of finite type. The classical theory of thermodynamic formalism shows that such pressure functions are convex, analytic and have slant asymptotes. We provide a sharp exponential lower bound on how fast the pressure function approaches its asymptotes. As a counterpart, we also show that there is no c… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    MSC Class: 37D35; 37B10; 37A60

  6. arXiv:2301.05339  [pdf, other

    cs.GR cs.CV cs.HC cs.LG

    A Comprehensive Review of Data-Driven Co-Speech Gesture Generation

    Authors: Simbarashe Nyatsanga, Taras Kucherenko, Chaitanya Ahuja, Gustav Eje Henter, Michael Neff

    Abstract: Gestures that accompany speech are an essential part of natural and efficient embodied human communication. The automatic generation of such co-speech gestures is a long-standing problem in computer animation and is considered an enabling technology in film, games, virtual social spaces, and for interaction with social robots. The problem is made challenging by the idiosyncratic and non-periodic n… ▽ More

    Submitted 10 April, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted for EUROGRAPHICS 2023

    ACM Class: I.3.7

  7. Evaluating Data-Driven Co-Speech Gestures of Embodied Conversational Agents through Real-Time Interaction

    Authors: Yuan He, André Pereira, Taras Kucherenko

    Abstract: Embodied Conversational Agents that make use of co-speech gestures can enhance human-machine interactions in many ways. In recent years, data-driven gesture generation approaches for ECAs have attracted considerable research attention, and related methods have continuously improved. Real-time interaction is typically used when researchers evaluate ECA systems that generate rule-based gestures. How… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Published at the International Conference on Intelligent Virtual Agents

  8. arXiv:2208.10441  [pdf, other

    cs.HC cs.GR cs.LG cs.MM cs.SD eess.AS

    The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

    Authors: Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

    Abstract: This paper reports on the second GENEA Challenge to benchmark data-driven automatic co-speech gesture generation. Participating teams used the same speech and motion dataset to build gesture-generation systems. Motion generated by all these systems was rendered to video using a standardised visualisation pipeline and evaluated in several large, crowdsourced user studies. Unlike when comparing diff… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 12 pages, 5 figures; final version for ACM ICMI 2022

    ACM Class: I.3; I.2

  9. arXiv:2108.05762  [pdf, other

    cs.HC cs.LG cs.MM

    Multimodal analysis of the predictability of hand-gesture properties

    Authors: Taras Kucherenko, Rajmund Nagy, Michael Neff, Hedvig Kjellström, Gustav Eje Henter

    Abstract: Embodied conversational agents benefit from being able to accompany their speech with gestures. Although many data-driven approaches to gesture generation have been proposed in recent years, it is still unclear whether such systems can consistently generate gestures that convey meaning. We investigate which gesture properties (phase, category, and semantics) can be predicted from speech text and/o… ▽ More

    Submitted 14 January, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: Accepted at the International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2022

  10. arXiv:2108.05709  [pdf, other

    cs.HC

    To Rate or Not To Rate: Investigating Evaluation Methods for Generated Co-Speech Gestures

    Authors: Pieter Wolfert, Jeffrey M. Girard, Taras Kucherenko, Tony Belpaeme

    Abstract: While automatic performance metrics are crucial for machine learning of artificial human-like behaviour, the gold standard for evaluation remains human judgement. The subjective evaluation of artificial human-like behaviour in embodied conversational agents is however expensive and little is known about the quality of the data it returns. Two approaches to subjective evaluation can be largely dist… ▽ More

    Submitted 13 August, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: accepted for publication at International Conference for Multimodal Interaction (ICMI'21)

  11. Flexibility of the Pressure Function

    Authors: Tamara Kucherenko, Anthony Quas

    Abstract: We study the flexibility of the pressure function of a continuous potential (observable) with respect to a parameter regarded as the inverse temperature. The points of non-differentiability of this function are of particular interest in statistical physics, since they correspond to phase transitions. It is well known that the pressure function is convex, Lipschitz, and has an asymptote at infinity… ▽ More

    Submitted 28 February, 2023; v1 submitted 1 August, 2021; originally announced August 2021.

    MSC Class: 37A60; 37B10; 37D35

  12. arXiv:2106.14736  [pdf, other

    cs.HC cs.CV cs.GR cs.LG

    Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech

    Authors: Taras Kucherenko, Rajmund Nagy, Patrik Jonell, Michael Neff, Hedvig Kjellström, Gustav Eje Henter

    Abstract: We propose a new framework for gesture generation, aiming to allow data-driven approaches to produce more semantically rich gestures. Our approach first predicts whether to gesture, followed by a prediction of the gesture properties. Those properties are then used as conditioning for a modern probabilistic gesture-generation model capable of high-quality output. This empowers the approach to gener… ▽ More

    Submitted 13 August, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at the ACM International Conference on Intelligent Virtual Agents (IVA 2021)

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: International Conference on Intelligent Virtual Agents 2021

  13. arXiv:2102.12302  [pdf, other

    cs.HC cs.GR cs.LG

    A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents

    Authors: Rajmund Nagy, Taras Kucherenko, Birger Moell, André Pereira, Hedvig Kjellström, Ulysses Bernardet

    Abstract: Embodied conversational agents (ECAs) benefit from non-verbal behavior for natural and efficient interaction with users. Gesticulation - hand and arm movements accompanying speech - is an essential part of non-verbal behavior. Gesture generation models have been developed for several decades: starting with rule-based and ending with mainly data-driven methods. To date, recent end-to-end gesture ge… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Rajmund Nagy and Taras Kucherenko contributed equally to this work. To be published in the Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), Online, May 3-7, 2021, IFAA-MAS, 3 pages, 1 figure

  14. arXiv:2102.11617  [pdf, other

    cs.HC cs.GR cs.MM

    A large, crowdsourced evaluation of gesture generation systems on common data: The GENEA Challenge 2020

    Authors: Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Gustav Eje Henter

    Abstract: Co-speech gestures, gestures that accompany speech, play an important role in human communication. Automatic co-speech gesture generation is thus a key enabling technology for embodied conversational agents (ECAs), since humans expect ECAs to be capable of multi-modal communication. Research into gesture generation is rapidly gravitating towards data-driven methods. Unfortunately, individual resea… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted for publication at the 26th International Conference on Intelligent User Interfaces (IUI'21). 11 pages, 5 figures

    ACM Class: I.3; I.2

  15. HEMVIP: Human Evaluation of Multiple Videos in Parallel

    Authors: Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Gustav Eje Henter

    Abstract: In many research areas, for example motion and gesture generation, objective measures alone do not provide an accurate impression of key stimulus traits such as perceived quality or appropriateness. The gold standard is instead to evaluate these aspects through user studies, especially subjective evaluations of video stimuli. Common evaluation paradigms either present individual stimuli to be scor… ▽ More

    Submitted 20 October, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 6 pages, 1 figures. Proceedings of the 22th ACM International Conference on Multimodal Interaction. 2021. Montreal, Canada

  16. arXiv:2101.05684  [pdf, other

    cs.LG cs.GR cs.SD eess.AS

    Generating coherent spontaneous speech and gesture from text

    Authors: Simon Alexanderson, Éva Székely, Gustav Eje Henter, Taras Kucherenko, Jonas Beskow

    Abstract: Embodied human communication encompasses both verbal (speech) and non-verbal information (e.g., gesture and head movements). Recent advances in machine learning have substantially improved the technologies for generating synthetic versions of both of these types of data: On the speech side, text-to-speech systems are now able to generate highly convincing, spontaneous-sounding speech using unscrip… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: 3 pages, 2 figures, published at the ACM International Conference on Intelligent Virtual Agents (IVA) 2020

    MSC Class: 68T07 ACM Class: I.2.6; J.4; I.3.7; I.2.9

    Journal ref: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agents (IVA '20), 2020, 3 pages

  17. Can we trust online crowdworkers? Comparing online and offline participants in a preference test of virtual agents

    Authors: Patrik Jonell, Taras Kucherenko, Ilaria Torre, Jonas Beskow

    Abstract: Conducting user studies is a crucial component in many scientific fields. While some studies require participants to be physically present, other studies can be conducted both physically (e.g. in-lab) and online (e.g. via crowdsourcing). Inviting participants to the lab can be a time-consuming and logistically difficult endeavor, not to mention that sometimes research groups might not be able to r… ▽ More

    Submitted 23 October, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: Patrik Jonell and Taras Kucherenko contributed equally to this work. Published at the Proceedings of the 20th ACM International Conference on Intelligent Virtual Agent. 8 pages, 7 figures

  18. arXiv:2007.09170  [pdf, other

    cs.CV cs.GR cs.HC cs.LG

    Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation

    Authors: Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström

    Abstract: This paper presents a novel framework for speech-driven gesture production, applicable to virtual agents to enhance human-computer interaction. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a sequence of 3D coordina… ▽ More

    Submitted 28 January, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Extension of our IVA'19 paper. Accepted at the International Journal of Human-Computer Interaction. See more at https://svito-zar.github.io/audio2gestures/. arXiv admin note: substantial text overlap with arXiv:1903.03369

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: Int. J. Hum. Comput.Interact.(2021)

  19. arXiv:2006.13988  [pdf, ps, other

    math.DS

    Multiple phase transitions on compact symbolic systems

    Authors: Tamara Kucherenko, Anthony Quas, Christian Wolf

    Abstract: Let $φ:X\to \mathbb R$ be a continuous potential associated with a symbolic dynamical system $T:X\to X$ over a finite alphabet. Introducing a parameter $β>0$ (interpreted as the inverse temperature) we study the regularity of the pressure function $β\mapsto P_{\rm top}(βφ)$ on an interval $[α,\infty)$ with $α>0$. We say that $φ$ has a phase transition at $β_0$ if the pressure function… ▽ More

    Submitted 5 September, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: In this update we present a revised version of the main theorem which now only deals with phase transitions in an interval $[α,\infty)$ for some fixed $α>0.$

    MSC Class: 37A60; 37B10; 37D35

  20. arXiv:2006.09888  [pdf, other

    cs.CV cs.HC cs.LG cs.SD eess.AS eess.IV stat.ML

    Let's Face It: Probabilistic Multi-modal Interlocutor-aware Generation of Facial Gestures in Dyadic Settings

    Authors: Patrik Jonell, Taras Kucherenko, Gustav Eje Henter, Jonas Beskow

    Abstract: To enable more natural face-to-face interactions, conversational agents need to adapt their behavior to their interlocutors. One key aspect of this is generation of appropriate non-verbal behavior for the agent, for example facial gestures, here defined as facial expressions and head movements. Most existing gesture-generating systems do not utilize multi-modal cues from the interlocutor when synt… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Best Paper Award. 8 pages, 4 figures, IVA '20: Proceedings of the 20th ACM International Conference on Intelligent Virtual Agent

  21. arXiv:2001.09326  [pdf, other

    cs.HC cs.LG eess.AS

    Gesticulator: A framework for semantically-aware speech-driven gesture generation

    Authors: Taras Kucherenko, Patrik Jonell, Sanne van Waveren, Gustav Eje Henter, Simon Alexanderson, Iolanda Leite, Hedvig Kjellström

    Abstract: During speech, people spontaneously gesticulate, which plays a key role in conveying information. Similarly, realistic co-speech gestures are crucial to enable natural and smooth interactions with social agents. Current end-to-end co-speech gesture generation systems use a single modality for representing speech: either audio or text. These systems are therefore confined to producing either acoust… ▽ More

    Submitted 14 January, 2021; v1 submitted 25 January, 2020; originally announced January 2020.

    Comments: ICMI 2020 Best Paper Award. Code is available. 9 pages, 6 figures

    ACM Class: I.2.7; I.2.6; I.3.7

    Journal ref: Proceedings of the 2020 International Conference on Multimodal Interaction (ICMI '20)

  22. arXiv:1909.07317  [pdf, ps, other

    math.DS

    Measures of maximal entropy on subsystems of topological suspension semi-flows

    Authors: Tamara Kucherenko, Daniel J. Thompson

    Abstract: Given a compact topological dynamical system (X, f) with positive entropy and upper semi-continuous entropy map, and any closed invariant subset $Y \subset X$ with positive entropy, we show that there exists a continuous roof function such that the set of measures of maximal entropy for the suspension semi-flow over (X,f) consists precisely of the lifts of measures which maximize entropy on Y. Thi… ▽ More

    Submitted 27 January, 2021; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: v3: 10 pages. Corrected some typos. To appear in Studia Mathematica

  23. Analyzing Input and Output Representations for Speech-Driven Gesture Generation

    Authors: Taras Kucherenko, Dai Hasegawa, Gustav Eje Henter, Naoshi Kaneko, Hedvig Kjellström

    Abstract: This paper presents a novel framework for automatic speech-driven gesture generation, applicable to human-agent interaction including both virtual agents and robots. Specifically, we extend recent deep-learning-based, data-driven methods for speech-driven gesture generation by incorporating representation learning. Our model takes speech as input and produces gestures as output, in the form of a s… ▽ More

    Submitted 11 June, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: Accepted at IVA '19. Shorter version published at AAMAS '19. The code is available at https://github.com/GestureGeneration/Speech_driven_gesture_generation_with_autoencoder

    ACM Class: I.2.6; I.5.1; J.4

  24. arXiv:1803.02665  [pdf, other

    cs.LG

    A Neural Network Approach to Missing Marker Reconstruction in Human Motion Capture

    Authors: Taras Kucherenko, Jonas Beskow, Hedvig Kjellström

    Abstract: Optical motion capture systems have become a widely used technology in various fields, such as augmented reality, robotics, movie production, etc. Such systems use a large number of cameras to triangulate the position of optical markers.The marker positions are estimated with high accuracy. However, especially when tracking articulated bodies, a fraction of the markers in each timestep is missing… ▽ More

    Submitted 25 September, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

    Comments: 7 pages, 6 figures

    MSC Class: 68T05

  25. arXiv:1709.01613  [pdf, other

    cs.HC cs.AI cs.CY

    Machine Learning and Social Robotics for Detecting Early Signs of Dementia

    Authors: Patrik Jonell, Joseph Mendelson, Thomas Storskog, Goran Hagman, Per Ostberg, Iolanda Leite, Taras Kucherenko, Olga Mikheeva, Ulrika Akenine, Vesna Jelic, Alina Solomon, Jonas Beskow, Joakim Gustafson, Miia Kivipelto, Hedvig Kjellstrom

    Abstract: This paper presents the EACare project, an ambitious multi-disciplinary collaboration with the aim to develop an embodied system, capable of carrying out neuropsychological tests to detect early signs of dementia, e.g., due to Alzheimer's disease. The system will use methods from Machine Learning and Social Robotics, and be trained with examples of recorded clinician-patient interactions. The inte… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

  26. arXiv:1708.00550  [pdf, ps, other

    math.DS

    Measures of maximal entropy for suspension flows over the full shift

    Authors: Tamara Kucherenko, Daniel J. Thompson

    Abstract: We consider suspension flows with continuous roof function over the full shift $Σ$ on a finite alphabet. For any positive entropy subshift of finite type $Y \subset Σ$, we explictly construct a roof function such that the measure(s) of maximal entropy for the suspension flow over $Σ$ are exactly the lifts of the measure(s) of maximal entropy for $Y$. In the case when $Y$ is transitive, this gives… ▽ More

    Submitted 12 March, 2019; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: 13 pages, v3: minor revisions. To appear in Mathematische Zeitschrift

    MSC Class: 37D35; 37B10; 37A35

  27. arXiv:1604.06512  [pdf, ps, other

    math.DS

    Ground States and Zero-Temperature Measures at the Boundary of Rotation Sets

    Authors: Tamara Kucherenko, Christian Wolf

    Abstract: We consider a continuous dynamical system $f:X\to X$ on a compact metric space $X$ equipped with an $m$-dimensional continuous potential $Φ=(φ_1,\cdots,φ_m):X\to \bR^m$. We study the set of ground states $ GS(α)$ of the potential $α\cdot Φ$ as a function of the direction vector $α\in S^{m-1}$. %We also study the corresponding rotation vectors $\rv(GS(α))$. We show that the structure of the ground… ▽ More

    Submitted 21 April, 2016; originally announced April 2016.

    Comments: 26 pages

    MSC Class: 37D35; 37E45 (Primary); 37B10; 37E45; 37L40 (Secondary)

  28. arXiv:1310.4030  [pdf, ps, other

    math.DS

    Localized topological pressure and equilibrium states

    Authors: Tamara Kucherenko, Christian Wolf

    Abstract: We introduce the notion of localized topological pressure for continuous maps on compact metric spaces. The localized pressure of a continuous potential $\varphi$ is computed by considering only those $(n,ε)$-separated sets whose statistical sums with respect to an $m$-dimensional potential $Φ$ are "close" to a given value $w\in \bR^m$. We then establish for several classes of systems and potentia… ▽ More

    Submitted 15 October, 2013; originally announced October 2013.

    MSC Class: 37C40; 37D35; 37A60

  29. arXiv:1210.0135  [pdf, ps, other

    math.DS

    Geometry and entropy of generalized rotation sets

    Authors: Tamara Kucherenko, Christian Wolf

    Abstract: For a continuous map $f$ on a compact metric space we study the geometry and entropy of the generalized rotation set $\R(Φ)$. Here $Φ=(φ_1,...,φ_m)$ is a $m$-dimensional continuous potential and $\R(Φ)$ is the set of all $μ$-integrals of $Φ$ and $μ$ runs over all $f$-invariant probability measures. It is easy to see that the rotation set is a compact and convex subset of $\bR^m$. We study the ques… ▽ More

    Submitted 29 September, 2012; originally announced October 2012.