Skip to main content

Showing 1–13 of 13 results for author: Ringeval, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06728  [pdf, other

    cs.HC

    THERADIA WoZ: An Ecological Corpus for Appraisal-based Affect Research in Healthcare

    Authors: Hippolyte Fournier, Sina Alisamir, Safaa Azzakhnini, Hanna Chainay, Olivier Koenig, Isabella Zsoldos, Eléeonore Trân, Gérard Bailly, Frédéeric Elisei, Béatrice Bouchot, Brice Varini, Patrick Constant, Joan Fruitet, Franck Tarpin-Bernard, Solange Rossato, François Portet, Fabien Ringeval

    Abstract: We present THERADIA WoZ, an ecological corpus designed for audiovisual research on affect in healthcare. Two groups of senior individuals, consisting of 52 healthy participants and 9 individuals with Mild Cognitive Impairment (MCI), performed Computerised Cognitive Training (CCT) exercises while receiving support from a virtual assistant, tele-operated by a human in the role of a Wizard-of-Oz (WoZ… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  2. arXiv:2401.05166  [pdf, other

    cs.CV

    REACT 2024: the Second Multiple Appropriate Facial Reaction Generation Challenge

    Authors: Siyang Song, Micol Spitale, Cheng Luo, Cristina Palmero, German Barquero, Hengde Zhu, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

    Abstract: In dyadic interactions, humans communicate their intentions and state of mind using verbal and non-verbal cues, where multiple different facial reactions might be appropriate in response to a specific speaker behaviour. Then, how to develop a machine learning (ML) model that can automatically generate multiple appropriate, diverse, realistic and synchronised human facial reactions from an previous… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 68T40

  3. arXiv:2310.16810  [pdf, other

    cs.CL cs.AI

    Can GPT models Follow Human Summarization Guidelines? Evaluating ChatGPT and GPT-4 for Dialogue Summarization

    Authors: Yongxin Zhou, Fabien Ringeval, François Portet

    Abstract: This study explores the capabilities of prompt-driven Large Language Models (LLMs) like ChatGPT and GPT-4 in adhering to human guidelines for dialogue summarization. Experiments employed DialogSum (English social conversations) and DECODA (French call center interactions), testing various prompts: including prompts from existing literature and those from human summarization guidelines, as well as… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  4. arXiv:2309.05472  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

    Authors: Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different domains including computer vision and natural language processing. Speech processing drastically benefitted from SSL as most of the current domain-related tasks are now being approached with pre-trained models. This work introduces LeBenchmark 2.0 an open-source framework for assessing and building SSL-… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Published in Computer Science and Language. Preprint allowed

  5. arXiv:2307.12371  [pdf, other

    cs.CL

    PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization

    Authors: Yongxin Zhou, Fabien Ringeval, François Portet

    Abstract: Automatic dialogue summarization is a well-established task with the goal of distilling the most crucial information from human conversations into concise textual summaries. However, most existing research has predominantly focused on summarizing factual information, neglecting the affective content, which can hold valuable insights for analyzing, monitoring, or facilitating human interactions. In… ▽ More

    Submitted 3 May, 2024; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: LREC-COLING 2024, Torino (Italia), 20-25 May, 2024

  6. arXiv:2306.06583  [pdf, other

    cs.CV

    REACT2023: the first Multi-modal Multiple Appropriate Facial Reaction Generation Challenge

    Authors: Siyang Song, Micol Spitale, Cheng Luo, German Barquero, Cristina Palmero, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

    Abstract: The Multi-modal Multiple Appropriate Facial Reaction Generation Challenge (REACT2023) is the first competition event focused on evaluating multimedia processing and machine learning techniques for generating human-appropriate facial reactions in various dyadic interaction scenarios, with all participants competing strictly under the same conditions. The goal of the challenge is to provide the firs… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    MSC Class: 68T40

  7. arXiv:2209.11061  [pdf, other

    eess.AS cs.HC cs.LG

    Cross-domain Voice Activity Detection with Self-Supervised Representations

    Authors: Sina Alisamir, Fabien Ringeval, Francois Portet

    Abstract: Voice Activity Detection (VAD) aims at detecting speech segments on an audio signal, which is a necessary first step for many today's speech based applications. Current state-of-the-art methods focus on training a neural network exploiting features directly contained in the acoustics, such as Mel Filter Banks (MFBs). Such methods therefore require an extra normalisation step to adapt to a new doma… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  8. arXiv:2209.10223  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    Dynamic Time-Alignment of Dimensional Annotations of Emotion using Recurrent Neural Networks

    Authors: Sina Alisamir, Fabien Ringeval, Francois Portet

    Abstract: Most automatic emotion recognition systems exploit time-continuous annotations of emotion to provide fine-grained descriptions of spontaneous expressions as observed in real-life interactions. As emotion is rather subjective, its annotation is usually performed by several annotators who provide a trace for a given dimension, i.e. a time-continuous series describing a dimension such as arousal or v… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  9. arXiv:2207.08305  [pdf, other

    cs.CL cs.AI

    Effectiveness of French Language Models on Abstractive Dialogue Summarization Task

    Authors: Yongxin Zhou, François Portet, Fabien Ringeval

    Abstract: Pre-trained language models have established the state-of-the-art on various natural language processing tasks, including dialogue summarization, which allows the reader to quickly access key information from long conversations in meetings, interviews or phone calls. However, such dialogues are still difficult to handle with current models because the spontaneity of the language involves expressio… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: Yongxin Zhou, François Portet, Fabien Ringeval. Effectiveness of French Language Models on Abstractive Dialogue Summarization Task. LREC 2022, Marseille, France, 21-23 June 2022

  10. LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

    Authors: Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image and natural language processing. Recent works also investigated SSL from speech. They were notably successful to improve performance on downstream tasks such as automatic speech recognition (ASR). While these works suggest it is possible to reduce dependence on labeled data for building efficient spee… ▽ More

    Submitted 10 June, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Will be presented at Interspeech 2021

    Journal ref: Proc. Interspeech 2021

  11. arXiv:1907.11510  [pdf, ps, other

    cs.HC cs.CV cs.IR cs.LG stat.ML

    AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

    Authors: Fabien Ringeval, Björn Schuller, Michel Valstar, NIcholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, Siyang Song, Shuo Liu, Zi** Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic

    Abstract: The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions. The goal of the Challen… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  12. SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

    Authors: Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, **g Han, Vedhas Pandit, Antoine Toisoul, Bjorn Schuller, Kam Star, Elnar Hajiyev, Maja Pantic

    Abstract: Natural human-computer interaction and audio-visual human behaviour sensing systems, which would achieve robust performance in-the-wild are more needed than ever as digital devices are increasingly becoming an indispensable part of our life. Accurately annotated real-world data are the crux in devising such systems. However, existing databases usually consider controlled settings, low demographic… ▽ More

    Submitted 18 November, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

  13. arXiv:1605.01600  [pdf, other

    cs.CV cs.HC cs.MM

    AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge

    Authors: Michel Valstar, Jonathan Gratch, Bjorn Schuller, Fabien Ringeval, Denis Lalanne, Mercedes Torres Torres, Stefan Scherer, Guiota Stratou, Roddy Cowie, Maja Pantic

    Abstract: The Audio/Visual Emotion Challenge and Workshop (AVEC 2016) "Depression, Mood and Emotion" will be the sixth competition event aimed at comparison of multimedia processing and machine learning methods for automatic audio, visual and physiological depression and emotion analysis, with all participants competing under strictly the same conditions. The goal of the Challenge is to provide a common ben… ▽ More

    Submitted 22 November, 2016; v1 submitted 5 May, 2016; originally announced May 2016.

    Comments: Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, AVEC'16, co-located with the 24th ACM International Conference on Multimedia, MM 2016, pages 3-10, Amsterdam, The Netherlands, October 2016. ACM