Skip to main content

Showing 1–4 of 4 results for author: Padi, S

.
  1. arXiv:2312.03756  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    LineConGraphs: Line Conversation Graphs for Effective Emotion Recognition using Graph Neural Networks

    Authors: Gokul S Krishnan, Sarala Padi, Craig S. Greenberg, Balaraman Ravindran, Dinesh Manoch, Ram D. Sriram

    Abstract: Emotion Recognition in Conversations (ERC) is a critical aspect of affective computing, and it has many practical applications in healthcare, education, chatbots, and social media platforms. Earlier approaches for ERC analysis involved modeling both speaker and long-term contextual information using graph neural network architectures. However, it is ideal to deploy speaker-independent models for r… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 13 pages, 6 figures

  2. arXiv:2202.08974  [pdf, other

    cs.SD cs.HC cs.LG cs.RO eess.AS

    Multimodal Emotion Recognition using Transfer Learning from Speaker Recognition and BERT-based models

    Authors: Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

    Abstract: Automatic emotion recognition plays a key role in computer-human interaction as it has the potential to enrich the next-generation artificial intelligence with emotional intelligence. It finds applications in customer and/or representative behavior analysis in call centers, gaming, personal assistants, and social robots, to mention a few. Therefore, there has been an increasing demand to develop r… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2108.02510

  3. arXiv:2108.02510  [pdf, other

    cs.SD cs.AI cs.HC eess.AS

    Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation

    Authors: Sarala Padi, Seyed Omid Sadjadi, Dinesh Manocha, Ram D. Sriram

    Abstract: Automatic speech emotion recognition (SER) is a challenging task that plays a crucial role in natural human-computer interaction. One of the main challenges in SER is data scarcity, i.e., insufficient amounts of carefully labeled data to build and fully explore complex deep learning models for emotion classification. This paper aims to address this challenge using a transfer learning strategy comb… ▽ More

    Submitted 16 August, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted at ACM/SIGCHI ICMI'21

  4. arXiv:2010.09895  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Multi-Window Data Augmentation Approach for Speech Emotion Recognition

    Authors: Sarala Padi, Dinesh Manocha, Ram D. Sriram

    Abstract: We present a Multi-Window Data Augmentation (MWA-SER) approach for speech emotion recognition. MWA-SER is a unimodal approach that focuses on two key concepts; designing the speech augmentation method and building the deep learning model to recognize the underlying emotion of an audio signal. Our proposed multi-window augmentation approach generates additional data samples from the speech signal b… ▽ More

    Submitted 15 February, 2022; v1 submitted 19 October, 2020; originally announced October 2020.