Skip to main content

Showing 1–24 of 24 results for author: Provost, E M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10518  [pdf, other

    cs.SD cs.AI eess.AS

    Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

    Authors: Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost

    Abstract: Paraphasias are speech errors that are often characteristic of aphasia and they represent an important signal in assessing disease severity and subtype. Traditionally, clinicians manually identify paraphasias by transcribing and analyzing speech-language samples, which can be a time-consuming and burdensome process. Identifying paraphasias automatically can greatly help clinicians with the transcr… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  2. arXiv:2311.00867  [pdf, other

    eess.AS cs.CL

    Automatic Disfluency Detection from Untranscribed Speech

    Authors: Amrit Romana, Kazuhito Koishida, Emily Mower Provost

    Abstract: Speech disfluencies, such as filled pauses or repetitions, are disruptions in the typical flow of speech. Stuttering is a speech disorder characterized by a high rate of disfluencies, but all individuals speak with some disfluencies and the rates of disfluencies may by increased by factors such as cognitive load. Clinically, automatic disfluency detection may help in treatment planning for individ… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  3. Articulatory Coordination for Speech Motor Tracking in Huntington Disease

    Authors: Matthew Perez, Amrit Romana, Angela Roberts, Noelle Carlozzi, Jennifer Ann Miner, Praveen Dayalu, Emily Mower Provost

    Abstract: Huntington Disease (HD) is a progressive disorder which often manifests in motor impairment. Motor severity (captured via motor score) is a key component in assessing overall HD severity. However, motor score evaluation involves in-clinic visits with a trained medical professional, which are expensive and not always accessible. Speech analysis provides an attractive avenue for tracking HD severity… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  4. arXiv:2109.04316  [pdf, other

    cs.LG cs.AI cs.HC

    Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network

    Authors: Lance Ying, Amrit Romana, Emily Mower Provost

    Abstract: In recent years, deep-learning-based speech emotion recognition models have outperformed classical machine learning models. Previously, neural network designs, such as Multitask Learning, have accounted for variations in emotional expressions due to demographic and contextual factors. However, existing models face a few constraints: 1) they rely on a clear definition of domains (e.g. gender, noise… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 9 pages, manuscript under peer review

  5. arXiv:2104.08806  [pdf, other

    cs.SD cs.LG eess.AS

    Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems

    Authors: Mimansa Jaiswal, Emily Mower Provost

    Abstract: Speech emotion recognition is an important component of any human centered system. But speech characteristics produced and perceived by a person can be influenced by a multitude of reasons, both desirable such as emotion, and undesirable such as noise. To train robust emotion recognition models, we need a large, yet realistic data distribution, but emotion datasets are often small and hence are au… ▽ More

    Submitted 31 August, 2023; v1 submitted 18 April, 2021; originally announced April 2021.

  6. arXiv:2104.08792  [pdf, other

    cs.CL cs.HC

    Human-Imitating Metrics for Training and Evaluating Privacy Preserving Emotion Recognition Models Using Sociolinguistic Knowledge

    Authors: Mimansa Jaiswal, Emily Mower Provost

    Abstract: Privacy preservation is a crucial component of any real-world application. But, in applications relying on machine learning backends, privacy is challenging because models often capture more than what the model was initially trained for, resulting in the potential leakage of sensitive information. In this paper, we propose an automatic and quantifiable metric that allows us to evaluate humans' per… ▽ More

    Submitted 4 October, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  7. arXiv:2010.11226  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Dynamic Layer Customization for Noise Robust Speech Emotion Recognition in Heterogeneous Condition Training

    Authors: Alex Wilf, Emily Mower Provost

    Abstract: Robustness to environmental noise is important to creating automatic speech emotion recognition systems that are deployable in the real world. Prior work on noise robustness has assumed that systems would not make use of sample-by-sample training noise conditions, or that they would have access to unlabelled testing data to generalize across noise conditions. We avoid these assumptions and introdu… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  8. arXiv:2010.08503  [pdf, other

    eess.AS cs.SD

    Classification of Manifest Huntington Disease using Vowel Distortion Measures

    Authors: Amrit Romana, John Bandon, Noelle Carlozzi, Angela Roberts, Emily Mower Provost

    Abstract: Huntington disease (HD) is a fatal autosomal dominant neurocognitive disorder that causes cognitive disturbances, neuropsychiatric symptoms, and impaired motor abilities (e.g., gait, speech, voice). Due to its progressive nature, HD treatment requires ongoing clinical monitoring of symptoms. Individuals with the gene mutation which causes HD may exhibit a range of speech symptoms as they progress… ▽ More

    Submitted 19 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

  9. arXiv:2009.04008  [pdf, other

    cs.CL cs.SI

    Quantifying the Effects of COVID-19 on Mental Health Support Forums

    Authors: Laura Biester, Katie Matton, Janarthanan Rajendran, Emily Mower Provost, Rada Mihalcea

    Abstract: The COVID-19 pandemic, like many of the disease outbreaks that have preceded it, is likely to have a profound effect on mental health. Understanding its impact can inform strategies for mitigating negative consequences. In this work, we seek to better understand the effects of COVID-19 on mental health by examining discussions within mental health support communities on Reddit. First, we quantify… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  10. Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts

    Authors: Matthew Perez, Zakaria Aldeneh, Emily Mower Provost

    Abstract: Robust speech recognition is a key prerequisite for semantic feature extraction in automatic aphasic speech analysis. However, standard one-size-fits-all automatic speech recognition models perform poorly when applied to aphasic speech. One reason for this is the wide range of speech intelligibility due to different levels of severity (i.e., higher severity lends itself to less intelligible speech… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: 4 pages

  11. Classification of Huntington Disease using Acoustic and Lexical Features

    Authors: Matthew Perez, Wenyu **, Duc Le, Noelle Carlozzi, Praveen Dayalu, Angela Roberts, Emily Mower Provost

    Abstract: Speech is a critical biomarker for Huntington Disease (HD), with changes in speech increasing in severity as the disease progresses. Speech analyses are currently conducted using either transcriptions created manually by trained professionals or using global rating scales. Manual transcription is both expensive and time-consuming and global rating scales may lack sufficient sensitivity and fidelit… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: 4 pages

  12. arXiv:1910.13212  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Privacy Enhanced Multimodal Neural Representations for Emotion Recognition

    Authors: Mimansa Jaiswal, Emily Mower Provost

    Abstract: Many mobile applications and virtual conversational agents now aim to recognize and adapt to emotions. To enable this, data are transmitted from users' devices and stored on central servers. Yet, these data contain sensitive information that could be used by mobile applications without user's consent or, maliciously, by an eavesdrop** adversary. In this work, we show how multimodal representatio… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 8 pages

  13. arXiv:1910.05115  [pdf, ps, other

    eess.AS cs.SD q-bio.NC

    Identifying Mood Episodes Using Dialogue Features from Clinical Interviews

    Authors: Zakaria Aldeneh, Mimansa Jaiswal, Michael Picheny, Melvin McInnis, Emily Mower Provost

    Abstract: Bipolar disorder, a severe chronic mental illness characterized by pathological mood swings from depression to mania, requires ongoing symptom severity tracking to both guide and measure treatments that are critical for maintaining long-term health. Mental health professionals assess symptom severity through semi-structured clinical interviews. During these interviews, they observe their patients'… ▽ More

    Submitted 24 March, 2022; v1 submitted 28 September, 2019; originally announced October 2019.

  14. arXiv:1909.11248  [pdf

    cs.LG cs.HC stat.ML

    When to Intervene: Detecting Abnormal Mood using Everyday Smartphone Conversations

    Authors: John Gideon, Katie Matton, Steve Anderau, Melvin G McInnis, Emily Mower Provost

    Abstract: Bipolar disorder (BPD) is a chronic mental illness characterized by extreme mood and energy changes from mania to depression. These changes drive behaviors that often lead to devastating personal or social consequences. BPD is managed clinically with regular interactions with care providers, who assess mood, energy levels, and the form and content of speech. Recent work has proposed smartphones fo… ▽ More

    Submitted 2 October, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Submitted to IEEE Transactions on Affective Computing

  15. arXiv:1909.00360  [pdf, other

    cs.HC cs.AI cs.LG

    The Ambiguous World of Emotion Representation

    Authors: Vidhyasaharan Sethu, Emily Mower Provost, Julien Epps, Carlos Busso, Nicholas Cummins, Shrikanth Narayanan

    Abstract: Artificial intelligence and machine learning systems have demonstrated huge improvements and human-level parity in a range of activities, including speech recognition, face recognition and speaker verification. However, these diverse tasks share a key commonality that is not true in affective computing: the ground truth information that is inferred can be unambiguously represented. This observatio… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

  16. arXiv:1908.08979  [pdf, other

    cs.LG cs.CL cs.SD eess.AS stat.ML

    Controlling for Confounders in Multimodal Emotion Classification via Adversarial Learning

    Authors: Mimansa Jaiswal, Zakaria Aldeneh, Emily Mower Provost

    Abstract: Various psychological factors affect how individuals express emotions. Yet, when we collect data intended for use in building emotion recognition systems, we often try to do so by creating paradigms that are designed just with a focus on eliciting emotional behavior. Algorithms trained with these types of data are unlikely to function outside of controlled environments because our emotions natural… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: 10 pages, ICMI 2019

  17. arXiv:1907.03050  [pdf, other

    cs.LG cs.HC eess.AS stat.ML

    Jointly Aligning and Predicting Continuous Emotion Annotations

    Authors: Soheil Khorram, Melvin G McInnis, Emily Mower Provost

    Abstract: Time-continuous dimensional descriptions of emotions (e.g., arousal, valence) allow researchers to characterize short-time changes and to capture long-term trends in emotion expression. However, continuous emotion labels are generally not synchronized with the input speech signal due to delays caused by reaction-time, which is inherent in human evaluations. To deal with this challenge, we introduc… ▽ More

    Submitted 18 July, 2019; v1 submitted 5 July, 2019; originally announced July 2019.

    Comments: IEEE Transactions on Affective Computing

  18. arXiv:1903.12094  [pdf

    cs.LG cs.SD eess.AS stat.ML

    Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)

    Authors: John Gideon, Melvin G McInnis, Emily Mower Provost

    Abstract: Automatic speech emotion recognition provides computers with critical context to enable user understanding. While methods trained and tested within the same dataset have been shown successful, they often fail when applied to unseen datasets. To address this, recent work has focused on adversarial methods to find more generalized representations of emotional speech. However, many of these methods h… ▽ More

    Submitted 3 November, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

  19. arXiv:1903.11672  [pdf, other

    cs.SD cs.HC cs.LG eess.AS

    MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations

    Authors: Mimansa Jaiswal, Zakaria Aldeneh, Cristian-Paul Bara, Yuanhang Luo, Mihai Burzo, Rada Mihalcea, Emily Mower Provost

    Abstract: Emotion recognition algorithms rely on data annotated with high quality labels. However, emotion expression and perception are inherently subjective. There is generally not a single annotation that can be unambiguously declared "correct". As a result, annotations are colored by the manner in which they were collected. In this paper, we conduct crowdsourcing experiments to investigate this impact o… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: 5 pages, ICASSP 2019

  20. arXiv:1903.09245  [pdf, other

    cs.LG cs.AI cs.CC stat.ML

    Trainable Time War**: Aligning Time-Series in the Continuous-Time Domain

    Authors: Soheil Khorram, Melvin G McInnis, Emily Mower Provost

    Abstract: DTW calculates the similarity or alignment between two signals, subject to temporal war**. However, its computational complexity grows exponentially with the number of time-series. Although there have been algorithms developed that are linear in the number of time-series, they are generally quadratic in time-series length. The exception is generalized time war** (GTW), which has linear computa… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: ICASSP 2019

  21. arXiv:1806.10658  [pdf, other

    cs.HC

    The PRIORI Emotion Dataset: Linking Mood to Emotion Detected In-the-Wild

    Authors: Soheil Khorram, Mimansa Jaiswal, John Gideon, Melvin McInnis, Emily Mower Provost

    Abstract: Bipolar Disorder is a chronic psychiatric illness characterized by pathological mood swings associated with severe disruptions in emotion regulation. Clinical monitoring of mood is key to the care of these dynamic and incapacitating mood states. Frequent and detailed monitoring improves clinical sensitivity to detect mood state changes, but typically requires costly and limited resources. Speech c… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: Interspeech 2018

  22. arXiv:1805.06511  [pdf, ps, other

    cs.CL cs.AI

    Improving End-of-turn Detection in Spoken Dialogues by Detecting Speaker Intentions as a Secondary Task

    Authors: Zakaria Aldeneh, Dimitrios Dimitriadis, Emily Mower Provost

    Abstract: This work focuses on the use of acoustic cues for modeling turn-taking in dyadic spoken dialogues. Previous work has shown that speaker intentions (e.g., asking a question, uttering a backchannel, etc.) can influence turn-taking behavior and are good predictors of turn-transitions in spoken dialogues. However, speaker intentions are not readily available for use by automated systems at run-time; m… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: ICASSP 2018

  23. arXiv:1708.07050  [pdf, other

    cs.SD cs.AI

    Capturing Long-term Temporal Dependencies with Convolutional Networks for Continuous Emotion Recognition

    Authors: Soheil Khorram, Zakaria Aldeneh, Dimitrios Dimitriadis, Melvin McInnis, Emily Mower Provost

    Abstract: The goal of continuous emotion recognition is to assign an emotion value to every frame in a sequence of acoustic features. We show that incorporating long-term temporal dependencies is critical for continuous emotion recognition tasks. To this end, we first investigate architectures that use dilated convolutions. We show that even though such architectures outperform previously reported systems,… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: 5 pages, 5 figures, 2 tables, Interspeech 2017

  24. arXiv:1706.03256  [pdf, other

    cs.LG

    Progressive Neural Networks for Transfer Learning in Emotion Recognition

    Authors: John Gideon, Soheil Khorram, Zakaria Aldeneh, Dimitrios Dimitriadis, Emily Mower Provost

    Abstract: Many paralinguistic tasks are closely related and thus representations learned in one domain can be leveraged for another. In this paper, we investigate how knowledge can be transferred between three paralinguistic tasks: speaker, emotion, and gender recognition. Further, we extend this problem to cross-dataset tasks, asking how knowledge captured in one emotion dataset can be transferred to anoth… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

    Comments: 5 pages, 4 figures, to appear in the proceedings of Interspeech 2017