Skip to main content

Showing 1–9 of 9 results for author: Clemmensen, L H

.
  1. arXiv:2306.01538  [pdf, other

    eess.AS

    On Crowdsourcing-design with Comparison Category Rating for Evaluating Speech Enhancement Algorithms

    Authors: Angélica S. Z. Suárez, Clément Laroche, Line H. Clemmensen, Sneha Das

    Abstract: Speech enhancement techniques improve the quality or the intelligibility of an audio signal by removing unwanted noise. It is used as preprocessing in numerous applications such as speech recognition, hearing aids, broadcasting and telephony. The evaluation of such algorithms often relies on reference-based objective metrics that are shown to correlate poorly with human perception. In order to eva… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Published at ICASSP 2023

  2. arXiv:2204.11550  [pdf, other

    cs.CL cs.SD eess.AS

    Speech Detection For Child-Clinician Conversations In Danish For Low-Resource In-The-Wild Conditions: A Case Study

    Authors: Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line. H. Clemmensen

    Abstract: Use of speech models for automatic speech processing tasks can improve efficiency in the screening, analysis, diagnosis and treatment in medicine and psychiatry. However, the performance of pre-processing speech tasks like segmentation and diarization can drop considerably on in-the-wild clinical data, specifically when the target dataset comprises of atypical speech. In this paper we study the pe… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: 5 pages. Submitted to Interspeech 2022

  3. arXiv:2203.14867  [pdf, other

    eess.AS cs.SD

    Continuous Metric Learning For Transferable Speech Emotion Recognition and Embedding Across Low-resource Languages

    Authors: Sneha Das, Nicklas Leander Lund, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen

    Abstract: Speech emotion recognition~(SER) refers to the technique of inferring the emotional state of an individual from speech signals. SERs continue to garner interest due to their wide applicability. Although the domain is mainly founded on signal processing, machine learning, and deep learning, generalizing over languages continues to remain a challenge. However, develo** generalizable and transferab… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Preprint of paper accepted to be presented at the Northern Lights Deep Learning Conference (NLDL), 2022. The labels are available at: https://bit.ly/3rg6VsA

  4. arXiv:2203.14865  [pdf, other

    eess.AS cs.SD

    Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representations

    Authors: Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen

    Abstract: In recent years, speech emotion recognition (SER) has been used in wide ranging applications, from healthcare to the commercial sector. In addition to signal processing approaches, methods for SER now also use deep learning techniques which provide transfer learning possibilities. However, generalizing over languages, corpora and recording conditions is still an open challenge. In this work we add… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Preprint of paper accepted to be presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022. Source code at https://bit.ly/34CgkSZ. arXiv admin note: text overlap with arXiv:2105.02055

  5. arXiv:2203.07033  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Compressing CNN Kernels for Videos Using Tucker Decompositions: Towards Lightweight CNN Applications

    Authors: Tobias Engelhardt Rasmussen, Line H Clemmensen, Andreas Baum

    Abstract: Convolutional Neural Networks (CNN) are the state-of-the-art in the field of visual computing. However, a major problem with CNNs is the large number of floating point operations (FLOPs) required to perform convolutions for large inputs. When considering the application of CNNs to video data, convolutional filters become even more complex due to the extra temporal dimension. This leads to problems… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Presented at the Northern Lights Deep Learning Conference 2022 in Tromsø, Norway

  6. arXiv:2203.04706  [pdf, other

    stat.ML cs.AI cs.LG

    Data Representativity for Machine Learning and AI Systems

    Authors: Line H. Clemmensen, Rune D. Kjærsgaard

    Abstract: Data representativity is crucial when drawing inference from data through machine learning models. Scholars have increased focus on unraveling the bias and fairness in models, also in relation to inherent biases in the input data. However, limited work exists on the representativity of samples (datasets) for appropriate inference in AI systems. This paper reviews definitions and notions of a repre… ▽ More

    Submitted 3 February, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

  7. arXiv:2105.02055  [pdf, other

    eess.AS cs.AI cs.SD

    Towards Interpretable and Transferable Speech Emotion Recognition: Latent Representation Based Analysis of Features, Methods and Corpora

    Authors: Sneha Das, Nicole Nadine Lønfeldt, Anne Katrine Pagsberg, Line H. Clemmensen

    Abstract: In recent years, speech emotion recognition (SER) has been used in wide ranging applications, from healthcare to the commercial sector. In addition to signal processing approaches, methods for SER now also use deep learning techniques. However, generalizing over languages, corpora and recording conditions is still an open challenge in the field. Furthermore, due to the black-box nature of deep lea… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  8. arXiv:2006.01671  [pdf, other

    stat.ML cs.LG stat.CO

    A generalized linear joint trained framework for semi-supervised learning of sparse features

    Authors: Juan C. Laria, Line H. Clemmensen, Bjarne K. Ersbøll

    Abstract: The elastic-net is among the most widely used types of regularization algorithms, commonly associated with the problem of supervised generalized linear model estimation via penalized maximum likelihood. Its nice properties originate from a combination of $\ell_1$ and $\ell_2$ norms, which endow this method with the ability to select variables taking into account the correlations between them. In t… ▽ More

    Submitted 2 October, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

  9. arXiv:1605.09196  [pdf, other

    stat.ML cs.LG

    Forest Floor Visualizations of Random Forests

    Authors: Soeren H. Welling, Hanne H. F. Refsgaard, Per B. Brockhoff, Line H. Clemmensen

    Abstract: We propose a novel methodology, forest floor, to visualize and interpret random forest (RF) models. RF is a popular and useful tool for non-linear multi-variate classification and regression, which yields a good trade-off between robustness (low variance) and adaptiveness (low bias). Direct interpretation of a RF model is difficult, as the explicit ensemble model of hundreds of deep trees is compl… ▽ More

    Submitted 4 July, 2016; v1 submitted 30 May, 2016; originally announced May 2016.

    Comments: 25 pages, 12 figures, supplementary materials. v2->v3: minor proofing, moderated comments on ICE-plots, replaced ψ-operator with the subset named H in equation 13 and 14 to improve simplicity