Search | arXiv e-print repository

doi 10.5281/zenodo.5625680

Cross-cultural Mood Perception in Pop Songs and its Alignment with Mood Detection Algorithms

Authors: Harin Lee, Frank Hoeger, Marc Schoenwiesner, Minsu Park, Nori Jacoby

Abstract: Do people from different cultural backgrounds perceive the mood in music the same way? How closely do human ratings across different cultures approximate automatic mood detection algorithms that are often trained on corpora of predominantly Western popular music? Analyzing 166 participants responses from Brazil, South Korea, and the US, we examined the similarity between the ratings of nine catego… ▽ More Do people from different cultural backgrounds perceive the mood in music the same way? How closely do human ratings across different cultures approximate automatic mood detection algorithms that are often trained on corpora of predominantly Western popular music? Analyzing 166 participants responses from Brazil, South Korea, and the US, we examined the similarity between the ratings of nine categories of perceived moods in music and estimated their alignment with four popular mood detection algorithms. We created a dataset of 360 recent pop songs drawn from major music charts of the countries and constructed semantically identical mood descriptors across English, Korean, and Portuguese languages. Multiple participants from the three countries rated their familiarity, preference, and perceived moods for a given song. Ratings were highly similar within and across cultures for basic mood attributes such as sad, cheerful, and energetic. However, we found significant cross-cultural differences for more complex characteristics such as dreamy and love. To our surprise, the results of mood detection algorithms were uniformly correlated across human ratings from all three countries and did not show a detectable bias towards any particular culture. Our study thus suggests that the mood detection algorithms can be considered as an objective measure at least within the popular music context. △ Less

Submitted 2 August, 2021; originally announced August 2021.

Comments: 8 pages, 5 figures, to be included as proceedings for the 22nd International Society of Music Information Retrieval (ISMIR)

Journal ref: Proceedings of the 22nd International Society for Music Information Retrieval Conference, Nov. 2021, pp. 366-373

arXiv:1912.02260 [pdf, other]

doi 10.32470/CCN.2019.1300-0

The effect of task and training on intermediate representations in convolutional neural networks revealed with modified RV similarity analysis

Authors: Jessica A. F. Thompson, Yoshua Bengio, Marc Schoenwiesner

Abstract: Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained ne… ▽ More Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained network (all parameters updated at every step), a freeze trained network (layers gradually frozen during training), random networks (only some layers trained), and a completely untrained network. We found that RV2 was able to recover expected similarity patterns and provide interpretable similarity matrices that suggested hypotheses about how representations are affected by different training recipes. We propose that the superior performance achieved by freeze training can be attributed to representational differences in the penultimate layer. Our comparisons of random networks suggest that the inputs and targets serve as anchors on the representations in the lowest and highest layers. △ Less

Submitted 4 December, 2019; originally announced December 2019.

Comments: 4 pages, 4 figures, Conference on Cognitive Computational Neuroscience 2019

arXiv:1810.08651 [pdf, ps, other]

How can deep learning advance computational modeling of sensory information processing?

Authors: Jessica A. F. Thompson, Yoshua Bengio, Elia Formisano, Marc Schönwiesner

Abstract: Deep learning, computational neuroscience, and cognitive science have overlap** goals related to understanding intelligence such that perception and behaviour can be simulated in computational systems. In neuroimaging, machine learning methods have been used to test computational models of sensory information processing. Recently, these model comparison techniques have been used to evaluate deep… ▽ More Deep learning, computational neuroscience, and cognitive science have overlap** goals related to understanding intelligence such that perception and behaviour can be simulated in computational systems. In neuroimaging, machine learning methods have been used to test computational models of sensory information processing. Recently, these model comparison techniques have been used to evaluate deep neural networks (DNNs) as models of sensory information processing. However, the interpretation of such model evaluations is muddied by imprecise statistical conclusions. Here, we make explicit the types of conclusions that can be drawn from these existing model comparison techniques and how these conclusions change when the model in question is a DNN. We discuss how DNNs are amenable to new model comparison techniques that allow for stronger conclusions to be made about the computational mechanisms underlying sensory information processing. △ Less

Submitted 25 September, 2018; originally announced October 2018.

Comments: Presented at MLINI-2016 workshop, 2016 (arXiv:1701.01437)

Report number: MLINI/2016/04

Showing 1–3 of 3 results for author: Schönwiesner, M