Skip to main content

Showing 1–5 of 5 results for author: Sulun, S

.
  1. arXiv:2307.14783  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.MM

    Emotion4MIDI: a Lyrics-based Emotion-Labeled Symbolic Music Dataset

    Authors: Serkan Sulun, Pedro Oliveira, Paula Viana

    Abstract: We present a new large-scale emotion-labeled symbolic music dataset consisting of 12k MIDI songs. To create this dataset, we first trained emotion classification models on the GoEmotions dataset, achieving state-of-the-art results with a model half the size of the baseline. We then applied these models to lyrics from two large-scale MIDI datasets. Our dataset covers a wide range of fine-grained em… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted to 22nd EPIA Conference on Artificial Intelligence (2023)

  2. arXiv:2203.16165  [pdf, other

    eess.AS cs.AI cs.MM

    Symbolic music generation conditioned on continuous-valued emotions

    Authors: Serkan Sulun, Matthew E. P. Davies, Paula Viana

    Abstract: In this paper we present a new approach for the generation of multi-instrument symbolic music driven by musical emotion. The principal novelty of our approach centres on conditioning a state-of-the-art transformer based on continuous-valued valence and arousal labels. In addition, we provide a new large-scale dataset of symbolic music paired with emotion labels in terms of valence and arousal. We… ▽ More

    Submitted 4 May, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Published in IEEE Access

    Journal ref: volume:10, year:2022, pages:44617-44626

  3. arXiv:2011.07274  [pdf, other

    eess.AS cs.AI cs.LG cs.SD

    On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks

    Authors: Serkan Sulun, Matthew E. P. Davies

    Abstract: In this paper, we address a sub-topic of the broad domain of audio enhancement, namely musical audio bandwidth extension. We formulate the bandwidth extension problem using deep neural networks, where a band-limited signal is provided as input to the network, with the goal of reconstructing a full-bandwidth output. Our main contribution centers on the impact of the choice of low pass filter when t… ▽ More

    Submitted 6 January, 2021; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: Qualitative examples on https://serkansulun.com/bwe. Source code on https://github.com/serkansulun/deep-music-enhancer

  4. arXiv:2007.08922  [pdf, other

    eess.IV cs.CV cs.LG

    Can Learned Frame-Prediction Compete with Block-Motion Compensation for Video Coding?

    Authors: Serkan Sulun, A. Murat Tekalp

    Abstract: Given recent advances in learned video prediction, we investigate whether a simple video codec using a pre-trained deep model for next frame prediction based on previously encoded/decoded frames without sending any motion side information can compete with standard video codecs based on block-motion compensation. Frame differences given learned frame predictions are encoded by a standard still-imag… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in Springer Journal of Signal, Image and Video Processing

  5. Deep Learned Frame Prediction for Video Compression

    Authors: Serkan Sulun

    Abstract: Motion compensation is one of the most essential methods for any video compression algorithm. Video frame prediction is a task analogous to motion compensation. In recent years, the task of frame prediction is undertaken by deep neural networks (DNNs). In this thesis we create a DNN to perform learned frame prediction and additionally implement a codec that contains our DNN. We train our network u… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.