Skip to main content

Showing 1–6 of 6 results for author: Jonason, N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12666  [pdf, other

    cs.SD cs.LG eess.AS

    SYMPLEX: Controllable Symbolic Music Generation using Simplex Diffusion with Vocabulary Priors

    Authors: Nicolas Jonason, Luca Casini, Bob L. T. Sturm

    Abstract: We present a new approach for fast and controllable generation of symbolic music based on the simplex diffusion, which is essentially a diffusion process operating on probabilities rather than the signal space. This objective has been applied in domains such as natural language processing but here we apply it to generating 4-bar multi-instrument music loops using an orderless representation. We sh… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2311.10384  [pdf, other

    cs.SD eess.AS

    Retrieval Augmented Generation of Symbolic Music with LLMs

    Authors: Nicolas Jonason, Luca Casini, Carl Thomé, Bob L. T. Sturm

    Abstract: We explore the use of large language models (LLMs) for music generation using a retrieval system to select relevant examples. We find promising initial results for music generation in a dialogue with the user, especially considering the ease with which such a system can be implemented. The code is available online.

    Submitted 28 December, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: LBD @ ISMIR 2023

  3. arXiv:2309.07658  [pdf, other

    cs.SD eess.AS

    DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

    Authors: Nicolas Jonason, Xin Wang, Erica Cooper, Lauri Juvela, Bob L. T. Sturm, Junichi Yamagishi

    Abstract: We explore the use of neural synthesis for acoustic guitar from string-wise MIDI input. We propose four different systems and compare them with both objective metrics and subjective evaluation against natural audio and a sample-based baseline. We iteratively develop these four systems by making various considerations on the architecture and intermediate tasks, such as predicting pitch and loudness… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  4. arXiv:2305.03530  [pdf, other

    cs.SD cs.LG eess.AS

    Exploring Softly Masked Language Modelling for Controllable Symbolic Music Generation

    Authors: Nicolas Jonason, Bob L. T. Sturm

    Abstract: This document presents some early explorations of applying Softly Masked Language Modelling (SMLM) to symbolic music generation. SMLM can be seen as a generalisation of masked language modelling (MLM), where instead of each element of the input set being either known or unknown, each element can be known, unknown or partly known. We demonstrate some results of applying SMLM to constrained symbolic… ▽ More

    Submitted 11 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Version 1.1

  5. arXiv:2212.02610  [pdf, other

    cs.SD cs.LG eess.AS

    Audio Latent Space Cartography

    Authors: Nicolas Jonason, Bob L. T. Sturm

    Abstract: We explore the generation of visualisations of audio latent spaces using an audio-to-image generation pipeline. We believe this can help with the interpretability of audio latent spaces. We demonstrate a variety of results on the NSynth dataset. A web demo is available.

    Submitted 7 December, 2022; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Late Breaking / Demo, ISMIR 2022 (https://ismir2022program.ismir.net/lbd_413.html)

    ACM Class: J.5

  6. arXiv:2211.11225  [pdf, other

    cs.SD cs.LG eess.AS

    TimbreCLIP: Connecting Timbre to Text and Images

    Authors: Nicolas Jonason, Bob L. T. Sturm

    Abstract: We present work in progress on TimbreCLIP, an audio-text cross modal embedding trained on single instrument notes. We evaluate the models with a cross-modal retrieval task on synth patches. Finally, we demonstrate the application of TimbreCLIP on two tasks: text-driven audio equalization and timbre to image generation.

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Submitted to AAAI workshop on creative AI across modalities

    ACM Class: J.5