Skip to main content

Showing 1–2 of 2 results for author: Cherep, M

.
  1. arXiv:2406.05923  [pdf, other

    cs.SD cs.LG eess.AS

    Contrastive Learning from Synthetic Audio Doppelgangers

    Authors: Manuel Cherep, Nikhil Singh

    Abstract: Learning robust audio representations currently demands extensive datasets of real-world sound recordings. By applying artificial transformations to these recordings, models can learn to recognize similarities despite subtle variations through techniques like contrastive learning. However, these transformations are only approximations of the true diversity found in real-world sounds, which are gen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 17 pages, 6 figures

  2. arXiv:2406.00294  [pdf, other

    cs.SD cs.LG eess.AS

    Creative Text-to-Audio Generation via Synthesizer Programming

    Authors: Manuel Cherep, Nikhil Singh, Jessica Shand

    Abstract: Neural audio synthesis methods now allow specifying ideas in natural language. However, these methods produce results that cannot be easily tweaked, as they are based on large latent spaces and up to billions of uninterpretable parameters. We propose a text-to-audio generation method that leverages a virtual modular sound synthesizer with only 78 parameters. Synthesizers have long been used by ski… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024