Skip to main content

Showing 1–8 of 8 results for author: Bitton, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2008.01393  [pdf, other

    cs.SD cs.LG eess.AS

    Neural Granular Sound Synthesis

    Authors: Adrien Bitton, Philippe Esling, Tatsuya Harada

    Abstract: Granular sound synthesis is a popular audio generation technique based on rearranging sequences of small waveform windows. In order to control the synthesis, all grains in a given corpus are analyzed through a set of acoustic descriptors. This provides a representation reflecting some form of local similarities across the grains. However, the quality of this grain space is bound by that of the des… ▽ More

    Submitted 3 July, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: presented for ICMC 2021 (2020 postponed)

  2. arXiv:2008.01370  [pdf

    cs.SD cs.LG eess.AS

    Timbre latent space: exploration and creative aspects

    Authors: Antoine Caillon, Adrien Bitton, Brice Gatinet, Philippe Esling

    Abstract: Recent studies show the ability of unsupervised models to learn invertible audio representations using Auto-Encoders. They enable high-quality sound synthesis but a limited control since the latent spaces do not disentangle timbre properties. The emergence of disentangled representations was studied in Variational Auto-Encoders (VAEs), and has been applied to audio. Using an additional perceptual… ▽ More

    Submitted 17 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

  3. arXiv:2007.16187  [pdf, other

    cs.LG cs.IR cs.MM cs.SD eess.AS stat.ML

    Ultra-light deep MIR by trimming lottery tickets

    Authors: Philippe Esling, Theis Bazin, Adrien Bitton, Tristan Carsault, Ninon Devis

    Abstract: Current state-of-the-art results in Music Information Retrieval are largely dominated by deep learning approaches. These provide unprecedented accuracy across all tasks. However, the consistently overlooked downside of these models is their stunningly massive complexity, which seems concomitantly crucial to their success. In this paper, we address this issue by proposing a model pruning method bas… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 8 pages, 2 figures. 21st International Society for Music Information Retrieval Conference 11-15 October 2020, Montreal, Canada

  4. arXiv:2007.16170  [pdf, other

    cs.LG cs.MM cs.SD eess.AS stat.ML

    Diet deep generative audio models with structured lottery

    Authors: Philippe Esling, Ninon Devis, Adrien Bitton, Antoine Caillon, Axel Chemla--Romeu-Santos, Constance Douwes

    Abstract: Deep learning models have provided extremely successful solutions in most audio application fields. However, the high accuracy of these models comes at the expense of a tremendous computation cost. This aspect is almost always overlooked in evaluating the quality of proposed models. However, models should not be evaluated without taking into account their complexity. This aspect is especially crit… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 8 pages, 5 figures. Proceedings of the 23rd International Conference on Digital Audio Effects (DAFx-20), Vienna, Austria, September 8-12, 2020

  5. arXiv:2007.06349  [pdf, other

    eess.AS cs.LG

    Vector-Quantized Timbre Representation

    Authors: Adrien Bitton, Philippe Esling, Tatsuya Harada

    Abstract: Timbre is a set of perceptual attributes that identifies different types of sound sources. Although its definition is usually elusive, it can be seen from a signal processing viewpoint as all the spectral features that are perceived independently from pitch and loudness. Some works have studied high-level timbre synthesis by analyzing the feature relationships of different instruments, but acousti… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  6. arXiv:1904.06215  [pdf, other

    cs.SD cs.LG eess.AS

    Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders

    Authors: Adrien Bitton, Philippe Esling, Antoine Caillon, Martin Fouilleul

    Abstract: Generative models have thrived in computer vision, enabling unprecedented image processes. Yet the results in audio remain less advanced. Our project targets real-time sound synthesis from a reduced set of high-level parameters, including semantic controls that can be adapted to different sound libraries and specific tags. These generative variables should allow expressive modulations of target mu… ▽ More

    Submitted 22 June, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: this article has been accepted for presentation to the 22nd International Conference on Digital Audio Effects (DAFx 2019) ; we provide additional content on this companion repository https://github.com/acids-ircam/Expressive_WAE_FADER

  7. arXiv:1810.00222  [pdf, other

    cs.SD eess.AS

    Modulated Variational auto-Encoders for many-to-many musical timbre transfer

    Authors: Adrien Bitton, Philippe Esling, Axel Chemla-Romeu-Santos

    Abstract: Generative models have been successfully applied to image style transfer and domain translation. However, there is still a wide gap in the quality of results when learning such tasks on musical audio. Furthermore, most translation models only enable one-to-one or one-to-many transfer by relying on separate encoders or decoders and complex, computationally-heavy models. In this paper, we introduce… ▽ More

    Submitted 29 September, 2018; originally announced October 2018.

  8. arXiv:1805.08501  [pdf, other

    cs.SD eess.AS

    Generative timbre spaces: regularizing variational auto-encoders with perceptual metrics

    Authors: Philippe Esling, Axel Chemla--Romeu-Santos, Adrien Bitton

    Abstract: Timbre spaces have been used in music perception to study the perceptual relationships between instruments based on dissimilarity ratings. However, these spaces do not generalize to novel examples and do not provide an invertible map**, preventing audio synthesis. In parallel, generative models have aimed to provide methods for synthesizing novel timbres. However, these systems do not provide an… ▽ More

    Submitted 1 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: Digital Audio Conference (DaFX 2018)