Skip to main content

Showing 1–8 of 8 results for author: Ramírez, M A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.01664  [pdf, other

    eess.AS cs.LG cs.SD

    Distortion Audio Effects: Learning How to Recover the Clean Signal

    Authors: Johannes Imort, Giorgio Fabbro, Marco A. Martínez Ramírez, Stefan Uhlich, Yuichiro Koyama, Yuki Mitsufuji

    Abstract: Given the recent advances in music source separation and automatic mixing, removing audio effects in music tracks is a meaningful step toward develo** an automated remixing system. This paper focuses on removing distortion audio effects applied to guitar tracks in music production. We explore whether effect removal can be solved by neural networks designed for source separation and audio effect… ▽ More

    Submitted 13 September, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: Audio examples available at https://joimort.github.io/distortionremoval/

  2. arXiv:2110.06525  [pdf, other

    cs.SD cs.LG eess.AS

    Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks

    Authors: Bo-Yu Chen, Wei-Han Hsu, Wei-Hsiang Liao, Marco A. Martínez Ramírez, Yuki Mitsufuji, Yi-Hsuan Yang

    Abstract: A central task of a Disc Jockey (DJ) is to create a mixset of mu-sic with seamless transitions between adjacent tracks. In this paper, we explore a data-driven approach that uses a generative adversarial network to create the song transition by learning from real-world DJ mixes. In particular, the generator of the model uses two differentiable digital signal processing components, an equalizer (EQ… ▽ More

    Submitted 17 February, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: To be published at ICASSP 2022

  3. arXiv:2105.04752  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Differentiable Signal Processing With Black-Box Audio Effects

    Authors: Marco A. Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J. Bryan

    Abstract: We present a data-driven approach to automate audio signal processing by incorporating stateful third-party, audio effects as layers within a deep neural network. We then train a deep encoder to analyze input audio and control effect parameters to perform the desired signal manipulation, requiring only input-target paired audio data as supervision. To train our network with non-differentiable blac… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), June 2021. Source code, demo and audio examples: https://mchijmma.github.io/DeepAFx/

  4. arXiv:2104.13553  [pdf, other

    eess.AS cs.LG cs.SD

    AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries

    Authors: Woosung Choi, Minseok Kim, Marco A. Martínez Ramírez, Jaehwa Chung, Soonyoung Jung

    Abstract: This paper proposes a neural network that performs audio transformations to user-specified sources (e.g., vocals) of a given audio track according to a given description while preserving other sources not mentioned in the description. Audio Manipulation on a Specific Source (AMSS) is challenging because a sound object (i.e., a waveform sample or frequency bin) is `transparent'; it usually carries… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures, 3 tables, under reviewing of ACMMM 21

  5. arXiv:1910.10105  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Modeling plate and spring reverberation using a DSP-informed deep neural network

    Authors: Marco A. Martínez Ramírez, Emmanouil Benetos, Joshua D. Reiss

    Abstract: Plate and spring reverberators are electromechanical systems first used and researched as means to substitute real room reverberation. Nowadays they are often used in music production for aesthetic reasons due to their particular sonic characteristics. The modeling of these audio processors and their perceptual qualities is difficult since they use mechanical elements together with analog electron… ▽ More

    Submitted 17 April, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: Presented at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Barcelona, Spain, May 2020. Source code, dataset, audio examples and more detailed diagrams: https://mchijmma.github.io/modeling-plate-spring-reverb/

  6. arXiv:1905.06148  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    A general-purpose deep learning approach to model time-varying audio effects

    Authors: Marco A. Martínez Ramírez, Emmanouil Benetos, Joshua D. Reiss

    Abstract: Audio processors whose parameters are modified periodically over time are often referred as time-varying or modulation based audio effects. Most existing methods for modeling these type of effect units are often optimized to a very specific circuit and cannot be efficiently generalized to other time-varying effects. Based on convolutional and recurrent neural networks, we propose a deep learning a… ▽ More

    Submitted 21 June, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: audio files: https://mchijmma.github.io/modeling-time-varying/

  7. arXiv:1904.04589  [pdf, other

    eess.AS cs.SD

    Ensemble Models for Spoofing Detection in Automatic Speaker Verification

    Authors: Bhusan Chettri, Daniel Stoller, Veronica Morfi, Marco A. Martínez Ramírez, Emmanouil Benetos, Bob L. Sturm

    Abstract: Detecting spoofing attempts of automatic speaker verification (ASV) systems is challenging, especially when using only one modeling approach. For robustness, we use both deep neural networks and traditional machine learning models and combine them as ensemble models through logistic regression. They are trained to detect logical access (LA) and physical access (PA) attacks on the dataset released… ▽ More

    Submitted 4 July, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: Accepted at Interspeech 2019, Graz, Austria

  8. arXiv:1810.06603  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Modeling of nonlinear audio effects with end-to-end deep neural networks

    Authors: Marco A. Martínez Ramirez, Joshua D. Reiss

    Abstract: In the context of music production, distortion effects are mainly used for aesthetic reasons and are usually applied to electric musical instruments. Most existing methods for nonlinear modeling are often either simplified or optimized to a very specific circuit. In this work, we investigate deep learning architectures for audio processing and we aim to find a general purpose end-to-end deep neura… ▽ More

    Submitted 6 March, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: Presented at the 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Brighton, UK, May 2019