Skip to main content

Showing 1–13 of 13 results for author: Holighaus, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.12380  [pdf, other

    math.NA cs.IT cs.MS eess.SP

    Fast Matching Pursuit with Multi-Gabor Dictionaries

    Authors: Zdeněk Průša, Nicki Holighaus, Peter Balazs

    Abstract: Finding the best K-sparse approximation of a signal in a redundant dictionary is an NP-hard problem. Suboptimal greedy matching pursuit (MP) algorithms are generally used for this task. In this work, we present an acceleration technique and an implementation of the matching pursuit algorithm acting on a multi-Gabor dictionary, i.e., a concatenation of several Gabor-type time-frequency dictionaries… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

  2. arXiv:2202.07498  [pdf, other

    cs.SD cs.MS eess.AS eess.SP

    Non-iterative Filter Bank Phase (Re)Construction

    Authors: Zdeněk Průša, Nicki Holighaus

    Abstract: Signal reconstruction from magnitude-only measurements presents a long-standing problem in signal processing. In this contribution, we propose a phase (re)construction method for filter banks with uniform decimation and controlled frequency variation. The suggested procedure extends the recently introduced phase-gradient heap integration and relies on a phase-magnitude relationship for filter bank… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  3. arXiv:2202.07484  [pdf, other

    cs.SD eess.AS eess.SP

    Phase-Based Signal Representations for Scattering

    Authors: Daniel Haider, Peter Balazs, Nicki Holighaus

    Abstract: The scattering transform is a non-linear signal representation method based on cascaded wavelet transform magnitudes. In this paper we introduce phase scattering, a novel approach where we use phase derivatives in a scattering procedure. We first revisit phase-related concepts for representing time-frequency information of audio signals, in particular, the partial derivatives of the phase in the t… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  4. arXiv:2202.07479  [pdf, ps, other

    cs.SD eess.AS

    Audio Inpainting via $\ell_1$-Minimization and Dictionary Learning

    Authors: Shristi Rajbamshi, Georg Tauböck, Peter Balazs, Nicki Holighaus

    Abstract: Audio inpainting refers to signal processing techniques that aim at restoring missing or corrupted consecutive samples in audio signals. Prior works have shown that $\ell_1$- minimization with appropriate weighting is capable of solving audio inpainting problems, both for the analysis and the synthesis models. These models assume that audio signals are sparse with respect to some redundant diction… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  5. arXiv:2202.07382  [pdf, other

    cs.SD cs.MS eess.AS

    Phase Vocoder Done Right

    Authors: Zdenek Prusa, Nicki Holighaus

    Abstract: The phase vocoder (PV) is a widely spread technique for processing audio signals. It employs a short-time Fourier transform (STFT) analysis-modify-synthesis loop and is typically used for time-scaling of signals by means of using different time steps for STFT analysis and synthesis. The main challenge of PV used for that purpose is the correction of the STFT phase. In this paper, we introduce a no… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  6. arXiv:2106.05148  [pdf, other

    eess.SP cs.SD eess.AS

    Time-Frequency Phase Retrieval for Audio -- The Effect of Transform Parameters

    Authors: Andrés Marafioti, Nicki Holighaus, Piotr Majdak

    Abstract: In audio processing applications, phase retrieval (PR) is often performed from the magnitude of short-time Fourier transform (STFT) coefficients. Although PR performance has been observed to depend on the considered STFT parameters and audio data, the extent of this dependence has not been systematically evaluated yet. To address this, we studied the performance of three PR algorithms for various… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted for publication as a regular paper in the IEEE Transactions on Signal Processing

  7. arXiv:2005.05032  [pdf, other

    cs.SD eess.AS stat.ML

    GACELA -- A generative adversarial context encoder for long audio inpainting

    Authors: Andres Marafioti, Piotr Majdak, Nicki Holighaus, Nathanaël Perraudin

    Abstract: We introduce GACELA, a generative adversarial network (GAN) designed to restore missing musical audio data with a duration ranging between hundreds of milliseconds to a few seconds, i.e., to perform long-gap audio inpainting. While previous work either addressed shorter gaps or relied on exemplars by copying available information from other signal parts, GACELA addresses the inpainting of long gap… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 1, pp. 120-131, Jan. 2021

  8. arXiv:1902.04072  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Adversarial Generation of Time-Frequency Features with application in audio synthesis

    Authors: Andrés Marafioti, Nicki Holighaus, Nathanaël Perraudin, Piotr Majdak

    Abstract: Time-frequency (TF) representations provide powerful and intuitive features for the analysis of time series such as audio. But still, generative modeling of audio in the TF domain is a subtle matter. Consequently, neural audio synthesis widely relies on directly modeling the waveform and previous attempts at unconditionally synthesizing audio from neurally generated invertible TF features still st… ▽ More

    Submitted 16 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Accepted for publication at ICML 2019

  9. arXiv:1810.12138  [pdf, other

    cs.SD cs.LG eess.AS

    Audio inpainting of music by means of neural networks

    Authors: Andrés Marafioti, Nicki Holighaus, Piotr Majdak, Nathanaël Perraudin

    Abstract: We studied the ability of deep neural networks (DNNs) to restore missing audio content based on its context, a process usually referred to as audio inpainting. We focused on gaps in the range of tens of milliseconds. The proposed DNN structure was trained on audio signals containing music and musical instruments, separately, with 64-ms long gaps. The input to the DNN was the context, i.e., the sig… ▽ More

    Submitted 18 February, 2022; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Presented at the 146th AES Convention [arXiv:1810.12138v2]. For the journal version, published in published in IEEE TASLP, see [arXiv:1810.12138v2]

  10. Frame Theory for Signal Processing in Psychoacoustics

    Authors: Peter Balazs, Nicki Holighaus, Thibaud Necciari, Diana Stoeva

    Abstract: This review chapter aims to strengthen the link between frame theory and signal processing tasks in psychoacoustics. On the one side, the basic concepts of frame theory are presented and some proofs are provided to explain those concepts in some detail. The goal is to reveal to hearing scientists how this mathematical theory could be relevant for their research. In particular, we focus on frame th… ▽ More

    Submitted 3 November, 2016; originally announced November 2016.

    Journal ref: In: Balan R., Benedetto J., Czaja W., Dellatorre M., Okoudjou K. (eds) Excursions in Harmonic Analysis, Vol. 5. Applied and Numerical Harmonic Analysis. Birkhäuser, Cham, 2017, 225-268

  11. arXiv:1607.06667  [pdf, other

    cs.SD cs.AI cs.MM cs.SE

    Inpainting of long audio segments with similarity graphs

    Authors: Nathanael Perraudin, Nicki Holighaus, Piotr Majdak, Peter Balazs

    Abstract: We present a novel method for the compensation of long duration data loss in audio signals, in particular music. The concealment of such signal defects is based on a graph that encodes signal structure in terms of time-persistent spectral similarity. A suitable candidate segment for the substitution of the lost content is proposed by an intuitive optimization scheme and smoothly inserted into the… ▽ More

    Submitted 23 February, 2018; v1 submitted 22 July, 2016; originally announced July 2016.

  12. arXiv:1601.06652  [pdf, ps, other

    cs.SD

    A Perceptually Motivated Filter Bank with Perfect Reconstruction for Audio Signal Processing

    Authors: Thibaud Necciari, Nicki Holighaus, Peter Balazs, Zdenek Prusa

    Abstract: Many audio applications rely on filter banks (FBs) to analyze, process, and re-synthesize sounds. To approximate the auditory frequency resolution in the signal chain, some applications rely on perceptually motivated FBs, the gammatone FB being a popular example. However, most perceptually motivated FBs only allow partial signal reconstruction at high redundancies and/or do not have good resistanc… ▽ More

    Submitted 25 January, 2016; originally announced January 2016.

  13. arXiv:1311.0897  [pdf, other

    math.FA cs.IT cs.SI

    Spectrum-Adapted Tight Graph Wavelet and Vertex-Frequency Frames

    Authors: David I Shuman, Christoph Wiesmeyr, Nicki Holighaus, Pierre Vandergheynst

    Abstract: We consider the problem of designing spectral graph filters for the construction of dictionaries of atoms that can be used to efficiently represent signals residing on weighted graphs. While the filters used in previous spectral graph wavelet constructions are only adapted to the length of the spectrum, the filters proposed in this paper are adapted to the distribution of graph Laplacian eigenvalu… ▽ More

    Submitted 4 November, 2013; originally announced November 2013.