Skip to main content

Showing 1–7 of 7 results for author: De Sena, E

.
  1. arXiv:2404.00082  [pdf, other

    eess.AS cs.LG cs.SD

    Data-Driven Room Acoustic Modeling Via Differentiable Feedback Delay Networks With Learnable Delay Lines

    Authors: Alessandro Ilic Mezza, Riccardo Giampiccolo, Enzo De Sena, Alberto Bernardini

    Abstract: Over the past few decades, extensive research has been devoted to the design of artificial reverberation algorithms aimed at emulating the room acoustics of physical environments. Despite significant advancements, automatic parameter tuning of delay-network models remains an open challenge. We introduce a novel method for finding the parameters of a Feedback Delay Network (FDN) such that its outpu… ▽ More

    Submitted 17 May, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: The article has been submitted to EURASIP Journal on Audio, Speech, and Music Processing on Jan 02, 2024 and is currently under review

  2. arXiv:2312.14658  [pdf, other

    cs.SD eess.AS

    Room Acoustic Rendering Networks with Control of Scattering and Early Reflections

    Authors: Matteo Scerbo, Lauri Savioja, Enzo De Sena

    Abstract: Room acoustic synthesis can be used in Virtual Reality (VR), Augmented Reality (AR) and gaming applications to enhance listeners' sense of immersion, realism and externalisation. A common approach is to use Geometrical Acoustics (GA) models to compute impulse responses at interactive speed, and fast convolution methods to apply said responses in real time. Alternatively, delay-network-based models… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE/ACM Transactions on Audio, Speech, and Language Processing. 12 pages, 12 figures, 2 tables

    MSC Class: 76Q05 (Primary) 93C43; 94A12 (Secondary)

  3. arXiv:2306.08514  [pdf, other

    eess.AS eess.SP

    Low-Complexity Steered Response Power Map** based on Low-Rank and Sparse Interpolation

    Authors: Thomas Dietzen, Enzo De Sena, Toon van Waterschoot

    Abstract: For acoustic source localization, a map of the acoustic scene as obtained by the steered response power (SRP) approach can be employed. In SRP, the frequency-weighted output power of a beamformer steered towards a set of candidate locations is obtained from generalized cross-correlations (GCCs). Due to the dense grid of candidate locations, conventional SRP exhibits a high computational complexity… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  4. Low-Complexity Steered Response Power Map** based on Nyquist-Shannon Sampling

    Authors: Thomas Dietzen, Enzo De Sena, Toon van Waterschoot

    Abstract: The steered response power (SRP) approach to acoustic source localization computes a map of the acoustic scene from the frequency-weighted output power of a beamformer steered towards a set of candidate locations. Equivalently, SRP may be expressed in terms of time-domain generalized cross-correlations (GCCs) at lags equal to the candidate locations' time-differences of arrival (TDOAs). Due to the… ▽ More

    Submitted 22 July, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

  5. arXiv:2009.12143  [pdf, other

    math.NA

    On the Convergence of the Multipole Expansion Method

    Authors: Brian Fitzpatrick, Enzo De Sena, Toon van Waterschoot

    Abstract: The multipole expansion method (MEM) is a spatial discretization technique that is widely used in applications that feature scattering of waves from circular cylinders. Moreover, it also serves as a key component in several other numerical methods in which scattering computations involving arbitrarily shaped objects are accelerated by enclosing the objects in artificial cylinders. A fundamental qu… ▽ More

    Submitted 3 June, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: 21 pages, 2 figures; Corrected a scaling error that occurred when plotting the third columns of Figs 1,2,3, some very minor grammatical edits to the intro/conclusion to improve clarity and conciseness, included funding info in first page; updated intro with historical info; reformatted several sections to reduce no. of pages; changed title, shortened abstract; fixed typo in proof of Thm 1.1

    MSC Class: 31A10; 42B10; 65N12; 65N15; 65R20; 70F10; 78M15; 78M16

  6. Localization Uncertainty in Time-Amplitude Stereophonic Reproduction

    Authors: Enzo De Sena, Zoran Cvetkovic, Huseyin Hacihabiboglu, Marc Moonen, Toon van Waterschoot

    Abstract: This article studies the effects of inter-channel time and level differences in stereophonic reproduction on perceived localization uncertainty, which is defined as how difficult it is for a listener to tell where a sound source is located. Towards this end, a computational model of localization uncertainty is proposed first. The model calculates inter-aural time and level difference cues, and com… ▽ More

    Submitted 6 September, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Journal ref: IEEE/ACM Trans. Audio, Speech and Language Process. vol 28, pp. 1000 - 1015, Feb. 2020

  7. Efficient Synthesis of Room Acoustics via Scattering Delay Networks

    Authors: Enzo De Sena, Huseyin Hacihabiboglu, Zoran Cvetkovic, Julius O. Smith III

    Abstract: An acoustic reverberator consisting of a network of delay lines connected via scattering junctions is proposed. All parameters of the reverberator are derived from physical properties of the enclosure it simulates. It allows for simulation of unequal and frequency-dependent wall absorption, as well as directional sources and microphones. The reverberator renders the first-order reflections exactly… ▽ More

    Submitted 9 July, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 23, No. 9, September 2015