L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing
Authors:
Eric Guizzo,
Riccardo F. Gramaccioni,
Saeid Jamili,
Christian Marinoni,
Edoardo Massaro,
Claudia Medaglia,
Giuseppe Nachira,
Leonardo Nucciarelli,
Ludovica Paglialunga,
Marco Pennese,
Sveva Pepe,
Enrico Rocchi,
Aurelio Uncini,
Danilo Comminiello
Abstract:
The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results s…
▽ More
The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results submission stage. Usually, machine learning approaches to 3D audio tasks are based on single-perspective Ambisonics recordings or on arrays of single-capsule microphones. We propose, instead, a novel multichannel audio configuration based multiple-source and multiple-perspective Ambisonics recordings, performed with an array of two first-order Ambisonics microphones. To the best of our knowledge, it is the first time that a dual-mic Ambisonics configuration is used for these tasks. We provide baseline models and results for both tasks, obtained with state-of-the-art architectures: FaSNet for SE and SELDNet for SELD. This report is aimed at providing all needed information to participate in the L3DAS21 Challenge, illustrating the details of the L3DAS21 dataset, the challenge tasks and the baseline models.
△ Less
Submitted 29 April, 2021; v1 submitted 12 April, 2021;
originally announced April 2021.
Discrete Fourier Transform Method for Discrimination of Digital Scintillation Pulses in Mixed Neutron-Gamma Fields
Authors:
M. J. Safari,
F. Abbasi Davani,
H. Afarideh,
S. Jamili,
E. Bayat
Abstract:
A Discrete Fourier Transform Method (DFTM) for discrimination between the signal of neutrons and gamma rays in organic scintillation detectors is presented. The method is based on the transformation of signals into the frequency domain using the sine and cosine Fourier transforms in combination with the discrete Fourier transform. The method is largely benefited from considerable differences that…
▽ More
A Discrete Fourier Transform Method (DFTM) for discrimination between the signal of neutrons and gamma rays in organic scintillation detectors is presented. The method is based on the transformation of signals into the frequency domain using the sine and cosine Fourier transforms in combination with the discrete Fourier transform. The method is largely benefited from considerable differences that usually is available between the zero-frequency components of sine and cosine and the norm of the amplitude of the DFT for neutrons and gamma-ray signals. Moreover, working in frequency domain naturally results in considerable suppression of the unwanted effects of various noise sources that is expected to be effective in time domain methods. The proposed method could also be assumed as a generalized nonlinear weighting method that could result in a new class of pulse shape discrimination methods, beyond definition of the DFT. A comparison to the traditional Charge Integration Method (CIM), as well as the Frequency Gradient Analysis Method (FGAM) and the Wavelet Packet Transform Method (WPTM) has been presented to demonstrate the applicability and efficiency of the method for real-world applications. The method, in general, shows better discrimination Figure of Merits (FoMs) at both the low-light outputs and in average over the studied energy domain. A noise analysis has been performed for all of the abovementioned methods. It reveals that the frequency domain methods (FGAM and DFTM) are less sensitive to the noise effects.
△ Less
Submitted 1 November, 2016;
originally announced November 2016.