Skip to main content

Showing 1–11 of 11 results for author: Uncini, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.07024  [pdf, other

    cs.LG eess.SP

    Demystifying the Hypercomplex: Inductive Biases in Hypercomplex Deep Learning

    Authors: Danilo Comminiello, Eleonora Grassucci, Danilo P. Mandic, Aurelio Uncini

    Abstract: Hypercomplex algebras have recently been gaining prominence in the field of deep learning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidimensional signals in real-world 3D and 4D paradigms. This paper provides a foundational framework that serves as a roadmap for understanding why hypercomplex deep learning methods are… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted for Publication in IEEE Signal Processing Magazine

  2. arXiv:2402.09245  [pdf, other

    eess.AS cs.LG eess.SP

    Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

    Authors: Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

    Abstract: The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications. As part of our latest competition, we provide a brand-new dataset, which maintains the s… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)

  3. arXiv:2310.10224  [pdf, other

    eess.IV cs.CV cs.LG

    Generalizing Medical Image Representations via Quaternion Wavelet Networks

    Authors: Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

    Abstract: Neural network generalizability is becoming a broad research field due to the increasing availability of datasets from different sources and for various tasks. This issue is even wider when processing medical data, where a lack of methodological standards causes large variations being provided by different imaging centers or acquired with various devices and cofactors. To overcome these limitation… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: This paper is currently under review

  4. arXiv:2204.01851  [pdf, other

    eess.AS cs.LG cs.SD

    Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation

    Authors: Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello

    Abstract: Spatial audio methods are gaining a growing interest due to the spread of immersive audio experiences and applications, such as virtual and augmented reality. For these purposes, 3D audio signals are often acquired through arrays of Ambisonics microphones, each comprising four capsules that decompose the sound field in spherical harmonics. In this paper, we propose a dual quaternion representation… ▽ More

    Submitted 14 December, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: Paper accepted for publication in Elsevier Pattern Recognition Letters

  5. arXiv:2202.10372  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

    Authors: Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of the L3DAS21 edition. We generated a new dataset, which maintains the same general characteristics of L3DAS21 datasets, but with an extended number of data points a… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022). arXiv admin note: substantial text overlap with arXiv:2104.05499

    Journal ref: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 9186-9190

  6. arXiv:2104.09641  [pdf, ps, other

    cs.LG cs.SD eess.AS eess.SP eess.SY

    A New Class of Efficient Adaptive Filters for Online Nonlinear Modeling

    Authors: Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini

    Abstract: Nonlinear models are known to provide excellent performance in real-world applications that often operate in non-ideal conditions. However, such applications often require online processing to be performed with limited computational resources. To address this problem, we propose a new class of efficient nonlinear models for online applications. The proposed algorithms are based on linear-in-the-pa… ▽ More

    Submitted 26 August, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: This work has been accepted for publication in IEEE Transactions on Systems, Man, and Cybernetics: Systems. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2104.05499  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

    Authors: Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results s… ▽ More

    Submitted 29 April, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Documentation paper for the L3DAS21 Challenge for IEEE MLSP 2021. Further information on www.l3das.com/mlsp2021

    Journal ref: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021, pp. 1-6

  8. A Quaternion-Valued Variational Autoencoder

    Authors: Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini

    Abstract: Deep probabilistic generative models have achieved incredible success in many fields of application. Among such models, variational autoencoders (VAEs) have proved their ability in modeling a generative process by learning a latent representation of the input. In this paper, we propose a novel VAE defined in the quaternion domain, which exploits the properties of quaternion algebra to improve perf… ▽ More

    Submitted 22 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 3310-3314

  9. Combined Sparse Regularization for Nonlinear Adaptive Filters

    Authors: Danilo Comminiello, Michele Scarpiniti, Simone Scardapane, Luis A. Azpicueta-Ruiz, Aurelio Uncini

    Abstract: Nonlinear adaptive filters often show some sparse behavior due to the fact that not all the coefficients are equally useful for the modeling of any nonlinearity. Recently, a class of proportionate algorithms has been proposed for nonlinear filters to leverage sparsity of their coefficients. However, the choice of the norm penalty of the cost function may be not always appropriate depending on the… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: This is a corrected version of the paper presented at EUSIPCO 2018 and published on IEEE https://ieeexplore.ieee.org/document/8552955

    Journal ref: 2018 26th European Signal Processing Conference (EUSIPCO), Sep. 2018

  10. A Multimodal Deep Network for the Reconstruction of T2W MR Images

    Authors: Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

    Abstract: Multiple sclerosis is one of the most common chronic neurological diseases affecting the central nervous system. Lesions produced by the MS can be observed through two modalities of magnetic resonance (MR), known as T2W and FLAIR sequences, both providing useful information for formulating a diagnosis. However, long acquisition time makes the acquired MR image vulnerable to motion artifacts. This… ▽ More

    Submitted 24 February, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: 29th Italian Neural Networks Workshop (WIRN 2019)

    Journal ref: Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore, Jul. 2020

  11. arXiv:1812.06811  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

    Authors: Danilo Comminiello, Marco Lella, Simone Scardapane, Aurelio Uncini

    Abstract: Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity. One of the models that perfectly suits with quaternion-valued data processing is represented by 3D acoustic signals in their spherical harmonics decomposition. In this paper, we address the problem of localizing and detecting sound events in the spatial sound… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: Submitted to ICASSP 2019

    Journal ref: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, pp. 8533-8537