Skip to main content

Showing 1–41 of 41 results for author: Comminiello, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09976  [pdf, other

    cs.CV eess.SP

    Language-Oriented Semantic Latent Representation for Image Transmission

    Authors: Giordano Cicchetti, Eleonora Grassucci, Jihong Park, **ho Choi, Sergio Barbarossa, Danilo Comminiello

    Abstract: In the new paradigm of semantic communication (SC), the focus is on delivering meanings behind bits by extracting semantic information from raw data. Recent advances in data-to-text models facilitate language-oriented SC, particularly for text-transformed image communication via image-to-text (I2T) encoding and text-to-image (T2I) decoding. However, although semantically aligned, the text is too c… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Under review at IEEE International Workshop on Machine Learning for Signal Processing (MLSP) 2024

  2. arXiv:2405.09866  [pdf, other

    eess.SP cs.LG

    Rethinking Multi-User Semantic Communications with Deep Generative Models

    Authors: Eleonora Grassucci, **ho Choi, Jihong Park, Riccardo F. Gramaccioni, Giordano Cicchetti, Danilo Comminiello

    Abstract: In recent years, novel communication strategies have emerged to face the challenges that the increased number of connected devices and the higher quality of transmitted information are posing. Among them, semantic communication obtained promising results especially when combined with state-of-the-art deep generative models, such as large language or diffusion models, able to regenerate content fro… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Under review in IEEE Journal on Selected Areas in Communications

  3. arXiv:2405.07024  [pdf, other

    cs.LG eess.SP

    Demystifying the Hypercomplex: Inductive Biases in Hypercomplex Deep Learning

    Authors: Danilo Comminiello, Eleonora Grassucci, Danilo P. Mandic, Aurelio Uncini

    Abstract: Hypercomplex algebras have recently been gaining prominence in the field of deep learning owing to the advantages of their division algebras over real vector spaces and their superior results when dealing with multidimensional signals in real-world 3D and 4D paradigms. This paper provides a foundational framework that serves as a roadmap for understanding why hypercomplex deep learning methods are… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: Accepted for Publication in IEEE Signal Processing Magazine

  4. arXiv:2405.05015  [pdf, other

    cs.LG

    Concrete Dense Network for Long-Sequence Time Series Clustering

    Authors: Redemptor Jr Laceda Taloma, Patrizio Pisani, Danilo Comminiello

    Abstract: Time series clustering is fundamental in data analysis for discovering temporal patterns. Despite recent advancements, learning cluster-friendly representations is still challenging, particularly with long and complex time series. Deep temporal clustering methods have been trying to integrate the canonical k-means into end-to-end training of neural networks but fall back on surrogate losses due to… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Under review in IEEE Transactions on Pattern Analysis and Machine Intelligence

  5. arXiv:2405.02961  [pdf, other

    cs.CV eess.IV

    JOSENet: A Joint Stream Embedding Network for Violence Detection in Surveillance Videos

    Authors: Pietro Nardelli, Danilo Comminiello

    Abstract: Due to the ever-increasing availability of video surveillance cameras and the growing need for crime prevention, the violence detection task is attracting greater attention from the research community. With respect to other action recognition tasks, violence detection in surveillance videos shows additional issues, such as the presence of a significant variety of real fight scenes. Unfortunately,… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: Submitted to the International Journal of Computer Vision

  6. arXiv:2404.05669  [pdf, other

    cs.CV

    NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement

    Authors: Giordano Cicchetti, Danilo Comminiello

    Abstract: Real-world documents may suffer various forms of degradation, often resulting in lower accuracy in optical character recognition (OCR) systems. Therefore, a crucial preprocessing step is essential to eliminate noise while preserving text and key features of documents. In this paper, we propose NAF-DPM, a novel generative framework based on a diffusion probabilistic model (DPM) designed to restore… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Under review at IEEE Transactions on Pattern Analysis and Machine Intelligence

  7. arXiv:2403.18370  [pdf, other

    cs.CV cs.LG

    Ship in Sight: Diffusion Models for Ship-Image Super Resolution

    Authors: Luigi Sigillo, Riccardo Fosco Gramaccioni, Alessandro Nicolosi, Danilo Comminiello

    Abstract: In recent years, remarkable advancements have been achieved in the field of image generation, primarily driven by the escalating demand for high-quality outcomes across various image generation subtasks, such as inpainting, denoising, and super resolution. A major effort is devoted to exploring the application of super-resolution techniques to enhance the quality of low-resolution images. In this… ▽ More

    Submitted 21 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at 2024 International Joint Conference on Neural Networks (IJCNN)

  8. arXiv:2403.17929  [pdf, other

    cs.CV

    Towards Explaining Hypercomplex Neural Networks

    Authors: Eleonora Lopez, Eleonora Grassucci, Debora Capriotti, Danilo Comminiello

    Abstract: Hypercomplex neural networks are gaining increasing interest in the deep learning community. The attention directed towards hypercomplex models originates from several aspects, spanning from purely theoretical and mathematical characteristics to the practical advantage of lightweight models over conventional networks, and their unique properties to capture both global and local relations. In parti… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: The paper has been accepted at IEEE WCCI 2024

  9. arXiv:2403.15649  [pdf, other

    cs.IT

    Semantic Communication Challenges: Understanding Dos and Avoiding Don'ts

    Authors: **ho Choi, Jihong Park, Eleonora Grassucci, Danilo Comminiello

    Abstract: Semantic communication, emerging as a promising paradigm for data transmission, offers an innovative departure from the constraints of Shannon theory, heralding significant advancements in future communication technologies. Despite the proliferation of proposed approaches, there are still numerous challenges. In this paper, we review current semantic communication methodologies and shed light on p… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 5 pages, IEEE vehicular technology conference, Spring 2023

  10. arXiv:2402.09245  [pdf, other

    eess.AS cs.LG eess.SP

    Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

    Authors: Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

    Abstract: The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications. As part of our latest competition, we provide a brand-new dataset, which maintains the s… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)

  11. arXiv:2401.06803  [pdf, other

    cs.CL cs.LG

    Generative AI Meets Semantic Communication: Evolution and Revolution of Communication Tasks

    Authors: Eleonora Grassucci, Jihong Park, Sergio Barbarossa, Seong-Lyun Kim, **ho Choi, Danilo Comminiello

    Abstract: While deep generative models are showing exciting abilities in computer vision and natural language processing, their adoption in communication frameworks is still far underestimated. These methods are demonstrated to evolve solutions to classic communication problems such as denoising, restoration, or compression. Nevertheless, generative models can unveil their real potential in semantic communi… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: Under consideration in IEEE Network Special Issue "The Interplay Between Generative AI and 5G-Advanced toward 6G"

  12. arXiv:2311.00635  [pdf, other

    cs.IR

    GATSY: Graph Attention Network for Music Artist Similarity

    Authors: Andrea Giuseppe Di Francesco, Giuliano Giampietro, Indro Spinelli, Danilo Comminiello

    Abstract: The artist similarity quest has become a crucial subject in social and scientific contexts. Modern research solutions facilitate music discovery according to user tastes. However, defining similarity among artists may involve several aspects, even related to a subjective perspective, and it often affects a recommendation. This paper presents GATSY, a recommendation system built upon graph attentio… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 6 pages, Submitted to MLSP 2023

  13. arXiv:2310.15247  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis

    Authors: Marco Comunità, Riccardo F. Gramaccioni, Emilian Postolache, Emanuele Rodolà, Danilo Comminiello, Joshua D. Reiss

    Abstract: Sound design involves creatively selecting, recording, and editing sound effects for various media like cinema, video games, and virtual/augmented reality. One of the most time-consuming steps when designing sound is synchronizing audio with video. In some cases, environmental recordings from video shoots are available, which can aid in the process. However, in video games and animations, no refer… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  14. arXiv:2310.10224  [pdf, other

    eess.IV cs.CV cs.LG

    Generalizing Medical Image Representations via Quaternion Wavelet Networks

    Authors: Luigi Sigillo, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

    Abstract: Neural network generalizability is becoming a broad research field due to the increasing availability of datasets from different sources and for various tasks. This issue is even wider when processing medical data, where a lack of methodological standards causes large variations being provided by different imaging centers or acquired with various devices and cofactors. To overcome these limitation… ▽ More

    Submitted 17 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: This paper is currently under review

  15. arXiv:2310.07648  [pdf, other

    cs.HC cs.LG eess.SP

    Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals

    Authors: Eleonora Lopez, Eleonora Chiarantano, Eleonora Grassucci, Danilo Comminiello

    Abstract: Multimodal emotion recognition from physiological signals is receiving an increasing amount of attention due to the impossibility to control them at will unlike behavioral reactions, thus providing more reliable information. Existing deep learning-based methods still rely on extracted handcrafted features, not taking full advantage of the learning ability of neural networks, and often adopt a sing… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Published at IEEE ICASSP workshops 2023

  16. arXiv:2310.07633  [pdf, other

    eess.IV cs.CV

    Attention-Map Augmentation for Hypercomplex Breast Cancer Classification

    Authors: Eleonora Lopez, Filippo Betello, Federico Carmignani, Eleonora Grassucci, Danilo Comminiello

    Abstract: Breast cancer is the most widespread neoplasm among women and early detection of this disease is critical. Deep learning techniques have become of great interest to improve diagnostic performance. However, distinguishing between malignant and benign masses in whole mammograms poses a challenge, as they appear nearly identical to an untrained eye, and the region of interest (ROI) constitutes only a… ▽ More

    Submitted 23 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Published in Elsevier Pattern Recognition Letters

  17. arXiv:2310.07623  [pdf, other

    cs.AI cs.CV

    Dual Quaternion Rotational and Translational Equivariance in 3D Rigid Motion Modelling

    Authors: Guilherme Vieira, Eleonora Grassucci, Marcos Eduardo Valle, Danilo Comminiello

    Abstract: Objects' rigid motions in 3D space are described by rotations and translations of a highly-correlated set of points, each with associated $x,y,z$ coordinates that real-valued networks consider as separate entities, losing information. Previous works exploit quaternion algebra and their ability to model rotations in 3D space. However, these algebras do not properly encode translations, leading to s… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted at IEEE MLSP 2023 (Honorable Mention Top 10% Outstanding Paper)

  18. arXiv:2310.07612  [pdf, other

    cs.LG cs.AI cs.ET

    PHYDI: Initializing Parameterized Hypercomplex Neural Networks as Identity Functions

    Authors: Matteo Mancanelli, Eleonora Grassucci, Aurelio Uncini, Danilo Comminiello

    Abstract: Neural models based on hypercomplex algebra systems are growing and prolificating for a plethora of applications, ranging from computer vision to natural language processing. Hand in hand with their adoption, parameterized hypercomplex neural networks (PHNNs) are growing in size and no techniques have been adopted so far to control their convergence at a large scale. In this paper, we study PHNNs… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted at IEEE MLSP 2023 (Honorable Mention TOP 5% Outstanding Papers)

  19. arXiv:2309.07195  [pdf, other

    cs.SD cs.ET eess.AS

    Diffusion models for audio semantic communication

    Authors: Eleonora Grassucci, Christian Marinoni, Andrea Rodriguez, Danilo Comminiello

    Abstract: Directly sending audio signals from a transmitter to a receiver across a noisy channel may absorb consistent bandwidth and be prone to errors when trying to recover the transmitted bits. On the contrary, the recent semantic communication approach proposes to send the semantics and then regenerate semantically consistent content at the receiver without exactly recovering the bitstream. In this pape… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE ICASSP 2024

  20. arXiv:2309.02478  [pdf, other

    cs.LG cs.AI eess.SP

    Enhancing Semantic Communication with Deep Generative Models -- An ICASSP Special Session Overview

    Authors: Eleonora Grassucci, Yuki Mitsufuji, ** Zhang, Danilo Comminiello

    Abstract: Semantic communication is poised to play a pivotal role in sha** the landscape of future AI-driven communication systems. Its challenge of extracting semantic information from the original complex content and regenerating semantically consistent data at the receiver, possibly being robust to channel corruptions, can be addressed with deep generative models. This ICASSP special session overview p… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE ICASSP

  21. arXiv:2306.04321  [pdf, other

    cs.AI cs.MM

    Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

    Authors: Eleonora Grassucci, Sergio Barbarossa, Danilo Comminiello

    Abstract: Semantic communication is expected to be one of the cores of next-generation AI-based communications. One of the possibilities offered by semantic communication is the capability to regenerate, at the destination side, images or videos semantically equivalent to the transmitted ones, without necessarily recovering the transmitted sequence of bits. The current solutions still lack the ability to bu… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  22. arXiv:2305.10882  [pdf, other

    cs.CV cs.LG eess.IV

    StawGAN: Structural-Aware Generative Adversarial Networks for Infrared Image Translation

    Authors: Luigi Sigillo, Eleonora Grassucci, Danilo Comminiello

    Abstract: This paper addresses the problem of translating night-time thermal infrared images, which are the most adopted image modalities to analyze night-time scenes, to daytime color images (NTIT2DC), which provide better perceptions of objects. We introduce a novel model that focuses on enhancing the quality of the target generation without merely colorizing it. The proposed structural aware (StawGAN) en… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Journal ref: 2023 IEEE International Symposium on Circuits and Systems (ISCAS)

  23. Hypercomplex Image-to-Image Translation

    Authors: Eleonora Grassucci, Luigi Sigillo, Aurelio Uncini, Danilo Comminiello

    Abstract: Image-to-image translation (I2I) aims at transferring the content representation from an input domain to an output one, bouncing along different target domains. Recent I2I generative models, which gain outstanding results in this task, comprise a set of diverse deep networks each with tens of million parameters. Moreover, images are usually three-dimensional being composed of RGB channels and comm… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN)

  24. arXiv:2204.05798  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-View Hypercomplex Learning for Breast Cancer Screening

    Authors: Eleonora Lopez, Eleonora Grassucci, Martina Valleriani, Danilo Comminiello

    Abstract: Traditionally, deep learning methods for breast cancer classification perform a single-view analysis. However, radiologists simultaneously analyze all four views that compose a mammography exam, owing to the correlations contained in mammography views, which present crucial information for identifying tumors. In light of this, some studies have started to propose multi-view methods. Nevertheless,… ▽ More

    Submitted 4 March, 2024; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: This paper has been submitted to IEEE Transactions on Pattern Analysis and Machine Intelligence

  25. arXiv:2204.02385  [pdf, other

    eess.AS cs.LG cs.SD

    Learning Speech Emotion Representations in the Quaternion Domain

    Authors: Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

    Abstract: The modeling of human emotion expression in speech signals is an important, yet challenging task. The high resource demand of speech emotion recognition models, combined with the the general scarcity of emotion-labelled data are obstacles to the development and application of effective solutions in this field. In this paper, we present an approach to jointly circumvent these difficulties. Our meth… ▽ More

    Submitted 3 March, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted for Publication in IEEE/ACM Transactions on Audio, Speech and Language Processing

  26. arXiv:2204.01851  [pdf, other

    eess.AS cs.LG cs.SD

    Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation

    Authors: Eleonora Grassucci, Gioia Mancini, Christian Brignone, Aurelio Uncini, Danilo Comminiello

    Abstract: Spatial audio methods are gaining a growing interest due to the spread of immersive audio experiences and applications, such as virtual and augmented reality. For these purposes, 3D audio signals are often acquired through arrays of Ambisonics microphones, each comprising four capsules that decompose the sound field in spherical harmonics. In this paper, we propose a dual quaternion representation… ▽ More

    Submitted 14 December, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: Paper accepted for publication in Elsevier Pattern Recognition Letters

  27. arXiv:2202.10372  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

    Authors: Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of the L3DAS21 edition. We generated a new dataset, which maintains the same general characteristics of L3DAS21 datasets, but with an extended number of data points a… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022). arXiv admin note: substantial text overlap with arXiv:2104.05499

    Journal ref: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 9186-9190

  28. PHNNs: Lightweight Neural Networks via Parameterized Hypercomplex Convolutions

    Authors: Eleonora Grassucci, Aston Zhang, Danilo Comminiello

    Abstract: Hypercomplex neural networks have proven to reduce the overall number of parameters while ensuring valuable performance by leveraging the properties of Clifford algebras. Recently, hypercomplex linear layers have been further improved by involving efficient parameterized Kronecker products. In this paper, we define the parameterization of hypercomplex convolutional layers and introduce the family… ▽ More

    Submitted 19 September, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems

  29. arXiv:2104.09641  [pdf, ps, other

    cs.LG cs.SD eess.AS eess.SP eess.SY

    A New Class of Efficient Adaptive Filters for Online Nonlinear Modeling

    Authors: Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini

    Abstract: Nonlinear models are known to provide excellent performance in real-world applications that often operate in non-ideal conditions. However, such applications often require online processing to be performed with limited computational resources. To address this problem, we propose a new class of efficient nonlinear models for online applications. The proposed algorithms are based on linear-in-the-pa… ▽ More

    Submitted 26 August, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: This work has been accepted for publication in IEEE Transactions on Systems, Man, and Cybernetics: Systems. Copyright may be transferred without notice, after which this version may no longer be accessible

  30. arXiv:2104.09630  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Quaternion Generative Adversarial Networks

    Authors: Eleonora Grassucci, Edoardo Cicero, Danilo Comminiello

    Abstract: Latest Generative Adversarial Networks (GANs) are gathering outstanding results through a large-scale training, thus employing models composed of millions of parameters requiring extensive computational capabilities. Building such huge models undermines their replicability and increases the training instability. Moreover, multi-channel data, such as images or audio, are usually processed by realva… ▽ More

    Submitted 27 July, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted as a Chapter for the SPRINGER book "Generative Adversarial Learning: Architectures and Applications"

    Journal ref: Generative Adversarial Learning: Architectures and Applications. Intelligent Systems Reference Library, vol 217. Springer, Cham, Feb. 2022

  31. arXiv:2104.05499  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

    Authors: Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results s… ▽ More

    Submitted 29 April, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Documentation paper for the L3DAS21 Challenge for IEEE MLSP 2021. Further information on www.l3das.com/mlsp2021

    Journal ref: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021, pp. 1-6

  32. A Quaternion-Valued Variational Autoencoder

    Authors: Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini

    Abstract: Deep probabilistic generative models have achieved incredible success in many fields of application. Among such models, variational autoencoders (VAEs) have proved their ability in modeling a generative process by learning a latent representation of the input. In this paper, we propose a novel VAE defined in the quaternion domain, which exploits the properties of quaternion algebra to improve perf… ▽ More

    Submitted 22 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

    Journal ref: 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021, pp. 3310-3314

  33. A Multimodal Deep Network for the Reconstruction of T2W MR Images

    Authors: Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

    Abstract: Multiple sclerosis is one of the most common chronic neurological diseases affecting the central nervous system. Lesions produced by the MS can be observed through two modalities of magnetic resonance (MR), known as T2W and FLAIR sequences, both providing useful information for formulating a diagnosis. However, long acquisition time makes the acquired MR image vulnerable to motion artifacts. This… ▽ More

    Submitted 24 February, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: 29th Italian Neural Networks Workshop (WIRN 2019)

    Journal ref: Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore, Jul. 2020

  34. Compressing deep quaternion neural networks with targeted regularization

    Authors: Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

    Abstract: In recent years, hyper-complex deep networks (such as complex-valued and quaternion-valued neural networks) have received a renewed interest in the literature. They find applications in multiple fields, ranging from image reconstruction to 3D audio processing. Similar to their real-valued counterparts, quaternion neural networks (QVNNs) require custom regularization strategies to avoid overfitting… ▽ More

    Submitted 13 July, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Published on CAAI Transactions on Intelligence Technology, https://digital-library.theiet.org/content/journals/10.1049/trit.2020.0020

  35. arXiv:1902.02085  [pdf, other

    cs.NE

    Widely Linear Kernels for Complex-Valued Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

    Abstract: Complex-valued neural networks (CVNNs) have been shown to be powerful nonlinear approximators when the input data can be properly modeled in the complex domain. One of the major challenges in scaling up CVNNs in practice is the design of complex activation functions. Recently, we proposed a novel framework for learning these activation functions neuron-wise in a data-dependent fashion, based on a… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

    Comments: Accepted at ICASSP 2019

  36. arXiv:1812.06811  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Quaternion Convolutional Neural Networks for Detection and Localization of 3D Sound Events

    Authors: Danilo Comminiello, Marco Lella, Simone Scardapane, Aurelio Uncini

    Abstract: Learning from data in the quaternion domain enables us to exploit internal dependencies of 4D signals and treating them as a single entity. One of the models that perfectly suits with quaternion-valued data processing is represented by 3D acoustic signals in their spherical harmonics decomposition. In this paper, we address the problem of localizing and detecting sound events in the spatial sound… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Comments: Submitted to ICASSP 2019

    Journal ref: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, pp. 8533-8537

  37. arXiv:1807.04065  [pdf, other

    cs.NE cs.LG stat.ML

    Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

    Abstract: Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we des… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for presentation at 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  38. arXiv:1802.09405  [pdf, other

    cs.NE cs.LG stat.ML

    Improving Graph Convolutional Networks with Non-Parametric Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

    Abstract: Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e.g., citation networks or knowledge graphs. While several variants of GNNs have been proposed, they only consider simple nonlinear activation functions in their layers, such as rectifiers or squashing functions. In this paper, we investi… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Submitted to EUSIPCO 2018

  39. Group Sparse Regularization for Deep Neural Networks

    Authors: Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini

    Abstract: In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i.e., feature selection). While these problems are generally dealt with separately, we present a simple regularized formulation allowing to solve all three of them in parallel, using standar… ▽ More

    Submitted 2 July, 2016; originally announced July 2016.

  40. arXiv:1605.07833  [pdf, other

    cs.LG

    Effective Blind Source Separation Based on the Adam Algorithm

    Authors: Michele Scarpiniti, Simone Scardapane, Danilo Comminiello, Raffaele Parisi, Aurelio Uncini

    Abstract: In this paper, we derive a modified InfoMax algorithm for the solution of Blind Signal Separation (BSS) problems by using advanced stochastic methods. The proposed approach is based on a novel stochastic optimization approach known as the Adaptive Moment Estimation (Adam) algorithm. The proposed BSS solution can benefit from the excellent properties of the Adam approach. In order to derive the new… ▽ More

    Submitted 26 September, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

    Comments: Revised version after review process. This paper has been presented at the 26-th Italian Workshop on Neural Networks (WIRN2016) May 18-20, Vietri sul Mare, Salerno, Italy. It will be published soon as a chapter in a book of the the Springer Smart Innovation, Systems and Technologies series

  41. arXiv:1605.05509  [pdf, other

    stat.ML cs.LG cs.NE

    Learning activation functions from data using cubic spline interpolation

    Authors: Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio Uncini

    Abstract: Neural networks require a careful design in order to perform properly on a given task. In particular, selecting a good activation function (possibly in a data-dependent fashion) is a crucial step, which remains an open problem in the research community. Despite a large amount of investigations, most current implementations simply select one fixed function from a small set of candidates, which is n… ▽ More

    Submitted 11 May, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: Submitted to the 27th Italian Workshop on Neural Networks (WIRN 2017)

    Journal ref: Neural Advances in Processing Nonlinear Dynamic Signals, 2017