Skip to main content

Showing 1–50 of 67 results for author: Scardapane, S

.
  1. arXiv:2406.11430  [pdf, other

    cs.CL cs.AI

    A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression

    Authors: Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini

    Abstract: The deployment of large language models (LLMs) is often hindered by the extensive memory requirements of the Key-Value (KV) cache, especially as context lengths increase. Existing approaches to reduce the KV cache size involve either fine-tuning the model to learn a compression strategy or leveraging attention scores to reduce the sequence length. We analyse the attention distributions in decoder-… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.06642  [pdf, other

    cs.LG

    TopoBenchmarkX: A Framework for Benchmarking Topological Deep Learning

    Authors: Lev Telyatnikov, Guillermo Bernardez, Marco Montagna, Pavlo Vasylenko, Ghada Zamzmi, Mustafa Hajij, Michael T Schaub, Nina Miolane, Simone Scardapane, Theodore Papamarkou

    Abstract: This work introduces TopoBenchmarkX, a modular open-source library designed to standardize benchmarking and accelerate research in Topological Deep Learning (TDL). TopoBenchmarkX maps the TDL pipeline into a sequence of independent and modular components for data loading and processing, as well as model training, optimization, and evaluation. This modular organization provides flexibility for modi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2405.12222  [pdf, other

    eess.IV cs.AI cs.CV

    Influence based explainability of brain tumors segmentation in multimodal Magnetic Resonance Imaging

    Authors: Tommaso Torda, Andrea Ciardiello, Simona Gargiulo, Greta Grillo, Simone Scardapane, Cecilia Voena, Stefano Giagu

    Abstract: In recent years Artificial Intelligence has emerged as a fundamental tool in medical applications. Despite this rapid development, deep neural networks remain black boxes that are difficult to explain, and this represents a major limitation for their use in clinical practice. We focus on the segmentation of medical images task, where most explainability methods proposed so far provide a visual exp… ▽ More

    Submitted 5 April, 2024; originally announced May 2024.

    Comments: 15 pages, 7 figures

  4. arXiv:2405.02330  [pdf, other

    cs.IT cs.AI cs.LG

    Adaptive Semantic Token Selection for AI-native Goal-oriented Communications

    Authors: Alessio Devoto, Simone Petruzzi, Jary Pomponi, Paolo Di Lorenzo, Simone Scardapane

    Abstract: In this paper, we propose a novel design for AI-native goal-oriented communications, exploiting transformer neural networks under dynamic inference constraints on bandwidth and computation. Transformers have become the standard architecture for pretraining large-scale vision and text models, and preliminary results have shown promising performance also in deep joint source-channel coding (JSCC). H… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: 5 pages

    MSC Class: 94A40

  5. arXiv:2404.17625  [pdf, other

    cs.LG cs.AI

    Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land

    Authors: Simone Scardapane

    Abstract: Neural networks surround us, in the form of large language models, speech transcription systems, molecular discovery algorithms, robotics, and much more. Stripped of anything else, neural networks are compositions of differentiable primitives, and studying them means learning how to program and how to interact with these models, a particular example of what is called differentiable programming.… ▽ More

    Submitted 4 July, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Companion website for additional chapters: https://www.sscardapane.it/alice-book

  6. arXiv:2403.07965  [pdf, other

    cs.LG cs.AI

    Conditional computation in neural networks: principles and research trends

    Authors: Simone Scardapane, Alessandro Baiocchi, Alessio Devoto, Valerio Marsocci, Pasquale Minervini, Jary Pomponi

    Abstract: This article summarizes principles and ideas from the emerging area of applying \textit{conditional computation} methods to the design of neural networks. In particular, we focus on neural networks that can dynamically activate or de-activate parts of their computational graph conditionally on their input. Examples include the dynamic selection of, e.g., input tokens, layers (or sets of layers), a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Under review at Intelligenza Artificiale (IOS Press)

  7. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  8. arXiv:2402.07573  [pdf, other

    eess.SP

    Goal-Oriented and Semantic Communication in 6G AI-Native Networks: The 6G-GOALS Approach

    Authors: Emilio Calvanese Strinati, Paolo Di Lorenzo, Vincenzo Sciancalepore, Adnan Aijaz, Marios Kountouris, Deniz Gündüz, Petar Popovski, Mohamed Sana, Photios A. Stavrou, Beatriz Soret, Nicola Cordeschi, Simone Scardapane, Mattia Merluzzi, Lanfranco Zanzi, Mauro Boldi Renato, Tony Quek, Nicola di Pietro, Olivier Forceville, Francesca Costanzo, Peizheng Li

    Abstract: Recent advances in AI technologies have notably expanded device intelligence, fostering federation and cooperation among distributed AI agents. These advancements impose new requirements on future 6G mobile network architectures. To meet these demands, it is essential to transcend classical boundaries and integrate communication, computation, control, and intelligence. This paper presents the 6G-G… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  9. arXiv:2402.02441  [pdf, other

    cs.LG cs.AI cs.MS stat.CO

    TopoX: A Suite of Python Packages for Machine Learning on Topological Domains

    Authors: Mustafa Hajij, Mathilde Papillon, Florian Frantzen, Jens Agerberg, Ibrahem AlJabea, Ruben Ballester, Claudio Battiloro, Guillermo Bernárdez, Tolga Birdal, Aiden Brent, Peter Chin, Sergio Escalera, Simone Fiorellino, Odin Hoff Gardaa, Gurusankar Gopalakrishnan, Devendra Govil, Josef Hoppe, Maneel Reddy Karri, Jude Khouja, Manuel Lecha, Neal Livesay, Jan Meißner, Soham Mukherjee, Alexander Nikitin, Theodore Papamarkou , et al. (18 additional authors not shown)

    Abstract: We introduce TopoX, a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains that extend graphs: hypergraphs, simplicial, cellular, path and combinatorial complexes. TopoX consists of three packages: TopoNetX facilitates constructing and computing on these domains, including working with nodes, edges and higher-order… ▽ More

    Submitted 17 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  10. arXiv:2402.01262  [pdf, other

    cs.LG cs.CV

    Class incremental learning with probability dampening and cascaded gated classifier

    Authors: Jary Pomponi, Alessio Devoto, Simone Scardapane

    Abstract: Humans are capable of acquiring new knowledge and transferring learned knowledge into different domains, incurring a small forgetting. The same ability, called Continual Learning, is challenging to achieve when operating with neural networks due to the forgetting affecting past learned tasks when learning new ones. This forgetting can be mitigated by replaying stored samples from past tasks, but a… ▽ More

    Submitted 23 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Previously called "Cascaded Scaling Classifier: class incremental learning with probability scaling ". The official code is available https://github.com/jaryP/CIL-Margin-Dampening-Gated-Classifier

  11. arXiv:2401.14845  [pdf, other

    cs.CV cs.LG

    Adaptive Point Transformer

    Authors: Alessandro Baiocchi, Indro Spinelli, Alessandro Nicolosi, Simone Scardapane

    Abstract: The recent surge in 3D data acquisition has spurred the development of geometric deep learning models for point cloud processing, boosted by the remarkable success of transformers in natural language processing. While point cloud transformers (PTs) have achieved impressive results recently, their quadratic scaling with respect to the point cloud size poses a significant scalability challenge for r… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 26 pages, 8 figures, submitted to Neural Networs

  12. arXiv:2401.13330  [pdf, other

    cs.LG cs.CV

    NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural Networks

    Authors: Matteo Gambella, Jary Pomponi, Simone Scardapane, Manuel Roveri

    Abstract: Early Exit Neural Networks (EENNs) endow astandard Deep Neural Network (DNN) with Early Exit Classifiers (EECs), to provide predictions at intermediate points of the processing when enough confidence in classification is achieved. This leads to many benefits in terms of effectiveness and efficiency. Currently, the design of EENNs is carried out manually by experts, a complex and time-consuming tas… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  13. arXiv:2312.10193  [pdf, other

    cs.LG

    Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference

    Authors: Bartosz Wójcik, Alessio Devoto, Karol Pustelnik, Pasquale Minervini, Simone Scardapane

    Abstract: The computational cost of transformer models makes them inefficient in low-latency or low-power applications. While techniques such as quantization or linear attention can reduce the computational load, they may incur a reduction in accuracy. In addition, globally reducing the cost for all inputs may be sub-optimal. We observe that for each layer, the full width of the layer may be needed only for… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  14. arXiv:2310.07684  [pdf, other

    cs.AI cs.SI

    Hypergraph Neural Networks through the Lens of Message Passing: A Common Perspective to Homophily and Architecture Design

    Authors: Lev Telyatnikov, Maria Sofia Bucarelli, Guillermo Bernardez, Olga Zaghen, Simone Scardapane, Pietro Lio

    Abstract: Most of the current hypergraph learning methodologies and benchmarking datasets in the hypergraph realm are obtained by lifting procedures from their graph analogs, leading to overshadowing specific characteristics of hypergraphs. This paper attempts to confront some pending questions in that regard: Q1 Can the concept of homophily play a crucial role in Hypergraph Neural Networks (HNNs)? Q2 Is th… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  15. arXiv:2310.04361  [pdf, other

    cs.LG

    Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion

    Authors: Filip Szatkowski, Bartosz Wójcik, Mikołaj Piórczyński, Simone Scardapane

    Abstract: Transformer models can face practical limitations due to their high computational requirements. At the same time, such models exhibit significant activation sparsity, which can be leveraged to reduce the inference cost by converting parts of the network into equivalent Mixture-of-Experts (MoE) layers. Despite the crucial role played by activation sparsity, its impact on this process remains unexpl… ▽ More

    Submitted 7 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  16. ICML 2023 Topological Deep Learning Challenge : Design and Results

    Authors: Mathilde Papillon, Mustafa Hajij, Helen Jenne, Johan Mathe, Audun Myers, Theodore Papamarkou, Tolga Birdal, Tamal Dey, Tim Doster, Tegan Emerson, Gurusankar Gopalakrishnan, Devendra Govil, Aldo Guzmán-Sáenz, Henry Kvinge, Neal Livesay, Soham Mukherjee, Shreyas N. Samaga, Karthikeyan Natesan Ramamurthy, Maneel Reddy Karri, Paul Rosen, Sophia Sanborn, Robin Walters, Jens Agerberg, Sadrodin Barikbin, Claudio Battiloro , et al. (31 additional authors not shown)

    Abstract: This paper presents the computational challenge on topological deep learning that was hosted within the ICML 2023 Workshop on Topology and Geometry in Machine Learning. The competition asked participants to provide open-source implementations of topological neural networks from the literature by contributing to the python packages TopoNetX (data processing) and TopoModelX (deep learning). The chal… ▽ More

    Submitted 18 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  17. arXiv:2308.12844  [pdf, other

    cs.LG

    Probabilistic load forecasting with Reservoir Computing

    Authors: Michele Guerra, Simone Scardapane, Filippo Maria Bianchi

    Abstract: Some applications of deep learning require not only to provide accurate results but also to quantify the amount of confidence in their prediction. The management of an electric power grid is one of these cases: to avoid risky scenarios, decision-makers need both precise and reliable forecasts of, for example, power loads. For this reason, point forecasts are not enough hence it is necessary to ado… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  18. arXiv:2305.16174  [pdf, other

    cs.LG cs.AI cs.NE

    From Latent Graph to Latent Topology Inference: Differentiable Cell Complex Module

    Authors: Claudio Battiloro, Indro Spinelli, Lev Telyatnikov, Michael Bronstein, Simone Scardapane, Paolo Di Lorenzo

    Abstract: Latent Graph Inference (LGI) relaxed the reliance of Graph Neural Networks (GNNs) on a given graph topology by dynamically learning it. However, most of LGI methods assume to have a (noisy, incomplete, improvable, ...) input graph to rewire and can solely learn regular graph topologies. In the wake of the success of Topological Deep Learning (TDL), we study Latent Topology Inference (LTI) for lear… ▽ More

    Submitted 3 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Under review. 17 pages, 5 figures

  19. arXiv:2304.07750  [pdf, other

    cs.CV

    GeoMultiTaskNet: remote sensing unsupervised domain adaptation using geographical coordinates

    Authors: Valerio Marsocci, Nicolas Gonthier, Anatol Garioud, Simone Scardapane, Clément Mallet

    Abstract: Land cover maps are a pivotal element in a wide range of Earth Observation (EO) applications. However, annotating large datasets to develop supervised systems for remote sensing (RS) semantic segmentation is costly and time-consuming. Unsupervised Domain Adaption (UDA) could tackle these issues by adapting a model trained on a source domain, where labels are available, to a target domain, without… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  20. arXiv:2304.07152  [pdf, other

    cs.LG

    Combining Stochastic Explainers and Subgraph Neural Networks can Increase Expressivity and Interpretability

    Authors: Indro Spinelli, Michele Guerra, Filippo Maria Bianchi, Simone Scardapane

    Abstract: Subgraph-enhanced graph neural networks (SGNN) can increase the expressive power of the standard message-passing framework. This model family represents each graph as a collection of subgraphs, generally extracted by random sampling or with hand-crafted heuristics. Our key observation is that by selecting "meaningful" subgraphs, besides improving the expressivity of a GNN, it is also possible to o… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  21. arXiv:2302.11479  [pdf, other

    cs.LG stat.ML

    Drop Edges and Adapt: a Fairness Enforcing Fine-tuning for Graph Neural Networks

    Authors: Indro Spinelli, Riccardo Bianchini, Simone Scardapane

    Abstract: The rise of graph representation learning as the primary solution for many different network science tasks led to a surge of interest in the fairness of this family of methods. Link prediction, in particular, has a substantial social impact. However, link prediction algorithms tend to increase the segregation in social networks by disfavoring the links between individuals in specific demographic g… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  22. arXiv:2210.10446  [pdf, other

    cs.LG cs.AI

    EGG-GAE: scalable graph neural networks for tabular data imputation

    Authors: Lev Telyatnikov, Simone Scardapane

    Abstract: Missing data imputation (MDI) is crucial when dealing with tabular datasets across various domains. Autoencoders can be trained to reconstruct missing values, and graph autoencoders (GAE) can additionally consider similar patterns in the dataset when imputing new values for a given instance. However, previously proposed GAEs suffer from scalability issues, requiring the user to define a similarity… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  23. Explainability in subgraphs-enhanced Graph Neural Networks

    Authors: Michele Guerra, Indro Spinelli, Simone Scardapane, Filippo Maria Bianchi

    Abstract: Recently, subgraphs-enhanced Graph Neural Networks (SGNNs) have been introduced to enhance the expressive power of Graph Neural Networks (GNNs), which was proved to be not higher than the 1-dimensional Weisfeiler-Leman isomorphism test. The new paradigm suggests using subgraphs extracted from the input graph to improve the model's expressiveness, but the additional complexity exacerbates an alread… ▽ More

    Submitted 19 January, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: The source code implementing our workflow is publicly available online at https://github.com/MicheleUIT/Explaining_SGNN

  24. arXiv:2208.02048  [pdf, other

    cs.LG stat.ML

    Centroids Matching: an efficient Continual Learning approach operating in the embedding space

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) occurs when a neural network loses the information previously learned while training on a set of samples from a different distribution, i.e., a new task. Existing approaches have achieved remarkable results in mitigating CF, especially in a scenario called task incremental learning. However, this scenario is not realistic, and limited work has been done to achieve good… ▽ More

    Submitted 10 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: Submitted to Transactions on Machine Learning Research (TMLR)

  25. arXiv:2205.15903  [pdf, other

    eess.IV cs.CV

    Inferring 3D change detection from bitemporal optical images

    Authors: Valerio Marsocci, Virginia Coletta, Roberta Ravanelli, Simone Scardapane, Mattia Crespi

    Abstract: Change detection is one of the most active research areas in Remote Sensing (RS). Most of the recently developed change detection methods are based on deep learning (DL) algorithms. This kind of algorithms is generally focused on generating two-dimensional (2D) change maps, thus only identifying planimetric changes in land use/land cover (LULC) and not considering nor returning any information on… ▽ More

    Submitted 16 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: https://doi.org/10.1016/j.isprsjprs.2022.12.009

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing 196 (2023) 325-339

  26. arXiv:2205.11319  [pdf, other

    cs.CV

    Continual Barlow Twins: continual self-supervised learning for remote sensing semantic segmentation

    Authors: Valerio Marsocci, Simone Scardapane

    Abstract: In the field of Earth Observation (EO), Continual Learning (CL) algorithms have been proposed to deal with large datasets by decomposing them into several subsets and processing them incrementally. The majority of these algorithms assume that data is (a) coming from a single source, and (b) fully labeled. Real-world EO datasets are instead characterized by a large heterogeneity (e.g., coming from… ▽ More

    Submitted 9 January, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  27. arXiv:2204.04020  [pdf, other

    cs.CV cs.LG

    Engagement Detection with Multi-Task Training in E-Learning Environments

    Authors: Onur Copur, Mert Nakıp, Simone Scardapane, Jürgen Slowack

    Abstract: Recognition of user interaction, in particular engagement detection, became highly crucial for online working and learning environments, especially during the COVID-19 outbreak. Such recognition and detection systems significantly improve the user experience and efficiency by providing valuable feedback. In this paper, we propose a novel Engagement Detection with Multi-Task Training (ED-MTT) syste… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  28. arXiv:2204.02385  [pdf, other

    eess.AS cs.LG cs.SD

    Learning Speech Emotion Representations in the Quaternion Domain

    Authors: Eric Guizzo, Tillman Weyde, Simone Scardapane, Danilo Comminiello

    Abstract: The modeling of human emotion expression in speech signals is an important, yet challenging task. The high resource demand of speech emotion recognition models, combined with the the general scarcity of emotion-labelled data are obstacles to the development and application of effective solutions in this field. In this paper, we present an approach to jointly circumvent these difficulties. Our meth… ▽ More

    Submitted 3 March, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted for Publication in IEEE/ACM Transactions on Audio, Speech and Language Processing

  29. arXiv:2203.10974  [pdf, other

    cs.CV

    Towards Self-Supervised Gaze Estimation

    Authors: Arya Farkhondeh, Cristina Palmero, Simone Scardapane, Sergio Escalera

    Abstract: Recent joint embedding-based self-supervised methods have surpassed standard supervised approaches on various image recognition tasks such as image classification. These self-supervised methods aim at maximizing agreement between features extracted from two differently transformed views of the same image, which results in learning an invariant representation with respect to appearance and geometri… ▽ More

    Submitted 23 November, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: BMVC 2022. For code and pre-trained models, visit https://github.com/aryafarkhondeh/SwAT

  30. arXiv:2202.05694  [pdf, other

    cs.LG stat.ML

    Continual Learning with Invertible Generative Models

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 27 December, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.02443

  31. Pixle: a fast and effective black-box attack based on rearranging pixels

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Recent research has found that neural networks are vulnerable to several types of adversarial attacks, where the input samples are modified in such a way that the model produces a wrong prediction that misclassifies the adversarial sample. In this paper we focus on black-box adversarial attacks, that can be performed without knowing the inner structure of the attacked model, nor the training proce… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  32. A Meta-Learning Approach for Training Explainable Graph Neural Networks

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: In this paper, we investigate the degree of explainability of graph neural networks (GNNs). Existing explainers work by finding global/local subgraphs to explain a prediction, but they are applied after a GNN has already been trained. Here, we propose a meta-learning framework for improving the level of explainability of a GNN directly at training time, by steering the optimization procedure towar… ▽ More

    Submitted 20 December, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  33. Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: In this paper, we propose a novel ensembling technique for deep neural networks, which is able to drastically reduce the required memory compared to alternative approaches. In particular, we propose to extract multiple sub-networks from a single, untrained neural network by solving an end-to-end optimization task combining differentiable scaling over the original architecture, with multiple regula… ▽ More

    Submitted 17 September, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Article accepted at Neural Networks

  34. FairDrop: Biased Edge Dropout for Enhancing Fairness in Graph Representation Learning

    Authors: Indro Spinelli, Simone Scardapane, Amir Hussain, Aurelio Uncini

    Abstract: Graph representation learning has become a ubiquitous component in many scenarios, ranging from social network analysis to energy forecasting in smart grids. In several applications, ensuring the fairness of the node (or graph) representations with respect to some protected attributes is crucial for their correct deployment. Yet, fairness in graph deep learning remains under-explored, with few sol… ▽ More

    Submitted 27 December, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Submitted to a journal for the peer-review process

  35. arXiv:2104.09641  [pdf, ps, other

    cs.LG cs.SD eess.AS eess.SP eess.SY

    A New Class of Efficient Adaptive Filters for Online Nonlinear Modeling

    Authors: Danilo Comminiello, Alireza Nezamdoust, Simone Scardapane, Michele Scarpiniti, Amir Hussain, Aurelio Uncini

    Abstract: Nonlinear models are known to provide excellent performance in real-world applications that often operate in non-ideal conditions. However, such applications often require online processing to be performed with limited computational resources. To address this problem, we propose a new class of efficient nonlinear models for online applications. The proposed algorithms are based on linear-in-the-pa… ▽ More

    Submitted 26 August, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Comments: This work has been accepted for publication in IEEE Transactions on Systems, Man, and Cybernetics: Systems. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2104.00405  [pdf, other

    cs.LG cs.AI cs.CV

    Avalanche: an End-to-End Library for Continual Learning

    Authors: Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuzzolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adrian Popescu, Christopher Kanan, Joost van de Weijer , et al. (3 additional authors not shown)

    Abstract: Learning continually from non-stationary data streams is a long-standing goal and a challenging problem in machine learning. Recently, we have witnessed a renewed and fast-growing interest in continual learning, especially within the deep learning community. However, algorithmic solutions are often difficult to re-implement, evaluate and port across different settings, where even results on standa… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

    Comments: Official Website: https://avalanche.continualai.org

  37. Combined Sparse Regularization for Nonlinear Adaptive Filters

    Authors: Danilo Comminiello, Michele Scarpiniti, Simone Scardapane, Luis A. Azpicueta-Ruiz, Aurelio Uncini

    Abstract: Nonlinear adaptive filters often show some sparse behavior due to the fact that not all the coefficients are equally useful for the modeling of any nonlinearity. Recently, a class of proportionate algorithms has been proposed for nonlinear filters to leverage sparsity of their coefficients. However, the choice of the norm penalty of the cost function may be not always appropriate depending on the… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: This is a corrected version of the paper presented at EUSIPCO 2018 and published on IEEE https://ieeexplore.ieee.org/document/8552955

    Journal ref: 2018 26th European Signal Processing Conference (EUSIPCO), Sep. 2018

  38. arXiv:2007.06281  [pdf, other

    cs.LG cs.NE stat.ML

    Distributed Training of Graph Convolutional Networks

    Authors: Simone Scardapane, Indro Spinelli, Paolo Di Lorenzo

    Abstract: The aim of this work is to develop a fully-distributed algorithmic framework for training graph convolutional networks (GCNs). The proposed method is able to exploit the meaningful relational structure of the input data, which are collected by a set of agents that communicate over a sparse network topology. After formulating the centralized GCN training problem, we first show how to make inference… ▽ More

    Submitted 7 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Published on IEEE Transactions on Signal and Information Processing over Networks

    Journal ref: IEEE Transactions on Signal and Information Processing over Networks, vol. 7, pp. 87-100, 2021

  39. arXiv:2007.02443  [pdf, other

    stat.ML cs.CV cs.LG

    Pseudo-Rehearsal for Continual Learning with Normalizing Flows

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 5 August, 2021; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: A preliminary unpublished version of this work was presented in the LifelongML workshop, at ICML 2020

  40. Distributed Stochastic Nonconvex Optimization and Learning based on Successive Convex Approximation

    Authors: Paolo Di Lorenzo, Simone Scardapane

    Abstract: We study distributed stochastic nonconvex optimization in multi-agent networks. We introduce a novel algorithmic framework for the distributed minimization of the sum of the expected value of a smooth (possibly nonconvex) function (the agents' sum-utility) plus a convex (possibly nonsmooth) regularizer. The proposed method hinges on successive convex approximation (SCA) techniques, leveraging dyna… ▽ More

    Submitted 12 May, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Proceedings of 2019 Asilomar Conference on Signals, Systems, and Computers

  41. arXiv:2004.12814  [pdf, other

    cs.NE cs.LG stat.ML

    Why should we add early exits to neural networks?

    Authors: Simone Scardapane, Michele Scarpiniti, Enzo Baccarelli, Aurelio Uncini

    Abstract: Deep neural networks are generally designed as a stack of differentiable layers, in which a prediction is obtained only after running the full stack. Recently, some contributions have proposed techniques to endow the networks with early exits, allowing to obtain predictions at intermediate points of the stack. These multi-output networks have a number of advantages, including: (i) significant redu… ▽ More

    Submitted 23 June, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Published in Cognitive Computation

    Journal ref: Cognitive Computation, 2020

  42. Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Bayesian Neural Networks (BNNs) are trained to optimize an entire distribution over their weights instead of a single set, having significant advantages in terms of, e.g., interpretability, multi-task learning, and calibration. Because of the intractability of the resulting optimization problem, most BNNs are either sampled through Monte Carlo methods, or trained by minimizing a suitable Evidence… ▽ More

    Submitted 30 September, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  43. arXiv:2002.12287  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Randomized Neural Networks

    Authors: Claudio Gallicchio, Simone Scardapane

    Abstract: Randomized Neural Networks explore the behavior of neural systems where the majority of connections are fixed, either in a stochastic or a deterministic fashion. Typical examples of such systems consist of multi-layered neural network architectures where the connections to the hidden layer(s) are left untrained after initialization. Limiting the training algorithms to operate on a reduced set of w… ▽ More

    Submitted 2 February, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

  44. Adaptive Propagation Graph Convolutional Network

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: Graph convolutional networks (GCNs) are a family of neural network models that perform inference on graph data by interleaving vertex-wise operations and message-passing exchanges across nodes. Concerning the latter, two key questions arise: (i) how to design a differentiable exchange protocol (e.g., a 1-hop Laplacian smoothing in the original GCN), and (ii) how to characterize the trade-off in co… ▽ More

    Submitted 28 September, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Published in IEEE Transaction on Neural Networks and Learning Systems

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2020

  45. Efficient Continual Learning in Neural Networks with Embedding Regularization

    Authors: Jary Pomponi, Simone Scardapane, Vincenzo Lomonaco, Aurelio Uncini

    Abstract: Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered either the progressive increase in the size of the networks, or have tried to regularize the network behavior to equalize it with respect to previously observed t… ▽ More

    Submitted 11 February, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Journal ref: Neurocomputing, 397, pp. 139-148, 2020

  46. A Multimodal Deep Network for the Reconstruction of T2W MR Images

    Authors: Antonio Falvo, Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

    Abstract: Multiple sclerosis is one of the most common chronic neurological diseases affecting the central nervous system. Lesions produced by the MS can be observed through two modalities of magnetic resonance (MR), known as T2W and FLAIR sequences, both providing useful information for formulating a diagnosis. However, long acquisition time makes the acquired MR image vulnerable to motion artifacts. This… ▽ More

    Submitted 24 February, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: 29th Italian Neural Networks Workshop (WIRN 2019)

    Journal ref: Progresses in Artificial Intelligence and Neural Systems. Smart Innovation, Systems and Technologies, vol 184. Springer, Singapore, Jul. 2020

  47. Compressing deep quaternion neural networks with targeted regularization

    Authors: Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

    Abstract: In recent years, hyper-complex deep networks (such as complex-valued and quaternion-valued neural networks) have received a renewed interest in the literature. They find applications in multiple fields, ranging from image reconstruction to 3D audio processing. Similar to their real-valued counterparts, quaternion neural networks (QVNNs) require custom regularization strategies to avoid overfitting… ▽ More

    Submitted 13 July, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Published on CAAI Transactions on Intelligence Technology, https://digital-library.theiet.org/content/journals/10.1049/trit.2020.0020

  48. arXiv:1906.08502  [pdf, other

    stat.ML cs.LG

    Efficient data augmentation using graph imputation neural networks

    Authors: Indro Spinelli, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

    Abstract: Recently, data augmentation in the semi-supervised regime, where unlabeled data vastly outnumbers labeled data, has received a considerable attention. In this paper, we describe an efficient technique for this task, exploiting a recent framework we proposed for missing data imputation called graph imputation neural network (GINN). The key idea is to leverage both supervised and unsupervised data t… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Presented at the 2019 Italian Workshop on Neural Networks (WIRN'19)

  49. Missing Data Imputation with Adversarially-trained Graph Convolutional Networks

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: Missing data imputation (MDI) is a fundamental problem in many scientific disciplines. Popular methods for MDI use global statistics computed from the entire data set (e.g., the feature-wise medians), or build predictive models operating independently on every instance. In this paper we propose a more general framework for MDI, leveraging recent work in the field of graph neural networks (GNNs). W… ▽ More

    Submitted 24 June, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: Published in Neural Networks (2020)

    Journal ref: Neural Networks, 129, pp. 249-260, 2020

  50. arXiv:1903.11990  [pdf, other

    stat.ML cs.LG

    On the Stability and Generalization of Learning with Kernel Activation Functions

    Authors: Michele Cirillo, Simone Scardapane, Steven Van Vaerenbergh, Aurelio Uncini

    Abstract: In this brief we investigate the generalization properties of a recently-proposed class of non-parametric activation functions, the kernel activation functions (KAFs). KAFs introduce additional parameters in the learning process in order to adapt nonlinearities individually on a per-neuron basis, exploiting a cheap kernel expansion of every activation value. While this increase in flexibility has… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: Submitted as a brief paper to IEEE TNNLS