Skip to main content

Showing 1–31 of 31 results for author: Scardapane, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  2. arXiv:2402.02441  [pdf, other

    cs.LG cs.AI cs.MS stat.CO

    TopoX: A Suite of Python Packages for Machine Learning on Topological Domains

    Authors: Mustafa Hajij, Mathilde Papillon, Florian Frantzen, Jens Agerberg, Ibrahem AlJabea, Ruben Ballester, Claudio Battiloro, Guillermo Bernárdez, Tolga Birdal, Aiden Brent, Peter Chin, Sergio Escalera, Simone Fiorellino, Odin Hoff Gardaa, Gurusankar Gopalakrishnan, Devendra Govil, Josef Hoppe, Maneel Reddy Karri, Jude Khouja, Manuel Lecha, Neal Livesay, Jan Meißner, Soham Mukherjee, Alexander Nikitin, Theodore Papamarkou , et al. (18 additional authors not shown)

    Abstract: We introduce TopoX, a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains that extend graphs: hypergraphs, simplicial, cellular, path and combinatorial complexes. TopoX consists of three packages: TopoNetX facilitates constructing and computing on these domains, including working with nodes, edges and higher-order… ▽ More

    Submitted 17 February, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  3. arXiv:2302.11479  [pdf, other

    cs.LG stat.ML

    Drop Edges and Adapt: a Fairness Enforcing Fine-tuning for Graph Neural Networks

    Authors: Indro Spinelli, Riccardo Bianchini, Simone Scardapane

    Abstract: The rise of graph representation learning as the primary solution for many different network science tasks led to a surge of interest in the fairness of this family of methods. Link prediction, in particular, has a substantial social impact. However, link prediction algorithms tend to increase the segregation in social networks by disfavoring the links between individuals in specific demographic g… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  4. arXiv:2208.02048  [pdf, other

    cs.LG stat.ML

    Centroids Matching: an efficient Continual Learning approach operating in the embedding space

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) occurs when a neural network loses the information previously learned while training on a set of samples from a different distribution, i.e., a new task. Existing approaches have achieved remarkable results in mitigating CF, especially in a scenario called task incremental learning. However, this scenario is not realistic, and limited work has been done to achieve good… ▽ More

    Submitted 10 September, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

    Comments: Submitted to Transactions on Machine Learning Research (TMLR)

  5. arXiv:2202.05694  [pdf, other

    cs.LG stat.ML

    Continual Learning with Invertible Generative Models

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 27 December, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.02443

  6. Pixle: a fast and effective black-box attack based on rearranging pixels

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Recent research has found that neural networks are vulnerable to several types of adversarial attacks, where the input samples are modified in such a way that the model produces a wrong prediction that misclassifies the adversarial sample. In this paper we focus on black-box adversarial attacks, that can be performed without knowing the inner structure of the attacked model, nor the training proce… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  7. A Meta-Learning Approach for Training Explainable Graph Neural Networks

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: In this paper, we investigate the degree of explainability of graph neural networks (GNNs). Existing explainers work by finding global/local subgraphs to explain a prediction, but they are applied after a GNN has already been trained. Here, we propose a meta-learning framework for improving the level of explainability of a GNN directly at training time, by steering the optimization procedure towar… ▽ More

    Submitted 20 December, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  8. Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: In this paper, we propose a novel ensembling technique for deep neural networks, which is able to drastically reduce the required memory compared to alternative approaches. In particular, we propose to extract multiple sub-networks from a single, untrained neural network by solving an end-to-end optimization task combining differentiable scaling over the original architecture, with multiple regula… ▽ More

    Submitted 17 September, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Article accepted at Neural Networks

  9. FairDrop: Biased Edge Dropout for Enhancing Fairness in Graph Representation Learning

    Authors: Indro Spinelli, Simone Scardapane, Amir Hussain, Aurelio Uncini

    Abstract: Graph representation learning has become a ubiquitous component in many scenarios, ranging from social network analysis to energy forecasting in smart grids. In several applications, ensuring the fairness of the node (or graph) representations with respect to some protected attributes is crucial for their correct deployment. Yet, fairness in graph deep learning remains under-explored, with few sol… ▽ More

    Submitted 27 December, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Submitted to a journal for the peer-review process

  10. arXiv:2007.06281  [pdf, other

    cs.LG cs.NE stat.ML

    Distributed Training of Graph Convolutional Networks

    Authors: Simone Scardapane, Indro Spinelli, Paolo Di Lorenzo

    Abstract: The aim of this work is to develop a fully-distributed algorithmic framework for training graph convolutional networks (GCNs). The proposed method is able to exploit the meaningful relational structure of the input data, which are collected by a set of agents that communicate over a sparse network topology. After formulating the centralized GCN training problem, we first show how to make inference… ▽ More

    Submitted 7 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Published on IEEE Transactions on Signal and Information Processing over Networks

    Journal ref: IEEE Transactions on Signal and Information Processing over Networks, vol. 7, pp. 87-100, 2021

  11. arXiv:2007.02443  [pdf, other

    stat.ML cs.CV cs.LG

    Pseudo-Rehearsal for Continual Learning with Normalizing Flows

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endl… ▽ More

    Submitted 5 August, 2021; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: A preliminary unpublished version of this work was presented in the LifelongML workshop, at ICML 2020

  12. arXiv:2004.12814  [pdf, other

    cs.NE cs.LG stat.ML

    Why should we add early exits to neural networks?

    Authors: Simone Scardapane, Michele Scarpiniti, Enzo Baccarelli, Aurelio Uncini

    Abstract: Deep neural networks are generally designed as a stack of differentiable layers, in which a prediction is obtained only after running the full stack. Recently, some contributions have proposed techniques to endow the networks with early exits, allowing to obtain predictions at intermediate points of the stack. These multi-output networks have a number of advantages, including: (i) significant redu… ▽ More

    Submitted 23 June, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Published in Cognitive Computation

    Journal ref: Cognitive Computation, 2020

  13. Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

    Authors: Jary Pomponi, Simone Scardapane, Aurelio Uncini

    Abstract: Bayesian Neural Networks (BNNs) are trained to optimize an entire distribution over their weights instead of a single set, having significant advantages in terms of, e.g., interpretability, multi-task learning, and calibration. Because of the intractability of the resulting optimization problem, most BNNs are either sampled through Monte Carlo methods, or trained by minimizing a suitable Evidence… ▽ More

    Submitted 30 September, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  14. arXiv:2002.12287  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Randomized Neural Networks

    Authors: Claudio Gallicchio, Simone Scardapane

    Abstract: Randomized Neural Networks explore the behavior of neural systems where the majority of connections are fixed, either in a stochastic or a deterministic fashion. Typical examples of such systems consist of multi-layered neural network architectures where the connections to the hidden layer(s) are left untrained after initialization. Limiting the training algorithms to operate on a reduced set of w… ▽ More

    Submitted 2 February, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

  15. Adaptive Propagation Graph Convolutional Network

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: Graph convolutional networks (GCNs) are a family of neural network models that perform inference on graph data by interleaving vertex-wise operations and message-passing exchanges across nodes. Concerning the latter, two key questions arise: (i) how to design a differentiable exchange protocol (e.g., a 1-hop Laplacian smoothing in the original GCN), and (ii) how to characterize the trade-off in co… ▽ More

    Submitted 28 September, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: Published in IEEE Transaction on Neural Networks and Learning Systems

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2020

  16. Efficient Continual Learning in Neural Networks with Embedding Regularization

    Authors: Jary Pomponi, Simone Scardapane, Vincenzo Lomonaco, Aurelio Uncini

    Abstract: Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered either the progressive increase in the size of the networks, or have tried to regularize the network behavior to equalize it with respect to previously observed t… ▽ More

    Submitted 11 February, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Journal ref: Neurocomputing, 397, pp. 139-148, 2020

  17. Compressing deep quaternion neural networks with targeted regularization

    Authors: Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

    Abstract: In recent years, hyper-complex deep networks (such as complex-valued and quaternion-valued neural networks) have received a renewed interest in the literature. They find applications in multiple fields, ranging from image reconstruction to 3D audio processing. Similar to their real-valued counterparts, quaternion neural networks (QVNNs) require custom regularization strategies to avoid overfitting… ▽ More

    Submitted 13 July, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Published on CAAI Transactions on Intelligence Technology, https://digital-library.theiet.org/content/journals/10.1049/trit.2020.0020

  18. arXiv:1906.08502  [pdf, other

    stat.ML cs.LG

    Efficient data augmentation using graph imputation neural networks

    Authors: Indro Spinelli, Simone Scardapane, Michele Scarpiniti, Aurelio Uncini

    Abstract: Recently, data augmentation in the semi-supervised regime, where unlabeled data vastly outnumbers labeled data, has received a considerable attention. In this paper, we describe an efficient technique for this task, exploiting a recent framework we proposed for missing data imputation called graph imputation neural network (GINN). The key idea is to leverage both supervised and unsupervised data t… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Presented at the 2019 Italian Workshop on Neural Networks (WIRN'19)

  19. Missing Data Imputation with Adversarially-trained Graph Convolutional Networks

    Authors: Indro Spinelli, Simone Scardapane, Aurelio Uncini

    Abstract: Missing data imputation (MDI) is a fundamental problem in many scientific disciplines. Popular methods for MDI use global statistics computed from the entire data set (e.g., the feature-wise medians), or build predictive models operating independently on every instance. In this paper we propose a more general framework for MDI, leveraging recent work in the field of graph neural networks (GNNs). W… ▽ More

    Submitted 24 June, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: Published in Neural Networks (2020)

    Journal ref: Neural Networks, 129, pp. 249-260, 2020

  20. arXiv:1903.11990  [pdf, other

    stat.ML cs.LG

    On the Stability and Generalization of Learning with Kernel Activation Functions

    Authors: Michele Cirillo, Simone Scardapane, Steven Van Vaerenbergh, Aurelio Uncini

    Abstract: In this brief we investigate the generalization properties of a recently-proposed class of non-parametric activation functions, the kernel activation functions (KAFs). KAFs introduce additional parameters in the learning process in order to adapt nonlinearities individually on a per-neuron basis, exploiting a cheap kernel expansion of every activation value. While this increase in flexibility has… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: Submitted as a brief paper to IEEE TNNLS

  21. arXiv:1901.10232  [pdf, other

    cs.LG stat.ML

    Multikernel activation functions: formulation and a case study

    Authors: Simone Scardapane, Elena Nieddu, Donatella Firmani, Paolo Merialdo

    Abstract: The design of activation functions is a growing research area in the field of neural networks. In particular, instead of using fixed point-wise functions (e.g., the rectified linear unit), several authors have proposed ways of learning these functions directly from the data in a non-parametric fashion. In this paper we focus on the kernel activation function (KAF), a recently proposed framework wh… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: Accepted for presentation at INNS BDDL 2019 (https://innsbddl2019.org)

  22. arXiv:1807.04065  [pdf, other

    cs.NE cs.LG stat.ML

    Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

    Abstract: Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we des… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for presentation at 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  23. arXiv:1802.09405  [pdf, other

    cs.NE cs.LG stat.ML

    Improving Graph Convolutional Networks with Non-Parametric Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

    Abstract: Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e.g., citation networks or knowledge graphs. While several variants of GNNs have been proposed, they only consider simple nonlinear activation functions in their layers, such as rectifiers or squashing functions. In this paper, we investi… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Submitted to EUSIPCO 2018

  24. arXiv:1707.04035  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Kafnets: kernel-based non-parametric activation functions for neural networks

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Simone Totaro, Aurelio Uncini

    Abstract: Neural networks are generally built by interleaving (adaptable) linear layers with (fixed) nonlinear activation functions. To increase their flexibility, several authors have proposed methods for adapting the activation functions themselves, endowing them with varying degrees of flexibility. None of these approaches, however, have gained wide acceptance in practice, and research in this topic rema… ▽ More

    Submitted 23 November, 2017; v1 submitted 13 July, 2017; originally announced July 2017.

    Comments: Preprint submitted to Neural Networks (Elsevier)

  25. arXiv:1706.04769  [pdf, other

    stat.ML cs.LG

    Stochastic Training of Neural Networks via Successive Convex Approximations

    Authors: Simone Scardapane, Paolo Di Lorenzo

    Abstract: This paper proposes a new family of algorithms for training neural networks (NNs). These are based on recent developments in the field of non-convex optimization, going under the general name of successive convex approximation (SCA) techniques. The basic idea is to iteratively replace the original (non-convex, highly dimensional) learning problem with a sequence of (strongly convex) approximations… ▽ More

    Submitted 15 June, 2017; originally announced June 2017.

    Comments: Preprint submitted to IEEE Transactions on Neural Networks and Learning Systems

  26. arXiv:1706.03533  [pdf, other

    stat.ML cs.LG

    Recursive Multikernel Filters Exploiting Nonlinear Temporal Structure

    Authors: Steven Van Vaerenbergh, Simone Scardapane, Ignacio Santamaria

    Abstract: In kernel methods, temporal information on the data is commonly included by using time-delayed embeddings as inputs. Recently, an alternative formulation was proposed by defining a gamma-filter explicitly in a reproducing kernel Hilbert space, giving rise to a complex model where multiple kernels operate on different temporal combinations of the input signal. In the original formulation, the kerne… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

    Comments: Eusipco 2017

  27. arXiv:1704.08913  [pdf, other

    stat.ML cs.LG

    Adaptation and learning over networks for nonlinear system modeling

    Authors: Simone Scardapane, Jie Chen, Cédric Richard

    Abstract: In this chapter, we analyze nonlinear filtering problems in distributed environments, e.g., sensor networks or peer-to-peer protocols. In these scenarios, the agents in the environment receive measurements in a streaming fashion, and they are required to estimate a common (nonlinear) model by alternating local computations and communications with their neighbors. We focus on the important distinct… ▽ More

    Submitted 28 April, 2017; originally announced April 2017.

    Comments: To be published as a chapter in `Adaptive Learning Methods for Nonlinear System Modeling', Elsevier Publishing, Eds. D. Comminiello and J.C. Principe (2018)

  28. A Framework for Parallel and Distributed Training of Neural Networks

    Authors: Simone Scardapane, Paolo Di Lorenzo

    Abstract: The aim of this paper is to develop a general framework for training neural networks (NNs) in a distributed environment, where training data is partitioned over a set of agents that communicate with each other through a sparse, possibly time-varying, connectivity pattern. In such distributed scenario, the training problem can be formulated as the (regularized) optimization of a non-convex social c… ▽ More

    Submitted 20 April, 2017; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: Published on Neural Networks (Elsevier), in press

  29. arXiv:1607.06364  [pdf, other

    stat.ML cs.LG

    Distributed Supervised Learning using Neural Networks

    Authors: Simone Scardapane

    Abstract: Distributed learning is the problem of inferring a function in the case where training data is distributed among multiple geographically separated sources. Particularly, the focus is on designing learning strategies with low computational requirements, in which communication is restricted only to neighboring agents, with no reliance on a centralized authority. In this thesis, we analyze multiple d… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

    Comments: Author's Ph.D. thesis (DIET Dept., Sapienza University of Rome, May 2016). Supervisor: Prof. Aurelio Uncini

  30. Group Sparse Regularization for Deep Neural Networks

    Authors: Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini

    Abstract: In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i.e., feature selection). While these problems are generally dealt with separately, we present a simple regularized formulation allowing to solve all three of them in parallel, using standar… ▽ More

    Submitted 2 July, 2016; originally announced July 2016.

  31. arXiv:1605.05509  [pdf, other

    stat.ML cs.LG cs.NE

    Learning activation functions from data using cubic spline interpolation

    Authors: Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio Uncini

    Abstract: Neural networks require a careful design in order to perform properly on a given task. In particular, selecting a good activation function (possibly in a data-dependent fashion) is a crucial step, which remains an open problem in the research community. Despite a large amount of investigations, most current implementations simply select one fixed function from a small set of candidates, which is n… ▽ More

    Submitted 11 May, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: Submitted to the 27th Italian Workshop on Neural Networks (WIRN 2017)

    Journal ref: Neural Advances in Processing Nonlinear Dynamic Signals, 2017