Skip to main content

Showing 1–5 of 5 results for author: Comminiello, D

Searching in archive stat. Search in all archives.
.
  1. Compressing deep quaternion neural networks with targeted regularization

    Authors: Riccardo Vecchi, Simone Scardapane, Danilo Comminiello, Aurelio Uncini

    Abstract: In recent years, hyper-complex deep networks (such as complex-valued and quaternion-valued neural networks) have received a renewed interest in the literature. They find applications in multiple fields, ranging from image reconstruction to 3D audio processing. Similar to their real-valued counterparts, quaternion neural networks (QVNNs) require custom regularization strategies to avoid overfitting… ▽ More

    Submitted 13 July, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Published on CAAI Transactions on Intelligence Technology, https://digital-library.theiet.org/content/journals/10.1049/trit.2020.0020

  2. arXiv:1807.04065  [pdf, other

    cs.NE cs.LG stat.ML

    Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

    Abstract: Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we des… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for presentation at 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  3. arXiv:1802.09405  [pdf, other

    cs.NE cs.LG stat.ML

    Improving Graph Convolutional Networks with Non-Parametric Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

    Abstract: Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e.g., citation networks or knowledge graphs. While several variants of GNNs have been proposed, they only consider simple nonlinear activation functions in their layers, such as rectifiers or squashing functions. In this paper, we investi… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Submitted to EUSIPCO 2018

  4. Group Sparse Regularization for Deep Neural Networks

    Authors: Simone Scardapane, Danilo Comminiello, Amir Hussain, Aurelio Uncini

    Abstract: In this paper, we consider the joint task of simultaneously optimizing (i) the weights of a deep neural network, (ii) the number of neurons for each hidden layer, and (iii) the subset of active input features (i.e., feature selection). While these problems are generally dealt with separately, we present a simple regularized formulation allowing to solve all three of them in parallel, using standar… ▽ More

    Submitted 2 July, 2016; originally announced July 2016.

  5. arXiv:1605.05509  [pdf, other

    stat.ML cs.LG cs.NE

    Learning activation functions from data using cubic spline interpolation

    Authors: Simone Scardapane, Michele Scarpiniti, Danilo Comminiello, Aurelio Uncini

    Abstract: Neural networks require a careful design in order to perform properly on a given task. In particular, selecting a good activation function (possibly in a data-dependent fashion) is a crucial step, which remains an open problem in the research community. Despite a large amount of investigations, most current implementations simply select one fixed function from a small set of candidates, which is n… ▽ More

    Submitted 11 May, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: Submitted to the 27th Italian Workshop on Neural Networks (WIRN 2017)

    Journal ref: Neural Advances in Processing Nonlinear Dynamic Signals, 2017