Skip to main content

Showing 1–11 of 11 results for author: Van Vaerenbergh, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:1903.11990  [pdf, other

    stat.ML cs.LG

    On the Stability and Generalization of Learning with Kernel Activation Functions

    Authors: Michele Cirillo, Simone Scardapane, Steven Van Vaerenbergh, Aurelio Uncini

    Abstract: In this brief we investigate the generalization properties of a recently-proposed class of non-parametric activation functions, the kernel activation functions (KAFs). KAFs introduce additional parameters in the learning process in order to adapt nonlinearities individually on a per-neuron basis, exploiting a cheap kernel expansion of every activation value. While this increase in flexibility has… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

    Comments: Submitted as a brief paper to IEEE TNNLS

  2. arXiv:1807.04065  [pdf, other

    cs.NE cs.LG stat.ML

    Recurrent Neural Networks with Flexible Gates using Kernel Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Simone Totaro, Aurelio Uncini

    Abstract: Gated recurrent neural networks have achieved remarkable results in the analysis of sequential data. Inside these networks, gates are used to control the flow of information, allowing to model even very long-term dependencies in the data. In this paper, we investigate whether the original gate equation (a linear projection followed by an element-wise sigmoid) can be improved. In particular, we des… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted for presentation at 2018 IEEE International Workshop on Machine Learning for Signal Processing (MLSP)

  3. arXiv:1802.09405  [pdf, other

    cs.NE cs.LG stat.ML

    Improving Graph Convolutional Networks with Non-Parametric Activation Functions

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Danilo Comminiello, Aurelio Uncini

    Abstract: Graph neural networks (GNNs) are a class of neural networks that allow to efficiently perform inference on data that is associated to a graph structure, such as, e.g., citation networks or knowledge graphs. While several variants of GNNs have been proposed, they only consider simple nonlinear activation functions in their layers, such as rectifiers or squashing functions. In this paper, we investi… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Submitted to EUSIPCO 2018

  4. arXiv:1802.05910  [pdf, other

    cs.LG stat.ML

    Pattern Localization in Time Series through Signal-To-Model Alignment in Latent Space

    Authors: Steven Van Vaerenbergh, Ignacio Santamaria, Victor Elvira, Matteo Salvatori

    Abstract: In this paper, we study the problem of locating a predefined sequence of patterns in a time series. In particular, the studied scenario assumes a theoretical model is available that contains the expected locations of the patterns. This problem is found in several contexts, and it is commonly solved by first synthesizing a time series from the model, and then aligning it to the true time series thr… ▽ More

    Submitted 19 February, 2018; v1 submitted 16 February, 2018; originally announced February 2018.

    Comments: IEEE ICASSP 2018

  5. arXiv:1707.04035  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Kafnets: kernel-based non-parametric activation functions for neural networks

    Authors: Simone Scardapane, Steven Van Vaerenbergh, Simone Totaro, Aurelio Uncini

    Abstract: Neural networks are generally built by interleaving (adaptable) linear layers with (fixed) nonlinear activation functions. To increase their flexibility, several authors have proposed methods for adapting the activation functions themselves, endowing them with varying degrees of flexibility. None of these approaches, however, have gained wide acceptance in practice, and research in this topic rema… ▽ More

    Submitted 23 November, 2017; v1 submitted 13 July, 2017; originally announced July 2017.

    Comments: Preprint submitted to Neural Networks (Elsevier)

  6. arXiv:1706.03533  [pdf, other

    stat.ML cs.LG

    Recursive Multikernel Filters Exploiting Nonlinear Temporal Structure

    Authors: Steven Van Vaerenbergh, Simone Scardapane, Ignacio Santamaria

    Abstract: In kernel methods, temporal information on the data is commonly included by using time-delayed embeddings as inputs. Recently, an alternative formulation was proposed by defining a gamma-filter explicitly in a reproducing kernel Hilbert space, giving rise to a complex model where multiple kernels operate on different temporal combinations of the input signal. In the original formulation, the kerne… ▽ More

    Submitted 12 June, 2017; originally announced June 2017.

    Comments: Eusipco 2017

  7. arXiv:1609.03164  [pdf, ps, other

    stat.ML cs.IT cs.LG

    On the Relationship between Online Gaussian Process Regression and Kernel Least Mean Squares Algorithms

    Authors: Steven Van Vaerenbergh, Jesus Fernandez-Bes, Víctor Elvira

    Abstract: We study the relationship between online Gaussian process (GP) regression and kernel least mean squares (KLMS) algorithms. While the latter have no capacity of storing the entire posterior distribution during online learning, we discover that their operation corresponds to the assumption of a fixed posterior covariance that follows a simple parametric model. Interestingly, several well-known KLMS… ▽ More

    Submitted 11 September, 2016; originally announced September 2016.

    Comments: Accepted for publication in 2016 IEEE International Workshop on Machine Learning for Signal Processing

  8. arXiv:1501.06929  [pdf, ps, other

    stat.ML eess.SY stat.AP

    A Probabilistic Least-Mean-Squares Filter

    Authors: Jesus Fernandez-Bes, Víctor Elvira, Steven Van Vaerenbergh

    Abstract: We introduce a probabilistic approach to the LMS filter. By means of an efficient approximation, this approach provides an adaptable step-size LMS algorithm together with a measure of uncertainty about the estimation. In addition, the proposed approximation preserves the linear complexity of the standard LMS. Numerical results show the improved performance of the algorithm with respect to standard… ▽ More

    Submitted 27 January, 2015; originally announced January 2015.

  9. arXiv:1310.5347  [pdf, other

    stat.ML cs.LG

    Bayesian Extensions of Kernel Least Mean Squares

    Authors: Il Memming Park, Sohan Seth, Steven Van Vaerenbergh

    Abstract: The kernel least mean squares (KLMS) algorithm is a computationally efficient nonlinear adaptive filtering method that "kernelizes" the celebrated (linear) least mean squares algorithm. We demonstrate that the least mean squares algorithm is closely related to the Kalman filtering, and thus, the KLMS can be interpreted as an approximate Bayesian filtering method. This allows us to systematically d… ▽ More

    Submitted 20 October, 2013; originally announced October 2013.

    Comments: 7 pages, 4 fiures

  10. arXiv:1303.2823  [pdf, other

    cs.LG cs.IT stat.ML

    Gaussian Processes for Nonlinear Signal Processing

    Authors: Fernando Pérez-Cruz, Steven Van Vaerenbergh, Juan José Murillo-Fuentes, Miguel Lázaro-Gredilla, Ignacio Santamaria

    Abstract: Gaussian processes (GPs) are versatile tools that have been successfully employed to solve nonlinear estimation problems in machine learning, but that are rarely used in signal processing. In this tutorial, we present GPs for regression as a natural nonlinear extension to optimal Wiener filtering. After establishing their basic formulation, we discuss several important aspects and extensions, incl… ▽ More

    Submitted 27 September, 2013; v1 submitted 12 March, 2013; originally announced March 2013.

    Journal ref: IEEE Signal Processing Magazine, vol.30, no.4, pp.40-50, July 2013

  11. arXiv:1108.3372  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Overlap** Mixtures of Gaussian Processes for the Data Association Problem

    Authors: Miguel Lázaro-Gredilla, Steven Van Vaerenbergh, Neil Lawrence

    Abstract: In this work we introduce a mixture of GPs to address the data association problem, i.e. to label a group of observations according to the sources that generated them. Unlike several previously proposed GP mixtures, the novel mixture has the distinct characteristic of using no gating function to determine the association of samples and mixture components. Instead, all the GPs in the mixture are gl… ▽ More

    Submitted 16 August, 2011; originally announced August 2011.