Skip to main content

Showing 1–15 of 15 results for author: Løkse, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17951  [pdf, other

    cs.LG cs.IT stat.ML

    Cauchy-Schwarz Divergence Information Bottleneck for Regression

    Authors: Shujian Yu, Xi Yu, Sigurd Løkse, Robert Jenssen, Jose C. Principe

    Abstract: The information bottleneck (IB) approach is popular to improve the generalization, robustness and explainability of deep neural networks. Essentially, it aims to find a minimum sufficient representation $\mathbf{t}$ by striking a trade-off between a compression term $I(\mathbf{x};\mathbf{t})$ and a prediction term $I(y;\mathbf{t})$, where $I(\cdot;\cdot)$ refers to the mutual information (MI). MI… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: accepted by ICLR-24, project page: \url{https://github.com/SJYuCNEL/Cauchy-Schwarz-Information-Bottleneck}

  2. arXiv:2303.09877  [pdf, other

    stat.ML cs.CV cs.LG

    On the Effects of Self-supervision and Contrastive Alignment in Deep Multi-view Clustering

    Authors: Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael C. Kampffmeyer

    Abstract: Self-supervised learning is a central component in recent approaches to deep multi-view clustering (MVC). However, we find large variations in the development of self-supervision-based methods for deep MVC, potentially slowing the progress of the field. To address this, we present DeepMVC, a unified framework for deep MVC that includes many recent methods as instances. We leverage our framework to… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Code available at https://github.com/DanielTrosten/DeepMVC

  3. arXiv:2303.09352  [pdf, other

    cs.CV

    Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-shot Learning with Hyperspherical Embeddings

    Authors: Daniel J. Trosten, Rwiddhi Chakraborty, Sigurd Løkse, Kristoffer Knutsen Wickstrøm, Robert Jenssen, Michael C. Kampffmeyer

    Abstract: Distance-based classification is frequently used in transductive few-shot learning (FSL). However, due to the high-dimensionality of image representations, FSL classifiers are prone to suffer from the hubness problem, where a few points (hubs) occur frequently in multiple nearest neighbour lists of other points. Hubness negatively impacts distance-based classification when hubs from one class appe… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  4. arXiv:2301.08970  [pdf, other

    cs.LG cs.IT stat.ML

    The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making

    Authors: Shujian Yu, Hongming Li, Sigurd Løkse, Robert Jenssen, José C. Príncipe

    Abstract: The Cauchy-Schwarz (CS) divergence was developed by Príncipe et al. in 2000. In this paper, we extend the classic CS divergence to quantify the closeness between two conditional distributions and show that the developed conditional CS divergence can be simply estimated by a kernel density estimator from given samples. We illustrate the advantages (e.g., rigorous faithfulness guarantee, lower compu… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 January, 2023; originally announced January 2023.

    Comments: 27 pages, 10 figures, under 2nd round review

  5. arXiv:2205.08864  [pdf, ps, other

    stat.ML cs.LG math.ST

    The Kernelized Taylor Diagram

    Authors: Kristoffer Wickstrøm, J. Emmanuel Johnson, Sigurd Løkse, Gustau Camps-Valls, Karl Øyvind Mikalsen, Michael Kampffmeyer, Robert Jenssen

    Abstract: This paper presents the kernelized Taylor diagram, a graphical framework for visualizing similarities between data populations. The kernelized Taylor diagram builds on the widely used Taylor diagram, which is used to visualize similarities between populations. However, the Taylor diagram has several limitations such as not capturing non-linear relationships and sensitivity to outliers. To address… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted at the Norwegian Artificial Intelligence Symposium 2022. Code available at: https://github.com/Wickstrom/KernelizedTaylorDiagram

  6. arXiv:2112.10161  [pdf, other

    stat.ML cs.LG

    RELAX: Representation Learning Explainability

    Authors: Kristoffer K. Wickstrøm, Daniel J. Trosten, Sigurd Løkse, Ahcène Boubekki, Karl Øyvind Mikalsen, Michael C. Kampffmeyer, Robert Jenssen

    Abstract: Despite the significant improvements that representation learning via self-supervision has led to when learning from unlabeled data, no methods exist that explain what influences the learned representation. We address this need through our proposed approach, RELAX, which is the first approach for attribution-based explanations of representations. Our approach can also model the uncertainty in its… ▽ More

    Submitted 21 February, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

  7. arXiv:2103.07738  [pdf, other

    cs.CV cs.LG

    Reconsidering Representation Alignment for Multi-view Clustering

    Authors: Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer

    Abstract: Aligning distributions of view representations is a core component of today's state of the art models for deep multi-view clustering. However, we identify several drawbacks with naïvely aligning representation distributions. We demonstrate that these drawbacks both lead to less separable clusters in the representation space, and inhibit the model's ability to prioritize views. Based on these obser… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

    Comments: To appear in CVPR 2021. Code available at https://github.com/DanielTrosten/mvc

  8. arXiv:2001.07026  [pdf

    stat.ML cs.CV cs.LG

    Leveraging tensor kernels to reduce objective function mismatch in deep clustering

    Authors: Daniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer

    Abstract: Objective Function Mismatch (OFM) occurs when the optimization of one objective has a negative impact on the optimization of another objective. In this work we study OFM in deep clustering, and find that the popular autoencoder-based approach to deep clustering can lead to both reduced clustering performance, and a significant amount of OFM between the reconstruction and clustering objectives. To… ▽ More

    Submitted 13 February, 2024; v1 submitted 20 January, 2020; originally announced January 2020.

  9. arXiv:1909.11396  [pdf, other

    stat.ML cs.LG

    Information Plane Analysis of Deep Neural Networks via Matrix-Based Renyi's Entropy and Tensor Kernels

    Authors: Kristoffer Wickstrøm, Sigurd Løkse, Michael Kampffmeyer, Shujian Yu, Jose Principe, Robert Jenssen

    Abstract: Analyzing deep neural networks (DNNs) via information plane (IP) theory has gained tremendous attention recently as a tool to gain insight into, among others, their generalization ability. However, it is by no means obvious how to estimate mutual information (MI) between each hidden layer and the input/desired output, to construct the IP. For instance, hidden layers with many neurons require MI es… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 15 pages, 8 figures

  10. Deep Divergence-Based Approach to Clustering

    Authors: Michael Kampffmeyer, Sigurd Løkse, Filippo M. Bianchi, Lorenzo Livi, Arnt-Børre Salberg, Robert Jenssen

    Abstract: A promising direction in deep learning research consists in learning representations and simultaneously discovering cluster structure in unlabeled data by optimizing a discriminative loss function. As opposed to supervised deep learning, this line of research is in its infancy, and how to design and optimize suitable loss functions to train deep neural networks for clustering is still an open ques… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  11. arXiv:1807.07868  [pdf, other

    stat.ML cs.LG cs.NE

    The Deep Kernelized Autoencoder

    Authors: Michael Kampffmeyer, Sigurd Løkse, Filippo M. Bianchi, Robert Jenssen, Lorenzo Livi

    Abstract: Autoencoders learn data representations (codes) in such a way that the input is reproduced at the output of the network. However, it is not always clear what kind of properties of the input data need to be captured by the codes. Kernel machines have experienced great success by operating via inner-products in a theoretically well-defined reproducing kernel Hilbert space, hence capturing topologica… ▽ More

    Submitted 23 July, 2018; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: This work extends the preliminary (conference) version of this paper (arXiv:1702.02526), Applied Soft Computing, Elsevier, 2018

  12. arXiv:1803.07870  [pdf, other

    cs.NE

    Reservoir computing approaches for representation and classification of multivariate time series

    Authors: Filippo Maria Bianchi, Simone Scardapane, Sigurd Løkse, Robert Jenssen

    Abstract: Classification of multivariate time series (MTS) has been tackled with a large variety of methodologies and applied to a wide range of scenarios. Reservoir Computing (RC) provides efficient tools to generate a vectorial, fixed-size representation of the MTS that can be further processed by standard classifiers. Despite their unrivaled training speed, MTS classifiers based on a standard RC architec… ▽ More

    Submitted 7 June, 2020; v1 submitted 21 March, 2018; originally announced March 2018.

  13. arXiv:1711.06509  [pdf, other

    cs.NE

    Bidirectional deep-readout echo state networks

    Authors: Filippo Maria Bianchi, Simone Scardapane, Sigurd Løkse, Robert Jenssen

    Abstract: We propose a deep architecture for the classification of multivariate time series. By means of a recurrent and untrained reservoir we generate a vectorial representation that embeds temporal relationships in the data. To improve the memorization capability, we implement a bidirectional reservoir, whose last state captures also past dependencies in the input. We apply dimensionality reduction to th… ▽ More

    Submitted 13 February, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

  14. arXiv:1702.02526  [pdf, other

    stat.ML cs.LG cs.NE

    Deep Kernelized Autoencoders

    Authors: Michael Kampffmeyer, Sigurd Løkse, Filippo Maria Bianchi, Robert Jenssen, Lorenzo Livi

    Abstract: In this paper we introduce the deep kernelized autoencoder, a neural network model that allows an explicit approximation of (i) the map** from an input space to an arbitrary, user-specified kernel space and (ii) the back-projection from such a kernel space to input space. The proposed method is based on traditional autoencoders and is trained through a new unsupervised loss function. During trai… ▽ More

    Submitted 8 February, 2017; originally announced February 2017.

  15. Training Echo State Networks with Regularization through Dimensionality Reduction

    Authors: Sigurd Løkse, Filippo Maria Bianchi, Robert Jenssen

    Abstract: In this paper we introduce a new framework to train an Echo State Network to predict real valued time-series. The method consists in projecting the output of the internal layer of the network on a space with lower dimensionality, before training the output layer to learn the target task. Notably, we enforce a regularization constraint that leads to better generalization capabilities. We evaluate t… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.