Skip to main content

Showing 1–6 of 6 results for author: Gerken, J E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06504  [pdf, other

    cs.LG

    Equivariant Neural Tangent Kernels

    Authors: Philipp Misof, Pan Kessel, Jan E. Gerken

    Abstract: Equivariant neural networks have in recent years become an important technique for guiding architecture selection for neural networks with many applications in domains ranging from medical image analysis to quantum chemistry. In particular, as the most general linear equivariant layers with respect to the regular representation, group convolutions have been highly impactful in numerous application… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 13 pages + 5 pages appendices

  2. arXiv:2403.03103  [pdf, other

    cs.LG

    Emergent Equivariance in Deep Ensembles

    Authors: Jan E. Gerken, Pan Kessel

    Abstract: We show that deep ensembles become equivariant for all inputs and at all training times by simply using data augmentation. Crucially, equivariance holds off-manifold and for any architecture in the infinite width limit. The equivariance is emergent in the sense that predictions of individual ensemble members are not equivariant but their collective prediction is. Neural tangent kernel theory is us… ▽ More

    Submitted 15 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 11 pages + 17 pages appendices

  3. arXiv:2307.07313  [pdf, other

    cs.CV cs.LG

    HEAL-SWIN: A Vision Transformer On The Sphere

    Authors: Oscar Carlsson, Jan E. Gerken, Hampus Linander, Heiner Spieß, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: High-resolution wide-angle fisheye images are becoming more and more important for robotics applications such as autonomous driving. However, using ordinary convolutional neural networks or vision transformers on this data is problematic due to projection and distortion losses introduced when projecting to a rectangular grid on the plane. We introduce the HEAL-SWIN transformer, which combines the… ▽ More

    Submitted 8 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted as poster to CVPR 2024. Main body: 10 pages, 7 figures. Appendices: 9 pages, 6 figures

  4. arXiv:2206.05075  [pdf, other

    cs.LG cs.AI

    Diffeomorphic Counterfactuals with Generative Models

    Authors: Ann-Kathrin Dombrowski, Jan E. Gerken, Klaus-Robert Müller, Pan Kessel

    Abstract: Counterfactuals can explain classification decisions of neural networks in a human interpretable way. We propose a simple but effective method to generate such counterfactuals. More specifically, we perform a suitable diffeomorphic coordinate transformation and then perform gradient ascent in these coordinates to find counterfactuals which are classified with great confidence as a specified target… ▽ More

    Submitted 16 June, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  5. arXiv:2202.03990  [pdf, other

    cs.LG cs.CV

    Equivariance versus Augmentation for Spherical Images

    Authors: Jan E. Gerken, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are tr… ▽ More

    Submitted 12 July, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: Accepted to ICML2022, updated according to ICML-reviewer comments, 18 pages of which 9 in main body, 16 figures,

  6. arXiv:2105.13926  [pdf, other

    cs.LG cs.CV hep-th

    Geometric Deep Learning and Equivariant Neural Networks

    Authors: Jan E. Gerken, Jimmy Aronsson, Oscar Carlsson, Hampus Linander, Fredrik Ohlsson, Christoffer Petersson, Daniel Persson

    Abstract: We survey the mathematical foundations of geometric deep learning, focusing on group equivariant and gauge equivariant neural networks. We develop gauge equivariant convolutional neural networks on arbitrary manifolds $\mathcal{M}$ using principal bundles with structure group $K$ and equivariant maps between sections of associated vector bundles. We also discuss group equivariant neural networks f… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: 57 pages