Skip to main content

Showing 1–3 of 3 results for author: Sommerlade, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.15667  [pdf, other

    cs.CV

    Adaptive Token Sampling For Efficient Vision Transformers

    Authors: Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Juergen Gall

    Abstract: While state-of-the-art vision transformer models achieve promising results in image classification, they are computationally expensive and require many GFLOPs. Although the GFLOPs of a vision transformer can be decreased by reducing the number of tokens in the network, there is no setting that is optimal for all input images. In this work, we therefore introduce a differentiable parameter-free Ada… ▽ More

    Submitted 26 July, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Comments: ECCV 2022

  2. arXiv:2108.06401  [pdf, other

    cs.SD eess.AS

    Cross-modal Spectrum Transformation Network For Acoustic Scene classification

    Authors: Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade

    Abstract: Convolutional neural networks (CNNs) with log-mel spectrum features have shown promising results for acoustic scene classification tasks. However, the performance of these CNN based classifiers is still lacking as they do not generalise well for unknown environments. To address this issue, we introduce an acoustic spectrum transformation network where traditional log-mel spectrums are transformed… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Journal ref: ICASSP 2021

  3. arXiv:2012.06444  [pdf, other

    cs.CV

    Relighting Images in the Wild with a Self-Supervised Siamese Auto-Encoder

    Authors: Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade

    Abstract: We propose a self-supervised method for image relighting of single view images in the wild. The method is based on an auto-encoder which deconstructs an image into two separate encodings, relating to the scene illumination and content, respectively. In order to disentangle this embedding information without supervision, we exploit the assumption that some augmentation operations do not affect the… ▽ More

    Submitted 23 August, 2021; v1 submitted 11 December, 2020; originally announced December 2020.