Skip to main content

Showing 1–4 of 4 results for author: Kittler, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2211.13189  [pdf, other

    cs.SD cs.CV eess.AS

    ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification

    Authors: Sara Atito, Muhammad Awais, Wenwu Wang, Mark D Plumbley, Josef Kittler

    Abstract: Transformers, which were originally developed for natural language processing, have recently generated significant interest in the computer vision and audio communities due to their flexibility in learning long-range relationships. Constrained by the data hungry nature of transformers and the limited amount of labelled data, most transformer-based models for audio tasks are finetuned from ImageNet… ▽ More

    Submitted 10 March, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

  2. UMFA: A photorealistic style transfer method based on U-Net and multi-layer feature aggregation

    Authors: D. Y. Rao, X. J. Wu, H. Li, J. Kittler, T. Y. Xu

    Abstract: In this paper, we propose a photorealistic style transfer network to emphasize the natural effect of photorealistic image stylization. In general, distortion of the image content and lacking of details are two typical issues in the style transfer field. To this end, we design a novel framework employing the U-Net structure to maintain the rich spatial clues, with a multi-layer feature aggregation… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

  3. arXiv:2102.10526  [pdf, other

    cs.CV eess.IV

    Deep Decomposition Network for Image Processing: A Case Study for Visible and Infrared Image Fusion

    Authors: Yu Fu, Xiao-Jun Wu, Josef Kittler

    Abstract: Image decomposition is a crucial subject in the field of image processing. It can extract salient features from the source image. We propose a new image decomposition method based on convolutional neural network. This method can be applied to many image processing tasks. In this paper, we apply the image decomposition network to the image fusion task. We input infrared image and visible light imag… ▽ More

    Submitted 3 August, 2022; v1 submitted 21 February, 2021; originally announced February 2021.

  4. arXiv:1909.07273  [pdf, other

    cs.CV eess.IV

    More About Covariance Descriptors for Image Set Coding: Log-Euclidean Framework based Kernel Matrix Representation

    Authors: Kai-Xuan Chen, Xiao-Jun Wu, Jie-Yi Ren, Rui Wang, Josef Kittler

    Abstract: We consider a family of structural descriptors for visual data, namely covariance descriptors (CovDs) that lie on a non-linear symmetric positive definite (SPD) manifold, a special type of Riemannian manifolds. We propose an improved version of CovDs for image set coding by extending the traditional CovDs from Euclidean space to the SPD manifold. Specifically, the manifold of SPD matrices is a com… ▽ More

    Submitted 26 September, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: 10 pages