Skip to main content

Showing 1–3 of 3 results for author: Guzhov, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.13043  [pdf, ps, other

    cs.SD cs.CV eess.AS

    AudioCLIP: Extending CLIP to Image, Text and Audio

    Authors: Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel

    Abstract: In the past, the rapidly evolving field of sound classification greatly benefited from the application of methods from other domains. Today, we observe the trend to fuse domain-specific tasks and approaches together, which provides the community with new outstanding models. In this work, we present an extension of the CLIP model that handles audio in addition to text and images. Our proposed mod… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: submitted to GCPR 2021

  2. arXiv:2104.11587  [pdf, other

    cs.SD eess.AS

    ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio

    Authors: Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel

    Abstract: Environmental Sound Classification (ESC) is a rapidly evolving field that recently demonstrated the advantages of application of visual domain techniques to the audio-related tasks. Previous studies indicate that the domain-specific modification of cross-domain approaches show a promise in pushing the whole area of ESC forward. In this paper, we present a new time-frequency transformation layer… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: submitted IJCNN 2021

  3. arXiv:2004.07301  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    ESResNet: Environmental Sound Classification Based on Visual Domain Models

    Authors: Andrey Guzhov, Federico Raue, Jörn Hees, Andreas Dengel

    Abstract: Environmental Sound Classification (ESC) is an active research area in the audio domain and has seen a lot of progress in the past years. However, many of the existing approaches achieve high accuracy by relying on domain-specific features and architectures, making it harder to benefit from advances in other fields (e.g., the image domain). Additionally, some of the past successes have been attrib… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: 8 pages, 4 figures; submitted to ICPR 2020