Skip to main content

Showing 1–6 of 6 results for author: Abeßer, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.00384  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol

    Authors: Konstantinos Apostolidis, Jakob Abesser, Luca Cuccovillo, Vasileios Mezaris

    Abstract: This paper presents a baseline approach and an experimental protocol for a specific content verification problem: detecting discrepancies between the audio and video modalities in multimedia content. We first design and optimize an audio-visual scene classifier, to compare with existing classification baselines that use both modalities. Then, by applying this classifier separately to the audio and… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted for publication, 3rd ACM Int. Workshop on Multimedia AI against Disinformation (MAD'24) at ACM ICMR'24, June 10, 2024, Phuket, Thailand. This is the "accepted version"

  2. arXiv:2111.01710  [pdf, other

    eess.AS cs.SD

    Multi-input Architecture and Disentangled Representation Learning for Multi-dimensional Modeling of Music Similarity

    Authors: Sebastian Ribecky, Jakob Abeßer, Hanna Lukashevich

    Abstract: In the context of music information retrieval, similarity-based approaches are useful for a variety of tasks that benefit from a query-by-example scenario. Music however, naturally decomposes into a set of semantically meaningful factors of variation. Current representation learning strategies pursue the disentanglement of such factors from deep representations, resulting in highly interpretable m… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Submitted to ICASSP 2022

  3. arXiv:2110.13586  [pdf, other

    eess.AS cs.SD

    Towards Audio Domain Adaptation for Acoustic Scene Classification using Disentanglement Learning

    Authors: Jakob Abeßer, Meinard Müller

    Abstract: The deployment of machine listening algorithms in real-life applications is often impeded by a domain shift caused for instance by different microphone characteristics. In this paper, we propose a novel domain adaptation strategy based on disentanglement learning. The goal is to disentangle task-specific and domain-specific characteristics in the analyzed audio recordings. In particular, we combin… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: submitted to ICASSP 2022

  4. arXiv:2105.02592  [pdf, other

    eess.AS cs.SD

    USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios

    Authors: Jakob Abeßer

    Abstract: This paper introduces a novel dataset for polyphonic sound event detection in urban sound monitoring use-cases. Based on isolated sounds taken from the FSD50k dataset, 20,000 polyphonic soundscapes are synthesized with sounds being randomly positioned in the stereo panorama using different loudness levels. The paper gives a detailed discussion of possible application scenarios, explains the datase… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  5. arXiv:2104.13620  [pdf, other

    eess.AS cs.AI cs.SD

    IDMT-Traffic: An Open Benchmark Dataset for Acoustic Traffic Monitoring Research

    Authors: Jakob Abeßer, Saichand Gourishetti, András Kátai, Tobias Clauß, Prachi Sharma, Judith Liebetrau

    Abstract: In many urban areas, traffic load and noise pollution are constantly increasing. Automated systems for traffic monitoring are promising countermeasures, which allow to systematically quantify and predict local traffic flow in order to to support municipal traffic planning decisions. In this paper, we present a novel open benchmark dataset, containing 2.5 hours of stereo audio recordings of 4718 ve… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

  6. arXiv:2102.08833  [pdf, other

    cs.SD cs.DC cs.LG eess.AS

    DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection

    Authors: David S. Johnson, Wolfgang Lorenz, Michael Taenzer, Stylianos Mimilakis, Sascha Grollmisch, Jakob Abeßer, Hanna Lukashevich

    Abstract: Research on sound event detection (SED) in environmental settings has seen increased attention in recent years. The large amounts of (private) domestic or urban audio data needed raise significant logistical and privacy concerns. The inherently distributed nature of these tasks, make federated learning (FL) a promising approach to take advantage of largescale data while mitigating privacy issues.… ▽ More

    Submitted 31 May, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: To be published in EUSIPCO 2021