Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Primus, Paul; Eghbal-zadeh, Hamid; Eitelsebner, David; Koutini, Khaled; Arzt, Andreas; Widmer, Gerhard

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1909.02869 (eess)

[Submitted on 4 Sep 2019]

Title:Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Authors:Paul Primus, Hamid Eghbal-zadeh, David Eitelsebner, Khaled Koutini, Andreas Arzt, Gerhard Widmer

View PDF

Abstract:Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of machine listening (Task 1b of the DCASE 2019 Challenge). We propose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-parallel samples, i.e. time-aligned recordings from different recording devices. No classification labels are needed for our domain adaptation (DA) method, which makes the data collection process cheaper.

Comments:	Published at the Workshop on Detection and Classification of Acoustic Scenes and Events, 25-26 October 2019, New York, USA
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:1909.02869 [eess.AS]
	(or arXiv:1909.02869v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1909.02869

Submission history

From: Paul Primus [view email]
[v1] Wed, 4 Sep 2019 16:19:50 UTC (636 KB)

Full-text links:

Access Paper:

view license

Current browse context:

eess.AS

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
cs.LG
cs.SD
eess
stat
stat.ML

References & Citations

export BibTeX citation

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators