-
arXiv:2406.15516 [pdf, ps, other]
System Description for the Displace Speaker Diarization Challenge 2023
Abstract: This paper describes our solution for the Diarization of Speaker and Language in Conversational Environments Challenge (Displace 2023). We used a combination of VAD for finding segfments with speech, Resnet architecture based CNN for feature extraction from these segments, and spectral clustering for features clustering. Even though it was not trained with using Hindi, the described algorithm achi… ▽ More
Submitted 20 June, 2024; originally announced June 2024.