Skip to main content

Showing 1–1 of 1 results for author: Kollman, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2207.04156  [pdf, other

    cs.SD cs.CL cs.IR eess.AS

    Automated Audio Captioning and Language-Based Audio Retrieval

    Authors: Clive Gomes, Hye** Park, Patrick Kollman, Yi Song, Iffanice Houndayi, Ankit Shah

    Abstract: This project involved participation in the DCASE 2022 Competition (Task 6) which had two subtasks: (1) Automated Audio Captioning and (2) Language-Based Audio Retrieval. The first subtask involved the generation of a textual description for audio samples, while the goal of the second was to find audio samples within a fixed dataset that match a given description. For both subtasks, the Clotho data… ▽ More

    Submitted 15 May, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: DCASE 2022 Competition (Task 6)