Skip to main content

Showing 1–1 of 1 results for author: Shaulov, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.03884  [pdf, other

    cs.SD cs.CL eess.AS

    Zero-Shot Audio Captioning via Audibility Guidance

    Authors: Tal Shaharabany, Ariel Shaulov, Lior Wolf

    Abstract: The task of audio captioning is similar in essence to tasks such as image and video captioning. However, it has received much less attention. We propose three desiderata for captioning audio -- (i) fluency of the generated text, (ii) faithfulness of the generated text to the input audio, and the somewhat related (iii) audibility, which is the quality of being able to be perceived based only on aud… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.