We gratefully acknowledge support from
the Simons Foundation and member institutions.

Yuma Koizumi is qualified to endorse.

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval

Yuma Koizumi: Is registered as an author of this paper.
Can endorse for cs.CL, cs.LG, cs.SD, eess.AS, eess.SP, stat.ML. (why?)

Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi and Masahiro Yasuda are not registered as owners of this paper. (why?)