We gratefully acknowledge support from
the Simons Foundation and member institutions.

Chiori Hori Ph.D., Jue Wang and Irfan Essa are qualified to endorse.

End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features

Chiori Hori Ph.D.: Is registered as an author of this paper.
Can endorse for cs.CL, cs.CV, cs.MM, cs.SD. (why?)
Jue Wang: Is registered as an author of this paper.
Can endorse for cs.CV. (why?)
Irfan Essa: Is registered as an author of this paper.
Can endorse for cs.AI, cs.CL, cs.CV, cs.LG, cs.RO, cs.SD. (why?)

Huda Alamri, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Dhruv Batra and Devi Parikh are not registered as owners of this paper. (why?)