Skip to main content

Showing 1–1 of 1 results for author: Kondratyuk, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.06324  [pdf, other

    cs.CV cs.AI cs.LG cs.MM eess.IV

    Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

    Authors: Hassan Akbari, Dan Kondratyuk, Yin Cui, Rachel Hornung, Huisheng Wang, Hartwig Adam

    Abstract: We present Integrated Multimodal Perception (IMP), a simple and scalable multimodal multi-task training and modeling approach. IMP integrates multimodal inputs including image, video, text, and audio into a single Transformer encoder with minimal modality-specific components. IMP makes use of a novel design that combines Alternating Gradient Descent (AGD) and Mixture-of-Experts (MoE) for efficient… ▽ More

    Submitted 11 December, 2023; v1 submitted 10 May, 2023; originally announced May 2023.