Skip to main content

Showing 1–1 of 1 results for author: Pan, K W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19593  [pdf, other

    cs.CL cs.CV

    SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

    Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

    Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.