Skip to main content

Showing 1–1 of 1 results for author: Purohit, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.00598  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

    Authors: Andy Zeng, Maria Attarian, Brian Ichter, Krzysztof Choromanski, Adrian Wong, Stefan Welker, Federico Tombari, Aveek Purohit, Michael Ryoo, Vikas Sindhwani, Johnny Lee, Vincent Vanhoucke, Pete Florence

    Abstract: Large pretrained (e.g., "foundation") models exhibit distinct capabilities depending on the domain of data they are trained on. While these domains are generic, they may only barely overlap. For example, visual-language models (VLMs) are trained on Internet-scale image captions, but large language models (LMs) are further trained on Internet-scale text with no images (e.g., spreadsheets, SAT quest… ▽ More

    Submitted 27 May, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: https://socraticmodels.github.io/