Skip to main content

Showing 1–4 of 4 results for author: Go, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00060  [pdf, other

    cs.CL cs.LG

    Cascade-Aware Training of Language Models

    Authors: Congchao Wang, Sean Augenstein, Keith Rush, Wittawat Jitkrittum, Harikrishna Narasimhan, Ankit Singh Rawat, Aditya Krishna Menon, Alec Go

    Abstract: Reducing serving cost and latency is a fundamental concern for the deployment of language models (LMs) in business applications. To address this, cascades of LMs offer an effective solution that conditionally employ smaller models for simpler queries. Cascaded systems are typically built with independently trained models, neglecting the advantages of considering inference-time interactions of the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: 22 pages, 13 figures

  2. arXiv:2405.15668  [pdf, other

    cs.CV

    What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Alec Go

    Abstract: Large language models (LLMs) has been effectively used for many computer vision tasks, including image classification. In this paper, we present a simple yet effective approach for zero-shot image classification using multimodal LLMs. By employing multimodal LLMs, we generate comprehensive textual representations from input images. These textual representations are then utilized to generate fixed-… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2010.04904  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-path Neural Networks for On-device Multi-domain Visual Classification

    Authors: Qifei Wang, Junjie Ke, Joshua Greaves, Grace Chu, Gabriel Bender, Luciano Sbaiz, Alec Go, Andrew Howard, Feng Yang, Ming-Hsuan Yang, Jeff Gilbert, Peyman Milanfar

    Abstract: Learning multiple domains/tasks with a single model is important for improving data efficiency and lowering inference cost for numerous vision tasks, especially on resource-constrained mobile devices. However, hand-crafting a multi-domain/task model can be both tedious and challenging. This paper proposes a novel approach to automatically learn a multi-path network for multi-domain visual classifi… ▽ More

    Submitted 8 January, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Comments: WACV 2021

  4. arXiv:1804.03230  [pdf, other

    cs.CV

    NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications

    Authors: Tien-Ju Yang, Andrew Howard, Bo Chen, Xiao Zhang, Alec Go, Mark Sandler, Vivienne Sze, Hartwig Adam

    Abstract: This work proposes an algorithm, called NetAdapt, that automatically adapts a pre-trained deep neural network to a mobile platform given a resource budget. While many existing algorithms simplify networks based on the number of MACs or weights, optimizing those indirect metrics may not necessarily reduce the direct metrics, such as latency and energy consumption. To solve this problem, NetAdapt in… ▽ More

    Submitted 28 September, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: Accepted by ECCV 2018