Skip to main content

Showing 1–1 of 1 results for author: Ananthanarayanan, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.19213  [pdf, other

    eess.SY cs.AI cs.LG cs.NI

    HawkVision: Low-Latency Modeless Edge AI Serving

    Authors: ChonLam Lao, Jiaqi Gao, Ganesh Ananthanarayanan, Aditya Akella, Minlan Yu

    Abstract: The trend of modeless ML inference is increasingly growing in popularity as it hides the complexity of model inference from users and caters to diverse user and application accuracy requirements. Previous work mostly focuses on modeless inference in data centers. To provide low-latency inference, in this paper, we promote modeless inference at the edge. The edge environment introduces additional c… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.