Skip to main content

Showing 1–1 of 1 results for author: Tremel, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.17329  [pdf, other

    cs.OS cs.AI

    Cascade: A Platform for Delay-Sensitive Edge Intelligence

    Authors: Weijia Song, Thiago Garrett, Yuting Yang, Mingzhao Liu, Edward Tremel, Lorenzo Rosa, Andrea Merlina, Roman Vitenberg, Ken Birman

    Abstract: Interactive intelligent computing applications are increasingly prevalent, creating a need for AI/ML platforms optimized to reduce per-event latency while maintaining high throughput and efficient resource management. Yet many intelligent applications run on AI/ML platforms that optimize for high throughput even at the cost of high tail-latency. Cascade is a new AI/ML hosting platform intended to… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 14 pages, 12 Figures