Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lorenzo Rosa

Cascade: A Platform for Delay-Sensitive Edge Intelligence

Nov 29, 2023

Weijia Song, Thiago Garrett, Yuting Yang, Mingzhao Liu, Edward Tremel, Lorenzo Rosa, Andrea Merlina, Roman Vitenberg, Ken Birman

Figure 1 for Cascade: A Platform for Delay-Sensitive Edge Intelligence

Figure 2 for Cascade: A Platform for Delay-Sensitive Edge Intelligence

Figure 3 for Cascade: A Platform for Delay-Sensitive Edge Intelligence

Figure 4 for Cascade: A Platform for Delay-Sensitive Edge Intelligence

Abstract:Interactive intelligent computing applications are increasingly prevalent, creating a need for AI/ML platforms optimized to reduce per-event latency while maintaining high throughput and efficient resource management. Yet many intelligent applications run on AI/ML platforms that optimize for high throughput even at the cost of high tail-latency. Cascade is a new AI/ML hosting platform intended to untangle this puzzle. Innovations include a legacy-friendly storage layer that moves data with minimal copying and a "fast path" that collocates data and computation to maximize responsiveness. Our evaluation shows that Cascade reduces latency by orders of magnitude with no loss of throughput.

* 14 pages, 12 Figures

Via

Access Paper or Ask Questions