Skip to main content

Showing 1–14 of 14 results for author: Levis, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04311  [pdf, other

    cs.AI cs.CL cs.DC cs.IR

    ALTO: An Efficient Network Orchestrator for Compound AI Systems

    Authors: Keshav Santhanam, Deepti Raghavan, Muhammad Shahir Rahman, Thejas Venkatesh, Neha Kunjal, Pratiksha Thaker, Philip Levis, Matei Zaharia

    Abstract: We present ALTO, a network orchestrator for efficiently serving compound AI systems such as pipelines of language models. ALTO achieves high throughput and low latency by taking advantage of an optimization opportunity specific to generative language models: streaming intermediate outputs. As language models produce outputs token by token, ALTO exposes opportunities to stream intermediate outputs… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2108.12720  [pdf, other

    cs.NI cs.GR cs.MM

    Towards Retina-Quality VR Video Streaming: 15ms Could Save You 80% of Your Bandwidth

    Authors: Luke Hsiao, Brooke Krajancich, Philip Levis, Gordon Wetzstein, Keith Winstein

    Abstract: Virtual reality systems today cannot yet stream immersive, retina-quality virtual reality video over a network. One of the greatest challenges to this goal is the sheer data rates required to transmit retina-quality video frames at high resolutions and frame rates. Recent work has leveraged the decay of visual acuity in human perception in novel gaze-contingent video compression techniques. In thi… ▽ More

    Submitted 20 September, 2021; v1 submitted 28 August, 2021; originally announced August 2021.

    Comments: 6 pages, 7 figures; added additional discussion and clarifications; updated acknowledgements

  3. Approximate Partition Selection for Big-Data Workloads using Summary Statistics

    Authors: Kexin Rong, Yao Lu, Peter Bailis, Srikanth Kandula, Philip Levis

    Abstract: Many big-data clusters store data in large partitions that support access at a coarse, partition-level granularity. As a result, approximate query processing via row-level sampling is inefficient, often requiring reads of many partitions. In this work, we seek to answer queries quickly and approximately by reading a subset of the data partitions and combining partial answers in a weighted manner w… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  4. arXiv:2007.13828  [pdf, other

    cs.AR

    GRIP: A Graph Neural Network Accelerator Architecture

    Authors: Kevin Kiningham, Christopher Re, Philip Levis

    Abstract: We present GRIP, a graph neural network accelerator architecture designed for low-latency inference. AcceleratingGNNs is challenging because they combine two distinct types of computation: arithmetic-intensive vertex-centric operations and memory-intensive edge-centric operations. GRIP splits GNN inference into a fixed set of edge- and vertex-centric execution phases that can be implemented in har… ▽ More

    Submitted 30 July, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

  5. arXiv:1906.01113  [pdf, other

    cs.NI

    Learning in situ: a randomized experiment in video streaming

    Authors: Francis Y. Yan, Hudson Ayers, Chenzhi Zhu, Sadjad Fouladi, James Hong, Keyi Zhang, Philip Levis, Keith Winstein

    Abstract: We describe the results of a randomized controlled trial of video-streaming algorithms for bitrate selection and network prediction. Over the last eight months, we have streamed 14.2 years of video to 56,000 users across the Internet. Sessions are randomized in blinded fashion among algorithms, and client telemetry is recorded for analysis. We found that in this real-world setting, it is difficu… ▽ More

    Submitted 23 September, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: USENIX NSDI (2020) 495-511

  6. arXiv:1806.10751  [pdf, other

    cs.NI

    Design Considerations for Low Power Internet Protocols

    Authors: Hudson Ayers, Paul Crews, Hubert Teo, Conor McAvity, Amit Levy, Philip Levis

    Abstract: Over the past 10 years, low-power wireless networks have transitioned to supporting IPv6 connectivity through 6LoWPAN, a set of standards which specify how to aggressively compress IPv6 packets over low-power wireless links such as 802.15.4. We find that different low-power IPv6 stacks are unable to communicate using 6LoWPAN, and therefore IP, due to design tradeoffs between code size and energy… ▽ More

    Submitted 21 January, 2020; v1 submitted 27 June, 2018; originally announced June 2018.

  7. Smart Contracts for Machine-to-Machine Communication: Possibilities and Limitations

    Authors: Yuichi Hanada, Luke Hsiao, Philip Levis

    Abstract: Blockchain technologies, such as smart contracts, present a unique interface for machine-to-machine communication that provides a secure, append-only record that can be shared without trust and without a central administrator. We study the possibilities and limitations of using smart contracts for machine-to-machine communication by designing, implementing, and evaluating AGasP, an application for… ▽ More

    Submitted 7 January, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

  8. arXiv:1803.09835  [pdf, other

    cs.DB

    Locality-Sensitive Hashing for Earthquake Detection: A Case Study of Scaling Data-Driven Science

    Authors: Kexin Rong, Clara E. Yoon, Karianne J. Bergen, Hashem Elezabi, Peter Bailis, Philip Levis, Gregory C. Beroza

    Abstract: In this work, we report on a novel application of Locality Sensitive Hashing (LSH) to seismic data at scale. Based on the high waveform similarity between reoccurring earthquakes, our application identifies potential earthquakes by searching for similar time series segments via LSH. However, a straightforward implementation of this LSH-enabled application has difficulty scaling beyond 3 months of… ▽ More

    Submitted 23 July, 2018; v1 submitted 26 March, 2018; originally announced March 2018.

  9. arXiv:1705.01662  [pdf, other

    cs.DC

    Execution Templates: Caching Control Plane Decisions for Strong Scaling of Data Analytics

    Authors: Omid Mashayekhi, Hang Qu, Chinmayee Shah, Philip Levis

    Abstract: Control planes of cloud frameworks trade off between scheduling granularity and performance. Centralized systems schedule at task granularity, but only schedule a few thousand tasks per second. Distributed systems schedule hundreds of thousands of tasks per second but changing the schedule is costly. We present execution templates, a control plane abstraction that can schedule hundreds of thousa… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

    Comments: To appear at USENIX ATC 2017

  10. Fonduer: Knowledge Base Construction from Richly Formatted Data

    Authors: Sen Wu, Luke Hsiao, Xiao Cheng, Braden Hancock, Theodoros Rekatsinas, Philip Levis, Christopher RĂ©

    Abstract: We focus on knowledge base construction (KBC) from richly formatted data. In contrast to KBC from text or tabular data, KBC from richly formatted data aims to extract relations conveyed jointly via textual, structural, tabular, and visual expressions. We introduce Fonduer, a machine-learning-based KBC system for richly formatted data. Fonduer presents a new data model that accounts for three chall… ▽ More

    Submitted 1 March, 2018; v1 submitted 15 March, 2017; originally announced March 2017.

    Journal ref: SIGMOD 2018

  11. arXiv:1606.01972  [pdf, other

    cs.DC

    Scalable, Fast Cloud Computing with Execution Templates

    Authors: Omid Mashayekhi, Hang Qu, Chinmayee Shah, Philip Levis

    Abstract: Large scale cloud data analytics applications are often CPU bound. Most of these cycles are wasted: benchmarks written in C++ run 10-51 times faster than frameworks such as Naiad and Spark. However, calling faster implementations from those frameworks only sees moderate (3-5x) speedups because their control planes cannot schedule work fast enough. This paper presents execution templates, a contr… ▽ More

    Submitted 6 June, 2016; originally announced June 2016.

  12. arXiv:1606.01966  [pdf, other

    cs.DC

    Distributed Graphical Simulation in the Cloud

    Authors: Omid Mashayekhi, Chinmayee Shah, Hang Qu, Andrew Lim, Philip Levis

    Abstract: Graphical simulations are a cornerstone of modern media and films. But existing software packages are designed to run on HPC nodes, and perform poorly in the computing cloud. These simulations have complex data access patterns over complex data structures, and mutate data arbitrarily, and so are a poor fit for existing cloud computing systems. We describe a software architecture for running graphi… ▽ More

    Submitted 6 June, 2016; originally announced June 2016.

  13. arXiv:1602.01412   

    cs.DC

    Canary: A Scheduling Architecture for High Performance Cloud Computing

    Authors: Hang Qu, Omid Mashayekhi, David Terei, Philip Levis

    Abstract: We present Canary, a scheduling architecture that allows high performance analytics workloads to scale out to run on thousands of cores. Canary is motivated by the observation that a central scheduler is a bottleneck for high performance codes: a handful of multicore workers can execute tasks faster than a controller can schedule them. The key insight in Canary is to reverse the responsibilities… ▽ More

    Submitted 14 April, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

    Comments: We have some presentation issues with the paper

  14. arXiv:1506.07577  [pdf, other

    cs.GR

    Ebb: A DSL for Physical Simulation on CPUs and GPUs

    Authors: Gilbert Louis Bernstein, Chinmayee Shah, Crystal Lemire, Zachary DeVito, Matthew Fisher, Philip Levis, Pat Hanrahan

    Abstract: Designing programming environments for physical simulation is challenging because simulations rely on diverse algorithms and geometric domains. These challenges are compounded when we try to run efficiently on heterogeneous parallel architectures. We present Ebb, a domain-specific language (DSL) for simulation, that runs efficiently on both CPUs and GPUs. Unlike previous DSLs, Ebb uses a three-lay… ▽ More

    Submitted 24 February, 2016; v1 submitted 24 June, 2015; originally announced June 2015.