Skip to main content

Showing 1–4 of 4 results for author: Sparks, E R

Searching in archive cs. Search in all archives.
.
  1. arXiv:1610.09451  [pdf, other

    cs.LG cs.DC

    KeystoneML: Optimizing Pipelines for Large-Scale Advanced Analytics

    Authors: Evan R. Sparks, Shivaram Venkataraman, Tomer Kaftan, Michael J. Franklin, Benjamin Recht

    Abstract: Modern advanced analytics applications make use of machine learning techniques and contain multiple steps of domain-specific and general-purpose processing with high resource requirements. We present KeystoneML, a system that captures and optimizes the end-to-end large-scale machine learning applications for high-throughput training in a distributed environment with a high-level API. This approach… ▽ More

    Submitted 29 October, 2016; originally announced October 2016.

  2. arXiv:1603.03336  [pdf, other

    cs.LG stat.ME

    Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies

    Authors: Francois W. Belletti, Evan R. Sparks, Michael J. Franklin, Alexandre M. Bayen, Joseph E. Gonzalez

    Abstract: Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long ran… ▽ More

    Submitted 10 March, 2016; originally announced March 2016.

  3. arXiv:1502.00068  [pdf, other

    cs.DB cs.DC cs.LG

    TuPAQ: An Efficient Planner for Large-scale Predictive Analytic Queries

    Authors: Evan R. Sparks, Ameet Talwalkar, Michael J. Franklin, Michael I. Jordan, Tim Kraska

    Abstract: The proliferation of massive datasets combined with the development of sophisticated analytical techniques have enabled a wide variety of novel applications such as improved product recommendations, automatic image tagging, and improved speech-driven interfaces. These and many other applications can be supported by Predictive Analytic Queries (PAQs). A major obstacle to supporting PAQs is the chal… ▽ More

    Submitted 8 March, 2015; v1 submitted 30 January, 2015; originally announced February 2015.

  4. arXiv:1310.5426  [pdf, other

    cs.LG cs.DC stat.ML

    MLI: An API for Distributed Machine Learning

    Authors: Evan R. Sparks, Ameet Talwalkar, Virginia Smith, Jey Kottalam, Xinghao Pan, Joseph Gonzalez, Michael J. Franklin, Michael I. Jordan, Tim Kraska

    Abstract: MLI is an Application Programming Interface designed to address the challenges of building Machine Learn- ing algorithms in a distributed setting based on data-centric computing. Its primary goal is to simplify the development of high-performance, scalable, distributed algorithms. Our initial results show that, relative to existing systems, this interface can be used to build distributed implement… ▽ More

    Submitted 25 October, 2013; v1 submitted 21 October, 2013; originally announced October 2013.