Skip to main content

Showing 1–2 of 2 results for author: Yavuz, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:1509.02256  [pdf, other

    cs.DC

    Matrix Computations and Optimization in Apache Spark

    Authors: Reza Bosagh Zadeh, Xiangrui Meng, Aaron Staple, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan Sparks, Alexander Ulanov, Matei Zaharia

    Abstract: We describe matrix computations available in the cluster programming framework, Apache Spark. Out of the box, Spark provides abstractions and implementations for distributed matrices and optimization routines using these matrices. When translating single-node algorithms to run on a distributed cluster, we observe that often a simple idea is enough: separating matrix operations from vector operatio… ▽ More

    Submitted 12 July, 2016; v1 submitted 8 September, 2015; originally announced September 2015.

  2. arXiv:1505.06807  [pdf, other

    cs.LG cs.DC cs.MS stat.ML

    MLlib: Machine Learning in Apache Spark

    Authors: Xiangrui Meng, Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, Ameet Talwalkar

    Abstract: Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLlib, Spark's open-source distributed machine learning library. MLlib provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives. Shippe… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.