Skip to main content

Showing 1–5 of 5 results for author: Condie, T

.
  1. arXiv:1807.02957  [pdf, other

    cs.DB cs.LO cs.PL

    Scaling-Up Reasoning and Advanced Analytics on BigData

    Authors: Tyson Condie, Ariyam Das, Matteo Interlandi, Alexander Shkapsky, Mohan Yang, Carlo Zaniolo

    Abstract: BigDatalog is an extension of Datalog that achieves performance and scalability on both Apache Spark and multicore systems to the point that its graph analytics outperform those written in GraphX. Looking back, we see how this realizes the ambitious goal pursued by deductive database researchers beginning forty years ago: this is the goal of combining the rigor and power of logic in expressing que… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP)

  2. arXiv:1707.05681  [pdf, other

    cs.DB

    Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

    Authors: Carlo Zaniolo, Mohan Yang, Matteo Interlandi, Ariyam Das, Alexander Shkapsky, Tyson Condie

    Abstract: A very desirable Datalog extension investigated by many researchers in the last thirty years consists in allowing the use of the basic SQL aggregates min, max, count and sum in recursive rules. In this paper, we propose a simple comprehensive solution that extends the declarative least-fixpoint semantics of Horn Clauses, along with the optimization techniques used in the bottom-up implementation a… ▽ More

    Submitted 21 July, 2017; v1 submitted 18 July, 2017; originally announced July 2017.

    Comments: Paper presented at the 33nd International Conference on Logic Programming (ICLP 2017), Melbourne, Australia, August 28 to September 1, 2017. 16 pages, LaTeX (arXiv:1707.05681)

  3. arXiv:1407.0455  [pdf, other

    cs.DB

    Pregelix: Big(ger) Graph Analytics on A Dataflow Engine

    Authors: Yingyi Bu, Vinayak Borkar, Jianfeng Jia, Michael J. Carey, Tyson Condie

    Abstract: There is a growing need for distributed graph processing systems that are capable of gracefully scaling to very large graph datasets. Unfortunately, this challenge has not been easily met due to the intense memory pressure imposed by process-centric, message passing designs that many graph processing systems follow. Pregelix is a new open source distributed graph processing system that is based on… ▽ More

    Submitted 2 July, 2014; originally announced July 2014.

  4. arXiv:1303.3517  [pdf, other

    cs.DC cs.DB cs.LG

    Iterative MapReduce for Large Scale Machine Learning

    Authors: Joshua Rosen, Neoklis Polyzotis, Vinayak Borkar, Yingyi Bu, Michael J. Carey, Markus Weimer, Tyson Condie, Raghu Ramakrishnan

    Abstract: Large datasets ("Big Data") are becoming ubiquitous because the potential value in deriving insights from data, across a wide range of business and scientific applications, is increasingly recognized. In particular, machine learning - one of the foundational disciplines for data analysis, summarization and inference - on Big Data has become routine at most organizations that operate large clouds,… ▽ More

    Submitted 13 March, 2013; originally announced March 2013.

  5. arXiv:1203.0160  [pdf, other

    cs.DB cs.LG cs.PF

    Scaling Datalog for Machine Learning on Big Data

    Authors: Yingyi Bu, Vinayak Borkar, Michael J. Carey, Joshua Rosen, Neoklis Polyzotis, Tyson Condie, Markus Weimer, Raghu Ramakrishnan

    Abstract: In this paper, we present the case for a declarative foundation for data-intensive machine learning systems. Instead of creating a new system for each specific flavor of machine learning task, or hardcoding new optimizations, we argue for the use of recursive queries to program a variety of machine learning systems. By taking this approach, database query optimization techniques can be utilized to… ▽ More

    Submitted 2 March, 2012; v1 submitted 1 March, 2012; originally announced March 2012.