Skip to main content

Showing 1–5 of 5 results for author: Abadi, D J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08901  [pdf, other

    cs.DB cs.LG

    Bullion: A Column Store for Machine Learning

    Authors: Gang Liao, Ye Liu, Jianjun Chen, Daniel J. Abadi

    Abstract: The past two decades have witnessed columnar storage revolutionizing data warehousing and analytics. However, the rapid growth of machine learning poses new challenges to this domain. This paper presents Bullion, a columnar storage system tailored for machine learning workloads. Bullion addresses the complexities of data compliance, optimizes the encoding of long sequence sparse features, efficien… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  2. arXiv:2312.16735  [pdf, other

    cs.DB cs.DC

    Flock: A Low-Cost Streaming Query Engine on FaaS Platforms

    Authors: Gang Liao, Amol Deshpande, Daniel J. Abadi

    Abstract: Existing serverless data analytics systems rely on external storage services like S3 for data shuffling and communication between cloud functions. While this approach provides the elasticity benefits of serverless computing, it incurs additional latency and cost overheads. We present Flock, a novel cloud-native streaming query engine that leverages the on-demand scalability of FaaS platforms for r… ▽ More

    Submitted 21 April, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

  3. arXiv:2207.02746  [pdf, other

    cs.DB

    C5: Cloned Concurrency Control that Always Keeps Up

    Authors: Jeffrey Helt, Abhinav Sharma, Daniel J. Abadi, Wyatt Lloyd, Jose M. Faleiro

    Abstract: Asynchronously replicated primary-backup databases are commonly deployed to improve availability and offload read-only transactions. To both apply replicated writes from the primary and serve read-only transactions, the backups implement a cloned concurrency control protocol. The protocol ensures read-only transactions always return a snapshot of state that previously existed on the primary. This… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 14 pages, 12 figures

  4. arXiv:1512.06168  [pdf, ps, other

    cs.DB cs.DC

    Design Principles for Scaling Multi-core OLTP Under High Contention

    Authors: Kun Ren, Jose M. Faleiro, Daniel J. Abadi

    Abstract: Although significant recent progress has been made in improving the multi-core scalability of high throughput transactional database systems, modern systems still fail to achieve scalable throughput for workloads involving frequent access to highly contended data. Most of this inability to achieve high throughput is explained by the fundamental constraints involved in guaranteeing ACID --- the add… ▽ More

    Submitted 5 January, 2016; v1 submitted 18 December, 2015; originally announced December 2015.

  5. arXiv:1412.2324  [pdf, ps, other

    cs.DB

    Rethinking serializable multiversion concurrency control

    Authors: Jose M. Faleiro, Daniel J. Abadi

    Abstract: Multi-versioned database systems have the potential to significantly increase the amount of concurrency in transaction processing because they can avoid read-write conflicts. Unfortunately, the increase in concurrency usually comes at the cost of transaction serializability. If a database user requests full serializability, modern multi-versioned systems significantly constrain read-write concurre… ▽ More

    Submitted 2 December, 2015; v1 submitted 7 December, 2014; originally announced December 2014.