Skip to main content

Showing 1–3 of 3 results for author: Tatemura, J

.
  1. arXiv:1304.1838  [pdf, other

    cs.DB cs.DC cs.PF

    Towards a Workload for Evolutionary Analytics

    Authors: Jeff LeFevre, Jagan Sankaranarayanan, Hakan Hacigumus, Junichi Tatemura, Neoklis Polyzotis

    Abstract: Emerging data analysis involves the ingestion and exploration of new data sets, application of complex functions, and frequent query revisions based on observing prior query answers. We call this new type of analysis evolutionary analytics and identify its properties. This type of analysis is not well represented by current benchmark workloads. In this paper, we present a workload and identify sev… ▽ More

    Submitted 27 June, 2013; v1 submitted 5 April, 2013; originally announced April 2013.

    Comments: 10 pages

    Journal ref: DanaC: Workshop on Data analytics in the Cloud, June 2013, New York, NY

  2. arXiv:1303.6609  [pdf, other

    cs.DB cs.DC cs.DS

    Exploiting Opportunistic Physical Design in Large-scale Data Analytics

    Authors: Jeff LeFevre, Jagan Sankaranarayanan, Hakan Hacigumus, Junichi Tatemura, Neoklis Polyzotis, Michael J. Carey

    Abstract: Large-scale systems, such as MapReduce and Hadoop, perform aggressive materialization of intermediate job results in order to support fault tolerance. When jobs correspond to exploratory queries submitted by data analysts, these materializations yield a large set of materialized views that typically capture common computation among successive queries from the same analyst, or even across queries o… ▽ More

    Submitted 10 December, 2013; v1 submitted 26 March, 2013; originally announced March 2013.

    Comments: 15 pages

  3. arXiv:1201.0226  [pdf, other

    cs.DB

    Towards Cost-Effective Storage Provisioning for DBMSs

    Authors: Ning Zhang, Junichi Tatemura, Jignesh M. Patel, Hakan Hacıgümüş

    Abstract: Data center operators face a bewildering set of choices when considering how to provision resources on machines with complex I/O subsystems. Modern I/O subsystems often have a rich mix of fast, high performing, but expensive SSDs sitting alongside with cheaper but relatively slower (for random accesses) traditional hard disk drives. The data center operators need to determine how to provision the… ▽ More

    Submitted 31 December, 2011; originally announced January 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 4, pp. 274-285 (2011)