Skip to main content

Showing 1–6 of 6 results for author: Koziol, Q

.
  1. arXiv:2206.11992  [pdf

    cs.DC

    The LBNL Superfacility Project Report

    Authors: Deborah Bard, Cory Snavely, Lisa Gerhardt, Jason Lee, Becci Totzke, Katie Antypas, William Arndt, Johannes Blaschke, Suren Byna, Ravi Cheema, Shreyas Cholia, Mark Day, Bjoern Enders, Aditi Gaur, Annette Greiner, Taylor Groves, Mariam Kiran, Quincey Koziol, Tom Lehman, Kelly Rowland, Chris Samuel, Ashwin Selvarajan, Alex Sim, David Skinner, Laurie Stephey , et al. (2 additional authors not shown)

    Abstract: The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019… ▽ More

    Submitted 27 June, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: 85 pages, 23 figures

    Report number: UCPMS ID: 3815358 UCPMS ID: 3815358 UCPMS ID: 3815358 UCPMS ID: 3815358UCPMS ID: 3815358 UCPMS ID: 3815358

  2. arXiv:2205.01168  [pdf, other

    cs.DC

    A Case Study on Parallel HDF5 Dataset Concatenation for High Energy Physics Data Analysis

    Authors: Sunwoo Lee, Kai-yuan Hou, Kewei Wang, Saba Sehrish, Marc Paterno, James Kowalkowski, Quincey Koziol, Robert Ross, Ankit Agrawal, Alok Choudhary, Wei-keng Liao

    Abstract: In High Energy Physics (HEP), experimentalists generate large volumes of data that, when analyzed, helps us better understand the fundamental particles and their interactions. This data is often captured in many files of small size, creating a data management challenge for scientists. In order to better facilitate data management, transfer, and analysis on large scale platforms, it is advantageous… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  3. arXiv:2105.12929  [pdf, other

    cs.DC

    Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

    Authors: Bo Fang, Daoce Wang, Sian **, Quincey Koziol, Zhao Zhang, Qiang Guan, Suren Byna, Sriram Krishnamoorthy, Dingwen Tao

    Abstract: In recent years, the increasing complexity in scientific simulations and emerging demands for training heavy artificial intelligence models require massive and fast data accesses, which urges high-performance computing (HPC) platforms to equip with more advanced storage infrastructures such as solid-state disks (SSDs). While SSDs offer high-performance I/O, the reliability challenges faced by the… ▽ More

    Submitted 2 August, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: 12 pages, 9 figures, 4 tables, accepted by IEEE Cluster'21

  4. arXiv:2007.01789  [pdf, other

    cs.DC

    Map** Datasets to Object Storage System

    Authors: Xiaowei, Chu, Jeff LeFevre, Aldrin Montana, Dana Robinson, Quincey Koziol, Peter Alvaro, Carlos Maltzahn

    Abstract: Access libraries such as ROOT and HDF5 allow users to interact with datasets using high level abstractions, like coordinate systems and associated slicing operations. Unfortunately, the implementations of access libraries are based on outdated assumptions about storage systems interfaces and are generally unable to fully benefit from modern fast storage devices. The situation is getting worse with… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Journal ref: In 24th International Conference on Computing in High Energy & Nuclear Physics, Adelaide, Australia, November 4-8 2019

  5. arXiv:1712.00423  [pdf, other

    cs.DC

    DAOS for Extreme-scale Systems in Scientific Applications

    Authors: M. Scot Breitenfeld, Neil Fortner, Jordan Henderson, Jerome Soumagne, Mohamad Chaarawi, Johann Lombardi, Quincey Koziol

    Abstract: Exascale I/O initiatives will require new and fully integrated I/O models which are capable of providing straightforward functionality, fault tolerance and efficiency. One solution is the Distributed Asynchronous Object Storage (DAOS) technology, which is primarily designed to handle the next generation NVRAM and NVMe technologies envisioned for providing a high bandwidth/IOPS storage tier close t… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: Submitted to HiPC-2017 on Jun 30 2017, accepted for publication on Sep 8 2017, withdrawn on Oct 20 2017 b/c no author was able to present

  6. arXiv:1510.02135  [pdf, other

    cs.DC

    A Remote Procedure Call Approach for Extreme-scale Services

    Authors: Jerome Soumagne, Philip H. Carns, Dries Kimpe, Quincey Koziol, Robert B. Ross

    Abstract: When working at exascale, the various constraints imposed by the extreme scale of the system bring new challenges for application users and software/middleware developers. In that context, and to provide best performance, resiliency and energy efficiency, software may be provided as a service oriented approach, adjusting resource utilization to best meet facility and user requirements. Remote proc… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.

    Comments: CSESSP 2015