Skip to main content

Showing 1–5 of 5 results for author: Wilkinson, S R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.09004  [pdf, other

    cs.DC cs.AI cs.DB cs.LG

    Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability

    Authors: Renan Souza, Tyler J. Skluzacek, Sean R. Wilkinson, Maxim Ziatdinov, Rafael Ferreira da Silva

    Abstract: Modern large-scale scientific discovery requires multidisciplinary collaboration across diverse computing facilities, including High Performance Computing (HPC) machines and the Edge-to-Cloud continuum. Integrated data analysis plays a crucial role in scientific discovery, especially in the current AI era, by enabling Responsible AI development, FAIR, Reproducibility, and User Steering. However, t… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures, 2 Listings, 42 references, Paper accepted at IEEE eScience'23

    MSC Class: 65Y05; 68P15 ACM Class: I.2; H.2; C.4; J.2

    Journal ref: 19th IEEE International Conference on e-Science (eScience) 2023 - Limassol, Cyprus

  2. Pseudonymization at Scale: OLCF's Summit Usage Data Case Study

    Authors: Ketan Maheshwari, Sean R. Wilkinson, Alex May, Tyler Skluzacek, Olga A. Kuchar, Rafael Ferreira da Silva

    Abstract: The analysis of vast amounts of data and the processing of complex computational jobs have traditionally relied upon high performance computing (HPC) systems. Understanding these analyses' needs is paramount for designing solutions that can lead to better science, and similarly, understanding the characteristics of the user behavior on those systems is important for improving user experiences on H… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 9 pages, 5 figures, accepted to BTSD 2022 workshop (see https://sites.google.com/view/btsd2022 for more information), to be published in the proceedings of IEEE Big Data 2022

  3. WfBench: Automated Generation of Scientific Workflow Benchmarks

    Authors: Tainã Coleman, Henri Casanova, Ketan Maheshwari, Loïc Pottier, Sean R. Wilkinson, Justin Wozniak, Frédéric Suter, Mallikarjun Shankar, Rafael Ferreira da Silva

    Abstract: The prevalence of scientific workflows with high computational demands calls for their execution on various distributed computing platforms, including large-scale leadership-class high-performance computing (HPC) clusters. To handle the deployment, monitoring, and optimization of workflow executions, many workflow systems have been developed over the past decade. There is a need for workflow bench… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  4. F*** workflows: when parts of FAIR are missing

    Authors: Sean R. Wilkinson, Greg Eisenhauer, Anuj J. Kapadia, Kathryn Knight, Jeremy Logan, Patrick Widener, Matthew Wolf

    Abstract: The FAIR principles for scientific data (Findable, Accessible, Interoperable, Reusable) are also relevant to other digital objects such as research software and scientific workflows that operate on scientific data. The FAIR principles can be applied to the data being handled by a scientific workflow as well as the processes, software, and other infrastructure which are necessary to specify and exe… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: 6 pages, 0 figures, accepted to ERROR 2022 workshop (see https://error-workshop.org/ for more information), to be published in proceedings of IEEE eScience 2022

  5. Unveiling User Behavior on Summit Login Nodes as a User

    Authors: Sean R. Wilkinson, Ketan Maheshwari, Rafael Ferreira da Silva

    Abstract: We observe and analyze usage of the login nodes of the leadership class Summit supercomputer from the perspective of an ordinary user -- not a system administrator -- by periodically sampling user activities (job queues, running processes, etc.) for two full years (2020-2021). Our findings unveil key usage patterns that evidence misuse of the system, including gaming the policies, impairing I/O pe… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: International Conference on Computational Science (ICCS), 2022