Skip to main content

Showing 1–16 of 16 results for author: Kindratenko, V

Searching in archive cs. Search in all archives.
.
  1. Training Next Generation AI Users and Developers at NCSA

    Authors: Daniel S. Katz, Volodymyr Kindratenko, Olena Kindratenko, Priyam Mazumdar

    Abstract: This article focuses on training work carried out in artificial intelligence (AI) at the National Center for Supercomputing Applications (NCSA) at the University of Illinois Urbana-Champaign via a research experience for undergraduates (REU) program named FoDOMMaT. It also describes why we are interested in AI, and concludes by discussing what we've learned from running this program and its predec… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2402.12271  [pdf, other

    cs.DC cs.LG

    Secure Federated Learning Across Heterogeneous Cloud and High-Performance Computing Resources -- A Case Study on Federated Fine-tuning of LLaMA 2

    Authors: Zilinghan Li, Shilan He, Pranshu Chaturvedi, Volodymyr Kindratenko, Eliu A Huerta, Kibaek Kim, Ravi Madduri

    Abstract: Federated learning enables multiple data owners to collaboratively train robust machine learning models without transferring large or sensitive local datasets by only sharing the parameters of the locally trained models. In this paper, we elaborate on the design of our Advanced Privacy-Preserving Federated Learning (APPFL) framework, which streamlines end-to-end secure and reliable federated learn… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2312.07711  [pdf, other

    cs.AI

    Leveraging Large Language Models to Build and Execute Computational Workflows

    Authors: Alejandro Duque, Abdullah Syed, Kastan V. Day, Matthew J. Berry, Daniel S. Katz, Volodymyr V. Kindratenko

    Abstract: The recent development of large language models (LLMs) with multi-billion parameters, coupled with the creation of user-friendly application programming interfaces (APIs), has paved the way for automatically generating and executing code in response to straightforward human queries. This paper explores how these emerging capabilities can be harnessed to facilitate complex scientific workflows, eli… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  4. arXiv:2309.14675  [pdf, other

    cs.LG cs.DC

    FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices using a Computing Power Aware Scheduler

    Authors: Zilinghan Li, Pranshu Chaturvedi, Shilan He, Han Chen, Gagandeep Singh, Volodymyr Kindratenko, E. A. Huerta, Kibaek Kim, Ravi Madduri

    Abstract: Cross-silo federated learning offers a promising solution to collaboratively train robust and generalized AI models without compromising the privacy of local datasets, e.g., healthcare, financial, as well as scientific projects that lack a centralized data facility. Nonetheless, because of the disparity of computing resources among different clients (i.e., device heterogeneity), synchronous federa… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted as poster at The Twelfth International Conference on Learning Representations (ICLR 2024)

  5. arXiv:2309.03367  [pdf, other

    cs.CV cs.AI

    Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks

    Authors: Priyam Mazumdar, Aiman Soliman, Volodymyr Kindratenko, Luigi Marini, Kenton McHenry

    Abstract: The lack of quality labeled data is one of the main bottlenecks for training Deep Learning models. As the task increases in complexity, there is a higher penalty for overfitting and unstable learning. The typical paradigm employed today is Self-Supervised learning, where the model attempts to learn from a large corpus of unstructured and unlabeled data and then transfer that knowledge to the requi… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  6. arXiv:2308.08786  [pdf, other

    cs.LG

    APPFLx: Providing Privacy-Preserving Cross-Silo Federated Learning as a Service

    Authors: Zilinghan Li, Shilan He, Pranshu Chaturvedi, Trung-Hieu Hoang, Minseok Ryu, E. A. Huerta, Volodymyr Kindratenko, Jordan Fuhrman, Maryellen Giger, Ryan Chard, Kibaek Kim, Ravi Madduri

    Abstract: Cross-silo privacy-preserving federated learning (PPFL) is a powerful tool to collaboratively train robust and generalized machine learning (ML) models without sharing sensitive (e.g., healthcare of financial) local data. To ease and accelerate the adoption of PPFL, we introduce APPFLx, a ready-to-use platform that provides privacy-preserving cross-silo federated learning as a service. APPFLx empl… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  7. arXiv:2301.08403  [pdf, other

    cs.LG cs.CR stat.AP stat.ML

    One-shot Generative Data Augmentation with Bounded Divergence for UAV Identification in Limited RF Environments

    Authors: Amir Kazemi, Salar Basiri, Volodymyr Kindratenko, Srinivasa Salapaka

    Abstract: This work addresses the pressing need for cybersecurity in Unmanned Aerial Vehicles (UAVs), particularly focusing on the challenges of identifying UAVs using radiofrequency (RF) fingerprinting in constrained environments. The complexity and variability of RF signals, influenced by environmental interference and hardware imperfections, often render traditional RF-based identification methods ineffe… ▽ More

    Submitted 14 May, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 12 pages, 7 figures, 4 tables

    MSC Class: 49Q22; 68T37 ACM Class: I.2.6; I.5.1

  8. arXiv:2212.05081  [pdf, other

    hep-ex cs.LG physics.comp-ph

    FAIR AI Models in High Energy Physics

    Authors: Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao

    Abstract: The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning (ML) models -- algorithms that have been trained on data without being explicitly… ▽ More

    Submitted 29 December, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 34 pages, 9 figures, 10 tables

    Journal ref: Mach. Learn.: Sci. Technol. 4 (2023) 045062

  9. arXiv:2210.08973  [pdf, ps, other

    cs.CY cs.HC cs.LG hep-ex

    FAIR for AI: An interdisciplinary and international community building perspective

    Authors: E. A. Huerta, Ben Blaiszik, L. Catherine Brinson, Kristofer E. Bouchard, Daniel Diaz, Caterina Doglioni, Javier M. Duarte, Murali Emani, Ian Foster, Geoffrey Fox, Philip Harris, Lukas Heinrich, Shantenu Jha, Daniel S. Katz, Volodymyr Kindratenko, Christine R. Kirkpatrick, Kati Lassila-Perini, Ravi K. Madduri, Mark S. Neubauer, Fotis E. Psomopoulos, Avik Roy, Oliver Rübel, Zhizhen Zhao, Ruike Zhu

    Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to i… ▽ More

    Submitted 1 August, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 10 pages, comments welcome!; v2: 12 pages, accepted to Scientific Data

    ACM Class: I.2.0; E.0

    Journal ref: Scientific Data 10, 487 (2023)

  10. arXiv:2209.11477  [pdf, other

    cs.CV

    Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model

    Authors: Zhenting Qi, Ruike Zhu, Zheyu Fu, Wenhao Chai, Volodymyr Kindratenko

    Abstract: Fight detection in videos is an emerging deep learning application with today's prevalence of surveillance systems and streaming media. Previous work has largely relied on action recognition techniques to tackle this problem. In this paper, we propose a simple but effective method that solves the task from a new perspective: we design the fight detection model as a composition of an action-aware f… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: Accepted by ICTAI 2022

  11. arXiv:2108.07749  [pdf, other

    astro-ph.GA astro-ph.HE cs.LG

    AGNet: Weighing Black Holes with Deep Learning

    Authors: Joshua Yao-Yu Lin, Sneh Pandya, Devanshi Pratap, Xin Liu, Matias Carrasco Kind, Volodymyr Kindratenko

    Abstract: Supermassive black holes (SMBHs) are ubiquitously found at the centers of most massive galaxies. Measuring SMBH mass is important for understanding the origin and evolution of SMBHs. However, traditional methods require spectroscopic data which is expensive to gather. We present an algorithm that weighs SMBHs using quasar light time series, circumventing the need for expensive spectra. We train, v… ▽ More

    Submitted 21 November, 2022; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: 8 pages, 7 figures, 1 table, Accepted by MNRAS

    Journal ref: Monthly Notices of the Royal Astronomical Society, 2022;, stac3339

  12. arXiv:2108.02214  [pdf, other

    hep-ex cs.AI cs.DB hep-ph

    A FAIR and AI-ready Higgs boson decay dataset

    Authors: Yifan Chen, E. A. Huerta, Javier Duarte, Philip Harris, Daniel S. Katz, Mark S. Neubauer, Daniel Diaz, Farouk Mokhtar, Raghav Kansal, Sang Eon Park, Volodymyr V. Kindratenko, Zhizhen Zhao, Roger Rusack

    Abstract: To enable the reusability of massive scientific datasets by humans and machines, researchers aim to adhere to the principles of findability, accessibility, interoperability, and reusability (FAIR) for data and artificial intelligence (AI) models. This article provides a domain-agnostic, step-by-step assessment guide to evaluate whether or not a given dataset meets these principles. We demonstrate… ▽ More

    Submitted 16 February, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 13 pages, 3 figures. v2: Accepted to Nature Scientific Data. Learn about the FAIR4HEP project at https://fair4hep.github.io. See our invited Behind the Paper Blog in Springer Nature Research Data Community at https://go.nature.com/3oMVYxo

    ACM Class: I.2; J.2

    Journal ref: Scientific Data volume 9, Article number: 31 (2022)

  13. arXiv:2012.08545  [pdf, other

    gr-qc astro-ph.IM cs.AI cs.DC

    Accelerated, Scalable and Reproducible AI-driven Gravitational Wave Detection

    Authors: E. A. Huerta, Asad Khan, Xiaobo Huang, Minyang Tian, Maksim Levental, Ryan Chard, Wei Wei, Maeve Heflin, Daniel S. Katz, Volodymyr Kindratenko, Dawei Mu, Ben Blaiszik, Ian Foster

    Abstract: The development of reusable artificial intelligence (AI) models for wider use and rigorous validation by the community promises to unlock new opportunities in multi-messenger astrophysics. Here we develop a workflow that connects the Data and Learning Hub for Science, a repository for publishing AI models, with the Hardware Accelerated Learning (HAL) cluster, using funcX as a universal distributed… ▽ More

    Submitted 9 July, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: 17 pages, 5 figures; v2: 12 pages, 6 figures. Accepted to Nature Astronomy. See also the Behind the Paper blog in Nature Astronomy "https://astronomycommunity.nature.com/posts/from-disruption-to-sustained-innovation-artificial-intelligence-for-gravitational-wave-astrophysics"

    MSC Class: 68T01; 68T35; 83C35; 83C57

    Journal ref: Nat Astron 5, 1062-1068 (2021)

  14. arXiv:2003.08394  [pdf, other

    physics.comp-ph astro-ph.IM cs.LG gr-qc

    Convergence of Artificial Intelligence and High Performance Computing on NSF-supported Cyberinfrastructure

    Authors: E. A. Huerta, Asad Khan, Edward Davis, Colleen Bushell, William D. Gropp, Daniel S. Katz, Volodymyr Kindratenko, Seid Koric, William T. C. Kramer, Brendan McGinty, Kenton McHenry, Aaron Saxton

    Abstract: Significant investments to upgrade and construct large-scale scientific facilities demand commensurate investments in R&D to design algorithms and computing approaches to enable scientific and engineering breakthroughs in the big data era. Innovative Artificial Intelligence (AI) applications have powered transformational solutions for big data challenges in industry and technology that now drive a… ▽ More

    Submitted 19 October, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: White paper accepted to the NSF Workshop on Smart Cyberinfrastructure, February 25-27, 2020 http://smartci.sci.utah.edu/. v2: Survey paper accepted to Journal of Big Data

    MSC Class: 68T35; 68M14; 68N15; 68N30 ACM Class: I.2; I.6

    Journal ref: Journal of Big Data volume 7, Article number: 88 (2020)

  15. arXiv:1911.11779  [pdf, other

    gr-qc astro-ph.HE astro-ph.IM cs.LG

    Enabling real-time multi-messenger astrophysics discoveries with deep learning

    Authors: E. A. Huerta, Gabrielle Allen, Igor Andreoni, Javier M. Antelis, Etienne Bachelet, Bruce Berriman, Federica Bianco, Rahul Biswas, Matias Carrasco, Kyle Chard, Minsik Cho, Philip S. Cowperthwaite, Zachariah B. Etienne, Maya Fishbach, Francisco Förster, Daniel George, Tom Gibbs, Matthew Graham, William Gropp, Robert Gruendl, Anushri Gupta, Roland Haas, Sarah Habib, Elise Jennings, Margaret W. G. Johnson , et al. (35 additional authors not shown)

    Abstract: Multi-messenger astrophysics is a fast-growing, interdisciplinary field that combines data, which vary in volume and speed of data processing, from many different instruments that probe the Universe using different cosmic messengers: electromagnetic waves, cosmic rays, gravitational waves and neutrinos. In this Expert Recommendation, we review the key challenges of real-time observations of gravit… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: Invited Expert Recommendation for Nature Reviews Physics. The art work produced by E. A. Huerta and Shawn Rosofsky for this article was used by Carl Conway to design the cover of the October 2019 issue of Nature Reviews Physics

    Journal ref: Nature Reviews Physics volume 1, pages 600-608 (2019)

  16. arXiv:1902.00522  [pdf, ps, other

    astro-ph.IM astro-ph.HE cs.LG gr-qc

    Deep Learning for Multi-Messenger Astrophysics: A Gateway for Discovery in the Big Data Era

    Authors: Gabrielle Allen, Igor Andreoni, Etienne Bachelet, G. Bruce Berriman, Federica B. Bianco, Rahul Biswas, Matias Carrasco Kind, Kyle Chard, Minsik Cho, Philip S. Cowperthwaite, Zachariah B. Etienne, Daniel George, Tom Gibbs, Matthew Graham, William Gropp, Anushri Gupta, Roland Haas, E. A. Huerta, Elise Jennings, Daniel S. Katz, Asad Khan, Volodymyr Kindratenko, William T. C. Kramer, Xin Liu, Ashish Mahabal , et al. (23 additional authors not shown)

    Abstract: This report provides an overview of recent work that harnesses the Big Data Revolution and Large Scale Computing to address grand computational challenges in Multi-Messenger Astrophysics, with a particular emphasis on real-time discovery campaigns. Acknowledging the transdisciplinary nature of Multi-Messenger Astrophysics, this document has been prepared by members of the physics, astronomy, compu… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: 15 pages, no figures. White paper based on the "Deep Learning for Multi-Messenger Astrophysics: Real-time Discovery at Scale" workshop, hosted at NCSA, October 17-19, 2018 http://www.ncsa.illinois.edu/Conferences/DeepLearningLSST/