Skip to main content

Showing 1–1 of 1 results for author: Nolte, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00110  [pdf, other

    cs.DC cs.AI

    Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services

    Authors: Ali Doosthosseini, Jonathan Decker, Hendrik Nolte, Julian M. Kunkel

    Abstract: The increasing adoption of large language models (LLMs) has created a pressing need for an efficient, secure and private serving infrastructure, which allows researchers to run open-source or custom fine-tuned LLMs and ensures users that their data remains private and is not stored without their consent. While high-performance computing (HPC) systems equipped with state-of-the-art GPUs are well-su… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 27 pages, 5 figures, 2 tables