Skip to main content

Showing 1–10 of 10 results for author: Beranek, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12612  [pdf, other

    cs.AR

    A System Development Kit for Big Data Applications on FPGA-based Clusters: The EVEREST Approach

    Authors: Christian Pilato, Subhadeep Banik, Jakub Beranek, Fabien Brocheton, Jeronimo Castrillon, Riccardo Cevasco, Radim Cmar, Serena Curzel, Fabrizio Ferrandi, Karl F. A. Friebel, Antonella Galizia, Matteo Grasso, Paulo Silva, Jan Martinovic, Gianluca Palermo, Michele Paolino, Andrea Parodi, Antonio Parodi, Fabio Pintus, Raphael Polig, David Poulet, Francesco Regazzoni, Burkhard Ringlein, Roberto Rocco, Katerina Slaninova , et al. (6 additional authors not shown)

    Abstract: Modern big data workflows are characterized by computationally intensive kernels. The simulated results are often combined with knowledge extracted from AI models to ultimately support decision-making. These energy-hungry workflows are increasingly executed in data centers with energy-efficient hardware accelerators since FPGAs are well-suited for this task due to their inherent parallelism. We pr… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted for presentation at DATE 2024 (multi-partner project session)

  2. Tunable and Portable Extreme-Scale Drug Discovery Platform at Exascale: the LIGATE Approach

    Authors: Gianluca Palermo, Gianmarco Accordi, Davide Gadioli, Emanuele Vitali, Cristina Silvano, Bruno Guindani, Danilo Ardagna, Andrea R. Beccari, Domenico Bonanni, Carmine Talarico, Filippo Lunghini, Jan Martinovic, Paulo Silva, Ada Bohm, Jakub Beranek, Jan Krenek, Branislav Jansik, Luigi Crisci, Biagio, Cosenza, Peter Thoman, Philip Salzmann, Thomas Fahringer, Leila Alexander, Gerardo Tauriello , et al. (10 additional authors not shown)

    Abstract: Today digital revolution is having a dramatic impact on the pharmaceutical industry and the entire healthcare system. The implementation of machine learning, extreme-scale computer simulations, and big data analytics in the drug design and development process offers an excellent opportunity to lower the risk of investment and reduce the time to the patient. Within the LIGATE project, we aim to i… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Paper Accepted to the 20th ACM International Conference on Computing Frontiers (CF'23)

  3. Analysis of Workflow Schedulers in Simulated Distributed Environments

    Authors: Jakub Beránek, Stanislav Böhm, Vojtěch Cima

    Abstract: Task graphs provide a simple way to describe scientific workflows (sets of tasks with dependencies) that can be executed on both HPC clusters and in the cloud. An important aspect of executing such graphs is the used scheduling algorithm. Many scheduling heuristics have been proposed in existing works; nevertheless, they are often tested in oversimplified environments. We provide an extensible sim… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  4. arXiv:2104.07582  [pdf, other

    cs.AR cs.DC cs.DS cs.PF

    SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems

    Authors: Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez Luna, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Marek Konieczny, Nils Blach, Onur Mutlu, Torsten Hoefler

    Abstract: Simple graph algorithms such as PageRank have been the target of numerous hardware accelerators. Yet, there also exist much more complex graph mining algorithms for problems such as clustering or maximal clique listing. These algorithms are memory-bound and thus could be accelerated by hardware techniques such as Processing-in-Memory (PIM). However, they also come with nonstraightforward paralleli… ▽ More

    Submitted 25 October, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: Proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO'21), 2021

  5. arXiv:2103.03653  [pdf, other

    cs.DC cs.CV cs.DS cs.MS cs.PF

    GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra

    Authors: Maciej Besta, Zur Vonarburg-Shmaria, Yannick Schaffner, Leonardo Schwarz, Grzegorz Kwasniewski, Lukas Gianinazzi, Jakub Beranek, Kacper Janda, Tobias Holenstein, Sebastian Leisinger, Peter Tatkowski, Esref Ozdemir, Adrian Balla, Marcin Copik, Philipp Lindenberger, Pavel Kalvoda, Marek Konieczny, Onur Mutlu, Torsten Hoefler

    Abstract: We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive literature review, prescribing representative problems, algorithms, and datasets. Second, GMS offers a carefully designed software platform for seamless testing of dif… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Journal ref: International Conference on Very Large Data Bases (VLDB), 2021

  6. Runtime vs Scheduler: Analyzing Dask's Overheads

    Authors: Stanislav Böhm, Jakub Beránek

    Abstract: Dask is a distributed task framework which is commonly used by data scientists to parallelize Python code on computing clusters with little programming effort. It uses a sophisticated work-stealing scheduler which has been hand-tuned to execute task graphs as efficiently as possible. But is scheduler optimization a worthwhile effort for Dask? Our paper shows on many real world task graphs that eve… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  7. arXiv:2010.03536  [pdf, other

    cs.NI cs.DC

    PsPIN: A high-performance low-power architecture for flexible in-network compute

    Authors: Salvatore Di Girolamo, Andreas Kurth, Alexandru Calotoiu, Thomas Benz, Timo Schneider, Jakub Beránek, Luca Benini, Torsten Hoefler

    Abstract: The capacity of offloading data and control tasks to the network is becoming increasingly important, especially if we consider the faster growth of network speed when compared to CPU frequencies. In-network compute alleviates the host CPU load by running tasks directly in the network, enabling additional computation/communication overlap and potentially improving overall application performance. H… ▽ More

    Submitted 1 June, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

  8. Haydi: Rapid Prototy** and Combinatorial Objects

    Authors: Stanislav Böhm, Jakub Beránek, Martin Šurkovský

    Abstract: Haydi (http://haydi.readthedocs.io) is a framework for generating discrete structures. It provides a way to define a structure from basic building blocks and then enumerate all elements, all non-isomorphic elements, or generate random elements in the structure. Haydi is designed as a tool for rapid prototy**. It is implemented as a pure Python package and supports execution in distributed enviro… ▽ More

    Submitted 27 September, 2019; originally announced September 2019.

    Journal ref: Foundations of Information and Knowledge Systems - 10th International Symposium, FoIKS 2018, Budapest, Hungary, May 14-18, 2018, Proceedings. Lecture Notes in Computer Science 10833, Springer 2018, ISBN 978-3-319-90049-0

  9. Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware

    Authors: Tiziano De Matteis, Johannes de Fine Licht, Jakub Beránek, Torsten Hoefler

    Abstract: Distributed memory programming is the established paradigm used in high-performance computing (HPC) systems, requiring explicit communication between nodes and devices. When FPGAs are deployed in distributed settings, communication is typically handled either by going through the host machine, sacrificing performance, or by streaming across fixed device-to-device connections, sacrificing flexibili… ▽ More

    Submitted 7 September, 2019; originally announced September 2019.

  10. Network-Accelerated Non-Contiguous Memory Transfers

    Authors: Salvatore Di Girolamo, Konstantin Taranov, Andreas Kurth, Michael Schaffner, Timo Schneider, Jakub Beránek, Maciej Besta, Luca Benini, Duncan Roweth, Torsten Hoefler

    Abstract: Applications often communicate data that is non-contiguous in the send- or the receive-buffer, e.g., when exchanging a column of a matrix stored in row-major order. While non-contiguous transfers are well supported in HPC (e.g., MPI derived datatypes), they can still be up to 5x slower than contiguous transfers of the same size. As we enter the era of network acceleration, we need to investigate w… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), Nov. 2019