Skip to main content

Showing 1–8 of 8 results for author: Podstawski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05085  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

    Authors: Maciej Besta, Ales Kubicek, Roman Niggli, Robert Gerstenberger, Lucas Weitzendorf, Mingyuan Chi, Patrick Iff, Joanna Gajda, Piotr Nyczyk, Jürgen Müller, Hubert Niewiadomski, Marcin Chrapek, Michał Podstawski, Torsten Hoefler

    Abstract: Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabling the retrieval of documents into the LLM context to provide more accurate and relevant responses. Existing RAG solutions do not focus on queries that may require fetching multiple documents with substantially different contents. Such queries occur frequently, but are challenging because the embed… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. Graph of Thoughts: Solving Elaborate Problems with Large Language Models

    Authors: Maciej Besta, Nils Blach, Ales Kubicek, Robert Gerstenberger, Michal Podstawski, Lukas Gianinazzi, Joanna Gajda, Tomasz Lehmann, Hubert Niewiadomski, Piotr Nyczyk, Torsten Hoefler

    Abstract: We introduce Graph of Thoughts (GoT): a framework that advances prompting capabilities in large language models (LLMs) beyond those offered by paradigms such as Chain-of-Thought or Tree of Thoughts (ToT). The key idea and primary advantage of GoT is the ability to model the information generated by an LLM as an arbitrary graph, where units of information ("LLM thoughts") are vertices, and edges co… ▽ More

    Submitted 6 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence 2024 (AAAI'24)

  3. The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of Cores

    Authors: Maciej Besta, Robert Gerstenberger, Marc Fischer, Michał Podstawski, Nils Blach, Berke Egeli, Georgy Mitenkov, Wojciech Chlapek, Marek Michalewicz, Hubert Niewiadomski, Jürgen Müller, Torsten Hoefler

    Abstract: Graph databases (GDBs) are crucial in academic and industry applications. The key challenges in develo** GDBs are achieving high performance, scalability, programmability, and portability. To tackle these challenges, we harness established practices from the HPC landscape to build a system that outperforms all past GDBs presented in the literature by orders of magnitude, for both OLTP and OLAP w… ▽ More

    Submitted 20 November, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Best Paper Finalist at ACM Supercomputing '23 (SC '23)

    Journal ref: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2023 (SC '23)

  4. arXiv:2209.09732  [pdf, other

    cs.LG cs.DB

    Neural Graph Databases

    Authors: Maciej Besta, Patrick Iff, Florian Scheidl, Kazuki Osawa, Nikoli Dryden, Michal Podstawski, Tiancheng Chen, Torsten Hoefler

    Abstract: Graph databases (GDBs) enable processing and analysis of unstructured, complex, rich, and usually vast graph datasets. Despite the large significance of GDBs in both academia and industry, little effort has been made into integrating them with the predictive power of graph neural networks (GNNs). In this work, we show how to seamlessly combine nearly any GNN model with the computational capabiliti… ▽ More

    Submitted 24 November, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Journal ref: Learning on Graphs (LOG) 2022

  5. arXiv:2208.11469  [pdf, other

    cs.DC cs.DS

    ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations

    Authors: Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tětek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler

    Abstract: Important graph mining problems such as Clustering are computationally demanding. To significantly accelerate these problems, we propose ProbGraph: a graph representation that enables simple and fast approximate parallel graph mining with strong theoretical guarantees on work, depth, and result accuracy. The key idea is to represent sets of vertices using probabilistic set representations such as… ▽ More

    Submitted 21 November, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Best Paper Award at ACM/IEEE Supercomputing'22 (SC22)

    Journal ref: Proceedings of the ACM/IEEE International Conference on High Performance Computing, Networking, Storage and Analysis, November 2022

  6. arXiv:2012.14132  [pdf, other

    cs.DC

    SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing

    Authors: Marcin Copik, Grzegorz Kwasniewski, Maciej Besta, Michal Podstawski, Torsten Hoefler

    Abstract: Function-as-a-Service (FaaS) is one of the most promising directions for the future of cloud services, and serverless functions have immediately become a new middleware for building scalable and cost-efficient microservices and applications. However, the quickly moving technology hinders reproducibility, and the lack of a standardized benchmarking suite leads to ad-hoc solutions and microbenchmark… ▽ More

    Submitted 1 July, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Extended version of the paper accepted at Middleware 2021

  7. arXiv:2010.16012  [pdf, other

    cs.DC cs.DS

    To Push or To Pull: On Reducing Communication and Synchronization in Graph Computations

    Authors: Maciej Besta, Michal Podstawski, Linus Groner, Edgar Solomonik, Torsten Hoefler

    Abstract: We reduce the cost of communication and synchronization in graph processing by analyzing the fastest way to process graphs: pushing the updates to a shared state or pulling the updates to a private state.We investigate the applicability of this push-pull dichotomy to various algorithms and its impact on complexity, performance, and the amount of used locks, atomics, and reads/writes. We consider 1… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Journal ref: Proceedings of the 26th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'17), 2017

  8. arXiv:1910.09017  [pdf, other

    cs.DB cs.DC

    Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries

    Authors: Maciej Besta, Robert Gerstenberger, Emanuel Peter, Marc Fischer, Michał Podstawski, Claude Barthels, Gustavo Alonso, Torsten Hoefler

    Abstract: Graph processing has become an important part of multiple areas of computer science, such as machine learning, computational sciences, medical applications, social network analysis, and many others. Numerous graphs such as web or social networks may contain up to trillions of edges. Often, these graphs are also dynamic (their structure changes over time) and have domain-specific rich data associat… ▽ More

    Submitted 30 August, 2023; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: A shorter version of the survey was published in the journal ACM Computing Surveys