Skip to main content

Showing 1–9 of 9 results for author: Fragkoulis, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13629  [pdf, other

    cs.DC cs.DB

    CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows

    Authors: George Siachamis, Kyriakos Psarakis, Marios Fragkoulis, Arie van Deursen, Paris Carbone, Asterios Katsifodimos

    Abstract: Stream processing in the last decade has seen broad adoption in both commercial and research settings. One key element for this success is the ability of modern stream processors to handle failures while ensuring exactly-once processing guarantees. At the moment of writing, virtually all stream processors that guarantee exactly-once processing implement a variant of Apache Flink's coordinated chec… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2402.01339  [pdf, other

    cs.IR

    Improving Sequential Recommendations with LLMs

    Authors: Artun Boz, Wouter Zorgdrager, Zoe Kotti, Jesse Harte, Panos Louridas, Dietmar Jannach, Marios Fragkoulis

    Abstract: The sequential recommendation problem has attracted considerable research attention in the past few years, leading to the rise of numerous recommendation models. In this work, we explore how Large Language Models (LLMs), which are nowadays introducing disruptive effects in many AI-based applications, can be used to build or improve sequential recommendation approaches. Specifically, we design thre… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 33 pages, 12 figures, 7 tables

  3. arXiv:2312.06893  [pdf, other

    cs.DC cs.DB

    Styx: Transactional Stateful Functions on Streaming Dataflows

    Authors: Kyriakos Psarakis, George Siachamis, George Christodoulou, Marios Fragkoulis, Asterios Katsifodimos

    Abstract: Develo** stateful cloud applications, such as high-throughput/low-latency workflows and microservices with strict consistency requirements, remains arduous for programmers. The Stateful-Functions-as-a-Service (SFaaS) paradigm aims to serve these use cases. However, existing approaches either provide serializable transactional guarantees at the level of individual functions or separate applicat… ▽ More

    Submitted 4 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  4. Leveraging Large Language Models for Sequential Recommendation

    Authors: Jesse Harte, Wouter Zorgdrager, Panos Louridas, Asterios Katsifodimos, Dietmar Jannach, Marios Fragkoulis

    Abstract: Sequential recommendation problems have received increasing attention in research during the past few years, leading to the inception of a large variety of algorithmic approaches. In this work, we explore how large language models (LLMs), which are nowadays introducing disruptive effects in many AI-based applications, can be used to build or improve sequential recommendation approaches. Specifical… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 9 pages

    Report number: In Seventeenth ACM Conference on Recommender Systems (RecSys '23), September 18--22, 2023, Singapore, Singapore. ACM, New York, NY, USA

  5. arXiv:2206.12733  [pdf, other

    cs.DB

    SiMa: Effective and Efficient Matching Across Data Silos Using Graph Neural Networks

    Authors: Christos Koutras, Rihan Hai, Kyriakos Psarakis, Marios Fragkoulis, Asterios Katsifodimos

    Abstract: How can we leverage existing column relationships within silos, to predict similar ones across silos? Can we do this efficiently and effectively? Existing matching approaches do not exploit prior knowledge, relying on prohibitively expensive similarity computations. In this paper we present the first technique for matching columns across data silos, called SiMa, which leverages Graph Neural Networ… ▽ More

    Submitted 3 March, 2024; v1 submitted 25 June, 2022; originally announced June 2022.

  6. arXiv:2112.00710  [pdf, other

    cs.DC cs.DB

    Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows

    Authors: Kyriakos Psarakis, Wouter Zorgdrager, Marios Fragkoulis, Guido Salvaneschi, Asterios Katsifodimos

    Abstract: Although the cloud has reached a state of robustness, the burden of using its resources falls on the shoulders of programmers who struggle to keep up with ever-growing cloud infrastructure services and abstractions. As a result, state management, scaling, operation, and failure management of scalable cloud applications, require disproportionately more effort than develo** the applications' actua… ▽ More

    Submitted 3 September, 2023; v1 submitted 17 November, 2021; originally announced December 2021.

  7. arXiv:2103.10169  [pdf, other

    cs.DC cs.DB

    Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile

    Authors: Can Gencer, Marko Topolnik, Viliam Ďurina, Emin Demirci, Ensar B. Kahveci, Ali Gürbüz Ondřej Lukáš, József Bartók, Grzegorz Gierlach, František Hartman, Ufuk Yılmaz, Mehmet Doğan, Mohamed Mandouh, Marios Fragkoulis, Asterios Katsifodimos

    Abstract: Jet is an open-source, high-performance, distributed stream processor built at Hazelcast during the last five years. Jet was engineered with millisecond latency on the 99.99th percentile as its primary design goal. Originally Jet's purpose was to be an execution engine that performs complex business logic on top of streams generated by Hazelcast's In-memory Data Grid (IMDG): a set of high-performa… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  8. arXiv:2010.07386  [pdf, other

    cs.DB

    Valentine: Evaluating Matching Techniques for Dataset Discovery

    Authors: Christos Koutras, George Siachamis, Andra Ionescu, Kyriakos Psarakis, Jerry Brons, Marios Fragkoulis, Christoph Lofi, Angela Bonifati, Asterios Katsifodimos

    Abstract: Data scientists today search large data lakes to discover and integrate datasets. In order to bring together disparate data sources, dataset discovery methods rely on some form of schema matching: the process of establishing correspondences between datasets. Traditionally, schema matching has been used to find matching pairs of columns between a source and a target schema. However, the use of sche… ▽ More

    Submitted 13 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

  9. arXiv:2008.00842  [pdf, other

    cs.DC cs.CL cs.DB cs.PF

    A Survey on the Evolution of Stream Processing Systems

    Authors: Marios Fragkoulis, Paris Carbone, Vasiliki Kalavri, Asterios Katsifodimos

    Abstract: Stream processing has been an active research field for more than 20 years, but it is now witnessing its prime time due to recent successful efforts by the research community and numerous worldwide open-source communities. This survey provides a comprehensive overview of fundamental aspects of stream processing systems and their evolution in the functional areas of out-of-order data management, st… ▽ More

    Submitted 14 January, 2023; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 30 pages, 10 figures, 6 tables