Skip to main content

Showing 1–17 of 17 results for author: Rellermeyer, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.18355  [pdf

    cs.IR cs.DB cs.DS

    COPR -- Efficient, large-scale log storage and retrieval

    Authors: Julian Reichinger, Thomas Krismayer, Jan Rellermeyer

    Abstract: Modern, large scale monitoring systems have to process and store vast amounts of log data in near real-time. At query time the systems have to find relevant logs based on the content of the log message using support structures that can scale to these amounts of data while still being efficient to use. We present our novel Compressed Probabilistic Retrieval algorithm (COPR), capable of answering Mu… ▽ More

    Submitted 27 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 14 pages, 8 figures

    ACM Class: H.3.1

  2. arXiv:2401.14093  [pdf, other

    cs.SE cs.LG

    McUDI: Model-Centric Unsupervised Degradation Indicator for Failure Prediction AIOps Solutions

    Authors: Lorena Poenaru-Olaru, Luis Cruz, Jan Rellermeyer, Arie van Deursen

    Abstract: Due to the continuous change in operational data, AIOps solutions suffer from performance degradation over time. Although periodic retraining is the state-of-the-art technique to preserve the failure prediction AIOps models' performance over time, this technique requires a considerable amount of labeled data to retrain. In AIOps obtaining label data is expensive since it requires the availability… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  3. Is Your Anomaly Detector Ready for Change? Adapting AIOps Solutions to the Real World

    Authors: Lorena Poenaru-Olaru, Natalia Karpova, Luis Cruz, Jan Rellermeyer, Arie van Deursen

    Abstract: Anomaly detection techniques are essential in automating the monitoring of IT systems and operations. These techniques imply that machine learning algorithms are trained on operational data corresponding to a specific period of time and that they are continuously evaluated on newly emerging data. Operational data is constantly changing over time, which affects the performance of deployed anomaly d… ▽ More

    Submitted 11 April, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  4. Log Parsing Evaluation in the Era of Modern Software Systems

    Authors: Stefan Petrescu, Floris den Hengst, Alexandru Uta, Jan S. Rellermeyer

    Abstract: Due to the complexity and size of modern software systems, the amount of logs generated is tremendous. Hence, it is infeasible to manually investigate these data in a reasonable time, thereby requiring automating log analysis to derive insights about the functioning of the systems. Motivated by an industry use-case, we zoom-in on one integral part of automated log analysis, log parsing, which is t… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  5. arXiv:2211.13098  [pdf, other

    cs.LG cs.AI

    Are Concept Drift Detectors Reliable Alarming Systems? -- A Comparative Study

    Authors: Lorena Poenaru-Olaru, Luis Cruz, Arie van Deursen, Jan S. Rellermeyer

    Abstract: As machine learning models increasingly replace traditional business logic in the production system, their lifecycle management is becoming a significant concern. Once deployed into production, the machine learning models are constantly evaluated on new streaming data. Given the continuous data flow, shifting data, also known as concept drift, is ubiquitous in such settings. Concept drift usually… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  6. arXiv:2206.03259  [pdf

    cs.CY

    Future Computer Systems and Networking Research in the Netherlands: A Manifesto

    Authors: Alexandru Iosup, Fernando Kuipers, Ana Lucia Varbanescu, Paola Grosso, Animesh Trivedi, Jan Rellermeyer, Lin Wang, Alexandru Uta, Francesco Regazzoni

    Abstract: Our modern society and competitive economy depend on a strong digital foundation and, in turn, on sustained research and innovation in computer systems and networks (CompSys). With this manifesto, we draw attention to CompSys as a vital part of ICT. Among ICT technologies, CompSys covers all the hardware and all the operational software layers that enable applications; only application-specific de… ▽ More

    Submitted 26 May, 2022; originally announced June 2022.

    Comments: Position paper: 7 foundational research themes in computer science and networking research, 4 advances with outstanding impact on society, 10 recommendations, 50 pages. Co-signatories from (alphabetical order): ASTRON, CWI, Gaia-X NL, NIKHEF, RU Groningen, SIDN Labs, Solvinity, SURF, TNO, TU/e, TU Delft, UvA, U. Leiden, U. Twente, VU Amsterdam

    ACM Class: A.1; A.m; C.0; D.4; J.0; K.3; K.4; K.6

  7. arXiv:2112.06280  [pdf, other

    cs.DC

    In-Memory Indexed Caching for Distributed Data Processing

    Authors: Alexandru Uta, Bogdan Ghit, Ankur Dave, Jan Rellermeyer, Peter Boncz

    Abstract: Powerful abstractions such as dataframes are only as efficient as their underlying runtime system. The de-facto distributed data processing framework, Apache Spark, is poorly suited for the modern cloud-based data-science workloads due to its outdated assumptions: static datasets analyzed using coarse-grained transformations. In this paper, we introduce the Indexed DataFrame, an in-memory cache th… ▽ More

    Submitted 8 February, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at IEEE IPDPS 2022

  8. arXiv:2112.00616  [pdf, other

    cs.DC cs.AI

    Roadmap for Edge AI: A Dagstuhl Perspective

    Authors: Aaron Yi Ding, Ella Peltonen, Tobias Meuser, Atakan Aral, Christian Becker, Schahram Dustdar, Thomas Hiessl, Dieter Kranzlmuller, Madhusanka Liyanage, Setareh Magshudi, Nitinder Mohan, Joerg Ott, Jan S. Rellermeyer, Stefan Schulte, Henning Schulzrinne, Gurkan Solmaz, Sasu Tarkoma, Blesson Varghese, Lars Wolf

    Abstract: Based on the collective input of Dagstuhl Seminar (21342), this paper presents a comprehensive discussion on AI methods and capabilities in the context of edge computing, referred as Edge AI. In a nutshell, we envision Edge AI to provide adaptation for data-driven applications, enhance network and radio access, and allow the creation, optimization, and deployment of distributed AI/ML pipelines wit… ▽ More

    Submitted 27 November, 2021; originally announced December 2021.

    Comments: for ACM SIGCOMM CCR

    ACM Class: I.2.11

  9. A Fresh Look at the Architecture and Performance of Contemporary Isolation Platforms

    Authors: Vincent van Rijn, Jan S. Rellermeyer

    Abstract: With the ever-increasing pervasiveness of the cloud computing paradigm, strong isolation guarantees and low performance overhead from isolation platforms are paramount. An ideal isolation platform offers both: an impermeable isolation boundary while imposing a negligible performance overhead. In this paper, we examine various isolation platforms (containers, secure containers, hypervisors, unikern… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the 22nd ACM/IFIP International Middleware Conference, 2022

  10. arXiv:2106.06972  [pdf, other

    cs.AI

    RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System

    Authors: Yapeng Jasper Hu, Ralph van Gurp, Ashay Somai, Hugo Kooijman, Jan S. Rellermeyer

    Abstract: Consistent alpha generation, i.e., maintaining an edge over the market, underpins the ability of asset traders to reliably generate profits. Technical indicators and trading strategies are commonly used tools to determine when to buy/hold/sell assets, yet these are limited by the fact that they operate on known values. Over the past decades, multiple studies have investigated the potential of arti… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  11. arXiv:2103.10248  [pdf, other

    cs.LG cs.AI cs.SE

    Systematic Map** Study on the Machine Learning Lifecycle

    Authors: Yuanhao Xie, Luís Cruz, Petra Heck, Jan S. Rellermeyer

    Abstract: The development of artificial intelligence (AI) has made various industries eager to explore the benefits of AI. There is an increasing amount of research surrounding AI, most of which is centred on the development of new AI algorithms and techniques. However, the advent of AI is bringing an increasing set of practical problems related to AI model lifecycle management that need to be investigated.… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted at WAIN21: 1st Workshop on AI Engineering - Software Engineering for AI

    MSC Class: 68T01 (Primary) ACM Class: D.2.9; I.2.5

  12. arXiv:2012.03550  [pdf, other

    cs.DC

    SGD_Tucker: A Novel Stochastic Optimization Strategy for Parallel Sparse Tucker Decomposition

    Authors: Hao Li, Zixuan Li, Kenli Li, Jan S. Rellermeyer, Lydia Y. Chen, Keqin Li

    Abstract: Sparse Tucker Decomposition (STD) algorithms learn a core tensor and a group of factor matrices to obtain an optimal low-rank representation feature for the \underline{H}igh-\underline{O}rder, \underline{H}igh-\underline{D}imension, and \underline{S}parse \underline{T}ensor (HOHDST). However, existing STD algorithms face the problem of intermediate variables explosion which results from the fact t… ▽ More

    Submitted 8 December, 2020; v1 submitted 7 December, 2020; originally announced December 2020.

  13. arXiv:1912.09789  [pdf

    cs.LG cs.DC stat.ML

    A Survey on Distributed Machine Learning

    Authors: Joost Verbraeken, Matthijs Wolting, Jonathan Katzy, Jeroen Kloppenburg, Tim Verbelen, Jan S. Rellermeyer

    Abstract: The demand for artificial intelligence has grown significantly over the last decade and this growth has been fueled by advances in machine learning techniques and the ability to leverage hardware acceleration. However, in order to increase the quality of predictions and render machine learning solutions feasible for more complex applications, a substantial amount of training data is required. Alth… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  14. arXiv:1912.09256  [pdf, other

    cs.PF cs.DC

    Is Big Data Performance Reproducible in Modern Cloud Networks?

    Authors: Alexandru Uta, Alexandru Custura, Dmitry Duplyakin, Ivo Jimenez, Jan Rellermeyer, Carlos Maltzahn, Robert Ricci, Alexandru Iosup

    Abstract: Performance variability has been acknowledged as a problem for over a decade by cloud practitioners and performance engineers. Yet, our survey of top systems conferences reveals that the research community regularly disregards variability when running experiments in the cloud. Focusing on networks, we assess the impact of variability on cloud-based big-data workloads by gathering traces from mains… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: 12 pages paper, 3 pages references

  15. arXiv:1906.10496  [pdf

    cs.DC

    The Coming Age of Pervasive Data Processing

    Authors: Jan S. Rellermeyer, Sobhan Omranian Khorasani, Dan Graur, Apourva Parthasarathy

    Abstract: Emerging Big Data analytics and machine learning applications require a significant amount of computational power. While there exists a plethora of large-scale data processing frameworks which thrive in handling the various complexities of data-intensive workloads, the ever-increasing demand of applications have made us reconsider the traditional ways of scaling (e.g., scale-out) and seek new oppo… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: ISPDC 2019

  16. Container Density Improvements with Dynamic Memory Extension using NAND Flash

    Authors: Jan S. Rellermeyer, Maher Amer, Richard Smutzer, Karthick Rajamani

    Abstract: While containers efficiently implement the idea of operating-system-level application virtualization, they are often insufficient to increase the server utilization to a desirable level. The reason is that in practice many containerized applications experience a limited amount of load while there are few containers with a high load. In such a scenario, the virtual memory management system can beco… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: APSys 2018

  17. arXiv:1802.06270  [pdf, other

    cs.DC

    MAVIS: Managing Datacenters using Smartphones

    Authors: Raghav Shankar, Benjamin Kobin, Saurabh Bagchi, Michael Kistler, Jan Rellermeyer

    Abstract: Distributed monitoring plays a crucial role in managing the activities of cloud-based datacenters. System administrators have long relied on monitoring systems such as Nagios and Ganglia to obtain status alerts on their desktop-class machines. However, the popularity of mobile devices is pushing the community to develop datacenter monitoring solutions for smartphone-class devices. Here we lay out… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

    Comments: ACM Classification (2012): Data center networks; System management; Ubiquitous and mobile computing systems and tools