Search | arXiv e-print repository

Increasing Efficiency and Result Reliability of Continuous Benchmarking for FaaS Applications

Authors: Tim C. Rese, Nils Japke, Sebastian Koch, Tobias Pfandzelter, David Bermbach

Abstract: In a continuous deployment setting, Function-as-a-Service (FaaS) applications frequently receive updated releases, each of which can cause a performance regression. While continuous benchmarking, i.e., comparing benchmark results of the updated and the previous version, can detect such regressions, performance variability of FaaS platforms necessitates thousands of function calls, thus, making con… ▽ More In a continuous deployment setting, Function-as-a-Service (FaaS) applications frequently receive updated releases, each of which can cause a performance regression. While continuous benchmarking, i.e., comparing benchmark results of the updated and the previous version, can detect such regressions, performance variability of FaaS platforms necessitates thousands of function calls, thus, making continuous benchmarking time-intensive and expensive. In this paper, we propose DuetFaaS, an approach which adapts duet benchmarking to FaaS applications. With DuetFaaS, we deploy two versions of FaaS function in a single cloud function instance and execute them in parallel to reduce the impact of platform variability. We evaluate our approach against state-of-the-art approaches, running on AWS Lambda. Overall, DuetFaaS requires fewer invocations to accurately detect performance regressions than other state-of-the-art approaches. In 99.65% of evaluated cases, our approach provides smaller confidence interval sizes than the comparing approaches, and can reduce the size by up to 98.23%. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.14413 [pdf, other]

GeoFaaS: An Edge-to-Cloud FaaS Platform

Authors: Mohammadreza Malekabbasi, Tobias Pfandzelter, Trever Schirmer, David Bermbach

Abstract: The massive growth of mobile and IoT devices demands geographically distributed computing systems for optimal performance, privacy, and scalability. However, existing edge-to-cloud serverless platforms lack location awareness, resulting in inefficient network usage and increased latency. In this paper, we propose GeoFaaS, a novel edge-to-cloud Function-as-a-Service (FaaS) platform that leverages… ▽ More The massive growth of mobile and IoT devices demands geographically distributed computing systems for optimal performance, privacy, and scalability. However, existing edge-to-cloud serverless platforms lack location awareness, resulting in inefficient network usage and increased latency. In this paper, we propose GeoFaaS, a novel edge-to-cloud Function-as-a-Service (FaaS) platform that leverages real-time client location information for transparent request execution on the nearest available FaaS node. If needed, GeoFaaS transparently offloads requests to the cloud when edge resources are overloaded, thus, ensuring consistent execution without user intervention. GeoFaaS has a modular and decentralized architecture: building on the single-node FaaS system tinyFaaS, GeoFaaS works as a stand-alone edge-to-cloud FaaS platform but can also integrate and act as a routing layer for existing FaaS services, e.g., in the cloud. To evaluate our approach, we implemented an open-source proof-of-concept prototype and studied performance and fault-tolerance behavior in experiments. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 6 pages, 6 figures

arXiv:2405.13594 [pdf, other]

GeoFF: Federated Serverless Workflows with Data Pre-Fetching

Authors: Valentin Carl, Trever Schirmer, Tobias Pfandzelter, David Bermbach

Abstract: Function-as-a-Service (FaaS) is a popular cloud computing model in which applications are implemented as work flows of multiple independent functions. While cloud providers usually offer composition services for such workflows, they do not support cross-platform workflows forcing developers to hardcode the composition logic. Furthermore, FaaS workflows tend to be slow due to cascading cold starts,… ▽ More Function-as-a-Service (FaaS) is a popular cloud computing model in which applications are implemented as work flows of multiple independent functions. While cloud providers usually offer composition services for such workflows, they do not support cross-platform workflows forcing developers to hardcode the composition logic. Furthermore, FaaS workflows tend to be slow due to cascading cold starts, inter-function latency, and data download latency on the critical path. In this paper, we propose GeoFF, a serverless choreography middleware that executes FaaS workflows across different public and private FaaS platforms, including ad-hoc workflow recomposition. Furthermore, GeoFF supports function pre-warming and data pre-fetching. This minimizes end-to-end workflow latency by taking cold starts and data download latency off the critical path. In experiments with our proof-of-concept prototype and a realistic application, we were able to reduce end-to-end latency by more than 50%. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13528 [pdf, other]

ElastiBench: Scalable Continuous Benchmarking on Cloud FaaS Platforms

Authors: Trever Schirmer, Tobias Pfandzelter, David Bermbach

Abstract: Running microbenchmark suites often and early in the development process enables developers to identify performance issues in their application. Microbenchmark suites of complex applications can comprise hundreds of individual benchmarks and take multiple hours to evaluate meaningfully, making running those benchmarks as part of CI/CD pipelines infeasible. In this paper, we reduce the total execut… ▽ More Running microbenchmark suites often and early in the development process enables developers to identify performance issues in their application. Microbenchmark suites of complex applications can comprise hundreds of individual benchmarks and take multiple hours to evaluate meaningfully, making running those benchmarks as part of CI/CD pipelines infeasible. In this paper, we reduce the total execution time of microbenchmark suites by leveraging the massive scalability and elasticity of FaaS (Function-as-a-Service) platforms. While using FaaS enables users to quickly scale up to thousands of parallel function instances to speed up microbenchmarking, the performance variation and low control over the underlying computing resources complicate reliable benchmarking. We demonstrate an architecture for executing microbenchmark suites on cloud FaaS platforms and evaluate it on code changes from an open-source time series database. Our evaluation shows that our prototype can produce reliable results (~95% of performance changes accurately detected) in a quarter of the time (<=15min vs.~4h) and at lower cost ($0.49 vs. ~$1.18) compared to cloud-based virtual machines. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2403.16869 [pdf, other]

Lessons Learned from Building Edge Software System Testbeds

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Edge computing requires the complex software interaction of geo-distributed, heterogeneous components. The growing research and industry interest in edge computing software systems has necessitated exploring ways of testing and evaluating edge software at scale without relying on physical infrastructure. Beyond simulation, virtual testbeds that emulate edge infrastructure can provide a cost-effici… ▽ More Edge computing requires the complex software interaction of geo-distributed, heterogeneous components. The growing research and industry interest in edge computing software systems has necessitated exploring ways of testing and evaluating edge software at scale without relying on physical infrastructure. Beyond simulation, virtual testbeds that emulate edge infrastructure can provide a cost-efficient yet realistic environment to evaluate edge software. In this experience paper, we share lessons learned from building a total of five edge software testbeds. We describe pitfalls in architecture and development as well as experiences from having students use our testbed tooling in distributed systems prototy** classes. While we remain confident that building custom testbed tooling is the right approach for edge computing researchers and practitioners alike, we hope this paper allows others to avoid common mistakes and benefit from our experience. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2403.00515 [pdf, other]

Are Unikernels Ready for Serverless on the Edge?

Authors: Felix Moebius, Tobias Pfandzelter, David Bermbach

Abstract: Function-as-a-Service (FaaS) is a promising edge computing execution model but requires secure sandboxing mechanisms to isolate workloads from multiple tenants on constrained infrastructure. Although Docker containers are lightweight and popular in open-source FaaS platforms, they are generally considered insufficient for executing untrusted code and providing sandbox isolation. Commercial cloud F… ▽ More Function-as-a-Service (FaaS) is a promising edge computing execution model but requires secure sandboxing mechanisms to isolate workloads from multiple tenants on constrained infrastructure. Although Docker containers are lightweight and popular in open-source FaaS platforms, they are generally considered insufficient for executing untrusted code and providing sandbox isolation. Commercial cloud FaaS platforms thus rely on Linux microVMs or hardened container runtimes, which are secure but come with a higher resource footprint. Unikernels combine application code and limited operating system primitives into a single purpose appliance, reducing the footprint of an application and its sandbox while providing full Linux compatibility. In this paper, we study the suitability of unikernels as an edge FaaS execution environment using the Nanos and OSv unikernel tool chains. We compare performance along several metrics such as cold start overhead and idle footprint against sandboxes such as Firecracker Linux microVMs, Docker containers, and secure gVisor containers. We find that unikernels exhibit desirable cold start performance, yet lag behind Linux microVMs in stability. Nevertheless, we show that unikernels are a promising candidate for further research on Linux-compatible FaaS isolation. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2311.09745 [pdf, other]

Application-Centric Benchmarking of Distributed FaaS Platforms using BeFaaS

Authors: Martin Grambow, Tobias Pfandzelter, David Bermbach

Abstract: Due to the popularity of the FaaS programming model, there is now a wide variety of commercial and open-source FaaS systems. Hence, for comparison of different FaaS systems and their configuration options, FaaS application developers rely on FaaS benchmarking frameworks. Existing frameworks, however, tend to evaluate only single isolated aspects, a more holistic application-centric benchmarking fr… ▽ More Due to the popularity of the FaaS programming model, there is now a wide variety of commercial and open-source FaaS systems. Hence, for comparison of different FaaS systems and their configuration options, FaaS application developers rely on FaaS benchmarking frameworks. Existing frameworks, however, tend to evaluate only single isolated aspects, a more holistic application-centric benchmarking framework is still missing. In previous work, we proposed BeFaaS, an extensible application-centric benchmarking framework for FaaS environments that focuses on the evaluation of FaaS platforms through realistic and typical examples of FaaS applications. In this extended paper, we (i) enhance our benchmarking framework with additional features for distributed FaaS setups, (ii) design application benchmarks reflecting typical FaaS use cases, and (iii) use them to run extensive experiments with commercial cloud FaaS platforms (AWS Lambda, Azure Functions, Google Cloud Functions) and the tinyFaaS edge serverless platform. BeFaaS now includes four FaaS application-centric benchmarks, is extensible for additional workload profiles and platforms, and supports federated benchmark runs in which the benchmark application is distributed over multiple FaaS systems while collecting fine-grained measurement results for drill-down analysis. Our experiment results show that (i) network transmission is a major contributor to response latency for function chains, (ii) this effect is exacerbated in hybrid edge-cloud deployments, (iii) the trigger delay between a published event and the start of the triggered function ranges from about 100ms for AWS Lambda to 800ms for Google Cloud Functions, and (iv) Azure Functions shows the best cold start behavior for our workloads. △ Less

Submitted 26 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2102.12770

arXiv:2311.04875 [pdf, other]

Fusionize++: Improving Serverless Application Performance Using Dynamic Task Inlining and Infrastructure Optimization

Authors: Trever Schirmer, Joel Scheuner, Tobias Pfandzelter, David Bermbach

Abstract: The Function-as-a-Service (FaaS) execution model increases developer productivity by removing operational concerns such as managing hardware or software runtimes. Developers, however, still need to partition their applications into FaaS functions, which is error-prone and complex: Encapsulating only the smallest logical unit of an application as a FaaS function maximizes flexibility and reusabilit… ▽ More The Function-as-a-Service (FaaS) execution model increases developer productivity by removing operational concerns such as managing hardware or software runtimes. Developers, however, still need to partition their applications into FaaS functions, which is error-prone and complex: Encapsulating only the smallest logical unit of an application as a FaaS function maximizes flexibility and reusability. Yet, it also leads to invocation overheads, additional cold starts, and may increase cost due to double billing during synchronous invocations. Conversely, deploying an entire application as a single FaaS function avoids these overheads but decreases flexibility. In this paper we present Fusionize, a framework that automates optimizing for this trade-off by automatically fusing application code into an optimized multi-function composition. Developers only need to write fine-grained application code following the serverless model, while Fusionize automatically fuses different parts of the application into FaaS functions, manages their interactions, and configures the underlying infrastructure. At runtime, it monitors application performance and adapts it to minimize request-response latency and costs. Real-world use cases show that Fusionize can improve the deployment artifacts of the application, reducing both median request-response latency and cost of an example IoT application by more than 35%. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: arXiv admin note: text overlap with arXiv:2204.11533

arXiv:2311.04108 [pdf, other]

doi 10.1145/3603166.3632128

The Early Microbenchmark Catches the Bug -- Studying Performance Issues Using Micro- and Application Benchmarks

Authors: Nils Japke, Christoph Witzko, Martin Grambow, David Bermbach

Abstract: An application's performance regressions can be detected by both application or microbenchmarks. While application benchmarks stress the system under test by sending synthetic but realistic requests which, e.g., simulate real user traffic, microbenchmarks evaluate the performance on a subroutine level by calling the function under test repeatedly. In this paper, we use a testbed microservice app… ▽ More An application's performance regressions can be detected by both application or microbenchmarks. While application benchmarks stress the system under test by sending synthetic but realistic requests which, e.g., simulate real user traffic, microbenchmarks evaluate the performance on a subroutine level by calling the function under test repeatedly. In this paper, we use a testbed microservice application which includes three performance issues to study the detection capabilities of both approaches. In extensive benchmarking experiments, we increase the severity of each performance issue stepwise, run both an application benchmark and the microbenchmark suite, and check at which point each benchmark detects the performance issue. Our results show that microbenchmarks detect all three issues earlier, some even at the lowest severity level. Application benchmarks, however, raised false positive alarms, wrongly detected performance improvements, and detected the performance issues later. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted for publication in 2023 IEEE/ACM 16th International Conference on Utility and Cloud Computing

arXiv:2311.03889 [pdf, other]

doi 10.1145/3631295.3631395

Efficiently Detecting Performance Changes in FaaS Application Releases

Authors: Martin Grambow, Tim Dockenfuß, Trever Schirmer, Nils Japke, David Bermbach

Abstract: The source code of Function as a Service (FaaS) applications is constantly being refined. To detect if a source code change introduces a significant performance regression, the traditional benchmarking approach evaluates both the old and new function version separately using numerous artificial requests. In this paper, we describe a wrapper approach that enables the Randomized Multiple Interleav… ▽ More The source code of Function as a Service (FaaS) applications is constantly being refined. To detect if a source code change introduces a significant performance regression, the traditional benchmarking approach evaluates both the old and new function version separately using numerous artificial requests. In this paper, we describe a wrapper approach that enables the Randomized Multiple Interleaved Trials (RMIT) benchmark execution methodology in FaaS environments and use bootstrap** percentile intervals to derive more accurate confidence intervals of detected performance changes. We evaluate our approach using two public FaaS providers, an artificial performance issue, and several benchmark configuration parameters. We conclude that RMIT can shrink the width of confidence intervals in the results from 10.65% using the traditional approach to 0.37% using RMIT and thus enables a more fine-grained performance change detection. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted for publication in 9th International Workshop on Serverless Computing (WoSC '23), ACM 2023

arXiv:2310.11957 [pdf, other]

Supporting UAVs with Edge Computing: A Review of Opportunities and Challenges

Authors: Malte Janßen, Tobias Pfandzelter, Minghe Wang, David Bermbach

Abstract: Over the last years, Unmanned Aerial Vehicles (UAVs) have seen significant advancements in sensor capabilities and computational abilities, allowing for efficient autonomous navigation and visual tracking applications. However, the demand for computationally complex tasks has increased faster than advances in battery technology. This opens up possibilities for improvements using edge computing. In… ▽ More Over the last years, Unmanned Aerial Vehicles (UAVs) have seen significant advancements in sensor capabilities and computational abilities, allowing for efficient autonomous navigation and visual tracking applications. However, the demand for computationally complex tasks has increased faster than advances in battery technology. This opens up possibilities for improvements using edge computing. In edge computing, edge servers can achieve lower latency responses compared to traditional cloud servers through strategic geographic deployments. Furthermore, these servers can maintain superior computational performance compared to UAVs, as they are not limited by battery constraints. Combining these technologies by aiding UAVs with edge servers, research finds measurable improvements in task completion speed, energy efficiency, and reliability across multiple applications and industries. This systematic literature review aims to analyze the current state of research and collect, select, and extract the key areas where UAV activities can be supported and improved through edge computing. △ Less

Submitted 18 October, 2023; originally announced October 2023.

Report number: MCC.2023.3

arXiv:2309.15471 [pdf, other]

doi 10.1145/3631295.3631393

ProFaaStinate: Delaying Serverless Function Calls to Optimize Platform Performance

Authors: Trever Schirmer, Valentin Carl, Tobias Pfandzelter, David Bermbach

Abstract: Function-as-a-Service (FaaS) enables developers to run serverless applications without managing operational tasks. In current FaaS platforms, both synchronous and asynchronous calls are executed immediately. In this paper, we present ProFaaStinate, which extends serverless platforms to enable delayed execution of asynchronous function calls. This allows platforms to execute calls at convenient tim… ▽ More Function-as-a-Service (FaaS) enables developers to run serverless applications without managing operational tasks. In current FaaS platforms, both synchronous and asynchronous calls are executed immediately. In this paper, we present ProFaaStinate, which extends serverless platforms to enable delayed execution of asynchronous function calls. This allows platforms to execute calls at convenient times with higher resource availability or lower load. ProFaaStinate is able to optimize performance without requiring deep integration into the rest of the platform, or a complex systems model. In our evaluation, our prototype built on top of Nuclio can reduce request response latency and workflow duration while also preventing the system from being overloaded during load peaks. Using a document preparation use case, we show a 54% reduction in average request response latency. This reduction in resource usage benefits both platforms and users as cost savings. △ Less

Submitted 31 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

Comments: Accepted for publication in Proc. of 9th International Workshop on Serverless Computing (WoSC 23)

arXiv:2309.03584 [pdf, other]

doi 10.1145/3630180.3631203

Enoki: Stateful Distributed FaaS from Edge to Cloud

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Function-as-a-Service (FaaS) is a promising paradigm for applications distributed across the edge-cloud continuum. FaaS functions are stateless by nature, leading to high elasticity and transparent invocation. Supporting stateful applications, however, requires integrating data storage in FaaS, which is not trivial in an edge-cloud environment. We propose Enoki, an architecture for stateful FaaS… ▽ More Function-as-a-Service (FaaS) is a promising paradigm for applications distributed across the edge-cloud continuum. FaaS functions are stateless by nature, leading to high elasticity and transparent invocation. Supporting stateful applications, however, requires integrating data storage in FaaS, which is not trivial in an edge-cloud environment. We propose Enoki, an architecture for stateful FaaS computing replicated across the edge-cloud continuum. Enoki integrates a replicated key-value store with single-node FaaS systems at edge and cloud nodes in order to provide low-latency local data access for functions without breaking the abstraction of the FaaS programming model. We evaluate Enoki with microbenchmarks on an open-source prototype and demonstrate building a stateful FaaS application with multiple functions distributed over edge and cloud. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2306.10321 [pdf, other]

On the Applicability of Network Coordinate Systems for Fog Computing

Authors: Simon Huber, Tobias Pfandzelter, David Bermbach

Abstract: Identifying the closest fog node is crucial for mobile clients to benefit from fog computing. Relying on geographical location alone us insufficient for this as it ignores real observed client access latency. In this paper, we analyze the performance of the Meridian and Vivaldi network coordinate systems in identifying nearest fog nodes. To that end, we simulate a dense fog environment with mobile… ▽ More Identifying the closest fog node is crucial for mobile clients to benefit from fog computing. Relying on geographical location alone us insufficient for this as it ignores real observed client access latency. In this paper, we analyze the performance of the Meridian and Vivaldi network coordinate systems in identifying nearest fog nodes. To that end, we simulate a dense fog environment with mobile clients. We find that while network coordinate systems really find fog nodes in close network proximity, a purely latency-oriented identification approach ignores the larger problem of balancing load across fog nodes. △ Less

Submitted 25 July, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

Report number: MCC.2023.2

arXiv:2306.09756 [pdf, other]

doi 10.1145/3570361.3614239

Can Orbital Servers Provide Mars-Wide Edge Computing?

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Human landing, exploration and settlement on Mars will require local compute resources at the Mars edge. Landing such resources on Mars is an expensive endeavor. Instead, in this paper we lay out how concepts from low-Earth orbit edge computing may be applied to Mars edge computing. This could lower launching costs of compute resources for Mars while also providing Mars-wide networking and compute… ▽ More Human landing, exploration and settlement on Mars will require local compute resources at the Mars edge. Landing such resources on Mars is an expensive endeavor. Instead, in this paper we lay out how concepts from low-Earth orbit edge computing may be applied to Mars edge computing. This could lower launching costs of compute resources for Mars while also providing Mars-wide networking and compute coverage. We propose a possible Mars compute constellation, discuss applications, analyze feasibility, and raise research questions for future work. △ Less

Submitted 18 August, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

Comments: 1st ACM MobiCom Workshop on Satellite Networking and Computing (SatCom '23)

arXiv:2306.01595 [pdf, other]

Fog System Configuration Management with CRDTs

Authors: Nick Stender, Tobias Pfandzelter, David Bermbach

Abstract: Current fog systems rely on centralized and strongly consistent services for configuration management originally designed for cloud systems. In the geo-distributed fog, such systems can exhibit high communication latency or become unavailable in case of network partition. In this paper, we examine the drawbacks of strong consistency for fog configuration management and propose an alternative based… ▽ More Current fog systems rely on centralized and strongly consistent services for configuration management originally designed for cloud systems. In the geo-distributed fog, such systems can exhibit high communication latency or become unavailable in case of network partition. In this paper, we examine the drawbacks of strong consistency for fog configuration management and propose an alternative based on CRDTs. We prototypically implement our approach for the FReD fog data management platform. Early results show reductions of server response times of up to 50\%. △ Less

Submitted 25 July, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

Report number: MCC.2023.1

arXiv:2306.00575 [pdf, other]

Predicting Temporal Aspects of Movement for Predictive Replication in Fog Environments

Authors: Emil Balitzki, Tobias Pfandzelter, David Bermbach

Abstract: To fully exploit the benefits of the fog environment, efficient management of data locality is crucial. Blind or reactive data replication falls short in harnessing the potential of fog computing, necessitating more advanced techniques for predicting where and when clients will connect. While spatial prediction has received considerable attention, temporal prediction remains understudied. Our pa… ▽ More To fully exploit the benefits of the fog environment, efficient management of data locality is crucial. Blind or reactive data replication falls short in harnessing the potential of fog computing, necessitating more advanced techniques for predicting where and when clients will connect. While spatial prediction has received considerable attention, temporal prediction remains understudied. Our paper addresses this gap by examining the advantages of incorporating temporal prediction into existing spatial prediction models. We also provide a comprehensive analysis of spatio-temporal prediction models, such as Deep Neural Networks and Markov models, in the context of predictive replication. We propose a novel model using Holt-Winter's Exponential Smoothing for temporal prediction, leveraging sequential and periodical user movement patterns. In a fog network simulation with real user trajectories our model achieves a 15% reduction in excess data with a marginal 1% decrease in data availability. △ Less

Submitted 19 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

Report number: MCC.2023.4

arXiv:2305.13385 [pdf, other]

doi 10.1145/3603166.3632550

A Hybrid Communication Approach for Metadata Exchange in Geo-Distributed Fog Environments

Authors: Marvin Kruber, Tobias Pfandzelter, David Bermbach

Abstract: Metadata exchange is crucial for efficient geo-distributed fog computing. Existing solutions for metadata exchange overlook geo-awareness or lack adequate failure tolerance. We propose HFCS, a novel hybrid communication system that combines hierarchical and peer-to-peer elements, along with edge pools. HFCS utilizes a gossip protocol for dynamic metadata exchange. In simulation, we investigate t… ▽ More Metadata exchange is crucial for efficient geo-distributed fog computing. Existing solutions for metadata exchange overlook geo-awareness or lack adequate failure tolerance. We propose HFCS, a novel hybrid communication system that combines hierarchical and peer-to-peer elements, along with edge pools. HFCS utilizes a gossip protocol for dynamic metadata exchange. In simulation, we investigate the impact of node density and edge pool size on HFCS performance. We observe a performance improvement for clustered node distributions, aligning well with real-world scenarios. HFCS outperforms a hierarchical and a P2P approach in task fulfillment at a slight cost to failure detection. △ Less

Submitted 26 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

arXiv:2305.01712 [pdf, other]

A Crowdsensing Approach for Deriving Surface Quality of Cycling Infrastructure

Authors: Ahmet-Serdar Karakaya, Leonard Thomas, Denis Koljada, David Bermbach

Abstract: Cities worldwide are trying to increase the modal share of bicycle traffic to address traffic and carbon emission problems. Aside from safety, a key factor for this is the cycling comfort, including the surface quality of cycle paths. In this paper, we propose a novel edge-based crowdsensing method for analyzing the surface quality of bicycle paths using smartphone sensor data: Cyclists record the… ▽ More Cities worldwide are trying to increase the modal share of bicycle traffic to address traffic and carbon emission problems. Aside from safety, a key factor for this is the cycling comfort, including the surface quality of cycle paths. In this paper, we propose a novel edge-based crowdsensing method for analyzing the surface quality of bicycle paths using smartphone sensor data: Cyclists record their rides which after preprocessed on their phones before being uploaded to a private cloud backend. There, additional analysis modules aggregate data from all available rides to derive surface quality information which can then used for surface quality-aware routing and planning of infrastructure maintenance. △ Less

Submitted 2 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

Comments: Accepted for the 11th IEEE International Conference on Cloud Engineering (IC2E 2023)

arXiv:2304.09026 [pdf, other]

Towards a Benchmark for Fog Data Processing

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Fog data processing systems provide key abstractions to manage data and event processing in the geo-distributed and heterogeneous fog environment. The lack of standardized benchmarks for such systems, however, hinders their development and deployment, as different approaches cannot be compared quantitatively. Existing cloud data benchmarks are inadequate for fog computing, as their focus on worklo… ▽ More Fog data processing systems provide key abstractions to manage data and event processing in the geo-distributed and heterogeneous fog environment. The lack of standardized benchmarks for such systems, however, hinders their development and deployment, as different approaches cannot be compared quantitatively. Existing cloud data benchmarks are inadequate for fog computing, as their focus on workload specification ignores the tight integration of application and infrastructure inherent in fog computing. In this paper, we outline an approach to a fog-native data processing benchmark that combines workload specifications with infrastructure specifications. This holistic approach allows researchers and engineers to quantify how a software approach performs for a given workload on given infrastructure. Further, by basing our benchmark in a realistic IoT sensor network scenario, we can combine paradigms such as low-latency event processing, machine learning inference, and offline data analytics, and analyze the performance impact of their interplay in a fog data processing system. △ Less

Submitted 25 July, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

Comments: 11th IEEE International Conference on Cloud Engineering (IC2E 2023), short paper

arXiv:2304.07177 [pdf, other]

doi 10.1145/3592533.3592808

The Night Shift: Understanding Performance Variability of Cloud Serverless Platforms

Authors: Trever Schirmer, Nils Japke, Sofia Greten, Tobias Pfandzelter, David Bermbach

Abstract: Function-as-a-Service is a popular cloud programming model that supports developers by abstracting away most operational concerns with automatic deployment and scaling of applications. Due to the high level of abstraction, developers rely on the cloud platform to offer a consistent service level, as decreased performance leads to higher latency and higher cost given the pay-per-use model. In this… ▽ More Function-as-a-Service is a popular cloud programming model that supports developers by abstracting away most operational concerns with automatic deployment and scaling of applications. Due to the high level of abstraction, developers rely on the cloud platform to offer a consistent service level, as decreased performance leads to higher latency and higher cost given the pay-per-use model. In this paper, we measure performance variability of Google Cloud Functions over multiple months. Our results show that diurnal patterns can lead to performance differences of up to 15%, and that the frequency of unexpected cold starts increases threefold during the start of the week. This behavior can negatively impact researchers that conduct performance studies on cloud platforms and practitioners that run cloud applications. △ Less

Submitted 14 April, 2023; originally announced April 2023.

Comments: Author's copy of paper accepted at the 1st Workshop on SErverless Systems, Applications and MEthodologies (SESAME '23)

arXiv:2303.07779 [pdf, other]

Lotus: Serverless In-Transit Data Processing for Edge-based Pub/Sub

Authors: Minghe Wang, Trever Schirmer, Tobias Pfandzelter, David Bermbach

Abstract: Publish-subscribe systems are a popular approach for edge-based IoT use cases: Heterogeneous, constrained edge devices can be integrated easily, with message routing logic offloaded to edge message brokers. Message processing, however, is still done on constrained edge devices. Complex content-based filtering, the transformation between data representations, or message extraction place a considera… ▽ More Publish-subscribe systems are a popular approach for edge-based IoT use cases: Heterogeneous, constrained edge devices can be integrated easily, with message routing logic offloaded to edge message brokers. Message processing, however, is still done on constrained edge devices. Complex content-based filtering, the transformation between data representations, or message extraction place a considerable load on these systems, and resulting superfluous message transfers strain the network. In this paper, we propose Lotus, adding in-transit data processing to an edge publish-subscribe middleware in order to offload basic message processing from edge devices to brokers. Specifically, we leverage the Function-as-a-Service paradigm, which offers support for efficient multi-tenancy, scale-to-zero, and real-time processing. With a proof-of-concept prototype of Lotus, we validate its feasibility and demonstrate how it can be used to offload sensor data transformation to the publish-subscribe messaging middleware. △ Less

Submitted 14 March, 2023; originally announced March 2023.

arXiv:2303.05256 [pdf, other]

doi 10.1002/spe.3237

Managing Data Replication and Distribution in the Fog with FReD

Authors: Tobias Pfandzelter, Nils Japke, Trever Schirmer, Jonathan Hasenburg, David Bermbach

Abstract: The heterogeneous, geographically distributed infrastructure of fog computing poses challenges in data replication, data distribution, and data mobility for fog applications. Fog computing is still missing the necessary abstractions to manage application data, and fog application developers need to re-implement data management for every new piece of software. Proposed solutions are limited to cert… ▽ More The heterogeneous, geographically distributed infrastructure of fog computing poses challenges in data replication, data distribution, and data mobility for fog applications. Fog computing is still missing the necessary abstractions to manage application data, and fog application developers need to re-implement data management for every new piece of software. Proposed solutions are limited to certain application domains, such as the IoT, are not flexible in regard to network topology, or do not provide the means for applications to control the movement of their data. In this paper, we present FReD, a data replication middleware for the fog. FReD serves as a building block for configurable fog data distribution and enables low-latency, high-bandwidth, and privacy-sensitive applications. FReD is a common data access interface across heterogeneous infrastructure and network topologies, provides transparent and controllable data distribution, and can be integrated with applications from different domains. To evaluate our approach, we present a prototype implementation of FReD and show the benefits of develo** with FReD using three case studies of fog computing applications. △ Less

Submitted 11 July, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

arXiv:2302.08952 [pdf, other]

doi 10.1145/3614204.3616106

Edge Computing in Low-Earth Orbit -- What Could Possibly Go Wrong?

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Large low-Earth orbit (LEO) satellite networks are being built to provide low-latency broadband Internet access to a global subscriber base. In addition to network transmissions, researchers have proposed embedding compute resources in satellites to support LEO edge computing. To make software systems ready for the LEO edge, they need to be adapted for its unique execution environment, e.g., to su… ▽ More Large low-Earth orbit (LEO) satellite networks are being built to provide low-latency broadband Internet access to a global subscriber base. In addition to network transmissions, researchers have proposed embedding compute resources in satellites to support LEO edge computing. To make software systems ready for the LEO edge, they need to be adapted for its unique execution environment, e.g., to support handovers in face of satellite mobility. So far, research around LEO edge software systems has focused on the predictable behavior of satellite networks, such as orbital movements. Additionally, we must also consider failure patterns, e.g., effects of radiation on compute hardware in space. In this paper, we present a taxonomy of failures that may occur in LEO edge computing and how they could affect software systems. From there, we derive considerations for LEO edge software systems and lay out avenues for future work. △ Less

Submitted 14 August, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

Comments: 1st Workshop on Low Earth Orbit Networking and Communication (LEO-NET '23)

arXiv:2212.09515 [pdf, other]

doi 10.1109/TCC.2022.3217947

Using Microbenchmark Suites to Detect Application Performance Changes

Authors: Martin Grambow, Denis Kovalev, Christoph Laaber, Philipp Leitner, David Bermbach

Abstract: Software performance changes are costly and often hard to detect pre-release. Similar to software testing frameworks, either application benchmarks or microbenchmarks can be integrated into quality assurance pipelines to detect performance changes before releasing a new application version. Unfortunately, extensive benchmarking studies usually take several hours which is problematic when examining… ▽ More Software performance changes are costly and often hard to detect pre-release. Similar to software testing frameworks, either application benchmarks or microbenchmarks can be integrated into quality assurance pipelines to detect performance changes before releasing a new application version. Unfortunately, extensive benchmarking studies usually take several hours which is problematic when examining dozens of daily code changes in detail; hence, trade-offs have to be made. Optimized microbenchmark suites, which only include a small subset of the full suite, are a potential solution for this problem, given that they still reliably detect the majority of the application performance changes such as an increased request latency. It is, however, unclear whether microbenchmarks and application benchmarks detect the same performance problems and one can be a proxy for the other. In this paper, we explore whether microbenchmark suites can detect the same application performance changes as an application benchmark. For this, we run extensive benchmark experiments with both the complete and the optimized microbenchmark suites of the two time-series database systems InuxDB and VictoriaMetrics and compare their results to the results of corresponding application benchmarks. We do this for 70 and 110 commits, respectively. Our results show that it is possible to detect application performance changes using an optimized microbenchmark suite if frequent false-positive alarms can be tolerated. △ Less

Submitted 19 December, 2022; originally announced December 2022.

Comments: Accepted for publication in IEEE Transactions on Cloud Computing

arXiv:2208.05862 [pdf, other]

Network Emulation in Large-Scale Virtual Edge Testbeds: A Note of Caution and the Way Forward

Authors: Soeren Becker, Tobias Pfandzelter, Nils Japke, David Bermbach, Odej Kao

Abstract: The growing research and industry interest in the Internet of Things and the edge computing paradigm has increased the need for cost-efficient virtual testbeds for large-scale distributed applications. Researchers, students, and practitioners need to test and evaluate the interplay of hundreds or thousands of real software components and services connected with a realistic edge network without acc… ▽ More The growing research and industry interest in the Internet of Things and the edge computing paradigm has increased the need for cost-efficient virtual testbeds for large-scale distributed applications. Researchers, students, and practitioners need to test and evaluate the interplay of hundreds or thousands of real software components and services connected with a realistic edge network without access to physical infrastructure. While advances in virtualization technologies have enabled parts of this, network emulation as a crucial part in the development of edge testbeds is lagging behind: As we show in this paper, NetEm, the current state-of-the-art network emulation tooling included in the Linux kernel, imposes prohibitive scalability limits. We quantify these limits, investigate possible causes, and present a way forward for network emulation in large-scale virtual edge testbeds based on eBPFs. △ Less

Submitted 11 August, 2022; originally announced August 2022.

Comments: Accepted for 2nd International Workshop on Testing Distributed Internet of Things Systems (TDIS 2022)

arXiv:2205.04538 [pdf, other]

doi 10.1109/MedComNet55087.2022.9810439

A Realistic Cyclist Model for SUMO Based on the SimRa Dataset

Authors: Ahmet-Serdar Karakaya, Konstantin Köhler, Julian Heinovski, Falko Dressler, David Bermbach

Abstract: Increasing the modal share of bicycle traffic to reduce carbon emissions, reduce urban car traffic, and to improve the health of citizens, requires a shift away from car-centric city planning. For this, traffic planners often rely on simulation tools such as SUMO which allow them to study the effects of construction changes before implementing them. Similarly, studies of vulnerable road users, her… ▽ More Increasing the modal share of bicycle traffic to reduce carbon emissions, reduce urban car traffic, and to improve the health of citizens, requires a shift away from car-centric city planning. For this, traffic planners often rely on simulation tools such as SUMO which allow them to study the effects of construction changes before implementing them. Similarly, studies of vulnerable road users, here cyclists, also use such models to assess the performance of communication-based road traffic safety systems. The cyclist model in SUMO, however, is very imprecise as SUMO cyclists behave either like slow cars or fast pedestrians, thus, casting doubt on simulation results for bicycle traffic. In this paper, we analyze acceleration, velocity, and intersection left-turn behavior of cyclists in a large dataset of real world cycle tracks. We use the results to derive an improved cyclist model and implement it in SUMO. △ Less

Submitted 5 May, 2022; originally announced May 2022.

Comments: Accepted for the 20th Mediterranean Communication and Computer Networking Conference (MedComNet 2022)

Journal ref: 20th IEEE Mediterranean Communication and Computer Networking Conference (MedComNet 2022), Paphos, Cyprus, June 2022, pp. 166-173

arXiv:2204.11533 [pdf, other]

Fusionize: Improving Serverless Application Performance through Feedback-Driven Function Fusion

Authors: Trever Schirmer, Joel Scheuner, Tobias Pfandzelter, David Bermbach

Abstract: Serverless computing increases developer productivity by removing operational concerns such as managing hardware or software runtimes. Developers, however, still need to partition their application into functions, which can be error-prone and adds complexity: Using a small function size where only the smallest logical unit of an application is inside a function maximizes flexibility and reusabilit… ▽ More Serverless computing increases developer productivity by removing operational concerns such as managing hardware or software runtimes. Developers, however, still need to partition their application into functions, which can be error-prone and adds complexity: Using a small function size where only the smallest logical unit of an application is inside a function maximizes flexibility and reusability. Yet, having small functions leads to invocation overheads, additional cold starts, and may increase cost due to busy waiting. In this paper we present Fusionize, a framework that removes these concerns from developers by automatically fusing the application code into a multi-function orchestration with varying function size. Developers only need to write the application code following a lightweight programming model and do not need to worry how the application is turned into functions. Our framework automatically fuses different parts of the application into functions and manages their interactions. Leveraging monitoring data, the framework optimizes the distribution of application parts to functions to optimize deployment goals such as end-to-end latency and cost. Using two example applications, we show that Fusionize can automatically and iteratively improve the deployment artifacts of the application. △ Less

Submitted 15 August, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

Comments: Accepted for publication in Proc. of IEEE International Conference on Cloud Engineering 2022 (IC2E'22)

arXiv:2204.11509 [pdf, other]

Streaming vs. Functions: A Cost Perspective on Cloud Event Processing

Authors: Tobias Pfandzelter, Sören Henning, Trever Schirmer, Wilhelm Hasselbring, David Bermbach

Abstract: In cloud event processing, data generated at the edge is processed in real-time by cloud resources. Both distributed stream processing (DSP) and Function-as-a-Service (FaaS) have been proposed to implement such event processing applications. FaaS emphasizes fast development and easy operation, while DSP emphasizes efficient handling of large data volumes. Despite their architectural differences, b… ▽ More In cloud event processing, data generated at the edge is processed in real-time by cloud resources. Both distributed stream processing (DSP) and Function-as-a-Service (FaaS) have been proposed to implement such event processing applications. FaaS emphasizes fast development and easy operation, while DSP emphasizes efficient handling of large data volumes. Despite their architectural differences, both can be used to model and implement loosely-coupled job graphs. In this paper, we consider the selection of FaaS and DSP from a cost perspective. We implement stateless and stateful workflows from the Theodolite benchmarking suite using cloud FaaS and DSP. In an extensive evaluation, we show how application type, cloud service provider, and runtime environment can influence the cost of application deployments and derive decision guidelines for cloud engineers. △ Less

Submitted 12 August, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

Comments: Accepted for Publication at the 10th IEEE International Conference on Cloud Engineering (IC2E 2022)

arXiv:2204.10416 [pdf, other]

doi 10.1016/j.pmcj.2023.101779

CycleSense: Detecting Near Miss Incidents in Bicycle Traffic from Mobile Motion Sensors

Authors: Ahmet-Serdar Karakaya, Thomas Ritter, Felix Biessmann, David Bermbach

Abstract: In cities worldwide, cars cause health and traffic problems whichcould be partly mitigated through an increased modal share of bicycles. Many people, however, avoid cycling due to a lack of perceived safety. For city planners, addressing this is hard as they lack insights intowhere cyclists feel safe and where they do not. To gain such insights,we have in previous work proposed the crowdsourcing p… ▽ More In cities worldwide, cars cause health and traffic problems whichcould be partly mitigated through an increased modal share of bicycles. Many people, however, avoid cycling due to a lack of perceived safety. For city planners, addressing this is hard as they lack insights intowhere cyclists feel safe and where they do not. To gain such insights,we have in previous work proposed the crowdsourcing platform SimRa,which allows cyclists to record their rides and report near miss incidentsvia a smartphone app. In this paper, we present CycleSense, a combination of signal pro-cessing and Machine Learning techniques, which partially automatesthe detection of near miss incidents, thus making the reporting of nearmiss incidents easier. Using the SimRa data set, we evaluate CycleSenseby comparing it to a baseline method used by SimRa and show that itsignificantly improves incident detection. △ Less

Submitted 14 March, 2023; v1 submitted 21 April, 2022; originally announced April 2022.

arXiv:2204.06282 [pdf, other]

doi 10.1145/3528535.3531517

Celestial: Virtual Software System Testbeds for the LEO Edge

Authors: Tobias Pfandzelter, David Bermbach

Abstract: As private space companies such as SpaceX and Telesat are building large LEO satellite constellations to provide global broadband Internet access, researchers have proposed to embed compute services within satellite constellations to provide computing services on the LEO edge. While the LEO edge is merely theoretical at the moment, providers are expected to rapidly develop their satellite technolo… ▽ More As private space companies such as SpaceX and Telesat are building large LEO satellite constellations to provide global broadband Internet access, researchers have proposed to embed compute services within satellite constellations to provide computing services on the LEO edge. While the LEO edge is merely theoretical at the moment, providers are expected to rapidly develop their satellite technologies to keep the upper hand in the new space race. In this paper, we answer the question of how researchers can explore the possibilities of LEO edge computing and evaluate arbitrary software systems in an accurate runtime environment and with cost-efficient scalability. To that end, we present Celestial, a virtual testbed for the LEO edge based on microVMs. Celestial can efficiently emulate individual satellites and their movement as well as ground station servers with realistic network conditions and in an application-agnostic manner, which we show empirically. Additionally, we explore opportunities and implications of deploying a real-time remote sensing application on LEO edge infrastructure in a case study on Celestial. △ Less

Submitted 14 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

Comments: Accepted for publication at Middleware 2022

arXiv:2203.07934 [pdf, other]

Towards Distributed Coordination for Fog Platforms

Authors: Tobias Pfandzelter, Trever Schirmer, David Bermbach

Abstract: Distributed fog and edge applications communicate over unreliable networks and are subject to high communication delays. This makes using existing distributed coordination technologies from cloud applications infeasible, as they are built on the assumption of a highly reliable, low-latency datacenter network to achieve strict consistency with low overheads. To help implement configuration and stat… ▽ More Distributed fog and edge applications communicate over unreliable networks and are subject to high communication delays. This makes using existing distributed coordination technologies from cloud applications infeasible, as they are built on the assumption of a highly reliable, low-latency datacenter network to achieve strict consistency with low overheads. To help implement configuration and state management for fog platforms and applications, we propose a novel decentralized approach that lets systems specify coordination strategies and membership for different sets of coordination data. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Accepted for publication at the 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid 2022) Poster Track

arXiv:2201.05872 [pdf, other]

QoS-Aware Resource Placement for LEO Satellite Edge Computing

Authors: Tobias Pfandzelter, David Bermbach

Abstract: With the advent of large LEO satellite communication networks to provide global broadband Internet access, interest in providing edge computing resources within LEO networks has emerged. The LEO Edge promises low-latency, high-bandwidth access to compute and storage resources for a global base of clients and IoT devices regardless of their geographical location. Current proposals assume compute… ▽ More With the advent of large LEO satellite communication networks to provide global broadband Internet access, interest in providing edge computing resources within LEO networks has emerged. The LEO Edge promises low-latency, high-bandwidth access to compute and storage resources for a global base of clients and IoT devices regardless of their geographical location. Current proposals assume compute resources or service replicas at every LEO satellite, which requires high upfront investments and can lead to over-provisioning. To implement and use the LEO Edge efficiently, methods for server and service placement are required that help select an optimal subset of satellites as server or service replica locations. In this paper, we show how the existing research on resource placement on a 2D torus can be applied to this problem by leveraging the unique topology of LEO satellite networks. Further, we extend the existing discrete resource placement methods to allow placement with QoS constraints. In simulation of proposed LEO satellite communication networks, we show how QoS depends on orbital parameters and that our proposed method can take these effects into account where the existing approach cannot. △ Less

Submitted 25 February, 2022; v1 submitted 15 January, 2022; originally announced January 2022.

Comments: 6th IEEE International Conference on Fog and Edge Computing (ICFEC 2022)

arXiv:2201.03462 [pdf, other]

Towards Grassroots Peering at the Edge

Authors: David Bermbach, Sergio Lucia, Vlado Handziski, Adam Wolisz

Abstract: Fog Computing allows applications to address their latency and privacy requirements while co** with bandwidth limitations of Internet service providers (ISPs). Existing research on fog systems has so far mostly taken a very high-level view on the actual fog infrastructure. In this position paper, we identify and discuss the problem of having multiple ISPs in edge-to-edge communication. As a poss… ▽ More Fog Computing allows applications to address their latency and privacy requirements while co** with bandwidth limitations of Internet service providers (ISPs). Existing research on fog systems has so far mostly taken a very high-level view on the actual fog infrastructure. In this position paper, we identify and discuss the problem of having multiple ISPs in edge-to-edge communication. As a possible solution we propose that edge operators create direct edge-to-edge links in a grassroots fashion and discuss different implementation options. Based on this, we highlight some important open research challenges that result from this. △ Less

Submitted 20 October, 2021; originally announced January 2022.

Comments: accepted for publication in ACM M4IoT 2021

arXiv:2111.03395 [pdf, other]

doi 10.1145/3492323.3495595

Predictive Replica Placement for Mobile Users in Distributed Fog Data Stores with Client-Side Markov Models

Authors: Malte Bellmann, Tobias Pfandzelter, David Bermbach

Abstract: Mobile clients that consume and produce data are abundant in fog environments and low latency access to this data can only be achieved by storing it in their close physical proximity. To adapt data replication in fog data stores in an efficient manner and make client data available at the fog node that is closest to the client, the systems need to predict both client movement and pauses in data co… ▽ More Mobile clients that consume and produce data are abundant in fog environments and low latency access to this data can only be achieved by storing it in their close physical proximity. To adapt data replication in fog data stores in an efficient manner and make client data available at the fog node that is closest to the client, the systems need to predict both client movement and pauses in data consumption. In this paper, we present variations of Markov model algorithms that can run on clients to increase the data availability while minimizing excess data. In a simulation, we find the availability of data at the closest node can be improved by 35% without incurring the storage and communication overheads of global replication. △ Less

Submitted 5 November, 2021; originally announced November 2021.

Comments: Accepted for publication at 1st Workshop on Distributed Machine Learning for the Intelligent Computing Continuum (DML-ICC) (2021 IEEE/ACM 14th International Conference on Utility and Cloud Computing (UCC '21) Companion)

arXiv:2108.13222 [pdf, other]

doi 10.1002/spe.3058

AuctionWhisk: Using an Auction-Inspired Approach for Function Placement in Serverless Fog Platforms

Authors: David Bermbach, Jonathan Bader, Jonathan Hasenburg, Tobias Pfandzelter, Lauritz Thamsen

Abstract: The Function-as-a-Service (FaaS) paradigm has a lot of potential as a computing model for fog environments comprising both cloud and edge nodes, as compute requests can be scheduled across the entire fog continuum in a fine-grained manner. When the request rate exceeds capacity limits at the resource-constrained edge, some functions need to be offloaded towards the cloud. In this paper, we prese… ▽ More The Function-as-a-Service (FaaS) paradigm has a lot of potential as a computing model for fog environments comprising both cloud and edge nodes, as compute requests can be scheduled across the entire fog continuum in a fine-grained manner. When the request rate exceeds capacity limits at the resource-constrained edge, some functions need to be offloaded towards the cloud. In this paper, we present an auction-inspired approach in which application developers bid on resources while fog nodes decide locally which functions to execute and which to offload in order to maximize revenue. Unlike many current approaches to function placement in the fog, our approach can work in an online and decentralized manner. We also present our proof-of-concept prototype AuctionWhisk that illustrates how such an approach can be implemented in a real FaaS platform. Through a number of simulation runs and system experiments, we show that revenue for overloaded nodes can be maximized without drop** function requests. △ Less

Submitted 23 November, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

Comments: Wiley - Software: Practice and Experience

arXiv:2108.10628 [pdf, other]

Towards Predictive Replica Placement for Distributed Data Stores in Fog Environments

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Mobile clients that consume and produce data are abundant in fog environments. Low latency access to this data can only be achieved by storing it in close physical proximity to the clients. Current data store systems fall short as they do not replicate data based on client movement. We propose an approach to predictive replica placement that autonomously and proactively replicates data close to li… ▽ More Mobile clients that consume and produce data are abundant in fog environments. Low latency access to this data can only be achieved by storing it in close physical proximity to the clients. Current data store systems fall short as they do not replicate data based on client movement. We propose an approach to predictive replica placement that autonomously and proactively replicates data close to likely client locations. △ Less

Submitted 24 August, 2021; originally announced August 2021.

Comments: accepted for publication at the 9th IEEE International Conference on Cloud Engineering (IC2E 2021)

arXiv:2108.08685 [pdf, other]

On the Future of Cloud Engineering

Authors: David Bermbach, Abhishek Chandra, Chandra Krintz, Aniruddha Gokhale, Aleksander Slominski, Lauritz Thamsen, Everton Cavalcante, Tian Guo, Ivona Brandic, Rich Wolski

Abstract: Ever since the commercial offerings of the Cloud started appearing in 2006, the landscape of cloud computing has been undergoing remarkable changes with the emergence of many different types of service offerings, developer productivity enhancement tools, and new application classes as well as the manifestation of cloud functionality closer to the user at the edge. The notion of utility computing,… ▽ More Ever since the commercial offerings of the Cloud started appearing in 2006, the landscape of cloud computing has been undergoing remarkable changes with the emergence of many different types of service offerings, developer productivity enhancement tools, and new application classes as well as the manifestation of cloud functionality closer to the user at the edge. The notion of utility computing, however, has remained constant throughout its evolution, which means that cloud users always seek to save costs of leasing cloud resources while maximizing their use. On the other hand, cloud providers try to maximize their profits while assuring service-level objectives of the cloud-hosted applications and kee** operational costs low. All these outcomes require systematic and sound cloud engineering principles. The aim of this paper is to highlight the importance of cloud engineering, survey the landscape of best practices in cloud engineering and its evolution, discuss many of the existing cloud engineering advances, and identify both the inherent technical challenges and research opportunities for the future of cloud computing in general and cloud engineering in particular. △ Less

Submitted 19 August, 2021; originally announced August 2021.

Comments: author copy/preprint of a paper published in the IEEE International Conference on Cloud Engineering (IC2E 2021)

arXiv:2104.02396 [pdf, other]

doi 10.1145/3434770.3459736

Towards a Computing Platform for the LEO Edge

Authors: Tobias Pfandzelter, Jonathan Hasenburg, David Bermbach

Abstract: The new space race is heating up as private companies such as SpaceX and Amazon are building large satellite constellations in low-earth orbit (LEO) to provide global broadband internet access. As the number of subscribers connected to this access network grows, it becomes necessary to investigate if and how edge computing concepts can be applied to LEO satellite networks. In this paper, we disc… ▽ More The new space race is heating up as private companies such as SpaceX and Amazon are building large satellite constellations in low-earth orbit (LEO) to provide global broadband internet access. As the number of subscribers connected to this access network grows, it becomes necessary to investigate if and how edge computing concepts can be applied to LEO satellite networks. In this paper, we discuss the unique characteristics of the LEO edge and analyze the suitability of three organization paradigms for applications considering developer requirements. We conclude that the serverless approach is the most promising solution, opening up the field for future research. △ Less

Submitted 6 April, 2021; originally announced April 2021.

Comments: Accepted for Publication at 4th International Workshop on Edge Systems, Analytics and Networking (EdgeSys '21), April 26, 2021, Online, United Kingdom

arXiv:2102.12770 [pdf, other]

BeFaaS: An Application-Centric Benchmarking Framework for FaaS Platforms

Authors: Martin Grambow, Tobias Pfandzelter, Luk Burchard, Carsten Schubert, Max Zhao, David Bermbach

Abstract: Following the increasing interest and adoption of FaaS systems, benchmarking frameworks for determining non-functional properties have also emerged. While existing (microbenchmark) frameworks only evaluate single aspects of FaaS platforms, a more holistic, application-driven approach is still missing. In this paper, we design and present BeFaaS, an extensible application-centric benchmarking frame… ▽ More Following the increasing interest and adoption of FaaS systems, benchmarking frameworks for determining non-functional properties have also emerged. While existing (microbenchmark) frameworks only evaluate single aspects of FaaS platforms, a more holistic, application-driven approach is still missing. In this paper, we design and present BeFaaS, an extensible application-centric benchmarking framework for FaaS environments that focuses on the evaluation of FaaS platforms through realistic and typical examples of FaaS applications. BeFaaS includes a built-in e-commerce benchmark, is extensible for new workload profiles and new platforms, supports federated benchmark runs in which the benchmark application is distributed over multiple providers, and supports a fine-grained result analysis. Our evaluation compares three major FaaS providers in single cloud provider setups and shows that BeFaaS is capable of running each benchmark automatically with minimal configuration effort and providing detailed insights for each interaction. △ Less

Submitted 1 November, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

Comments: Accepted for publication in Proc. of IEEE International Conference on Cloud Engineering 2021 (IC2E'21)

arXiv:2012.08979 [pdf, other]

Edge (of the Earth) Replication: Optimizing Content Delivery in Large LEO Satellite Communication Networks

Authors: Tobias Pfandzelter, David Bermbach

Abstract: Large low earth orbit (LEO) satellite networks such as SpaceX's Starlink constellation promise to deliver low-latency, high-bandwidth Internet access with global coverage. As an alternative to terrestrial fiber as a global Internet backbone, they could potentially serve billions of Internet-connected devices. Currently, operators of CDNs exploit the hierarchical topology of the Internet to place p… ▽ More Large low earth orbit (LEO) satellite networks such as SpaceX's Starlink constellation promise to deliver low-latency, high-bandwidth Internet access with global coverage. As an alternative to terrestrial fiber as a global Internet backbone, they could potentially serve billions of Internet-connected devices. Currently, operators of CDNs exploit the hierarchical topology of the Internet to place points-of-presence near users, yet this approach is no longer possible when the topology changes to a single, wide-area, converged access and backhaul network. In this paper, we explore the opportunities of points-of-presence for CDNs within the satellite network itself, as it could provide better access latency for users while reducing operational costs for the satellite Internet service providers. We propose four strategies for selecting points-of-presence in satellite constellations that we evaluate through extensive simulation. In one case, we find that replicating web content within satellites can reduce bandwidth use in the constellation by 93% over an approach without replication in the network, while storing just 0.01% of all content in individual satellites. △ Less

Submitted 4 March, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

Comments: Accepted for the 21st IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid '21)

arXiv:2009.10579 [pdf, other]

doi 10.1109/TCC.2021.3074988

MockFog 2.0: Automated Execution of Fog Application Experiments in the Cloud

Authors: Jonathan Hasenburg, Martin Grambow, David Bermbach

Abstract: Fog computing is an emerging computing paradigm that uses processing and storage capabilities located at the edge, in the cloud, and possibly in between. Testing and benchmarking fog applications, however, is hard since runtime infrastructure will typically be in use or may not exist, yet. While approaches for the emulation of infrastructure testbeds do exist, their focus is typically the emulatio… ▽ More Fog computing is an emerging computing paradigm that uses processing and storage capabilities located at the edge, in the cloud, and possibly in between. Testing and benchmarking fog applications, however, is hard since runtime infrastructure will typically be in use or may not exist, yet. While approaches for the emulation of infrastructure testbeds do exist, their focus is typically the emulation of edge devices. Other approaches also emulate infrastructure within the core network or the cloud, but they miss support for automated experiment orchestration. In this paper, we propose to evaluate fog applications on an emulated infrastructure testbed created in the cloud which can be manipulated based on a pre-defined orchestration schedule. Developers can freely design the infrastructure, configure performance characteristics, manage application components, and orchestrate their experiments. We also present our proof-of-concept implementation MockFog 2.0. We use MockFog 2.0 to evaluate a fog-based smart factory application and showcase how its features can be used to study the impact of infrastructure changes and workload variations. With these experiments, we also show that MockFog can achieve good experiment reproducibility, even in a public cloud environment. △ Less

Submitted 11 May, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

Comments: Accepted for publication in IEEE Transactions on Cloud Computing

arXiv:2008.07891 [pdf, other]

doi 10.1002/spe.3003

From Zero to Fog: Efficient Engineering of Fog-Based Internet of Things Applications

Authors: Tobias Pfandzelter, Jonathan Hasenburg, David Bermbach

Abstract: In IoT data processing, cloud computing alone does not suffice due to latency constraints, bandwidth limitations, and privacy concerns. By introducing intermediary nodes closer to the edge of the network that offer compute services in proximity to IoT devices, fog computing can reduce network strain and high access latency to application services. While this is the only viable approach to enable e… ▽ More In IoT data processing, cloud computing alone does not suffice due to latency constraints, bandwidth limitations, and privacy concerns. By introducing intermediary nodes closer to the edge of the network that offer compute services in proximity to IoT devices, fog computing can reduce network strain and high access latency to application services. While this is the only viable approach to enable efficient IoT applications, the issue of component placement among cloud and intermediary nodes in the fog adds a new dimension to system design. State-of-the-art solutions to this issue rely on either simulation or solving a formalized assignment problem through heuristics, which are both inaccurate and fail to scale with a solution space that grows exponentially. In this paper, we present a three step process for designing practical fog-based IoT applications that uses best practices, simulation, and testbed analysis to converge towards an efficient system architecture. We then apply this process in a smart factory case study. By deploying filtered options to a physical testbed, we show that each step of our process converges towards more efficient application designs. △ Less

Submitted 2 July, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

Comments: This work has been published in Wiley - Software: Practice and Experience

Journal ref: Software: Practice and Experience, vol. 51, no. 8, pp. 1798-1821, Aug. 2021

arXiv:2006.08481 [pdf, other]

doi 10.1016/j.pmcj.2020.101197

SimRa: Using Crowdsourcing to Identify Near Miss Hotspots in Bicycle Traffic

Authors: Ahmet-Serdar Karakaya, Jonathan Hasenburg, David Bermbach

Abstract: An increased modal share of bicycle traffic is a key mechanism to reduce emissions and solve traffic-related problems. However, a lack of (perceived) safety keeps people from using their bikes more frequently. To improve safety in bicycle traffic, city planners need an overview of accidents, near miss incidents, and bike routes. Such information, however, is currently not available. In this paper,… ▽ More An increased modal share of bicycle traffic is a key mechanism to reduce emissions and solve traffic-related problems. However, a lack of (perceived) safety keeps people from using their bikes more frequently. To improve safety in bicycle traffic, city planners need an overview of accidents, near miss incidents, and bike routes. Such information, however, is currently not available. In this paper, we describe SimRa, a platform for collecting data on bicycle routes and near miss incidents using smartphone-based crowdsourcing. We also describe how we identify dangerous near miss hotspots based on the collected data and propose a scoring model. △ Less

Submitted 1 July, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

Comments: Accepted for publication in Elsevier Pervasive and Mobile Computing

arXiv:2004.11725 [pdf, other]

A Survey on Edge Performance Benchmarking

Authors: Blesson Varghese, Nan Wang, David Bermbach, Cheol-Ho Hong, Eyal de Lara, Weisong Shi, Christopher Stewart

Abstract: Edge computing is the next Internet frontier that will leverage computing resources located near users, sensors, and data stores to provide more responsive services. Therefore, it is envisioned that a large-scale, geographically dispersed, and resource-rich distributed system will emerge and play a key role in the future Internet. However, given the loosely coupled nature of such complex systems,… ▽ More Edge computing is the next Internet frontier that will leverage computing resources located near users, sensors, and data stores to provide more responsive services. Therefore, it is envisioned that a large-scale, geographically dispersed, and resource-rich distributed system will emerge and play a key role in the future Internet. However, given the loosely coupled nature of such complex systems, their operational conditions are expected to change significantly over time. In this context, the performance characteristics of such systems will need to be captured rapidly, which is referred to as performance benchmarking, for application deployment, resource orchestration, and adaptive decision-making. Edge performance benchmarking is a nascent research avenue that has started gaining momentum over the past five years. This article first reviews articles published over the past three decades to trace the history of performance benchmarking from tightly coupled to loosely coupled systems. It then systematically classifies previous research to identify the system under test, techniques analyzed, and benchmark runtime in edge performance benchmarking. △ Less

Submitted 16 December, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

Comments: Accepted by ACM Computing Surveys, 16 December 2020

arXiv:2001.01603 [pdf, other]

doi 10.1016/j.comcom.2020.01.015

GeoBroker: Leveraging Geo-Contexts for IoT Data Distribution

Authors: Jonathan Hasenburg, David Bermbach

Abstract: In the Internet of Things, the relevance of data often depends on the geographic context of data producers and consumers. Today's data distribution services, however, mostly focus on data content and not on geo-context, which could help to reduce the dissemination of excess data in many IoT scenarios. In this paper, we propose to use the geo-context information associated with devices to control d… ▽ More In the Internet of Things, the relevance of data often depends on the geographic context of data producers and consumers. Today's data distribution services, however, mostly focus on data content and not on geo-context, which could help to reduce the dissemination of excess data in many IoT scenarios. In this paper, we propose to use the geo-context information associated with devices to control data distribution. We define what geo-context dimensions exist and compare our definition with concepts from related work. Furthermore, we designed GeoBroker, a data distribution service that uses the location of things, as well as geofences for messages and subscriptions, to control data distribution. This way, we enable new IoT application scenarios while also increasing overall system efficiency for scenarios where geo-contexts matter by delivering only relevant messages. We evaluate our approach based on a proof-of-concept prototype and several experiments. △ Less

Submitted 20 January, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

Comments: Accepted for publication in Elsevier Computer Communications

Journal ref: Computer Communications 151 (2020) 473-484

arXiv:1912.06096 [pdf, other]

Towards Auction-Based Function Placement in Serverless Fog Platforms

Authors: David Bermbach, Setareh Maghsudi, Jonathan Hasenburg, Tobias Pfandzelter

Abstract: The Function-as-a-Service (FaaS) paradigm has a lot of potential as a computing model for fog environments comprising both cloud and edge nodes. When the request rate exceeds capacity limits at the edge, some functions need to be offloaded from the edge towards the cloud. In this position paper, we propose an auction-based approach in which application developers bid on resources. This allows fo… ▽ More The Function-as-a-Service (FaaS) paradigm has a lot of potential as a computing model for fog environments comprising both cloud and edge nodes. When the request rate exceeds capacity limits at the edge, some functions need to be offloaded from the edge towards the cloud. In this position paper, we propose an auction-based approach in which application developers bid on resources. This allows fog nodes to make a local decision about which functions to offload while maximizing revenue. For a first evaluation of our approach, we use simulation. △ Less

Submitted 4 March, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

Comments: preprint, authors' camera-ready version of a conference paper accepted in the 2020 IEEE International Conference on Fog Computing (ICFC 2020)

arXiv:1912.03107 [pdf, other]

FBase: A Replication Service for Data-Intensive Fog Applications

Authors: Jonathan Hasenburg, Martin Grambow, David Bermbach

Abstract: The combination of edge and cloud in the fog computing paradigm enables a new breed of data-intensive applications. These applications, however, have to face a number of fog-specific challenges which developers have to repetitively address for every single application. In this paper, we propose a replication service specifically tailored to the needs of data-intensive fog applications that aims… ▽ More The combination of edge and cloud in the fog computing paradigm enables a new breed of data-intensive applications. These applications, however, have to face a number of fog-specific challenges which developers have to repetitively address for every single application. In this paper, we propose a replication service specifically tailored to the needs of data-intensive fog applications that aims to ease or eliminate challenges caused by the highly distributed and heterogeneous environment fog applications operate in. Furthermore, we present our prototypical proof-of-concept implementation FBase that we have made available as open source. △ Less

Submitted 29 January, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

Comments: Technical Report; extended version

Report number: MCC.2019.1

arXiv:1910.04032 [pdf]

doi 10.1109/MIC.2020.2979161

Fog Computing as Privacy Enabler

Authors: Frank Pallas, Philip Raschke, David Bermbach

Abstract: Despite broad discussions on privacy challenges arising from fog computing, the authors argue that privacy and security requirements might actually drive the adoption of fog computing. They present four patterns of fog computing fostering data privacy and the security of business secrets, complementing existing cryptographic approaches. Their practical application is illuminated on the basis of th… ▽ More Despite broad discussions on privacy challenges arising from fog computing, the authors argue that privacy and security requirements might actually drive the adoption of fog computing. They present four patterns of fog computing fostering data privacy and the security of business secrets, complementing existing cryptographic approaches. Their practical application is illuminated on the basis of three case studies. △ Less

Submitted 13 March, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

Comments: Preprint, accepted for publication in IEEE Internet Computing. This is the authors' own version before final copy-editing by IEEE

arXiv:1903.07712 [pdf, other]

Benchmarking Web API Quality -- Revisited

Authors: David Bermbach, Erik Wittern

Abstract: Modern applications increasingly interact with web APIs -- reusable components, deployed and operated outside the application, and accessed over the network. Their existence, arguably, spurs application innovations, making it easy to integrate data or functionalities. While previous work has analyzed the ecosystem of web APIs and their design, little is known about web API quality at runtime. This… ▽ More Modern applications increasingly interact with web APIs -- reusable components, deployed and operated outside the application, and accessed over the network. Their existence, arguably, spurs application innovations, making it easy to integrate data or functionalities. While previous work has analyzed the ecosystem of web APIs and their design, little is known about web API quality at runtime. This gap is critical, as qualities including availability, latency, or provider security preferences can severely impact applications and user experience. In this paper, we revisit a 3-month, geo-distributed benchmark of popular web APIs, originally performed in 2015. We repeat this benchmark in 2018 and compare results from these two benchmarks regarding availability and latency. We furthermore introduce new results from assessing provider security preferences, collected both in 2015 and 2018, and results from our attempts to reach out to API providers with the results from our 2015 experiments. Our extensive experiments show that web API qualities vary 1.) based on the geo-distribution of clients, 2.) during our individual experiments, and 3.) between the two experiments. Our findings provide evidence to foster the discussion around web API quality, and can act as a basis for the creation of tools and approaches to mitigate quality issues. △ Less

Submitted 3 July, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

Comments: Accepted for publication in the Rivers Journal of Web Engineering. The paper is text-wise identical to the camera-ready version but uses a different template

Showing 1–50 of 51 results for author: Bermbach, D