-
A Learning-Based Caching Mechanism for Edge Content Delivery
Authors:
Hoda Torabi,
Hamzeh Khazaei,
Marin Litoiu
Abstract:
With the advent of 5G networks and the rise of the Internet of Things (IoT), Content Delivery Networks (CDNs) are increasingly extending into the network edge. This shift introduces unique challenges, particularly due to the limited cache storage and the diverse request patterns at the edge. These edge environments can host traffic classes characterized by varied object-size distributions and obje…
▽ More
With the advent of 5G networks and the rise of the Internet of Things (IoT), Content Delivery Networks (CDNs) are increasingly extending into the network edge. This shift introduces unique challenges, particularly due to the limited cache storage and the diverse request patterns at the edge. These edge environments can host traffic classes characterized by varied object-size distributions and object-access patterns. Such complexity makes it difficult for traditional caching strategies, which often rely on metrics like request frequency or time intervals, to be effective. Despite these complexities, the optimization of edge caching is crucial. Improved byte hit rates at the edge not only alleviate the load on the network backbone but also minimize operational costs and expedite content delivery to end-users.
In this paper, we introduce HR-Cache, a comprehensive learning-based caching framework grounded in the principles of Hazard Rate (HR) ordering, a rule originally formulated to compute an upper bound on cache performance. HR-Cache leverages this rule to guide future object eviction decisions. It employs a lightweight machine learning model to learn from caching decisions made based on HR ordering, subsequently predicting the "cache-friendliness" of incoming requests. Objects deemed "cache-averse" are placed into cache as priority candidates for eviction. Through extensive experimentation, we demonstrate that HR-Cache not only consistently enhances byte hit rates compared to existing state-of-the-art methods but also achieves this with minimal prediction overhead.
Our experimental results, using three real-world traces and one synthetic trace, indicate that HR-Cache consistently achieves 2.2-14.6% greater WAN traffic savings than LRU. It outperforms not only heuristic caching strategies but also the state-of-the-art learning-based algorithm.
△ Less
Submitted 3 April, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
A Market Mechanism for Trading Flexibility Between Interconnected Electricity Markets
Authors:
Hossein Khazaei,
Ceyhun Eksin,
Roohallah Khatami,
Alfredo Garcia
Abstract:
Electricity markets differ in their ability to meet power imbalances in short notice in a controlled fashion. Relatively flexible markets have the ability to ramp up (or down) power flows across interties without compromising their ability to reliably meet internal demand. In this paper, a market mechanism to enable flexibility trading amongst market operators is introduced. In the proposed market…
▽ More
Electricity markets differ in their ability to meet power imbalances in short notice in a controlled fashion. Relatively flexible markets have the ability to ramp up (or down) power flows across interties without compromising their ability to reliably meet internal demand. In this paper, a market mechanism to enable flexibility trading amongst market operators is introduced. In the proposed market mechanism, market operators exchange information regarding optimal terms of trade (nodal prices and flows) along interconnection lines at every trading round. Equipped with this information, each market operator then independently solves its own internal chance-constrained economic dispatch problem and broadcasts the updated optimal terms of trade for flows across markets. We show the proposed decentralized market mechanism for flexibility trading converges to a Nash equilibrium of the intraday market coupling game, i.e. a combination of internal market clearing solutions (one for each participating market) and flows and prices along interconnection lines so that no individual market operator has an incentive to modify its own internal solution and/or the terms of trade along interties. For a specific class of chance constraints, we show that the limiting equilibrium outcome is efficient, i.e. it corresponds to the solution of the single market clearing problem for all participating markets. The proposed market mechanism is illustrated with an application to the three-area IEEE Reliability Test System.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Performance Modeling of Metric-Based Serverless Computing Platforms
Authors:
Nima Mahmoudi,
Hamzeh Khazaei
Abstract:
Analytical performance models are very effective in ensuring the quality of service and cost of service deployment remain desirable under different conditions and workloads. While various analytical performance models have been proposed for previous paradigms in cloud computing, serverless computing lacks such models that can provide developers with performance guarantees. Besides, most serverless…
▽ More
Analytical performance models are very effective in ensuring the quality of service and cost of service deployment remain desirable under different conditions and workloads. While various analytical performance models have been proposed for previous paradigms in cloud computing, serverless computing lacks such models that can provide developers with performance guarantees. Besides, most serverless computing platforms still require developers' input to specify the configuration for their deployment that could affect both the performance and cost of their deployment, without providing them with any direct and immediate feedback. In previous studies, we built such performance models for steady-state and transient analysis of scale-per-request serverless computing platforms (e.g., AWS Lambda, Azure Functions, Google Cloud Functions) that could give developers immediate feedback about the quality of service and cost of their deployments. In this work, we aim to develop analytical performance models for the latest trend in serverless computing platforms that use concurrency value and the rate of requests per second for autoscaling decisions. Examples of such serverless computing platforms are Knative and Google Cloud Run (a managed Knative service by Google). The proposed performance model can help developers and providers predict the performance and cost of deployments with different configurations which could help them tune the configuration toward the best outcome. We validate the applicability and accuracy of the proposed performance model by extensive real-world experimentation on Knative and show that our performance model is able to accurately predict the steady-state characteristics of a given workload with minimal amount of data collection.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
MLProxy: SLA-Aware Reverse Proxy for Machine Learning Inference Serving on Serverless Computing Platforms
Authors:
Nima Mahmoudi,
Hamzeh Khazaei
Abstract:
Serving machine learning inference workloads on the cloud is still a challenging task on the production level. Optimal configuration of the inference workload to meet SLA requirements while optimizing the infrastructure costs is highly complicated due to the complex interaction between batch configuration, resource configurations, and variable arrival process. Serverless computing has emerged in r…
▽ More
Serving machine learning inference workloads on the cloud is still a challenging task on the production level. Optimal configuration of the inference workload to meet SLA requirements while optimizing the infrastructure costs is highly complicated due to the complex interaction between batch configuration, resource configurations, and variable arrival process. Serverless computing has emerged in recent years to automate most infrastructure management tasks. Workload batching has revealed the potential to improve the response time and cost-effectiveness of machine learning serving workloads. However, it has not yet been supported out of the box by serverless computing platforms. Our experiments have shown that for various machine learning workloads, batching can hugely improve the system's efficiency by reducing the processing overhead per request.
In this work, we present MLProxy, an adaptive reverse proxy to support efficient machine learning serving workloads on serverless computing systems. MLProxy supports adaptive batching to ensure SLA compliance while optimizing serverless costs. We performed rigorous experiments on Knative to demonstrate the effectiveness of MLProxy. We showed that MLProxy could reduce the cost of serverless deployment by up to 92% while reducing SLA violations by up to 99% that can be generalized across state-of-the-art model serving frameworks.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Maximizing Social Welfare and Agreement via Information Design in Linear-Quadratic-Gaussian Games
Authors:
Furkan Sezer,
Hossein Khazaei,
Ceyhun Eksin
Abstract:
We consider linear-quadratic Gaussian (LQG) games in which players have quadratic payoffs that depend on the players' actions and an unknown payoff-relevant state, and signals on the state that follow a Gaussian distribution conditional on the state realization. An information designer decides the fidelity of information revealed to the players in order to maximize the social welfare of the player…
▽ More
We consider linear-quadratic Gaussian (LQG) games in which players have quadratic payoffs that depend on the players' actions and an unknown payoff-relevant state, and signals on the state that follow a Gaussian distribution conditional on the state realization. An information designer decides the fidelity of information revealed to the players in order to maximize the social welfare of the players or reduce the disagreement among players' actions. Leveraging the semi-definiteness of the information design problem, we derive analytical solutions for these objectives under specific LQG games. We show that full information disclosure maximizes social welfare when there is a common payoff-relevant state, when there is strategic substitutability in the actions of players, or when the signals are public. Numerical results show that as strategic substitution increases, the value of the information disclosure increases. When the objective is to induce conformity among players' actions, hiding information is optimal. Lastly, we consider the information design objective that is a weighted combination of social welfare and cohesiveness of players' actions. We obtain an interval for the weights where full information disclosure is optimal under public signals for games with strategic substitutability. Numerical solutions show that the actual interval where full information disclosure is optimal gets close to the analytical interval obtained as substitution increases.
△ Less
Submitted 26 February, 2023; v1 submitted 25 February, 2021;
originally announced February 2021.
-
SimFaaS: A Performance Simulator for Serverless Computing Platforms
Authors:
Nima Mahmoudi,
Hamzeh Khazaei
Abstract:
Develo** accurate and extendable performance models for serverless platforms, aka Function-as-a-Service (FaaS) platforms, is a very challenging task. Also, implementation and experimentation on real serverless platforms is both costly and time-consuming. However, at the moment, there is no comprehensive simulation tool or framework to be used instead of the real platform. As a result, in this pa…
▽ More
Develo** accurate and extendable performance models for serverless platforms, aka Function-as-a-Service (FaaS) platforms, is a very challenging task. Also, implementation and experimentation on real serverless platforms is both costly and time-consuming. However, at the moment, there is no comprehensive simulation tool or framework to be used instead of the real platform. As a result, in this paper, we fill this gap by proposing a simulation platform, called SimFaaS, which assists serverless application developers to develop optimized Function-as-a-Service applications in terms of cost and performance. On the other hand, SimFaaS can be leveraged by FaaS providers to tailor their platforms to be workload-aware so that they can increase profit and quality of service at the same time. Also, serverless platform providers can evaluate new designs, implementations, and deployments on SimFaaS in a timely and cost-efficient manner.
SimFaaS is open-source, well-documented, and publicly available, making it easily usable and extendable to incorporate more use case scenarios in the future. Besides, it provides performance engineers with a set of tools that can calculate several characteristics of serverless platform internal states, which is otherwise hard (mostly impossible) to extract from real platforms. We show how SimFaaS facilitates the prediction of essential performance metrics such as average response time, probability of cold start, and the average number of instances reflecting the infrastructure cost incurred by the serverless computing provider. We evaluate the accuracy and applicability of SimFaaS by comparing the prediction results with real-world traces from Amazon AWS Lambda.
△ Less
Submitted 17 February, 2021;
originally announced February 2021.
-
A Pub-Sub Architecture to Promote Blockchain Interoperability
Authors:
Sara Ghaemi,
Sara Rouhani,
Rafael Belchior,
Rui S. Cruz,
Hamzeh Khazaei,
Petr Musilek
Abstract:
The maturing of blockchain technology leads to heterogeneity, where multiple solutions specialize in a particular use case. While the development of different blockchain networks shows great potential for blockchains, the isolated networks have led to data and asset silos, limiting the applications of this technology. Blockchain interoperability solutions are essential to enable distributed ledger…
▽ More
The maturing of blockchain technology leads to heterogeneity, where multiple solutions specialize in a particular use case. While the development of different blockchain networks shows great potential for blockchains, the isolated networks have led to data and asset silos, limiting the applications of this technology. Blockchain interoperability solutions are essential to enable distributed ledgers to reach their full potential. Such solutions allow blockchains to support asset and data transfer, resulting in the development of innovative applications.
This paper proposes a novel blockchain interoperability solution for permissioned blockchains based on the publish/subscribe architecture. We implemented a prototype of this platform to show the feasibility of our design. We evaluate our solution by implementing examples of the different publisher and subscriber networks, such as Hyperledger Besu, which is an Ethereum client, and two different versions of Hyperledger Fabric. We present a performance analysis of the whole network that indicates its limits and bottlenecks. Finally, we discuss the extensibility and scalability of the platform in different scenarios. Our evaluation shows that our system can handle a throughput in the order of the hundreds of transactions per second.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
Anonymizing Sensor Data on the Edge: A Representation Learning and Transformation Approach
Authors:
Omid Hajihassani,
Omid Ardakanian,
Hamzeh Khazaei
Abstract:
The abundance of data collected by sensors in Internet of Things (IoT) devices, and the success of deep neural networks in uncovering hidden patterns in time series data have led to mounting privacy concerns. This is because private and sensitive information can be potentially learned from sensor data by applications that have access to this data. In this paper, we aim to examine the tradeoff betw…
▽ More
The abundance of data collected by sensors in Internet of Things (IoT) devices, and the success of deep neural networks in uncovering hidden patterns in time series data have led to mounting privacy concerns. This is because private and sensitive information can be potentially learned from sensor data by applications that have access to this data. In this paper, we aim to examine the tradeoff between utility and privacy loss by learning low-dimensional representations that are useful for data obfuscation. We propose deterministic and probabilistic transformations in the latent space of a variational autoencoder to synthesize time series data such that intrusive inferences are prevented while desired inferences can still be made with sufficient accuracy. In the deterministic case, we use a linear transformation to move the representation of input data in the latent space such that the reconstructed data is likely to have the same public attribute but a different private attribute than the original input data. In the probabilistic case, we apply the linear transformation to the latent representation of input data with some probability. We compare our technique with autoencoder-based anonymization techniques and additionally show that it can anonymize data in real time on resource-constrained edge devices.
△ Less
Submitted 27 August, 2021; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Stochastic Decision-Making Model for Aggregation of Residential Units with PV-Systems and Storages
Authors:
Hossein Khazaei,
Ramin Moslemi,
Ratnesh Sharma
Abstract:
Many residential energy consumers have installed photovoltaic (PV) panels and energy storage systems. These residential users can aggregate and participate in the energy markets. A stochastic decision making model for an aggregation of these residential units for participation in two-settlement markets is proposed in this paper. Scenarios are generated using Seasonal Autoregressive Integrated Movi…
▽ More
Many residential energy consumers have installed photovoltaic (PV) panels and energy storage systems. These residential users can aggregate and participate in the energy markets. A stochastic decision making model for an aggregation of these residential units for participation in two-settlement markets is proposed in this paper. Scenarios are generated using Seasonal Autoregressive Integrated Moving Average (SARIMA) model and joint probability distribution function of the forecast errors to model the uncertainties of the real-time prices, PV generations and demands. The proposed scenario generation model of this paper treats forecast errors as random variable, which allows to reflect new information observed in the real-time market into scenario generation process without retraining SARIMA or re-fitting probability distribution functions over the forecast errors. This approach significantly improves the computational time of the proposed model. A simulation study is conducted for an aggregation of 6 residential units, and the results highlights the benefits of aggregation as well as the proposed stochastic decision-making model.
△ Less
Submitted 20 February, 2020;
originally announced February 2020.
-
On the Equilibria and Efficiency of Electricity Markets with Renewable Power Producers and Congestion Constraints
Authors:
Hossein Khazaei,
X. Andy Sun,
Yue Zhao
Abstract:
With increasing renewable penetration in power systems, a prominent challenge in efficient and reliable power system operation is handling the uncertainties inherent in the renewable generation. In this paper, we propose a simple two-settlement market mechanism in which renewable power producers (RPPs) participate, so that a) the independent system operator (ISO) does not need to consider the unce…
▽ More
With increasing renewable penetration in power systems, a prominent challenge in efficient and reliable power system operation is handling the uncertainties inherent in the renewable generation. In this paper, we propose a simple two-settlement market mechanism in which renewable power producers (RPPs) participate, so that a) the independent system operator (ISO) does not need to consider the uncertainties of the renewables in its economic dispatch, and yet b) the market equilibrium is shown to approach social efficiency as if the ISO solves a stochastic optimization taking into account all the uncertainties. In showing this result, a key innovation is a new approach of efficiently computing the Nash equilibrium (NE) among the strategic RPPs in congestion-constrained power networks. In particular, the proposed approach decouples finding an NE into searching over congestion patterns and computing an NE candidate assuming a congestion pattern. As such, the computational complexity of finding an NE grows only cubically with the number of RPPs in the market. We demonstrate our results in the IEEE 14-bus system and show that the NE approaches social efficiency as the number of RPPs grows.
△ Less
Submitted 29 November, 2019;
originally announced December 2019.
-
Executability of Python Snippets in Stack Overflow
Authors:
Md Monir Hossain,
Nima Mahmoudi,
Changyuan Lin,
Hamzeh Khazaei,
Abram Hindle
Abstract:
Online resources today contain an abundant amount of code snippets for documentation, collaboration, learning, and problem-solving purposes. Their executability in a "plug and play" manner enables us to confirm their quality and use them directly in projects. But, in practice that is often not the case due to several requirements violations or incompleteness. However, it is a difficult task to inv…
▽ More
Online resources today contain an abundant amount of code snippets for documentation, collaboration, learning, and problem-solving purposes. Their executability in a "plug and play" manner enables us to confirm their quality and use them directly in projects. But, in practice that is often not the case due to several requirements violations or incompleteness. However, it is a difficult task to investigate the executability on a large scale due to different possible errors during the execution. We have developed a scalable framework to investigate this for SOTorrent Python snippets. We found that with minor adjustments, 27.92% of snippets are executable. The executability has not changed significantly over time. The code snippets referenced in GitHub are more likely to be directly executable. But executability does not affect the chances of the answer to be selected as the accepted answer significantly. These properties help us understand and improve the interaction of users with online resources that include code snippets.
△ Less
Submitted 10 July, 2019;
originally announced July 2019.
-
Performance Modeling of Microservice Platforms
Authors:
Hamzeh Khazaei,
Nima Mahmoudi,
Cornel Barna,
Marin Litoiu
Abstract:
Microservice architecture has transformed the way developers are building and deploying applications in the nowadays cloud computing centers. This new approach provides increased scalability, flexibility, manageability, and performance while reducing the complexity of the whole software development life cycle. The increase in cloud resource utilization also benefits microservice providers. Various…
▽ More
Microservice architecture has transformed the way developers are building and deploying applications in the nowadays cloud computing centers. This new approach provides increased scalability, flexibility, manageability, and performance while reducing the complexity of the whole software development life cycle. The increase in cloud resource utilization also benefits microservice providers. Various microservice platforms have emerged to facilitate the DevOps of containerized services by enabling continuous integration and delivery. Microservice platforms deploy application containers on virtual or physical machines provided by public/private cloud infrastructures in a seamless manner. In this paper, we study and evaluate the provisioning performance of microservice platforms by incorporating the details of all layers (i.e., both micro and macro layers) in the modelling process. To this end, we first build a microservice platform on top of Amazon EC2 cloud and then leverage it to develop a comprehensive performance model to perform what-if analysis and capacity planning for microservice platforms at scale. In other words, the proposed performance model provides a systematic approach to measure the elasticity of the microservice platform by analyzing the provisioning performance at both the microservice platform and the back-end macroservice infrastructures.
△ Less
Submitted 3 October, 2020; v1 submitted 9 February, 2019;
originally announced February 2019.
-
Indirect Mechanism Design for Efficient and Stable Renewable Energy Aggregation
Authors:
Hossein Khazaei,
Yue Zhao
Abstract:
Mechanism design is studied for aggregating renewable power producers (RPPs) in a two-settlement power market. Employing an indirect mechanism design framework, a payoff allocation mechanism (PAM) is derived from the competitive equilibrium (CE) of a specially formulated market with transferrable payoff. Given the designed mechanism, the strategic behaviors of the participating RPPs entail a non-c…
▽ More
Mechanism design is studied for aggregating renewable power producers (RPPs) in a two-settlement power market. Employing an indirect mechanism design framework, a payoff allocation mechanism (PAM) is derived from the competitive equilibrium (CE) of a specially formulated market with transferrable payoff. Given the designed mechanism, the strategic behaviors of the participating RPPs entail a non-cooperative game: It is proven that a unique pure Nash equilibrium (NE) exists among the RPPs, for which a closed-form expression is found. Moreover, it is proven that the designed mechanism achieves a number of key desirable properties at the NE: these include efficiency (i.e., an ideal "Price of Anarchy" of one), stability (i.e., "in the core" from a coalitional game theoretic perspective), and no collusion. In addition, it is shown that a set of desirable "ex-post" properties are also achieved by the designed mechanism. Extensive simulations are conducted and corroborate the theoretical results.
△ Less
Submitted 6 October, 2018;
originally announced October 2018.
-
Elascale: Autoscaling and Monitoring as a Service
Authors:
Hamzeh Khazaei,
Rajsimman Ravichandiran,
Byungchul Park,
Hadi Bannazadeh,
Ali Tizghadam,
Alberto Leon-Garcia
Abstract:
Auto-scalability has become an evident feature for cloud software systems including but not limited to big data and IoT applications. Cloud application providers now are in full control over their applications' microservices and macroservices; virtual machines and containers can be provisioned or deprovisioned on demand at runtime. Elascale strives to adjust both micro/macro resources with respect…
▽ More
Auto-scalability has become an evident feature for cloud software systems including but not limited to big data and IoT applications. Cloud application providers now are in full control over their applications' microservices and macroservices; virtual machines and containers can be provisioned or deprovisioned on demand at runtime. Elascale strives to adjust both micro/macro resources with respect to workload and changes in the internal state of the whole application stack. Elascale leverages Elasticsearch stack for collection, analysis and storage of performance metrics. Elascale then uses its default scaling engine to elastically adapt the managed application. Extendibility is guaranteed through provider, schema, plug-in and policy elements in the Elascale by which flexible scalability algorithms, including both reactive and proactive techniques, can be designed and implemented for various technologies, infrastructures and software stacks. In this paper, we present the architecture and initial implementation of Elascale; an instance will be leveraged to add auto-scalability to a generic IoT application. Due to zero dependency to the target software system, Elascale can be leveraged to provide auto-scalability and monitoring as-a-service for any type of cloud software system.
△ Less
Submitted 8 November, 2017;
originally announced November 2017.
-
Ex-post Stable and Fair Payoff Allocation for Renewable Energy Aggregation
Authors:
Hossein Khazaei,
Yue Zhao
Abstract:
Aggregating statistically diverse renewable power producers (RPPs) is an effective way to reduce the uncertainty of the RPPs. The key question in aggregation of RPPs is how to allocate payoffs among the RPPs. In this paper, a payoff allocation mechanism (PAM) with a simple closed-form expression is proposed: It achieves stability (in the core) and fairness both in the "ex-post" sense, i.e., for al…
▽ More
Aggregating statistically diverse renewable power producers (RPPs) is an effective way to reduce the uncertainty of the RPPs. The key question in aggregation of RPPs is how to allocate payoffs among the RPPs. In this paper, a payoff allocation mechanism (PAM) with a simple closed-form expression is proposed: It achieves stability (in the core) and fairness both in the "ex-post" sense, i.e., for all possible realizations of renewable power generation. Furthermore, this PAM can in fact be derived from the competitive equilibrium in a market. The proposed PAM is evaluated in a simulation study with ten wind power producers in the PJM interconnection.
△ Less
Submitted 11 January, 2017; v1 submitted 30 December, 2016;
originally announced December 2016.