Search | arXiv e-print repository

Informed and Assessable Observability Design Decisions in Cloud-native Microservice Applications

Authors: Maria C. Borges, Joshua Bauer, Sebastian Werner, Michael Gebauer, Stefan Tai

Abstract: Observability is important to ensure the reliability of microservice applications. These applications are often prone to failures, since they have many independent services deployed on heterogeneous environments. When employed "correctly", observability can help developers identify and troubleshoot faults quickly. However, instrumenting and configuring the observability of a microservice applicati… ▽ More Observability is important to ensure the reliability of microservice applications. These applications are often prone to failures, since they have many independent services deployed on heterogeneous environments. When employed "correctly", observability can help developers identify and troubleshoot faults quickly. However, instrumenting and configuring the observability of a microservice application is not trivial but tool-dependent and tied to costs. Architects need to understand observability-related trade-offs in order to weigh between different observability design alternatives. Still, these architectural design decisions are not supported by systematic methods and typically just rely on "professional intuition". In this paper, we argue for a systematic method to arrive at informed and continuously assessable observability design decisions. Specifically, we focus on fault observability of cloud-native microservice applications, and turn this into a testable and quantifiable property. Towards our goal, we first model the scale and scope of observability design decisions across the cloud-native stack. Then, we propose observability metrics which can be determined for any microservice application through so-called observability experiments. We present a proof-of-concept implementation of our experiment tool OXN. OXN is able to inject arbitrary faults into an application, similar to Chaos Engineering, but also possesses the unique capability to modify the observability configuration, allowing for the assessment of design decisions that were previously left unexplored. We demonstrate our approach using a popular open source microservice application and show the trade-offs involved in different observability design decisions. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Journal ref: International Conference on Software Architecture 2024

arXiv:2402.10665 [pdf, other]

Selective Prediction for Semantic Segmentation using Post-Hoc Confidence Estimation and Its Performance under Distribution Shift

Authors: Bruno Laboissiere Camargos Borges, Bruno Machado Pacheco, Danilo Silva

Abstract: Semantic segmentation plays a crucial role in various computer vision applications, yet its efficacy is often hindered by the lack of high-quality labeled data. To address this challenge, a common strategy is to leverage models trained on data from different populations, such as publicly available datasets. This approach, however, leads to the distribution shift problem, presenting a reduced perfo… ▽ More Semantic segmentation plays a crucial role in various computer vision applications, yet its efficacy is often hindered by the lack of high-quality labeled data. To address this challenge, a common strategy is to leverage models trained on data from different populations, such as publicly available datasets. This approach, however, leads to the distribution shift problem, presenting a reduced performance on the population of interest. In scenarios where model errors can have significant consequences, selective prediction methods offer a means to mitigate risks and reduce reliance on expert supervision. This paper investigates selective prediction for semantic segmentation in low-resource settings, thus focusing on post-hoc confidence estimators applied to pre-trained models operating under distribution shift. We propose a novel image-level confidence measure tailored for semantic segmentation and demonstrate its effectiveness through experiments on three medical imaging tasks. Our findings show that post-hoc confidence estimators offer a cost-effective approach to reducing the impacts of distribution shift. △ Less

Submitted 6 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

arXiv:2311.05047 [pdf, ps, other]

doi 10.26615/978-954-452-084-7_042

DeepLearningBrasil@LT-EDI-2023: Exploring Deep Learning Techniques for Detecting Depression in Social Media Text

Authors: Eduardo Garcia, Juliana Gomes, Adalberto Barbosa Júnior, Cardeque Borges, Nádia da Silva

Abstract: In this paper, we delineate the strategy employed by our team, DeepLearningBrasil, which secured us the first place in the shared task DepSign-LT-EDI@RANLP-2023, achieving a 47.0% Macro F1-Score and a notable 2.4% advantage. The task was to classify social media texts into three distinct levels of depression - "not depressed," "moderately depressed," and "severely depressed." Leveraging the power… ▽ More In this paper, we delineate the strategy employed by our team, DeepLearningBrasil, which secured us the first place in the shared task DepSign-LT-EDI@RANLP-2023, achieving a 47.0% Macro F1-Score and a notable 2.4% advantage. The task was to classify social media texts into three distinct levels of depression - "not depressed," "moderately depressed," and "severely depressed." Leveraging the power of the RoBERTa and DeBERTa models, we further pre-trained them on a collected Reddit dataset, specifically curated from mental health-related Reddit's communities (Subreddits), leading to an enhanced understanding of nuanced mental health discourse. To address lengthy textual data, we used truncation techniques that retained the essence of the content by focusing on its beginnings and endings. Our model was robust against unbalanced data by incorporating sample weights into the loss. Cross-validation and ensemble techniques were then employed to combine our k-fold trained models, delivering an optimal solution. The accompanying code is made available for transparency and further development. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Report number: 2023.ltedi-1.42

arXiv:2212.08736 [pdf, other]

doi 10.1016/j.jcp.2023.112341

A Neural Network Warm-Start Approach for the Inverse Acoustic Obstacle Scattering Problem

Authors: Mo Zhou, Jiequn Han, Manas Rachh, Carlos Borges

Abstract: We consider the inverse acoustic obstacle problem for sound-soft star-shaped obstacles in two dimensions wherein the boundary of the obstacle is determined from measurements of the scattered field at a collection of receivers outside the object. One of the standard approaches for solving this problem is to reformulate it as an optimization problem: finding the boundary of the domain that minimizes… ▽ More We consider the inverse acoustic obstacle problem for sound-soft star-shaped obstacles in two dimensions wherein the boundary of the obstacle is determined from measurements of the scattered field at a collection of receivers outside the object. One of the standard approaches for solving this problem is to reformulate it as an optimization problem: finding the boundary of the domain that minimizes the $L^2$ distance between computed values of the scattered field and the given measurement data. The optimization problem is computationally challenging since the local set of convexity shrinks with increasing frequency and results in an increasing number of local minima in the vicinity of the true solution. In many practical experimental settings, low frequency measurements are unavailable due to limitations of the experimental setup or the sensors used for measurement. Thus, obtaining a good initial guess for the optimization problem plays a vital role in this environment. We present a neural network warm-start approach for solving the inverse scattering problem, where an initial guess for the optimization problem is obtained using a trained neural network. We demonstrate the effectiveness of our method with several numerical examples. For high frequency problems, this approach outperforms traditional iterative methods such as Gauss-Newton initialized without any prior (i.e., initialized using a unit circle), or initialized using the solution of a direct method such as the linear sampling method. The algorithm remains robust to noise in the scattered field measurements and also converges to the true solution for limited aperture data. However, the number of training samples required to train the neural network scales exponentially in frequency and the complexity of the obstacles considered. We conclude with a discussion of this phenomenon and potential directions for future research. △ Less

Submitted 3 August, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

Journal ref: Journal of Computational Physics, 488, 112243 (2023)

arXiv:2110.03471 [pdf, other]

FaaSter Troubleshooting -- Evaluating Distributed Tracing Approaches for Serverless Applications

Authors: Maria C. Borges, Sebastian Werner, Ahmet Kilic

Abstract: Serverless applications can be particularly difficult to troubleshoot, as these applications are often composed of various managed and partly managed services. Faults are often unpredictable and can occur at multiple points, even in simple compositions. Each additional function or service in a serverless composition introduces a new possible fault source and a new layer to obfuscate faults. Curren… ▽ More Serverless applications can be particularly difficult to troubleshoot, as these applications are often composed of various managed and partly managed services. Faults are often unpredictable and can occur at multiple points, even in simple compositions. Each additional function or service in a serverless composition introduces a new possible fault source and a new layer to obfuscate faults. Currently, serverless platforms offer only limited support for identifying runtime faults. Developers looking to observe their serverless compositions often have to rely on scattered logs and ambiguous error messages to pinpoint root causes. In this paper, we investigate the use of distributed tracing for improving the observability of faults in serverless applications. To this end, we first introduce a model for characterizing fault observability, then provide a prototypical tracing implementation - specifically, a developer-driven and a platform-supported tracing approach. We compare both approaches with our model, measure associated trade-offs (execution latency, resource utilization), and contribute new insights for troubleshooting serverless compositions. △ Less

Submitted 7 October, 2021; originally announced October 2021.

Comments: 2021 IEEE International Conference on Cloud Engineering (IC2E)

arXiv:2106.06001 [pdf, other]

doi 10.1109/EuroSPW54576.2021.00039

TIRA: An OpenAPI Extension and Toolbox for GDPR Transparency in RESTful Architectures

Authors: Elias Grünewald, Paul Wille, Frank Pallas, Maria C. Borges, Max-R. Ulbricht

Abstract: Transparency - the provision of information about what personal data is collected for which purposes, how long it is stored, or to which parties it is transferred - is one of the core privacy principles underlying regulations such as the GDPR. Technical approaches for implementing transparency in practice are, however, only rarely considered. In this paper, we present a novel approach for doing so… ▽ More Transparency - the provision of information about what personal data is collected for which purposes, how long it is stored, or to which parties it is transferred - is one of the core privacy principles underlying regulations such as the GDPR. Technical approaches for implementing transparency in practice are, however, only rarely considered. In this paper, we present a novel approach for doing so in current, RESTful application architectures and in line with prevailing agile and DevOps-driven practices. For this purpose, we introduce 1) a transparency-focused extension of OpenAPI specifications that allows individual service descriptions to be enriched with transparency-related annotations in a bottom-up fashion and 2) a set of higher-order tools for aggregating respective information across multiple, interdependent services and for coherently integrating our approach into automated CI/CD-pipelines. Together, these building blocks pave the way for providing transparency information that is more specific and at the same time better reflects the actual implementation givens within complex service architectures than current, overly broad privacy statements. △ Less

Submitted 10 June, 2021; originally announced June 2021.

Comments: Accepted for publication at the 2021 International Workshop on Privacy Engineering (IWPE'21). This is a preprint manuscript (authors' own version before final copy-editing)

arXiv:2102.02649 [pdf, ps, other]

A step toward a reinforcement learning de novo genome assembler

Authors: Kleber Padovani, Roberto Xavier, Rafael Cabral Borges, Andre Carvalho, Anna Reali, Annie Chateau, Ronnie Alves

Abstract: De novo genome assembly is a relevant but computationally complex task in genomics. Although de novo assemblers have been used successfully in several genomics projects, there is still no 'best assembler', and the choice and setup of assemblers still rely on bioinformatics experts. Thus, as with other computationally complex problems, machine learning may emerge as an alternative (or complementary… ▽ More De novo genome assembly is a relevant but computationally complex task in genomics. Although de novo assemblers have been used successfully in several genomics projects, there is still no 'best assembler', and the choice and setup of assemblers still rely on bioinformatics experts. Thus, as with other computationally complex problems, machine learning may emerge as an alternative (or complementary) way for develo** more accurate and automated assemblers. Reinforcement learning has proven promising for solving complex activities without supervision - such games - and there is a pressing need to understand the limits of this approach to 'real' problems, such as the DFA problem. This study aimed to shed light on the application of machine learning, using reinforcement learning (RL), in genome assembly. We expanded upon the sole previous approach found in the literature to solve this problem by carefully exploring the learning aspects of the proposed intelligent agent, which uses the Q-learning algorithm, and we provided insights for the next steps of automated genome assembly development. We improved the reward system and optimized the exploration of the state space based on pruning and in collaboration with evolutionary computing. We tested the new approaches on 23 new larger environments, which are all available on the internet. Our results suggest consistent performance progress; however, we also found limitations, especially concerning the high dimensionality of state and action spaces. Finally, we discuss paths for achieving efficient and automated genome assembly in real scenarios considering successful RL applications - including deep reinforcement learning. △ Less

Submitted 7 March, 2024; v1 submitted 2 February, 2021; originally announced February 2021.

arXiv:1903.09312 [pdf]

doi 10.1016/j.jenvman.2018.11.037

New methodology for facilitating the food wastage quantification. Identifying gaps and data inconsistencies

Authors: Hector Barco Cobalea, Iraia Oribe Garcia, Maria Virginia Vargas Viedma, Cruz Enrique Borges, Cristina Martin Andonegui, Ainhoa Alonso Vicario

Abstract: The work aims at providing a new methodology to facilitate the process of quantifying the food waste according to European standards all along the agrifood chain combining information that is becoming available at local level. This new methodology generates straightforward and easy-to-interpret results for the decision making process in the framework of the quantification of the food waste at lo… ▽ More The work aims at providing a new methodology to facilitate the process of quantifying the food waste according to European standards all along the agrifood chain combining information that is becoming available at local level. This new methodology generates straightforward and easy-to-interpret results for the decision making process in the framework of the quantification of the food waste at local and supralocal scale and it provides adequate procedures which are easy adaptable to the specific circumstances in each municipality. Moreover, this method could have applications for larger territorial contexts, as the national scale, detecting possible points for improvement of the current official figures at this respect. △ Less

Submitted 6 May, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

Comments: Accepted Manuscript version

Journal ref: Journal of Environmental Management. 2019

arXiv:1609.04735 [pdf, other]

doi 10.1145/3019612.3019730

RALL - Routing-Aware Of Path Length, Link Quality, And Traffic Load For Wireless Sensor Networks

Authors: Vinicius N. Medeiros, Douglas V. Santana, Bruno Silvestre, Vinicius da C. M. Borges

Abstract: Due to the enormous variety of application scenarios and ubiquity,Internet of Things (IoT) brought a new perspective of applications for the current and future Internet. The Wireless Sensor Networks provide key devices for develo** the IoT communication paradigm, such as the sensors collecting various kind of information and the routing and MAC protocols. However, this type of network has strong… ▽ More Due to the enormous variety of application scenarios and ubiquity,Internet of Things (IoT) brought a new perspective of applications for the current and future Internet. The Wireless Sensor Networks provide key devices for develo** the IoT communication paradigm, such as the sensors collecting various kind of information and the routing and MAC protocols. However, this type of network has strong power consumption and transmission capacity restrictions (low speed wireless links and subject to interference). In this context, it is necessary to develop solutions that enable a more efficient communication based on the optimized utilization of the network resources. This papers aims to present a multi-objective routing algorithm, named Routing-Aware of path Length, Link quality, and traffic Load (RALL), that seeks to balance three objectives: to minimize bottlenecks, to minimize path length, and to avoid links with low quality. RALL results in good performance when taking into consideration delivery rate, overhead, delay, and power consumption. △ Less

Submitted 15 September, 2016; originally announced September 2016.

Showing 1–9 of 9 results for author: Borges, C