Search | arXiv e-print repository

doi 10.1109/IISWC55918.2022.00024

Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics

Authors: Diego Moura, Vinicius Petrucci, Daniel Mosse

Abstract: Non-Volatile Memory (NVM) can deliver higher density and lower cost per bit when compared with DRAM. Its main drawback is that it is slower than DRAM. On the other hand, DRAM has scalability problems due to its cost and energy consumption. NVM will likely coexist with DRAM in computer systems and the biggest challenge is to know which data to allocate on each type of memory. A state-of-the-art app… ▽ More Non-Volatile Memory (NVM) can deliver higher density and lower cost per bit when compared with DRAM. Its main drawback is that it is slower than DRAM. On the other hand, DRAM has scalability problems due to its cost and energy consumption. NVM will likely coexist with DRAM in computer systems and the biggest challenge is to know which data to allocate on each type of memory. A state-of-the-art approach is AutoNUMA, in the Linux kernel. Prior work is limited to measuring AutoNUMA solely in terms of the application execution time, without understanding AutoNUMA's behavior. In this work we provide a more in-depth characterization of AutoNUMA, for instance, identifying where exactly a set of pages are allocated, while kee** track of promotion and demotion decisions performed by AutoNUMA. Our analysis shows that AutoNUMA's benefits can be modest when running graph processing applications, or graph analytics, because most pages have only one access over the entire execution time and other pages accesses have no temporal locality. We make a case for exploring application characteristics using object-level map**s between DRAM and NVM. Our preliminary experiments show that an object-level memory tiering can better capture the application behavior and reduce the execution time of graph analytics by 21% (avg) and 51% (max), when compared to AutoNUMA, while significantly reducing the number of memory accesses in NVM. △ Less

Submitted 9 November, 2022; originally announced December 2022.

arXiv:2211.02195 [pdf, other]

doi 10.1145/3488423.3519324

Learning to Rank Graph-based Application Objects on Heterogeneous Memories

Authors: Diego Moura, Vinicius Petrucci, Daniel Mosse

Abstract: Persistent Memory (PMEM), also known as Non-Volatile Memory (NVM), can deliver higher density and lower cost per bit when compared with DRAM. Its main drawback is that it is typically slower than DRAM. On the other hand, DRAM has scalability problems due to its cost and energy consumption. Soon, PMEM will likely coexist with DRAM in computer systems but the biggest challenge is to know which data… ▽ More Persistent Memory (PMEM), also known as Non-Volatile Memory (NVM), can deliver higher density and lower cost per bit when compared with DRAM. Its main drawback is that it is typically slower than DRAM. On the other hand, DRAM has scalability problems due to its cost and energy consumption. Soon, PMEM will likely coexist with DRAM in computer systems but the biggest challenge is to know which data to allocate on each type of memory. This paper describes a methodology for identifying and characterizing application objects that have the most influence on the application's performance using Intel Optane DC Persistent Memory. In the first part of our work, we built a tool that automates the profiling and analysis of application objects. In the second part, we build a machine learning model to predict the most critical object within large-scale graph-based applications. Our results show that using isolated features does not bring the same benefit compared to using a carefully chosen set of features. By performing data placement using our predictive model, we can reduce the execution time degradation by 12\% (average) and 30\% (max) when compared to the baseline's approach based on LLC misses indicator. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2206.04397 [pdf, other]

doi 10.1145/3533767.3543294

ESBMC-Jimple: Verifying Kotlin Programs via Jimple Intermediate Representation

Authors: Rafael Menezes, Daniel Moura, Helena Cavalcante, Rosiane de Freitas, Lucas C. Cordeiro

Abstract: In this work, we describe and evaluate the first model checker for verifying Kotlin programs through the Jimple intermediate representation. The verifier, named ESBMC-Jimple, is built on top of the Efficient SMT-based Context-Bounded Model Checker (ESBMC). It uses the Soot framework to obtain the Jimple IR, representing a simplified version of the Kotlin source code, containing a maximum of three… ▽ More In this work, we describe and evaluate the first model checker for verifying Kotlin programs through the Jimple intermediate representation. The verifier, named ESBMC-Jimple, is built on top of the Efficient SMT-based Context-Bounded Model Checker (ESBMC). It uses the Soot framework to obtain the Jimple IR, representing a simplified version of the Kotlin source code, containing a maximum of three operands per instruction. ESBMC-Jimple processes Kotlin source code together with a model of the standard Kotlin libraries and checks a set of safety properties. Experimental results show that ESBMC-Jimple can correctly verify a set of Kotlin benchmarks from the literature and that it is competitive with state-of-the-art Java bytecode verifiers. A demonstration is available at https://youtu.be/J6WhNfXvJNc. △ Less

Submitted 20 July, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

Comments: ACM SIGSOFT International Symposium on Software Testing and Analysis 2022

arXiv:2110.12009 [pdf]

Definition and Implications of the Digital Near-Death Experience: A Theoretical Essay on Preliminary Empiricism

Authors: Pedro Jacome de Moura Jr

Abstract: Purpose: People are increasingly adhering to social networking platforms (SNP), and this adhesion is often unreflective, which makes them alienate data, actions, and decisions to tech giants. This essay discusses what happens when, eventually, someone chooses to cancel their participation in a large SNP. Methodology/design: This is a theoretical essay, whose narrative resembles a theoretical-empir… ▽ More Purpose: People are increasingly adhering to social networking platforms (SNP), and this adhesion is often unreflective, which makes them alienate data, actions, and decisions to tech giants. This essay discusses what happens when, eventually, someone chooses to cancel their participation in a large SNP. Methodology/design: This is a theoretical essay, whose narrative resembles a theoretical-empirical manuscript, grounded on the author's experience and his subjective perceptions regarding being out of the WhatsApp network (nowadays, the main SNP instance in the world). Findings/highlights: This study proposes a definition and implications of the supposedly new "digital near-death experience" concept, a metaphor for the classic near-death experience (NDE). A research agenda is also proposed. Limitations: The resulting propositions are grounded on a set of assumptions, that if falsified, make the findings invalid. △ Less

Submitted 1 August, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

Comments: 5 pages. Accepted for presentation at Information Systems in Latin America (ISLA) 2022 conference (AIS/AMCIS)

arXiv:2107.09208 [pdf]

Music Tempo Estimation via Neural Networks -- A Comparative Analysis

Authors: Mila Soares de Oliveira de Souza, Pedro Nuno de Souza Moura, Jean-Pierre Briot

Abstract: This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensiv… ▽ More This paper presents a comparative analysis on two artificial neural networks (with different architectures) for the task of tempo estimation. For this purpose, it also proposes the modeling, training and evaluation of a B-RNN (Bidirectional Recurrent Neural Network) model capable of estimating tempo in bpm (beats per minutes) of musical pieces, without using external auxiliary modules. An extensive database (12,550 pieces in total) was curated to conduct a quantitative and qualitative analysis over the experiment. Percussion-only tracks were also included in the dataset. The performance of the B-RNN is compared to that of state-of-the-art models. For further comparison, a state-of-the-art CNN was also retrained with the same datasets used for the B-RNN training. Evaluation results for each model and datasets are presented and discussed, as well as observations and ideas for future research. Tempo estimation was more accurate for the percussion only dataset, suggesting that the estimation can be more accurate for percussion-only tracks, although further experiments (with more of such datasets) should be made to gather stronger evidence. △ Less

Submitted 19 July, 2021; originally announced July 2021.

ACM Class: I.2; I.5.4; J.5

arXiv:2107.06664 [pdf]

EnergySaver Software Manual

Authors: Davi Guimarães da Silva, Marla Teresinha Barbosa Geller, Dalton Felipe Silva Varão, João Bentes, Mauro Sérgio dos Santos Moura, Yasmin Braga Teixeira, Clayton André Maia dos Santos, Anderson Alvarenga de Moura Meneses

Abstract: Energy efficiency is a topic that has attracted the attention of researchers in recent years, in order to seek sustainability solutions for energy production and reduction of its costs, aiming to provide a balance between development and protection of natural resources. Thus, we proposed the EnergySaver software that has as its objective the monitoring of electric energy consumption, from data cap… ▽ More Energy efficiency is a topic that has attracted the attention of researchers in recent years, in order to seek sustainability solutions for energy production and reduction of its costs, aiming to provide a balance between development and protection of natural resources. Thus, we proposed the EnergySaver software that has as its objective the monitoring of electric energy consumption, from data capture to consumption forecast for the following month. To create Energy Saver, we used Open Source technologies applied to the Internet of Things (IoT), embedded systems, and Long Short-Term Memory Neural Networks (LSTM). However, in order to have harmony between the current researchers and those who may manipulate this software in the future, it is essential to create a Software Manual, where all the details of its implementation are described in detail. Therefore, this article describes all the steps for the implementation of the system, from the methodological scheme of the system, its modeling with UML, to the modules that compose it, becoming a Manual for its use. △ Less

Submitted 13 July, 2021; originally announced July 2021.

Comments: 8 pages, in Portuguese, 21 figures

arXiv:2002.06382 [pdf, ps, other]

Automatic lesion segmentation and Pathological Myopia classification in fundus images

Authors: Cefas Rodrigues Freire, Julio Cesar da Costa Moura, Daniele Montenegro da Silva Barros, Ricardo Alexsandro de Medeiros Valentim

Abstract: In this paper we present algorithms to diagnosis Pathological Myopia (PM) and detection of retinal structures and lesions such asOptic Disc (OD), Fovea, Atrophy and Detachment. All these tasks were performed in fundus imaging from PM patients and they are requirements to participate in the Pathologic Myopia Challenge (PALM). The challenge was organized as a half day Challenge, a Satellite Event of… ▽ More In this paper we present algorithms to diagnosis Pathological Myopia (PM) and detection of retinal structures and lesions such asOptic Disc (OD), Fovea, Atrophy and Detachment. All these tasks were performed in fundus imaging from PM patients and they are requirements to participate in the Pathologic Myopia Challenge (PALM). The challenge was organized as a half day Challenge, a Satellite Event of The IEEE International Symposium on Biomedical Imaging in Venice Italy.Our method applies different Deep Learning techniques for each task. Transfer learning is applied in all tasks using Xception as the baseline model. Also, some key ideas of YOLO architecture are used in the Optic Disc segmentation algorithm pipeline. We have evaluated our model's performance according the challenge rules in terms of AUC-ROC, F1-Score, Mean Dice Score and Mean Euclidean Distance. For initial activities our method has shown satisfactory results. △ Less

Submitted 15 February, 2020; originally announced February 2020.

Comments: ISBI 2019 PALM Challenge Report

arXiv:1504.02687 [pdf, ps, other]

3D Density Histograms for Criteria-driven Edge Bundling

Authors: Daniel C. Moura

Abstract: This paper presents a graph bundling algorithm that agglomerates edges taking into account both spatial proximity as well as user-defined criteria in order to reveal patterns that were not perceivable with previous bundling techniques. Each edge belongs to a group that may either be an input of the problem or found by clustering one or more edge properties such as origin, destination, orientation,… ▽ More This paper presents a graph bundling algorithm that agglomerates edges taking into account both spatial proximity as well as user-defined criteria in order to reveal patterns that were not perceivable with previous bundling techniques. Each edge belongs to a group that may either be an input of the problem or found by clustering one or more edge properties such as origin, destination, orientation, length or domain-specific properties. Bundling is driven by a stack of density maps, with each map capturing both the edge density of a given group as well as interactions with edges from other groups. Density maps are efficiently calculated by smoothing 2D histograms of edge occurrence using repeated averaging filters based on integral images. A CPU implementation of the algorithm is tested on several graphs, and different grou** criteria are used to illustrate how the proposed technique can render different visualizations of the same data. Bundling performance is much higher than on previous approaches, being particularly noticeable on large graphs, with millions of edges being bundled in seconds. △ Less

Submitted 10 April, 2015; originally announced April 2015.

Comments: Submitted to a conference (under review)

MSC Class: 68U05 ACM Class: I.3.3; E.1

Showing 1–8 of 8 results for author: Moura, D