-
Investigating the Impact of SOLID Design Principles on Machine Learning Code Understanding
Authors:
Raphael Cabral,
Marcos Kalinowski,
Maria Teresa Baldassarre,
Hugo Villamizar,
Tatiana Escovedo,
Hélio Lopes
Abstract:
[Context] Applying design principles has long been acknowledged as beneficial for understanding and maintainability in traditional software projects. These benefits may similarly hold for Machine Learning (ML) projects, which involve iterative experimentation with data, models, and algorithms. However, ML components are often developed by data scientists with diverse educational backgrounds, poten…
▽ More
[Context] Applying design principles has long been acknowledged as beneficial for understanding and maintainability in traditional software projects. These benefits may similarly hold for Machine Learning (ML) projects, which involve iterative experimentation with data, models, and algorithms. However, ML components are often developed by data scientists with diverse educational backgrounds, potentially resulting in code that doesn't adhere to software design best practices. [Goal] In order to better understand this phenomenon, we investigated the impact of the SOLID design principles on ML code understanding. [Method] We conducted a controlled experiment with three independent trials involving 100 data scientists. We restructured real industrial ML code that did not use SOLID principles. Within each trial, one group was presented with the original ML code, while the other was presented with ML code incorporating SOLID principles. Participants of both groups were asked to analyze the code and fill out a questionnaire that included both open-ended and closed-ended questions on their understanding. [Results] The study results provide statistically significant evidence that the adoption of the SOLID design principles can improve code understanding within the realm of ML projects. [Conclusion] We put forward that software engineering design principles should be spread within the data science community and considered for enhancing the maintainability of ML code.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
A Quantitative and Qualitative Analysis of Suicide Ideation Detection using Deep Learning
Authors:
Siqu Long,
Rina Cabral,
Josiah Poon,
Soyeon Caren Han
Abstract:
For preventing youth suicide, social media platforms have received much attention from researchers. A few researches apply machine learning, or deep learning-based text classification approaches to classify social media posts containing suicidality risk. This paper replicated competitive social media-based suicidality detection/prediction models. We evaluated the feasibility of detecting suicidal…
▽ More
For preventing youth suicide, social media platforms have received much attention from researchers. A few researches apply machine learning, or deep learning-based text classification approaches to classify social media posts containing suicidality risk. This paper replicated competitive social media-based suicidality detection/prediction models. We evaluated the feasibility of detecting suicidal ideation using multiple datasets and different state-of-the-art deep learning models, RNN-, CNN-, and Attention-based models. Using two suicidality evaluation datasets, we evaluated 28 combinations of 7 input embeddings with 4 commonly used deep learning models and 5 pretrained language models in quantitative and qualitative ways. Our replication study confirms that deep learning works well for social media-based suicidality detection in general, but it highly depends on the dataset's quality.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
PINs: Progressive Implicit Networks for Multi-Scale Neural Representations
Authors:
Zoe Landgraf,
Alexander Sorkine Hornung,
Ricardo Silveira Cabral
Abstract:
Multi-layer perceptrons (MLP) have proven to be effective scene encoders when combined with higher-dimensional projections of the input, commonly referred to as \textit{positional encoding}. However, scenes with a wide frequency spectrum remain a challenge: choosing high frequencies for positional encoding introduces noise in low structure areas, while low frequencies result in poor fitting of det…
▽ More
Multi-layer perceptrons (MLP) have proven to be effective scene encoders when combined with higher-dimensional projections of the input, commonly referred to as \textit{positional encoding}. However, scenes with a wide frequency spectrum remain a challenge: choosing high frequencies for positional encoding introduces noise in low structure areas, while low frequencies result in poor fitting of detailed regions. To address this, we propose a progressive positional encoding, exposing a hierarchical MLP structure to incremental sets of frequency encodings. Our model accurately reconstructs scenes with wide frequency bands and learns a scene representation at progressive level of detail \textit{without explicit per-level supervision}. The architecture is modular: each level encodes a continuous implicit representation that can be leveraged separately for its respective resolution, meaning a smaller network for coarser reconstructions. Experiments on several 2D and 3D datasets show improvements in reconstruction accuracy, representational capacity and training speed compared to baselines.
△ Less
Submitted 16 June, 2022; v1 submitted 9 February, 2022;
originally announced February 2022.
-
Temporal complex networks modeling applied to vehicular ad-hoc networks
Authors:
Fillipe Santos,
Andre L. L. Aquino,
Edmundo R. M. Madeira,
Raquel S. Cabral
Abstract:
VANETs solutions use aggregated graph representation to model the interaction among the vehicles and different aggregated complex network measures to quantify some topological characteristics. This modeling ignores the temporal interactions between the cars, causing loss of information or unrealistic behavior. This work proposes the use of both temporal graphs and temporal measures to model VANETs…
▽ More
VANETs solutions use aggregated graph representation to model the interaction among the vehicles and different aggregated complex network measures to quantify some topological characteristics. This modeling ignores the temporal interactions between the cars, causing loss of information or unrealistic behavior. This work proposes the use of both temporal graphs and temporal measures to model VANETs applications. To verify the viability of this model, we initially perform a comparative analysis between the temporal and aggregated modeling considering five different real datasets. This analysis shows that the aggregated model is inefficient in modeling the temporal aspects of networks. After that, we perform a network evaluation through a simulation by considering the impact of temporal modeling applied to the deployment of RSUs. First, we compare a solution based on our temporal modeling with a greedy algorithm based on an aggregated model to choose the positions of RSUs. In a scenario with 70 RSUs, we have 77% and 65% of coverage in the temporal and aggregated model (greedy algorithm), respectively. Second, we evaluate the use of aggregated and temporal measures applied as features in a genetic algorithm. The approach with temporal betweenness had the better result with 90% of the coverage area against 61% of aggregated one applied to the same scenario.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Computação Urbana da Teoria à Prática: Fundamentos, Aplicações e Desafios
Authors:
Diego O. Rodrigues,
Frances A. Santos,
Geraldo P. Rocha Filho,
Ademar T. Akabane,
Raquel Cabral,
Roger Immich,
Wellington L. Junior,
Felipe D. Cunha,
Daniel L. Guidoni,
Thiago H. Silva,
Denis Rosário,
Eduardo Cerqueira,
Antonio A. F. Loureiro,
Leandro A. Villas
Abstract:
The growing of cities has resulted in innumerable technical and managerial challenges for public administrators such as energy consumption, pollution, urban mobility and even supervision of private and public spaces in an appropriate way. Urban Computing emerges as a promising paradigm to solve such challenges, through the extraction of knowledge, from a large amount of heterogeneous data existing…
▽ More
The growing of cities has resulted in innumerable technical and managerial challenges for public administrators such as energy consumption, pollution, urban mobility and even supervision of private and public spaces in an appropriate way. Urban Computing emerges as a promising paradigm to solve such challenges, through the extraction of knowledge, from a large amount of heterogeneous data existing in urban space. Moreover, Urban Computing correlates urban sensing, data management, and analysis to provide services that have the potential to improve the quality of life of the citizens of large urban centers. Consider this context, this chapter aims to present the fundamentals of Urban Computing and the steps necessary to develop an application in this area. To achieve this goal, the following questions will be investigated, namely: (i) What are the main research problems of Urban Computing?; (ii) What are the technological challenges for the implementation of services in Urban Computing?; (iii) What are the main methodologies used for the development of services in Urban Computing?; and (iv) What are the representative applications in this field?
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
Variability Analysis of Complex Networks Measures based on Stochastic Distances
Authors:
Raquel Cabral,
Alejandro Frery,
Jaime Ramírez
Abstract:
Complex networks can model the structure and dynamics of different types of systems. It has been shown that they are characterized by a set of measures. In this work, we evaluate the variability of complex networks measures face to perturbations and, for this purpose, we impose controlled perturbations and quantify their effect. We analyze theoretical models (random, small-world and scale-free) an…
▽ More
Complex networks can model the structure and dynamics of different types of systems. It has been shown that they are characterized by a set of measures. In this work, we evaluate the variability of complex networks measures face to perturbations and, for this purpose, we impose controlled perturbations and quantify their effect. We analyze theoretical models (random, small-world and scale-free) and real networks (a collaboration network and a metabolic networks) along with the shortest path length, vertex degree, local cluster coefficient and betweenness centrality measures. In such analysis, we propose the use of three stochastic quantifiers: the Kullback-Leibler divergence and the Jensen-Shannon and Hellinger distances. The sensitivity of these measures was analyzed with respect to the following perturbations: edge addition, edge removal, edge rewiring and node removal, all of them applied at different intensities. The results reveal that the evaluated measures are influenced by these perturbations. Additionally, hypotheses tests were performed to verify the behavior of the degree distribution to identify the intensity of the perturbations that leads to break this property.
△ Less
Submitted 29 July, 2014;
originally announced July 2014.
-
Feature and Region Selection for Visual Learning
Authors:
Ji Zhao,
Liantao Wang,
Ricardo Cabral,
Fernando De la Torre
Abstract:
Visual learning problems such as object classification and action recognition are typically approached using extensions of the popular bag-of-words (BoW) model. Despite its great success, it is unclear what visual features the BoW model is learning: Which regions in the image or video are used to discriminate among classes? Which are the most discriminative visual words? Answering these questions…
▽ More
Visual learning problems such as object classification and action recognition are typically approached using extensions of the popular bag-of-words (BoW) model. Despite its great success, it is unclear what visual features the BoW model is learning: Which regions in the image or video are used to discriminate among classes? Which are the most discriminative visual words? Answering these questions is fundamental for understanding existing BoW models and inspiring better models for visual recognition.
To answer these questions, this paper presents a method for feature selection and region selection in the visual BoW model. This allows for an intermediate visualization of the features and regions that are important for visual learning. The main idea is to assign latent weights to the features or regions, and jointly optimize these latent variables with the parameters of a classifier (e.g., support vector machine). There are four main benefits of our approach: (1) Our approach accommodates non-linear additive kernels such as the popular $χ^2$ and intersection kernel; (2) our approach is able to handle both regions in images and spatio-temporal regions in videos in a unified way; (3) the feature selection problem is convex, and both problems can be solved using a scalable reduced gradient method; (4) we point out strong connections with multiple kernel learning and multiple instance learning approaches. Experimental results in the PASCAL VOC 2007, MSR Action Dataset II and YouTube illustrate the benefits of our approach.
△ Less
Submitted 18 January, 2016; v1 submitted 20 July, 2014;
originally announced July 2014.
-
Structural Changes in Data Communication in Wireless Sensor Networks
Authors:
Raquel S. Cabral,
Andre L. L. Aquino,
Alejandro C. Frery,
Osvaldo A. Rosso,
Jaime A. Ramírez
Abstract:
Wireless sensor networks are an important technology for making distributed autonomous measures in hostile or inaccessible environments. Among the challenges they pose, the way data travel among them is a relevant issue since their structure is quite dynamic. The operational topology of such devices can often be described by complex networks. In this work, we assess the variation of measures commo…
▽ More
Wireless sensor networks are an important technology for making distributed autonomous measures in hostile or inaccessible environments. Among the challenges they pose, the way data travel among them is a relevant issue since their structure is quite dynamic. The operational topology of such devices can often be described by complex networks. In this work, we assess the variation of measures commonly employed in the complex networks literature applied to wireless sensor networks. Four data communication strategies were considered: geometric, random, small-world, and scale-free models, along with the shortest path length measure. The sensitivity of this measure was analyzed with respect to the following perturbations: insertion and removal of nodes in the geometric strategy; and insertion, removal and rewiring of links in the other models. The assessment was performed using the normalized Kullback-Leibler divergence and Hellinger distance quantifiers, both deriving from the Information Theory framework. The results reveal that the shortest path length is sensitive to perturbations.
△ Less
Submitted 13 August, 2013;
originally announced August 2013.