-
Characterizing Nodes and Edges in Dynamic Attributed Networks: A Social-based Approach
Authors:
Thiago H. P. Silva,
Alberto H. F. Laender,
Pedro O. S. Vaz de Melo
Abstract:
How to characterize nodes and edges in dynamic attributed networks based on social aspects? We address this problem by exploring the strength of the ties between actors and their associated attributes over time, thus capturing the social roles of the actors and the meaning of their dynamic interactions in different social network scenarios. For this, we apply social concepts to promote a better un…
▽ More
How to characterize nodes and edges in dynamic attributed networks based on social aspects? We address this problem by exploring the strength of the ties between actors and their associated attributes over time, thus capturing the social roles of the actors and the meaning of their dynamic interactions in different social network scenarios. For this, we apply social concepts to promote a better understanding of the underlying complexity that involves actors and their social motivations. More specifically, we explore the notion of social capital given by the strategic positioning of a particular actor in a social structure by means of the concepts of brokerage, the ability of creating bridges with diversified patterns, and closure, the ability of aggregating nodes with similar patterns. As a result, we unveil the differences of social interactions in distinct academic coauthorship networks and questions \& answers communities. We also statistically validate our social definitions considering the importance of the nodes and edges in a social structure by means of network properties.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Science Tree: A Platform for Exploring the Brazilian Academic Genealogy
Authors:
João M. M. C. Cota,
Alberto H. F. Laender,
Raquel O. Prates
Abstract:
Identifying and studying the formation of researchers over the years is a challenging task, as the current repositories of theses and dissertations are cataloged in a decentralized manner in different digital libraries, many of them with limited scope. In this paper, we take a step forward towards building a large repository to record the Brazilian academic genealogy. For this, we collected data f…
▽ More
Identifying and studying the formation of researchers over the years is a challenging task, as the current repositories of theses and dissertations are cataloged in a decentralized manner in different digital libraries, many of them with limited scope. In this paper, we take a step forward towards building a large repository to record the Brazilian academic genealogy. For this, we collected data from the Lattes platform, an internationally recognized initiative that provides a repository of researchers' curricula maintained by the Brazilian National Council for Scientific and Technological Development (CNPq), and developed a user-oriented platform to generate the academic genealogy trees of Brazilian researchers from them, also providing additional data resulting from a series of analyses regarding the main properties of such trees. Our effort has identified interesting aspects related to the academic career of the Brazilian researchers, which highlight the importance of generating and cataloging their academic genealogy trees.
△ Less
Submitted 10 August, 2021;
originally announced August 2021.
-
Overcoming Bias in Community Detection Evaluation
Authors:
Jeancarlo Campos Leão,
Alberto H. F. Laender,
Pedro O. S. Vaz de Melo
Abstract:
Community detection is a key task to further understand the function and the structure of complex networks. Therefore, a strategy used to assess this task must be able to avoid biased and incorrect results that might invalidate further analyses or applications that rely on such communities. Two widely used strategies to assess this task are generally known as structural and functional. The structu…
▽ More
Community detection is a key task to further understand the function and the structure of complex networks. Therefore, a strategy used to assess this task must be able to avoid biased and incorrect results that might invalidate further analyses or applications that rely on such communities. Two widely used strategies to assess this task are generally known as structural and functional. The structural strategy basically consists in detecting and assessing such communities by using multiple methods and structural metrics. On the other hand, the functional strategy might be used when ground truth data are available to assess the detected communities. However, the evaluation of communities based on such strategies is usually done in experimental configurations that are largely susceptible to biases, a situation that is inherent to algorithms, metrics and network data used in this task. Furthermore, such strategies are not systematically combined in a way that allows for the identification and mitigation of bias in the algorithms, metrics or network data to converge into more consistent results. In this context, the main contribution of this article is an approach that supports a robust quality evaluation when detecting communities in real-world networks. In our approach, we measure the quality of a community by applying the structural and functional strategies, and the combination of both, to obtain different pieces of evidence. Then, we consider the divergences and the consensus among the pieces of evidence to identify and overcome possible sources of bias in community detection algorithms, evaluation metrics, and network data. Experiments conducted with several real and synthetic networks provided results that show the effectiveness of our approach to obtain more consistent conclusions about the quality of the detected communities.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
A Brief Survey on Replica Consistency in Cloud Environments
Authors:
Robson A. Campêlo,
Marco A. Casanova,
Dorgival O. Guedes,
Alberto H. F. Laender
Abstract:
Cloud computing is a general term that involves delivering hosted services over the Internet. With the accelerated growth of the volume of data used by applications, many organizations have moved their data into cloud servers to provide scalable, reliable and highly available services. A particularly challenging issue that arises in the context of cloud storage systems with geographically-distribu…
▽ More
Cloud computing is a general term that involves delivering hosted services over the Internet. With the accelerated growth of the volume of data used by applications, many organizations have moved their data into cloud servers to provide scalable, reliable and highly available services. A particularly challenging issue that arises in the context of cloud storage systems with geographically-distributed data replication is how to reach a consistent state for all replicas. This survey reviews major aspects related to consistency issues in cloud data storage systems, categorizing recently proposed methods into three categories: (1) fixed consistency methods, (2) configurable consistency methods and (3) consistency monitoring methods.
△ Less
Submitted 1 September, 2020; v1 submitted 26 August, 2020;
originally announced August 2020.
-
A Multi-Strategy Approach to Overcoming Bias in Community Detection Evaluation
Authors:
Jeancarlo Campos Leão,
Alberto H. F. Laender,
Pedro O. S. Vaz de Melo
Abstract:
Community detection is key to understand the structure of complex networks. However, the lack of appropriate evaluation strategies for this specific task may produce biased and incorrect results that might invalidate further analyses or applications based on such networks. In this context, the main contribution of this paper is an approach that supports a robust quality evaluation when detecting c…
▽ More
Community detection is key to understand the structure of complex networks. However, the lack of appropriate evaluation strategies for this specific task may produce biased and incorrect results that might invalidate further analyses or applications based on such networks. In this context, the main contribution of this paper is an approach that supports a robust quality evaluation when detecting communities in real-world networks. In our approach, we use multiple strategies that capture distinct aspects of the communities. The conclusion on the quality of these communities is based on the consensus among the strategies adopted for the structural evaluation, as well as on the comparison with communities detected by different methods and with their existing ground truths. In this way, our approach allows one to overcome biases in network data, detection algorithms and evaluation metrics, thus providing more consistent conclusions about the quality of the detected communities. Experiments conducted with several real and synthetic networks provided results that show the effectiveness of our approach.
△ Less
Submitted 21 September, 2019;
originally announced September 2019.
-
Improving Community Detection by Mining Social Interactions
Authors:
Jeancarlo Campos Leão,
Michele Amaral Brandão,
Pedro O. S. Vaz de Melo,
Alberto H. F. Laender
Abstract:
Social relationships can be divided into different classes based on the regularity with which they occur and the similarity among them. Thus, rare and somewhat similar relationships are random and cause noise in a social network, thus hiding the actual structure of the network and preventing an accurate analysis of it. In this context, in this paper we propose a process to handle social network da…
▽ More
Social relationships can be divided into different classes based on the regularity with which they occur and the similarity among them. Thus, rare and somewhat similar relationships are random and cause noise in a social network, thus hiding the actual structure of the network and preventing an accurate analysis of it. In this context, in this paper we propose a process to handle social network data that exploits temporal features to improve the detection of communities by existing algorithms. By removing random interactions, we observe that social networks converge to a topology with more purely social relationships and more modular communities.
△ Less
Submitted 4 October, 2018; v1 submitted 3 October, 2018;
originally announced October 2018.
-
Building the Brazilian Academic Genealogy Tree
Authors:
Wellington Dores,
Elias Soares,
Fabrício Benevenuto,
Alberto H. F. Laender
Abstract:
Along the history, many researchers provided remarkable contributions to science, not only advancing knowledge but also in terms of mentoring new scientists. Currently, identifying and studying the formation of researchers over the years is a challenging task as current repositories of theses and dissertations are cataloged in a decentralized way through many local digital libraries. Following our…
▽ More
Along the history, many researchers provided remarkable contributions to science, not only advancing knowledge but also in terms of mentoring new scientists. Currently, identifying and studying the formation of researchers over the years is a challenging task as current repositories of theses and dissertations are cataloged in a decentralized way through many local digital libraries. Following our previous work in which we created and analyzed a large collection of genealogy trees extracted from NDLTD, in this paper we focus our attention on building such trees for the Brazilian research community. For this, we use data from the Lattes Platform, an internationally renowned initiative from CNPq, the Brazilian National Council for Scientific and Technological Development, for managing information about individual researchers and research groups in Brazil.
△ Less
Submitted 27 December, 2017;
originally announced December 2017.
-
The H-index Paradox: Your Coauthors Have a Higher H-index than You Do
Authors:
Fabrício Benevenuto,
Alberto H. F. Laender,
Bruno L. Alves
Abstract:
One interesting phenomenon that emerges from the typical structure of social networks is the friendship paradox. It states that your friends have on average more friends than you do. Recent efforts have explored variations of it, with numerous implications for the dynamics of social networks. However, the friendship paradox and its variations consider only the topological structure of the networks…
▽ More
One interesting phenomenon that emerges from the typical structure of social networks is the friendship paradox. It states that your friends have on average more friends than you do. Recent efforts have explored variations of it, with numerous implications for the dynamics of social networks. However, the friendship paradox and its variations consider only the topological structure of the networks and neglect many other characteristics that are correlated with node degree. In this article, we take the case of scientific collaborations to investigate whether a similar paradox also arises in terms of a researcher's scientific productivity as measured by her H-index. The H-index is a widely used metric in academia to capture both the quality and the quantity of a researcher's scientific output. It is likely that a researcher may use her coauthors' H-indexes as a way to infer whether her own H-index is adequate in her research area. Nevertheless, in this article, we show that the average H-index of a researcher's coauthors is usually higher than her own H-index. We present empirical evidence of this paradox and discuss some of its potential consequences.
△ Less
Submitted 19 October, 2015; v1 submitted 15 October, 2015;
originally announced October 2015.