-
Science Tree: A Platform for Exploring the Brazilian Academic Genealogy
Authors:
João M. M. C. Cota,
Alberto H. F. Laender,
Raquel O. Prates
Abstract:
Identifying and studying the formation of researchers over the years is a challenging task, as the current repositories of theses and dissertations are cataloged in a decentralized manner in different digital libraries, many of them with limited scope. In this paper, we take a step forward towards building a large repository to record the Brazilian academic genealogy. For this, we collected data f…
▽ More
Identifying and studying the formation of researchers over the years is a challenging task, as the current repositories of theses and dissertations are cataloged in a decentralized manner in different digital libraries, many of them with limited scope. In this paper, we take a step forward towards building a large repository to record the Brazilian academic genealogy. For this, we collected data from the Lattes platform, an internationally recognized initiative that provides a repository of researchers' curricula maintained by the Brazilian National Council for Scientific and Technological Development (CNPq), and developed a user-oriented platform to generate the academic genealogy trees of Brazilian researchers from them, also providing additional data resulting from a series of analyses regarding the main properties of such trees. Our effort has identified interesting aspects related to the academic career of the Brazilian researchers, which highlight the importance of generating and cataloging their academic genealogy trees.
△ Less
Submitted 10 August, 2021;
originally announced August 2021.
-
Typed Graph Networks
Authors:
Marcelo O. R. Prates,
Pedro H. C. Avelar,
Henrique Lemos,
Marco Gori,
Luis Lamb
Abstract:
Recently, the deep learning community has given growing attention to neural architectures engineered to learn problems in relational domains. Convolutional Neural Networks employ parameter sharing over the image domain, tying the weights of neural connections on a grid topology and thus enforcing the learning of a number of convolutional kernels. By instantiating trainable neural modules and assem…
▽ More
Recently, the deep learning community has given growing attention to neural architectures engineered to learn problems in relational domains. Convolutional Neural Networks employ parameter sharing over the image domain, tying the weights of neural connections on a grid topology and thus enforcing the learning of a number of convolutional kernels. By instantiating trainable neural modules and assembling them in varied configurations (apart from grids), one can enforce parameter sharing over graphs, yielding models which can effectively be fed with relational data. In this context, vertices in a graph can be projected into a hyperdimensional real space and iteratively refined over many message-passing iterations in an end-to-end differentiable architecture. Architectures of this family have been referred to with several definitions in the literature, such as Graph Neural Networks, Message-passing Neural Networks, Relational Networks and Graph Networks. In this paper, we revisit the original Graph Neural Network model and show that it generalises many of the recent models, which in turn benefit from the insight of thinking about vertex \textbf{types}. To illustrate the generality of the original model, we present a Graph Neural Network formalisation, which partitions the vertices of a graph into a number of types. Each type represents an entity in the ontology of the problem one wants to learn. This allows - for instance - one to assign embeddings to edges, hyperedges, and any number of global attributes of the graph. As a companion to this paper we provide a Python/Tensorflow library to facilitate the development of such architectures, with which we instantiate the formalisation to reproduce a number of models proposed in the current literature.
△ Less
Submitted 24 February, 2019; v1 submitted 23 January, 2019;
originally announced January 2019.
-
Multitask Learning on Graph Neural Networks: Learning Multiple Graph Centrality Measures with a Unified Network
Authors:
Pedro H. C. Avelar,
Henrique Lemos,
Marcelo O. R. Prates,
Luis Lamb
Abstract:
The application of deep learning to symbolic domains remains an active research endeavour. Graph neural networks (GNN), consisting of trained neural modules which can be arranged in different topologies at run time, are sound alternatives to tackle relational problems which lend themselves to graph representations. In this paper, we show that GNNs are capable of multitask learning, which can be na…
▽ More
The application of deep learning to symbolic domains remains an active research endeavour. Graph neural networks (GNN), consisting of trained neural modules which can be arranged in different topologies at run time, are sound alternatives to tackle relational problems which lend themselves to graph representations. In this paper, we show that GNNs are capable of multitask learning, which can be naturally enforced by training the model to refine a single set of multidimensional embeddings $\in \mathbb{R}^d$ and decode them into multiple outputs by connecting MLPs at the end of the pipeline. We demonstrate the multitask learning capability of the model in the relevant relational problem of estimating network centrality measures, focusing primarily on producing rankings based on these measures, i.e. is vertex $v_1$ more central than vertex $v_2$ given centrality $c$?. We then show that a GNN can be trained to develop a \emph{lingua franca} of vertex embeddings from which all relevant information about any of the trained centrality measures can be decoded. The proposed model achieves $89\%$ accuracy on a test dataset of random instances with up to 128 vertices and is shown to generalise to larger problem sizes. The model is also shown to obtain reasonable accuracy on a dataset of real world instances with up to 4k vertices, vastly surpassing the sizes of the largest instances with which the model was trained ($n=128$). Finally, we believe that our contributions attest to the potential of GNNs in symbolic domains in general and in relational learning in particular.
△ Less
Submitted 28 November, 2019; v1 submitted 11 September, 2018;
originally announced September 2018.
-
Learning to Solve NP-Complete Problems - A Graph Neural Network for Decision TSP
Authors:
Marcelo O. R. Prates,
Pedro H. C. Avelar,
Henrique Lemos,
Luis Lamb,
Moshe Vardi
Abstract:
Graph Neural Networks (GNN) are a promising technique for bridging differential programming and combinatorial domains. GNNs employ trainable modules which can be assembled in different configurations that reflect the relational structure of each problem instance. In this paper, we show that GNNs can learn to solve, with very little supervision, the decision variant of the Traveling Salesperson Pro…
▽ More
Graph Neural Networks (GNN) are a promising technique for bridging differential programming and combinatorial domains. GNNs employ trainable modules which can be assembled in different configurations that reflect the relational structure of each problem instance. In this paper, we show that GNNs can learn to solve, with very little supervision, the decision variant of the Traveling Salesperson Problem (TSP), a highly relevant $\mathcal{NP}$-Complete problem. Our model is trained to function as an effective message-passing algorithm in which edges (embedded with their weights) communicate with vertices for a number of iterations after which the model is asked to decide whether a route with cost $<C$ exists. We show that such a network can be trained with sets of dual examples: given the optimal tour cost $C^{*}$, we produce one decision instance with target cost $x\%$ smaller and one with target cost $x\%$ larger than $C^{*}$. We were able to obtain $80\%$ accuracy training with $-2\%,+2\%$ deviations, and the same trained model can generalize for more relaxed deviations with increasing performance. We also show that the model is capable of generalizing for larger problem sizes. Finally, we provide a method for predicting the optimal route cost within $2\%$ deviation from the ground truth. In summary, our work shows that Graph Neural Networks are powerful enough to solve $\mathcal{NP}$-Complete problems which combine symbolic and numeric data.
△ Less
Submitted 16 November, 2018; v1 submitted 7 September, 2018;
originally announced September 2018.
-
Assessing Gender Bias in Machine Translation -- A Case Study with Google Translate
Authors:
Marcelo O. R. Prates,
Pedro H. C. Avelar,
Luis Lamb
Abstract:
Recently there has been a growing concern about machine bias, where trained statistical models grow to reflect controversial societal asymmetries, such as gender or racial bias. A significant number of AI tools have recently been suggested to be harmfully biased towards some minority, with reports of racist criminal behavior predictors, Iphone X failing to differentiate between two Asian people an…
▽ More
Recently there has been a growing concern about machine bias, where trained statistical models grow to reflect controversial societal asymmetries, such as gender or racial bias. A significant number of AI tools have recently been suggested to be harmfully biased towards some minority, with reports of racist criminal behavior predictors, Iphone X failing to differentiate between two Asian people and Google photos' mistakenly classifying black people as gorillas. Although a systematic study of such biases can be difficult, we believe that automated translation tools can be exploited through gender neutral languages to yield a window into the phenomenon of gender bias in AI.
In this paper, we start with a comprehensive list of job positions from the U.S. Bureau of Labor Statistics (BLS) and used it to build sentences in constructions like "He/She is an Engineer" in 12 different gender neutral languages such as Hungarian, Chinese, Yoruba, and several others. We translate these sentences into English using the Google Translate API, and collect statistics about the frequency of female, male and gender-neutral pronouns in the translated output. We show that GT exhibits a strong tendency towards male defaults, in particular for fields linked to unbalanced gender distribution such as STEM jobs. We ran these statistics against BLS' data for the frequency of female participation in each job position, showing that GT fails to reproduce a real-world distribution of female workers. We provide experimental evidence that even if one does not expect in principle a 50:50 pronominal gender distribution, GT yields male defaults much more frequently than what would be expected from demographic data alone.
We are hopeful that this work will ignite a debate about the need to augment current statistical translation tools with debiasing techniques which can already be found in the scientific literature.
△ Less
Submitted 11 March, 2019; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Astrometric and photometric study of Dias 4, Dias 6 and other five open clusters using ground based and Gaia DR2 data
Authors:
Wilton S. Dias,
Hektor Monteiro,
Jacques R. D. Lépine,
R. Prates,
C. D. Gneiding,
M. Sacchi
Abstract:
We present a study of 7 southern open clusters based on UBVRI CCD photomety (Johnsons-Cousins system) and Gaia DR2 data. Dias 4, Dias 6 and four other clusters had UBVRI photometric observations determined for the first time. From the observational UBVRI data we obtained photometric membership probability estimates and, using the proper motions from the UCAC5 catalog, we also determined the kinema…
▽ More
We present a study of 7 southern open clusters based on UBVRI CCD photomety (Johnsons-Cousins system) and Gaia DR2 data. Dias 4, Dias 6 and four other clusters had UBVRI photometric observations determined for the first time. From the observational UBVRI data we obtained photometric membership probability estimates and, using the proper motions from the UCAC5 catalog, we also determined the kinematic membership. From Gaia DR2 astrometric data we determine the stellar membership using proper motions and parallaxes, taking into account the full covariance matrix. For both independent sets of data and membership we apply our non subjective multidimensional global optimization tool to fit theoretical isochrones to determine the distance, age, reddening, metallicity and binary fraction of the clusters. The results of the mean proper motions, distances and ages are in agreement, but the ones obtained from Gaia DR2 data are more precise in both membership selection and estimated parameters. In the case of NGC 6087, the Cepheid S Nor, member of the open cluster, was used to obtain an independent distance estimate, confirming the one determined by our fitting method. We also report a serendipitous discovery of two new clusters in the extended field near what was originally Dias 4.
△ Less
Submitted 24 August, 2018;
originally announced August 2018.
-
Kernel Cross-View Collaborative Representation based Classification for Person Re-Identification
Authors:
Raphael Prates,
William Robson Schwartz
Abstract:
Person re-identification aims at the maintenance of a global identity as a person moves among non-overlap** surveillance cameras. It is a hard task due to different illumination conditions, viewpoints and the small number of annotated individuals from each pair of cameras (small-sample-size problem). Collaborative Representation based Classification (CRC) has been employed successfully to addres…
▽ More
Person re-identification aims at the maintenance of a global identity as a person moves among non-overlap** surveillance cameras. It is a hard task due to different illumination conditions, viewpoints and the small number of annotated individuals from each pair of cameras (small-sample-size problem). Collaborative Representation based Classification (CRC) has been employed successfully to address the small-sample-size problem in computer vision. However, the original CRC formulation is not well-suited for person re-identification since it does not consider that probe and gallery samples are from different cameras. Furthermore, it is a linear model, while appearance changes caused by different camera conditions indicate a strong nonlinear transition between cameras. To overcome such limitations, we propose the Kernel Cross-View Collaborative Representation based Classification (Kernel X-CRC) that represents probe and gallery images by balancing representativeness and similarity nonlinearly. It assumes that a probe and its corresponding gallery image are represented with similar coding vectors using individuals from the training set. Experimental results demonstrate that our assumption is true when using a high-dimensional feature vector and becomes more compelling when dealing with a low-dimensional and discriminative representation computed using a common subspace learning method. We achieve state-of-the-art for rank-1 matching rates in two person re-identification datasets (PRID450S and GRID) and the second best results on VIPeR and CUHK01 datasets.
△ Less
Submitted 21 November, 2016;
originally announced November 2016.
-
He II $λ$4686 emission from the massive binary system in $η$ Car: constraints to the orbital elements and the nature of the periodic minima
Authors:
M. Teodoro,
A. Damineli,
B. Heathcote,
N. D. Richardson,
A. F. J. Moffat,
L. St-Jean,
C. Russell,
T. R. Gull,
T. I. Madura,
K. R. Pollard,
F. Walter,
A. Coimbra,
R. Prates,
E. Fernández-Lajús,
R. C. Gamen,
G. Hickel,
W. Henrique,
F. Navarete,
T. Andrade,
F. Jablonski,
P. Luckas,
M. Locke,
J. Powles,
T. Bohlsen,
R. Chini
, et al. (5 additional authors not shown)
Abstract:
η Carinae is an extremely massive binary system in which rapid spectrum variations occur near periastron. Most notably, near periastron the He II $λ4686$ line increases rapidly in strength, drops to a minimum value, then increases briefly before fading away. To understand this behavior, we conducted an intense spectroscopic monitoring of the He II $λ4686$ emission line across the 2014.6 periastron…
▽ More
η Carinae is an extremely massive binary system in which rapid spectrum variations occur near periastron. Most notably, near periastron the He II $λ4686$ line increases rapidly in strength, drops to a minimum value, then increases briefly before fading away. To understand this behavior, we conducted an intense spectroscopic monitoring of the He II $λ4686$ emission line across the 2014.6 periastron passage using ground- and space-based telescopes. Comparison with previous data confirmed the overall repeatability of EW(He II $λ4686$), the line radial velocities, and the timing of the minimum, though the strongest peak was systematically larger in 2014 than in 2009 by 26%. The EW(He II $λ4686$) variations, combined with other measurements, yield an orbital period $2022.7\pm0.3$ d. The observed variability of the EW(He II $λ4686$) was reproduced by a model in which the line flux primarily arises at the apex of the wind-wind collision and scales inversely with the square of the stellar separation, if we account for the excess emission as the companion star plunges into the hot inner layers of the primary's atmosphere, and including absorption from the disturbed primary wind between the source and the observer. This model constrains the orbital inclination to $135^\circ$-$153^\circ$, and the longitude of periastron to $234^\circ$-$252^\circ$. It also suggests that periastron passage occurred on $T_0 = 2456874.4\pm1.3$ d. Our model also reproduced EW(He II $λ4686$) variations from a polar view of the primary star as determined from the observed He II $λ4686$ emission scattered off the Homunculus nebula.
△ Less
Submitted 13 January, 2016;
originally announced January 2016.
-
Breaking the News: First Impressions Matter on Online News
Authors:
Julio Reis,
Fabrıcio Benevenuto,
Pedro O. S. Vaz de Melo,
Raquel Prates,
Haewoon Kwak,
Jisun An
Abstract:
A growing number of people are changing the way they consume news, replacing the traditional physical newspapers and magazines by their virtual online versions or/and weblogs. The interactivity and immediacy present in online news are changing the way news are being produced and exposed by media corporations. News websites have to create effective strategies to catch people's attention and attract…
▽ More
A growing number of people are changing the way they consume news, replacing the traditional physical newspapers and magazines by their virtual online versions or/and weblogs. The interactivity and immediacy present in online news are changing the way news are being produced and exposed by media corporations. News websites have to create effective strategies to catch people's attention and attract their clicks. In this paper we investigate possible strategies used by online news corporations in the design of their news headlines. We analyze the content of 69,907 headlines produced by four major global media corporations during a minimum of eight consecutive months in 2014. In order to discover strategies that could be used to attract clicks, we extracted features from the text of the news headlines related to the sentiment polarity of the headline. We discovered that the sentiment of the headline is strongly related to the popularity of the news and also with the dynamics of the posted comments on that particular news.
△ Less
Submitted 16 April, 2015; v1 submitted 26 March, 2015;
originally announced March 2015.
-
Brazilian License Plate Detection Using Histogram of Oriented Gradients and Sliding Windows
Authors:
R. F. Prates,
G. Cámara-Chávez,
William R. Schwartz,
D. Menotti
Abstract:
Due to the increasingly need for automatic traffic monitoring, vehicle license plate detection is of high interest to perform automatic toll collection, traffic law enforcement, parking lot access control, among others. In this paper, a sliding window approach based on Histogram of Oriented Gradients (HOG) features is used for Brazilian license plate detection. This approach consists in scanning t…
▽ More
Due to the increasingly need for automatic traffic monitoring, vehicle license plate detection is of high interest to perform automatic toll collection, traffic law enforcement, parking lot access control, among others. In this paper, a sliding window approach based on Histogram of Oriented Gradients (HOG) features is used for Brazilian license plate detection. This approach consists in scanning the whole image in a multiscale fashion such that the license plate is located precisely. The main contribution of this work consists in a deep study of the best setup for HOG descriptors on the detection of Brazilian license plates, in which HOG have never been applied before. We also demonstrate the reliability of this method ensured by a recall higher than 98% (with a precision higher than 78%) in a publicly available data set.
△ Less
Submitted 9 January, 2014;
originally announced January 2014.