-
Investigating dynamics and asymptotic trend to equilibrium in a reactive BGK model
Authors:
Giorgio Martalò,
Ana Jacinta Soares,
Romina Travaglini
Abstract:
We investigate numerically a recent BGK-type model for a multi-component mixture of monatomic gases, undergoing a reversible bimolecular chemical reaction. The model replaces each collisional term of the Boltzmann equation with a relaxation term, thereby describing separately the effects of the mechanical processes and the chemical reaction. Additionally, the model exhibits consistency properties.…
▽ More
We investigate numerically a recent BGK-type model for a multi-component mixture of monatomic gases, undergoing a reversible bimolecular chemical reaction. The model replaces each collisional term of the Boltzmann equation with a relaxation term, thereby describing separately the effects of the mechanical processes and the chemical reaction. Additionally, the model exhibits consistency properties. The correct entropy production is ensured when auxiliary temperatures in the chemical contributions share a common value. We assume isotropic distributions and perform numerical simulations for the macroscopic fields to appraise how the dynamics push the mixture toward thermalization and chemical equilibrium. We show that the hypothesis on the equalization of fictitious species temperatures is justifiable to ensure the monotonicity of the classical $H$-Boltzmann functional. Simulations show that, when initial temperatures are far from equilibrium, the relaxation towards equilibrium occurs at a later stage and the classical $H$-Boltzmann functional is not monotone during the initial transient.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
A non-conservative kinetic framework for a closed-market society subject to shock events
Authors:
Marco Menale,
Ana Jacinta Soares,
Romina Travaglini
Abstract:
Recently, several events have shockingly impacted society, carrying tough consequences. This was evident in the recent COVID-19 pandemic. However, not all individuals are affected by shock events in the same way. Among other factors, the consequences can vary depending on the wealth class. In our presented work, the approach typical of kinetic theory is used to analyze the dynamics of a closed-mar…
▽ More
Recently, several events have shockingly impacted society, carrying tough consequences. This was evident in the recent COVID-19 pandemic. However, not all individuals are affected by shock events in the same way. Among other factors, the consequences can vary depending on the wealth class. In our presented work, the approach typical of kinetic theory is used to analyze the dynamics of a closed-market society exposed to various types of shock events. To achieve this, we introduce non-conservative equations, incorporating proliferative and destructive binary interactions as well as external actions. Specifically, the latter term reproduces the shock events, and to accomplish this, we introduce an appropriate external force field into the kinetic framework, modeled using Gaussian functions. Several numerical simulations exploring different scenarios are presented to illustrate the behavior of the solution predicted by the model and to gain some insights when complex situations are investigated.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
A model for slowing particles in random media
Authors:
François Golse,
Valeria Ricci,
Ana Jacinta Soares
Abstract:
We present a simple model in dimension $d\geq 2$ for slowing particles in random media, where point particles move in straight lines among and inside spherical identical obstacles with Poisson distributed centres. When crossing an obstacle, a particle is slowed down according to the law $\dot{V}= -\fracκε S(|V|) V$, where $V$ is the velocity of the point particle, $κ$ is a positive constant, $ε$ i…
▽ More
We present a simple model in dimension $d\geq 2$ for slowing particles in random media, where point particles move in straight lines among and inside spherical identical obstacles with Poisson distributed centres. When crossing an obstacle, a particle is slowed down according to the law $\dot{V}= -\fracκε S(|V|) V$, where $V$ is the velocity of the point particle, $κ$ is a positive constant, $ε$ is the radius of the obstacle and $S(|V|)$ is a given slowing profile. With this choice, the slowing rate in the obstacles is such that the variation of speed at each crossing is of order $1$. We study the asymptotic limit of the particle system when $ε$ vanishes and the mean free path of the point particles stays finite. We prove the convergence of the point particles density measure to the solution of a kinetic-like equation with a collision term which includes a contribution proportional to a $δ$ function in $v=0$; this contribution guarantees the conservation of mass for the limit equation.
△ Less
Submitted 11 July, 2024; v1 submitted 9 June, 2024;
originally announced June 2024.
-
Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer Simulation
Authors:
Nader Zare,
Aref Sayareh,
Alireza Sadraii,
Arad Firouzkouhi,
Amilcar Soares
Abstract:
RoboCup Soccer Simulation 2D (SS2D) research is hampered by the complexity of existing Cpp-based codes like Helios, Cyrus, and Gliders, which also suffer from limited integration with modern machine learning frameworks. This development paper introduces a transformative solution a gRPC-based, language-agnostic framework that seamlessly integrates with the high-performance Helios base code. This ap…
▽ More
RoboCup Soccer Simulation 2D (SS2D) research is hampered by the complexity of existing Cpp-based codes like Helios, Cyrus, and Gliders, which also suffer from limited integration with modern machine learning frameworks. This development paper introduces a transformative solution a gRPC-based, language-agnostic framework that seamlessly integrates with the high-performance Helios base code. This approach not only facilitates the use of diverse programming languages including CSharp, JavaScript, and Python but also maintains the computational efficiency critical for real time decision making in SS2D. By breaking down language barriers, our framework significantly enhances collaborative potential and flexibility, empowering researchers to innovate without the overhead of mastering or develo** extensive base codes. We invite the global research community to leverage and contribute to the Cross Language Soccer (CLS) framework, which is openly available under the MIT License, to drive forward the capabilities of multi-agent systems in soccer simulations.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Enhancing Global Maritime Traffic Network Forecasting with Gravity-Inspired Deep Learning Models
Authors:
Ruixin Song,
Gabriel Spadon,
Ronald Pelot,
Stan Matwin,
Amilcar Soares
Abstract:
Aquatic non-indigenous species (NIS) pose significant threats to biodiversity, disrupting ecosystems and inflicting substantial economic damages across agriculture, forestry, and fisheries. Due to the fast growth of global trade and transportation networks, NIS has been introduced and spread unintentionally in new environments. This study develops a new physics-informed model to forecast maritime…
▽ More
Aquatic non-indigenous species (NIS) pose significant threats to biodiversity, disrupting ecosystems and inflicting substantial economic damages across agriculture, forestry, and fisheries. Due to the fast growth of global trade and transportation networks, NIS has been introduced and spread unintentionally in new environments. This study develops a new physics-informed model to forecast maritime ship** traffic between port regions worldwide. The predicted information provided by these models, in turn, is used as input for risk assessment of NIS spread through transportation networks to evaluate the capability of our solution. Inspired by the gravity model for international trades, our model considers various factors that influence the likelihood and impact of vessel activities, such as ship** flux density, distance between ports, trade flow, and centrality measures of transportation hubs. Accordingly, this paper introduces transformers to gravity models to rebuild the short- and long-term dependencies that make the risk analysis feasible. Thus, we introduce a physics-inspired framework that achieves an 89% binary accuracy for existing and non-existing trajectories and an 84.8% accuracy for the number of vessels flowing between key port areas, representing more than 10% improvement over the traditional deep-gravity model. Along these lines, this research contributes to a better understanding of NIS risk assessment. It allows policymakers, conservationists, and stakeholders to prioritize management actions by identifying high-risk invasion pathways. Besides, our model is versatile and can include new data sources, making it suitable for assessing international vessel traffic flow in a changing global landscape.
△ Less
Submitted 10 July, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
On the Klein-Gordon oscillators in Eddington-inspired Born-Infeld gravity global monopole spacetime and a Wu-Yang magnetic monopole
Authors:
Omar Mustafa,
Adriano R. Soares,
Carlos F. S. Pereira,
Ricardo L. L. Vitória
Abstract:
We consider Klein-Gordon (KG) particles in a global monopole (GM) spacetime within Eddington-inspired Born-Infeld gravity (EiBI-gravity) and in a Wu-Yang magnetic monopole (WYMM). We discuss a set of KG-oscillators in such spacetime settings. We propose a textbook power series expansion for the KG radial wave function that allows us to retrieve the exact energy levels for KG-oscillators in a GM sp…
▽ More
We consider Klein-Gordon (KG) particles in a global monopole (GM) spacetime within Eddington-inspired Born-Infeld gravity (EiBI-gravity) and in a Wu-Yang magnetic monopole (WYMM). We discuss a set of KG-oscillators in such spacetime settings. We propose a textbook power series expansion for the KG radial wave function that allows us to retrieve the exact energy levels for KG-oscillators in a GM spacetime and a WYMM without EiBI-gravity. We, moreover, report some \textit{conditionally exact}, closed form, energy levels (through some parametric correlations) for KG-oscillators in a GM spacetime and a WYMM within EiBI-gravity, and for massless KG-oscillators in a GM spacetime and a WYMM within EiBI-gravity under the influence of a Coulomb plus linear Lorentz scalar potential. We study and discuss the effects of the Eddington parameter $κ$, GM-parameter $α$, WYMM strength $σ$, KG-oscillators' frequency $Ω$, and the coupling parameters of the Coulomb plus linear Lorentz scalar potential, on the spectroscopic structure of the KG-oscillators at hand. Such effects are studied over a vast range of the radial quantum number $n_r\geq 0$ and include energy levels clustering at $κ>>1$ (i.e., extreme EiBI-gravity), and at $|σ|>>1$ (i.e., extreme WYMM strength).
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Lie symmetry analysis for fractional evolution equation with $ψ$-Riemann-Liouville derivative
Authors:
Junior C. A. Soares,
Felix S. Costa,
J. Vanterler C. Sousa,
Maria V. S. Sousa,
Amália R. E. Pereira
Abstract:
We present the applycation of theory of Lie group analysis with $ψ$-Riemann-Liouville fractional derivative detailing the construction of infinitesimal prolongation to obtain Lie symmetries. In additional, is addressed the invariance condition without the need to impose that the lower limit of fractional integral is fixed. We find an expression that expands the knowledge regarding the study of exa…
▽ More
We present the applycation of theory of Lie group analysis with $ψ$-Riemann-Liouville fractional derivative detailing the construction of infinitesimal prolongation to obtain Lie symmetries. In additional, is addressed the invariance condition without the need to impose that the lower limit of fractional integral is fixed. We find an expression that expands the knowledge regarding the study of exact solutions for fractional differential equations. We use of the framework developed in \cite{zaky2022note} to present our understanding of the extension of $ψ$-Riemann-Liouville fractional derivative. It is demonstrate the Leibniz type rule for the derivative operator in question for built the prolongation. At last, we calculate the Lie symmetries of the generalized Burgers equation and fractional porous medium equation.
△ Less
Submitted 20 November, 2023;
originally announced January 2024.
-
Engineering Features to Improve Pass Prediction in Soccer Simulation 2D Games
Authors:
Nader Zare,
Mahtab Sarvmaili,
Aref Sayareh,
Omid Amini,
Stan Matwin Amilcar Soares
Abstract:
Soccer Simulation 2D (SS2D) is a simulation of a real soccer game in two dimensions. In soccer, passing behavior is an essential action for kee** the ball in possession of our team and creating goal opportunities. Similarly, for SS2D, predicting the passing behaviors of both opponents and our teammates helps manage resources and score more goals. Therefore, in this research, we have tried to add…
▽ More
Soccer Simulation 2D (SS2D) is a simulation of a real soccer game in two dimensions. In soccer, passing behavior is an essential action for kee** the ball in possession of our team and creating goal opportunities. Similarly, for SS2D, predicting the passing behaviors of both opponents and our teammates helps manage resources and score more goals. Therefore, in this research, we have tried to address the modeling of passing behavior of soccer 2D players using Deep Neural Networks (DNN) and Random Forest (RF). We propose an embedded data extraction module that can record the decision-making of agents in an online format. Afterward, we apply four data sorting techniques for training data preparation. After, we evaluate the trained models' performance playing against 6 top teams of RoboCup 2019 that have distinctive playing strategies. Finally, we examine the importance of different feature groups on the prediction of a passing strategy. All results in each step of this work prove our suggested methodology's effectiveness and improve the performance of the pass prediction in Soccer Simulation 2D games ranging from 5\% (e.g., playing against the same team) to 10\% (e.g., playing against Robocup top teams).
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Improving Dribbling, Passing, and Marking Actions in Soccer Simulation 2D Games Using Machine Learning
Authors:
Nader Zare,
Omid Amini,
Aref Sayareh,
Mahtab Sarvmaili,
Arad Firouzkouhi,
Stan Matwin,
Amilcar Soares
Abstract:
The RoboCup competition was started in 1997, and is known as the oldest RoboCup league. The RoboCup 2D Soccer Simulation League is a stochastic, partially observable soccer environment in which 24 autonomous agents play on two opposing teams. In this paper, we detail the main strategies and functionalities of CYRUS, the RoboCup 2021 2D Soccer Simulation League champions. The new functionalities pr…
▽ More
The RoboCup competition was started in 1997, and is known as the oldest RoboCup league. The RoboCup 2D Soccer Simulation League is a stochastic, partially observable soccer environment in which 24 autonomous agents play on two opposing teams. In this paper, we detail the main strategies and functionalities of CYRUS, the RoboCup 2021 2D Soccer Simulation League champions. The new functionalities presented and discussed in this work are (i) Multi Action Dribble, (ii) Pass Prediction and (iii) Marking Decision. The Multi Action Dribbling strategy enabled CYRUS to succeed more often and to be safer when dribbling actions were performed during a game. The Pass Prediction enhanced our gameplay by predicting our teammate's passing behavior, anticipating and making our agents collaborate better towards scoring goals. Finally, the Marking Decision addressed the multi-agent matching problem to improve CYRUS defensive strategy by finding an optimal solution to mark opponents' players.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
Deep Learning Brasil at ABSAPT 2022: Portuguese Transformer Ensemble Approaches
Authors:
Juliana Resplande Santanna Gomes,
Eduardo Augusto Santos Garcia,
Adalberto Ferreira Barbosa Junior,
Ruan Chaves Rodrigues,
Diogo Fernandes Costa Silva,
Dyonnatan Ferreira Maia,
Nádia Félix Felipe da Silva,
Arlindo Rodrigues Galvão Filho,
Anderson da Silva Soares
Abstract:
Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarit…
▽ More
Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarity of each aspect term (positive, negative or neutral). This article presents we present our participation in Aspect-Based Sentiment Analysis in Portuguese (ABSAPT) 2022 at IberLEF 2022. We submitted the best performing systems, achieving new state-of-the-art results on both subtasks.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Multi-Path Long-Term Vessel Trajectories Forecasting with Probabilistic Feature Fusion for Problem Shifting
Authors:
Gabriel Spadon,
Jay Kumar,
Derek Eden,
Josh van Berkel,
Tom Foster,
Amilcar Soares,
Ronan Fablet,
Stan Matwin,
Ronald Pelot
Abstract:
This paper addresses the challenge of boosting the precision of multi-path long-term vessel trajectory forecasting on engineered sequences of Automatic Identification System (AIS) data using feature fusion for problem shifting. We have developed a deep auto-encoder model and a phased framework approach to predict the next 12 hours of vessel trajectories using 1 to 3 hours of AIS data as input. To…
▽ More
This paper addresses the challenge of boosting the precision of multi-path long-term vessel trajectory forecasting on engineered sequences of Automatic Identification System (AIS) data using feature fusion for problem shifting. We have developed a deep auto-encoder model and a phased framework approach to predict the next 12 hours of vessel trajectories using 1 to 3 hours of AIS data as input. To this end, we fuse the spatiotemporal features from the AIS messages with probabilistic features engineered from historical AIS data referring to potential routes and destinations. As a result, we reduce the forecasting uncertainty by shifting the problem into a trajectory reconstruction problem. The probabilistic features have an F1-Score of approximately 85% and 75% for the vessel route and destination prediction, respectively. Under such circumstances, we achieved an R2 Score of over 98% with different layer structures and varying feature combinations; the high R2 Score is a natural outcome of the well-defined ship** lanes in the study region. However, our proposal stands out among competing approaches as it demonstrates the capability of complex decision-making during turnings and route selection. Furthermore, we have shown that our model achieves more accurate forecasting with average and median errors of 11km and 6km, respectively, a 25% improvement from the current state-of-the-art approaches. The resulting model from this proposal is deployed as part of a broader Decision Support System to safeguard whales by preventing the risk of vessel-whale collisions under the smartWhales initiative and acting on the Gulf of St. Lawrence in Atlantic Canada.
△ Less
Submitted 10 July, 2024; v1 submitted 29 October, 2023;
originally announced October 2023.
-
Yin Yang Convolutional Nets: Image Manifold Extraction by the Analysis of Opposites
Authors:
Augusto Seben da Rosa,
Frederico Santos de Oliveira,
Anderson da Silva Soares,
Arnaldo Candido Junior
Abstract:
Computer vision in general presented several advances such as training optimizations, new architectures (pure attention, efficient block, vision language models, generative models, among others). This have improved performance in several tasks such as classification, and others. However, the majority of these models focus on modifications that are taking distance from realistic neuroscientific app…
▽ More
Computer vision in general presented several advances such as training optimizations, new architectures (pure attention, efficient block, vision language models, generative models, among others). This have improved performance in several tasks such as classification, and others. However, the majority of these models focus on modifications that are taking distance from realistic neuroscientific approaches related to the brain. In this work, we adopt a more bio-inspired approach and present the Yin Yang Convolutional Network, an architecture that extracts visual manifold, its blocks are intended to separate analysis of colors and forms at its initial layers, simulating occipital lobe's operations. Our results shows that our architecture provides State-of-the-Art efficiency among low parameter architectures in the dataset CIFAR-10. Our first model reached 93.32\% test accuracy, 0.8\% more than the older SOTA in this category, while having 150k less parameters (726k in total). Our second model uses 52k parameters, losing only 3.86\% test accuracy. We also performed an analysis on ImageNet, where we reached 66.49\% validation accuracy with 1.6M parameters. We make the code publicly available at: https://github.com/NoSavedDATA/YinYang_CNN.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Denoising Opponents Position in Partial Observation Environment
Authors:
Aref Sayareh,
Aria Sardari,
Vahid Khoddami,
Nader Zare,
Vinicius Prado da Fonseca,
Amilcar Soares
Abstract:
The RoboCup competitions hold various leagues, and the Soccer Simulation 2D League is a major among them. Soccer Simulation 2D (SS2D) match involves two teams, including 11 players and a coach for each team, competing against each other. The players can only communicate with the Soccer Simulation Server during the game. Several code bases are released publicly to simplify team development. So rese…
▽ More
The RoboCup competitions hold various leagues, and the Soccer Simulation 2D League is a major among them. Soccer Simulation 2D (SS2D) match involves two teams, including 11 players and a coach for each team, competing against each other. The players can only communicate with the Soccer Simulation Server during the game. Several code bases are released publicly to simplify team development. So researchers can easily focus on decision-making and implementing machine learning methods. SS2D actions and behaviors are only partially accurate due to different challenges, such as noise and partial observation. Therefore, one strategy is to implement alternative denoising methods to tackle observation inaccuracy. Our idea is to predict opponent positions while they have yet to be seen in a finite number of cycles using machine learning methods to make more accurate actions such as pass. We will explain our position prediction idea powered by Long Short-Term Memory models (LSTM) and Deep Neural Networks (DNN). The results show that the LSTM and DNN predict the opponents' position more accurately than the standard algorithm, such as the last-seen method.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
On the Computational Complexities of Complex-valued Neural Networks
Authors:
Kayol Soares Mayer,
Jonathan Aguiar Soares,
Ariadne Arrais Cruz,
Dalton Soares Arantes
Abstract:
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential…
▽ More
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential for measuring an algorithm's power consumption. Therefore, this paper presents both the quantitative and asymptotic computational complexities of CVNNs. This is a crucial tool in deciding which algorithm to implement. The mathematical operations are described in terms of the number of real-valued multiplications, as these are the most demanding operations. To determine which CVNN can be implemented in a low-power system, quantitative computational complexities can be used to accurately estimate the number of floating-point operations. We have also investigated the computational complexities of CVNNs discussed in some studies presented in the literature.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Holonomy corrected Schwarzschild black hole lensing
Authors:
A. R. Soares,
C. F. S. Pereira,
R. L. L. Vitória
Abstract:
In the present work, we theoretically investigate gravitational lensing in the spacetime of a holonomy corrected Schwarzschild black hole. Analytical expressions for the light deflection angle are obtained in both the weak field limit and the strong field limit. Furthermore, we analyze observables, such as relativistic images and magnifications, and compare the results with those expected in a Sch…
▽ More
In the present work, we theoretically investigate gravitational lensing in the spacetime of a holonomy corrected Schwarzschild black hole. Analytical expressions for the light deflection angle are obtained in both the weak field limit and the strong field limit. Furthermore, we analyze observables, such as relativistic images and magnifications, and compare the results with those expected in a Schwarzschild spacetime. We discuss the possibilities and difficulties of investigating such a solution in practice.
△ Less
Submitted 25 November, 2023; v1 submitted 10 September, 2023;
originally announced September 2023.
-
CVNN-based Channel Estimation and Equalization in OFDM Systems Without Cyclic Prefix
Authors:
Heitor dos Santos Sousa,
Jonathan Aguiar Soares,
Kayol Soares Mayer,
Dalton Soares Arantes
Abstract:
In modern communication systems operating with Orthogonal Frequency-Division Multiplexing (OFDM), channel estimation requires minimal complexity with one-tap equalizers. However, this depends on cyclic prefixes, which must be sufficiently large to cover the channel impulse response. Conversely, the use of cyclic prefix (CP) decreases the useful information that can be conveyed in an OFDM frame, th…
▽ More
In modern communication systems operating with Orthogonal Frequency-Division Multiplexing (OFDM), channel estimation requires minimal complexity with one-tap equalizers. However, this depends on cyclic prefixes, which must be sufficiently large to cover the channel impulse response. Conversely, the use of cyclic prefix (CP) decreases the useful information that can be conveyed in an OFDM frame, thereby degrading the spectral efficiency of the system. In this context, we study the impact of CPs on channel estimation with complex-valued neural networks (CVNNs). We show that the phase-transmittance radial basis function neural network offers superior results, in terms of required energy per bit, compared to classical minimum mean-squared error and least squares algorithms in scenarios without CP.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Pyrus Base: An Open Source Python Framework for the RoboCup 2D Soccer Simulation
Authors:
Nader Zare,
Aref Sayareh,
Omid Amini,
Mahtab Sarvmaili,
Arad Firouzkouhi,
Stan Matwin,
Amilcar Soares
Abstract:
Soccer, also known as football in some parts of the world, involves two teams of eleven players whose objective is to score more goals than the opposing team. To simulate this game and attract scientists from all over the world to conduct research and participate in an annual computer-based soccer world cup, Soccer Simulation 2D (SS2D) was one of the leagues initiated in the RoboCup competition. I…
▽ More
Soccer, also known as football in some parts of the world, involves two teams of eleven players whose objective is to score more goals than the opposing team. To simulate this game and attract scientists from all over the world to conduct research and participate in an annual computer-based soccer world cup, Soccer Simulation 2D (SS2D) was one of the leagues initiated in the RoboCup competition. In every SS2D game, two teams of 11 players and one coach connect to the RoboCup Soccer Simulation Server and compete against each other. Over the past few years, several C++ base codes have been employed to control agents' behavior and their communication with the server. Although C++ base codes have laid the foundation for the SS2D, develo** them requires an advanced level of C++ programming. C++ language complexity is a limiting disadvantage of C++ base codes for all users, especially for beginners. To conquer the challenges of C++ base codes and provide a powerful baseline for develo** machine learning concepts, we introduce Pyrus, the first Python base code for SS2D. Pyrus is developed to encourage researchers to efficiently develop their ideas and integrate machine learning algorithms into their teams. Pyrus base is open-source code, and it is publicly available under MIT License on GitHub
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages
Authors:
Frederico S. Oliveira,
Edresson Casanova,
Arnaldo Cândido Júnior,
Anderson S. Soares,
Arlindo R. Galvão Filho
Abstract:
In this paper, we present CML-TTS, a recursive acronym for CML-Multi-Lingual-TTS, a new Text-to-Speech (TTS) dataset developed at the Center of Excellence in Artificial Intelligence (CEIA) of the Federal University of Goias (UFG). CML-TTS is based on Multilingual LibriSpeech (MLS) and adapted for training TTS models, consisting of audiobooks in seven languages: Dutch, French, German, Italian, Port…
▽ More
In this paper, we present CML-TTS, a recursive acronym for CML-Multi-Lingual-TTS, a new Text-to-Speech (TTS) dataset developed at the Center of Excellence in Artificial Intelligence (CEIA) of the Federal University of Goias (UFG). CML-TTS is based on Multilingual LibriSpeech (MLS) and adapted for training TTS models, consisting of audiobooks in seven languages: Dutch, French, German, Italian, Portuguese, Polish, and Spanish. Additionally, we provide the YourTTS model, a multi-lingual TTS model, trained using 3,176.13 hours from CML-TTS and also with 245.07 hours from LibriTTS, in English. Our purpose in creating this dataset is to open up new research possibilities in the TTS area for multi-lingual models. The dataset is publicly available under the CC-BY 4.0 license1.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Evaluation of Speech Representations for MOS prediction
Authors:
Frederico S. Oliveira,
Edresson Casanova,
Arnaldo Cândido Júnior,
Lucas R. S. Gris,
Anderson S. Soares,
Arlindo R. Galvão Filho
Abstract:
In this paper, we evaluate feature extraction models for predicting speech quality. We also propose a model architecture to compare embeddings of supervised learning and self-supervised learning models with embeddings of speaker verification models to predict the metric MOS. Our experiments were performed on the VCC2018 dataset and a Brazilian-Portuguese dataset called BRSpeechMOS, which was creat…
▽ More
In this paper, we evaluate feature extraction models for predicting speech quality. We also propose a model architecture to compare embeddings of supervised learning and self-supervised learning models with embeddings of speaker verification models to predict the metric MOS. Our experiments were performed on the VCC2018 dataset and a Brazilian-Portuguese dataset called BRSpeechMOS, which was created for this work. The results show that the Whisper model is appropriate in all scenarios: with both the VCC2018 and BRSpeech- MOS datasets. Among the supervised and self-supervised learning models using BRSpeechMOS, Whisper-Small achieved the best linear correlation of 0.6980, and the speaker verification model, SpeakerNet, had linear correlation of 0.6963. Using VCC2018, the best supervised and self-supervised learning model, Whisper-Large, achieved linear correlation of 0.7274, and the best model speaker verification, TitaNet, achieved a linear correlation of 0.6933. Although the results of the speaker verification models are slightly lower, the SpeakerNet model has only 5M parameters, making it suitable for real-time applications, and the TitaNet model produces an embedding of size 192, the smallest among all the evaluated models. The experiment results are reproducible with publicly available source-code1 .
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person
Authors:
Lucas Rafael Stefanel Gris,
Ricardo Marcacini,
Arnaldo Candido Junior,
Edresson Casanova,
Anderson Soares,
Sandra Maria Aluísio
Abstract:
Automatic speech recognition (ASR) systems play a key role in applications involving human-machine interactions. Despite their importance, ASR models for the Portuguese language proposed in the last decade have limitations in relation to the correct identification of punctuation marks in automatic transcriptions, which hinder the use of transcriptions by other systems, models, and even by humans.…
▽ More
Automatic speech recognition (ASR) systems play a key role in applications involving human-machine interactions. Despite their importance, ASR models for the Portuguese language proposed in the last decade have limitations in relation to the correct identification of punctuation marks in automatic transcriptions, which hinder the use of transcriptions by other systems, models, and even by humans. However, recently Whisper ASR was proposed by OpenAI, a general-purpose speech recognition model that has generated great expectations in dealing with such limitations. This chapter presents the first study on the performance of Whisper for punctuation prediction in the Portuguese language. We present an experimental evaluation considering both theoretical aspects involving pausing points (comma) and complete ideas (exclamation, question, and fullstop), as well as practical aspects involving transcript-based topic modeling - an application dependent on punctuation marks for promising performance. We analyzed experimental results from videos of Museum of the Person, a virtual museum that aims to tell and preserve people's life histories, thus discussing the pros and cons of Whisper in a real-world scenario. Although our experiments indicate that Whisper achieves state-of-the-art results, we conclude that some punctuation marks require improvements, such as exclamation, semicolon and colon.
△ Less
Submitted 26 May, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Gravitational lensing in a topologically charged Eddington-inspired Born-Infeld spacetime
Authors:
A. R. Soares,
R. L. L. Vitória,
C. F. S. Pereira
Abstract:
In the present paper, we study several aspects of gravitational lensing caused by a topologically charged Monopole/Wormhole, both in the weak field limit and in the strong field limit. We calculate the light deflection and then use it to determine the observables, with which one can investigate the existence of these objects through observational tools. We emphasize that the presence of the topolo…
▽ More
In the present paper, we study several aspects of gravitational lensing caused by a topologically charged Monopole/Wormhole, both in the weak field limit and in the strong field limit. We calculate the light deflection and then use it to determine the observables, with which one can investigate the existence of these objects through observational tools. We emphasize that the presence of the topological charge produces changes in the observables in relation to the case of General Relativity Ellis-Bronnikov wormhole.
△ Less
Submitted 28 August, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Exponential Integrators for Phase-Field Equations using Pseudo-spectral Methods: A Python Implementation
Authors:
Elvis do A. Soares,
Amaro G. Barreto Jr.,
Frederico W. Tavares
Abstract:
In this paper, we implement exponential integrators, specifically Integrating Factor (IF) and Exponential Time Differencing (ETD) methods, using pseudo-spectral techniques to solve phase-field equations within a Python framework. These exponential integrators have showcased robust performance and accuracy when addressing stiff nonlinear partial differential equations. We compare these integrators…
▽ More
In this paper, we implement exponential integrators, specifically Integrating Factor (IF) and Exponential Time Differencing (ETD) methods, using pseudo-spectral techniques to solve phase-field equations within a Python framework. These exponential integrators have showcased robust performance and accuracy when addressing stiff nonlinear partial differential equations. We compare these integrators to the well-known implicit-explicit (IMEX) Euler integrators used in phase-field modeling. The synergy between pseudo-spectral techniques and exponential integrators yields significant benefits for modeling intricate systems governed by phase-field dynamics, such as solidification processes and pattern formation. Our comprehensive Python implementation illustrates the effectiveness of this combined approach in solving phase-field model equations. The results obtained from this implementation highlight the accuracy and computational advantages of the ETD method compared to other numerical techniques.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Classical Density Functional Theory Reveals Structural Information of H2 and CH4 Fluids Adsorbed in MOF-5
Authors:
Elvis do A. Soares,
Amaro G. Barreto Jr.,
Frederico W. Tavares
Abstract:
This study employs classical Density Functional Theory (cDFT) to investigate the adsorption isotherms and structural information of H2 and CH4 fluids inside MOF-5. The results indicate that the adsorption of both fluids is highly dependent on the fluid temperature and the shape of the MOF-5 structure. Specifically, the CH4 molecules exhibit stronger interactions with the MOF-5 framework, resulting…
▽ More
This study employs classical Density Functional Theory (cDFT) to investigate the adsorption isotherms and structural information of H2 and CH4 fluids inside MOF-5. The results indicate that the adsorption of both fluids is highly dependent on the fluid temperature and the shape of the MOF-5 structure. Specifically, the CH4 molecules exhibit stronger interactions with the MOF-5 framework, resulting in a greater adsorbed quantity compared to H2. Additionally, the cDFT calculations reveal that the adsorption process is influenced by the fluid-fluid spatial correlations between the fluid molecules and the external potential produced by the MOF-5 solid atoms. These findings are supported by comparison with experimental data of adsorbed amount and the structure factor of the adsorbed fluid inside the MOF-5. We demonstrate the importance of choosing the appropriate grid size in calculating the adsorption isotherm and the fluid structure factors within the MOF-5. Overall, this work provides valuable insights into the adsorption mechanism of H2 and CH4 in MOF-5, emphasizing the importance of considering the structural properties of the adsorbed fluids in MOFs for predicting and designing their gas storage capacity at different thermodynamic conditions.
△ Less
Submitted 5 July, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
Cyrus2D base: Source Code Base for RoboCup 2D Soccer Simulation League
Authors:
Nader Zare,
Omid Amini,
Aref Sayareh,
Mahtab Sarvmaili,
Arad Firouzkouhi,
Saba Ramezani Rad,
Stan Matwin,
Amilcar Soares
Abstract:
Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. Several base codes have been released for the RoboCup soccer simulation 2D (RCSS2D) community that have promoted the application of multi-agent and AI algorithms in this field. In this paper, we introduce "Cyrus2D…
▽ More
Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. Several base codes have been released for the RoboCup soccer simulation 2D (RCSS2D) community that have promoted the application of multi-agent and AI algorithms in this field. In this paper, we introduce "Cyrus2D Base", which is derived from the base code of the RCSS2D 2021 champion. We merged Gliders2D base V2.6 with the newest version of the Helios base. We applied several features of Cyrus2021 to improve the performance and capabilities of this base alongside a Data Extractor to facilitate the implementation of machine learning in the field. We have tested this base code in different teams and scenarios, and the obtained results demonstrate significant improvements in the defensive and offensive strategy of the team.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Development of a Cobalt Electrochemical Sensor for Measuring Phosphate in Municipal Wastewaters
Authors:
Saif S. S. Al Wahaibi,
Benjamin D. Martin,
Ana Soares
Abstract:
The introduction of the Water Framework directive sets stringent limits on phosphorous discharge from wastewater treatment plants to maintain the complex interdependent relationship between water tributaries and the ecosystem. This paper studies a cobalt based electrochemical sensor for phosphate detection in wastewater. An evaluation of the sensors operational envelope, impact of pH, detection li…
▽ More
The introduction of the Water Framework directive sets stringent limits on phosphorous discharge from wastewater treatment plants to maintain the complex interdependent relationship between water tributaries and the ecosystem. This paper studies a cobalt based electrochemical sensor for phosphate detection in wastewater. An evaluation of the sensors operational envelope, impact of pH, detection limits, linearity of response, accuracy and reproducibility in a single ion solution was conducted. An indirect method was employed to assess the effect of all of these parameters; the parameter was kept constant, while the phosphate concertation was varied. Tests on real wastewater samples verified the effect of the interfering factors, as phosphate measurements from three different sampling points (influent, activated sludge mixed liquors and effluent) did not correlate favourably with measurements acquired from a specialised laboratory. The success of this sensor is probably dependent on the simultaneous measurement of, or the calibration for, interfering parameters. However, the former approach would most likely require additional probes to measure these interfering parameters and the latter would probably require a complex calibrating matrix to account for all the interfering parameters. Nonetheless, variations of such sensors reviewed in this paper and their encouraging results offer an optimistic field of improvement on the design of the sensor studied in this paper for it to be employed on real wastewater systems.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
PCA-based Channel Estimation for MIMO Communications
Authors:
Jonathan Aguiar Soares,
Kayol Soares Mayer,
Pedro Benevenuto Valadares,
Dalton Soares Arantes
Abstract:
In multiple-input multiple-output communications, channel estimation is paramount to keep base stations and users on track. This paper proposes a novel PCA-based-principal component analysis-channel estimation approach for MIMO orthogonal frequency division multiplexing systems. The channel frequency response is firstly estimated with the least squares method, and then PCA is used to filter only t…
▽ More
In multiple-input multiple-output communications, channel estimation is paramount to keep base stations and users on track. This paper proposes a novel PCA-based-principal component analysis-channel estimation approach for MIMO orthogonal frequency division multiplexing systems. The channel frequency response is firstly estimated with the least squares method, and then PCA is used to filter only the higher singular components of the channel impulse response, which is then converted back to the frequency domain. The proposed approach is compared with the MMSE, the minimum mean square error estimation, in terms of bit error rate versus Eb/N0.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels
Authors:
Martha Dais Ferreira,
Gabriel Spadon,
Amilcar Soares,
Stan Matwin
Abstract:
Automatic Identification System (AIS) messages are useful for tracking vessel activity across oceans worldwide using radio links and satellite transceivers. Such data plays a significant role in tracking vessel activity and map** mobility patterns such as those found in fishing. Accordingly, this paper proposes a geometric-driven semi-supervised approach for fishing activity detection from AIS d…
▽ More
Automatic Identification System (AIS) messages are useful for tracking vessel activity across oceans worldwide using radio links and satellite transceivers. Such data plays a significant role in tracking vessel activity and map** mobility patterns such as those found in fishing. Accordingly, this paper proposes a geometric-driven semi-supervised approach for fishing activity detection from AIS data. Through the proposed methodology we show how to explore the information included in the messages to extract features describing the geometry of the vessel route. To this end, we leverage the unsupervised nature of cluster analysis to label the trajectory geometry highlighting the changes in the vessel's moving pattern which tends to indicate fishing activity. The labels obtained by the proposed unsupervised approach are used to detect fishing activities, which we approach as a time-series classification task. In this context, we propose a solution using recurrent neural networks on AIS data streams with roughly 87% of the overall $F$-score on the whole trajectories of 50 different unseen fishing vessels. Such results are accompanied by a broad benchmark study assessing the performance of different Recurrent Neural Network (RNN) architectures. In conclusion, this work contributes by proposing a thorough process that includes data preparation, labeling, data modeling, and model validation. Therefore, we present a novel solution for mobility pattern detection that relies upon unfolding the trajectory in time and observing their inherent geometry.
△ Less
Submitted 22 August, 2022; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Fluorescence angiography classification in colorectal surgery -- A preliminary report
Authors:
Antonio S Soares,
Sophia Bano,
Neil T Clancy,
Laurence B Lovat,
Danail Stoyanov,
Manish Chand
Abstract:
Background: Fluorescence angiography has shown very promising results in reducing anastomotic leaks by allowing the surgeon to select optimally perfused tissue. However, subjective interpretation of the fluorescent signal still hinders broad application of the technique, as significant variation between different surgeons exists. Our aim is to develop an artificial intelligence algorithm to classi…
▽ More
Background: Fluorescence angiography has shown very promising results in reducing anastomotic leaks by allowing the surgeon to select optimally perfused tissue. However, subjective interpretation of the fluorescent signal still hinders broad application of the technique, as significant variation between different surgeons exists. Our aim is to develop an artificial intelligence algorithm to classify colonic tissue as 'perfused' or 'not perfused' based on intraoperative fluorescence angiography data.
Methods: A classification model with a Resnet architecture was trained on a dataset of fluorescence angiography videos of colorectal resections at a tertiary referral centre. Frames corresponding to fluorescent and non-fluorescent segments of colon were used to train a classification algorithm. Validation using frames from patients not used in the training set was performed, including both data collected using the same equipment and data collected using a different camera. Performance metrics were calculated, and saliency maps used to further analyse the output. A decision boundary was identified based on the tissue classification.
Results: A convolutional neural network was successfully trained on 1790 frames from 7 patients and validated in 24 frames from 14 patients. The accuracy on the training set was 100%, on the validation set was 80%. Recall and precision were respectively 100% and 100% on the training set and 68.8% and 91.7% on the validation set.
Conclusion: Automated classification of intraoperative fluorescence angiography with a high degree of accuracy is possible and allows automated decision boundary identification. This will enable surgeons to standardise the technique of fluorescence angiography. A web based app was made available to deploy the algorithm.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
CYRUS Soccer Simulation 2D Team Description Paper 2021
Authors:
Nader Zare,
Aref Sayareh,
Mahtab Sarvmaili,
Omid Amini,
Amilcar Soares,
Stan Matwin
Abstract:
In this report, we briefly present the technical procedure and simulation steps for the 2D soccer simulation of team Cyrus. We emphasize on this document on how the prediction of teammates' behavior is performed. In our proposed method, the agent receives the noisy inputs from the server, and predicts the ball holder full state behavior. Taking advantage of this approach for choosing the optimal v…
▽ More
In this report, we briefly present the technical procedure and simulation steps for the 2D soccer simulation of team Cyrus. We emphasize on this document on how the prediction of teammates' behavior is performed. In our proposed method, the agent receives the noisy inputs from the server, and predicts the ball holder full state behavior. Taking advantage of this approach for choosing the optimal view angle shows 11.30% improvement on the expected win rate.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
CYRUS Soccer Simulation 2D Team Description Paper 2022
Authors:
Nader Zare,
Arad Firouzkouhi,
Omid Amini,
Mahtab Sarvmaili,
Aref Sayareh,
Saba Ramezani Rad,
Stan Matwin,
Amilcar Soares
Abstract:
Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. The players are only allowed to communicate with the server that is called Soccer Simulation Server. This paper introduces the previous and current research of the CYRUS soccer simulation team, the champion of Robo…
▽ More
Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. The players are only allowed to communicate with the server that is called Soccer Simulation Server. This paper introduces the previous and current research of the CYRUS soccer simulation team, the champion of RoboCup 2021. We will present our idea about improving Unmarking Decisioning and Positioning by using Pass Prediction Deep Neural Network. Based on our experimental results, this idea proven to be effective on increasing the winning rate of Cyrus against opponents.
△ Less
Submitted 22 May, 2022;
originally announced May 2022.
-
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
Authors:
Edresson Casanova,
Christopher Shulby,
Alexander Korolev,
Arnaldo Candido Junior,
Anderson da Silva Soares,
Sandra Aluísio,
Moacir Antonelli Ponti
Abstract:
We explore cross-lingual multi-speaker speech synthesis and cross-lingual voice conversion applied to data augmentation for automatic speech recognition (ASR) systems in low/medium-resource scenarios. Through extensive experiments, we show that our approach permits the application of speech synthesis and voice conversion to improve ASR systems using only one target-language speaker during model tr…
▽ More
We explore cross-lingual multi-speaker speech synthesis and cross-lingual voice conversion applied to data augmentation for automatic speech recognition (ASR) systems in low/medium-resource scenarios. Through extensive experiments, we show that our approach permits the application of speech synthesis and voice conversion to improve ASR systems using only one target-language speaker during model training. We also managed to close the gap between ASR models trained with synthesized versus human speech compared to other works that use many speakers. Finally, we show that it is possible to obtain promising ASR training results with our data augmentation method using only a single real speaker in a target language.
△ Less
Submitted 20 May, 2023; v1 submitted 29 March, 2022;
originally announced April 2022.
-
Visualizing Energy Transfer Between Redox-Active Colloids
Authors:
Subing Qu,
Zihao Ou,
Yavuz Savsatli,
Lehan Yao,
Yu Cao,
Elena C. Montoto,
Hao Yu,
**gshu Hui,
Bo Li,
Julio A. N. T. Soares,
Lydia Kisley,
Brian Bailey,
Elizabeth A. Murphy,
Junsheng Liu,
Christopher M. Evans,
Charles M. Schroeder,
Joaquín Rodríguez-López,
Jeffrey S. Moore,
Qian Chen,
Paul V. Braun
Abstract:
Redox-based electrical conduction in nonconjugated polymers has been explored less than a decade, yet is already showing promise as a new concept for electrical energy transport. Here using monolayers and sub-monolayers of touching micron-sized redox active colloids (RAC) containing high densities of ethyl-viologen (EV) side groups, intercolloid redox-based electron transport was directly observed…
▽ More
Redox-based electrical conduction in nonconjugated polymers has been explored less than a decade, yet is already showing promise as a new concept for electrical energy transport. Here using monolayers and sub-monolayers of touching micron-sized redox active colloids (RAC) containing high densities of ethyl-viologen (EV) side groups, intercolloid redox-based electron transport was directly observed via fluorescence microscopy. This observation was enabled by the discovery that these RAC exhibit a highly non-linear electrofluorochromism which can be quantitatively coupled to the colloid redox state. By evaluating the quasi-Fickian nature of the charge transfer (CT) kinetics, the apparent CT diffusion coefficient DCT was extracted. Along with addressing more fundamental questions regarding energy transport in colloidal materials, this first real-time real-space imaging of energy transport within monolayers of redox-active colloids may provide insights into energy transfer in flow batteries, and enable design of new forms of conductive polymers for applications including organic electronics.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Synthesis of Eu(HCOO)$_3$ and Eu(HCOO)$_{3}\cdot$(HCONH$_2$)$_2$ crystals and observation of their $^5$D$_{0}\rightarrow ^{7}$F$_0$ transition for quantum information systems
Authors:
Zachary W. Riedel,
Donny R. Pearson Jr.,
Manohar H. Karigerasi,
Julio A. N. T. Soares,
Elizabeth A. Goldschmidt,
Daniel P. Shoemaker
Abstract:
Two stoichiometric metal-organic frameworks containing Eu$^{3+}$ cations are probed as candidates for photon-based quantum information storage. Synthesis procedures for growing 0.2 mm, rod-shaped Eu(HCOO)$_3$ and 1-3 mm, rhombohedral Eu(HCOO)$_{3}\cdot$(HCONH$_2$)$_2$ single crystals are presented with visible precipitation as soon as 1 h into heating for Eu(HCOO)$_3$ and 24 h for Eu(HCOO)…
▽ More
Two stoichiometric metal-organic frameworks containing Eu$^{3+}$ cations are probed as candidates for photon-based quantum information storage. Synthesis procedures for growing 0.2 mm, rod-shaped Eu(HCOO)$_3$ and 1-3 mm, rhombohedral Eu(HCOO)$_{3}\cdot$(HCONH$_2$)$_2$ single crystals are presented with visible precipitation as soon as 1 h into heating for Eu(HCOO)$_3$ and 24 h for Eu(HCOO)$_{3}\cdot$(HCONH$_2$)$_2$. Room temperature and 1.4 K photoluminescence measurements of the $^5$D$_{0}\rightarrow {^7}$F$_J$ transitions of Eu$^{3+}$ are analyzed for both compounds. Comparisons of peak width and intensity are discussed along with the notable first report for both of the $^5$D$_{0}\rightarrow {^7}$F$_0$ transition, the hyperfine structure of which has potential use in quantum memory applications. The air instability of Eu(HCOO)$_{3}\cdot$(HCONH$_2$)$_2$ and the transformation of its photoluminescence properties are discussed.
△ Less
Submitted 6 May, 2022; v1 submitted 8 March, 2022;
originally announced March 2022.
-
Unfolding AIS transmission behavior for vessel movement modeling on noisy data leveraging machine learning
Authors:
Gabriel Spadon,
Martha D. Ferreira,
Amilcar Soares,
Stan Matwin
Abstract:
The oceans are a source of an impressive mixture of complex data that could be used to uncover relationships yet to be discovered. Such data comes from the oceans and their surface, such as Automatic Identification System (AIS) messages used for tracking vessels' trajectories. AIS messages are transmitted over radio or satellite at ideally periodic time intervals but vary irregularly over time. As…
▽ More
The oceans are a source of an impressive mixture of complex data that could be used to uncover relationships yet to be discovered. Such data comes from the oceans and their surface, such as Automatic Identification System (AIS) messages used for tracking vessels' trajectories. AIS messages are transmitted over radio or satellite at ideally periodic time intervals but vary irregularly over time. As such, this paper aims to model the AIS message transmission behavior through neural networks for forecasting upcoming AIS messages' content from multiple vessels, particularly in a simultaneous approach despite messages' temporal irregularities as outliers. We present a set of experiments comprising multiple algorithms for forecasting tasks with horizon sizes of varying lengths. Deep learning models (e.g., neural networks) revealed themselves to adequately preserve vessels' spatial awareness regardless of temporal irregularity. We show how convolutional layers, feed-forward networks, and recurrent neural networks can improve such tasks by working together. Experimenting with short, medium, and large-sized sequences of messages, our model achieved 36/37/38% of the Relative Percentage Difference - the lower, the better, whereas we observed 92/45/96% on the Elman's RNN, 51/52/40% on the GRU, and 129/98/61% on the LSTM. These results support our model as a driver for improving the prediction of vessel routes when analyzing multiple vessels of diverging types simultaneously under temporally noise data.
△ Less
Submitted 5 July, 2022; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Predição da Idade Cerebral a partir de Imagens de Ressonância Magnética utilizando Redes Neurais Convolucionais
Authors:
Victor H. R. Oliveira,
Augusto Antunes,
Alexandre S. Soares,
Arthur D. Reys,
Robson Z. Júnior,
Saulo D. S. Pedro,
Danilo Silva
Abstract:
In this work, deep learning techniques for brain age prediction from magnetic resonance images are investigated, aiming to assist in the identification of biomarkers of the natural aging process. The identification of biomarkers is useful for detecting an early-stage neurodegenerative process, as well as for predicting age-related or non-age-related cognitive decline. Two techniques are implemente…
▽ More
In this work, deep learning techniques for brain age prediction from magnetic resonance images are investigated, aiming to assist in the identification of biomarkers of the natural aging process. The identification of biomarkers is useful for detecting an early-stage neurodegenerative process, as well as for predicting age-related or non-age-related cognitive decline. Two techniques are implemented and compared in this work: a 3D Convolutional Neural Network applied to the volumetric image and a 2D Convolutional Neural Network applied to slices from the axial plane, with subsequent fusion of individual predictions. The best result was obtained by the 2D model, which achieved a mean absolute error of 3.83 years.
--
Neste trabalho são investigadas técnicas de aprendizado profundo para a predição da idade cerebral a partir de imagens de ressonância magnética, visando auxiliar na identificação de biomarcadores do processo natural de envelhecimento. A identificação de biomarcadores é útil para a detecção de um processo neurodegenerativo em estágio inicial, além de possibilitar prever um declínio cognitivo relacionado ou não à idade. Duas técnicas são implementadas e comparadas neste trabalho: uma Rede Neural Convolucional 3D aplicada na imagem volumétrica e uma Rede Neural Convolucional 2D aplicada a fatias do plano axial, com posterior fusão das predições individuais. O melhor resultado foi obtido pelo modelo 2D, que alcançou um erro médio absoluto de 3.83 anos.
△ Less
Submitted 23 December, 2021;
originally announced December 2021.
-
Survey of Generative Methods for Social Media Analysis
Authors:
Stan Matwin,
Aristides Milios,
Paweł Prałat,
Amilcar Soares,
François Théberge
Abstract:
This survey draws a broad-stroke, panoramic picture of the State of the Art (SoTA) of the research in generative methods for the analysis of social media data. It fills a void, as the existing survey articles are either much narrower in their scope or are dated. We included two important aspects that currently gain importance in mining and modeling social media: dynamics and networks. Social dynam…
▽ More
This survey draws a broad-stroke, panoramic picture of the State of the Art (SoTA) of the research in generative methods for the analysis of social media data. It fills a void, as the existing survey articles are either much narrower in their scope or are dated. We included two important aspects that currently gain importance in mining and modeling social media: dynamics and networks. Social dynamics are important for understanding the spreading of influence or diseases, formation of friendships, the productivity of teams, etc. Networks, on the other hand, may capture various complex relationships providing additional insight and identifying important patterns that would otherwise go unnoticed.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
Exact modifications on a vacuum spacetime due to a gradient bumblebee field at its vacuum expectation value
Authors:
F. P. Poulis,
M. A. C. Soares
Abstract:
This work belongs to the context of the standard-model extension, in which a Lorentz symmetry violation is induced by a bumblebee field as it acquires a nonzero vacuum expectation value. The mathematical formulation of a generic bumblebee model and its associated dynamical equations are presented. Then, these equations are considered for the vacuum and a substantial simplification is performed for…
▽ More
This work belongs to the context of the standard-model extension, in which a Lorentz symmetry violation is induced by a bumblebee field as it acquires a nonzero vacuum expectation value. The mathematical formulation of a generic bumblebee model and its associated dynamical equations are presented. Then, these equations are considered for the vacuum and a substantial simplification is performed for the particular case of a gradient bumblebee field at its vacuum expectation value. After some further manipulation, a method to easily find solutions to the model is developed, in which the exact effect on the spacetime description due to the presence of this bumblebee field is explicitly provided. As some examples, the method is applied to determine the implications of the bumblebee field on the Schwarzschild spacetime and also on a rotating one. A previously published solution is recovered and some new ones are obtained. In the rotating situation, a simple solution is found which contains both the Kerr solution and the already published one as special cases. It is also shown its distinguished surfaces are still given by the same corresponding expressions for the Kerr solution. In conclusion, the mathematical improvement made is considered to be a significant contribution to the theory as a powerful tool to investigate its many aspects and consequences.
△ Less
Submitted 18 July, 2022; v1 submitted 7 December, 2021;
originally announced December 2021.
-
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Authors:
Arnaldo Candido Junior,
Edresson Casanova,
Anderson Soares,
Frederico Santos de Oliveira,
Lucas Oliveira,
Ricardo Corso Fernandes Junior,
Daniel Peixoto Pinto da Silva,
Fernando Gorgulho Fayet,
Bruno Baldissera Carlotto,
Lucas Rafael Stefanel Gris,
Sandra Maria Aluísio
Abstract:
Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however,…
▽ More
Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however, are composed of audios containing only read and prepared speech. There is a lack of datasets including spontaneous speech, which are essential in different ASR applications. This paper presents CORAA (Corpus of Annotated Audios) v1. with 290.77 hours, a publicly available dataset for ASR in BP containing validated pairs (audio-transcription). CORAA also contains European Portuguese audios (4.69 hours). We also present a public ASR model based on Wav2Vec 2.0 XLSR-53 and fine-tuned over CORAA. Our model achieved a Word Error Rate of 24.18% on CORAA test set and 20.08% on Common Voice test set. When measuring the Character Error Rate, we obtained 11.02% and 6.34% for CORAA and Common Voice, respectively. CORAA corpora were assembled to both improve ASR models in BP with phenomena from spontaneous speech and motivate young researchers to start their studies on ASR for Portuguese. All the corpora are publicly available at https://github.com/nilc-nlp/CORAA under the CC BY-NC-ND 4.0 license.
△ Less
Submitted 18 November, 2021; v1 submitted 14 October, 2021;
originally announced October 2021.
-
PTRAIL -- A python package for parallel trajectory data preprocessing
Authors:
Salman Haidri,
Yaksh J. Haranwala,
Vania Bogorny,
Chiara Renso,
Vinicius Prado da Fonseca,
Amilcar Soares
Abstract:
Trajectory data represent a trace of an object that changes its position in space over time. This kind of data is complex to handle and analyze, since it is generally produced in huge quantities, often prone to errors generated by the geolocation device, human mishandling, or area coverage limitation. Therefore, there is a need for software specifically tailored to preprocess trajectory data. In t…
▽ More
Trajectory data represent a trace of an object that changes its position in space over time. This kind of data is complex to handle and analyze, since it is generally produced in huge quantities, often prone to errors generated by the geolocation device, human mishandling, or area coverage limitation. Therefore, there is a need for software specifically tailored to preprocess trajectory data. In this work we propose PTRAIL, a python package offering several trajectory preprocessing steps, including filtering, feature extraction, and interpolation. PTRAIL uses parallel computation and vectorization, being suitable for large datasets and fast compared to other python libraries.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
A Weakly Supervised Dataset of Fine-Grained Emotions in Portuguese
Authors:
Diogo Cortiz,
Jefferson O. Silva,
Newton Calegari,
Ana Luísa Freitas,
Ana Angélica Soares,
Carolina Botelho,
Gabriel Gaudencio Rêgo,
Waldir Sampaio,
Paulo Sergio Boggio
Abstract:
Affective Computing is the study of how computers can recognize, interpret and simulate human affects. Sentiment Analysis is a common task inNLP related to this topic, but it focuses only on emotion valence (positive, negative, neutral). An emerging approach in NLP is Emotion Recognition, which relies on fined-grained classification. This research describes an approach to create a lexical-based we…
▽ More
Affective Computing is the study of how computers can recognize, interpret and simulate human affects. Sentiment Analysis is a common task inNLP related to this topic, but it focuses only on emotion valence (positive, negative, neutral). An emerging approach in NLP is Emotion Recognition, which relies on fined-grained classification. This research describes an approach to create a lexical-based weakly supervised corpus for fine-grained emotion in Portuguese. We evaluated our dataset by fine-tuning a transformer-based language model (BERT) and validating it on a Gold Standard annotated validation set. Our results (F1-score=.64) suggest lexical-based weak supervision as an appropriate strategy for initial work in low resourced environment.
△ Less
Submitted 8 October, 2021; v1 submitted 17 August, 2021;
originally announced August 2021.
-
An Integrated Progressive Hedging and Benders Decomposition with Multiple Master Method to Solve the Brazilian Generation Expansion Problem
Authors:
Alessandro Soares,
Alexandre Street,
Tiago Andrade,
Joaquim Dias Garcia
Abstract:
This paper exploits the decomposition structure of the large-scale hydrothermal generation expansion planning problem with an integrated modified Benders Decomposition and Progressive Hedging approach. We consider detailed and realistic data from the Brazilian power system to represent hourly chronological constraints based on typical days per month and year. Also, we represent the multistage stoc…
▽ More
This paper exploits the decomposition structure of the large-scale hydrothermal generation expansion planning problem with an integrated modified Benders Decomposition and Progressive Hedging approach. We consider detailed and realistic data from the Brazilian power system to represent hourly chronological constraints based on typical days per month and year. Also, we represent the multistage stochastic nature of the optimal hydrothermal operational policy through co-optimized linear decision rules for individual reservoirs. Therefore, we ensure investment decisions compatible with a nonanticipative (implementable) operational policy. To solve the large-scale optimization problem, we propose an improved Benders Decomposition method with multiple instances of the master problem, each of which strengthened by primal cuts and new Benders cuts generated by each master's trial solution. Additionally, our new approach allows using Progressive Hedging penalization terms for accelerating the convergence of the method. We show that our method is 60\% faster than the benchmark. Finally, the consideration of a nonanticipative operational policy can save 7.64\% of the total cost (16.18\% of the investment costs) and significantly improve spot price profiles.
△ Less
Submitted 1 December, 2021; v1 submitted 6 August, 2021;
originally announced August 2021.
-
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
Authors:
Lucas Rafael Stefanel Gris,
Edresson Casanova,
Frederico Santos de Oliveira,
Anderson da Silva Soares,
Arnaldo Candido Junior
Abstract:
Deep learning techniques have been shown to be efficient in various tasks, especially in the development of speech recognition systems, that is, systems that aim to transcribe an audio sentence in a sequence of written words. Despite the progress in the area, speech recognition can still be considered difficult, especially for languages lacking available data, such as Brazilian Portuguese (BP). In…
▽ More
Deep learning techniques have been shown to be efficient in various tasks, especially in the development of speech recognition systems, that is, systems that aim to transcribe an audio sentence in a sequence of written words. Despite the progress in the area, speech recognition can still be considered difficult, especially for languages lacking available data, such as Brazilian Portuguese (BP). In this sense, this work presents the development of an public Automatic Speech Recognition (ASR) system using only open available audio data, from the fine-tuning of the Wav2vec 2.0 XLSR-53 model pre-trained in many languages, over BP data. The final model presents an average word error rate of 12.4% over 7 different datasets (10.5% when applying a language model). According to our knowledge, the obtained error is the lowest among open end-to-end (E2E) ASR models for BP.
△ Less
Submitted 22 December, 2021; v1 submitted 23 July, 2021;
originally announced July 2021.
-
Continuous Control with Deep Reinforcement Learning for Autonomous Vessels
Authors:
Nader Zare,
Bruno Brandoli,
Mahtab Sarvmaili,
Amilcar Soares,
Stan Matwin
Abstract:
Maritime autonomous transportation has played a crucial role in the globalization of the world economy. Deep Reinforcement Learning (DRL) has been applied to automatic path planning to simulate vessel collision avoidance situations in open seas. End-to-end approaches that learn complex map**s directly from the input have poor generalization to reach the targets in different environments. In this…
▽ More
Maritime autonomous transportation has played a crucial role in the globalization of the world economy. Deep Reinforcement Learning (DRL) has been applied to automatic path planning to simulate vessel collision avoidance situations in open seas. End-to-end approaches that learn complex map**s directly from the input have poor generalization to reach the targets in different environments. In this work, we present a new strategy called state-action rotation to improve agent's performance in unseen situations by rotating the obtained experience (state-action-state) and preserving them in the replay buffer. We designed our model based on Deep Deterministic Policy Gradient, local view maker, and planner. Our agent uses two deep Convolutional Neural Networks to estimate the policy and action-value functions. The proposed model was exhaustively trained and tested in maritime scenarios with real maps from cities such as Montreal and Halifax. Experimental results show that the state-action rotation on top of the CVN consistently improves the rate of arrival to a destination (RATD) by up 11.96% with respect to the Vessel Navigator with Planner and Local View (VNPLV), as well as it achieves superior performance in unseen map**s by up 30.82%. Our proposed approach exhibits advantages in terms of robustness when tested in a new environment, supporting the idea that generalization can be achieved by using state-action rotation.
△ Less
Submitted 26 June, 2021;
originally announced June 2021.
-
Modeling the geospatial evolution of COVID-19 using spatio-temporal convolutional sequence-to-sequence neural networks
Authors:
Mário Cardoso,
André Cavalheiro,
Alexandre Borges,
Ana F. Duarte,
Amílcar Soares,
Maria João Pereira,
Nuno J. Nunes,
Leonardo Azevedo,
Arlindo L. Oliveira
Abstract:
Europe was hit hard by the COVID-19 pandemic and Portugal was one of the most affected countries, having suffered three waves in the first twelve months. Approximately between Jan 19th and Feb 5th 2021 Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000. Despite its importance, accurate prediction of the geos…
▽ More
Europe was hit hard by the COVID-19 pandemic and Portugal was one of the most affected countries, having suffered three waves in the first twelve months. Approximately between Jan 19th and Feb 5th 2021 Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000. Despite its importance, accurate prediction of the geospatial evolution of COVID-19 remains a challenge, since existing analytical methods fail to capture the complex dynamics that result from both the contagion within a region and the spreading of the infection from infected neighboring regions.
We use a previously developed methodology and official municipality level data from the Portuguese Directorate-General for Health (DGS), relative to the first twelve months of the pandemic, to compute an estimate of the incidence rate in each location of mainland Portugal. The resulting sequence of incidence rate maps was then used as a gold standard to test the effectiveness of different approaches in the prediction of the spatial-temporal evolution of the incidence rate. Four different methods were tested: a simple cell level autoregressive moving average (ARMA) model, a cell level vector autoregressive (VAR) model, a municipality-by-municipality compartmental SIRD model followed by direct block sequential simulation and a convolutional sequence-to-sequence neural network model based on the STConvS2S architecture. We conclude that the convolutional sequence-to-sequence neural network is the best performing method, when predicting the medium-term future incidence rate, using the available information.
△ Less
Submitted 6 May, 2021;
originally announced May 2021.
-
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
Authors:
Edresson Casanova,
Christopher Shulby,
Eren Gölge,
Nicolas Michael Müller,
Frederico Santos de Oliveira,
Arnaldo Candido Junior,
Anderson da Silva Soares,
Sandra Maria Aluisio,
Moacir Antonelli Ponti
Abstract:
In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training. We propose a speaker-conditional architecture that explores a flow-based decoder that works in a zero-shot scenario. As text encoders, we explore a dilated residual convolutional-based encoder, gated convolutional-based encoder, and transform…
▽ More
In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training. We propose a speaker-conditional architecture that explores a flow-based decoder that works in a zero-shot scenario. As text encoders, we explore a dilated residual convolutional-based encoder, gated convolutional-based encoder, and transformer-based encoder. Additionally, we have shown that adjusting a GAN-based vocoder for the spectrograms predicted by the TTS model on the training dataset can significantly improve the similarity and speech quality for new speakers. Our model converges using only 11 speakers, reaching state-of-the-art results for similarity with new speakers, as well as high speech quality.
△ Less
Submitted 15 June, 2021; v1 submitted 2 April, 2021;
originally announced April 2021.
-
Hybrid Model with Time Modeling for Sequential Recommender Systems
Authors:
Marlesson R. O. Santana,
Anderson Soares
Abstract:
Deep learning based methods have been used successfully in recommender system problems. Approaches using recurrent neural networks, transformers, and attention mechanisms are useful to model users' long- and short-term preferences in sequential interactions. To explore different session-based recommendation solutions, Booking.com recently organized the WSDM WebTour 2021 Challenge, which aims to be…
▽ More
Deep learning based methods have been used successfully in recommender system problems. Approaches using recurrent neural networks, transformers, and attention mechanisms are useful to model users' long- and short-term preferences in sequential interactions. To explore different session-based recommendation solutions, Booking.com recently organized the WSDM WebTour 2021 Challenge, which aims to benchmark models to recommend the final city in a trip. This study presents our approach to this challenge. We conducted several experiments to test different state-of-the-art deep learning architectures for recommender systems. Further, we proposed some changes to Neural Attentive Recommendation Machine (NARM), adapted its architecture for the challenge objective, and implemented training approaches that can be used in any session-based model to improve accuracy. Our experimental result shows that the improved NARM outperforms all other state-of-the-art benchmark methods.
△ Less
Submitted 7 March, 2021;
originally announced March 2021.
-
PyEquIon: A Python Package For Automatic Speciation Calculations of Aqueous Electrolyte Solutions
Authors:
Caio Felippe Curitiba Marcellos,
Gerson Francisco da Silva Junior,
Elvis do Amaral Soares,
Fabio Ramos,
Amaro G. Barreto Jr
Abstract:
In several industrial applications, such as crystallization, pollution control, and flow assurance, an accurate understanding of the aqueous electrolyte solutions is crucial. Electrolyte equilibrium calculation contributes with the design and optimization of processes by providing important information, such as species concentration, solution pH and potential for solid formation. In this work, a p…
▽ More
In several industrial applications, such as crystallization, pollution control, and flow assurance, an accurate understanding of the aqueous electrolyte solutions is crucial. Electrolyte equilibrium calculation contributes with the design and optimization of processes by providing important information, such as species concentration, solution pH and potential for solid formation. In this work, a pure Python library distributed under BSD-3 license was developed for the calculation of aqueous electrolyte equilibrium. The package takes as inputs the feed components of a given solution, and it automatically identifies its composing ions and the chemical reactions involved to calculate equilibrium conditions. Moreover, there is no established electrolyte activity coefficient model for a broad range of operational conditions. Hence, in this package, built-in activity coefficient models are structured in a modular approach, so that the non-ideality calculation can be performed by a user provided function, which allows further research in the topic. The package can be used by researchers to readily identify the equilibrium reactions and possible solid phases in a user friendly language.
△ Less
Submitted 10 May, 2021; v1 submitted 18 January, 2021;
originally announced January 2021.
-
Strong gravitational lensing in a spacetime with topological charge within the Eddington-inspired Born-Infeld gravity
Authors:
C. Furtado,
J. R. Nascimento,
A. Yu. Petrov,
P. J. Porfírio,
A. R. Soares
Abstract:
In this work we calculate the angular deflection of light in the strong field limit in two spacetimes which were previously studied within the Eddington-inspired Born-Infeld gravity (EiBI), namely, a black hole and a wormhole, both with topological charge. We show that the presence of the parameters characterizing EiBI and the topological charge promote significant changes in the angular deflectio…
▽ More
In this work we calculate the angular deflection of light in the strong field limit in two spacetimes which were previously studied within the Eddington-inspired Born-Infeld gravity (EiBI), namely, a black hole and a wormhole, both with topological charge. We show that the presence of the parameters characterizing EiBI and the topological charge promote significant changes in the angular deflection of light with respect to that one obtained in Schwarzschild spacetime. Using the expression for angular deflection in the strong field limit, we calculate the position and magnification of the respective relativistic images.
△ Less
Submitted 24 October, 2020; v1 submitted 22 October, 2020;
originally announced October 2020.
-
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces
Authors:
Marlesson R. O. Santana,
Luckeciano C. Melo,
Fernando H. F. Camargo,
Bruno Brandão,
Anderson Soares,
Renan M. Oliveira,
Sandor Caetano
Abstract:
Recommender Systems are especially challenging for marketplaces since they must maximize user satisfaction while maintaining the healthiness and fairness of such ecosystems. In this context, we observed a lack of resources to design, train, and evaluate agents that learn by interacting within these environments. For this matter, we propose MARS-Gym, an open-source framework to empower researchers…
▽ More
Recommender Systems are especially challenging for marketplaces since they must maximize user satisfaction while maintaining the healthiness and fairness of such ecosystems. In this context, we observed a lack of resources to design, train, and evaluate agents that learn by interacting within these environments. For this matter, we propose MARS-Gym, an open-source framework to empower researchers and engineers to quickly build and evaluate Reinforcement Learning agents for recommendations in marketplaces. MARS-Gym addresses the whole development pipeline: data processing, model design and optimization, and multi-sided evaluation. We also provide the implementation of a diverse set of baseline agents, with a metrics-driven analysis of them in the Trivago marketplace dataset, to illustrate how to conduct a holistic assessment using the available metrics of recommendation, off-policy estimation, and fairness. With MARS-Gym, we expect to bridge the gap between academic research and production systems, as well as to facilitate the design of new algorithms and applications.
△ Less
Submitted 30 September, 2020;
originally announced October 2020.
-
Zero-Shot Heterogeneous Transfer Learning from Recommender Systems to Cold-Start Search Retrieval
Authors:
Tao Wu,
Ellie Ka-In Chio,
Heng-Tze Cheng,
Yu Du,
Steffen Rendle,
Dima Kuzmin,
Ritesh Agarwal,
Li Zhang,
John Anderson,
Sarvjeet Singh,
Tushar Chandra,
Ed H. Chi,
Wen Li,
Ankit Kumar,
Xiang Ma,
Alex Soares,
Nitin **dal,
Pei Cao
Abstract:
Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown…
▽ More
Many recent advances in neural information retrieval models, which predict top-K items given a query, learn directly from a large training set of (query, item) pairs. However, they are often insufficient when there are many previously unseen (query, item) combinations, often referred to as the cold start problem. Furthermore, the search system can be biased towards items that are frequently shown to a query previously, also known as the 'rich get richer' (a.k.a. feedback loop) problem. In light of these problems, we observed that most online content platforms have both a search and a recommender system that, while having heterogeneous input spaces, can be connected through their common output item space and a shared semantic representation. In this paper, we propose a new Zero-Shot Heterogeneous Transfer Learning framework that transfers learned knowledge from the recommender system component to improve the search component of a content platform. First, it learns representations of items and their natural-language features by predicting (item, item) correlation graphs derived from the recommender system as an auxiliary task. Then, the learned representations are transferred to solve the target search retrieval task, performing query-to-item prediction without having seen any (query, item) pairs in training. We conduct online and offline experiments on one of the world's largest search and recommender systems from Google, and present the results and lessons learned. We demonstrate that the proposed approach can achieve high performance on offline search retrieval tasks, and more importantly, achieved significant improvements on relevance and user interactions over the highly-optimized production system in online experiments.
△ Less
Submitted 18 August, 2020; v1 submitted 6 August, 2020;
originally announced August 2020.