-
Full Iso-recursive Types
Authors:
Litao Zhou,
Qianyong Wan,
Bruno C. d. S. Oliveira
Abstract:
There are two well-known formulations of recursive types: iso-recursive and equi-recursive types. Abadi and Fiore [1996] have shown that iso- and equi-recursive types have the same expressive power. However, their encoding of equi-recursive types in terms of iso-recursive types requires explicit coercions. These coercions come with significant additional computational overhead, and complicate reas…
▽ More
There are two well-known formulations of recursive types: iso-recursive and equi-recursive types. Abadi and Fiore [1996] have shown that iso- and equi-recursive types have the same expressive power. However, their encoding of equi-recursive types in terms of iso-recursive types requires explicit coercions. These coercions come with significant additional computational overhead, and complicate reasoning about the equivalence of the two formulations of recursive types.
This paper proposes a generalization of iso-recursive types called full iso-recursive types. Full iso-recursive types allow encoding all programs with equi-recursive types without computational overhead. Instead of explicit term coercions, all type transformations are captured by computationally irrelevant casts, which can be erased at runtime without affecting the semantics of the program. Consequently, reasoning about the equivalence between the two approaches can be greatly simplified. We present a calculus called $λ^μ_{Fi}$, which extends the simply typed lambda calculus (STLC) with full iso-recursive types. The $λ^μ_{Fi}$ calculus is proved to be type sound, and shown to have the same expressive power as a calculus with equi-recursive types. We also extend our results to subty**, and show that equi-recursive subty** can be expressed in terms of iso-recursive subty** with cast operators.
△ Less
Submitted 7 July, 2024; v1 submitted 30 June, 2024;
originally announced July 2024.
-
Index estimates for harmonic Gauss maps
Authors:
Alcides de Carvalho,
Marcos P. Cavalcante,
Wagner Costa-Filho,
Darlan de Oliveira
Abstract:
Let $Σ$ denote a closed surface with constant mean curvature in $\mathbb{G}^3$, a 3-dimensional Lie group equipped with a bi-invariant metric. For such surfaces, there is a harmonic Gauss map which maps values to the unit sphere within the Lie algebra of $\mathbb{G}$. We prove that the energy index of the Gauss map of $Σ$ is bounded below by its topological genus. We also obtain index estimates in…
▽ More
Let $Σ$ denote a closed surface with constant mean curvature in $\mathbb{G}^3$, a 3-dimensional Lie group equipped with a bi-invariant metric. For such surfaces, there is a harmonic Gauss map which maps values to the unit sphere within the Lie algebra of $\mathbb{G}$. We prove that the energy index of the Gauss map of $Σ$ is bounded below by its topological genus. We also obtain index estimates in the case of complete non compact surfaces.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Authors:
Danilo de Oliveira,
Simon Welker,
Julius Richter,
Timo Gerkmann
Abstract:
To obtain improved speech enhancement models, researchers often focus on increasing performance according to specific instrumental metrics. However, when the same metric is used in a loss function to optimize models, it may be detrimental to aspects that the given metric does not see. The goal of this paper is to illustrate the risk of overfitting a speech enhancement model to the metric used for…
▽ More
To obtain improved speech enhancement models, researchers often focus on increasing performance according to specific instrumental metrics. However, when the same metric is used in a loss function to optimize models, it may be detrimental to aspects that the given metric does not see. The goal of this paper is to illustrate the risk of overfitting a speech enhancement model to the metric used for evaluation. For this, we introduce enhancement models that exploit the widely used PESQ measure. Our "PESQetarian" model achieves 3.82 PESQ on VB-DMD while scoring very poorly in a listening experiment. While the obtained PESQ value of 3.82 would imply "state-of-the-art" PESQ-performance on the VB-DMD benchmark, our examples show that when optimizing w.r.t. a metric, an isolated evaluation on the same metric may be misleading. Instead, other metrics should be included in the evaluation and the resulting performance predictions should be confirmed by listening.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges
Authors:
Daniel A. P. Oliveira,
Eugénio Ribeiro,
David Martins de Matos
Abstract:
Creating engaging narratives from visual data is crucial for automated digital media consumption, assistive technologies, and interactive entertainment. This survey covers methodologies used in the generation of these narratives, focusing on their principles, strengths, and limitations.
The survey also covers tasks related to automatic story generation, such as image and video captioning, and vi…
▽ More
Creating engaging narratives from visual data is crucial for automated digital media consumption, assistive technologies, and interactive entertainment. This survey covers methodologies used in the generation of these narratives, focusing on their principles, strengths, and limitations.
The survey also covers tasks related to automatic story generation, such as image and video captioning, and visual question answering, as well as story generation without visual inputs. These tasks share common challenges with visual story generation and have served as inspiration for the techniques used in the field. We analyze the main datasets and evaluation metrics, providing a critical perspective on their limitations.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
First direct observations of interplanetary shock impact angle effects on actual geomagnetically induced currents: The case of the Finnish natural gas pipeline system
Authors:
Denny M. Oliveira,
Eftyhia Zesta,
Sergio Vidal-Luengo
Abstract:
The impact of interplanetary (IP) shocks on the Earth's magnetosphere can greatly disturb the geomagnetic field and electric currents in the magnetosphere-ionosphere system. At high latitudes, the current systems most affected by the shocks are the auroral electrojet currents. These currents then generate ground geomagnetically induced currents (GICs) that couple with and are highly detrimental to…
▽ More
The impact of interplanetary (IP) shocks on the Earth's magnetosphere can greatly disturb the geomagnetic field and electric currents in the magnetosphere-ionosphere system. At high latitudes, the current systems most affected by the shocks are the auroral electrojet currents. These currents then generate ground geomagnetically induced currents (GICs) that couple with and are highly detrimental to ground artificial conductors including power transmission lines, oil/gas pipelines, railways, and submarine cables. Recent research has shown that the shock impact angle, the angle the shock normal vector performs with the Sun-Earth line, plays a major role in controlling the subsequent geomagnetic activity. More specifically, due to more symmetric magnetospheric compressions, nearly frontal shocks are usually more geoeffective than highly inclined shocks. In this study, we utilize a subset (332 events) of a shock list with more than 600 events to investigate, for the first time, shock impact angle effects on the subsequent GICs right after shock impact (compression effects) and several minutes after shock impact (substorm-like effects). We use GIC recordings from the Finnish natural gas pipeline performed near the Mäntsälä compression station in southern Finland. We find that GIC peaks (> 5 A) occurring after shock impacts are mostly caused by nearly frontal shocks and occur in the post-noon/dusk magnetic local time sector. These GIC peaks are presumably triggered by partial ring current intensifications in the dusk sector. On the other hand, more intense GIC peaks (> 20 A) generally occur several minutes after shock impacts and are located around the magnetic midnight terminator. These GIC peaks are most likely caused by intense energetic particle injections from the magnetotail which frequently occur during substorms.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Super-suppression of long wavelength phonons in constricted nanoporous geometries
Authors:
Alex Greaney,
S. Aria Hosseini,
Laura de Sousa Oliveira,
Alathea Davies,
Neophytos Neophytou
Abstract:
In a typical semiconductor material, the majority of heat is carried by long wavelength, long mean-free-path phonons. Nanostructuring strategies to reduce thermal conductivity, a promising direction in the field of thermoelectrics, place scattering centers of size and spatial separation comparable to the mean-free-paths of the dominant phonons to selectively scatter them. The resultant thermal con…
▽ More
In a typical semiconductor material, the majority of heat is carried by long wavelength, long mean-free-path phonons. Nanostructuring strategies to reduce thermal conductivity, a promising direction in the field of thermoelectrics, place scattering centers of size and spatial separation comparable to the mean-free-paths of the dominant phonons to selectively scatter them. The resultant thermal conductivity is in most cases well predicted using Matthiessens rule. In general, however, long wavelength phonons are not as effectively scattered as the rest of the phonon spectrum. In this work, using large-scale Molecular Dynamics simulations, Non-Equilibrium Greens Function simulations, and Monte Carlo simulations, we show that specific nanoporous geometries, which create narrow constrictions in the passage of phonons, lead to anticorrelated heat currents in the phonon spectrum. This results in super-suppression of long-wavelength phonons due to heat trap**, and reductions in the thermal conductivity well below what is predicted by Matthiessens rule.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Contrastive Pretraining for Visual Concept Explanations of Socioeconomic Outcomes
Authors:
Ivica Obadic,
Alex Levering,
Lars Pennig,
Dario Oliveira,
Diego Marcos,
Xiaoxiang Zhu
Abstract:
Predicting socioeconomic indicators from satellite imagery with deep learning has become an increasingly popular research direction. Post-hoc concept-based explanations can be an important step towards broader adoption of these models in policy-making as they enable the interpretation of socioeconomic outcomes based on visual concepts that are intuitive to humans. In this paper, we study the inter…
▽ More
Predicting socioeconomic indicators from satellite imagery with deep learning has become an increasingly popular research direction. Post-hoc concept-based explanations can be an important step towards broader adoption of these models in policy-making as they enable the interpretation of socioeconomic outcomes based on visual concepts that are intuitive to humans. In this paper, we study the interplay between representation learning using an additional task-specific contrastive loss and post-hoc concept explainability for socioeconomic studies. Our results on two different geographical locations and tasks indicate that the task-specific pretraining imposes a continuous ordering of the latent space embeddings according to the socioeconomic outcomes. This improves the model's interpretability as it enables the latent space of the model to associate concepts encoding typical urban and natural area patterns with continuous intervals of socioeconomic outcomes. Further, we illustrate how analyzing the model's conceptual sensitivity for the intervals of socioeconomic outcomes can shed light on new insights for urban studies.
△ Less
Submitted 13 June, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
ProvDeploy: Provenance-oriented Containerization of High Performance Computing Scientific Workflows
Authors:
Liliane Kunstmann,
Débora Pina,
Daniel de Oliveira,
Marta Mattoso
Abstract:
Many existing scientific workflows require High Performance Computing environments to produce results in a timely manner. These workflows have several software library components and use different environments, making the deployment and execution of the software stack not trivial. This complexity increases if the user needs to add provenance data capture services to the workflow. This manuscript i…
▽ More
Many existing scientific workflows require High Performance Computing environments to produce results in a timely manner. These workflows have several software library components and use different environments, making the deployment and execution of the software stack not trivial. This complexity increases if the user needs to add provenance data capture services to the workflow. This manuscript introduces ProvDeploy to assist the user in configuring containers for scientific workflows with integrated provenance data capture. ProvDeploy was evaluated with a Scientific Machine Learning workflow, exploring containerization strategies focused on provenance in two distinct HPC environments
△ Less
Submitted 25 March, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Ontologia para monitorar a deficiência mental em seus déficts no processamento da informação por declínio cognitivo e evitar agressões psicológicas e físicas em ambientes educacionais com ajuda da I.A*
Authors:
Bruna Araújo de Castro Oliveira
Abstract:
The intention of this article is to propose the use of artificial intelligence to detect through analysis by UFO ontology the emergence of verbal and physical aggression related to psychosocial deficiencies and their provoking agents, in an attempt to prevent catastrophic consequences within school environments.
The intention of this article is to propose the use of artificial intelligence to detect through analysis by UFO ontology the emergence of verbal and physical aggression related to psychosocial deficiencies and their provoking agents, in an attempt to prevent catastrophic consequences within school environments.
△ Less
Submitted 31 January, 2024;
originally announced March 2024.
-
Opening the Black-Box: A Systematic Review on Explainable AI in Remote Sensing
Authors:
Adrian Höhl,
Ivica Obadic,
Miguel Ángel Fernández Torres,
Hiba Najjar,
Dario Oliveira,
Zeynep Akata,
Andreas Dengel,
Xiao Xiang Zhu
Abstract:
In recent years, black-box machine learning approaches have become a dominant modeling paradigm for knowledge extraction in Remote Sensing. Despite the potential benefits of uncovering the inner workings of these models with explainable AI, a comprehensive overview summarizing the used explainable AI methods and their objectives, findings, and challenges in Remote Sensing applications is still mis…
▽ More
In recent years, black-box machine learning approaches have become a dominant modeling paradigm for knowledge extraction in Remote Sensing. Despite the potential benefits of uncovering the inner workings of these models with explainable AI, a comprehensive overview summarizing the used explainable AI methods and their objectives, findings, and challenges in Remote Sensing applications is still missing. In this paper, we address this issue by performing a systematic review to identify the key trends of how explainable AI is used in Remote Sensing and shed light on novel explainable AI approaches and emerging directions that tackle specific Remote Sensing challenges. We also reveal the common patterns of explanation interpretation, discuss the extracted scientific insights in Remote Sensing, and reflect on the approaches used for explainable AI methods evaluation. Our review provides a complete summary of the state-of-the-art in the field. Further, we give a detailed outlook on the challenges and promising research directions, representing a basis for novel methodological development and a useful starting point for new researchers in the field of explainable AI in Remote Sensing.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Are Fact-Checking Tools Reliable? An Evaluation of Google Fact Check
Authors:
Qiangeng Yang,
Tess Christensen,
Shlok Gilda,
Juliana Fernandes,
Daniela Oliveira
Abstract:
Fact-checking is an effective approach to combat misinformation on social media, especially regarding significant social events such as the COVID-19 pandemic and the U.S. presidential elections. In this study, we evaluated the performance of Google Fact Check, a fact-checking search engine. By analyzing the search results regarding 1,000 COVID-19-related false claims, we found Google Fact Check no…
▽ More
Fact-checking is an effective approach to combat misinformation on social media, especially regarding significant social events such as the COVID-19 pandemic and the U.S. presidential elections. In this study, we evaluated the performance of Google Fact Check, a fact-checking search engine. By analyzing the search results regarding 1,000 COVID-19-related false claims, we found Google Fact Check not likely to provide sufficient fact-checking information for most false claims, even though the results obtained are generally reliable and helpful. We also found that the corresponding false claims of different fact-checking verdicts (i.e., "False", "Partly False", "True", and "Unratable") tend to reflect diverse emotional tones, and fact-checking sources are likely to check the claims in different lengths and using dictionary words to various extents. Claims addressing the same issue yet described differently are likely to obtain disparate fact-checking results. This research aims to shed light on the best practices for performing fact-checking searches for the general public.
△ Less
Submitted 22 April, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Transition to chaos and magnetic field generation in rotating Rayleigh-Bénard convection
Authors:
Dalton N. Oliveira,
Roman Chertovskih,
Erico L. Rempel,
Francis F. Franco
Abstract:
Hydrodynamic and magnetohydrodynamic convective attractors in three-dimensional rotating Rayleigh-Bénard convection are studied numerically by varying the Taylor and Rayleigh numbers as control parameters. First, an analysis of hydrodynamic attractors and their bifurcations is conducted, where routes to chaos via quasiperiodicity are identified. Second, the behaviour of the magnetohydrodynamic sys…
▽ More
Hydrodynamic and magnetohydrodynamic convective attractors in three-dimensional rotating Rayleigh-Bénard convection are studied numerically by varying the Taylor and Rayleigh numbers as control parameters. First, an analysis of hydrodynamic attractors and their bifurcations is conducted, where routes to chaos via quasiperiodicity are identified. Second, the behaviour of the magnetohydrodynamic system is investigated by introducing a seed magnetic field and measuring its growth or decay as a function of the Taylor number, while kee** the Rayleigh number fixed. Analysis of the attractors shows that rotation has a significant impact on magnetic field generation in Rayleigh-Bénard convection, with the critical magnetic Prandtl number changing nonmonotonically with the rotation rate. It is argued that a nonhysteretic blowout bifurcation with on-off intermittency is responsible for the transitions to dynamo.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Controlling the biodegradation rates of poly(globalide-co-ε-caprolactone) copolymers by post polymerization modification
Authors:
Camila Guindani,
Graziâni Candiotto,
Pedro H. H. Araújo,
Sandra R. S. Ferreira,
Débora de Oliveira,
Frederik R. Wurm,
Katharina Landfester
Abstract:
Controlling the degradation rates of polymers is crucial for their application in tissue engineering or to achieve degradation of the polymers in the wastewater purification. As hydrophobic polyesters often exhibit very slow degradation rates, we report here increased biodegradation rates of poly(globalide-co-ε-caprolactone) copolymers (PGlCL) produced by enzymatic ring-opening copolymerization an…
▽ More
Controlling the degradation rates of polymers is crucial for their application in tissue engineering or to achieve degradation of the polymers in the wastewater purification. As hydrophobic polyesters often exhibit very slow degradation rates, we report here increased biodegradation rates of poly(globalide-co-ε-caprolactone) copolymers (PGlCL) produced by enzymatic ring-opening copolymerization and post-functionalized with N-acetylcysteine by thiol-ene reaction. The degradation rates of the PGlCL and post-modified PGlCL-NAC films were determined by weight-loss experiments. The polymer films were immersed in phosphate-buffered saline (PBS) solution, and PBS containing lipase from Pseudomonas cepacia. The degree of functionalization affected the degradation behavior, and samples with a higher degree of functionalization presented higher weight loss. Finally, a degradation assay was performed in activated sludge, and PGlCL-NAC presented high degradability, having a degradation behavior similar to starch. Density Functional Theory (DFT) calculations were used to assess the changes in chemical properties and electronic charge distribution of PGlCL after its functionalization with NAC, hel** to understand its influence in their degradability. The results obtained confirm the possibility to increase the degradation rates of copolyesters based on caprolactone and globalide by thiol-ene post-functionalization, being a promising alternative for applications in biomedicine or the packaging sector.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Evolution of urban areas and land surface temperature
Authors:
Sudipan Saha,
Tushar Verma,
Dario Augusto Borges Oliveira
Abstract:
With the global population on the rise, our cities have been expanding to accommodate the growing number of people. The expansion of cities generally leads to the engulfment of peripheral areas. However, such expansion of urban areas is likely to cause increment in areas with increased land surface temperature (LST). By considering each summer as a data point, we form LST multi-year time-series an…
▽ More
With the global population on the rise, our cities have been expanding to accommodate the growing number of people. The expansion of cities generally leads to the engulfment of peripheral areas. However, such expansion of urban areas is likely to cause increment in areas with increased land surface temperature (LST). By considering each summer as a data point, we form LST multi-year time-series and cluster it to obtain spatio-temporal pattern. We observe several interesting phenomena from these patterns, e.g., some clusters show reasonable similarity to the built-up area, whereas the locations with high temporal variation are seen more in the peripheral areas. Furthermore, the LST center of mass shifts over the years for cities with development activities tilted towards a direction. We conduct the above-mentioned studies for three different cities in three different continents.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
Existence and multiplicity for fractional Dirichlet problem with $γ(ξ)$-Laplacian equation and Nehari manifold
Authors:
J. Vanterler da C. Sousa,
D. S. Oliveira,
Ravi P. Agarwal
Abstract:
This paper is divided in two parts. In the first part, we prove coercivity results and minimization of the Euler energy functional. In the second part, we focus on the existence and multiplicity of a positive solution of fractional Dirichlet problem involving the $γ(ξ)$-Laplacian equation with non-negative weight functions in $\mathcal{H}^{α,β;χ}_{γ(ξ)}(Λ,\mathbb{R})$ using some variational techni…
▽ More
This paper is divided in two parts. In the first part, we prove coercivity results and minimization of the Euler energy functional. In the second part, we focus on the existence and multiplicity of a positive solution of fractional Dirichlet problem involving the $γ(ξ)$-Laplacian equation with non-negative weight functions in $\mathcal{H}^{α,β;χ}_{γ(ξ)}(Λ,\mathbb{R})$ using some variational techniques and Nehari manifold.
△ Less
Submitted 3 October, 2023;
originally announced November 2023.
-
Foundation Models for Generalist Geospatial Artificial Intelligence
Authors:
Johannes Jakubik,
Sujit Roy,
C. E. Phillips,
Paolo Fraccaro,
Denys Godwin,
Bianca Zadrozny,
Daniela Szwarcman,
Carlos Gomes,
Gabby Nyirjesy,
Blair Edwards,
Daiki Kimura,
Naomi Simumba,
Linsong Chu,
S. Karthik Mukkavilli,
Devyani Lambhate,
Kamal Das,
Ran**i Bangalore,
Dario Oliveira,
Michal Muszynski,
Kumar Ankur,
Muthukumaran Ramasubramanian,
Iksha Gurung,
Sam Khallaghi,
Hanxi,
Li
, et al. (8 additional authors not shown)
Abstract:
Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation models are pre-trained on large unlabeled datasets through self-supervision, and then fine-tuned for various downstream tasks with small labeled datasets. This paper introduces a first-of-a-kind framewo…
▽ More
Significant progress in the development of highly adaptable and reusable Artificial Intelligence (AI) models is expected to have a significant impact on Earth science and remote sensing. Foundation models are pre-trained on large unlabeled datasets through self-supervision, and then fine-tuned for various downstream tasks with small labeled datasets. This paper introduces a first-of-a-kind framework for the efficient pre-training and fine-tuning of foundational models on extensive geospatial data. We have utilized this framework to create Prithvi, a transformer-based geospatial foundational model pre-trained on more than 1TB of multispectral satellite imagery from the Harmonized Landsat-Sentinel 2 (HLS) dataset. Our study demonstrates the efficacy of our framework in successfully fine-tuning Prithvi to a range of Earth observation tasks that have not been tackled by previous work on foundation models involving multi-temporal cloud gap imputation, flood map**, wildfire scar segmentation, and multi-temporal crop segmentation. Our experiments show that the pre-trained model accelerates the fine-tuning process compared to leveraging randomly initialized weights. In addition, pre-trained Prithvi compares well against the state-of-the-art, e.g., outperforming a conditional GAN model in multi-temporal cloud imputation by up to 5pp (or 5.7%) in the structural similarity index. Finally, due to the limited availability of labeled data in the field of Earth observation, we gradually reduce the quantity of available labeled data for refining the model to evaluate data efficiency and demonstrate that data can be decreased significantly without affecting the model's accuracy. The pre-trained 100 million parameter model and corresponding fine-tuning workflows have been released publicly as open source contributions to the global Earth sciences community through Hugging Face.
△ Less
Submitted 8 November, 2023; v1 submitted 28 October, 2023;
originally announced October 2023.
-
Validation of SOLPS-ITER Simulations against the TCV-X21 Reference Case
Authors:
Y. Wang,
C. Colandrea,
D. S. Oliveira,
C. Theiler,
H. Reimerdes,
T. Body,
D. Galassi,
L. Martinelli,
K. Lee,
TCV team
Abstract:
This paper presents a quantitative validation of SOLPS-ITER simulations against the TCV-X21 reference case and provides insights into the neutral dynamics and ionization source distribution in this scenario. TCV-X21 is a well-diagnosed diverted L-mode sheath-limited plasma scenario in both toroidal field directions, designed specifically for the validation of turbulence codes [D.S. Oliveira, T. Bo…
▽ More
This paper presents a quantitative validation of SOLPS-ITER simulations against the TCV-X21 reference case and provides insights into the neutral dynamics and ionization source distribution in this scenario. TCV-X21 is a well-diagnosed diverted L-mode sheath-limited plasma scenario in both toroidal field directions, designed specifically for the validation of turbulence codes [D.S. Oliveira, T. Body, et al 2022 Nucl. Fusion 62 096001]. Despite the optimization to reduce the impact of the neutral dynamics, the absence of neutrals in previous turbulence simulations of TCV-X21 was identified as a possible explanation for the disagreements with the experimental data in the divertor region. This motivates the present study with SOLPS-ITER that includes kinetic neutral dynamics via EIRENE. Five new observables are added to the extensive, publicly available TCV-X21 dataset. These are three deuterium Balmer lines in the divertor and neutral pressure in the common and private flux regions. The quantitative agreement metric is combined with the conjugate gradient method to approach the SOLPS-ITER input parameters that return the best overall agreement with the experiment. A proof-of-principle of this method results in a modest improvement in the level-of-agreement; shortcomings of the method and how to improve it are discussed. Alternatively, a scan of the particle and heat diffusion coefficients shows an improvement of 10.4% beyond the agreement level achieved by the gradient method. The result is found for an increased transport coefficient compared to what is usually used for TCV L-mode plasmas, suggesting the need for accurate self-consistent turbulence models for predictive boundary simulations. The simulations indicate that ~65% of the total ionization occurs in the SOL, motivating the inclusion of neutrals in future turbulence simulations towards improved agreement with the experiment.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Distilling HuBERT with LSTMs via Decoupled Knowledge Distillation
Authors:
Danilo de Oliveira,
Timo Gerkmann
Abstract:
Much research effort is being applied to the task of compressing the knowledge of self-supervised models, which are powerful, yet large and memory consuming. In this work, we show that the original method of knowledge distillation (and its more recently proposed extension, decoupled knowledge distillation) can be applied to the task of distilling HuBERT. In contrast to methods that focus on distil…
▽ More
Much research effort is being applied to the task of compressing the knowledge of self-supervised models, which are powerful, yet large and memory consuming. In this work, we show that the original method of knowledge distillation (and its more recently proposed extension, decoupled knowledge distillation) can be applied to the task of distilling HuBERT. In contrast to methods that focus on distilling internal features, this allows for more freedom in the network architecture of the compressed model. We thus propose to distill HuBERT's Transformer layers into an LSTM-based distilled model that reduces the number of parameters even below DistilHuBERT and at the same time shows improved performance in automatic speech recognition.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Crystallization and refluidization in very-narrow fluidized beds
Authors:
Vinícius Pereira da Silva Oliveira,
Danilo da Silva Borges,
Erick de Moraes Franklin
Abstract:
Fluidization of solid particles by an ascending fluid is frequent in industry because of the high rates of mass and heat transfers achieved. However, in some cases blockages occur and hinder the correct functioning of the fluidized bed. In this paper, we investigate the crystallization (defluidization) and refluidization that take place in very-narrow solid-liquid fluidized beds under steady flow…
▽ More
Fluidization of solid particles by an ascending fluid is frequent in industry because of the high rates of mass and heat transfers achieved. However, in some cases blockages occur and hinder the correct functioning of the fluidized bed. In this paper, we investigate the crystallization (defluidization) and refluidization that take place in very-narrow solid-liquid fluidized beds under steady flow conditions. For that, we carried out experiments where either monodisperse or bidisperse beds were immersed in water flows whose velocities were above those necessary for fluidization, and the ratio between the tube and grain diameters was smaller than 6. For monodisperse beds consisting of regular spheres, we observed that crystallization and refluidization alternate successively along time, which we quantify in terms of macroscopic structures and agitation of individual grains. We found the characteristic times for crystallization, and propose a new macroscopic parameter quantifying the degree of bed agitation. The bidisperse beds consisted of less-regular spheres placed on the bottom of a layer of regular spheres (the latter was identical to the monodisperse beds tested). We measured the changes that macroscopic structures and agitation of grains undergo, and show that the higher agitation in the bottom layer hinders crystallization of the top layer. Our results bring new insights into the dynamics of very-narrow beds, in addition to proposing a way of mitigating defluidization.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Thermal conductivity of Barium Bismuthate at low temperatures
Authors:
A. Henriques,
D. M. N. Oliveira,
M. Baksi,
M. Naveed,
W. H. Brito,
J. Larrea-Jimenéz,
D. Kumah,
S. Wirth,
V. Martelli
Abstract:
The perovskite BaBiO$_3$ crystallizes in a cubic structure and undergoes structural transitions toward lower symmetry phases upon cooling. The two low-temperature monoclinic phases are insulating, and the origin of this unexpected non-metallic character has been under debate. Both monoclinic phases exhibit tilting and breathing distortions, which are connected with the insulating nature of this co…
▽ More
The perovskite BaBiO$_3$ crystallizes in a cubic structure and undergoes structural transitions toward lower symmetry phases upon cooling. The two low-temperature monoclinic phases are insulating, and the origin of this unexpected non-metallic character has been under debate. Both monoclinic phases exhibit tilting and breathing distortions, which are connected with the insulating nature of this compound and may have important effects on phononic heat conductivity. Here, we report the first thermal conductivity measurement, $κ$(T), in pristine polycrystalline BaBiO$_3$ from 1.5 K to 310 K. At low and intermediate temperatures, we observe features reminiscent of a glass-like behavior, whereas at high-temperatures we find a downturn - typical of a crystalline solid. We compare our findings with available data of other recently investigated perovskite oxides displaying similar temperature dependence.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Interplanetary Shock Data Base
Authors:
Denny M. Oliveira
Abstract:
In this manuscript, I provide an updated interplanetary shock data base I published in previous works. This list has now 603 events. I also present and describe the data and methodologies used to compile this list. The main contribution of this work is to provide an updated end accurate interplanetary shock data base for future space physics and space weather investigations. The list has been uplo…
▽ More
In this manuscript, I provide an updated interplanetary shock data base I published in previous works. This list has now 603 events. I also present and describe the data and methodologies used to compile this list. The main contribution of this work is to provide an updated end accurate interplanetary shock data base for future space physics and space weather investigations. The list has been uploaded to Zenodo, and a link is provided for accessing the data files. As for Frontiers requirements, the access of the list has kept to be restricted during the review process. The list will be made public if/when the manuscript is published.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
ProWis: A Visual Approach for Building, Managing, and Analyzing Weather Simulation Ensembles at Runtime
Authors:
Carolina Veiga Ferreira de Souza,
Suzanna Maria Bonnet,
Daniel de Oliveira,
Marcio Cataldi,
Fabio Miranda,
Marcos Lage
Abstract:
Weather forecasting is essential for decision-making and is usually performed using numerical modeling. Numerical weather models, in turn, are complex tools that require specialized training and laborious setup and are challenging even for weather experts. Moreover, weather simulations are data-intensive computations and may take hours to days to complete. When the simulation is finished, the expe…
▽ More
Weather forecasting is essential for decision-making and is usually performed using numerical modeling. Numerical weather models, in turn, are complex tools that require specialized training and laborious setup and are challenging even for weather experts. Moreover, weather simulations are data-intensive computations and may take hours to days to complete. When the simulation is finished, the experts face challenges analyzing its outputs, a large mass of spatiotemporal and multivariate data. From the simulation setup to the analysis of results, working with weather simulations involves several manual and error-prone steps. The complexity of the problem increases exponentially when the experts must deal with ensembles of simulations, a frequent task in their daily duties. To tackle these challenges, we propose ProWis: an interactive and provenance-oriented system to help weather experts build, manage, and analyze simulation ensembles at runtime. Our system follows a human-in-the-loop approach to enable the exploration of multiple atmospheric variables and weather scenarios. ProWis was built in close collaboration with weather experts, and we demonstrate its effectiveness by presenting two case studies of rainfall events in Brazil.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
The use of the EM algorithm for regularization problems in high-dimensional linear mixed-effects models
Authors:
Daniela C. R. Oliveira,
Fernanda L. Schumacher,
Victor H. Lachos
Abstract:
The EM algorithm is a popular tool for maximum likelihood estimation but has not been used much for high-dimensional regularization problems in linear mixed-effects models. In this paper, we introduce the EMLMLasso algorithm, which combines the EM algorithm and the popular and efficient R package glmnet for Lasso variable selection of fixed effects in linear mixed-effects models. We compare the pe…
▽ More
The EM algorithm is a popular tool for maximum likelihood estimation but has not been used much for high-dimensional regularization problems in linear mixed-effects models. In this paper, we introduce the EMLMLasso algorithm, which combines the EM algorithm and the popular and efficient R package glmnet for Lasso variable selection of fixed effects in linear mixed-effects models. We compare the performance of our proposed EMLMLasso algorithm with the one implemented in the well-known R package glmmLasso through the analyses of both simulated and real-world applications. The simulations and applications demonstrated good properties, such as consistency, and the effectiveness of the proposed variable selection procedure, for both $p < n$ and $p > n$. Moreover, in all evaluated scenarios, the EMLMLasso algorithm outperformed glmmLasso. The proposed method is quite general and can be easily extended for ridge and elastic net penalties in linear mixed-effects models.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
IRQ Coloring and the Subtle Art of Mitigating Interrupt-generated Interference
Authors:
Diogo Costa,
Luca Cuomo,
Daniel Oliveira,
Ida Maria Savino,
Bruno Morelli,
José Martins,
Alessandro Biasci,
Sandro Pinto
Abstract:
Integrating workloads with differing criticality levels presents a formidable challenge in achieving the stringent spatial and temporal isolation requirements imposed by safety-critical standards such as ISO26262. The shift towards high-performance multicore platforms has been posing increasing issues to the so-called mixed-criticality systems (MCS) due to the reciprocal interference created by co…
▽ More
Integrating workloads with differing criticality levels presents a formidable challenge in achieving the stringent spatial and temporal isolation requirements imposed by safety-critical standards such as ISO26262. The shift towards high-performance multicore platforms has been posing increasing issues to the so-called mixed-criticality systems (MCS) due to the reciprocal interference created by consolidated subsystems vying for access to shared (microarchitectural) resources (e.g., caches, bus interconnect, memory controller). The research community has acknowledged all these challenges. Thus, several techniques, such as cache partitioning and memory throttling, have been proposed to mitigate such interference; however, these techniques have some drawbacks and limitations that impact performance, memory footprint, and availability. In this work, we look from a different perspective. Departing from the observation that safety-critical workloads are typically event- and thus interrupt-driven, we mask "colored" interrupts based on the \ac{QoS} assessment, providing fine-grain control to mitigate interference on critical workloads without entirely suspending non-critical workloads. We propose the so-called IRQ coloring technique. We implement and evaluate the IRQ Coloring on a reference high-performance multicore platform, i.e., Xilinx ZCU102. Results demonstrate negligible performance overhead, i.e., <1% for a 100 microseconds period, and reasonable throughput guarantees for medium-critical workloads. We argue that the IRQ coloring technique presents predictability and intermediate guarantees advantages compared to state-of-art mechanisms
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Conformal solitons for the mean curvature flow in hyperbolic space
Authors:
Luciano Mari,
Jose Danuso Rocha de Oliveira,
Andreas Savas-Halilaj,
Renivaldo Sodre de Sena
Abstract:
In this paper we study conformal solitons for the mean curvature flow in hyperbolic space $\mathbb{H}^{n+1}$. Working in the upper half-space model, we focus on horo-expanders, which relate to the conformal field $-\partial_0$. We classify cylindrical and rotationally symmetric examples, finding appropriate analogues of grim-reaper cylinders, bowl and winglike solitons. Moreover, we address the Pl…
▽ More
In this paper we study conformal solitons for the mean curvature flow in hyperbolic space $\mathbb{H}^{n+1}$. Working in the upper half-space model, we focus on horo-expanders, which relate to the conformal field $-\partial_0$. We classify cylindrical and rotationally symmetric examples, finding appropriate analogues of grim-reaper cylinders, bowl and winglike solitons. Moreover, we address the Plateau and the Dirichlet problems at infinity. For the latter, we provide the sharp boundary convexity condition to guarantee its solvability, and address the case of noncompact boundaries contained between two parallel hyperplanes of $\partial_{\infty}\mathbb{H}^{n+1}$. We conclude by proving rigidity results for bowl and grim-reaper cylinders.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Kernels, Data & Physics
Authors:
Francesco Cagnetta,
Deborah Oliveira,
Mahalakshmi Sabanayagam,
Nikolaos Tsilivis,
Julia Kempe
Abstract:
Lecture notes from the course given by Professor Julia Kempe at the summer school "Statistical physics of Machine Learning" in Les Houches. The notes discuss the so-called NTK approach to problems in machine learning, which consists of gaining an understanding of generally unsolvable problems by finding a tractable kernel formulation. The notes are mainly focused on practical applications such as…
▽ More
Lecture notes from the course given by Professor Julia Kempe at the summer school "Statistical physics of Machine Learning" in Les Houches. The notes discuss the so-called NTK approach to problems in machine learning, which consists of gaining an understanding of generally unsolvable problems by finding a tractable kernel formulation. The notes are mainly focused on practical applications such as data distillation and adversarial robustness, examples of inductive bias are also discussed.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
On the Behavior of Intrusive and Non-intrusive Speech Enhancement Metrics in Predictive and Generative Settings
Authors:
Danilo de Oliveira,
Julius Richter,
Jean-Marie Lemercier,
Tal Peer,
Timo Gerkmann
Abstract:
Since its inception, the field of deep speech enhancement has been dominated by predictive (discriminative) approaches, such as spectral map** or masking. Recently, however, novel generative approaches have been applied to speech enhancement, attaining good denoising performance with high subjective quality scores. At the same time, advances in deep learning also allowed for the creation of neur…
▽ More
Since its inception, the field of deep speech enhancement has been dominated by predictive (discriminative) approaches, such as spectral map** or masking. Recently, however, novel generative approaches have been applied to speech enhancement, attaining good denoising performance with high subjective quality scores. At the same time, advances in deep learning also allowed for the creation of neural network-based metrics, which have desirable traits such as being able to work without a reference (non-intrusively). Since generatively enhanced speech tends to exhibit radically different residual distortions, its evaluation using instrumental speech metrics may behave differently compared to predictively enhanced speech. In this paper, we evaluate the performance of the same speech enhancement backbone trained under predictive and generative paradigms on a variety of metrics and show that intrusive and non-intrusive measures correlate differently for each paradigm. This analysis motivates the search for metrics that can together paint a complete and unbiased picture of speech enhancement performance, irrespective of the model's training process.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Datasets for Portuguese Legal Semantic Textual Similarity: Comparing weak supervision and an annotation process approaches
Authors:
Daniel da Silva Junior,
Paulo Roberto dos S. Corval,
Aline Paes,
Daniel de Oliveira
Abstract:
The Brazilian judiciary has a large workload, resulting in a long time to finish legal proceedings. Brazilian National Council of Justice has established in Resolution 469/2022 formal guidance for document and process digitalization opening up the possibility of using automatic techniques to help with everyday tasks in the legal field, particularly in a large number of texts yielded on the routine…
▽ More
The Brazilian judiciary has a large workload, resulting in a long time to finish legal proceedings. Brazilian National Council of Justice has established in Resolution 469/2022 formal guidance for document and process digitalization opening up the possibility of using automatic techniques to help with everyday tasks in the legal field, particularly in a large number of texts yielded on the routine of law procedures. Notably, Artificial Intelligence (AI) techniques allow for processing and extracting useful information from textual data, potentially speeding up the process. However, datasets from the legal domain required by several AI techniques are scarce and difficult to obtain as they need labels from experts. To address this challenge, this article contributes with four datasets from the legal domain, two with documents and metadata but unlabeled, and another two labeled with a heuristic aiming at its use in textual semantic similarity tasks. Also, to evaluate the effectiveness of the proposed heuristic label process, this article presents a small ground truth dataset generated from domain expert annotations. The analysis of ground truth labels highlights that semantic analysis of domain text can be challenging even for domain experts. Also, the comparison between ground truth and heuristic labels shows that heuristic labels are useful.
△ Less
Submitted 29 May, 2023;
originally announced June 2023.
-
Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models
Authors:
Danilo de Oliveira,
Navin Raj Prabhu,
Timo Gerkmann
Abstract:
In large part due to their implicit semantic modeling, self-supervised learning (SSL) methods have significantly increased the performance of valence recognition in speech emotion recognition (SER) systems. Yet, their large size may often hinder practical implementations. In this work, we take HuBERT as an example of an SSL model and analyze the relevance of each of its layers for SER. We show tha…
▽ More
In large part due to their implicit semantic modeling, self-supervised learning (SSL) methods have significantly increased the performance of valence recognition in speech emotion recognition (SER) systems. Yet, their large size may often hinder practical implementations. In this work, we take HuBERT as an example of an SSL model and analyze the relevance of each of its layers for SER. We show that shallow layers are more important for arousal recognition while deeper layers are more important for valence. This observation motivates the importance of additional textual information for accurate valence recognition, as the distilled framework lacks the depth of its large-scale SSL teacher. Thus, we propose an audio-textual distilled SSL framework that, while having only ~20% of the trainable parameters of a large SSL model, achieves on par performance across the three emotion dimensions (arousal, valence, dominance) on the MSP-Podcast v1.10 dataset.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Nuclear activity in $z<0.3$ QSO 2's mainly triggered by galaxy mergers
Authors:
Bruna L. C. Araujo,
Thaisa Storchi-Bergmann,
Sandro B. Rembold,
André L. P. Kaipper,
Bruno Dall'Agnol de Oliveira
Abstract:
We investigate the role of the close environment on the nuclear activity of a sample of 436 nearby ($z<0.3$) QSO 2's -- selected from SDSS-III spectra, via comparison of their environment and interaction parameters with those of a control sample of 1308 galaxies. We have used the corresponding SDSS images to obtain the number of neighbour galaxies $N$, tidal strength parameter $Q$ and asymmetry pa…
▽ More
We investigate the role of the close environment on the nuclear activity of a sample of 436 nearby ($z<0.3$) QSO 2's -- selected from SDSS-III spectra, via comparison of their environment and interaction parameters with those of a control sample of 1308 galaxies. We have used the corresponding SDSS images to obtain the number of neighbour galaxies $N$, tidal strength parameter $Q$ and asymmetry parameters. We find a small excess of $N$ in the QSOs compared to its three controls, and no difference in $Q$. The main difference is an excess of asymmetry in the QSOs hosts, which is almost twice that of the control galaxies. This difference is not due to the hosts' morphology, since there is no difference in their Galaxy Zoo classifications. HST images of two highly asymmetric QSO 2 hosts of our sample show that both sources have a close companion (at projected separations $\sim$ 5 kpc), which we thus conclude is the cause of the observed asymmetry in the lower resolution SDSS images. The mean projected radius of the controls is $ \langle r \rangle = 8.53\pm$0.06 kpc, while that of the QSO hosts is $ \langle r \rangle = 9.39\pm$0.12 kpc, supporting the presence of interaction signatures in the outer regions of the QSO hosts. Our results favour a scenario in which nuclear activity in QSO 2's is triggered by close galaxy interactions -- when the distance between the host and companion is of the order of the galaxy radius, implying that they are already in the process of merger.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Self-consistent multi-component simulation of plasma turbulence and neutrals in detached conditions
Authors:
D. Mancini,
P. Ricci,
N. Vianello,
G. Van Parys,
D. S. Oliveira
Abstract:
Simulations of high-density deuterium plasmas in a lower single-null magnetic configuration based on a TCV discharge are presented. We evolve the dynamics of three charged species (electrons, D$^{+}$ and D$_{2}^{+}$), interacting with two neutrals species (D and D$_2$) through ionization, charge-exchange, recombination and molecular dissociation processes. The plasma is modelled by using the drift…
▽ More
Simulations of high-density deuterium plasmas in a lower single-null magnetic configuration based on a TCV discharge are presented. We evolve the dynamics of three charged species (electrons, D$^{+}$ and D$_{2}^{+}$), interacting with two neutrals species (D and D$_2$) through ionization, charge-exchange, recombination and molecular dissociation processes. The plasma is modelled by using the drift-reduced fluid Braginskii equations, while the neutral dynamics is described by a kinetic model. To control the divertor conditions, a D$_2$ puffing is used and the effect of increasing the puffing strength is investigated. The increase in fuelling leads to an increase of density in the scrape-off layer and a decrease of the plasma temperature. At the same time, the particle and heat fluxes to the divertor target decrease and the detachment of the inner target is observed. The analysis of particle and transport balance in the divertor volume shows that the decrease of the particle flux is caused by a decrease of the local neutral ionization together with a decrease of the parallel velocity, both caused by the lower plasma temperature. The relative importance of the different collision terms is assessed, showing the crucial role of molecular interactions, as they are responsible for increasing the atomic neutral density and temperature, since most of the D neutrals are produced by molecular activated recombination and D$_2$ dissociation. The presence of strong electric fields in high-density plasmas is also shown, revealing the role of the $E \times B$ drift in setting the asymmetry between the divertor targets. Simulation results are in agreement with experimental observations of increased density decay length, attributed to a decrease of parallel transport, together with an increase of plasma blob size and radial velocity.
△ Less
Submitted 3 September, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Geoeffectiveness of Interplanetary Shocks Controlled by Impact Angles: Past Research, Recent Advancements, and Future Work
Authors:
Denny M. Oliveira
Abstract:
Interplanetary (IP) shocks are disturbances commonly observed in the solar wind. IP shock impacts can cause a myriad of space weather effects in the Earth's magnetopause, inner magnetosphere, ionosphere, thermosphere, and ground magnetic field. The shock impact angle, measured as the angle the shock normal vector performs with the Sun-Earth line, has been shown to be a very important parameter tha…
▽ More
Interplanetary (IP) shocks are disturbances commonly observed in the solar wind. IP shock impacts can cause a myriad of space weather effects in the Earth's magnetopause, inner magnetosphere, ionosphere, thermosphere, and ground magnetic field. The shock impact angle, measured as the angle the shock normal vector performs with the Sun-Earth line, has been shown to be a very important parameter that controls shock geoeffectivess. An extensive review provided by Oliveira and Samsonov (2018) summarized all the work known at the time with respect to shock impact angles and geomagnetic activity; however, this topic has had some progress since Oliveira and Samsonov (2018) and the main goal of this mini review is to summarize all achievements to date in the topic to the knowledge of the author. Finally, this mini review also brings a few suggestions and ideas for future research in the area of IP shock impact angle geoeffectiveness.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Cold molecular gas outflow encasing the ionised one in the Seyfert galaxy NGC 3281
Authors:
Bruno Dall'Agnol de Oliveira,
Thaisa Storchi-Bergmann,
Raffaella Morganti,
Rogemar A. Riffel,
Venkatessh Ramakrishnan
Abstract:
We present ALMA CO(2-1) observations of the Seyfert 2 galaxy NGC 3281 at $\sim$ 100 pc spatial resolution. This galaxy was previously known to present a bi-conical ionised gas outflow extending to 2 kpc from the nucleus. The analysis of the CO moment and channel maps, as well as kinematic modelling reveals two main components in the molecular gas: one rotating in the galaxy plane and another outfl…
▽ More
We present ALMA CO(2-1) observations of the Seyfert 2 galaxy NGC 3281 at $\sim$ 100 pc spatial resolution. This galaxy was previously known to present a bi-conical ionised gas outflow extending to 2 kpc from the nucleus. The analysis of the CO moment and channel maps, as well as kinematic modelling reveals two main components in the molecular gas: one rotating in the galaxy plane and another outflowing and extending up to $\sim$ 1.8 -- 2.6 kpc from the nucleus, partially encasing the ionised component. The mass of the outflowing molecular gas component is $M_{\mathrm{mol},\mathrm{out}}$ = $(2.5\pm1.6){\times}10^{6}$ $\rm{M_{\odot}}$, representing $\sim$ 1.7 -- 2 % of the total molecular gas seen in emission within the inner 2.3 kpc. The corresponding mass outflow rate and power are $\dot{M}_{\mathrm{mol},\mathrm{out}}$ = 0.12 -- 0.72 $\rm{M_{\odot} yr^{-1}}$ and $\dot{E}_{\mathrm{mol},\mathrm{out}}$ = (0.045 -- 1.6) ${\times} 10^{40}$ $\rm{erg s^{-1}}$, which translates to a kinetic coupling efficiency with the AGN power of only $10^{-4}$ -- 0.02 %. This value reaches up to 0.1 % when including both the feedback in the ionised and molecular gas, as well as considering that only part of the energy couples kinetically with the gas. Some of the non-rotating CO emission can also be attributed to inflow in the galaxy plane towards the nucleus. The similarity of the CO outflow -- encasing the ionised gas one and the X-ray emission -- to those seen in other sources, suggests that this may be a common property of galactic outflows.
△ Less
Submitted 8 April, 2023;
originally announced April 2023.
-
Exploring quantum thermodynamics with NMR
Authors:
Carlos H. S. Vieira,
Jefferson L. D. de Oliveira,
Jonas F. G. Santos,
Pedro R. Dieguez,
Roberto M. Serra
Abstract:
Quantum thermodynamics seeks to extend non-equilibrium stochastic thermodynamics to small quantum systems where non-classical features are essential to its description. Such a research area has recently provided meaningful theoretical and experimental advances by exploring the wealth and the power of quantum features along with informational aspects of a system's thermodynamics. The relevance of s…
▽ More
Quantum thermodynamics seeks to extend non-equilibrium stochastic thermodynamics to small quantum systems where non-classical features are essential to its description. Such a research area has recently provided meaningful theoretical and experimental advances by exploring the wealth and the power of quantum features along with informational aspects of a system's thermodynamics. The relevance of such investigations is related to the fact that quantum technological devices are currently at the forefront of science and engineering applications. This short review article provides an overview of some concepts in quantum thermodynamics highlighting test-of-principles experiments using nuclear magnetic resonance techniques.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
Existence, uniqueness and controllability for Hilfer differential equations on times scales
Authors:
J. Vanterler da C. Sousa,
D. S. Oliveira,
Gastao S. F. Frederico,
Delfim F. M. Torres
Abstract:
We introduce a new version of $ψ$-Hilfer fractional derivative, on an arbitrary time scale. The fundamental properties of the new operator are investigated and, in particular, we prove an integration by parts formula. Using the Laplace transform and the obtained integration by parts formula, we then propose a $ψ$-Riemann-Liouville fractional integral on times scales. The applicability of the new o…
▽ More
We introduce a new version of $ψ$-Hilfer fractional derivative, on an arbitrary time scale. The fundamental properties of the new operator are investigated and, in particular, we prove an integration by parts formula. Using the Laplace transform and the obtained integration by parts formula, we then propose a $ψ$-Riemann-Liouville fractional integral on times scales. The applicability of the new operators is illustrated by considering a fractional initial value problem on an arbitrary time scale, for which we prove existence, uniqueness and controllability of solutions in a suitable Banach space. The obtained results are interesting and nontrivial even for particular choices: (i) of the time scale; (ii) of the order of differentiation; and/or (iii) function $ψ$; opening new directions of investigation.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Diminimal families of arbitrary diameter
Authors:
Luiz Emilio Allem,
Rodrigo Orsini Braga,
Carlos Hoppen,
Elismar da Rosa Oliveira,
Lucas Siviero Sibemberg,
Vilmar Trevisan
Abstract:
Given a tree $T$, let $q(T)$ be the minimum number of distinct eigenvalues in a symmetric matrix whose underlying graph is $T$. It is well known that $q(T)\geq d(T)+1$, where $d(T)$ is the diameter of $T$, and a tree $T$ is said to be diminimal if $q(T)=d(T)+1$. In this paper, we present families of diminimal trees of any fixed diameter. Our proof is constructive, allowing us to compute, for any d…
▽ More
Given a tree $T$, let $q(T)$ be the minimum number of distinct eigenvalues in a symmetric matrix whose underlying graph is $T$. It is well known that $q(T)\geq d(T)+1$, where $d(T)$ is the diameter of $T$, and a tree $T$ is said to be diminimal if $q(T)=d(T)+1$. In this paper, we present families of diminimal trees of any fixed diameter. Our proof is constructive, allowing us to compute, for any diminimal tree $T$ of diameter $d$ in these families, a symmetric matrix $M$ with underlying graph $T$ whose spectrum has exactly $d+1$ distinct eigenvalues.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Index bounds for closed minimal surfaces in 3-manifolds with the Killing property
Authors:
Marcos P. Cavalcante,
Darlan F. de Oliveira,
Robson dos S. Silva
Abstract:
Let $Σ$ be a closed minimal surface immersed in a Riemannian 3-manifold carrying an orthonormal Killing frame. This class of ambient spaces includes Lie groups with a bi-invariant metric. In this paper, we prove that the sum of the Morse index and the nullity of $Σ$ is bounded from below by a constant times its genus.
Let $Σ$ be a closed minimal surface immersed in a Riemannian 3-manifold carrying an orthonormal Killing frame. This class of ambient spaces includes Lie groups with a bi-invariant metric. In this paper, we prove that the sum of the Morse index and the nullity of $Σ$ is bounded from below by a constant times its genus.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
An interpretable machine learning system for colorectal cancer diagnosis from pathology slides
Authors:
Pedro C. Neto,
Diana Montezuma,
Sara P. Oliveira,
Domingos Oliveira,
João Fraga,
Ana Monteiro,
João Monteiro,
Liliana Ribeiro,
Sofia Gonçalves,
Stefan Reinhard,
Inti Zlobec,
Isabel M. Pinto,
Jaime S. Cardoso
Abstract:
Considering the profound transformation affecting pathology practice, we aimed to develop a scalable artificial intelligence (AI) system to diagnose colorectal cancer from whole-slide images (WSI). For this, we propose a deep learning (DL) system that learns from weak labels, a sampling strategy that reduces the number of training samples by a factor of six without compromising performance, an app…
▽ More
Considering the profound transformation affecting pathology practice, we aimed to develop a scalable artificial intelligence (AI) system to diagnose colorectal cancer from whole-slide images (WSI). For this, we propose a deep learning (DL) system that learns from weak labels, a sampling strategy that reduces the number of training samples by a factor of six without compromising performance, an approach to leverage a small subset of fully annotated samples, and a prototype with explainable predictions, active learning features and parallelisation. Noting some problems in the literature, this study is conducted with one of the largest WSI colorectal samples dataset with approximately 10,500 WSIs. Of these samples, 900 are testing samples. Furthermore, the robustness of the proposed method is assessed with two additional external datasets (TCGA and PAIP) and a dataset of samples collected directly from the proposed prototype. Our proposed method predicts, for the patch-based tiles, a class based on the severity of the dysplasia and uses that information to classify the whole slide. It is trained with an interpretable mixed-supervision scheme to leverage the domain knowledge introduced by pathologists through spatial annotations. The mixed-supervision scheme allowed for an intelligent sampling strategy effectively evaluated in several different scenarios without compromising the performance. On the internal dataset, the method shows an accuracy of 93.44% and a sensitivity between positive (low-grade and high-grade dysplasia) and non-neoplastic samples of 0.996. On the external test samples varied with TCGA being the most challenging dataset with an overall accuracy of 84.91% and a sensitivity of 0.996.
△ Less
Submitted 30 April, 2024; v1 submitted 6 January, 2023;
originally announced January 2023.
-
On the number of rational points of Artin-Schreier curves and hypersurfaces
Authors:
Fabio Enrique Brochero Martínez,
Daniela Alves de Oliveira
Abstract:
Let $\mathbb F_{q^n}$ denote the finite field with $q^n$ elements. In this paper we determine the number of $\mathbb F_{q^n}$-rational points of the affine Artin-Schreier curve given by $y^q-y = x(x^{q^i}-x)-λ$ and of the Artin-Schreier hypersurface $y^q-y=\sum_{j=1}^r a_jx_j(x_j^{q^{i_j}}-x_j)-λ.$ Moreover in both cases, we show that the Weil bound is attained only in the case where the trace of…
▽ More
Let $\mathbb F_{q^n}$ denote the finite field with $q^n$ elements. In this paper we determine the number of $\mathbb F_{q^n}$-rational points of the affine Artin-Schreier curve given by $y^q-y = x(x^{q^i}-x)-λ$ and of the Artin-Schreier hypersurface $y^q-y=\sum_{j=1}^r a_jx_j(x_j^{q^{i_j}}-x_j)-λ.$ Moreover in both cases, we show that the Weil bound is attained only in the case where the trace of $λ\in\mathbb F_{q^n}$ over $\mathbb F_q$ is zero. We use quadratic forms and permutation matrices to determine the number of affine rational points of these curves and hypersurfaces.
△ Less
Submitted 6 July, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Chronic pain patient narratives allow for the estimation of current pain intensity
Authors:
Diogo A. P. Nunes,
Joana Ferreira-Gomes,
Daniela Oliveira,
Carlos Vaz,
Sofia Pimenta,
Fani Neto,
David Martins de Matos
Abstract:
Chronic pain is a multi-dimensional experience, and pain intensity plays an important part, impacting the patients emotional balance, psychology, and behaviour. Standard self-reporting tools, such as the Visual Analogue Scale for pain, fail to capture this burden. Moreover, this type of tools is susceptible to a degree of subjectivity, dependent on the patients clear understanding of how to use it…
▽ More
Chronic pain is a multi-dimensional experience, and pain intensity plays an important part, impacting the patients emotional balance, psychology, and behaviour. Standard self-reporting tools, such as the Visual Analogue Scale for pain, fail to capture this burden. Moreover, this type of tools is susceptible to a degree of subjectivity, dependent on the patients clear understanding of how to use it, social biases, and their ability to translate a complex experience to a scale. To overcome these and other self-reporting challenges, pain intensity estimation has been previously studied based on facial expressions, electroencephalograms, brain imaging, and autonomic features. However, to the best of our knowledge, it has never been attempted to base this estimation on the patient narratives of the personal experience of chronic pain, which is what we propose in this work. Indeed, in the clinical assessment and management of chronic pain, verbal communication is essential to convey information to physicians that would otherwise not be easily accessible through standard reporting tools, since language, sociocultural, and psychosocial variables are intertwined. We show that language features from patient narratives indeed convey information relevant for pain intensity estimation, and that our computational models can take advantage of that. Specifically, our results show that patients with mild pain focus more on the use of verbs, whilst moderate and severe pain patients focus on adverbs, and nouns and adjectives, respectively, and that these differences allow for the distinction between these three pain classes.
△ Less
Submitted 17 November, 2022; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Exploring Self-Attention for Crop-type Classification Explainability
Authors:
Ivica Obadic,
Ribana Roscher,
Dario Augusto Borges Oliveira,
Xiao Xiang Zhu
Abstract:
Automated crop-type classification using Sentinel-2 satellite time series is essential to support agriculture monitoring. Recently, deep learning models based on transformer encoders became a promising approach for crop-type classification. Using explainable machine learning to reveal the inner workings of these models is an important step towards improving stakeholders' trust and efficient agricu…
▽ More
Automated crop-type classification using Sentinel-2 satellite time series is essential to support agriculture monitoring. Recently, deep learning models based on transformer encoders became a promising approach for crop-type classification. Using explainable machine learning to reveal the inner workings of these models is an important step towards improving stakeholders' trust and efficient agriculture monitoring.
In this paper, we introduce a novel explainability framework that aims to shed a light on the essential crop disambiguation patterns learned by a state-of-the-art transformer encoder model. More specifically, we process the attention weights of a trained transformer encoder to reveal the critical dates for crop disambiguation and use domain knowledge to uncover the phenological events that support the model performance. We also present a sensitivity analysis approach to understand better the attention capability for revealing crop-specific phenological events.
We report compelling results showing that attention patterns strongly relate to key dates, and consequently, to the critical phenological events for crop-type classification. These findings might be relevant for improving stakeholder trust and optimizing agriculture monitoring processes. Additionally, our sensitivity analysis demonstrates the limitation of attention weights for identifying the important events in the crop phenology as we empirically show that the unveiled phenological events depend on the other crops in the data considered during training.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees
Authors:
Moacir Antonelli Ponti,
Lucas de Angelis Oliveira,
Mathias Esteban,
Valentina Garcia,
Juan Martín Román,
Luis Argerich
Abstract:
Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning. This motivates studies to better understanding of the role of data instances with respect to their contribution in good metrics in models. In this paper we propose…
▽ More
Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning. This motivates studies to better understanding of the role of data instances with respect to their contribution in good metrics in models. In this paper we propose a method based on metrics computed from training dynamics of Gradient Boosting Decision Trees (GBDTs) to assess the behavior of each training example. We focus on datasets containing mostly tabular or structured data, for which the use of Decision Trees ensembles are still the state-of-the-art in terms of performance. Our methods achieved the best results overall when compared with confident learning, direct heuristics and a robust boosting algorithm. We show results on detecting noisy labels in order clean datasets, improving models' metrics in synthetic and real public datasets, as well as on a industry case in which we deployed a model based on the proposed solution.
△ Less
Submitted 22 February, 2024; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Transfer-learning for video classification: Video Swin Transformer on multiple domains
Authors:
Daniel Oliveira,
David Martins de Matos
Abstract:
The computer vision community has seen a shift from convolutional-based to pure transformer architectures for both image and video tasks. Training a transformer from zero for these tasks usually requires a lot of data and computational resources. Video Swin Transformer (VST) is a pure-transformer model developed for video classification which achieves state-of-the-art results in accuracy and effic…
▽ More
The computer vision community has seen a shift from convolutional-based to pure transformer architectures for both image and video tasks. Training a transformer from zero for these tasks usually requires a lot of data and computational resources. Video Swin Transformer (VST) is a pure-transformer model developed for video classification which achieves state-of-the-art results in accuracy and efficiency on several datasets. In this paper, we aim to understand if VST generalizes well enough to be used in an out-of-domain setting. We study the performance of VST on two large-scale datasets, namely FCVID and Something-Something using a transfer learning approach from Kinetics-400, which requires around 4x less memory than training from scratch. We then break down the results to understand where VST fails the most and in which scenarios the transfer-learning approach is viable. Our experiments show an 85\% top-1 accuracy on FCVID without retraining the whole model which is equal to the state-of-the-art for the dataset and a 21\% accuracy on Something-Something. The experiments also suggest that the performance of the VST decreases on average when the video duration increases which seems to be a consequence of a design choice of the model. From the results, we conclude that VST generalizes well enough to classify out-of-domain videos without retraining when the target classes are from the same type as the classes used to train the model. We observed this effect when we performed transfer-learning from Kinetics-400 to FCVID, where most datasets target mostly objects. On the other hand, if the classes are not from the same type, then the accuracy after the transfer-learning approach is expected to be poor. We observed this effect when we performed transfer-learning from Kinetics-400, where the classes represent mostly objects, to Something-Something, where the classes represent mostly actions.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Image-Based Detection of Modifications in Gas Pump PCBs with Deep Convolutional Autoencoders
Authors:
Diulhio Candido de Oliveira,
Bogdan Tomoyuki Nassu,
Marco Aurelio Wehrmeister
Abstract:
In this paper, we introduce an approach for detecting modifications in assembled printed circuit boards based on photographs taken without tight control over perspective and illumination conditions. One instance of this problem is the visual inspection of gas pumps PCBs, which can be modified by fraudsters wishing to deceive costumers or evade taxes. Given the uncontrolled environment and the huge…
▽ More
In this paper, we introduce an approach for detecting modifications in assembled printed circuit boards based on photographs taken without tight control over perspective and illumination conditions. One instance of this problem is the visual inspection of gas pumps PCBs, which can be modified by fraudsters wishing to deceive costumers or evade taxes. Given the uncontrolled environment and the huge number of possible modifications, we address the problem as a case of anomaly detection, proposing an approach that is directed towards the characteristics of that scenario, while being well-suited for other similar applications. The proposed approach employs a deep convolutional autoencoder trained to reconstruct images of an unmodified board, but which remains unable to do the same for images showing modifications. By comparing the input image with its reconstruction, it is possible to segment anomalies and modifications in a pixel-wise manner. Experiments performed on a dataset built to represent real-world situations (and which we will make publicly available) show that our approach outperforms other state-of-the-art approaches for anomaly segmentation in the considered scenario, while producing comparable results on the popular MVTec-AD dataset for a more general object anomaly detection task.
△ Less
Submitted 6 October, 2022; v1 submitted 30 September, 2022;
originally announced October 2022.
-
The number of rational points of a class of superelliptic curves
Authors:
José Alves Oliveira,
Daniela Oliveira,
F. E. Brochero Martínez
Abstract:
In this paper, we study the number of $\mathbb F_{q^n}$-rational points on the affine curve $\mathcal{X}_{d,a,b}$ given by the equation $$ y^d=ax\text{Tr}(x)+b,$$ where $\text{Tr}$ denote the trace function from $\mathbb F_{q^n}$ to $\mathbb F_{q}$ and $d$ is a positive integer. In particular, we present bounds for the number of $\mathbb F_{q}$-rational points on $\mathcal{X}_{d,a,b}$ and, for the…
▽ More
In this paper, we study the number of $\mathbb F_{q^n}$-rational points on the affine curve $\mathcal{X}_{d,a,b}$ given by the equation $$ y^d=ax\text{Tr}(x)+b,$$ where $\text{Tr}$ denote the trace function from $\mathbb F_{q^n}$ to $\mathbb F_{q}$ and $d$ is a positive integer. In particular, we present bounds for the number of $\mathbb F_{q}$-rational points on $\mathcal{X}_{d,a,b}$ and, for the cases where $d$ satisfies a natural condition, explicit formulas for the number of rational points are obtained. Particularly, a complete characterization is given for the case $d=2$. As a consequence of our results, we compute the number of elements $α$ in $\mathbb F_{q^n}$ such that $α$ and $\text{Tr}(α)$ are quadratic residues in $\mathbb F_{q^n}$.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
A Systematic Literature Review on the Impact of Formatting Elements on Code Legibility
Authors:
Delano Oliveira,
Reydne Santos,
Fernanda Madeiral,
Hidehiko Masuhara,
Fernando Castor
Abstract:
Context: Software programs can be written in different but functionally equivalent ways. Even though previous research has compared specific formatting elements to find out which alternatives affect code legibility, seeing the bigger picture of what makes code more or less legible is challenging. Goal: We aim to find which formatting elements have been investigated in empirical studies and which a…
▽ More
Context: Software programs can be written in different but functionally equivalent ways. Even though previous research has compared specific formatting elements to find out which alternatives affect code legibility, seeing the bigger picture of what makes code more or less legible is challenging. Goal: We aim to find which formatting elements have been investigated in empirical studies and which alternatives were found to be more legible for human subjects. Method: We conducted a systematic literature review and identified 15 papers containing human-centric studies that directly compared alternative formatting elements. We analyzed and organized these formatting elements using a card-sorting method. Results: We identified 13 formatting elements (e.g., indentation) and 33 levels of formatting elements (e.g., two-space indentation), which are about formatting styles, spacing, block delimiters, long or complex code lines, and word boundary styles. While some levels were found to be statistically better than other equivalent ones in terms of code legibility, e.g., appropriate use of indentation with blocks, others were not, e.g., formatting layout. For identifier style, we found divergent results, where one study found a significant difference in favor of camel case, while another study found a positive result in favor of snake case. Conclusion: The number of identified papers, some of which are outdated, and the many null and contradictory results emphasize the relative lack of work in this area and underline the importance of more research. There is much to be understood about how formatting elements influence code legibility before the creation of guidelines and automated aids to help developers make their code more legible.
△ Less
Submitted 1 June, 2023; v1 submitted 25 August, 2022;
originally announced August 2022.
-
Learning crop type map** from regional label proportions in large-scale SAR and optical imagery
Authors:
Laura E. C. La Rosa,
Dario A. B. Oliveira,
Pedram Ghamisi
Abstract:
The application of deep learning algorithms to Earth observation (EO) in recent years has enabled substantial progress in fields that rely on remotely sensed data. However, given the data scale in EO, creating large datasets with pixel-level annotations by experts is expensive and highly time-consuming. In this context, priors are seen as an attractive way to alleviate the burden of manual labelin…
▽ More
The application of deep learning algorithms to Earth observation (EO) in recent years has enabled substantial progress in fields that rely on remotely sensed data. However, given the data scale in EO, creating large datasets with pixel-level annotations by experts is expensive and highly time-consuming. In this context, priors are seen as an attractive way to alleviate the burden of manual labeling when training deep learning methods for EO. For some applications, those priors are readily available. Motivated by the great success of contrastive-learning methods for self-supervised feature representation learning in many computer-vision tasks, this study proposes an online deep clustering method using crop label proportions as priors to learn a sample-level classifier based on government crop-proportion data for a whole agricultural region. We evaluate the method using two large datasets from two different agricultural regions in Brazil. Extensive experiments demonstrate that the method is robust to different data types (synthetic-aperture radar and optical images), reporting higher accuracy values considering the major crop types in the target regions. Thus, it can alleviate the burden of large-scale image annotation in EO applications.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Authors:
Alef Iury Siqueira Ferreira,
Gustavo dos Reis Oliveira
Abstract:
This paper presents our efforts to build a robust ASR model for the shared task Automatic Speech Recognition for spontaneous and prepared speech & Speech Emotion Recognition in Portuguese (SE&R 2022). The goal of the challenge is to advance the ASR research for the Portuguese language, considering prepared and spontaneous speech in different dialects. Our method consist on fine-tuning an ASR model…
▽ More
This paper presents our efforts to build a robust ASR model for the shared task Automatic Speech Recognition for spontaneous and prepared speech & Speech Emotion Recognition in Portuguese (SE&R 2022). The goal of the challenge is to advance the ASR research for the Portuguese language, considering prepared and spontaneous speech in different dialects. Our method consist on fine-tuning an ASR model in a domain-specific approach, applying gain normalization and selective noise insertion. The proposed method improved over the strong baseline provided on the test set in 3 of the 4 tracks available
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes
Authors:
Danilo de Oliveira,
Tal Peer,
Timo Gerkmann
Abstract:
The SepFormer architecture shows very good results in speech separation. Like other learned-encoder models, it uses short frames, as they have been shown to obtain better performance in these cases. This results in a large number of frames at the input, which is problematic; since the SepFormer is transformer-based, its computational complexity drastically increases with longer sequences. In this…
▽ More
The SepFormer architecture shows very good results in speech separation. Like other learned-encoder models, it uses short frames, as they have been shown to obtain better performance in these cases. This results in a large number of frames at the input, which is problematic; since the SepFormer is transformer-based, its computational complexity drastically increases with longer sequences. In this paper, we employ the SepFormer in a speech enhancement task and show that by replacing the learned-encoder features with a magnitude short-time Fourier transform (STFT) representation, we can use long frames without compromising perceptual enhancement performance. We obtained equivalent quality and intelligibility evaluation scores while reducing the number of operations by a factor of approximately 8 for a 10-second utterance.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.