-
Magneto-electric decoupling in bismuth ferrite
Authors:
Thien Thanh Dang,
Juliana Heiniger-Schell,
Astita Dubey,
João Nuno Gonçalves,
Marianela Escobar Castillo,
Daniil Lewin,
Ian Chang Jie Yap,
Adeleh Mokhles Gerami,
Sobhan Mohammadi Fathabad,
Dmitry Zyabkin,
Doru Constantin Lupascu
Abstract:
It is still under intensive discussion, how magnetoelectric coupling actually occurs at the atomic scale in multiferroic BiFeO3. Nuclear solid-state techniques monitor local fields at the atomic scale. Using such an approach, we show that, contrary to our own expectation, ferroelectric and magnetic ordering in bismuth ferrite (BiFeO3 or BFO) decouple at the unit-cell level. Time differential pertu…
▽ More
It is still under intensive discussion, how magnetoelectric coupling actually occurs at the atomic scale in multiferroic BiFeO3. Nuclear solid-state techniques monitor local fields at the atomic scale. Using such an approach, we show that, contrary to our own expectation, ferroelectric and magnetic ordering in bismuth ferrite (BiFeO3 or BFO) decouple at the unit-cell level. Time differential perturbed angular correlation (TDPAC) data at temperatures below, close, and above the magnetic Néel temperature show that the coupling of the ferroelectric order to magnetization is completely absent at the bismuth site. It is common understanding that the antiferromagnetic order and the cycloidal ordering due to the Dzyaloshinskii-Moriya interaction generate a net zero magnetization of the sample cancelling any magnetoelectric effect at the macroscopic level. Our previous data show that a very large coupling of magnetic moment and electrical distortions arises on the magnetic sub-lattice (Fe-site). The oxygen octahedra around the iron site experience a large tilt due to the onset of magnetic ordering. Nevertheless, the Bi-containing complementary sub-lattice carrying the ferroelectric order is practically unaffected by this large structural change in its direct vicinity. The magnetoelectric coupling thus vanishes already at the unit cell level. These experimental results agree well with an ab-initio density functional theory (DFT) calculation.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Improved convergence rates for the multiobjective Frank-Wolfe method
Authors:
Douglas S. Gonçalves,
Max L. N. Gonçalves,
Jefferson G. Melo
Abstract:
This paper analyzes the convergence rates of the {\it Frank-Wolfe } method for solving convex constrained multiobjective optimization. We establish improved convergence rates under different assumptions on the objective function, the feasible set, and the localization of the limit point of the sequence generated by the method. In terms of the objective function values, we firstly show that if the…
▽ More
This paper analyzes the convergence rates of the {\it Frank-Wolfe } method for solving convex constrained multiobjective optimization. We establish improved convergence rates under different assumptions on the objective function, the feasible set, and the localization of the limit point of the sequence generated by the method. In terms of the objective function values, we firstly show that if the objective function is strongly convex and the limit point of the sequence generated by the method lies in the relative interior of the feasible set, then the algorithm achieves a linear convergence rate. Next, we focus on a special class of problems where the feasible constraint set is $(α,q)$-uniformly convex for some $α>0$ and $q \geq 2$, including, in particular, \(\ell_p\)-balls for all $p>1$. In this context, we prove that the method attains: (i) a rate of $\mathcal{O}(1/k^\frac{q}{q-1})$ when the objective function is strongly convex; and (ii) a linear rate (if $q=2$) or a rate of $\mathcal{O}(1/k^{\frac{q}{q-2}})$ (if $q>2$) under an additional assumption, which always holds if the feasible set does not contain an unconstrained weak Pareto point. We also discuss enhanced convergence rates for the algorithm in terms of an optimality measure. Finally, we provide some simple examples to illustrate the convergence rates and the set of assumptions.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
A Relative Inexact Proximal Gradient Method With an Explicit Linesearch
Authors:
Yunier Bello-Cruz,
Max L. N. Gonçalves,
Jefferson G. Melo,
Cassandra Mohr
Abstract:
This paper presents and investigates an inexact proximal gradient method for solving composite convex optimization problems characterized by an objective function composed of a sum of a full domain differentiable convex function and a non-differentiable convex function. We introduce an explicit linesearch strategy that requires only a relative inexact solution of the proximal subproblem per iterat…
▽ More
This paper presents and investigates an inexact proximal gradient method for solving composite convex optimization problems characterized by an objective function composed of a sum of a full domain differentiable convex function and a non-differentiable convex function. We introduce an explicit linesearch strategy that requires only a relative inexact solution of the proximal subproblem per iteration. We prove the convergence of the sequence generated by our scheme and establish its iteration complexity, considering both the functional values and a residual associated with first-order stationary solutions. Additionally, we provide numerical experiments to illustrate the practical efficacy of our method.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Quadruplet Loss For Improving the Robustness to Face Morphing Attacks
Authors:
Iurii Medvedev,
Nuno Gonçalves
Abstract:
Recent advancements in deep learning have revolutionized technology and security measures, necessitating robust identification methods. Biometric approaches, leveraging personalized characteristics, offer a promising solution. However, Face Recognition Systems are vulnerable to sophisticated attacks, notably face morphing techniques, enabling the creation of fraudulent documents. In this study, we…
▽ More
Recent advancements in deep learning have revolutionized technology and security measures, necessitating robust identification methods. Biometric approaches, leveraging personalized characteristics, offer a promising solution. However, Face Recognition Systems are vulnerable to sophisticated attacks, notably face morphing techniques, enabling the creation of fraudulent documents. In this study, we introduce a novel quadruplet loss function for increasing the robustness of face recognition systems against morphing attacks. Our approach involves specific sampling of face image quadruplets, combined with face morphs, for network training. Experimental results demonstrate the efficiency of our strategy in improving the robustness of face recognition networks against morphing attacks.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Fused Classification For Differential Face Morphing Detection
Authors:
Iurii Medvedev,
Joana Pimenta,
Nuno Gonçalves
Abstract:
Face morphing, a sophisticated presentation attack technique, poses significant security risks to face recognition systems. Traditional methods struggle to detect morphing attacks, which involve blending multiple face images to create a synthetic image that can match different individuals. In this paper, we focus on the differential detection of face morphing and propose an extended approach based…
▽ More
Face morphing, a sophisticated presentation attack technique, poses significant security risks to face recognition systems. Traditional methods struggle to detect morphing attacks, which involve blending multiple face images to create a synthetic image that can match different individuals. In this paper, we focus on the differential detection of face morphing and propose an extended approach based on fused classification method for no-reference scenario. We introduce a public face morphing detection benchmark for the differential scenario and utilize a specific data mining technique to enhance the performance of our approach. Experimental results demonstrate the effectiveness of our method in detecting morphing attacks.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Impact of Image Context for Single Deep Learning Face Morphing Attack Detection
Authors:
Joana Pimenta,
Iurii Medvedev,
Nuno Gonçalves
Abstract:
The increase in security concerns due to technological advancements has led to the popularity of biometric approaches that utilize physiological or behavioral characteristics for enhanced recognition. Face recognition systems (FRSs) have become prevalent, but they are still vulnerable to image manipulation techniques such as face morphing attacks. This study investigates the impact of the alignmen…
▽ More
The increase in security concerns due to technological advancements has led to the popularity of biometric approaches that utilize physiological or behavioral characteristics for enhanced recognition. Face recognition systems (FRSs) have become prevalent, but they are still vulnerable to image manipulation techniques such as face morphing attacks. This study investigates the impact of the alignment settings of input images on deep learning face morphing detection performance. We analyze the interconnections between the face contour and image context and suggest optimal alignment conditions for face morphing detection.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Neural Implicit Morphing of Face Images
Authors:
Guilherme Schardong,
Tiago Novello,
Hallison Paz,
Iurii Medvedev,
Vinícius da Silva,
Luiz Velho,
Nuno Gonçalves
Abstract:
Face morphing is a problem in computer graphics with numerous artistic and forensic applications. It is challenging due to variations in pose, lighting, gender, and ethnicity. This task consists of a war** for feature alignment and a blending for a seamless transition between the warped images. We propose to leverage coord-based neural networks to represent such war**s and blendings of face im…
▽ More
Face morphing is a problem in computer graphics with numerous artistic and forensic applications. It is challenging due to variations in pose, lighting, gender, and ethnicity. This task consists of a war** for feature alignment and a blending for a seamless transition between the warped images. We propose to leverage coord-based neural networks to represent such war**s and blendings of face images. During training, we exploit the smoothness and flexibility of such networks by combining energy functionals employed in classical approaches without discretizations. Additionally, our method is time-dependent, allowing a continuous war**/blending of the images. During morphing inference, we need both direct and inverse transformations of the time-dependent war**. The first (second) is responsible for war** the target (source) image into the source (target) image. Our neural war** stores those maps in a single network dismissing the need for inverting them. The results of our experiments indicate that our method is competitive with both classical and generative models under the lens of image quality and face-morphing detectors. Aesthetically, the resulting images present a seamless blending of diverse faces not yet usual in the literature.
△ Less
Submitted 13 June, 2024; v1 submitted 26 August, 2023;
originally announced August 2023.
-
The Segment Anything Model (SAM) for Remote Sensing Applications: From Zero to One Shot
Authors:
Lucas Prado Osco,
Qiusheng Wu,
Eduardo Lopes de Lemos,
Wesley Nunes Gonçalves,
Ana Paula Marques Ramos,
Jonathan Li,
José Marcato Junior
Abstract:
Segmentation is an essential step for remote sensing image processing. This study aims to advance the application of the Segment Anything Model (SAM), an innovative image segmentation model by Meta AI, in the field of remote sensing image analysis. SAM is known for its exceptional generalization capabilities and zero-shot learning, making it a promising approach to processing aerial and orbital im…
▽ More
Segmentation is an essential step for remote sensing image processing. This study aims to advance the application of the Segment Anything Model (SAM), an innovative image segmentation model by Meta AI, in the field of remote sensing image analysis. SAM is known for its exceptional generalization capabilities and zero-shot learning, making it a promising approach to processing aerial and orbital images from diverse geographical contexts. Our exploration involved testing SAM across multi-scale datasets using various input prompts, such as bounding boxes, individual points, and text descriptors. To enhance the model's performance, we implemented a novel automated technique that combines a text-prompt-derived general example with one-shot training. This adjustment resulted in an improvement in accuracy, underscoring SAM's potential for deployment in remote sensing imagery and reducing the need for manual annotation. Despite the limitations encountered with lower spatial resolution images, SAM exhibits promising adaptability to remote sensing data analysis. We recommend future research to enhance the model's proficiency through integration with supplementary fine-tuning techniques and other networks. Furthermore, we provide the open-source code of our modifications on online repositories, encouraging further and broader adaptations of SAM to the remote sensing domain.
△ Less
Submitted 31 October, 2023; v1 submitted 28 June, 2023;
originally announced June 2023.
-
MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture
Authors:
Diogo Nunes Goncalves,
Jose Marcato Junior,
Pedro Zamboni,
Hemerson Pistori,
Jonathan Li,
Keiller Nogueira,
Wesley Nunes Goncalves
Abstract:
Multi-task learning has proven to be effective in improving the performance of correlated tasks. Most of the existing methods use a backbone to extract initial features with independent branches for each task, and the exchange of information between the branches usually occurs through the concatenation or sum of the feature maps of the branches. However, this type of information exchange does not…
▽ More
Multi-task learning has proven to be effective in improving the performance of correlated tasks. Most of the existing methods use a backbone to extract initial features with independent branches for each task, and the exchange of information between the branches usually occurs through the concatenation or sum of the feature maps of the branches. However, this type of information exchange does not directly consider the local characteristics of the image nor the level of importance or correlation between the tasks. In this paper, we propose a semantic segmentation method, MTLSegFormer, which combines multi-task learning and attention mechanisms. After the backbone feature extraction, two feature maps are learned for each task. The first map is proposed to learn features related to its task, while the second map is obtained by applying learned visual attention to locally re-weigh the feature maps of the other tasks. In this way, weights are assigned to local regions of the image of other tasks that have greater importance for the specific task. Finally, the two maps are combined and used to solve a task. We tested the performance in two challenging problems with correlated tasks and observed a significant improvement in accuracy, mainly in tasks with high dependence on the others.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
The Potential of Visual ChatGPT For Remote Sensing
Authors:
Lucas Prado Osco,
Eduardo Lopes de Lemos,
Wesley Nunes Gonçalves,
Ana Paula Marques Ramos,
José Marcato Junior
Abstract:
Recent advancements in Natural Language Processing (NLP), particularly in Large Language Models (LLMs), associated with deep learning-based computer vision techniques, have shown substantial potential for automating a variety of tasks. One notable model is Visual ChatGPT, which combines ChatGPT's LLM capabilities with visual computation to enable effective image analysis. The model's ability to pr…
▽ More
Recent advancements in Natural Language Processing (NLP), particularly in Large Language Models (LLMs), associated with deep learning-based computer vision techniques, have shown substantial potential for automating a variety of tasks. One notable model is Visual ChatGPT, which combines ChatGPT's LLM capabilities with visual computation to enable effective image analysis. The model's ability to process images based on textual inputs can revolutionize diverse fields. However, its application in the remote sensing domain remains unexplored. This is the first paper to examine the potential of Visual ChatGPT, a cutting-edge LLM founded on the GPT architecture, to tackle the aspects of image processing related to the remote sensing domain. Among its current capabilities, Visual ChatGPT can generate textual descriptions of images, perform canny edge and straight line detection, and conduct image segmentation. These offer valuable insights into image content and facilitate the interpretation and extraction of information. By exploring the applicability of these techniques within publicly available datasets of satellite images, we demonstrate the current model's limitations in dealing with remote sensing images, highlighting its challenges and future prospects. Although still in early development, we believe that the combination of LLMs and visual models holds a significant potential to transform remote sensing image processing, creating accessible and practical application opportunities in the field.
△ Less
Submitted 5 July, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational Biology
Authors:
Tiago Lubiana,
Rafael Lopes,
Pedro Medeiros,
Juan Carlo Silva,
Andre Nicolau Aquime Goncalves,
Vinicius Maracaja-Coutinho,
Helder I Nakaya
Abstract:
The rise of advanced chatbots, such as ChatGPT, has sparked curiosity in the scientific community. ChatGPT is a general-purpose chatbot powered by large language models (LLMs) GPT-3.5 and GPT-4, with the potential to impact numerous fields, including computational biology. In this article, we offer ten tips based on our experience with ChatGPT to assist computational biologists in optimizing their…
▽ More
The rise of advanced chatbots, such as ChatGPT, has sparked curiosity in the scientific community. ChatGPT is a general-purpose chatbot powered by large language models (LLMs) GPT-3.5 and GPT-4, with the potential to impact numerous fields, including computational biology. In this article, we offer ten tips based on our experience with ChatGPT to assist computational biologists in optimizing their workflows. We have collected relevant prompts and reviewed the nascent literature in the field, compiling tips we project to remain pertinent for future ChatGPT and LLM iterations, ranging from code refactoring to scientific writing to prompt engineering. We hope our work will help bioinformaticians to complement their workflows while staying aware of the various implications of using this technology. Additionally, to track new and creative applications for bioinformatics tools such as ChatGPT, we have established a GitHub repository at https://github.com/csbl-br/awesome-compbio-chatgpt. Our belief is that ethical adherence to ChatGPT and other LLMs will increase the efficiency of computational biologists, ultimately advancing the pace of scientific discovery in the life sciences.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
RADAM: Texture Recognition through Randomized Aggregated Encoding of Deep Activation Maps
Authors:
Leonardo Scabini,
Kallil M. Zielinski,
Lucas C. Ribas,
Wesley N. Gonçalves,
Bernard De Baets,
Odemir M. Bruno
Abstract:
Texture analysis is a classical yet challenging task in computer vision for which deep neural networks are actively being applied. Most approaches are based on building feature aggregation modules around a pre-trained backbone and then fine-tuning the new architecture on specific texture recognition tasks. Here we propose a new method named \textbf{R}andom encoding of \textbf{A}ggregated \textbf{D…
▽ More
Texture analysis is a classical yet challenging task in computer vision for which deep neural networks are actively being applied. Most approaches are based on building feature aggregation modules around a pre-trained backbone and then fine-tuning the new architecture on specific texture recognition tasks. Here we propose a new method named \textbf{R}andom encoding of \textbf{A}ggregated \textbf{D}eep \textbf{A}ctivation \textbf{M}aps (RADAM) which extracts rich texture representations without ever changing the backbone. The technique consists of encoding the output at different depths of a pre-trained deep convolutional network using a Randomized Autoencoder (RAE). The RAE is trained locally to each image using a closed-form solution, and its decoder weights are used to compose a 1-dimensional texture representation that is fed into a linear SVM. This means that no fine-tuning or backpropagation is needed. We explore RADAM on several texture benchmarks and achieve state-of-the-art results with different computational budgets. Our results suggest that pre-trained backbones may not require additional fine-tuning for texture recognition if their learned representations are better encoded.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Ultrarelativistic electron beams accelerated by terawatt scalable kHz laser
Authors:
C. M. Lazzarini,
G. M. Grittani,
P. Valenta,
I. Zymak,
R. Antipenkov,
U. Chaulagain,
L. V. N. Goncalves,
A. Grenfell,
M. Lamac,
S. Lorenz,
M. Nevrkla,
V. Sobr,
A. Spacek,
W. Szuba,
P. Bakule,
G. Korn,
S. V. Bulanov
Abstract:
We show the laser-driven acceleration of unprecedented, collimated ($ 2 \ \mathrm{mrad} $ divergence), and quasi-monoenergetic ($ 25 \ \% $ energy spread) electron beams with energy up to $ 50 \ \mathrm{MeV} $ at $ 1 \ \mathrm{kHz} $ repetition rate. The laser driver is a multi-cycle ($ 15 \ \mathrm{fs} $) $ 1 \ \mathrm{kHz} $ optical parametric chirped pulse amplification (OPCPA) system, operatin…
▽ More
We show the laser-driven acceleration of unprecedented, collimated ($ 2 \ \mathrm{mrad} $ divergence), and quasi-monoenergetic ($ 25 \ \% $ energy spread) electron beams with energy up to $ 50 \ \mathrm{MeV} $ at $ 1 \ \mathrm{kHz} $ repetition rate. The laser driver is a multi-cycle ($ 15 \ \mathrm{fs} $) $ 1 \ \mathrm{kHz} $ optical parametric chirped pulse amplification (OPCPA) system, operating at $ 26 \ \mathrm{mJ} $ ($ 1.7 \ \mathrm{TW} $). The scalability of the driver laser technology and the electron beams reported in this work pave the way towards develo** high-brilliance x-ray sources for medical imaging, innovative devices for brain cancer treatment, and represent a step towards the realization of a kHz GeV electron beamline.
△ Less
Submitted 29 May, 2024; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Implementation and Performance Analysis of a Low Resolution OFDM System Prototype With Low Cost Hardware
Authors:
Eder O. de Souza,
João T. Dias,
Demerson N. Gonçalves
Abstract:
The present work focus on the implementation and analyze of performance of a low-resolution OFDM system prototype with low-cost hardware. A software defined radio (SDR) system was chosen in this implementation due to its various advantages over a traditional radio system. Among the options of SDR devices available, the use of universal software radio peripherals (USRP) was avoided due to its high…
▽ More
The present work focus on the implementation and analyze of performance of a low-resolution OFDM system prototype with low-cost hardware. A software defined radio (SDR) system was chosen in this implementation due to its various advantages over a traditional radio system. Among the options of SDR devices available, the use of universal software radio peripherals (USRP) was avoided due to its high cost, despite its popularity in this field of research. Alternatively, a combination of two low-cost SDRs, "Hackrf One" and "RTL- SDR Blog V3" with the GNU Radio, a popular, free and open source radio software, were used. Thus, it was possible to emulate the behavior of a low resolution ADC in the receiver, characterize its performance and estimate its energy savings. This allowed us to determine the feasibility of building a component with the analog-to-digital conversion function with few bits of resolution. We conclude that the performance of an ADC with at least 5 bits of resolution is pretty reasonable and that this reduction in the number of bits, in comparison to 8-bit ADC, represents a fairly expressive energy saving.
△ Less
Submitted 9 November, 2023; v1 submitted 11 February, 2023;
originally announced February 2023.
-
Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces Recognition
Authors:
Iurii Medvedev,
Farhad Shadmand,
Nuno Gonçalves
Abstract:
Face recognition has achieved outstanding performance in the last decade with the development of deep learning techniques.
Nowadays, the challenges in face recognition are related to specific scenarios, for instance, the performance under diverse image quality, the robustness for aging and edge cases of person age (children and elders), distinguishing of related identities.
In this set of prob…
▽ More
Face recognition has achieved outstanding performance in the last decade with the development of deep learning techniques.
Nowadays, the challenges in face recognition are related to specific scenarios, for instance, the performance under diverse image quality, the robustness for aging and edge cases of person age (children and elders), distinguishing of related identities.
In this set of problems, recognizing children's faces is one of the most sensitive and important. One of the reasons for this problem is the existing bias towards adults in existing face datasets.
In this work, we present a benchmark dataset for children's face recognition, which is compiled similarly to the famous face recognition benchmarks LFW, CALFW, CPLFW, XQLFW and AgeDB.
We also present a development dataset (separated into train and test parts) for adapting face recognition models for face images of children.
The proposed data is balanced for African, Asian, Caucasian, and Indian races. To the best of our knowledge, this is the first standartized data tool set for benchmarking and the largest collection for development for children's face recognition. Several face recognition experiments are presented to demonstrate the performance of the proposed data tool set.
△ Less
Submitted 13 January, 2023;
originally announced January 2023.
-
MorDeephy: Face Morphing Detection Via Fused Classification
Authors:
Iurii Medvedev,
Farhad Shadmand,
Nuno Gonçalves
Abstract:
Face morphing attack detection (MAD) is one of the most challenging tasks in the field of face recognition nowadays. In this work, we introduce a novel deep learning strategy for a single image face morphing detection, which implies the discrimination of morphed face images along with a sophisticated face recognition task in a complex classification scheme. It is directed onto learning the deep fa…
▽ More
Face morphing attack detection (MAD) is one of the most challenging tasks in the field of face recognition nowadays. In this work, we introduce a novel deep learning strategy for a single image face morphing detection, which implies the discrimination of morphed face images along with a sophisticated face recognition task in a complex classification scheme. It is directed onto learning the deep facial features, which carry information about the authenticity of these features. Our work also introduces several additional contributions: the public and easy-to-use face morphing detection benchmark and the results of our wild datasets filtering strategy. Our method, which we call MorDeephy, achieved the state of the art performance and demonstrated a prominent ability for generalising the task of morphing detection to unseen scenarios.
△ Less
Submitted 5 August, 2022;
originally announced August 2022.
-
Energy Efficiency of Web Browsers in the Android Ecosystem
Authors:
Nélson Gonçalves,
Rui Rua,
Jácome Cunha,
Rui Pereira,
João Saraiva
Abstract:
This paper presents an empirical study regarding the energy consumption of the most used web browsers on the Android ecosystem. In order to properly compare the web browsers in terms of energy consumption, we defined a set of typical usage scenarios to be replicated in the different browsers, executed in the same testing environment and conditions. The results of our study show that there are sign…
▽ More
This paper presents an empirical study regarding the energy consumption of the most used web browsers on the Android ecosystem. In order to properly compare the web browsers in terms of energy consumption, we defined a set of typical usage scenarios to be replicated in the different browsers, executed in the same testing environment and conditions. The results of our study show that there are significant differences in terms of energy consumption among the considered browsers. Furthermore, we conclude that some browsers are energy efficient in several user actions, but energy greedy in other ones, allowing us to conclude that no browser is universally more efficient for all usage scenarios.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Poisson-Birnbaum-Saunders Regression Model for Clustered Count Data
Authors:
Jussiane Nader Gonçalves,
Wagner Barreto-Souza,
Hernando Ombao
Abstract:
The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the obs…
▽ More
The premise of independence among subjects in the same cluster/group often fails in practice, and models that rely on such untenable assumption can produce misleading results. To overcome this severe deficiency, we introduce a new regression model to handle overdispersed and correlated clustered counts. To account for correlation within clusters, we propose a Poisson regression model where the observations within the same cluster are driven by the same latent random effect that follows the Birnbaum-Saunders distribution with a parameter that controls the strength of dependence among the individuals. This novel multivariate count model is called Clustered Poisson Birnbaum-Saunders (CPBS) regression. As illustrated in this paper, the CPBS model is analytically tractable, and its moment structure can be explicitly obtained. Estimation of parameters is performed through the maximum likelihood method, and an Expectation-Maximization (EM) algorithm is also developed. Simulation results to evaluate the finite-sample performance of our proposed estimators are presented. We also discuss diagnostic tools for checking model adequacy. An empirical application concerning the number of inpatient admissions by individuals to hospital emergency rooms, from the Medical Expenditure Panel Survey (MEPS) conducted by the United States Agency for Health Research and Quality, illustrates the usefulness of our proposed methodology.
△ Less
Submitted 21 February, 2022;
originally announced February 2022.
-
Reducing Overconfidence Predictions for Autonomous Driving Perception
Authors:
Gledson Melotti,
Cristiano Premebida,
Jordan J. Bird,
Diego R. Faria,
Nuno Gonçalves
Abstract:
In state-of-the-art deep learning for object recognition, SoftMax and Sigmoid functions are most commonly employed as the predictor outputs. Such layers often produce overconfident predictions rather than proper probabilistic scores, which can thus harm the decision-making of `critical' perception systems applied in autonomous driving and robotics. Given this, the experiments in this work propose…
▽ More
In state-of-the-art deep learning for object recognition, SoftMax and Sigmoid functions are most commonly employed as the predictor outputs. Such layers often produce overconfident predictions rather than proper probabilistic scores, which can thus harm the decision-making of `critical' perception systems applied in autonomous driving and robotics. Given this, the experiments in this work propose a probabilistic approach based on distributions calculated out of the Logit layer scores of pre-trained networks. We demonstrate that Maximum Likelihood (ML) and Maximum a-Posteriori (MAP) functions are more suitable for probabilistic interpretations than SoftMax and Sigmoid-based predictions for object recognition. We explore distinct sensor modalities via RGB images and LiDARs (RV: range-view) data from the KITTI and Lyft Level-5 datasets, where our approach shows promising performance compared to the usual SoftMax and Sigmoid layers, with the benefit of enabling interpretable probabilistic predictions. Another advantage of the approach introduced in this paper is that the ML and MAP functions can be implemented in existing trained networks, that is, the approach benefits from the output of the Logit layer of pre-trained networks. Thus, there is no need to carry out a new training phase since the ML and MAP functions are used in the test/prediction phase.
△ Less
Submitted 11 May, 2022; v1 submitted 15 February, 2022;
originally announced February 2022.
-
Rotation signature of TESS B-type stars. A comprehensive analysis
Authors:
L. F. Barraza,
R. L. Gomes,
Y. S. Messias,
I. C. Leão,
L. A. Almeida,
E. Janot-Pacheco,
A. C. Brito,
F. A. C. Brito,
J. V. Santana,
N. S. Gonçalves,
M. L. das Chagas,
M. A. Teixeira,
J. R. De Medeiros,
B. L. Canto Martins
Abstract:
Stellar rotation is a fundamental observable that drives different aspects of stellar and planetary evolution. In this work, we present an unprecedented manifold analysis of 160 B-type stars with light curves collected by the TESS space mission using three different procedures (Fast Fourier Transform, Lomb-Scargle, and wavelet techniques), accompanied by rigorous visual inspection in the search fo…
▽ More
Stellar rotation is a fundamental observable that drives different aspects of stellar and planetary evolution. In this work, we present an unprecedented manifold analysis of 160 B-type stars with light curves collected by the TESS space mission using three different procedures (Fast Fourier Transform, Lomb-Scargle, and wavelet techniques), accompanied by rigorous visual inspection in the search for rotation periodicities. This effort provides rotational periodicities for 6 new TESS B-type stars and confirmed periodicities for 22 targets with rotation periods previously listed in the literature. For other 61 stars, already classified as possible rotational variables, we identify noisy, pulsational, binarity, or ambiguous variability behavior rather than rotation signatures. The total sample of 28 potential rotators shows an overlap of different classes of rotational variables, composed of $α^2$ Canum Venaticorum, rotating ellipsoidal and SX Arietis stars. The combination of the three techniques applied in our analysis offers a solid path to overcome the challenges in the discrimination of rotation from other variabilities in stellar light curves, such as pulsation, binarity or other effects that have no physical meaning. Finally, the rotational periodicities reported in the present study may represent important constraints for improving stellar evolution models with rotation, as well as asteroseismic studies of hot stars.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Stepwise Migration of a Monolith to a Microservices Architecture: Performance and Migration Effort Evaluation
Authors:
Diogo Faustino,
Nuno Gonçalves,
Manuel Portela,
António Rito Silva
Abstract:
The agility inherent to today's business promotes the definition of software architectures where the business entities are decoupled into modules and/or services. However, there are advantages in having a rich domain model, where domain entities are tightly connected, because it fosters an initial quick development. On the other hand, the split of the business logic into modules and/or services, i…
▽ More
The agility inherent to today's business promotes the definition of software architectures where the business entities are decoupled into modules and/or services. However, there are advantages in having a rich domain model, where domain entities are tightly connected, because it fosters an initial quick development. On the other hand, the split of the business logic into modules and/or services, its encapsulation through well-defined interfaces and the introduction of inter-service communication introduces a cost in terms of performance. In this paper we analyze the stepwise migrating of a monolith, using a rich domain object, into a microservice architecture, where a modular monolith architecture is used as an intermediate step. The impact on the migration effort and on performance is measured for both steps. Current state of the art analyses the migration of monolith systems to a microservices architecture, but we observed that migration effort and performance issues are already significant in the migration to a modular monolith. Therefore, a clear distinction is established for each one of the steps, which may inform software architects on the planning of the migration of monolith systems. In particular, the trade-offs of doing all the migration process or just migrating to a modular monolith.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Probabilistic Approach for Road-Users Detection
Authors:
G. Melotti,
W. Lu,
P. Conde,
D. Zhao,
A. Asvadi,
N. Gonçalves,
C. Premebida
Abstract:
Object detection in autonomous driving applications implies that the detection and tracking of semantic objects are commonly native to urban driving environments, as pedestrians and vehicles. One of the major challenges in state-of-the-art deep-learning based object detection are false positives which occur with overconfident scores. This is highly undesirable in autonomous driving and other criti…
▽ More
Object detection in autonomous driving applications implies that the detection and tracking of semantic objects are commonly native to urban driving environments, as pedestrians and vehicles. One of the major challenges in state-of-the-art deep-learning based object detection are false positives which occur with overconfident scores. This is highly undesirable in autonomous driving and other critical robotic-perception domains because of safety concerns. This paper proposes an approach to alleviate the problem of overconfident predictions by introducing a novel probabilistic layer to deep object detection networks in testing. The suggested approach avoids the traditional Sigmoid or Softmax prediction layer which often produces overconfident predictions. It is demonstrated that the proposed technique reduces overconfidence in the false positives without degrading the performance on the true positives. The approach is validated on the 2D-KITTI objection detection through the YOLOV4 and SECOND (Lidar-based detector). The proposed approach enables interpretable probabilistic predictions without the requirement of re-training the network and therefore is very practical.
△ Less
Submitted 21 April, 2023; v1 submitted 2 December, 2021;
originally announced December 2021.
-
Abordagem probabilística para análise de confiabilidade de dados gerados em sequenciamentos multiplex na plataforma ABI SOLiD
Authors:
Fabio M. F. Lobato,
Carlos D. N. Damasceno,
Péricles L. Machado,
Nandamudi L. Vijaykumar,
André R. dos Santos,
Sylvain H. Darnet,
André N. A. Gonçalves,
Dayse O. de Alencar,
Ádamo L. de Santana
Abstract:
The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. This feature requires a computational process for separation of the data sample because the sequencer…
▽ More
The next-generation sequencers such as Illumina and SOLiD platforms generate a large amount of data, commonly above 10 Gigabytes of text files. Particularly, the SOLiD platform allows the sequencing of multiple samples in a single run, called multiplex run, through a tagging system called Barcode. This feature requires a computational process for separation of the data sample because the sequencer provides a mixture of all samples in a single output. This process must be secure to avoid any harm that may scramble further analysis. In this context, realized the need to develop a probabilistic model capable of assigning a degree of confidence in the marking system used in multiplex sequencing. The results confirmed the adequacy of the model obtained, which allows, among other things, to guide a process of filtering the data and evaluation of the sequencing protocol used.
△ Less
Submitted 11 August, 2021; v1 submitted 27 July, 2021;
originally announced July 2021.
-
Semantic Segmentation with Labeling Uncertainty and Class Imbalance
Authors:
Patrik Olã Bressan,
José Marcato Junior,
José Augusto Correa Martins,
Diogo Nunes Gonçalves,
Daniel Matte Freitas,
Lucas Prado Osco,
Jonathan de Andrade Silva,
Zhipeng Luo,
Jonathan Li,
Raymundo Cordero Garcia,
Wesley Nunes Gonçalves
Abstract:
Recently, methods based on Convolutional Neural Networks (CNN) achieved impressive success in semantic segmentation tasks. However, challenges such as the class imbalance and the uncertainty in the pixel-labeling process are not completely addressed. As such, we present a new approach that calculates a weight for each pixel considering its class and uncertainty during the labeling process. The pix…
▽ More
Recently, methods based on Convolutional Neural Networks (CNN) achieved impressive success in semantic segmentation tasks. However, challenges such as the class imbalance and the uncertainty in the pixel-labeling process are not completely addressed. As such, we present a new approach that calculates a weight for each pixel considering its class and uncertainty during the labeling process. The pixel-wise weights are used during training to increase or decrease the importance of the pixels. Experimental results show that the proposed approach leads to significant improvements in three challenging segmentation tasks in comparison to baseline methods. It was also proved to be more invariant to noise. The approach presented here may be used within a wide range of semantic segmentation methods to improve their robustness.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Counting and Locating High-Density Objects Using Convolutional Neural Network
Authors:
Mauro dos Santos de Arruda,
Lucas Prado Osco,
Plabiany Rodrigo Acosta,
Diogo Nunes Gonçalves,
José Marcato Junior,
Ana Paula Marques Ramos,
Edson Takashi Matsubara,
Zhipeng Luo,
Jonathan Li,
Jonathan de Andrade Silva,
Wesley Nunes Gonçalves
Abstract:
This paper presents a Convolutional Neural Network (CNN) approach for counting and locating objects in high-density imagery. To the best of our knowledge, this is the first object counting and locating method based on a feature map enhancement and a Multi-Stage Refinement of the confidence map. The proposed method was evaluated in two counting datasets: tree and car. For the tree dataset, our meth…
▽ More
This paper presents a Convolutional Neural Network (CNN) approach for counting and locating objects in high-density imagery. To the best of our knowledge, this is the first object counting and locating method based on a feature map enhancement and a Multi-Stage Refinement of the confidence map. The proposed method was evaluated in two counting datasets: tree and car. For the tree dataset, our method returned a mean absolute error (MAE) of 2.05, a root-mean-squared error (RMSE) of 2.87 and a coefficient of determination (R$^2$) of 0.986. For the car dataset (CARPK and PUCPR+), our method was superior to state-of-the-art methods. In the these datasets, our approach achieved an MAE of 4.45 and 3.16, an RMSE of 6.18 and 4.39, and an R$^2$ of 0.975 and 0.999, respectively. The proposed method is suitable for dealing with high object-density, returning a state-of-the-art performance for counting and locating objects.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
A Deep Learning Approach Based on Graphs to Detect Plantation Lines
Authors:
Diogo Nunes Gonçalves,
Mauro dos Santos de Arruda,
Hemerson Pistori,
Vanessa Jordão Marcato Fernandes,
Ana Paula Marques Ramos,
Danielle Elis Garcia Furuya,
Lucas Prado Osco,
Hongjie He,
Jonathan Li,
José Marcato Junior,
Wesley Nunes Gonçalves
Abstract:
Deep learning-based networks are among the most prominent methods to learn linear patterns and extract this type of information from diverse imagery conditions. Here, we propose a deep learning approach based on graphs to detect plantation lines in UAV-based RGB imagery presenting a challenging scenario containing spaced plants. The first module of our method extracts a feature map throughout the…
▽ More
Deep learning-based networks are among the most prominent methods to learn linear patterns and extract this type of information from diverse imagery conditions. Here, we propose a deep learning approach based on graphs to detect plantation lines in UAV-based RGB imagery presenting a challenging scenario containing spaced plants. The first module of our method extracts a feature map throughout the backbone, which consists of the initial layers of the VGG16. This feature map is used as an input to the Knowledge Estimation Module (KEM), organized in three concatenated branches for detecting 1) the plant positions, 2) the plantation lines, and 3) for the displacement vectors between the plants. A graph modeling is applied considering each plant position on the image as vertices, and edges are formed between two vertices (i.e. plants). Finally, the edge is classified as pertaining to a certain plantation line based on three probabilities (higher than 0.5): i) in visual features obtained from the backbone; ii) a chance that the edge pixels belong to a line, from the KEM step; and iii) an alignment of the displacement vectors with the edge, also from KEM. Experiments were conducted in corn plantations with different growth stages and patterns with aerial RGB imagery. A total of 564 patches with 256 x 256 pixels were used and randomly divided into training, validation, and testing sets in a proportion of 60\%, 20\%, and 20\%, respectively. The proposed method was compared against state-of-the-art deep learning methods, and achieved superior performance with a significant margin, returning precision, recall, and F1-score of 98.7\%, 91.9\%, and 95.1\%, respectively. This approach is useful in extracting lines with spaced plantation patterns and could be implemented in scenarios where plantation gaps occur, generating lines with few-to-none interruptions.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
A Review on Deep Learning in UAV Remote Sensing
Authors:
Lucas Prado Osco,
José Marcato Junior,
Ana Paula Marques Ramos,
Lúcio André de Castro Jorge,
Sarah Narges Fatholahi,
Jonathan de Andrade Silva,
Edson Takashi Matsubara,
Hemerson Pistori,
Wesley Nunes Gonçalves,
Jonathan Li
Abstract:
Deep Neural Networks (DNNs) learn representation from data with an impressive capability, and brought important breakthroughs for processing images, time-series, natural language, audio, video, and many others. In the remote sensing field, surveys and literature revisions specifically involving DNNs algorithms' applications have been conducted in an attempt to summarize the amount of information p…
▽ More
Deep Neural Networks (DNNs) learn representation from data with an impressive capability, and brought important breakthroughs for processing images, time-series, natural language, audio, video, and many others. In the remote sensing field, surveys and literature revisions specifically involving DNNs algorithms' applications have been conducted in an attempt to summarize the amount of information produced in its subfields. Recently, Unmanned Aerial Vehicles (UAV) based applications have dominated aerial sensing research. However, a literature revision that combines both "deep learning" and "UAV remote sensing" thematics has not yet been conducted. The motivation for our work was to present a comprehensive review of the fundamentals of Deep Learning (DL) applied in UAV-based imagery. We focused mainly on describing classification and regression techniques used in recent applications with UAV-acquired data. For that, a total of 232 papers published in international scientific journal databases was examined. We gathered the published material and evaluated their characteristics regarding application, sensor, and technique used. We relate how DL presents promising results and has the potential for processing tasks associated with UAV-based image data. Lastly, we project future perspectives, commentating on prominent DL paths to be explored in the UAV remote sensing field. Our revision consists of a friendly-approach to introduce, commentate, and summarize the state-of-the-art in UAV-based image applications with DNNs algorithms in diverse subfields of remote sensing, grou** it in the environmental, urban, and agricultural contexts.
△ Less
Submitted 20 August, 2023; v1 submitted 22 January, 2021;
originally announced January 2021.
-
A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows from UAV Imagery
Authors:
Lucas Prado Osco,
Mauro dos Santos de Arruda,
Diogo Nunes Gonçalves,
Alexandre Dias,
Juliana Batistoti,
Mauricio de Souza,
Felipe David Georges Gomes,
Ana Paula Marques Ramos,
Lúcio André de Castro Jorge,
Veraldo Liesenberg,
Jonathan Li,
Lingfei Ma,
José Marcato Junior,
Wesley Nunes Gonçalves
Abstract:
In this paper, we propose a novel deep learning method based on a Convolutional Neural Network (CNN) that simultaneously detects and geolocates plantation-rows while counting its plants considering highly-dense plantation configurations. The experimental setup was evaluated in a cornfield with different growth stages and in a Citrus orchard. Both datasets characterize different plant density scena…
▽ More
In this paper, we propose a novel deep learning method based on a Convolutional Neural Network (CNN) that simultaneously detects and geolocates plantation-rows while counting its plants considering highly-dense plantation configurations. The experimental setup was evaluated in a cornfield with different growth stages and in a Citrus orchard. Both datasets characterize different plant density scenarios, locations, types of crops, sensors, and dates. A two-branch architecture was implemented in our CNN method, where the information obtained within the plantation-row is updated into the plant detection branch and retro-feed to the row branch; which are then refined by a Multi-Stage Refinement method. In the corn plantation datasets (with both growth phases, young and mature), our approach returned a mean absolute error (MAE) of 6.224 plants per image patch, a mean relative error (MRE) of 0.1038, precision and recall values of 0.856, and 0.905, respectively, and an F-measure equal to 0.876. These results were superior to the results from other deep networks (HRNet, Faster R-CNN, and RetinaNet) evaluated with the same task and dataset. For the plantation-row detection, our approach returned precision, recall, and F-measure scores of 0.913, 0.941, and 0.925, respectively. To test the robustness of our model with a different type of agriculture, we performed the same task in the citrus orchard dataset. It returned an MAE equal to 1.409 citrus-trees per patch, MRE of 0.0615, precision of 0.922, recall of 0.911, and F-measure of 0.965. For citrus plantation-row detection, our approach resulted in precision, recall, and F-measure scores equal to 0.965, 0.970, and 0.964, respectively. The proposed method achieved state-of-the-art performance for counting and geolocating plants and plant-rows in UAV images from different types of crops.
△ Less
Submitted 14 February, 2021; v1 submitted 31 December, 2020;
originally announced December 2020.
-
From Time Series to Euclidean Spaces: On Spatial Transformations for Temporal Clustering
Authors:
Nuno Mota Goncalves,
Ioana Giurgiu,
Anika Schumann
Abstract:
Unsupervised clustering of temporal data is both challenging and crucial in machine learning. In this paper, we show that neither traditional clustering methods, time series specific or even deep learning-based alternatives generalise well when both varying sampling rates and high dimensionality are present in the input data. We propose a novel approach to temporal clustering, in which we (1) tran…
▽ More
Unsupervised clustering of temporal data is both challenging and crucial in machine learning. In this paper, we show that neither traditional clustering methods, time series specific or even deep learning-based alternatives generalise well when both varying sampling rates and high dimensionality are present in the input data. We propose a novel approach to temporal clustering, in which we (1) transform the input time series into a distance-based projected representation by using similarity measures suitable for dealing with temporal data,(2) feed these projections into a multi-layer CNN-GRU autoencoder to generate meaningful domain-aware latent representations, which ultimately (3) allow for a natural separation of clusters beneficial for most important traditional clustering algorithms. We evaluate our approach on time series datasets from various domains and show that it not only outperforms existing methods in all cases, by up to 32%, but is also robust and incurs negligible computation overheads.
△ Less
Submitted 2 October, 2020;
originally announced October 2020.
-
The exponent of the non-abelian tensor square and related constructions of $p$-groups
Authors:
R. Bastos,
E. de Melo,
N. Gonçalves,
C. Monetta
Abstract:
Let $G$ be a finite $p$-group. In this paper we obtain bounds for the exponent of the non-abelian tensor square $G \otimes G$ and of $ν(G)$, which is a certain extension of $G \otimes G$ by $G \times G$. In particular, we bound $\exp(ν(G))$ in terms of $\exp(ν(G/N))$ and $\exp(N)$ when $G$ admits some specific normal subgroup $N$. We also establish bounds for $\exp(G \otimes G)$ in terms of…
▽ More
Let $G$ be a finite $p$-group. In this paper we obtain bounds for the exponent of the non-abelian tensor square $G \otimes G$ and of $ν(G)$, which is a certain extension of $G \otimes G$ by $G \times G$. In particular, we bound $\exp(ν(G))$ in terms of $\exp(ν(G/N))$ and $\exp(N)$ when $G$ admits some specific normal subgroup $N$. We also establish bounds for $\exp(G \otimes G)$ in terms of $\exp(G)$ and either the nilpotency class or the coclass of the group $G$, improving some existing bounds.
△ Less
Submitted 30 March, 2021; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Physical pendulum model: Fractional differential equation and memory effects
Authors:
L. N. Gonçalves,
J. C. Fernandes,
A. Ferraz,
A. G. Silva,
P. J. Sebastião
Abstract:
A detailed analysis of three pendular motion models is presented. Inertial effects, self-oscillation, and memory, together with non-constant moment of inertia, hysteresis and negative dam** are shown to be required for the comprehensive description of the free pendulum oscillatory regime. The effects of very high initial amplitudes, friction in the roller bearing axle, drag, and pendulum geometr…
▽ More
A detailed analysis of three pendular motion models is presented. Inertial effects, self-oscillation, and memory, together with non-constant moment of inertia, hysteresis and negative dam** are shown to be required for the comprehensive description of the free pendulum oscillatory regime. The effects of very high initial amplitudes, friction in the roller bearing axle, drag, and pendulum geometry are also analysed and discussed. The model that consists of a fractional differential equation provides both the best explanation of, and the best fits to, experimental high resolution and long-time data gathered from standard action-camera videos.
△ Less
Submitted 11 December, 2020; v1 submitted 28 June, 2020;
originally announced June 2020.
-
An inexact version of the symmetric proximal ADMM for solving separable convex optimization
Authors:
Vando A. Adona,
Max L. N. Gonçalves
Abstract:
In this paper, we propose and analyze an inexact version of the symmetric proximal alternating direction method of multipliers (ADMM) for solving linearly constrained optimization problems. Basically, the method allows its first subproblem to be solved inexactly in such way that a relative approximate criterion is satisfied. In terms of the iteration number $k$, we establish global…
▽ More
In this paper, we propose and analyze an inexact version of the symmetric proximal alternating direction method of multipliers (ADMM) for solving linearly constrained optimization problems. Basically, the method allows its first subproblem to be solved inexactly in such way that a relative approximate criterion is satisfied. In terms of the iteration number $k$, we establish global $\mathcal{O} (1/ \sqrt{k})$ pointwise and $\mathcal{O} (1/ {k})$ ergodic convergence rates of the method for a domain of the acceleration parameters, which is consistent with the largest known one in the exact case. Since the symmetric proximal ADMM can be seen as a class of ADMM variants, the new algorithm as well as its convergence rates generalize, in particular, many others in the literature. Numerical experiments illustrating the practical advantages of the method are reported. To the best of our knowledge, this work is the first one to study an inexact version of the symmetric proximal ADMM.
△ Less
Submitted 4 June, 2020;
originally announced June 2020.
-
Probabilistic Object Classification using CNN ML-MAP layers
Authors:
G. Melotti,
C. Premebida,
J. J. Bird,
D. R. Faria,
N. Gonçalves
Abstract:
Deep networks are currently the state-of-the-art for sensory perception in autonomous driving and robotics. However, deep models often generate overconfident predictions precluding proper probabilistic interpretation which we argue is due to the nature of the SoftMax layer. To reduce the overconfidence without compromising the classification performance, we introduce a CNN probabilistic approach b…
▽ More
Deep networks are currently the state-of-the-art for sensory perception in autonomous driving and robotics. However, deep models often generate overconfident predictions precluding proper probabilistic interpretation which we argue is due to the nature of the SoftMax layer. To reduce the overconfidence without compromising the classification performance, we introduce a CNN probabilistic approach based on distributions calculated in the network's Logit layer. The approach enables Bayesian inference by means of ML and MAP layers. Experiments with calibrated and the proposed prediction layers are carried out on object classification using data from the KITTI database. Results are reported for camera ($RGB$) and LiDAR (range-view) modalities, where the new approach shows promising performance compared to SoftMax.
△ Less
Submitted 24 August, 2020; v1 submitted 29 May, 2020;
originally announced May 2020.
-
Three-dimensional dynamics of falling films in the presence of insoluble surfactants
Authors:
Assen Batchvarov,
Lyes Kahouadji,
Cristian R. Constante-Amores,
Gabriel Farah Norões Gonçalves,
Seungwon Shin,
Jalel Chergui,
Damir Juric,
Omar K. Matar
Abstract:
We study the effect of insoluble surfactants on the wave dynamics of vertically-falling liquid films. We use three-dimensional numerical simulations and employ a hybrid interface-tracking/level-set method, taking into account Marangoni stresses induced by gradients of interfacial surfactant concentration. Our numerical predictions for the evolution of the surfactant-free, three-dimensional wave to…
▽ More
We study the effect of insoluble surfactants on the wave dynamics of vertically-falling liquid films. We use three-dimensional numerical simulations and employ a hybrid interface-tracking/level-set method, taking into account Marangoni stresses induced by gradients of interfacial surfactant concentration. Our numerical predictions for the evolution of the surfactant-free, three-dimensional wave topology are validated against the experimental work of Park & Nosoko (2003). The addition of surfactants is found to influence significantly the development of horseshoe-shaped waves. At low Marangoni numbers, we show that the wave fronts exhibit spanwise oscillations before eventually acquiring a quasi two-dimensional shape. In addition, the presence of Marangoni stresses are found to suppress the peaks of the travelling waves and preceding capillary wave structures. At high Marangoni numbers, a near complete rigidification of the interface is observed.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
On Inexact Accelerated Proximal Gradient Methods with Relative Error Rules
Authors:
Yunier Bello-Cruz,
Max L. N. Gonçalves,
Nathan Krislock
Abstract:
One of the most popular and important first-order iterations that provides optimal complexity of the classical proximal gradient method (PGM) is the "Fast Iterative Shrinkage/Thresholding Algorithm" (FISTA). In this paper, two inexact versions of FISTA for minimizing the sum of two convex functions are studied. The proposed schemes inexactly solve their subproblems by using relative error criteria…
▽ More
One of the most popular and important first-order iterations that provides optimal complexity of the classical proximal gradient method (PGM) is the "Fast Iterative Shrinkage/Thresholding Algorithm" (FISTA). In this paper, two inexact versions of FISTA for minimizing the sum of two convex functions are studied. The proposed schemes inexactly solve their subproblems by using relative error criteria instead of exogenous and diminishing error rules. When the evaluation of the proximal operator is difficult, inexact versions of FISTA are necessary and the relative error rules proposed here may have certain advantages over previous error rules. The same optimal convergence rate of FISTA is recovered for both proposed schemes. Some numerical experiments are reported to illustrate the numerical behavior of the new approaches.
△ Less
Submitted 7 May, 2020;
originally announced May 2020.
-
Data-driven surrogate modelling and benchmarking for process equipment
Authors:
Gabriel F. N. Gonçalves,
Assen Batchvarov,
Yuyi Liu,
Yuxin Liu,
Lachlan Mason,
Indranil Pan,
Omar K. Matar
Abstract:
In chemical process engineering, surrogate models of complex systems are often necessary for tasks of domain exploration, sensitivity analysis of the design parameters, and optimization. A suite of computational fluid dynamics (CFD) simulations geared toward chemical process equipment modeling has been developed and validated with experimental results from the literature. Various regression-based…
▽ More
In chemical process engineering, surrogate models of complex systems are often necessary for tasks of domain exploration, sensitivity analysis of the design parameters, and optimization. A suite of computational fluid dynamics (CFD) simulations geared toward chemical process equipment modeling has been developed and validated with experimental results from the literature. Various regression-based active learning strategies are explored with these CFD simulators in-the-loop under the constraints of a limited function evaluation budget. Specifically, five different sampling strategies and five regression techniques are compared, considering a set of four test cases of industrial significance and varying complexity. Gaussian process regression was observed to have a consistently good performance for these applications. The present quantitative study outlines the pros and cons of the different available techniques and highlights the best practices for their adoption. The test cases and tools are available with an open-source license to ensure reproducibility and engage the wider research community in contributing to both the CFD models and develo** and benchmarking new improved algorithms tailored to this field.
△ Less
Submitted 8 September, 2020; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Experimental and theoretical study of electronic and hyperfine properties of hydrogenated anatase (TiO$_2$): defects interplay and thermal stability
Authors:
D. V. Zyabkin,
H. P. Gunnlaugsson,
J. N. Goncalves,
K. Bharuth-Ram,
B. Qi,
I. Unzueta,
D. Naidoo,
R. Mantovan,
H. Masenda,
S. Olafsson,
G. Peters,
J. Schell,
U. Vetter,
A. Dimitrova,
S. Krischok,
P. Schaaf
Abstract:
In this study we report on the results from emission $^{57}$Fe M${ö}$ssbauer Spectroscopy experiments, using dilute $^{57}$Mn implantation into pristine (TiO$_2$) and hydrogenated anatase held at temperatures between 300-700 K. Results of the electronic structure and local environment are complemented with ab-initio calculations. Upon implantation both Fe$^{2+}$ and Fe$^{3+}$ are observed in prist…
▽ More
In this study we report on the results from emission $^{57}$Fe M${ö}$ssbauer Spectroscopy experiments, using dilute $^{57}$Mn implantation into pristine (TiO$_2$) and hydrogenated anatase held at temperatures between 300-700 K. Results of the electronic structure and local environment are complemented with ab-initio calculations. Upon implantation both Fe$^{2+}$ and Fe$^{3+}$ are observed in pristine anatase, where the latter demonstrates the spin-lattice relaxation. The spectra obtained for hydrogenated anatase show no Fe$^{3+}$ contribution, suggesting that hydrogen acts as a donor. Due to the low threshold, hydrogen diffuses out of the lattice. Thus showing a dynamic behavior on the time scale of the $^{57}$Fe 14.4 keV state. The surrounding oxygen vacancies favor the high-spin Fe$^{2+}$ state. The sample treated at room temperature shows two distinct processes of hydrogen motion. The motion commences with the interstitial hydrogen, followed by switching to the covalently bound state. Hydrogen out-diffusion is hindered by bulk defects, which could cause both processes to overlap. Supplementary UV-Vis and electrical conductivity measurements show an improved electrical conductivity and higher optical absorption after the hydrogenation. X-ray photoelectron spectroscopy at room temperature reveals that the sample hydrogenated at 573 K shows presence of both Ti$^{3+}$ and Ti$^{2+}$ states. This could imply that a significant amount of oxygen vacancies and -OH bonds are present in the samples. Theory suggests that in the anatase sample implanted with Mn(Fe), probes were located near equatorial vacancies as next-nearest-neighbours, whilst a metastable hydrogen configuration is responsible for the annealing behavior.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Bio-Inspired Modality Fusion for Active Speaker Detection
Authors:
Gustavo Assunção,
Nuno Gonçalves,
Paulo Menezes
Abstract:
Human beings have developed fantastic abilities to integrate information from various sensory sources exploring their inherent complementarity. Perceptual capabilities are therefore heightened, enabling, for instance, the well-known "cocktail party" and McGurk effects, i.e., speech disambiguation from a panoply of sound signals. This fusion ability is also key in refining the perception of sound s…
▽ More
Human beings have developed fantastic abilities to integrate information from various sensory sources exploring their inherent complementarity. Perceptual capabilities are therefore heightened, enabling, for instance, the well-known "cocktail party" and McGurk effects, i.e., speech disambiguation from a panoply of sound signals. This fusion ability is also key in refining the perception of sound source location, as in distinguishing whose voice is being heard in a group conversation. Furthermore, neuroscience has successfully identified the superior colliculus region in the brain as the one responsible for this modality fusion, with a handful of biological models having been proposed to approach its underlying neurophysiological process. Deriving inspiration from one of these models, this paper presents a methodology for effectively fusing correlated auditory and visual information for active speaker detection. Such an ability can have a wide range of applications, from teleconferencing systems to social robotics. The detection approach initially routes auditory and visual information through two specialized neural network structures. The resulting embeddings are fused via a novel layer based on the superior colliculus, whose topological structure emulates spatial neuron cross-map** of unimodal perceptual fields. The validation process employed two publicly available datasets, with achieved results confirming and greatly surpassing initial expectations.
△ Less
Submitted 13 April, 2021; v1 submitted 28 February, 2020;
originally announced March 2020.
-
Levenberg-Marquardt methods with inexact projections for constrained nonlinear systems
Authors:
Douglas S. Gonçalves,
Max L. N. Gonçalves,
Fabrícia R. Oliveira
Abstract:
In this paper, we first propose a new Levenberg-Marquardt method for solving constrained (and not necessarily square) nonlinear systems. Basically, the method combines the unconstrained Levenberg-Marquardt method with a type of feasible inexact projection. The local convergence of the new method as well as results on its rate are established by using an error bound condition, which is weaker than…
▽ More
In this paper, we first propose a new Levenberg-Marquardt method for solving constrained (and not necessarily square) nonlinear systems. Basically, the method combines the unconstrained Levenberg-Marquardt method with a type of feasible inexact projection. The local convergence of the new method as well as results on its rate are established by using an error bound condition, which is weaker than the standard full-rank assumption. We further present and analyze a global version of the first method by means of a nonmonotone line search technique. Finally, numerical experiments illustrating the practical advantages of the proposed schemes are reported.
△ Less
Submitted 16 August, 2019;
originally announced August 2019.
-
Non-abelian tensor square and related constructions of $p$-groups
Authors:
Raimundo Bastos,
Emerson de Melo,
Nathália Gonçalves,
Ricardo Nunes
Abstract:
Let $G$ be a group. We denote by $ν(G)$ a certain extension of the non-abelian tensor square $[G,G^{\varphi}]$ by $G \times G$. We prove that if $G$ is a finite potent $p$-group, then $[G,G^{\varphi}]$ and the $k$-th term of the lower central series $γ_k(ν(G))$ are potently embedded in $ν(G)$ (Theorem A). Moreover, we show that if $G$ is a potent $p$-group, then the exponent $\exp(ν(G))$ divides…
▽ More
Let $G$ be a group. We denote by $ν(G)$ a certain extension of the non-abelian tensor square $[G,G^{\varphi}]$ by $G \times G$. We prove that if $G$ is a finite potent $p$-group, then $[G,G^{\varphi}]$ and the $k$-th term of the lower central series $γ_k(ν(G))$ are potently embedded in $ν(G)$ (Theorem A). Moreover, we show that if $G$ is a potent $p$-group, then the exponent $\exp(ν(G))$ divides $p \cdot \exp(G)$ (Theorem B). We also study the weak commutativity construction of powerful $p$-groups (Theorem C).
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
Dynamic texture analysis with diffusion in networks
Authors:
Lucas C. Ribas,
Wesley N. Goncalves,
Odemir M. Bruno
Abstract:
Dynamic texture is a field of research that has gained considerable interest from computer vision community due to the explosive growth of multimedia databases. In addition, dynamic texture is present in a wide range of videos, which makes it very important in expert systems based on videos such as medical systems, traffic monitoring systems, forest fire detection system, among others. In this pap…
▽ More
Dynamic texture is a field of research that has gained considerable interest from computer vision community due to the explosive growth of multimedia databases. In addition, dynamic texture is present in a wide range of videos, which makes it very important in expert systems based on videos such as medical systems, traffic monitoring systems, forest fire detection system, among others. In this paper, a new method for dynamic texture characterization based on diffusion in directed networks is proposed. The dynamic texture is modeled as a directed network. The method consists in the analysis of the dynamic of this network after a series of graph cut transformations based on the edge weights. For each network transformation, the activity for each vertex is estimated. The activity is the relative frequency that one vertex is visited by random walks in balance. Then, texture descriptor is constructed by concatenating the activity histograms. The main contributions of this paper are the use of directed network modeling and diffusion in network to dynamic texture characterization. These tend to provide better performance in dynamic textures classification. Experiments with rotation and interference of the motion pattern were conducted in order to demonstrate the robustness of the method. The proposed approach is compared to other dynamic texture methods on two very well know dynamic texture database and on traffic condition classification, and outperform in most of the cases.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.
-
On the global convergent of an inexact quasi-Newton conditional gradient method for constrained nonlinear systems
Authors:
M. L. N. Gonçalves,
F. R. Oliveira
Abstract:
In this paper, we propose a globally convergent method for solving constrained nonlinear systems. The method combines an efficient Newton conditional gradient method with a derivative-free and nonmonotone linesearch strategy. The global convergence analysis of the proposed method is established under suitable conditions, and some preliminary numerical experiments are given to illustrate its perfor…
▽ More
In this paper, we propose a globally convergent method for solving constrained nonlinear systems. The method combines an efficient Newton conditional gradient method with a derivative-free and nonmonotone linesearch strategy. The global convergence analysis of the proposed method is established under suitable conditions, and some preliminary numerical experiments are given to illustrate its performance.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
A Partially Inexact Alternating Direction Method of Multipliers and its Iteration-Complexity Analysis
Authors:
Vando A. Adona,
Max L. N. Goncalves,
Jefferson G. Melo
Abstract:
This paper proposes a partially inexact alternating direction method of multipliers for computing approximate solution of a linearly constrained convex optimization problem. This method allows its first subproblem to be solved inexactly using a relative approximate criterion, whereas a proximal term is added to its second subproblem in order to simplify it. A stepsize parameter is included in the…
▽ More
This paper proposes a partially inexact alternating direction method of multipliers for computing approximate solution of a linearly constrained convex optimization problem. This method allows its first subproblem to be solved inexactly using a relative approximate criterion, whereas a proximal term is added to its second subproblem in order to simplify it. A stepsize parameter is included in the updating rule of the Lagrangian multiplier to improve its computational performance.
Pointwise and ergodic interation-complexity bounds for the proposed method are established. To the best of our knowledge, this is the first time that complexity results for an inexact ADMM with relative error criteria has been analyzed.
Some preliminary numerical experiments are reported to illustrate the advantages of the new method.
△ Less
Submitted 18 May, 2018;
originally announced May 2018.
-
Multilayer Complex Network Descriptors for Color-Texture Characterization
Authors:
Leonardo F S Scabini,
Rayner H M Condori,
Wesley N Gonçalves,
Odemir M Bruno
Abstract:
A new method based on complex networks is proposed for color-texture analysis. The proposal consists on modeling the image as a multilayer complex network where each color channel is a layer, and each pixel (in each color channel) is represented as a network vertex. The network dynamic evolution is accessed using a set of modeling parameters (radii and thresholds), and new characterization techniq…
▽ More
A new method based on complex networks is proposed for color-texture analysis. The proposal consists on modeling the image as a multilayer complex network where each color channel is a layer, and each pixel (in each color channel) is represented as a network vertex. The network dynamic evolution is accessed using a set of modeling parameters (radii and thresholds), and new characterization techniques are introduced to capt information regarding within and between color channel spatial interaction. An automatic and adaptive approach for threshold selection is also proposed. We conduct classification experiments on 5 well-known datasets: Vistex, Usptex, Outex13, CURet and MBT. Results among various literature methods are compared, including deep convolutional neural networks with pre-trained architectures. The proposed method presented the highest overall performance over the 5 datasets, with 97.7 of mean accuracy against 97.0 achieved by the ResNet convolutional neural network with 50 layers.
△ Less
Submitted 2 April, 2018;
originally announced April 2018.
-
A smartphone application to measure the quality of pest control spraying machines via image analysis
Authors:
Bruno B. Machado,
Gabriel Spadon,
Mauro S. Arruda,
Wesley N. Goncalves,
Andre C. P. L. F. Carvalho,
Jose F. Rodrigues-Jr
Abstract:
The need for higher agricultural productivity has demanded the intensive use of pesticides. However, their correct use depends on assessment methods that can accurately predict how well the pesticides' spraying covered the intended crop region. Some methods have been proposed in the literature, but their high cost and low portability harm their widespread use. This paper proposes and experimentall…
▽ More
The need for higher agricultural productivity has demanded the intensive use of pesticides. However, their correct use depends on assessment methods that can accurately predict how well the pesticides' spraying covered the intended crop region. Some methods have been proposed in the literature, but their high cost and low portability harm their widespread use. This paper proposes and experimentally evaluates a new methodology based on the use of a smartphone-based mobile application, named DropLeaf. Experiments performed using DropLeaf showed that, in addition to its versatility, it can predict with high accuracy the pesticide spraying. DropLeaf is a five-fold image-processing methodology based on: (i) color space conversion, (ii) threshold noise removal, (iii) convolutional operations of dilation and erosion, (iv) detection of contour markers in the water-sensitive card, and, (v) identification of droplets via the marker-controlled watershed transformation. The authors performed successful experiments over two case studies, the first using a set of synthetic cards and the second using a real-world crop. The proposed tool can be broadly used by farmers equipped with conventional mobile phones, improving the use of pesticides with health, environmental and financial benefits.
△ Less
Submitted 16 December, 2017; v1 submitted 21 November, 2017;
originally announced November 2017.
-
An Inexact Newton-like conditional gradient method for constrained nonlinear systems
Authors:
M. L. N. Goncalves,
F. R. Oliveira
Abstract:
In this paper, we propose an inexact Newton-like conditional gradient method for solving constrained systems of nonlinear equations. The local convergence of the new method as well as results on its rate are established by using a general majorant condition. Two applications of such condition are provided: one is for functions whose the derivative satisfies Holder-like condition and the other is f…
▽ More
In this paper, we propose an inexact Newton-like conditional gradient method for solving constrained systems of nonlinear equations. The local convergence of the new method as well as results on its rate are established by using a general majorant condition. Two applications of such condition are provided: one is for functions whose the derivative satisfies Holder-like condition and the other is for functions that satisfies a Smale condition, which includes a substantial class of analytic functions. Some preliminaries numerical experiments illustrating the applicability of the proposed method for medium and large problems are also presented.
△ Less
Submitted 22 May, 2017;
originally announced May 2017.
-
Iteration-complexity analysis of a generalized alternating direction method of multipliers
Authors:
V. A. Adona,
M. L. N. Goncalves,
J. G. Melo
Abstract:
This paper analyzes the iteration-complexity of a generalized alternating direction method of multipliers (G-ADMM) for solving linearly constrained convex problems. This ADMM variant, which was first proposed by Bertsekas and Eckstein, introduces a relaxation parameter $α\in (0,2)$ into the second ADMM subproblem. Our approach is to show that the G-ADMM is an instance of a hybrid proximal extragra…
▽ More
This paper analyzes the iteration-complexity of a generalized alternating direction method of multipliers (G-ADMM) for solving linearly constrained convex problems. This ADMM variant, which was first proposed by Bertsekas and Eckstein, introduces a relaxation parameter $α\in (0,2)$ into the second ADMM subproblem. Our approach is to show that the G-ADMM is an instance of a hybrid proximal extragradient framework with some special properties, and, as a by product, we obtain ergodic iteration-complexity for the G-ADMM with $α\in (0,2]$, improving and complementing related results in the literature. Additionally, we also present pointwise iteration-complexity for the G-ADMM.
△ Less
Submitted 17 May, 2017;
originally announced May 2017.
-
On the pointwise iteration-complexity of a dynamic regularized ADMM with over-relaxation stepsize
Authors:
M. L. N. Goncalves
Abstract:
In this paper, we extend the improved pointwise iteration-complexity result of a dynamic regularized alternating direction method of multipliers (ADMM) for a new stepsize domain. In this complexity analysis, the stepsize parameter can even be chosen in the interval $(0,2)$ instead of interval $(0,(1+\sqrt{5})/2)$. As usual, our analysis is established by interpreting this ADMM variant as an instan…
▽ More
In this paper, we extend the improved pointwise iteration-complexity result of a dynamic regularized alternating direction method of multipliers (ADMM) for a new stepsize domain. In this complexity analysis, the stepsize parameter can even be chosen in the interval $(0,2)$ instead of interval $(0,(1+\sqrt{5})/2)$. As usual, our analysis is established by interpreting this ADMM variant as an instance of a hybrid proximal extragradient framework applied to a specific monotone inclusion problem.
△ Less
Submitted 8 May, 2017;
originally announced May 2017.
-
On the nature of the (de)coupling of the magnetostructural transition in Er$_5$Si$_4$
Authors:
Rui M. Costa,
João H. Belo,
Marcelo B. Barbosa,
Pedro A. Algarabel,
César Magén,
Luis Morellon,
Manuel R. Ibarra,
João N. Gonçalves,
Nuno M. Fortunato,
João S. Amaral,
João P. Araújo,
André M. Pereira
Abstract:
In this report, a successful thermodynamical model was employed to understand the structural transition in Er$_5$Si$_4$, able to explain the decoupling of the magnetic and structural transition. This was achieved by the DFT calculations which were used to determine the energy differences at 0 K, using a LSDA+U approximation. It was found that the M structure as the stable phase at low temperatures…
▽ More
In this report, a successful thermodynamical model was employed to understand the structural transition in Er$_5$Si$_4$, able to explain the decoupling of the magnetic and structural transition. This was achieved by the DFT calculations which were used to determine the energy differences at 0 K, using a LSDA+U approximation. It was found that the M structure as the stable phase at low temperatures as verified experimentally with a $ΔF_0 = -$0.262 eV. Finally, it was achieved a variation of Seebeck coefficient ($\sim$ 6 $μ$V) at the structural transition which allow to conclude that the electronic entropy variation is negligible in the transition.
△ Less
Submitted 24 August, 2017; v1 submitted 31 March, 2017;
originally announced March 2017.
-
Pointwise and ergodic convergence rates of a variable metric proximal ADMM
Authors:
Max L. N. Goncalves,
Jefferson G. Melo,
M. Marques Alves
Abstract:
In this paper, we obtain global $\mathcal{O} (1/ \sqrt{k})$ pointwise and $\mathcal{O} (1/ {k})$ ergodic convergence rates for a variable metric proximal alternating direction method of multipliers(VM-PADMM) for solving linearly constrained convex optimization problems. The VM-PADMM can be seen as a class of ADMM variants, allowing the use of degenerate metrics (defined by noninvertible linear ope…
▽ More
In this paper, we obtain global $\mathcal{O} (1/ \sqrt{k})$ pointwise and $\mathcal{O} (1/ {k})$ ergodic convergence rates for a variable metric proximal alternating direction method of multipliers(VM-PADMM) for solving linearly constrained convex optimization problems. The VM-PADMM can be seen as a class of ADMM variants, allowing the use of degenerate metrics (defined by noninvertible linear operators). We first propose and study nonasymptotic convergence rates of a variable metric hybrid proximal extragradient (VM-HPE) framework for solving monotone inclusions. Then, the above-mentioned convergence rates for the VM-PADMM are obtained essentially by showing that it falls within the latter framework. To the best of our knowledge, this is the first time that global pointwise (resp. pointwise and ergodic) convergence rates are obtained for the VM-PADMM (resp. VM-HPE framework).
△ Less
Submitted 4 May, 2017; v1 submitted 21 February, 2017;
originally announced February 2017.