-
Identifying type II quasars at intermediate redshift with few-shot learning photometric classification
Authors:
P. A. C. Cunha,
A. Humphrey,
J. Brinchmann,
S. G. Morais,
R. Carvajal,
J. M. Gomes,
I. Matute,
A. Paulino-Afonso
Abstract:
We aim to identify QSO2 candidates in the redshift desert using optical and infrared photometry. At this intermediate redshift range, most of the prominent optical emission lines in QSO2 sources (e.g. CIV1549; [OIII]4959,5008) fall either outside the wavelength range of the SDSS optical spectra or in particularly noisy wavelength ranges, making QSO2 identification challenging. Therefore, we adopte…
▽ More
We aim to identify QSO2 candidates in the redshift desert using optical and infrared photometry. At this intermediate redshift range, most of the prominent optical emission lines in QSO2 sources (e.g. CIV1549; [OIII]4959,5008) fall either outside the wavelength range of the SDSS optical spectra or in particularly noisy wavelength ranges, making QSO2 identification challenging. Therefore, we adopted a semi-supervised machine learning approach to select candidates in the SDSS galaxy sample. Recent applications of machine learning in astronomy focus on problems involving large data sets, with small data sets often being overlooked. We developed a few-shot learning approach for the identification and classification of rare-object classes using limited training data (200 sources). The new AMELIA pipeline uses a transfer-learning based approach with decision trees, distance-based, and deep learning methods to build a classifier capable of identifying rare objects on the basis of an observational training data set. We validated the performance of AMELIA by addressing the problem of identifying QSO2s at 1 $\leq$ z $\leq$ 2 using SDSS and WISE photometry, obtaining an F1-score above 0.8 in a supervised approach. We then used AMELIA to select new QSO2 candidates in the redshift desert and examined the nature of the candidates using SDSS spectra, when available. In particular, we identified a sub-population of [NeV]3426 emitters at z $\sim$ 1.1, which are highly likely to contain obscured AGNs. We used X-ray and radio cross-matching to validate our classification and investigated the performance of photometric criteria from the literature showing that our candidates have an inherent dusty nature. Finally, we derived physical properties for our QSO2 sample using photoionisation models and verified the AGN classification using an SED fitting.
△ Less
Submitted 27 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Euclid. I. Overview of the Euclid mission
Authors:
Euclid Collaboration,
Y. Mellier,
Abdurro'uf,
J. A. Acevedo Barroso,
A. Achúcarro,
J. Adamek,
R. Adam,
G. E. Addison,
N. Aghanim,
M. Aguena,
V. Ajani,
Y. Akrami,
A. Al-Bahlawan,
A. Alavi,
I. S. Albuquerque,
G. Alestas,
G. Alguero,
A. Allaoui,
S. W. Allen,
V. Allevato,
A. V. Alonso-Tetilla,
B. Altieri,
A. Alvarez-Candal,
A. Amara,
L. Amendola
, et al. (1086 additional authors not shown)
Abstract:
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14…
▽ More
The current standard model of cosmology successfully describes a variety of measurements, but the nature of its main ingredients, dark matter and dark energy, remains unknown. Euclid is a medium-class mission in the Cosmic Vision 2015-2025 programme of the European Space Agency (ESA) that will provide high-resolution optical imaging, as well as near-infrared imaging and spectroscopy, over about 14,000 deg^2 of extragalactic sky. In addition to accurate weak lensing and clustering measurements that probe structure formation over half of the age of the Universe, its primary probes for cosmology, these exquisite data will enable a wide range of science. This paper provides a high-level overview of the mission, summarising the survey characteristics, the various data-processing steps, and data products. We also highlight the main science objectives and expected performance.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Photon Many-body Dispersion: an Exchange-correlation Functional for Strongly Coupled Light-matter Systems
Authors:
Cankut Tasci,
Leonardo A. Cunha,
Johannes Flick
Abstract:
We introduce an electron-photon exchange-correlation functional for quantum electrodynamical density-functional theory (QEDFT). The approach, photon MBD (pMBD), is inspired by the many-body dispersion (MBD) method for weak intermolecular interactions, which is generalized to include both electronic and photonic (electromagnetic) degrees of freedom on the same footing. We demonstrate that pMBD accu…
▽ More
We introduce an electron-photon exchange-correlation functional for quantum electrodynamical density-functional theory (QEDFT). The approach, photon MBD (pMBD), is inspired by the many-body dispersion (MBD) method for weak intermolecular interactions, which is generalized to include both electronic and photonic (electromagnetic) degrees of freedom on the same footing. We demonstrate that pMBD accurately captures effects that arise in the context of strong light-matter interactions, such as anisotropic electron-photon interactions, beyond single-photon effects, and cavity modulated van der Waals interactions. Moreover, we show that pMBD is computationally efficient and allows simulations of large complex systems coupled to optical cavities.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Incorporating Competition into Dual Accessibility Assessment: The Competitive Equilibrium Method
Authors:
Andre Borgato Morelli,
Andre Luiz Cunha
Abstract:
This study proposes a new approach to assessing urban accessibility using the competitive equilibrium method, an adaptation of the balancing cost method that incorporates competition among users into dual accessibility metrics. The need for this method arises from verifying the inability of the balancing cost method to measure the competitive dynamics of transport systems precisely. The method wor…
▽ More
This study proposes a new approach to assessing urban accessibility using the competitive equilibrium method, an adaptation of the balancing cost method that incorporates competition among users into dual accessibility metrics. The need for this method arises from verifying the inability of the balancing cost method to measure the competitive dynamics of transport systems precisely. The method works by continuously updating available opportunities based on the number of competitors, capturing competition effects, and differentiating regions according to demand and supply. The application results in three cities in the interior of São Paulo demonstrate that competitive equilibrium is significantly more sensitive to employment supply variations and can measure potential competitors by area. In addition, the data used, and the computational tools developed are made available, allowing for the replication of the work and application in future studies.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Boosting, Voting Classifiers and Randomized Sample Compression Schemes
Authors:
Arthur da Cunha,
Kasper Green Larsen,
Martin Ritzert
Abstract:
In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak learners. While many successful boosting algorithms, such as the iconic AdaBoost, produce voting classifiers, their theoretical performance has long remained sub-…
▽ More
In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak learners. While many successful boosting algorithms, such as the iconic AdaBoost, produce voting classifiers, their theoretical performance has long remained sub-optimal: the best known bounds on the number of training examples necessary for a voting classifier to obtain a given accuracy has so far always contained at least two logarithmic factors above what is known to be achievable by general weak-to-strong learners. In this work, we break this barrier by proposing a randomized boosting algorithm that outputs voting classifiers whose generalization error contains a single logarithmic dependency on the sample size. We obtain this result by building a general framework that extends sample compression methods to support randomized learning algorithms based on sub-sampling.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Building rigid networks with prestress and selective pruning
Authors:
Marco Aurelio Galvani Cunha,
John C. Crocker,
Andrea J. Liu
Abstract:
The mechanical properties of biopolymer networks depend on their mean coordination number and stress state. Models based on spring networks have been shown to capture some properties of materials such as the actin cortex, but the effect of prestress combined with filament pruning, known to exist in the cortex, has not been studied. We show that in central-force spring networks below isostatic coor…
▽ More
The mechanical properties of biopolymer networks depend on their mean coordination number and stress state. Models based on spring networks have been shown to capture some properties of materials such as the actin cortex, but the effect of prestress combined with filament pruning, known to exist in the cortex, has not been studied. We show that in central-force spring networks below isostatic coordination that are rigidified by prestress, details of the pruning method significantly affect mechanical properties: networks pruned by a tension-inhibited method not only have a larger shear modulus than randomly-pruned networks, but require far smaller initial prestrains in order to remain rigid at biologically-relevant coordinations. These findings suggest a possible reason for the common motif of tension-inhibited filament-cleaving proteins in biopolymer networks.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Resilience in Highways: Proposal of Roadway Redundancy Indicators and Application in Segments of the Brazilian Network
Authors:
André Borgato Morelli,
André Luiz Cunha
Abstract:
With the growing realization that transport systems must operate satisfactorily not only in typical situations, but also in adverse circumstances, ensuring redundancies in road systems has gained crucial importance. In this context, several methods have been proposed for measuring the vulnerabilities and resilience of transport systems. However, a simple metric to understand and quantify the degre…
▽ More
With the growing realization that transport systems must operate satisfactorily not only in typical situations, but also in adverse circumstances, ensuring redundancies in road systems has gained crucial importance. In this context, several methods have been proposed for measuring the vulnerabilities and resilience of transport systems. However, a simple metric to understand and quantify the degree of redundancy of a given road segment is still necessary, mainly to guide the responsible bodies regarding the need for intervention or special care with certain sections of the system. Thus, this paper proposes a redundancy indicator based on network analyses in the vicinity of an element. The proposed indicator was first calculated on nine application examples and then on a substantial sample of the Brazilian road network (~10% of segments). The results demonstrate that the indicator can satisfactorily describe the variety of cases in the Brazilian network, capturing cases where there is significant redundancy in the elements, as in some regions of the Southeast and South; or cases of very low redundancy, such as the sparse grid in the north of the country. It was also verified that the indicator has a particular sensitivity to parameters of the defined function, requiring further research for an acceptable calibration.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets
Authors:
Arthur da Cunha,
Francesco d'Amore,
Emanuele Natale
Abstract:
The Strong Lottery Ticket Hypothesis (SLTH) states that randomly-initialised neural networks likely contain subnetworks that perform well without any training. Although unstructured pruning has been extensively studied in this context, its structured counterpart, which can deliver significant computational and memory efficiency gains, has been largely unexplored. One of the main reasons for this g…
▽ More
The Strong Lottery Ticket Hypothesis (SLTH) states that randomly-initialised neural networks likely contain subnetworks that perform well without any training. Although unstructured pruning has been extensively studied in this context, its structured counterpart, which can deliver significant computational and memory efficiency gains, has been largely unexplored. One of the main reasons for this gap is the limitations of the underlying mathematical tools used in formal analyses of the SLTH. In this paper, we overcome these limitations: we leverage recent advances in the multidimensional generalisation of the Random Subset-Sum Problem and obtain a variant that admits the stochastic dependencies that arise when addressing structured pruning in the SLTH. We apply this result to prove, for a wide class of random Convolutional Neural Networks, the existence of structured subnetworks that can approximate any sufficiently smaller network.
This result provides the first sub-exponential bound around the SLTH for structured pruning, opening up new avenues for further research on the hypothesis and contributing to the understanding of the role of over-parameterization in deep learning.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
Authors:
Dominique Beaini,
Shenyang Huang,
Joao Alex Cunha,
Zhiyi Li,
Gabriela Moisescu-Pareja,
Oleksandr Dymov,
Samuel Maddrell-Mander,
Callum McLean,
Frederik Wenkel,
Luis Müller,
Jama Hussein Mohamud,
Ali Parviz,
Michael Craig,
Michał Koziarski,
Jiarui Lu,
Zhaocheng Zhu,
Cristian Gabellini,
Kerstin Klaser,
Josef Dean,
Cas Wognum,
Maciej Sypetkowski,
Guillaume Rabusseau,
Reihaneh Rabbany,
Jian Tang,
Christopher Morris
, et al. (10 additional authors not shown)
Abstract:
Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by…
▽ More
Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by size into three distinct categories: ToyMix, LargeMix and UltraLarge. These datasets push the boundaries in both the scale and the diversity of supervised labels for molecular learning. They cover nearly 100 million molecules and over 3000 sparsely defined tasks, totaling more than 13 billion individual labels of both quantum and biological nature. In comparison, our datasets contain 300 times more data points than the widely used OGB-LSC PCQM4Mv2 dataset, and 13 times more than the quantum-only QM1B dataset. In addition, to support the development of foundational models based on our proposed datasets, we present the Graphium graph machine learning library which simplifies the process of building and training molecular machine learning models for multi-task and multi-level molecular datasets. Finally, we present a range of baseline results as a starting point of multi-task and multi-level training on these datasets. Empirically, we observe that performance on low-resource biological datasets show improvement by also training on large amounts of quantum data. This indicates that there may be potential in multi-task and multi-level training of a foundation model and fine-tuning it to resource-constrained downstream tasks.
△ Less
Submitted 18 October, 2023; v1 submitted 6 October, 2023;
originally announced October 2023.
-
Selection of powerful radio galaxies with machine learning
Authors:
R. Carvajal,
I. Matute,
J. Afonso,
R. P. Norris,
K. J. Luken,
P. Sánchez-Sáez,
P. A. C. Cunha,
A. Humphrey,
H. Messias,
S. Amarantidis,
D. Barbosa,
H. A. Cruz,
H. Miranda,
A. Paulino-Afonso,
C. Pappalardo
Abstract:
We developed and trained a pipeline of three machine learning (ML) models than can predict which sources are more likely to be an AGN and to be detected in specific radio surveys. Also, it can estimate redshift values for predicted radio-detectable AGNs. These models, which combine predictions from tree-based and gradient-boosting algorithms, have been trained with multi-wavelength data from near-…
▽ More
We developed and trained a pipeline of three machine learning (ML) models than can predict which sources are more likely to be an AGN and to be detected in specific radio surveys. Also, it can estimate redshift values for predicted radio-detectable AGNs. These models, which combine predictions from tree-based and gradient-boosting algorithms, have been trained with multi-wavelength data from near-infrared-selected sources in the Hobby-Eberly Telescope Dark Energy Experiment (HETDEX) Spring field. Training, testing, calibration, and validation were carried out in the HETDEX field. Further validation was performed on near-infrared-selected sources in the Stripe 82 field. In the HETDEX validation subset, our pipeline recovers 96% of the initially labelled AGNs and, from AGNs candidates, we recover 50% of previously detected radio sources. For Stripe 82, these numbers are 94% and 55%. Compared to random selection, these rates are two and four times better for HETDEX, and 1.2 and 12 times better for Stripe 82. The pipeline can also recover the redshift distribution of these sources with $σ_{\mathrm{NMAD}}$ = 0.07 for HETDEX ($σ_{\mathrm{NMAD}}$ = 0.09 for Stripe 82) and an outlier fraction of 19% (25% for Stripe 82), compatible with previous results based on broad-band photometry. Feature importance analysis stresses the relevance of near- and mid-infrared colours to select AGNs and identify their radio and redshift nature. Combining different algorithms in ML models shows an improvement in the prediction power of our pipeline over a random selection of sources. Tree-based ML models (in contrast to deep learning techniques) facilitate the analysis of the impact that features have on the predictions. This prediction can give insight into the potential physical interplay between the properties of radio AGNs (e.g. mass of black hole and accretion rate).
△ Less
Submitted 1 December, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision
Authors:
Jianning Li,
Zongwei Zhou,
Jiancheng Yang,
Antonio Pepe,
Christina Gsaxner,
Gijs Luijten,
Chongyu Qu,
Tiezheng Zhang,
Xiaoxi Chen,
Wenxuan Li,
Marek Wodzinski,
Paul Friedrich,
Kangxian Xie,
Yuan **,
Narmada Ambigapathy,
Enrico Nasca,
Naida Solak,
Gian Marco Melito,
Viet Duc Vu,
Afaque R. Memon,
Christopher Schlachta,
Sandrine De Ribaupierre,
Rajnikant Patel,
Roy Eagleson,
Xiaojun Chen
, et al. (132 additional authors not shown)
Abstract:
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape…
▽ More
Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of ShapeNet (about 51,300 models) and Princeton ModelNet (127,915 models). For the medical domain, we present a large collection of anatomical shapes (e.g., bones, organs, vessels) and 3D models of surgical instrument, called MedShapeNet, created to facilitate the translation of data-driven vision algorithms to medical applications and to adapt SOTA vision algorithms to medical problems. As a unique feature, we directly model the majority of shapes on the imaging data of real patients. As of today, MedShapeNet includes 23 dataset with more than 100,000 shapes that are paired with annotations (ground truth). Our data is freely accessible via a web interface and a Python application programming interface (API) and can be used for discriminative, reconstructive, and variational benchmarks as well as various applications in virtual, augmented, or mixed reality, and 3D printing. Exemplary, we present use cases in the fields of classification of brain tumors, facial and skull reconstructions, multi-class anatomy completion, education, and 3D printing. In future, we will extend the data and improve the interfaces. The project pages are: https://medshapenet.ikim.nrw/ and https://github.com/Jianningli/medshapenet-feedback
△ Less
Submitted 12 December, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Well-posed problem for a combustion model in a multilayer porous medium
Authors:
M. R. Batista,
A. Cunha,
J. C. Da Mota,
R. A. Santos
Abstract:
Combustion occurring in porous media has various practical applications, such as in in-situ combustion processes in oil reservoirs, the combustion of biogas in sanitary landfills, and many others. A porous medium where combustion takes place can consist of layers with different physical properties. This study demonstrates that the initial value problem for a combustion model in a multi-layer porou…
▽ More
Combustion occurring in porous media has various practical applications, such as in in-situ combustion processes in oil reservoirs, the combustion of biogas in sanitary landfills, and many others. A porous medium where combustion takes place can consist of layers with different physical properties. This study demonstrates that the initial value problem for a combustion model in a multi-layer porous medium has a unique solution, which is continuous with respect to the initial data and parameters in $\mathtt{L}^2(\mathbb{R})^n$. In summary, it establishes that the initial value problem is well-posed in $\mathtt{L}^2(\mathbb{R})^n$. The model is governed by a one-dimensional reaction-diffusion-convection system, where the unknowns are the temperatures in the layers. Previous studies have addressed the same problem in $\mathtt{H}^2(\mathbb{R})^n$. However, in this study, we solve the problem in a less restrictive space, namely $\mathtt{L}^2(\mathbb{R})^n$. The proof employs a novel approach to combustion problems in porous media, utilizing an evolution operator defined from the theory of semigroups in Hilbert space and Kato's theory for a well-posed associated initial value problem.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Um banco de dados de empregos formais georreferenciados em cidades brasileiras
Authors:
Andre Borgato Morelli,
André de Carvalho Fiedler,
André Luiz Cunha
Abstract:
Currently, transport planning has changed its paradigm from projects oriented to guarantee service levels to projects oriented to guarantee accessibility to opportunities. In this context, a number of studies and tools aimed at calculating accessibility are being made available, however these tools depend on job location data that are not always easily accessible. Thus, this work proposes the crea…
▽ More
Currently, transport planning has changed its paradigm from projects oriented to guarantee service levels to projects oriented to guarantee accessibility to opportunities. In this context, a number of studies and tools aimed at calculating accessibility are being made available, however these tools depend on job location data that are not always easily accessible. Thus, this work proposes the creation of a database with the locations of formal jobs in Brazilian cities. The method uses the RAIS jobs database and the CNEFE street faces database to infer the location of jobs in urban regions from the zip code and the number of non-residential addresses on street faces. As a result, jobs can be located more accurately in large and medium-sized cities and approximately in single zip code cities. Finally, the databases are made available openly so that researchers and planning professionals can easily apply accessibility analyzes throughout the national territory.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Probabilistic maps on bistable vibration energy harvesters
Authors:
João Pedro Norenberg,
Americo Cunha Jr,
Samuel da Silva,
Paulo Sergio Varoto
Abstract:
This paper analyzes the impact of parametric uncertainties on the dynamics of bistable energy harvesters, focusing on obtaining statistical information about how each parameter's variability affects the energy harvesting process. To model the parametric uncertainties, we use a probability distribution derived from the maximum entropy principle, while polynomial chaos is employed to propagate uncer…
▽ More
This paper analyzes the impact of parametric uncertainties on the dynamics of bistable energy harvesters, focusing on obtaining statistical information about how each parameter's variability affects the energy harvesting process. To model the parametric uncertainties, we use a probability distribution derived from the maximum entropy principle, while polynomial chaos is employed to propagate uncertainty. Conditional probabilities and probability maps are obtained to investigate the effect of uncertainty on harvesting energy. We consider different models of bistable energy harvesters that account for nonlinear piezoelectric coupling and asymmetries. Our findings suggest a higher probability of increasing harvested power in the intrawell motion regime as the excitation frequency increases. In contrast, increasing the excitation amplitude and piezoelectric coupling are more likely to increase power in the chaotic and interwell motion regimes, respectively. An illustrative example is presented to emphasize the importance of investigating the influence when all parameters vary simultaneously.
△ Less
Submitted 16 October, 2023; v1 submitted 19 February, 2023;
originally announced February 2023.
-
The generalized fractional KdV equation in weighted Sobolev spaces
Authors:
Alysson Cunha,
Oscar Riaño
Abstract:
This work concerns the study of persistence property in polynomial weighted spaces for solutions of the generalized fractional KdV equation in any spatial dimension $d\geq 1$. By establishing well-posedness results in conjunction with some asymptotic at infinity unique continuation principles, it is verified that dispersive effects and dimensionality mainly determine the maximum spatial decay allo…
▽ More
This work concerns the study of persistence property in polynomial weighted spaces for solutions of the generalized fractional KdV equation in any spatial dimension $d\geq 1$. By establishing well-posedness results in conjunction with some asymptotic at infinity unique continuation principles, it is verified that dispersive effects and dimensionality mainly determine the maximum spatial decay allowed by solutions of this model. In particular, we recover and extend some known results on weighted spaces for different models such as the Benjamin-Ono equation, and the dispersion generalized Benjamin-Ono equation. The estimates obtained for the linear equation seem to be of independent interest, and they are useful to obtain persistence properties in weighted spaces for models with different nonlinearities as the fractional KdV equation with combined nonlinearities.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Improving machine learning-derived photometric redshifts and physical property estimates using unlabelled observations
Authors:
A. Humphrey,
P. A. C. Cunha,
A. Paulino-Afonso,
S. Amarantidis,
R. Carvajal,
J. M. Gomes,
I. Matute,
P. Papaderos
Abstract:
In the era of huge astronomical surveys, machine learning offers promising solutions for the efficient estimation of galaxy properties. The traditional, `supervised' paradigm for the application of machine learning involves training a model on labelled data, and using this model to predict the labels of previously unlabelled data. The semi-supervised `pseudo-labelling' technique offers an alternat…
▽ More
In the era of huge astronomical surveys, machine learning offers promising solutions for the efficient estimation of galaxy properties. The traditional, `supervised' paradigm for the application of machine learning involves training a model on labelled data, and using this model to predict the labels of previously unlabelled data. The semi-supervised `pseudo-labelling' technique offers an alternative paradigm, allowing the model training algorithm to learn from both labelled data and as-yet unlabelled data. We test the pseudo-labelling method on the problems of estimating redshift, stellar mass, and star formation rate, using COSMOS2015 broad band photometry and one of several publicly available machine learning algorithms, and we obtain significant improvements compared to purely supervised learning. We find that the gradient-boosting tree methods CatBoost, XGBoost, and LightGBM benefit the most, with reductions of up to ~15% in metrics of absolute error. We also find similar improvements in the photometric redshift catastrophic outlier fraction. We argue that the pseudo-labellng technique will be useful for the estimation of redshift and physical properties of galaxies in upcoming large imaging surveys such as Euclid and LSST, which will provide photometric data for billions of sources.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Machine-learning classification of astronomical sources: estimating F1-score in the absence of ground truth
Authors:
A. Humphrey,
W. Kuberski,
J. Bialek,
N. Perrakis,
W. Cools,
N. Nuyttens,
H. Elakhrass,
P. A. C. Cunha
Abstract:
Machine-learning based classifiers have become indispensable in the field of astrophysics, allowing separation of astronomical sources into various classes, with computational efficiency suitable for application to the enormous data volumes that wide-area surveys now typically produce. In the standard supervised classification paradigm, a model is typically trained and validated using data from re…
▽ More
Machine-learning based classifiers have become indispensable in the field of astrophysics, allowing separation of astronomical sources into various classes, with computational efficiency suitable for application to the enormous data volumes that wide-area surveys now typically produce. In the standard supervised classification paradigm, a model is typically trained and validated using data from relatively small areas of sky, before being used to classify sources in other areas of the sky. However, population shifts between the training examples and the sources to be classified can lead to `silent' degradation in model performance, which can be challenging to identify when the ground-truth is not available. In this Letter, we present a novel methodology using the NannyML Confidence-Based Performance Estimation (CBPE) method to predict classifier F1-score in the presence of population shifts, but without ground-truth labels. We apply CBPE to the selection of quasars with decision-tree ensemble models, using broad-band photometry, and show that the F1-scores are predicted remarkably well (MAPE ~ 10%; R^2 = 0.74-0.92). We discuss potential use-cases in the domain of astronomy, including machine-learning model and/or hyperparameter selection, and evaluation of the suitability of training datasets for a particular classification problem.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Euclid preparation: XXII. Selection of Quiescent Galaxies from Mock Photometry using Machine Learning
Authors:
Euclid Collaboration,
A. Humphrey,
L. Bisigello,
P. A. C. Cunha,
M. Bolzonella,
S. Fotopoulou,
K. Caputi,
C. Tortora,
G. Zamorani,
P. Papaderos,
D. Vergani,
J. Brinchmann,
M. Moresco,
A. Amara,
N. Auricchio,
M. Baldi,
R. Bender,
D. Bonino,
E. Branchini,
M. Brescia,
S. Camera,
V. Capobianco,
C. Carbone,
J. Carretero,
F. J. Castander
, et al. (184 additional authors not shown)
Abstract:
The Euclid Space Telescope will provide deep imaging at optical and near-infrared wavelengths, along with slitless near-infrared spectroscopy, across ~15,000 sq deg of the sky. Euclid is expected to detect ~12 billion astronomical sources, facilitating new insights into cosmology, galaxy evolution, and various other topics. To optimally exploit the expected very large data set, there is the need t…
▽ More
The Euclid Space Telescope will provide deep imaging at optical and near-infrared wavelengths, along with slitless near-infrared spectroscopy, across ~15,000 sq deg of the sky. Euclid is expected to detect ~12 billion astronomical sources, facilitating new insights into cosmology, galaxy evolution, and various other topics. To optimally exploit the expected very large data set, there is the need to develop appropriate methods and software. Here we present a novel machine-learning based methodology for selection of quiescent galaxies using broad-band Euclid I_E, Y_E, J_E, H_E photometry, in combination with multiwavelength photometry from other surveys. The ARIADNE pipeline uses meta-learning to fuse decision-tree ensembles, nearest-neighbours, and deep-learning methods into a single classifier that yields significantly higher accuracy than any of the individual learning methods separately. The pipeline has `sparsity-awareness', so that missing photometry values are still informative for the classification. Our pipeline derives photometric redshifts for galaxies selected as quiescent, aided by the `pseudo-labelling' semi-supervised method. After application of the outlier filter, our pipeline achieves a normalized mean absolute deviation of ~< 0.03 and a fraction of catastrophic outliers of ~< 0.02 when measured against the COSMOS2015 photometric redshifts. We apply our classification pipeline to mock galaxy photometry catalogues corresponding to three main scenarios: (i) Euclid Deep Survey with ancillary ugriz, WISE, and radio data; (ii) Euclid Wide Survey with ancillary ugriz, WISE, and radio data; (iii) Euclid Wide Survey only. Our classification pipeline outperforms UVJ selection, in addition to the Euclid I_E-Y_E, J_E-H_E and u-I_E,I_E-J_E colour-colour methods, with improvements in completeness and the F1-score of up to a factor of 2. (Abridged)
△ Less
Submitted 5 December, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Nonlinear dynamics of asymmetric bistable energy harvesters
Authors:
João Pedro Norenberg,
Roberto Luo,
Vinicius Goncaalves Lopes,
João Victor L. L. Peterson,
Americo Cunha Jr
Abstract:
The paper investigates asymmetries effects over a nonlinear vibration energy harvester dynamics. The asymmetric system performance is compared with symmetric ones. Different asymmetry levels on restoring force and gravity action are investigated from a system-slo** angle variation. Bifurcation diagrams and basins of attraction are used to examine the local and global characteristics underlying d…
▽ More
The paper investigates asymmetries effects over a nonlinear vibration energy harvester dynamics. The asymmetric system performance is compared with symmetric ones. Different asymmetry levels on restoring force and gravity action are investigated from a system-slo** angle variation. Bifurcation diagrams and basins of attraction are used to examine the local and global characteristics underlying dynamical systems under different excitation energy. The results show the adverse effects of asymmetries on system dynamics. They also reveal ways to overcome them by canceling asymmetric influence from optimal slo** angle values and improving asymmetric system performance over symmetrical ones. This comprehensive numerical study provides novel valuable insights into asymmetrical energy harvester dynamics, a wide and still less explored topic.
△ Less
Submitted 9 June, 2023; v1 submitted 20 August, 2022;
originally announced September 2022.
-
Cell response in free-packed granular systems
Authors:
Ana F. Cunha,
André F. V. Matias,
Cristóvão S. Dias,
Mariana B. Oliveira,
Nuno A. M. Araújo,
João F. Mano
Abstract:
The study of the interactions of living adherent cells with mechanically stable (visco)elastic materials enables understanding and exploiting physiological phenomena mediated by cell-extracellular communication. However, insight on the interaction of cells and surrounding objects with different stability patterns upon cell contact might unveil cell responses that may be engineered for innovative a…
▽ More
The study of the interactions of living adherent cells with mechanically stable (visco)elastic materials enables understanding and exploiting physiological phenomena mediated by cell-extracellular communication. However, insight on the interaction of cells and surrounding objects with different stability patterns upon cell contact might unveil cell responses that may be engineered for innovative applications. Here, it is hypothesized that the efficiency of cell attachment, spreading and movement across a free-packed granular bed of microparticles depend on microparticle diameter, raising the possibility of a necessary minimum traction force for the reinforcement of cell-particle bonds, and long-term cell adhesion. The results suggest that microparticles with 14-20 μm are prone to cell-mediated mobility, holding the potential of inducing early cell detachment, while objects with diameters from 38-85 μm enable long-lasting cell adhesion and proliferation. An in-silico hybrid particle-based model that addresses time-dependent biological mechanisms of cell adhesion is proposed, providing inspiration for engineering platforms to address healthcare-related challenges.
△ Less
Submitted 5 September, 2022; v1 submitted 8 August, 2022;
originally announced August 2022.
-
On the Multidimensional Random Subset Sum Problem
Authors:
Luca Becchetti,
Arthur Carvalho Walraven da Cunha,
Andrea Clementi,
Francesco d'Amore,
Hicham Lesfari,
Emanuele Natale,
Luca Trevisan
Abstract:
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed at…
▽ More
In the Random Subset Sum Problem, given $n$ i.i.d. random variables $X_1, ..., X_n$, we wish to approximate any point $z \in [-1,1]$ as the sum of a suitable subset $X_{i_1(z)}, ..., X_{i_s(z)}$ of them, up to error $\varepsilon$. Despite its simple statement, this problem is of fundamental interest to both theoretical computer science and statistical mechanics. More recently, it gained renewed attention for its implications in the theory of Artificial Neural Networks. An obvious multidimensional generalisation of the problem is to consider $n$ i.i.d. $d$-dimensional random vectors, with the objective of approximating every point $\mathbf{z} \in [-1,1]^d$. In 1998, G. S. Lueker showed that, in the one-dimensional setting, $n=\mathcal{O}(\log \frac 1\varepsilon)$ samples guarantee the approximation property with high probability.In this work, we prove that, in $d$ dimensions, $n = \mathcal{O}(d^3\log \frac 1\varepsilon \cdot (\log \frac 1\varepsilon + \log d))$ samples suffice for the approximation property to hold with high probability. As an application highlighting the potential interest of this result, we prove that a recently proposed neural network model exhibits universality: with high probability, the model can approximate any neural network within a polynomial overhead in the number of parameters.
△ Less
Submitted 17 November, 2022; v1 submitted 28 July, 2022;
originally announced July 2022.
-
Uncertainty quantification in mechanistic epidemic models via cross-entropy approximate Bayesian computation
Authors:
Americo Cunha Jr,
David A. W. Barton,
Thiago G. Ritto
Abstract:
This paper proposes a data-driven approximate Bayesian computation framework for parameter estimation and uncertainty quantification of epidemic models, which incorporates two novelties: (i) the identification of the initial conditions by using plausible dynamic states that are compatible with observational data; (ii) learning of an informative prior distribution for the model parameters via the c…
▽ More
This paper proposes a data-driven approximate Bayesian computation framework for parameter estimation and uncertainty quantification of epidemic models, which incorporates two novelties: (i) the identification of the initial conditions by using plausible dynamic states that are compatible with observational data; (ii) learning of an informative prior distribution for the model parameters via the cross-entropy method. The new methodology's effectiveness is illustrated with the aid of actual data from the COVID-19 epidemic in Rio de Janeiro city in Brazil, employing an ordinary differential equation-based model with a generalized SEIR mechanistic structure that includes time-dependent transmission rate, asymptomatics, and hospitalizations. A minimization problem with two cost terms (number of hospitalizations and deaths) is formulated, and twelve parameters are identified. The calibrated model provides a consistent description of the available data, able to extrapolate forecasts over a few weeks, making the proposed methodology very appealing for real-time epidemic modeling.
△ Less
Submitted 2 February, 2023; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Multilingual Disinformation Detection for Digital Advertising
Authors:
Zofia Trstanova,
Nadir El Manouzi,
Maryline Chen,
Andre L. V. da Cunha,
Sergei Ivanov
Abstract:
In today's world, the presence of online disinformation and propaganda is more widespread than ever. Independent publishers are funded mostly via digital advertising, which is unfortunately also the case for those publishing disinformation content. The question of how to remove such publishers from advertising inventory has long been ignored, despite the negative impact on the open internet. In th…
▽ More
In today's world, the presence of online disinformation and propaganda is more widespread than ever. Independent publishers are funded mostly via digital advertising, which is unfortunately also the case for those publishing disinformation content. The question of how to remove such publishers from advertising inventory has long been ignored, despite the negative impact on the open internet. In this work, we make the first step towards quickly detecting and red-flagging websites that potentially manipulate the public with disinformation. We build a machine learning model based on multilingual text embeddings that first determines whether the page mentions a topic of interest, then estimates the likelihood of the content being malicious, creating a shortlist of publishers that will be reviewed by human experts. Our system empowers internal teams to proactively, rather than defensively, blacklist unsafe content, thus protecting the reputation of the advertisement provider.
△ Less
Submitted 4 July, 2022;
originally announced July 2022.
-
On non-compact gradient solitons
Authors:
Antonio W. Cunha,
Erin Griffin
Abstract:
In this paper we extend existing results for generalized solitons, called $q$-solitons, to the complete case by considering non-compact solitons. By placing regularity conditions on the vector field $X$ and curvature conditions on $M$, we are able to use the chosen properties of the tensor $q$ to see that such non-compact $q$-solitons are stationary and $q$-flat.
We conclude by applying our resu…
▽ More
In this paper we extend existing results for generalized solitons, called $q$-solitons, to the complete case by considering non-compact solitons. By placing regularity conditions on the vector field $X$ and curvature conditions on $M$, we are able to use the chosen properties of the tensor $q$ to see that such non-compact $q$-solitons are stationary and $q$-flat.
We conclude by applying our results to the examples of ambient obstruction solitons, Cotton solitons, and Bach solitons to demonstrate the utility of these general theorems for various flows.
△ Less
Submitted 9 June, 2023; v1 submitted 12 July, 2022;
originally announced July 2022.
-
Application of the semigroup theory to a combustion problem in a multi-layer porous medium
Authors:
E. A. Alarcon,
M. R. Batista,
A. Cunha,
J. C. Da Mota,
R. A. Santos
Abstract:
This study proved that the Cauchy problem for a one-dimensional reaction-diffusion-convection system is locally and globally well-posed in $\mathtt{H}^2(\mathbb{R})$. The system modeled a gasless combustion front through a multi-layer porous medium when the fuel concentration in each layer was a known function. Combustion has critical practical porous media applications, such as in in-situ combust…
▽ More
This study proved that the Cauchy problem for a one-dimensional reaction-diffusion-convection system is locally and globally well-posed in $\mathtt{H}^2(\mathbb{R})$. The system modeled a gasless combustion front through a multi-layer porous medium when the fuel concentration in each layer was a known function. Combustion has critical practical porous media applications, such as in in-situ combustion processes in oil reservoirs and several other areas.
Earlier studies considered physical parameters (e.g., porosity, thermal conductivity, heat capacity, and initial fuel concentration ) constant. Here, we consider a more realistic model where these parameters are functions of the spatial variable rather than constants. Furthermore, in previous studies, we did not consider the continuity of the solution regarding the initial data and parameters, unlike the current study. This proof uses a novel approach to combustion problems in porous media. We follow the abstract semigroups theory of operators in the Hilbert space and the well-known Kato's theory for a well-posed associated initial value problem.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
On the reduction of nonlinear electromechanical systems
Authors:
Americo Cunha Jr,
Marcelo Pereira,
Rafael Avanço,
Angelo Marcelo Tusset,
José Manoel Balthazar
Abstract:
The present work revisits the reduction of the nonlinear dynamics of an electromechanical system through a quasi-steady state hypothesis, discussing the fundamental aspects of this type of approach and clarifying some confusing points found in the literature. Expressions for the characteristic time scales of dynamics are deduced from a physical analysis that establishes an analogy between electrom…
▽ More
The present work revisits the reduction of the nonlinear dynamics of an electromechanical system through a quasi-steady state hypothesis, discussing the fundamental aspects of this type of approach and clarifying some confusing points found in the literature. Expressions for the characteristic time scales of dynamics are deduced from a physical analysis that establishes an analogy between electromechanical dynamics and the kinetics of a chemical reaction. It provides a physical justification, supplemented by non-dimensionalization and scaling of the equations, to reduce the dynamics of interest by assuming a quasi-steady state for the electrical subsystem, eliminating the inductive term from the electrical equation. Numerical experiments help to illustrate the typical behavior of the electromechanical system, a boundary layer phenomenon near the initial dynamic state, and the validity limits of the electromechanical quasi-steady-state assumption discussed here.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Electron-Affinity Time-Dependent Density Functional Theory: Formalism and Applications to Core-Excited States
Authors:
Kevin Carter-Fenk,
Leonardo A. Cunha,
Juan E. Arias-Martinez,
Martin Head-Gordon
Abstract:
The particle-hole interaction problem is longstanding within time-dependent density functional theory (TDDFT) and leads to extreme errors in the prediction of K-edge X-ray absorption spectra (XAS). We derive a linear-response formalism that uses optimized orbitals of the n-1-electron system as reference, building orbital relaxation and a proper hole into the initial density. Our approach is an exa…
▽ More
The particle-hole interaction problem is longstanding within time-dependent density functional theory (TDDFT) and leads to extreme errors in the prediction of K-edge X-ray absorption spectra (XAS). We derive a linear-response formalism that uses optimized orbitals of the n-1-electron system as reference, building orbital relaxation and a proper hole into the initial density. Our approach is an exact generalization of the static-exchange approximation that ameliorates particle-hole interaction error associated with the adiabatic approximation and reduces errors in TDDFT XAS by orders of magnitude. With a statistical performance of just 0.5 eV root-mean-square error and the same computational scaling as TDDFT under the core-valence separation approximation, we anticipate that this approach will be of great utility in XAS calculations of large systems.
△ Less
Submitted 19 September, 2022; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Revisiting the Random Subset Sum problem
Authors:
Arthur da Cunha,
Francesco d'Amore,
Frédéric Giroire,
Hicham Lesfari,
Emanuele Natale,
Laurent Viennot
Abstract:
The average properties of the well-known Subset Sum Problem can be studied by the means of its randomised version, where we are given a target value $z$, random variables $X_1, \ldots, X_n$, and an error parameter $\varepsilon > 0$, and we seek a subset of the $X_i$s whose sum approximates $z$ up to error $\varepsilon$. In this setup, it has been shown that, under mild assumptions on the distribut…
▽ More
The average properties of the well-known Subset Sum Problem can be studied by the means of its randomised version, where we are given a target value $z$, random variables $X_1, \ldots, X_n$, and an error parameter $\varepsilon > 0$, and we seek a subset of the $X_i$s whose sum approximates $z$ up to error $\varepsilon$. In this setup, it has been shown that, under mild assumptions on the distribution of the random variables, a sample of size $\mathcal{O}(\log(1/\varepsilon))$ suffices to obtain, with high probability, approximations for all values in $[-1/2, 1/2]$. Recently, this result has been rediscovered outside the algorithms community, enabling meaningful progress in other fields. In this work we present an alternative proof for this theorem, with a more direct approach and resourcing to more elementary tools.
△ Less
Submitted 30 March, 2023; v1 submitted 29 April, 2022;
originally announced April 2022.
-
Accurate core excitation and ionization energies from a state-specific coupled-cluster singles and doubles approach
Authors:
Juan E. Arias-Martinez,
Leonardo A. Cunha,
Katherine J. Oosterbaan,
Joonho Lee,
Martin Head-Gordon
Abstract:
We investigate the use of orbital-optimized references in conjunction with single-reference coupled-cluster theory with single and double substitutions (CCSD) for the study of core excitations and ionizations of 18 small organic molecules, without any use of response theory or equation-of-motion formalisms. Three schemes are employed to successfully address the convergence difficulties associated…
▽ More
We investigate the use of orbital-optimized references in conjunction with single-reference coupled-cluster theory with single and double substitutions (CCSD) for the study of core excitations and ionizations of 18 small organic molecules, without any use of response theory or equation-of-motion formalisms. Three schemes are employed to successfully address the convergence difficulties associated with the coupled-cluster equations, and the spin contamination resulting from the use of a spin symmetry-broken reference, in the case of excitations. In order to gauge the inherent potential of the methods studied, an effort is made to provide reasonable basis set limit estimates for the transition energies. Overall, we find that the two best-performing schemes studied here for Delta-CCSD are capable of predicting excitation and ionization energies with errors comparable to experimental accuracies. The proposed Delta-CCSD schemes seem to fare better than the widely used equation-of-motion CCSD (EOM-CCSD) with core-valence separation protocol, with statistical errors being reduced by more than a factor of two when compared to FC-CVS-EOM-CCSD.
△ Less
Submitted 11 July, 2022; v1 submitted 27 April, 2022;
originally announced April 2022.
-
On decay of the solutions for the dispersion generalized-Benjamin-Ono and Benjamin-Ono equations
Authors:
Alysson Cunha
Abstract:
We show that uniqueness results of the kind those obtained for KdV and Schrödinger equations ([7], [28]), are not valid for the dispersion generalized-Benjamin-Ono equation in the weighted Sobolev spaces
$$H^s(\R)\cap L^2(x^{2r}dx),$$ for appropriated $s$ and $r$.
In particular, we obtain that the uniqueness result proved for the dispersion generalized-Benjamin-Ono equation ([13]), is not true…
▽ More
We show that uniqueness results of the kind those obtained for KdV and Schrödinger equations ([7], [28]), are not valid for the dispersion generalized-Benjamin-Ono equation in the weighted Sobolev spaces
$$H^s(\R)\cap L^2(x^{2r}dx),$$ for appropriated $s$ and $r$.
In particular, we obtain that the uniqueness result proved for the dispersion generalized-Benjamin-Ono equation ([13]), is not true for all pairs of solutions $u_1\neq 0$ and $u_2\neq 0$. To achieve these results we employ the techniques present in our recent work [6]. We also improve some Theorems established for the dispersion generalized-Benjamin-Ono equation and for the Benjamin-Ono equation ([13], [12]).
△ Less
Submitted 5 April, 2022;
originally announced April 2022.
-
Photometric redshift-aided classification using ensemble learning
Authors:
P. A. C. Cunha,
A. Humphrey
Abstract:
We present SHEEP, a new machine learning approach to the classic problem of astronomical source classification, which combines the outputs from the XGBoost, LightGBM, and CatBoost learning algorithms to create stronger classifiers. A novel step in our pipeline is that prior to performing the classification, SHEEP first estimates photometric redshifts, which are then placed into the data set as an…
▽ More
We present SHEEP, a new machine learning approach to the classic problem of astronomical source classification, which combines the outputs from the XGBoost, LightGBM, and CatBoost learning algorithms to create stronger classifiers. A novel step in our pipeline is that prior to performing the classification, SHEEP first estimates photometric redshifts, which are then placed into the data set as an additional feature for classification model training; this results in significant improvements in the subsequent classification performance. SHEEP contains two distinct classification methodologies: (i) Multi-class and (ii) one versus all with correction by a meta-learner. We demonstrate the performance of SHEEP for the classification of stars, galaxies, and quasars using a data set composed of SDSS and WISE photometry of 3.5 million astronomical sources. The resulting F1-scores are as follows: 0.992 for galaxies; 0.967 for quasars; and 0.985 for stars. In terms of the F1-scores for the three classes, SHEEP is found to outperform a recent RandomForest-based classification approach using an essentially identical data set. Our methodology also facilitates model and data set explainability via feature importances; it also allows the selection of sources whose uncertain classifications may make them interesting sources for follow-up observations.
△ Less
Submitted 19 May, 2022; v1 submitted 5 April, 2022;
originally announced April 2022.
-
Truck Axle Detection with Convolutional Neural Networks
Authors:
Leandro Arab Marcomini,
André Luiz Cunha
Abstract:
Axle count in trucks is important to the classification of vehicles and to the operation of road systems. It is used in the determination of service fees and in the impact on the pavement. Although axle count can be achieved with traditional methods, such as manual labor, it is increasingly possible to count axles using deep learning and computer vision methods. This paper aims to compare three de…
▽ More
Axle count in trucks is important to the classification of vehicles and to the operation of road systems. It is used in the determination of service fees and in the impact on the pavement. Although axle count can be achieved with traditional methods, such as manual labor, it is increasingly possible to count axles using deep learning and computer vision methods. This paper aims to compare three deep-learning object detection algorithms, YOLO, Faster R-CNN, and SSD, for the detection of truck axles. A dataset was built to provide training and testing examples for the neural networks. The training was done on different base models, to increase training time efficiency and to compare results. We evaluated results based on five metrics: precision, recall, mAP, F1-score, and FPS count. Results indicate that YOLO and SSD have similar accuracy and performance, with more than 96\% mAP for both models. Datasets and codes are publicly available for download.
△ Less
Submitted 3 March, 2023; v1 submitted 4 April, 2022;
originally announced April 2022.
-
The starting dates of COVID-19 multiple waves
Authors:
Paulo Roberto de Lima Gianfelice,
Ricardo Sovek Oyarzabal,
Americo Cunha Jr,
Jose Mario Vicensi Grzybowski,
Fernando da Conceição Batista,
Elbert E. N. Macau
Abstract:
The severe acute respiratory syndrome of coronavirus 2 spread globally very quickly, causing great concern at the international level due to the severity of the associated respiratory disease, the so-called COVID-19. Considering Rio de Janeiro city (Brazil) as an example, the first diagnosis of this disease occurred in March 2020, but the exact moment when the local spread of the virus started is…
▽ More
The severe acute respiratory syndrome of coronavirus 2 spread globally very quickly, causing great concern at the international level due to the severity of the associated respiratory disease, the so-called COVID-19. Considering Rio de Janeiro city (Brazil) as an example, the first diagnosis of this disease occurred in March 2020, but the exact moment when the local spread of the virus started is uncertain as the Brazilian epidemiological surveillance system was not widely prepared to detect suspected cases of COVID-19 at that time. Improvements in this surveillance system occurred over the pandemic, but due to the complex nature of the disease transmission process, specifying the exact moment of emergence of new community contagion outbreaks is a complicated task. This work aims to propose a general methodology to determine possible start dates for the multiple community outbreaks of COVID-19, using for this purpose a parametric statistical approach that combines surveillance data, nonlinear regression, and information criteria to obtain a statistical model capable of describing the multiple waves of contagion observed. The dynamics of COVID-19 in the city of Rio de Janeiro is taken as a case study, and the results suggest that the original strain of the virus was already circulating in Rio de Janeiro city as early as late February 2020, probably being massively disseminated in the population during the carnival festivities.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Changes in polarization dictate necessary approximations for modeling electronic de-excitation intensity: an application to X-ray emission
Authors:
Subhayan Roychoudhury,
Leonardo A. Cunha,
Martin Head-Gordon,
David Prendergast
Abstract:
We systematically investigate the underlying relations among different levels of approximation for simulating electronic de-excitations, with a focus on modeling X-ray emission spectroscopy (XES). Using Fermi's golden rule and explicit modeling of the initial, core-excited state and the final, valence-hole state, we show that XES can be accurately modeled by using orbital optimization for the vari…
▽ More
We systematically investigate the underlying relations among different levels of approximation for simulating electronic de-excitations, with a focus on modeling X-ray emission spectroscopy (XES). Using Fermi's golden rule and explicit modeling of the initial, core-excited state and the final, valence-hole state, we show that XES can be accurately modeled by using orbital optimization for the various final states within a Slater-determinant framework. However, in this paper, we introduce a much cheaper approach reliant only on a single self-consistent field for all the final states, and show that it is typically sufficient. Further approximations reveal that these fundamentally many-body transitions can be reasonably approximated by projections of ground state orbitals, but that the ground state alone is insufficient. Furthermore, except in cases where the core-ionization induces negligible changes in polarization, linear-response approaches within the adiabatic approximation will have difficulty in accurately modeling de-excitation to the core level. Therefore, change in the net dipole moment of the valence electrons can serve as a metric for the validity of the linear-response approximation.
△ Less
Submitted 18 December, 2021;
originally announced December 2021.
-
Probing the nonequilibrium dynamics of stress, orientation and entanglements in polymer melts with orthogonal interrupted shear simulations
Authors:
Marco Aurelio Galvani Cunha,
Peter D. Olmsted,
Mark O. Robbins
Abstract:
Both entangled and unentangled polymer melts exhibit stress overshoots when subject to shearing flow. The size of the overshoot depends on the applied shear rate and is related to relaxation mechanisms such as reptation, chain stretch and convective constraint release. Previous experimental work shows that melts subjected to interrupted shear flows exhibit a smaller overshoot when sheared after pa…
▽ More
Both entangled and unentangled polymer melts exhibit stress overshoots when subject to shearing flow. The size of the overshoot depends on the applied shear rate and is related to relaxation mechanisms such as reptation, chain stretch and convective constraint release. Previous experimental work shows that melts subjected to interrupted shear flows exhibit a smaller overshoot when sheared after partial relaxation. This has been shown to be consistent with predictions by constitutive models. Here, we report molecular dynamics simulations of interrupted shear of polymer melts where the shear flow after the relaxation stage is orthogonal to the original applied flow. We observe that, for a given relaxation time, the size of the stress overshoot under orthogonal interrupted shear is larger than observed during parallel interrupted shear, which is not captured by constitutive models. Differences in maxima are also observed for overshoots in the first normal stress and chain end-to-end distance. We also show that measurements of the average number of entanglements per chain and average orientation at different scales along the chain are affected by the change in shear direction, leading to non-monotonic relaxation of the off-diagonal components of orientation and an appearance of a 'double peak' in the average number of entanglements during the transient. We propose that such complex behavior of entanglements is responsible for the increase in the overshoots of stress components, and that models of the dynamics of entanglements might be improved upon by considering a tensorial measurement of entanglements that can be coupled to orientation.
△ Less
Submitted 11 February, 2022; v1 submitted 29 November, 2021;
originally announced November 2021.
-
Relativistic Orbital Optimized Density Functional Theory for Accurate Core-Level Spectroscopy
Authors:
Leonardo A. Cunha,
Diptarka Hait,
Richard Kang,
Yuezhi Mao,
Martin Head-Gordon
Abstract:
Core-level spectra of 1s electrons of elements heavier than Ne show significant relativistic effects. We combine advances in orbital optimized DFT (OO-DFT) with the spin-free exact two-component (X2C) model for scalar relativistic effects, to study K-edge spectra of third period elements. OO-DFT/X2C is found to be quite accurate at predicting energies, yielding $\sim 0.5$ eV RMS error vs experimen…
▽ More
Core-level spectra of 1s electrons of elements heavier than Ne show significant relativistic effects. We combine advances in orbital optimized DFT (OO-DFT) with the spin-free exact two-component (X2C) model for scalar relativistic effects, to study K-edge spectra of third period elements. OO-DFT/X2C is found to be quite accurate at predicting energies, yielding $\sim 0.5$ eV RMS error vs experiment with the modern SCAN (and related) functionals. This marks a significant improvement over the $>50$ eV deviations that are typical for the popular time-dependent DFT (TDDFT) approach. Consequently, experimental spectra are quite well reproduced by OO-DFT/X2C, sans empirical shifts for alignment. OO-DFT/X2C combines high accuracy with ground state DFT cost and is thus a promising route for computing core-level spectra of third period elements. We also explored K and L edges of 3d transition metals to identify limitations of the OO-DFT/X2C approach in modeling the spectra of heavier atoms.
△ Less
Submitted 30 March, 2022; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Approaching the Basis Set Limit in Gaussian-Orbital-Based Periodic Calculations with Transferability: Performance of Pure Density Functionals for Simple Semiconductors
Authors:
Joonho Lee,
Xintian Feng,
Leonardo A. Cunha,
Jerome F. Gonthier,
Evgeny Epifanovsky,
Martin Head-Gordon
Abstract:
Simulating solids with quantum chemistry methods and Gaussian-type orbitals (GTOs) has been gaining popularity. Nonetheless, there are few systematic studies that assess the basis set incompleteness error (BSIE) in these GTO-based simulations over a variety of solids. In this work, we report a GTO-based implementation for solids, and apply it to address the basis set convergence issue. We employ a…
▽ More
Simulating solids with quantum chemistry methods and Gaussian-type orbitals (GTOs) has been gaining popularity. Nonetheless, there are few systematic studies that assess the basis set incompleteness error (BSIE) in these GTO-based simulations over a variety of solids. In this work, we report a GTO-based implementation for solids, and apply it to address the basis set convergence issue. We employ a simple strategy to generate large uncontracted (unc) GTO basis sets, that we call the unc-def2-GTH sets. These basis sets exhibit systematic improvement towards the basis set limit as well as good transferability based on application to a total of 43 simple semiconductors. Most notably, we found the BSIE of unc-def2-QZVP-GTH to be smaller than 0.7 m$E_h$ per atom in total energies and 20 meV in band gaps for all systems considered here. Using unc-def2-QZVP-GTH, we report band gap benchmarks of a combinatorially designed meta generalized gradient functional (mGGA), B97M-rV, and show that B97M-rV performs similarly (a root-mean-square-deviation (RMSD) of 1.18 eV) to other modern mGGA functionals, M06-L (1.26 eV), MN15-L (1.29 eV), and SCAN (1.20 eV). This represents a clear improvement over older pure functionals such as LDA (1.71 eV) and PBE (1.49 eV) though all these mGGAs are still far from being quantitatively accurate. We also provide several cautionary notes on the use of our uncontracted bases and on future research on GTO basis set development for solids.
△ Less
Submitted 7 October, 2021; v1 submitted 29 August, 2021;
originally announced August 2021.
-
Non-intrusive polynomial chaos expansion for topology optimization using polygonal meshes
Authors:
Nilton Cuellar,
Anderson Pereira,
Ivan F. M. Menezes,
Americo Cunha Jr
Abstract:
This paper deals with the applications of stochastic spectral methods for structural topology optimization in the presence of uncertainties. A non-intrusive polynomial chaos expansion is integrated into a topology optimization algorithm to calculate low-order statistical moments of the mechanical-mathematical model response. This procedure, known as robust topology optimization, can optimize the m…
▽ More
This paper deals with the applications of stochastic spectral methods for structural topology optimization in the presence of uncertainties. A non-intrusive polynomial chaos expansion is integrated into a topology optimization algorithm to calculate low-order statistical moments of the mechanical-mathematical model response. This procedure, known as robust topology optimization, can optimize the mean of the compliance while simultaneously minimizing its standard deviation. In order to address possible variabilities in the loads applied to the mechanical system of interest, magnitude and direction of the external forces are assumed to be uncertain. In this probabilistic framework, forces are described as a random field or a set of random variables. Representation of the random objects and propagation of load uncertainties through the model are efficiently done through Karhunen-Loève and polynomial chaos expansions. We take advantage of using polygonal elements, which have been shown to be effective in suppressing checkerboard patterns and reducing mesh dependency in the solution of topology optimization problems. Accuracy and applicability of the proposed methodology are demonstrated by means of several topology optimization examples. The obtained results, which are in excellent agreement with reference solutions computed via Monte Carlo method, show that load uncertainties play an important role in optimal design of structural systems, so that they must be taken into account to ensure a reliable optimization process.
△ Less
Submitted 14 July, 2021;
originally announced August 2021.
-
The nonlinear dynamics of a bistable energy harvesting system with colored noise disturbances
Authors:
Vinicius Gonçalves Lopes,
João Victor L. L. Peterson,
Americo Cunha Jr
Abstract:
This paper deals with the nonlinear stochastic dynamics of a piezoelectric energy harvesting system subjected to a harmonic external excitation disturbed by Gaussian colored noise. A parametric analysis is conducted, where the effects of the standard deviation and the correlation time of colored noise on the system response are investigated. The numerical results suggest a strong influence of nois…
▽ More
This paper deals with the nonlinear stochastic dynamics of a piezoelectric energy harvesting system subjected to a harmonic external excitation disturbed by Gaussian colored noise. A parametric analysis is conducted, where the effects of the standard deviation and the correlation time of colored noise on the system response are investigated. The numerical results suggest a strong influence of noise on the system response for higher values of correlation time and standard deviation, and a low (noise level independent) influence for low values of correlation time.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Identification of parameters in the torsional dynamics of a drilling process through Bayesian statistics
Authors:
Mario Germán Sandoval,
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This work presents the estimation of the parameters of an experimental setup, which is modeled as a system with three degrees of freedom, composed by a shaft, two rotors, and a DC motor, that emulates a drilling process. A Bayesian technique is used in the estimation process, to take into account the uncertainties and variabilities intrinsic to the measurement taken, which are modeled as a noise o…
▽ More
This work presents the estimation of the parameters of an experimental setup, which is modeled as a system with three degrees of freedom, composed by a shaft, two rotors, and a DC motor, that emulates a drilling process. A Bayesian technique is used in the estimation process, to take into account the uncertainties and variabilities intrinsic to the measurement taken, which are modeled as a noise of Gaussian nature. With this procedure it is expected to check the reliability of the nominal values of the physical parameters of the test rig. An estimation process assuming that nine parameters of the experimental apparatus are unknown is conducted, and the results show that for some quantities the relative deviation with respect to the nominal values is very high. This deviation evidentiates a strong deficiency in the mathematical model used to describe the dynamic behavior of the experimental apparatus.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Effect of an attached end mass in the dynamics of uncertainty nonlinear continuous random system
Authors:
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This work studies the dynamics of a one dimensional elastic bar with random elastic modulus and prescribed boundary conditions, say, fixed at one end, and attached to a lumped mass and two springs (one linear and another nonlinear) on the other extreme. The system analysis assumes that the elastic modulus has gamma probability distribution and uses Monte Carlo simulations to compute the propagatio…
▽ More
This work studies the dynamics of a one dimensional elastic bar with random elastic modulus and prescribed boundary conditions, say, fixed at one end, and attached to a lumped mass and two springs (one linear and another nonlinear) on the other extreme. The system analysis assumes that the elastic modulus has gamma probability distribution and uses Monte Carlo simulations to compute the propagation of uncertainty in this continuous--discrete system. After describing the deterministic and the stochastic modeling of the system, some configurations of the model are analyzed in order to characterize the effect of the lumped mass in the overall behavior of this dynamical system.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Quantification of parametric uncertainties induced by irregular soil loading in orchard tower sprayer nonlinear dynamics
Authors:
Americo Cunha Jr,
Jorge Luis Palacios Felix,
José Manoel Balthazar
Abstract:
This paper deals with the nonlinear stochastic dynamics of an orchard tower sprayer subjected to random excitations due to soil irregularities. A consistent stochastic model of uncertainties is constructed to describe random loadings and to predict variabilities in mechanical system response. The dynamics is addressed in time and frequency domains. Monte Carlo method is employed to compute the pro…
▽ More
This paper deals with the nonlinear stochastic dynamics of an orchard tower sprayer subjected to random excitations due to soil irregularities. A consistent stochastic model of uncertainties is constructed to describe random loadings and to predict variabilities in mechanical system response. The dynamics is addressed in time and frequency domains. Monte Carlo method is employed to compute the propagation of uncertainties through the stochastic model. Numerical simulations reveals a very rich dynamics, which is able to produce chaos. This numerical study also indicates that lateral vibrations follow a direct energy cascade law. A probabilistic analysis reveals the possibility of large lateral vibrations during the equipment operation.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Global sensitivity analysis of asymmetric energy harvesters
Authors:
João Pedro Norenberg,
Americo Cunha Jr,
Samuel da Silva,
Paulo Sérgio Varoto
Abstract:
Parametric variability is inevitable in actual energy harvesters. It can significantly affect crucial aspects of the system performance, especially in harvesting systems that present geometric parameters, material properties, or excitation conditions that are susceptible to small perturbations. This work aims to develop an investigation to identify the most critical parameters in the dynamic behav…
▽ More
Parametric variability is inevitable in actual energy harvesters. It can significantly affect crucial aspects of the system performance, especially in harvesting systems that present geometric parameters, material properties, or excitation conditions that are susceptible to small perturbations. This work aims to develop an investigation to identify the most critical parameters in the dynamic behavior of asymmetric bistable energy harvesters with nonlinear piezoelectric coupling, considering the variability of their physical and excitation properties. For this purpose, a global sensitivity analysis based on orthogonal variance decomposition, employing Sobol indices, is performed to quantify the effect of the harvester parameters on the variance of the recovered power. This technique quantifies the variance concerning each parameter individually and collectively regarding the total variation of the model. The results indicate that the frequency and amplitude of excitation, asymmetric terms and electrical proprieties of the piezoelectric coupling are the most critical parameters that affect the mean power harvested. It is also shown that the order of importance of the parameters can change according to the stability of the harvester's dynamic response. In this way, a better understanding of the system under analysis is obtained since the study allows the identification of vital parameters that rule the change of dynamic behavior and therefore constitutes a powerful tool in the robust design, optimization, and response prediction of nonlinear harvesters.
△ Less
Submitted 25 May, 2022; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Exploring Spin Symmetry-Breaking Effects for Static Field Ionization of Atoms: Is There an Analog to the Coulson-Fischer Point in Bond Dissociation?
Authors:
Leonardo A. Cunha,
Joonho Lee,
Diptarka Hait,
C. William McCurdy,
Martin Head-Gordon
Abstract:
Löwdin's symmetry dilemma is an ubiquitous issue in approximate quantum chemistry. In the context of Hartree-Fock (HF) theory, the use of Slater determinants with some imposed constraints to preserve symmetries of the exact problem may lead to physically unreasonable potential energy surfaces. On the other hand, lifting these constraints leads to the so-called broken symmetry solutions that usuall…
▽ More
Löwdin's symmetry dilemma is an ubiquitous issue in approximate quantum chemistry. In the context of Hartree-Fock (HF) theory, the use of Slater determinants with some imposed constraints to preserve symmetries of the exact problem may lead to physically unreasonable potential energy surfaces. On the other hand, lifting these constraints leads to the so-called broken symmetry solutions that usually provide better energetics, at the cost of losing information about good quantum numbers that describe the state of the system. This behavior has been previously extensively studied in the context of bond dissociation. This paper studies the behavior of different classes of Hartree-Fock spin polarized solutions (restricted, unrestricted, generalized) in the context of ionization by strong static electric fields. We find that, for simple two-electron systems, UHF is able to provide a qualitatively good description of states involved during the ionization process (neutral, singly-ionized and doubly ionized states), whereas RHF fails to describe the singly ionized state. For more complex systems, even though UHF is able to capture some of the expected characteristics of the ionized states, it is constrained to a single $M_s$ (diabatic) manifold in the energy surface as a function of field intensity. In this case a better qualitative picture can be painted by GHF as it is able to explore different spin manifolds and follow the lowest solution due to lack of collinearity constraints on the spin quantization axis.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Assessment of a transient homogeneous reactor through in situ adaptive tabulation
Authors:
Americo Cunha Jr,
Luis Fernando Figueira da Silva
Abstract:
The development of computational models for the numerical simulation of chemically reacting flows operating in the turbulent regime requires the solution of partial differential equations that represent the balance of mass, linear momentum, chemical species, and energy. The chemical reactions of the model may involve detailed reaction mechanisms for the description of the physicochemical phenomena…
▽ More
The development of computational models for the numerical simulation of chemically reacting flows operating in the turbulent regime requires the solution of partial differential equations that represent the balance of mass, linear momentum, chemical species, and energy. The chemical reactions of the model may involve detailed reaction mechanisms for the description of the physicochemical phenomena. One of the biggest challenges is the stiffness of the numerical simulation of these models and the nonlinear nature of species rate of reaction. This work presents a study of in situ adaptive tabulation (ISAT) technique, focusing on the accuracy, efficiency, and memory usage in the simulation of homogeneous stirred reactor models using simple and complex reaction mechanisms. The combustion of carbon monoxide with oxygen and methane with air mixtures are considered, using detailed reaction mechanisms with 4 and 53 species, 3 and 325 reactions, respectively. The results of these simulations indicate that the developed implementation of ISAT technique has a absolute global error smaller than 1 %. Moreover, ISAT technique provides gains, in terms of computational time, of up to 80% when compared with the direct integration of the full chemical kinetics. However, in terms of memory usage the present implementation of ISAT technique is found to be excessively demanding.
△ Less
Submitted 27 May, 2021;
originally announced June 2021.
-
Enhancing the performance of a bistable energy harvesting device via the cross-entropy method
Authors:
Americo Cunha Jr
Abstract:
This work deals with the solution of a non-convex optimization problem to enhance the performance of an energy harvesting device, which involves a nonlinear objective function and a discontinuous constraint. This optimization problem, which seeks to find a suitable configuration of parameters that maximize the electrical power recovered by a bistable energy harvesting system, is formulated in term…
▽ More
This work deals with the solution of a non-convex optimization problem to enhance the performance of an energy harvesting device, which involves a nonlinear objective function and a discontinuous constraint. This optimization problem, which seeks to find a suitable configuration of parameters that maximize the electrical power recovered by a bistable energy harvesting system, is formulated in terms of the dynamical system response and a binary classifier obtained from 0 to 1 test for chaos. A stochastic solution strategy that combines penalization and the cross-entropy method is proposed and numerically tested. Computational experiments are conducted to address the performance of the proposed optimization approach by comparison with a reference solution, obtained via an exhaustive search in a refined numerical mesh. The obtained results illustrate the effectiveness and robustness of the cross-entropy optimization strategy (even in the presence of noise or in moderately higher dimensions), showing that the proposed framework may be a very useful and powerful tool to solve optimization problems involving nonlinear energy harvesting dynamical systems.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
Computational modeling of the nonlinear stochastic dynamics of horizontal drillstrings
Authors:
Americo Cunha Jr,
Christian Soize,
Rubens Sampaio
Abstract:
This work intends to analyze the nonlinear stochastic dynamics of drillstrings in horizontal configuration. For this purpose, it considers a beam theory, with effects of rotatory inertia and shear deformation, which is capable of reproducing the large displacements that the beam undergoes. The friction and shock effects, due to beam/borehole wall transversal impacts, as well as the force and torqu…
▽ More
This work intends to analyze the nonlinear stochastic dynamics of drillstrings in horizontal configuration. For this purpose, it considers a beam theory, with effects of rotatory inertia and shear deformation, which is capable of reproducing the large displacements that the beam undergoes. The friction and shock effects, due to beam/borehole wall transversal impacts, as well as the force and torque induced by bit-rock interaction, are also considered in the model. Uncertainties of bit-rock interaction model are taken into account using a parametric probabilistic approach. Numerical simulations have shown that the mechanical system of interest has a very rich nonlinear stochastic dynamics, which generate phenomena such as bit-bounce, stick-slip, and transverse impacts. A study aiming to maximize the drilling process efficiency, varying drillstring velocities of translation and rotation is presented. Also, the work presents the definition and solution of two optimizations problems, one deterministic and one robust, where the objective is to maximize drillstring rate of penetration into the soil respecting its structural limits.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
On the nonlinear stochastic dynamics of a continuous system with discrete attached elements
Authors:
Americo Cunha Jr,
Rubens Sampaio
Abstract:
This paper presents a theoretical study on the influence of a discrete element in the nonlinear dynamics of a continuous mechanical system subject to randomness in the model parameters. This system is composed by an elastic bar, attached to springs and a lumped mass, with a random elastic modulus and subjected to a Gaussian white-noise distributed external force. One can note that the dynamic beha…
▽ More
This paper presents a theoretical study on the influence of a discrete element in the nonlinear dynamics of a continuous mechanical system subject to randomness in the model parameters. This system is composed by an elastic bar, attached to springs and a lumped mass, with a random elastic modulus and subjected to a Gaussian white-noise distributed external force. One can note that the dynamic behavior of the bar is significantly altered when the lumped mass is varied, becoming, on the right extreme and for large values of the concentrated mass, similar to a mass-spring system. It is also observed that the system response is more influenced by the randomness for small values of the lumped mass. The study conducted also show an irregular distribution of energy through the spectrum of frequencies, asymmetries and multimodal behavior in the probability distributions of the lumped mass velocity.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Uncertainty quantification through Monte Carlo method in a cloud computing setting
Authors:
A. Cunha Jr,
R. Nasser,
R. Sampaio,
H. Lopes,
K. Breitman
Abstract:
The Monte Carlo (MC) method is the most common technique used for uncertainty quantification, due to its simplicity and good statistical results. However, its computational cost is extremely high, and, in many cases, prohibitive. Fortunately, the MC algorithm is easily parallelizable, which allows its use in simulations where the computation of a single realization is very costly. This work presen…
▽ More
The Monte Carlo (MC) method is the most common technique used for uncertainty quantification, due to its simplicity and good statistical results. However, its computational cost is extremely high, and, in many cases, prohibitive. Fortunately, the MC algorithm is easily parallelizable, which allows its use in simulations where the computation of a single realization is very costly. This work presents a methodology for the parallelization of the MC method, in the context of cloud computing. This strategy is based on the MapReduce paradigm, and allows an efficient distribution of tasks in the cloud. This methodology is illustrated on a problem of structural dynamics that is subject to uncertainties. The results show that the technique is capable of producing good results concerning statistical moments of low order. It is shown that even a simple problem may require many realizations for convergence of histograms, which makes the cloud computing strategy very attractive (due to its high scalability capacity and low-cost). Additionally, the results regarding the time of processing and storage space usage allow one to qualify this new methodology as a solution for simulations that require a number of MC realizations beyond the standard.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
On uniqueness results for the Benjamin equation
Authors:
Alysson Cunha
Abstract:
We prove that the uniqueness results obtained in \cite{urrea} for the Benjamin equation, cannot be extended for any pair of non-vanishing solutions. On the other hand, we study uniqueness results of solutions of the Benjamin equation. With this purpose, we showed that for any solutions $u$ and $v$ defined in $\R\times [0,T]$, if there exists an open set $I\subset \R$ such that $u(\cdot,0)$ and…
▽ More
We prove that the uniqueness results obtained in \cite{urrea} for the Benjamin equation, cannot be extended for any pair of non-vanishing solutions. On the other hand, we study uniqueness results of solutions of the Benjamin equation. With this purpose, we showed that for any solutions $u$ and $v$ defined in $\R\times [0,T]$, if there exists an open set $I\subset \R$ such that $u(\cdot,0)$ and $v(\cdot,0)$ agree in $I$, $\p_t u(\cdot,0)$ and $\p_t v(\cdot,0)$ agree in $I$, then $u\equiv v$. To finish, a better version of this uniqueness result is also established.
△ Less
Submitted 1 July, 2021; v1 submitted 17 May, 2021;
originally announced May 2021.