Search | arXiv e-print repository

ToffA-DSPL: an approach of trade-off analysis for designing dynamic software product lines

Authors: Michelle Larissa Luciano Carvalho, Paulo Cesar Masiero, Ismayle de Sousa Santos, Eduardo Santana de Almeida

Abstract: Software engineers have adopted the Dynamic Software Product Lines (DSPL) engineering practices to develop Dynamically Adaptable Software (DAS). DAS is seen as a DSPL application and must cope with a large number of configurations of features, Non-functional Requirements (NFRs), and contexts. However, the accurate representation of the impact of features over NFRs and contexts for the identificati… ▽ More Software engineers have adopted the Dynamic Software Product Lines (DSPL) engineering practices to develop Dynamically Adaptable Software (DAS). DAS is seen as a DSPL application and must cope with a large number of configurations of features, Non-functional Requirements (NFRs), and contexts. However, the accurate representation of the impact of features over NFRs and contexts for the identification of optimal configurations is not a trivial task. Software engineers need to have domain knowledge and design DAS before deploying to satisfy those requirements. Aiming to handle them, we proposed an approach of Trade-off Analysis for DSPL at design-time, named ToffA-DSPL. It deals with the configuration selection process considering interactions between NFRs and contexts. We performed an exploratory study based on simulations to identify the usefulness of the ToffA-DSPL approach. In general, the configurations suggested by ToffA-DSPL provide high satisfaction levels of NFRs. Based on simulations, we evidenced that our approach aims to explore reuse and is useful for generating valid and optimal configurations. In addition, ToffA-DSPL enables software engineers to conduct trade-off analysis, evaluate changes in the context feature, and define an adaptation model from optimal configurations found in the analysis. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.00273 [pdf]

Please do not go: understanding turnover of software engineers from different perspectives

Authors: Michelle Larissa Luciano Carvalho, Paulo da Silva Cruz, Eduardo Santana de Almeida, Paulo Anselmo da Mota Silveira Neto, Rafael Prikladnicki

Abstract: Turnover consists of moving into and out of professional employees in the company in a given period. Such a phenomenon significantly impacts the software industry since it generates knowledge loss, delays in the schedule, and increased costs in the final project. Despite the efforts made by researchers and professionals to minimize the turnover, more studies are needed to understand the motivation… ▽ More Turnover consists of moving into and out of professional employees in the company in a given period. Such a phenomenon significantly impacts the software industry since it generates knowledge loss, delays in the schedule, and increased costs in the final project. Despite the efforts made by researchers and professionals to minimize the turnover, more studies are needed to understand the motivation that drives Software Engineers to leave their jobs and the main strategies CEOs adopt to retain these professionals in software development companies. In this paper, we contribute a mixed methods study involving semi-structured interviews with Software Engineers and CEOs to obtain a wider opinion of these professionals about turnover and a subsequent validation survey with additional software engineers to check and review the insights from interviews. In studying such aspects, we identified 19 different reasons for software engineers' turnover and 18 more efficient strategies used in the software development industry to reduce it. Our findings provide several implications for industry and academia, which can drive future research. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2406.05713 [pdf, other]

Enhancing the light yield of He:CF$_4$ based gaseous detector

Authors: F. D. Amaro, R. Antonietti, E. Baracchini, L. Benussi, S. Bianco, R. Campagnola, C. Capoccia, M. Caponero, D. S. Cardoso, L. G. M. de Carvalho, G. Cavoto, I. Abritta Costa, A. Croce, E. Dané, G. Dho, F. Di Giambattista, E. Di Marco, M. D'Astolfo, G. D'Imperio, D. Fiorina, F. Iacoangeli, Z. Islam, H. P. L. Jùnior, E. Kemp, G. Maccarrone , et al. (29 additional authors not shown)

Abstract: The CYGNO experiment aims to build a large ($\mathcal{O}(10)$ m$^3$) directional detector for rare event searches, such as nuclear recoils (NRs) induced by dark matter (DM), such as weakly interactive massive particles (WIMPs). The detector concept comprises a time projection chamber (TPC), filled with a He:CF$_4$ 60/40 scintillating gas mixture at room temperature and atmospheric pressure, equipp… ▽ More The CYGNO experiment aims to build a large ($\mathcal{O}(10)$ m$^3$) directional detector for rare event searches, such as nuclear recoils (NRs) induced by dark matter (DM), such as weakly interactive massive particles (WIMPs). The detector concept comprises a time projection chamber (TPC), filled with a He:CF$_4$ 60/40 scintillating gas mixture at room temperature and atmospheric pressure, equipped with an amplification stage made of a stack of three gas electron multipliers (GEMs) which are coupled to an optical readout. The latter consists in scientific CMOS (sCMOS) cameras and photomultipliers tubes (PMTs). The maximisation of the light yield of the amplification stage plays a major role in the determination of the energy threshold of the experiment. In this paper, we simulate the effect of the addition of a strong electric field below the last GEM plane on the GEM field structure and we experimentally test it by means of a 10$\times$10 cm$^2$ readout area prototype. The experimental measurements analyse stacks of different GEMs and helium concentrations in the gas mixture combined with this extra electric field, studying their performances in terms of light yield, energy resolution and intrinsic diffusion. It is found that the use of this additional electric field permits large light yield increases without degrading intrinsic characteristics of the amplification stage with respect to the regular use of GEMs. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.04825 [pdf, other]

Graph Mining under Data scarcity

Authors: Appan Rakaraddi, Lam Siew-Kei, Mahardhika Pratama, Marcus de Carvalho

Abstract: Multitude of deep learning models have been proposed for node classification in graphs. However, they tend to perform poorly under labeled-data scarcity. Although Few-shot learning for graphs has been introduced to overcome this problem, the existing models are not easily adaptable for generic graph learning frameworks like Graph Neural Networks (GNNs). Our work proposes an Uncertainty Estimator f… ▽ More Multitude of deep learning models have been proposed for node classification in graphs. However, they tend to perform poorly under labeled-data scarcity. Although Few-shot learning for graphs has been introduced to overcome this problem, the existing models are not easily adaptable for generic graph learning frameworks like Graph Neural Networks (GNNs). Our work proposes an Uncertainty Estimator framework that can be applied on top of any generic GNN backbone network (which are typically designed for supervised/semi-supervised node classification) to improve the node classification performance. A neural network is used to model the Uncertainty Estimator as a probability distribution rather than probabilistic discrete scalar values. We train these models under the classic episodic learning paradigm in the $n$-way, $k$-shot fashion, in an end-to-end setting. Our work demonstrates that implementation of the uncertainty estimator on a GNN backbone network improves the classification accuracy under Few-shot setting without any meta-learning specific architecture. We conduct experiments on multiple datasets under different Few-shot settings and different GNN-based backbone networks. Our method outperforms the baselines, which demonstrates the efficacy of the Uncertainty Estimator for Few-shot node classification on graphs with a GNN. △ Less

Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

Comments: 7 pages, 2 figures

arXiv:2406.03288 [pdf, other]

Embarrassingly Parallel GFlowNets

Authors: Tiago da Silva, Luiz Max Carvalho, Amauri Souza, Samuel Kaski, Diego Mesquita

Abstract: GFlowNets are a promising alternative to MCMC sampling for discrete compositional random variables. Training GFlowNets requires repeated evaluations of the unnormalized target distribution or reward function. However, for large-scale posterior sampling, this may be prohibitive since it incurs traversing the data several times. Moreover, if the data are distributed across clients, employing standar… ▽ More GFlowNets are a promising alternative to MCMC sampling for discrete compositional random variables. Training GFlowNets requires repeated evaluations of the unnormalized target distribution or reward function. However, for large-scale posterior sampling, this may be prohibitive since it incurs traversing the data several times. Moreover, if the data are distributed across clients, employing standard GFlowNets leads to intensive client-server communication. To alleviate both these issues, we propose embarrassingly parallel GFlowNet (EP-GFlowNet). EP-GFlowNet is a provably correct divide-and-conquer method to sample from product distributions of the form $R(\cdot) \propto R_1(\cdot) ... R_N(\cdot)$ -- e.g., in parallel or federated Bayes, where each $R_n$ is a local posterior defined on a data partition. First, in parallel, we train a local GFlowNet targeting each $R_n$ and send the resulting models to the server. Then, the server learns a global GFlowNet by enforcing our newly proposed \emph{aggregating balance} condition, requiring a single communication step. Importantly, EP-GFlowNets can also be applied to multi-objective optimization and model reuse. Our experiments illustrate the EP-GFlowNets's effectiveness on many tasks, including parallel Bayesian phylogenetics, multi-objective multiset, sequence generation, and federated Bayesian structure learning. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: Accepted to ICML 2024

arXiv:2406.01756 [pdf, ps, other]

On the completeness of several fortification-interdiction games in the Polynomial Hierarchy

Authors: Alberto Boggio Tomasaz, Margarida Carvalho, Roberto Cordone, Pierre Hosteins

Abstract: Fortification-interdiction games are tri-level adversarial games where two opponents act in succession to protect, disrupt and simply use an infrastructure for a specific purpose. Many such games have been formulated and tackled in the literature through specific algorithmic methods, however very few investigations exist on the completeness of such fortification problems in order to locate them ri… ▽ More Fortification-interdiction games are tri-level adversarial games where two opponents act in succession to protect, disrupt and simply use an infrastructure for a specific purpose. Many such games have been formulated and tackled in the literature through specific algorithmic methods, however very few investigations exist on the completeness of such fortification problems in order to locate them rigorously in the polynomial hierarchy. We clarify the completeness status of several well-known fortification problems, such as the Tri-level Interdiction Knapsack Problem with unit fortification and attack weights, the Max-flow Interdiction Problem and Shortest Path Interdiction Problem with Fortification, the Multi-level Critical Node Problem with unit weights, as well as a well-studied electric grid defence planning problem. For all of these problems, we prove their completeness either for the $Σ^p_2$ or the $Σ^p_3$ class of the polynomial hierarchy. We also prove that the Multi-level Fortification-Interdiction Knapsack Problem with an arbitrary number of protection and interdiction rounds and unit fortification and attack weights is complete for any level of the polynomial hierarchy, therefore providing a useful basis for further attempts at proving the completeness of protection-interdiction games at any level of said hierarchy. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.12361 [pdf, other]

The influence of ionized gas kinematics on HII galaxies. The cases of Tol 1004-296 and Tol 0957-278

Authors: Henri Plana, Vitor G. Alves, Maiara S. Carvalho

Abstract: Blue Compact Galaxies (BCGs), also known as \HII\ galaxies, are dwarf, star-forming objects with relatively simple dynamics, which allows for the investigation of star formation mechanisms in a cleaner manner compared to late-type objects. In this study, we have examined various characteristics of the interstellar medium, in connection with the kinematics and dynamics of ionized gas, in Tol 1004-2… ▽ More Blue Compact Galaxies (BCGs), also known as \HII\ galaxies, are dwarf, star-forming objects with relatively simple dynamics, which allows for the investigation of star formation mechanisms in a cleaner manner compared to late-type objects. In this study, we have examined various characteristics of the interstellar medium, in connection with the kinematics and dynamics of ionized gas, in Tol 1004-296 and Tol 0957-278. These two objects were observed using the SOAR Integral Field Spectrometer (SIFS) attached to the Southern Observatory for Astrophysical Research (SOAR). Both galaxies were observed with two gratings: one with medium resolution for monochromatic and abundance maps, and another with high resolution for kinematics and profile analysis. Additionally, we conducted an analysis on the velocity and velocity dispersion maps using intensity-velocity dispersion (I - $σ$) and velocity-velocity dispersion (Vr - $σ$) diagrams. Neither object exhibits a rotation pattern, and only Tol 1004-296 shows a velocity gradient between the two principal knots. However, the study reveals the significant role played by velocity dispersion in the star formation process. Specifically, we identified a relationship between monochromatic intensity, metallicity, and velocity dispersion, where high emission corresponds to low metallicity and low velocity dispersion. Tol 1004-296, in particular, exhibits a distinctive linear high velocity dispersion pattern between the two main knots, suggesting that both star formation sites are pushing the gas in opposite directions. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 22 pages, 23 figures. Accepted for publication in MNRAS

arXiv:2405.05836 [pdf, other]

Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection

Authors: Atefeh Mahdavi, Marco Carvalho

Abstract: Machine learning-based techniques open up many opportunities and improvements to derive deeper and more practical insights from data that can help businesses make informed decisions. However, the majority of these techniques focus on the conventional closed-set scenario, in which the label spaces for the training and test sets are identical. Open set recognition (OSR) aims to bring classification… ▽ More Machine learning-based techniques open up many opportunities and improvements to derive deeper and more practical insights from data that can help businesses make informed decisions. However, the majority of these techniques focus on the conventional closed-set scenario, in which the label spaces for the training and test sets are identical. Open set recognition (OSR) aims to bring classification tasks in a situation that is more like reality, which focuses on classifying the known classes as well as handling unknown classes effectively. In such an open-set problem the gathered samples in the training set cannot encompass all the classes and the system needs to identify unknown samples at test time. On the other hand, building an accurate and comprehensive model in a real dynamic environment presents a number of obstacles, because it is prohibitively expensive to train for every possible example of unknown items, and the model may fail when tested in testbeds. This study provides an algorithm exploring a new representation of feature space to improve classification in OSR tasks. The efficacy and efficiency of business processes and decision-making can be improved by integrating OSR, which offers more precise and insightful predictions of outcomes. We demonstrate the performance of the proposed method on three established datasets. The results indicate that the proposed model outperforms the baseline methods in accuracy and F1-score. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: Accepted for proceedings of the 57th Hawaii International Conference on System Sciences: 10 pages, 6 figures, 3-6 January 2024, Honolulu, United States

Journal ref: Atefeh, M., & Marco, C. (2024). "Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection." Proceedings of the 57th Hawaii International Conference on System Sciences, 1090-1999

arXiv:2405.02439 [pdf, ps, other]

Dynamic Single Facility Location under Cumulative Customer Demand

Authors: Warley Almeida Silva, Margarida Carvalho, Sanjay Dominik Jena

Abstract: Dynamic facility location problems aim at placing one or more valuable resources over a planning horizon to meet customer demand. Existing literature commonly assumes that customer demand quantities are defined independently for each time period. In many planning contexts, however, unmet demand carries over to future time periods. Unmet demand at some time periods may therefore affect decisions of… ▽ More Dynamic facility location problems aim at placing one or more valuable resources over a planning horizon to meet customer demand. Existing literature commonly assumes that customer demand quantities are defined independently for each time period. In many planning contexts, however, unmet demand carries over to future time periods. Unmet demand at some time periods may therefore affect decisions of subsequent time periods. This work studies a novel location problem, where the decision maker relocates a single temporary facility over time to capture cumulative customer demand. We propose two mixed-integer programming models for this problem, and show that one of them has a tighter continuous relaxation and allows the representation of more general customer demand behaviour. We characterize the computational complexity for this problem, and analyze which problem characteristics result in NP-hardness. We then propose an exact branch-and-Benders-cut method, and show how optimality cuts can be computed efficiently through an analytical procedure. Computational experiments show that our method is approximately 30 times faster than solving the tighter formulation directly. Our results also quantify the benefit of accounting for cumulative customer demand within the optimization framework, since the corresponding planning solutions perform much better than those obtained by ignoring cumulative demand or employing myopic heuristics. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 35 pages, 4 figures

arXiv:2404.08480 [pdf, other]

Decoding AI: The inside story of data analysis in ChatGPT

Authors: Ozan Evkaya, Miguel de Carvalho

Abstract: As a result of recent advancements in generative AI, the field of Data Science is prone to various changes. This review critically examines the Data Analysis (DA) capabilities of ChatGPT assessing its performance across a wide range of tasks. While DA provides researchers and practitioners with unprecedented analytical capabilities, it is far from being perfect, and it is important to recognize an… ▽ More As a result of recent advancements in generative AI, the field of Data Science is prone to various changes. This review critically examines the Data Analysis (DA) capabilities of ChatGPT assessing its performance across a wide range of tasks. While DA provides researchers and practitioners with unprecedented analytical capabilities, it is far from being perfect, and it is important to recognize and address its limitations. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 15 pages with figures and appendix

arXiv:2404.06266 [pdf, other]

Quantifying the U $5f$ covalence and degree of localization in U intermetallics

Authors: Andrea Marino, Denise S. Christovam, Daisuke Takegami, Johannes Falke, Miguel M. F. Carvalho, Takaki Okauchi, Chung-Fu Chang, Simone G. Altendorf, Andrea Amorese, Martin Sundermann, Andrei Gloskovskii, Hlynur Gretarsson, Bernhard Keimer, Alexandr V. Andreev, Ladislav Havela, Andreas Leithe-Jasper, Andrea Severing, Jan Kunes, Liu Hao Tjeng, Atsushi Hariki

Abstract: A procedure for quantifying the U $5f$ electrons' covalence and degree of localization in U intermetallic compounds is presented. To this end, bulk sensitive hard and soft x-ray photoelectron spectroscopy were utilized in combination with density-functional theory (DFT) plus dynamical mean-field theory (DMFT) calculations. The energy dependence of the photoionization cross-sections allows the dise… ▽ More A procedure for quantifying the U $5f$ electrons' covalence and degree of localization in U intermetallic compounds is presented. To this end, bulk sensitive hard and soft x-ray photoelectron spectroscopy were utilized in combination with density-functional theory (DFT) plus dynamical mean-field theory (DMFT) calculations. The energy dependence of the photoionization cross-sections allows the disentanglement of the U\,$5f$ contribution to the valence band from the various other atomic subshells so that the computational parameters in the DFT\,+\,DMFT can be reliably determined. Applying this method to UGa$_2$ and UB$_2$ as model compounds from opposite ends of the (de)localization range, we have achieved excellent simulations of the valence band and core-level spectra. The width in the distribution of atomic U\,$5f$ configurations contributing to the ground state, as obtained from the calculations, quantifies the correlated nature and degree of localization of the U\,5$f$. The findings permit answering the longstanding question why different spectroscopic techniques give seemingly different numbers for the U 5$f$ valence in intermetallic U compounds. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 14 pages, 9 figures

arXiv:2404.02453 [pdf, other]

Exploring the Connection Between the Normalized Power Prior and Bayesian Hierarchical Models

Authors: Yueqi Shen, Matthew A. Psioda, Luiz M. Carvalho, Joseph G. Ibrahim

Abstract: The power prior is a popular class of informative priors for incorporating information from historical data. It involves raising the likelihood for the historical data to a power, which acts as a discounting parameter. When the discounting parameter is modeled as random, the normalized power prior is recommended. Bayesian hierarchical modeling is a widely used method for synthesizing information f… ▽ More The power prior is a popular class of informative priors for incorporating information from historical data. It involves raising the likelihood for the historical data to a power, which acts as a discounting parameter. When the discounting parameter is modeled as random, the normalized power prior is recommended. Bayesian hierarchical modeling is a widely used method for synthesizing information from different sources, including historical data. In this work, we examine the analytical relationship between the normalized power prior (NPP) and Bayesian hierarchical models (BHM) for \emph{i.i.d.} normal data. We establish a direct relationship between the prior for the discounting parameter of the NPP and the prior for the variance parameter of the BHM. Such a relationship is first established for the case of a single historical dataset, and then extended to the case with multiple historical datasets with dataset-specific discounting parameters. For multiple historical datasets, we develop and establish theory for the BHM-matching NPP (BNPP) which establishes dependence between the dataset-specific discounting parameters leading to inferences that are identical to the BHM. Establishing this relationship not only justifies the NPP from the perspective of hierarchical modeling, but also provides insight on prior elicitation for the NPP. We present strategies on inducing priors on the discounting parameter based on hierarchical models, and investigate the borrowing properties of the BNPP. △ Less

Submitted 3 April, 2024; originally announced April 2024.

arXiv:2403.12923 [pdf, other]

Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models

Authors: Quang Minh Bui, Margarida Carvalho, José Neto

Abstract: The combinatorial pricing problem (CPP) is a bilevel problem in which the leader maximizes their revenue by imposing tolls on certain items that they can control. Based on the tolls set by the leader, the follower selects a subset of items corresponding to an optimal solution of a combinatorial optimization problem. To accomplish the leader's goal, the tolls need to be sufficiently low to discoura… ▽ More The combinatorial pricing problem (CPP) is a bilevel problem in which the leader maximizes their revenue by imposing tolls on certain items that they can control. Based on the tolls set by the leader, the follower selects a subset of items corresponding to an optimal solution of a combinatorial optimization problem. To accomplish the leader's goal, the tolls need to be sufficiently low to discourage the follower from choosing the items offered by the competitors. In this paper, we derive a single-level reformulation for the CPP by rewriting the follower's problem as a longest path problem using a dynamic programming model, and then taking its dual and applying strong duality. We proceed to solve the reformulation in a dynamic fashion with a cutting plane method. We apply this methodology to 2 distinct dynamic programming models, namely, a novel formulation designated as selection diagram and the well-known decision diagram. We also produce numerical results to evaluate their performances across 3 different specializations of the CPP and a closely related problem that is the knapsack interdiction problem. Our results showcase the potential of the 2 proposed reformulations over the natural value function approach, expanding the set of tools to solve combinatorial bilevel programs. △ Less

Submitted 19 March, 2024; originally announced March 2024.

MSC Class: 90C46; 90C27; 90C39

arXiv:2403.07132 [pdf, other]

A New Machine Learning Dataset of Bulldog Nostril Images for Stenosis Degree Classification

Authors: Gabriel Toshio Hirokawa Higa, Joyce Katiuccia Medeiros Ramos Carvalho, Paolo Brito Pascoalini Zanoni, Gisele Braziliano de Andrade, Hemerson Pistori

Abstract: Brachycephaly, a conformation trait in some dog breeds, causes BOAS, a respiratory disorder that affects the health and welfare of the dogs with various symptoms. In this paper, a new annotated dataset composed of 190 images of bulldogs' nostrils is presented. Three degrees of stenosis are approximately equally represented in the dataset: mild, moderate and severe stenosis. The dataset also compri… ▽ More Brachycephaly, a conformation trait in some dog breeds, causes BOAS, a respiratory disorder that affects the health and welfare of the dogs with various symptoms. In this paper, a new annotated dataset composed of 190 images of bulldogs' nostrils is presented. Three degrees of stenosis are approximately equally represented in the dataset: mild, moderate and severe stenosis. The dataset also comprises a small quantity of non stenotic nostril images. To the best of our knowledge, this is the first image dataset addressing this problem. Furthermore, deep learning is investigated as an alternative to automatically infer stenosis degree using nostril images. In this work, several neural networks were tested: ResNet50, MobileNetV3, DenseNet201, SwinV2 and MaxViT. For this evaluation, the problem was modeled in two different ways: first, as a three-class classification problem (mild or open, moderate, and severe); second, as a binary classification problem, with severe stenosis as target. For the multiclass classification, a maximum median f-score of 53.77\% was achieved by the MobileNetV3. For binary classification, a maximum median f-score of 72.08\% has been reached by ResNet50, indicating that the problem is challenging but possibly tractable. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2402.15486 [pdf, ps, other]

Solving Two-Stage Stochastic Programs with Endogenous Uncertainty via Random Variable Transformation

Authors: Maria Bazotte, Margarida Carvalho, Thibaut Vidal

Abstract: Real-world decision-making problems involve decision-dependent uncertainty, where the probability distribution of the random vector depends on the model decisions. However, few studies focus on two-stage stochastic programs with this type of endogenous uncertainty, and those that do lack general methodologies. We thus propose a general method for solving a class of these programs based on random v… ▽ More Real-world decision-making problems involve decision-dependent uncertainty, where the probability distribution of the random vector depends on the model decisions. However, few studies focus on two-stage stochastic programs with this type of endogenous uncertainty, and those that do lack general methodologies. We thus propose a general method for solving a class of these programs based on random variable transformation, a technique widely employed in probability and statistics. The random variable transformation converts a stochastic program with endogenous uncertainty (original program) into an equivalent stochastic program with decision-independent uncertainty (transformed program), for which solution procedures are well-studied. Moreover, endogenous uncertainty usually leads to nonlinear nonconvex programs, which are theoretically intractable. Nonetheless, we show that, for some classical endogenous distributions, the proposed method yields mixed-integer linear or convex programs with exogenous uncertainty. We validate this method by applying it to a network design and facility-protection problem, considering distinct decision-dependent distributions for the random variables. Whereas the original formulation of this problem is nonlinear non-convex for most endogenous distributions, the proposed method transforms it into mixed-integer linear programs with exogenous uncertainty. Particularly, we solve these obtained programs with the sample average approximation (SAA) method. Finally, the transformed program outperforms the case in which a mixed-integer linear formulation of the original program exists. △ Less

Submitted 1 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.12490 [pdf, other]

Towards Cross-Domain Continual Learning

Authors: Marcus de Carvalho, Mahardhika Pratama, Jie Zhang, Chua Haoyan, Edward Yapp

Abstract: Continual learning is a process that involves training learning agents to sequentially master a stream of tasks or classes without revisiting past data. The challenge lies in leveraging previously acquired knowledge to learn new tasks efficiently, while avoiding catastrophic forgetting. Existing methods primarily focus on single domains, restricting their applicability to specific problems. In t… ▽ More Continual learning is a process that involves training learning agents to sequentially master a stream of tasks or classes without revisiting past data. The challenge lies in leveraging previously acquired knowledge to learn new tasks efficiently, while avoiding catastrophic forgetting. Existing methods primarily focus on single domains, restricting their applicability to specific problems. In this work, we introduce a novel approach called Cross-Domain Continual Learning (CDCL) that addresses the limitations of being limited to single supervised domains. Our method combines inter- and intra-task cross-attention mechanisms within a compact convolutional network. This integration enables the model to maintain alignment with features from previous tasks, thereby delaying the data drift that may occur between tasks, while performing unsupervised cross-domain (UDA) between related domains. By leveraging an intra-task-specific pseudo-labeling method, we ensure accurate input pairs for both labeled and unlabeled samples, enhancing the learning process. To validate our approach, we conduct extensive experiments on public UDA datasets, showcasing its positive performance on cross-domain continual learning challenges. Additionally, our work introduces incremental ideas that contribute to the advancement of this field. We make our code and models available to encourage further exploration and reproduction of our results: \url{https://github.com/Ivsucram/CDCL} △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 12 pages, 2 Figures, 4 Tables. To be published at the IEEE International Conference on Data Engineering (ICDE) 2024

arXiv:2402.12463 [pdf, other]

Quantum phase transitions in one-dimensional nanostructures: a comparison between DFT and DMRG methodologies

Authors: T. Pauletti, M. Sanino, L. Gimenes, I. M. Carvalho, V. V. França

Abstract: In the realm of quantum chemistry, the accurate prediction of electronic structure and properties of nanostructures remains a formidable challenge. Density Functional Theory (DFT) and Density Matrix Renormalization Group (DMRG) have emerged as two powerful computational methods for addressing electronic correlation effects in diverse molecular systems. We compare ground-state energies ($e_0$), den… ▽ More In the realm of quantum chemistry, the accurate prediction of electronic structure and properties of nanostructures remains a formidable challenge. Density Functional Theory (DFT) and Density Matrix Renormalization Group (DMRG) have emerged as two powerful computational methods for addressing electronic correlation effects in diverse molecular systems. We compare ground-state energies ($e_0$), density profiles ($n$) and average entanglement entropies ($\bar S$) in metals, insulators and at the transition from metal to insulator, in homogeneous, superlattices and harmonically confined chains described by the fermionic one-dimensional Hubbard model. While for the homogeneous systems there is a clear hierarchy between the deviations, $D\%(\bar S)<D\%(e_0)< \bar D\%(n)$, and all the deviations decrease with the chain size; for superlattices and harmonical confinement the relation among the deviations is less trivial and strongly dependent on the superlattice structure and the confinement strength considered. For the superlattices, in general increasing the number of impurities in the unit cell represents less precision on the DFT calculations. For the confined chains, DFT performs better for metallic phases, while the highest deviations appear for the Mott and band-insulator phases. This work provides a comprehensive comparative analysis of these methodologies, shedding light on their respective strengths, limitations, and applications. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.11657 [pdf, other]

On the importance of assessing topological convergence in Bayesian phylogenetic inference

Authors: Marius Brusselmans, Luiz Max Carvalho, Samuel L. Hong, Jiansi Gao, Frederick A. Matsen IV, Andrew Rambaut, Philippe Lemey, Marc A. Suchard, Gytis Dudas, Guy Baele

Abstract: Modern phylogenetics research is often performed within a Bayesian framework, using sampling algorithms such as Markov chain Monte Carlo (MCMC) to approximate the posterior distribution. These algorithms require careful evaluation of the quality of the generated samples. Within the field of phylogenetics, one frequently adopted diagnostic approach is to evaluate the effective sample size (ESS) and… ▽ More Modern phylogenetics research is often performed within a Bayesian framework, using sampling algorithms such as Markov chain Monte Carlo (MCMC) to approximate the posterior distribution. These algorithms require careful evaluation of the quality of the generated samples. Within the field of phylogenetics, one frequently adopted diagnostic approach is to evaluate the effective sample size (ESS) and to investigate trace graphs of the sampled parameters. A major limitation of these approaches is that they are developed for continuous parameters and therefore incompatible with a crucial parameter in these inferences: the tree topology. Several recent advancements have aimed at extending these diagnostics to topological space. In this short reflection paper, we present a case study illustrating how these topological diagnostics can contain information not found in standard diagnostics, and how decisions regarding which of these diagnostics to compute can impact inferences regarding MCMC convergence and mixing. Given the major importance of detecting convergence and mixing issues in Bayesian phylogenetic analyses, the lack of a unified approach to this problem warrants further action, especially now that additional tools are becoming available to researchers. △ Less

Submitted 18 February, 2024; originally announced February 2024.

arXiv:2402.04250 [pdf, other]

Computing Approximate Nash Equilibria for Integer Programming Games

Authors: Aloïs Duguet, Margarida Carvalho, Gabriele Dragotto, Sandra Ulrich Ngueveu

Abstract: We propose a framework to compute approximate Nash equilibria in integer programming games with nonlinear payoffs, i.e., simultaneous and non-cooperative games where each player solves a parametrized mixed-integer nonlinear program. We prove that using absolute approximations of the players' objective functions and then computing its Nash equilibria is equivalent to computing approximate Nash equi… ▽ More We propose a framework to compute approximate Nash equilibria in integer programming games with nonlinear payoffs, i.e., simultaneous and non-cooperative games where each player solves a parametrized mixed-integer nonlinear program. We prove that using absolute approximations of the players' objective functions and then computing its Nash equilibria is equivalent to computing approximate Nash equilibria where the approximation factor is doubled. In practice, we propose an algorithm to approximate the players' objective functions via piecewise linear approximations. Our numerical experiments on a cybersecurity investment game show the computational effectiveness of our approach. △ Less

Submitted 6 February, 2024; originally announced February 2024.

arXiv:2401.06538 [pdf, other]

doi 10.5281/zenodo.10479965

Intelligent Data-Driven Architectural Features Orchestration for Network Slicing

Authors: Rodrigo Moreira, Flavio de Oliveira Silva, Tereza Cristina Melo de Brito Carvalho, Joberto S. B. Martins

Abstract: Network slicing is a crucial enabler and a trend for the Next Generation Mobile Network (NGMN) and various other new systems like the Internet of Vehicles (IoV) and Industrial IoT (IIoT). Orchestration and machine learning are key elements with a crucial role in the network-slicing processes since the NS process needs to orchestrate resources and functionalities, and machine learning can potential… ▽ More Network slicing is a crucial enabler and a trend for the Next Generation Mobile Network (NGMN) and various other new systems like the Internet of Vehicles (IoV) and Industrial IoT (IIoT). Orchestration and machine learning are key elements with a crucial role in the network-slicing processes since the NS process needs to orchestrate resources and functionalities, and machine learning can potentially optimize the orchestration process. However, existing network-slicing architectures lack the ability to define intelligent approaches to orchestrate features and resources in the slicing process. This paper discusses machine learning-based orchestration of features and capabilities in network slicing architectures. Initially, the slice resource orchestration and allocation in the slicing planning, configuration, commissioning, and operation phases are analyzed. In sequence, we highlight the need for optimized architectural feature orchestration and recommend using ML-embed agents, federated learning intrinsic mechanisms for knowledge acquisition, and a data-driven approach embedded in the network slicing architecture. We further develop an architectural features orchestration case embedded in the SFI2 network slicing architecture. An attack prevention security mechanism is developed for the SFI2 architecture using distributed embedded and cooperating ML agents. The case presented illustrates the architectural feature's orchestration process and benefits, highlighting its importance for the network slicing process. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 12 pages, 6 figures, Conference ADVANCE 24 - International Workshop on ADVANCEs in ICT Infrastructures and Services - February 26--29, 2024 - Hanoi, Vietnam

ACM Class: I.2.11; C.2.1; I.2.1

arXiv:2312.15747 [pdf, other]

A Comparison of Image and Scalar-Based Approaches in Preconditioner Selection

Authors: Michael Souza, Luiz M. Carvalho, Douglas Augusto, Jairo Panetta, Paulo Goldfeld, José R. P. Rodrigues

Abstract: Within high-performance computing (HPC), solving large sparse linear systems efficiently remains paramount, with iterative methods being the predominant choice. However, the performance of these methods is tightly coupled to the aptness of the chosen preconditioner. The multifaceted nature of sparse matrices makes the universal prescription of preconditioners elusive. Notably, the key attribute of… ▽ More Within high-performance computing (HPC), solving large sparse linear systems efficiently remains paramount, with iterative methods being the predominant choice. However, the performance of these methods is tightly coupled to the aptness of the chosen preconditioner. The multifaceted nature of sparse matrices makes the universal prescription of preconditioners elusive. Notably, the key attribute of sparsity is not precisely captured by scalar metrics such as bandwidth or matrix dimensions. Advancing prior methodologies, this research introduces matrix sparsity depiction via RGB images. Utilizing a convolutional neural network (CNN), the task of preconditioner selection turns into a multi-class classification problem. Extensive tests on 126 SuiteSparse matrices emphasize the enhanced prowess of the CNN model, noting a 32% boost in accuracy and a 25% reduction in computational slowdown. △ Less

Submitted 25 December, 2023; originally announced December 2023.

Comments: 23 pages, 8 figures, 9 tables

MSC Class: 65F08; 65F10; 68T20

arXiv:2312.05980 [pdf, other]

Maximum flow-based formulation for the optimal location of electric vehicle charging stations

Authors: Pierre-Luc Parent, Margarida Carvalho, Miguel F. Anjos, Ribal Atallah

Abstract: With the increasing effects of climate change, the urgency to step away from fossil fuels is greater than ever before. Electric vehicles (EVs) are one way to diminish these effects, but their widespread adoption is often limited by the insufficient availability of charging stations. In this work, our goal is to expand the infrastructure of EV charging stations, in order to provide a better quality… ▽ More With the increasing effects of climate change, the urgency to step away from fossil fuels is greater than ever before. Electric vehicles (EVs) are one way to diminish these effects, but their widespread adoption is often limited by the insufficient availability of charging stations. In this work, our goal is to expand the infrastructure of EV charging stations, in order to provide a better quality of service in terms of user satisfaction (and availability of charging stations). Specifically, our focus is directed towards urban areas. We first propose a model for the assignment of EV charging demand to stations, framing it as a maximum flow problem. This model is the basis for the evaluation of user satisfaction with a given charging infrastructure. Secondly, we incorporate the maximum flow model into a mixed-integer linear program, where decisions on the opening of new stations and on the expansion of their capacity through additional outlets is accounted for. We showcase our methodology for the city of Montreal, demonstrating the scalability of our approach to handle real-world scenarios. We conclude that considering both spacial and temporal variations in charging demand is meaningful when solving realistic instances. △ Less

Submitted 10 December, 2023; originally announced December 2023.

arXiv:2312.02306 [pdf, other]

Pulse vaccination in a modified SIR model: global dynamics, bifurcations and seasonality

Authors: João P. S. Maurício de Carvalho, Alexandre A. Rodrigues

Abstract: We analyze a periodically-forced dynamical system inspired by the SIR model with pulse vaccination. We fully characterize its dynamics according to the proportion $p$ of vaccinated individuals and the time $T$ between doses. If the basic reproduction number is less than 1 (i.e. $\mathcal{R}_p<1$), then we obtain precise conditions for the existence and global stability of a disease-free $T$-period… ▽ More We analyze a periodically-forced dynamical system inspired by the SIR model with pulse vaccination. We fully characterize its dynamics according to the proportion $p$ of vaccinated individuals and the time $T$ between doses. If the basic reproduction number is less than 1 (i.e. $\mathcal{R}_p<1$), then we obtain precise conditions for the existence and global stability of a disease-free $T$-periodic solution. Otherwise, if $\mathcal{R}_p>1$, then a globally stable $T$-periodic solution emerges with positive coordinates. We draw a bifurcation diagram $(T,p)$ and we describe the associated bifurcations. We also find analytically and numerically chaotic dynamics by adding seasonality to the disease transmission rate. In a realistic context, low vaccination coverage and intense seasonality may result in unpredictable dynamics. Previous experiments have suggested chaos in periodically-forced biological impulsive models, but no analytic proof has been given. △ Less

Submitted 4 December, 2023; originally announced December 2023.

arXiv:2309.17140 [pdf, other]

A Snapshot of the Mental Health of Software Professionals

Authors: Eduardo Santana de Almeida, Ingrid Oliveira de Nunes, Raphael Pereira de Oliveira, Michelle Larissa Luciano Carvalho, Andre Russowsky Brunoni, Shiyue Rong, Iftekhar Ahmed

Abstract: Mental health disorders affect a large number of people, leading to many lives being lost every year. These disorders affect struggling individuals and businesses whose productivity decreases due to days of lost work or lower employee performance. Recent studies provide alarming numbers of individuals who suffer from mental health disorders, e.g., depression and anxiety, in particular contexts, su… ▽ More Mental health disorders affect a large number of people, leading to many lives being lost every year. These disorders affect struggling individuals and businesses whose productivity decreases due to days of lost work or lower employee performance. Recent studies provide alarming numbers of individuals who suffer from mental health disorders, e.g., depression and anxiety, in particular contexts, such as academia. In the context of the software industry, there are limited studies that aim to understand the presence of mental health disorders and the characteristics of jobs in this context that can be triggers for the deterioration of the mental health of software professionals. In this paper, we present the results of a survey with 500 software professionals. We investigate different aspects of their mental health and the characteristics of their work to identify possible triggers of mental health deterioration. Our results provide the first evidence that mental health is a critical issue to be addressed in the software industry, as well as raise the direction of changes that can be done in this context to improve the mental health of software professionals. △ Less

Submitted 29 September, 2023; originally announced September 2023.

Comments: 12 pages, 3 figures

arXiv:2309.13421 [pdf, other]

Penalties and Rewards for Fair Learning in Paired Kidney Exchange Programs

Authors: Margarida Carvalho, Alison Caulfield, Yi Lin, Adrian Vetta

Abstract: A kidney exchange program, also called a kidney paired donation program, can be viewed as a repeated, dynamic trading and allocation mechanism. This suggests that a dynamic algorithm for transplant exchange selection may have superior performance in comparison to the repeated use of a static algorithm. We confirm this hypothesis using a full scale simulation of the Canadian Kidney Paired Donation… ▽ More A kidney exchange program, also called a kidney paired donation program, can be viewed as a repeated, dynamic trading and allocation mechanism. This suggests that a dynamic algorithm for transplant exchange selection may have superior performance in comparison to the repeated use of a static algorithm. We confirm this hypothesis using a full scale simulation of the Canadian Kidney Paired Donation Program: learning algorithms, that attempt to learn optimal patient-donor weights in advance via dynamic simulations, do lead to improved outcomes. Specifically, our learning algorithms, designed with the objective of fairness (that is, equity in terms of transplant accessibility across cPRA groups), also lead to an increased number of transplants and shorter average waiting times. Indeed, our highest performing learning algorithm improves egalitarian fairness by 10% whilst also increasing the number of transplants by 6% and decreasing waiting times by 24%. However, our main result is much more surprising. We find that the most critical factor in determining the performance of a kidney exchange program is not the judicious assignment of positive weights (rewards) to patient-donor pairs. Rather, the key factor in increasing the number of transplants, decreasing waiting times and improving group fairness is the judicious assignment of a negative weight (penalty) to the small number of non-directed donors in the kidney exchange program. △ Less

Submitted 23 September, 2023; originally announced September 2023.

Comments: Shorter version accepted in WINE 2023

arXiv:2309.13026 [pdf, other]

doi 10.1142/S0129183124500827

Directed propaganda in the majority-rule model

Authors: Fabricio L. Forgerini, Nuno Crokidakis, Marcio A. V. Carvalho

Abstract: Advertisement and propaganda have changed continuously in the past decades, mainly due to the people's interactions at online platforms and social networks, and operate nowadays reaching a highly specific online audience instead targeting the masses. The impacts of this new media effect, oriented directly for a specific audience, is investigated on this study, in which we focus on the opinion evol… ▽ More Advertisement and propaganda have changed continuously in the past decades, mainly due to the people's interactions at online platforms and social networks, and operate nowadays reaching a highly specific online audience instead targeting the masses. The impacts of this new media effect, oriented directly for a specific audience, is investigated on this study, in which we focus on the opinion evolution of agents in the majority-rule model, considering the presence of directed propaganda. We introduce $p$ as the probability of a "positive" external propaganda and $q$ as the probability to the agents follow the external propaganda. Our results show that the usual majority-rule model stationary state is reached, with a full consensus, only for two cases, namely when the external propaganda is absent or when the media favors only one of the two opinions. However, even for a small influence of external propaganda, the final state is reached with a majority opinion dominating the population. For the case in which the propaganda influence is strong enough among the agents, we show that the consensus can not be reached at all, and we observe the polarization of opinions. In addition, we show through analytical and numerical results that the system undergoes an order-disorder phase transition that occurs at $q_c = 1/3$ for the case $p = 0.5$. △ Less

Submitted 15 November, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

Comments: Accepted for publication in International Journal of Modern Physics C

Journal ref: Int. J. Mod. Phys. C 35, 2450082 (2024)

arXiv:2309.00702 [pdf, other]

Accelerated Benders Decomposition and Local Branching for Dynamic Maximum Covering Location Problems

Authors: Steven Lamontagne, Margarida Carvalho, Ribal Atallah

Abstract: The maximum covering location problem (MCLP) is a key problem in facility location, with many applications and variants. One such variant is the dynamic (or multi-period) MCLP, which considers the installation of facilities across multiple time periods. To the best of our knowledge, no exact solution method has been proposed to tackle large-scale instances of this problem. To that end, in this wor… ▽ More The maximum covering location problem (MCLP) is a key problem in facility location, with many applications and variants. One such variant is the dynamic (or multi-period) MCLP, which considers the installation of facilities across multiple time periods. To the best of our knowledge, no exact solution method has been proposed to tackle large-scale instances of this problem. To that end, in this work, we expand upon the current state-of-the-art branch-and-Benders-cut solution method in the static case, by exploring several acceleration techniques. Additionally, we propose a specialised local branching scheme, that uses a novel distance metric in its definition of subproblems and features a new method for efficient and exact solving of the subproblems. These methods are then compared through extensive computational experiments, highlighting the strengths of the proposed methodologies. △ Less

Submitted 25 October, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: V2: Minor corrections for references and invalid URL

arXiv:2308.15663 [pdf, other]

Adaptive Attack Detection in Text Classification: Leveraging Space Exploration Features for Text Sentiment Classification

Authors: Atefeh Mahdavi, Neda Keivandarian, Marco Carvalho

Abstract: Adversarial example detection plays a vital role in adaptive cyber defense, especially in the face of rapidly evolving attacks. In adaptive cyber defense, the nature and characteristics of attacks continuously change, making it crucial to have robust mechanisms in place to detect and counter these threats effectively. By incorporating adversarial example detection techniques, adaptive cyber defens… ▽ More Adversarial example detection plays a vital role in adaptive cyber defense, especially in the face of rapidly evolving attacks. In adaptive cyber defense, the nature and characteristics of attacks continuously change, making it crucial to have robust mechanisms in place to detect and counter these threats effectively. By incorporating adversarial example detection techniques, adaptive cyber defense systems can enhance their ability to identify and mitigate attacks that attempt to exploit vulnerabilities in machine learning models or other systems. Adversarial examples are inputs that are crafted by applying intentional perturbations to natural inputs that result in incorrect classification. In this paper, we propose a novel approach that leverages the power of BERT (Bidirectional Encoder Representations from Transformers) and introduces the concept of Space Exploration Features. We utilize the feature vectors obtained from the BERT model's output to capture a new representation of feature space to improve the density estimation method. △ Less

Submitted 29 August, 2023; originally announced August 2023.

Comments: Presented at 2nd International Workshop on Adaptive Cyber Defense, 2023 (arXiv:2308.09520)

Report number: ACD/2023/108

arXiv:2308.09520

Proceedings of the 2nd International Workshop on Adaptive Cyber Defense

Authors: Marco Carvalho, Damian Marriott, Mark Bilinski, Ahmad Ridley

Abstract: The 2nd International Workshop on Adaptive Cyber Defense was held at the Florida Institute of Technology, Florida. This workshop was organized to share research that explores unique applications of Artificial Intelligence (AI) and Machine Learning (ML) as foundational capabilities for the pursuit of adaptive cyber defense. The cyber domain cannot currently be reliably and effectively defended with… ▽ More The 2nd International Workshop on Adaptive Cyber Defense was held at the Florida Institute of Technology, Florida. This workshop was organized to share research that explores unique applications of Artificial Intelligence (AI) and Machine Learning (ML) as foundational capabilities for the pursuit of adaptive cyber defense. The cyber domain cannot currently be reliably and effectively defended without extensive reliance on human experts. Skilled cyber defenders are in short supply and often cannot respond fast enough to cyber threats. Building on recent advances in AI and ML the Cyber defense research community has been motivated to develop new dynamic and sustainable defenses through the adoption of AI and ML techniques to cyber settings. Bridging critical gaps between AI and Cyber researchers and practitioners can accelerate efforts to create semi-autonomous cyber defenses that can learn to recognize and respond to cyber attacks or discover and mitigate weaknesses in cooperation with other cyber operation systems and human experts. Furthermore, these defenses are expected to be adaptive and able to evolve over time to thwart changes in attacker behavior, changes in the system health and readiness, and natural shifts in user behavior over time. The workshop was comprised of invited keynote talks, technical presentations and a panel discussion about how AI/ML can enable autonomous mitigation of current and future cyber attacks. Workshop submissions were peer reviewed by a panel of domain experts with a proceedings consisting of six technical articles exploring challenging problems of critical importance to national and global security. Participation in this workshop offered new opportunities to stimulate research and innovation in the emerging domain of adaptive and autonomous cyber defense. △ Less

Submitted 2 July, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

arXiv:2307.15060 [pdf, other]

News from the Swampland -- Constraining string theory with astrophysics and cosmology

Authors: Nils Schöneberg, Léo Vacher, J. D. F. Dias, Martim M. C. D. Carvalho, C. J. A. P. Martins

Abstract: Our current best guess for a unified theory of gravitation and quantum field theory (string theory) generically predicts a set of requirements for a consistently quantized theory, the Swampland criteria. Refined versions of these criteria have recently been shown to be in mild tension with cosmological observations. We summarize the status of the current impact of and constraints on the Swampland… ▽ More Our current best guess for a unified theory of gravitation and quantum field theory (string theory) generically predicts a set of requirements for a consistently quantized theory, the Swampland criteria. Refined versions of these criteria have recently been shown to be in mild tension with cosmological observations. We summarize the status of the current impact of and constraints on the Swampland conjectures from cosmology, and subject a variety of dark energy quintessence models to recently released cosmological datasets. We find that instead of tightening the tension, the new data allows for slightly more freedom in the Swampland criteria. We further demonstrate that if there is no theoretical argument made to prevent interactions of the moduli fields with the electromagnetic sector, a novel fine-tuning argument arises from the extremely tight current constraints on such interactions. Finally, we conclude with a cautionary tale on model-independent reconstructions of the Swampland criteria from expansion rate data. △ Less

Submitted 8 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

Comments: 35 pages, 20 figures, 4 tables. All comments are welcome! [v2: Added citations, corrected eq. 2.6]

arXiv:2307.02562 [pdf, ps, other]

On the completely irregular set for systems with the shadowing property

Authors: Maria Carvalho, Vinícius Coelho, Luciana Salgado

Abstract: We prove that the completely irregular set is Baire generic for every non-uniquely ergodic transitive continuous map which satisfies the shadowing property and acts on a compact metric space without isolated points. We also show that, under the previous assumptions, the orbit of any completely irregular point is dense. Afterwards, we analyze the connection between transitivity and the shadowing pr… ▽ More We prove that the completely irregular set is Baire generic for every non-uniquely ergodic transitive continuous map which satisfies the shadowing property and acts on a compact metric space without isolated points. We also show that, under the previous assumptions, the orbit of any completely irregular point is dense. Afterwards, we analyze the connection between transitivity and the shadowing property, draw a few consequences of their joint action within the family of expansive homeomorphisms, and discuss several examples to test the scope of our results. △ Less

Submitted 18 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

Comments: 32 pages, no figures, Changes in Abstract and distribution of sections

MSC Class: 37A30; 37C10; 37C40; 37D20

arXiv:2306.02817 [pdf, other]

Integer Programming Games: A Gentle Computational Overview

Authors: Margarida Carvalho, Gabriele Dragotto, Andrea Lodi, Sriram Sankaranarayanan

Abstract: In this tutorial, we present a computational overview on computing Nash equilibria in Integer Programming Games ($IPG$s), $i.e.$, how to compute solutions for a class of non-cooperative and nonconvex games where each player solves a mixed-integer optimization problem. $IPG$s are a broad class of games extending the modeling power of mixed-integer optimization to multi-agent settings. This class of… ▽ More In this tutorial, we present a computational overview on computing Nash equilibria in Integer Programming Games ($IPG$s), $i.e.$, how to compute solutions for a class of non-cooperative and nonconvex games where each player solves a mixed-integer optimization problem. $IPG$s are a broad class of games extending the modeling power of mixed-integer optimization to multi-agent settings. This class of games includes, for instance, any finite game and any multi-agent extension of traditional combinatorial optimization problems. After providing some background motivation and context of applications, we systematically review and classify the state-of-the-art algorithms to compute Nash equilibria. We propose an essential taxonomy of the algorithmic ingredients needed to compute equilibria, and we describe the theoretical and practical challenges associated with equilibria computation. Finally, we quantitatively and qualitatively compare a sequential Stackelberg game with a simultaneous $IPG$ to highlight the different properties of their solutions. △ Less

Submitted 12 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: To appear in INFORMS TutORials in Operations Research 2023

arXiv:2305.07334 [pdf, other]

Locking and Quacking: Stacking Bayesian model predictions by log-pooling and superposition

Authors: Yuling Yao, Luiz Max Carvalho, Diego Mesquita, Yann McLatchie

Abstract: Combining predictions from different models is a central problem in Bayesian inference and machine learning more broadly. Currently, these predictive distributions are almost exclusively combined using linear mixtures such as Bayesian model averaging, Bayesian stacking, and mixture of experts. Such linear mixtures impose idiosyncrasies that might be undesirable for some applications, such as multi… ▽ More Combining predictions from different models is a central problem in Bayesian inference and machine learning more broadly. Currently, these predictive distributions are almost exclusively combined using linear mixtures such as Bayesian model averaging, Bayesian stacking, and mixture of experts. Such linear mixtures impose idiosyncrasies that might be undesirable for some applications, such as multi-modality. While there exist alternative strategies (e.g. geometric bridge or superposition), optimising their parameters usually involves computing an intractable normalising constant repeatedly. We present two novel Bayesian model combination tools. These are generalisations of model stacking, but combine posterior densities by log-linear pooling (locking) and quantum superposition (quacking). To optimise model weights while avoiding the burden of normalising constants, we investigate the Hyvarinen score of the combined posterior predictions. We demonstrate locking with an illustrative example and discuss its practical application with importance sampling. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Comments: An earlier version appeared at the NeurIPS 2022 Workshop on Score-Based Methods

arXiv:2304.06097 [pdf, other]

doi 10.1364/OL.504202

Quartic solitons of a mode-locked laser distributed model

Authors: D. Malheiro, M. Facão, M. I. Carvalho

Abstract: Dissipative quartic solitons have gained interest in the field of mode-locked lasers for their energy-width scaling which allows the generation of ultrashort pulses with high energies. Pursuing the characterization of such pulses, here we found soliton solutions of a distributed model for mode-locked lasers in the presence of either positive or negative fourth order dispersion (4OD). We studied th… ▽ More Dissipative quartic solitons have gained interest in the field of mode-locked lasers for their energy-width scaling which allows the generation of ultrashort pulses with high energies. Pursuing the characterization of such pulses, here we found soliton solutions of a distributed model for mode-locked lasers in the presence of either positive or negative fourth order dispersion (4OD). We studied the impact the laser parameters may have on the profiles, range of existence and energy-width relation of the output pulses. The most energetic and narrowest solutions occur for negative 4OD, with the energy having an inverse cubic dependence with the width in most cases. Our simulations showed that the spectral filtering has the biggest contribution in the generation of short (widhts as low as 39 fs) and very energetic (392 nJ) optical pulses. △ Less

Submitted 31 August, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

arXiv:2304.04813 [pdf, ps, other]

Asymptotic behavior of Musielak-Orlicz-Sobolev modulars

Authors: J. C. de Albuquerque, L. R. S. de Assis, M. L. M. Carvalho, A. Salort

Abstract: In this article we study the asymptotic behavior of anisotropic nonlocal nonstandard growth seminorms and modulars as the fractional parameter goes to 1. This gives a so-called Bourgain-Brezis-Mironescu type formula for a very general family of functionals. In the particu\-lar case of fractional Sobolev spaces with variable exponent, we point out that our proof asks for a weaker regularity of the… ▽ More In this article we study the asymptotic behavior of anisotropic nonlocal nonstandard growth seminorms and modulars as the fractional parameter goes to 1. This gives a so-called Bourgain-Brezis-Mironescu type formula for a very general family of functionals. In the particu\-lar case of fractional Sobolev spaces with variable exponent, we point out that our proof asks for a weaker regularity of the exponent than the considered in previous articles. △ Less

Submitted 13 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

arXiv:2303.15292 [pdf]

A comprehensive thermodynamic model for temperature change in i-caloric effects

Authors: A. M. G. Carvalho, W. Imamura

Abstract: Solid-state cooling based on i-caloric effects may be an alternative to conventional vapor-compression refrigeration systems. The adiabatic temperature change ($ΔT_{S}$) is one of the parameters that characterize the i-caloric effects, therefore it is important to obtain the correct $ΔT_{S}$ values and, whenever possible, to correlate this parameter with thermodynamic and microscopic quantities. I… ▽ More Solid-state cooling based on i-caloric effects may be an alternative to conventional vapor-compression refrigeration systems. The adiabatic temperature change ($ΔT_{S}$) is one of the parameters that characterize the i-caloric effects, therefore it is important to obtain the correct $ΔT_{S}$ values and, whenever possible, to correlate this parameter with thermodynamic and microscopic quantities. In this work, we propose a comprehensive thermodynamic model that allows us to determine the adiabatic temperature change from non-adiabatic measurements of temperature change induced by a field change. Our model fits efficiently temperature versus time and temperature change versus the inverse of the field change rate data for three different materials presenting different i-caloric effects. The results indicate the present model is a very useful and robust tool to obtain the correct $ΔT_{S}$ values and to correlate $ΔT_{S}$ with other thermodynamic quantities. △ Less

Submitted 5 March, 2023; originally announced March 2023.

Comments: 13 pages, 3 figures, 1 table

arXiv:2303.07202 [pdf, other]

Optimization of the location and design of urban green spaces

Authors: Caroline Leboeuf, Margarida Carvalho, Yan Kestens, Benoît Thierry

Abstract: The recent promotion of sustainable urban planning combined with a growing need for public interventions to improve well-being and health have led to an increased collective interest for green spaces in and around cities. In particular, parks have proven a wide range of benefits in urban areas. This also means inequities in park accessibility may contribute to health inequities. In this work, we s… ▽ More The recent promotion of sustainable urban planning combined with a growing need for public interventions to improve well-being and health have led to an increased collective interest for green spaces in and around cities. In particular, parks have proven a wide range of benefits in urban areas. This also means inequities in park accessibility may contribute to health inequities. In this work, we showcase the application of classic tools from Operations Research to assist decision-makers to improve parks' accessibility, distribution and design. Given the context of public decision-making, we are particularly concerned with equity and environmental justice, and are focused on an advanced assessment of users' behavior through a spatial interaction model. We present a two-stage fair facility location and design model, which serves as a template model to assist public decision-makers at the city-level for the planning of urban green spaces. The first-stage of the optimization model is about the optimal city-budget allocation to neighborhoods based on a data exposing inequality attributes. The second-stage seeks the optimal location and design of parks for each neighborhood, and the objective consists of maximizing the total expected probability of individuals visiting parks. We show how to reformulate the latter as a mixed-integer linear program. We further introduce a clustering method to reduce the size of the problem and determine a close to optimal solution within reasonable time. The model is tested using the case study of the city of Montreal and comparative results are discussed in detail to justify the performance of the model. △ Less

Submitted 13 March, 2023; originally announced March 2023.

arXiv:2303.01271 [pdf, other]

Bivariate beta distribution: parameter inference and diagnostics

Authors: Lucas Machado Moschen, Luiz Max Carvalho

Abstract: Correlated proportions appear in many real-world applications and present a unique challenge in terms of finding an appropriate probabilistic model due to their constrained nature. The bivariate beta is a natural extension of the well-known beta distribution to the space of correlated quantities on $[0, 1]^2$. Its construction is not unique, however. Over the years, many bivariate beta distributio… ▽ More Correlated proportions appear in many real-world applications and present a unique challenge in terms of finding an appropriate probabilistic model due to their constrained nature. The bivariate beta is a natural extension of the well-known beta distribution to the space of correlated quantities on $[0, 1]^2$. Its construction is not unique, however. Over the years, many bivariate beta distributions have been proposed, ranging from three to eight or more parameters, and for which the joint density and distribution moments vary in terms of mathematical tractability. In this paper, we investigate the construction proposed by Olkin & Trikalinos (2015), which strikes a balance between parameter-richness and tractability. We provide classical (frequentist) and Bayesian approaches to estimation in the form of method-of-moments and latent variable/data augmentation coupled with Hamiltonian Monte Carlo, respectively. The elicitation of bivariate beta as a prior distribution is also discussed. The development of diagnostics for checking model fit and adequacy is explored in depth with the aid of Monte Carlo experiments under both well-specified and misspecified data-generating settings. Keywords: Bayesian estimation; bivariate beta; correlated proportions; diagnostics; method of moments. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 32 pages, 23 figures

MSC Class: 62H12 (Primary) 62F15 (Secondary)

arXiv:2302.14230 [pdf, other]

Optimal Priors for the Discounting Parameter of the Normalized Power Prior

Authors: Yueqi Shen, Luiz M. Carvalho, Matthew A. Psioda, Joseph G. Ibrahim

Abstract: The power prior is a popular class of informative priors for incorporating information from historical data. It involves raising the likelihood for the historical data to a power, which acts as discounting parameter. When the discounting parameter is modelled as random, the normalized power prior is recommended. In this work, we prove that the marginal posterior for the discounting parameter for g… ▽ More The power prior is a popular class of informative priors for incorporating information from historical data. It involves raising the likelihood for the historical data to a power, which acts as discounting parameter. When the discounting parameter is modelled as random, the normalized power prior is recommended. In this work, we prove that the marginal posterior for the discounting parameter for generalized linear models converges to a point mass at zero if there is any discrepancy between the historical and current data, and that it does not converge to a point mass at one when they are fully compatible. In addition, we explore the construction of optimal priors for the discounting parameter in a normalized power prior. In particular, we are interested in achieving the dual objectives of encouraging borrowing when the historical and current data are compatible and limiting borrowing when they are in conflict. We propose intuitive procedures for eliciting the shape parameters of a beta prior for the discounting parameter based on two minimization criteria, the Kullback-Leibler divergence and the mean squared error. Based on the proposed criteria, the optimal priors derived are often quite different from commonly used priors such as the uniform prior. △ Less

Submitted 8 April, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

arXiv:2302.11219 [pdf, other]

doi 10.1088/2057-1976/acba9f

Deformable registration with intensity correction for CESM monitoring response to Neoadjuvant Chemotherapy

Authors: Clément Jailin, Pablo Milioni De Carvalho, Sara Mohamed, Laurence Vancamberg, Amr Farouk Ibrahim Moustafa, Mohammed Gomaa, Rasha Mohammed Kamal, Serge Muller

Abstract: This paper proposes a robust longitudinal registration method for Contrast Enhanced Spectral Mammography in monitoring neoadjuvant chemotherapy. Because breast texture intensity changes with the treatment, a non-rigid registration procedure with local intensity compensations is developed. The approach allows registering the low energy images of the exams acquired before and after the chemotherapy.… ▽ More This paper proposes a robust longitudinal registration method for Contrast Enhanced Spectral Mammography in monitoring neoadjuvant chemotherapy. Because breast texture intensity changes with the treatment, a non-rigid registration procedure with local intensity compensations is developed. The approach allows registering the low energy images of the exams acquired before and after the chemotherapy. The measured motion is then applied to the corresponding recombined images. The difference of registered images, called residual, makes vanishing the breast texture that did not changed between the two exams. Consequently, this registered residual allows identifying local density and iodine changes, especially in the lesion area. The method is validated with a synthetic NAC case where ground truths are available. Then the procedure is applied to 51 patients with 208 CESM image pairs acquired before and after the chemotherapy treatment. The proposed registration converged in all 208 cases. The intensity-compensated registration approach is evaluated with different mathematical metrics and through the repositioning of clinical landmarks (RMSE: 5.9 mm) and outperforms state-of-the-art registration techniques. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Journal ref: Biomedical Physics & Engineering Express (2023)

arXiv:2302.07005 [pdf]

doi 10.1038/s41592-023-01987-9

Community-developed checklists for publishing images and image analysis

Authors: Christopher Schmied, Michael Nelson, Sergiy Avilov, Gert-Jan Bakker, Cristina Bertocchi, Johanna Bischof, Ulrike Boehm, Jan Brocher, Mariana Carvalho, Catalin Chiritescu, Jana Christopher, Beth Cimini, Eduardo Conde-Sousa, Michael Ebner, Rupert Ecker, Kevin Eliceiri, Julia Fernandez-Rodriguez, Nathalie Gaudreault, Laurent Gelman, David Grunwald, Tingting Gu, Nadia Halidi, Mathias Hammer, Matthew Hartley, Marie Held , et al. (29 additional authors not shown)

Abstract: Images document scientific discoveries and are prevalent in modern biomedical research. Microscopy imaging in particular is currently undergoing rapid technological advancements. However for scientists wishing to publish the obtained images and image analyses results, there are to date no unified guidelines. Consequently, microscopy images and image data in publications may be unclear or difficult… ▽ More Images document scientific discoveries and are prevalent in modern biomedical research. Microscopy imaging in particular is currently undergoing rapid technological advancements. However for scientists wishing to publish the obtained images and image analyses results, there are to date no unified guidelines. Consequently, microscopy images and image data in publications may be unclear or difficult to interpret. Here we present community-developed checklists for preparing light microscopy images and image analysis for publications. These checklists offer authors, readers, and publishers key recommendations for image formatting and annotation, color selection, data availability, and for reporting image analysis workflows. The goal of our guidelines is to increase the clarity and reproducibility of image figures and thereby heighten the quality of microscopy data is in publications. △ Less

Submitted 14 September, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

Comments: 28 pages, 8 Figures, 3 Supplmentary Figures, Manuscript, Essential recommendations for publication of microscopy image data

arXiv:2301.04601 [pdf, ps, other]

On Fractional Musielak-Sobolev spaces and applications to nonlocal problems

Authors: J. C. de Albuquerque, L. R. S. de Assis, M. L. M. Carvalho, A. Salort

Abstract: In this work, we establish some abstract results on the perspective of the fractional Musielak-Sobolev spaces, such as: uniform convexity, Radon-Riesz property with respect to the modular function, $(S_{+})$-property, Brezis-Lieb type Lemma to the modular function and monotonicity results. Moreover, we apply the theory developed to study the existence of solutions to the following class of nonloca… ▽ More In this work, we establish some abstract results on the perspective of the fractional Musielak-Sobolev spaces, such as: uniform convexity, Radon-Riesz property with respect to the modular function, $(S_{+})$-property, Brezis-Lieb type Lemma to the modular function and monotonicity results. Moreover, we apply the theory developed to study the existence of solutions to the following class of nonlocal problems \begin{equation*} \left\{ \begin{array}{ll} (-Δ)_{Φ_{x,y}}^s u = f(x,u),& \mbox{in }Ω, u=0,& \mbox{on }\mathbb{R}^N\setminus Ω, \end{array} \right. \end{equation*} where $N\geq 2$, $Ω\subset \mathbb{R}^N$ is a bounded domain with Lipschitz boundary $\partial Ω$ and $f:Ω\times \mathbb{R} \rightarrow \mathbb{R}$ is a Carathéodory function not necessarily satisfying the Ambrosetti-Rabinowitz condition. Such class of problems enables the presence of many particular operators, for instance, the fractional operator with variable exponent, double-phase and double-phase with variable exponent operators, anisotropic fractional $p$-Laplacian, among others. △ Less

Submitted 11 January, 2023; originally announced January 2023.

MSC Class: 46E30; 35R11; 47G20

arXiv:2212.10673 [pdf, ps, other]

doi 10.1007/s10107-023-02043-2

Asymmetry in the Complexity of the Multi-Commodity Network Pricing Problem

Authors: Quang Minh Bui, Margarida Carvalho, José Neto

Abstract: The network pricing problem (NPP) is a bilevel problem, where the leader optimizes its revenue by deciding on the prices of certain arcs in a graph, while expecting the followers (also known as the commodities) to choose a shortest path based on those prices. In this paper, we investigate the complexity of the NPP with respect to two parameters: the number of tolled arcs, and the number of commodi… ▽ More The network pricing problem (NPP) is a bilevel problem, where the leader optimizes its revenue by deciding on the prices of certain arcs in a graph, while expecting the followers (also known as the commodities) to choose a shortest path based on those prices. In this paper, we investigate the complexity of the NPP with respect to two parameters: the number of tolled arcs, and the number of commodities. We devise a simple algorithm showing that if the number of tolled arcs is fixed, then the problem can be solved in polynomial time with respect to the number of commodities. In contrast, even if there is only one commodity, once the number of tolled arcs is not fixed, the problem becomes NP-hard. We characterize this asymmetry in the complexity with a novel property named strong bilevel feasibility. Finally, we describe an algorithm to generate valid inequalities to the NPP based on this property, accommodated with numerical results to demonstrate its effectiveness in solving the NPP with a high number of commodities. △ Less

Submitted 12 January, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

Comments: 37 pages, 12 figures. Mathematical Programming (2024)

MSC Class: 90C46; 90C25; 90C35 ACM Class: G.1.6; G.2.2

arXiv:2212.04009 [pdf, other]

A parallelizable model-based approach for marginal and multivariate clustering

Authors: Miguel de Carvalho, Gabriel Martos Venturini, Andrej Svetlošák

Abstract: This paper develops a clustering method that takes advantage of the sturdiness of model-based clustering, while attempting to mitigate some of its pitfalls. First, we note that standard model-based clustering likely leads to the same number of clusters per margin, which seems a rather artificial assumption for a variety of datasets. We tackle this issue by specifying a finite mixture model per mar… ▽ More This paper develops a clustering method that takes advantage of the sturdiness of model-based clustering, while attempting to mitigate some of its pitfalls. First, we note that standard model-based clustering likely leads to the same number of clusters per margin, which seems a rather artificial assumption for a variety of datasets. We tackle this issue by specifying a finite mixture model per margin that allows each margin to have a different number of clusters, and then cluster the multivariate data using a strategy game-inspired algorithm to which we call Reign-and-Conquer. Second, since the proposed clustering approach only specifies a model for the margins -- but leaves the joint unspecified -- it has the advantage of being partially parallelizable; hence, the proposed approach is computationally appealing as well as more tractable for moderate to high dimensions than a `full' (joint) model-based clustering approach. A battery of numerical experiments on artificial data indicate an overall good performance of the proposed methods in a variety of scenarios, and real datasets are used to showcase their application in practice. △ Less

Submitted 7 December, 2022; originally announced December 2022.

arXiv:2211.16339 [pdf, other]

SIR model with vaccination: bifurcation analysis

Authors: João P. S. Maurício de Carvalho, Alexandre A. Rodrigues

Abstract: There are few adapted SIR models in the literature that combine vaccination and logistic growth. In this article, we study bifurcations of a SIR model where the class of Susceptible individuals grows logistically and has been subject to constant vaccination. We explicitly prove that the endemic equilibrium is a codimension two singularity in the parameter space $(\mathcal{R}_0, p)$, where… ▽ More There are few adapted SIR models in the literature that combine vaccination and logistic growth. In this article, we study bifurcations of a SIR model where the class of Susceptible individuals grows logistically and has been subject to constant vaccination. We explicitly prove that the endemic equilibrium is a codimension two singularity in the parameter space $(\mathcal{R}_0, p)$, where $\mathcal{R}_0$ is the basic reproduction number and $p$ is the proportion of Susceptible individuals successfully vaccinated at birth. We exhibit explicitly the Hopf, transcritical, Belyakov, heteroclinic and saddle-node bifurcation curves unfolding the singularity. The two parameters $(\mathcal{R}_0, p)$ are written in a useful way to evaluate the proportion of vaccinated individuals necessary to eliminate the disease and to conclude how the vaccination may affect the outcome of the epidemic. We also exhibit the region in the parameter space where the disease persists and we illustrate our main result with numerical simulations, emphasizing the role of the parameters. △ Less

Submitted 25 April, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

MSC Class: 37G10; 37G15; 34C23; 37D05; 92B05

arXiv:2211.00867 [pdf, other]

Heavy-Tailed Pitman--Yor Mixture Models

Authors: Vianey Palacios Ramirez, Miguel de Carvalho, Luis Gutierrez Inostroza

Abstract: Heavy tails are often found in practice, and yet they are an Achilles heel of a variety of mainstream random probability measures such as the Dirichlet process. The first contribution of this paper focuses on the characterization of the tails of the so-called Pitman--Yor process, which includes the Dirichlet process as a particular case. We show that the right tail of a Pitman--Yor process, known… ▽ More Heavy tails are often found in practice, and yet they are an Achilles heel of a variety of mainstream random probability measures such as the Dirichlet process. The first contribution of this paper focuses on the characterization of the tails of the so-called Pitman--Yor process, which includes the Dirichlet process as a particular case. We show that the right tail of a Pitman--Yor process, known as the stable law process, is heavy-tailed, provided that the centering distribution is itself heavy-tailed. A second contribution of the paper rests on the development of two classes of heavy-tailed mixture models and the assessment of their relative merits. Multivariate extensions of the proposed heavy-tailed mixtures are here devised along with a predictor-dependent version so to learn about the effect of covariates on a multivariate heavy-tailed response. The simulation study suggests that the proposed method performs well in a variety of scenarios, and we showcase the application of the proposed methods in a neuroscience dataset. △ Less

Submitted 2 November, 2022; originally announced November 2022.

Comments: 24 pages

arXiv:2210.04808 [pdf, other]

doi 10.1111/itor.13298

A stochastic integer programming approach to reserve staff scheduling with preferences

Authors: Carl Perreault-Lafleur, Margarida Carvalho, Guy Desaulniers

Abstract: Nowadays, reaching a high level of employee satisfaction in efficient schedules is an important and difficult task faced by companies. We tackle a new variant of the personnel scheduling problem under unknown demand by considering employee satisfaction via endogenous uncertainty depending on the combination of their preferred and received schedules. We address this problem in the context of reserv… ▽ More Nowadays, reaching a high level of employee satisfaction in efficient schedules is an important and difficult task faced by companies. We tackle a new variant of the personnel scheduling problem under unknown demand by considering employee satisfaction via endogenous uncertainty depending on the combination of their preferred and received schedules. We address this problem in the context of reserve staff scheduling, an unstudied operational problem from the transit industry. To handle the challenges brought by the two uncertainty sources, regular employee and reserve employee absences, we formulate this problem as a two-stage stochastic integer program with mixed-integer recourse. The first-stage decisions consist in finding the days off of the reserve employees. After the unknown regular employee absences are revealed, the second-stage decisions are to schedule the reserve staff duties. We incorporate reserve employees' days-off preferences into the model to examine how employee satisfaction may affect their own absence rates. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 25 pages, 4 figures, International Transactions in Operational Research (2023)

arXiv:2210.03216 [pdf, other]

Beyond the shortest path: the path length index as a distribution

Authors: Leonardo B. L. Santos, Luiz Max Carvalho, Giovanni G. Soares, Leonardo N. Ferreira, Igor M. Sokolov

Abstract: The traditional complex network approach considers only the shortest paths from one node to another, not taking into account several other possible paths. This limitation is significant, for example, in urban mobility studies. In this short report, as the first steps, we present an exhaustive approach to address that problem and show we can go beyond the shortest path, but we do not need to go so… ▽ More The traditional complex network approach considers only the shortest paths from one node to another, not taking into account several other possible paths. This limitation is significant, for example, in urban mobility studies. In this short report, as the first steps, we present an exhaustive approach to address that problem and show we can go beyond the shortest path, but we do not need to go so far: we present an interactive procedure and an early stop possibility. After presenting some fundamental concepts in graph theory, we presented an analytical solution for the problem of counting the number of possible paths between two nodes in complete graphs, and a depth-limited approach to get all possible paths between each pair of nodes in a general graph (an NP-hard problem). We do not collapse the distribution of path lengths between a pair of nodes into a scalar number, we look at the distribution itself - taking all paths up to a pre-defined path length (considering a truncated distribution), and show the impact of that approach on the most straightforward distance-based graph index: the walk/path length. △ Less

Submitted 6 October, 2022; originally announced October 2022.

arXiv:2209.06647 [pdf]

Value of Bidirectional V2G Smart Charging Responsive Services: Insights from a Simple CA Model

Authors: Pedro M. S. Carvalho, Luis A. F. M. Ferreira

Abstract: In this paper, particle-hop** cellular automaton (CA) models of elastic demand are used to investigate the value added to plug-in electric vehicles (PEV) aggregators by adopting vehicle-to-grid (V2G) responsive services. CA models used earlier to study load-sifting responses are modified to capture discharge/ recharge capabilities of V2G. Results on ram** responses from CA are then analysed to… ▽ More In this paper, particle-hop** cellular automaton (CA) models of elastic demand are used to investigate the value added to plug-in electric vehicles (PEV) aggregators by adopting vehicle-to-grid (V2G) responsive services. CA models used earlier to study load-sifting responses are modified to capture discharge/ recharge capabilities of V2G. Results on ram** responses from CA are then analysed to discuss the small contribution to system controllability added by V2G responsive services. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: 3 pages, 4 figures

arXiv:2209.05569 [pdf, other]

Uncovering Regions of Maximum Dissimilarity on Random Process Data

Authors: Miguel de Carvalho, Gabriel Martos Venturini

Abstract: The comparison of local characteristics of two random processes can shed light on periods of time or space at which the processes differ the most. This paper proposes a method that learns about regions with a certain volume, where the marginal attributes of two processes are less similar. The proposed methods are devised in full generality for the setting where the data of interest are themselves… ▽ More The comparison of local characteristics of two random processes can shed light on periods of time or space at which the processes differ the most. This paper proposes a method that learns about regions with a certain volume, where the marginal attributes of two processes are less similar. The proposed methods are devised in full generality for the setting where the data of interest are themselves stochastic processes, and thus the proposed method can be used for pointing out the regions of maximum dissimilarity with a certain volume, in the contexts of functional data, time series, and point processes. The parameter functions underlying both stochastic processes of interest are modeled via a basis representation, and Bayesian inference is conducted via an integrated nested Laplace approximation. The numerical studies validate the proposed methods, and we showcase their application with case studies on criminology, finance, and medicine. △ Less

Submitted 12 September, 2022; originally announced September 2022.

ACM Class: G.3

Showing 1–50 of 337 results for author: Carvalho, M