-
The PLATO Mission
Authors:
Heike Rauer,
Conny Aerts,
Juan Cabrera,
Magali Deleuil,
Anders Erikson,
Laurent Gizon,
Mariejo Goupil,
Ana Heras,
Jose Lorenzo-Alvarez,
Filippo Marliani,
Cesar Martin-Garcia,
J. Miguel Mas-Hesse,
Laurence O'Rourke,
Hugh Osborn,
Isabella Pagano,
Giampaolo Piotto,
Don Pollacco,
Roberto Ragazzoni,
Gavin Ramsay,
Stéphane Udry,
Thierry Appourchaux,
Willy Benz,
Alexis Brandeker,
Manuel Güdel,
Eduardo Janot-Pacheco
, et al. (801 additional authors not shown)
Abstract:
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observati…
▽ More
PLATO (PLAnetary Transits and Oscillations of stars) is ESA's M3 mission designed to detect and characterise extrasolar planets and perform asteroseismic monitoring of a large number of stars. PLATO will detect small planets (down to <2 R_(Earth)) around bright stars (<11 mag), including terrestrial planets in the habitable zone of solar-like stars. With the complement of radial velocity observations from the ground, planets will be characterised for their radius, mass, and age with high accuracy (5 %, 10 %, 10 % for an Earth-Sun combination respectively). PLATO will provide us with a large-scale catalogue of well-characterised small planets up to intermediate orbital periods, relevant for a meaningful comparison to planet formation theories and to better understand planet evolution. It will make possible comparative exoplanetology to place our Solar System planets in a broader context. In parallel, PLATO will study (host) stars using asteroseismology, allowing us to determine the stellar properties with high accuracy, substantially enhancing our knowledge of stellar structure and evolution.
The payload instrument consists of 26 cameras with 12cm aperture each. For at least four years, the mission will perform high-precision photometric measurements. Here we review the science objectives, present PLATO's target samples and fields, provide an overview of expected core science performance as well as a description of the instrument and the mission profile at the beginning of the serial production of the flight cameras. PLATO is scheduled for a launch date end 2026. This overview therefore provides a summary of the mission to the community in preparation of the upcoming operational phases.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
$K^+Λ(1520)$ photoproduction at forward angles near threshold with the BGOOD experiment
Authors:
E. O. Rosanowski,
T. C. Jude,
S. Alef,
A. J. Clara Figueiredo,
R. Di Salvo,
D. Elsner,
A. Fantini,
O. Freyermuth,
F. Frommberger,
F. Ghio,
J. Groß,
K. Kohl,
P. Levi Sandri,
G. Mandaglio,
R. Messi,
D. Moricciani,
P. Pedroni,
B. -E. Reitz,
M. Romaniuk,
G. Scheluchin,
H. Schmieden,
A. Sonnenschein
Abstract:
The differential cross section for $γp\rightarrow K^+Λ(1520)$ was measured from threshold to a centre-of-mass energy of 2090\,MeV at forward angles at the BGOOD experiment. The high statistical precision and resolution in centre-of-mass energy and angle allows a detailed characterisation of this low-momentum transfer kinematic region. The data agree with a previous LEPS measurement and support eff…
▽ More
The differential cross section for $γp\rightarrow K^+Λ(1520)$ was measured from threshold to a centre-of-mass energy of 2090\,MeV at forward angles at the BGOOD experiment. The high statistical precision and resolution in centre-of-mass energy and angle allows a detailed characterisation of this low-momentum transfer kinematic region. The data agree with a previous LEPS measurement and support effective Lagrangian models that indicate that the contact term dominates the cross section near threshold.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
On the Origin of Fast Rotating Stars. I. Photometric calibration and results of AO-assisted BVRI+Halpha imaging of NGC330 with SAMI/SOAR
Authors:
Felipe Navarete,
Pedro Ticiani dos Santos,
Alex Cavaliéri Carciofi,
André Luiz Figueiredo
Abstract:
H$α$ emission is a clear indicator of circumstellar activity in Be stars, historically employed to assess the classical Be star (CBe) population in young open clusters (YOCs). The YOC NGC330 in the Small Magellanic Cloud exhibits a large known fraction of {CBe} stars and was selected for a pilot study to establish a comprehensive methodology for identifying H$α$ emitters in the Magellanic Clouds,…
▽ More
H$α$ emission is a clear indicator of circumstellar activity in Be stars, historically employed to assess the classical Be star (CBe) population in young open clusters (YOCs). The YOC NGC330 in the Small Magellanic Cloud exhibits a large known fraction of {CBe} stars and was selected for a pilot study to establish a comprehensive methodology for identifying H$α$ emitters in the Magellanic Clouds, encompassing the entire B-type spectral range. Using the SOAR Adaptative Module Imager (SAMI), we investigated the stellar population of NGC330 using multi-band BVRI+H$α$ imaging. We identified H$α$ emitters within the entire V-band range covered by SAMI/SOAR observations ($V\lesssim22$), comprising the complete B-type stellar population and offering a unique opportunity to explore the Be phenomenon across all spectral sub-classes. The stellar radial distribution shows a clear bimodal pattern between the most massive (B5 or earlier) and the lower-mass main-sequence objects (later than B6) within the cluster. The former is concentrated towards the cluster center (showing a dispersion of $σ=4.26\pm0.20$ pc), whereas the latter extends across larger radii ($σ=10.83\pm0.65$ pc), indicating mass stratification within NGC330. The total fraction of emitters is $4.4\pm0.5\%$, notably smaller than previous estimates from flux- or seeing-limited observations. However, a higher fraction of H$α$ emitters is observed among higher-mass stars ($32.8\pm3.4\%$) than within lower-mass ($4.4\pm0.9\%$). Consequently, the putative CBe population exhibits distinct dynamical characteristics compared to the bulk of the stellar population in NGC330. These findings highlight the significance of the current observations in providing a complete picture of the CBe population in NGC330.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Coherent $π^0ηd$ photoproduction at forward deuteron angles measured at BGOOD
Authors:
A. J. Clara Figueiredo,
T. C. Jude,
S. Alef,
P. L. Cole,
R. Di Salvo,
D. Elsner,
A. Fantini,
O. Freyermuth,
F. Frommberger,
F. Ghio,
J. Groß,
K. Kohl,
P. Levi Sandri,
G. Mandaglio,
P. Pedroni,
B. -E. Reitz,
M. Romaniuk,
G. Scheluchin,
H. Schmieden,
A. Sonnenschein,
C. Tillmanns
Abstract:
The coherent reaction, $γd \rightarrow π^0ηd$ was studied with the BGOOD experiment at ELSA from threshold to a centre-of-mass energy of 3200\,MeV. A full kinematic reconstruction was made, with final state deuterons identified in the forward spectrometer and $π^0$ and $η$ decays in the central BGO Rugby Ball. The strength of the differential cross section exceeds what can be described by models o…
▽ More
The coherent reaction, $γd \rightarrow π^0ηd$ was studied with the BGOOD experiment at ELSA from threshold to a centre-of-mass energy of 3200\,MeV. A full kinematic reconstruction was made, with final state deuterons identified in the forward spectrometer and $π^0$ and $η$ decays in the central BGO Rugby Ball. The strength of the differential cross section exceeds what can be described by models of coherent photoproduction at forward angles by orders of magnitude. The distribution of the differential cross section has an excellent agreement with a model including quasi-free $Δπ$ photoproduction, pion re-scattering and $N(1535)$ formation and subsequent nucleon coalescence to the deuteron. This also gives a reasonable description of the two-body invariant mass distributions and naturally explains the similar magnitudes of this channel and $π^0π^0 d$ coherent photoproduction.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
The oldest stars with low neutron-capture element abundances and origins in ancient dwarf galaxies
Authors:
Hillary Diane Andales,
Ananda Santos Figueiredo,
Casey Gordon Fienberg,
Mohammad K. Mardini,
Anna Frebel
Abstract:
We present a detailed chemical abundance and kinematic analysis of six extremely metal-poor ($-4.2 \leq$ [Fe/H] $\leq-$2.9) halo stars with very low neutron-capture abundances ([Sr/H] and [Ba/H]) based on high-resolution Magellan/MIKE spectra. Three of our stars have [Sr/Ba] and [Sr/H] ratios that resemble those of metal-poor stars in ultra-faint dwarf galaxies (UFDs). Since early UFDs may be the…
▽ More
We present a detailed chemical abundance and kinematic analysis of six extremely metal-poor ($-4.2 \leq$ [Fe/H] $\leq-$2.9) halo stars with very low neutron-capture abundances ([Sr/H] and [Ba/H]) based on high-resolution Magellan/MIKE spectra. Three of our stars have [Sr/Ba] and [Sr/H] ratios that resemble those of metal-poor stars in ultra-faint dwarf galaxies (UFDs). Since early UFDs may be the building blocks of the Milky Way, extremely metal-poor halo stars with low, UFD-like Sr and Ba abundances may thus be ancient stars from the earliest small galactic systems that were accreted by the proto-Milky Way. We label these objects as Small Accreted Stellar System (SASS) stars, and we find an additional 61 similar ones in the literature. A kinematic analysis of our sample and literature stars reveals them to be fast-moving halo objects, all with retrograde motion, indicating an accretion origin. Because SASS stars are much brighter than typical UFD stars, identifying them offers promising ways towards detailed studies of early star formation environments. From the chemical abundances of SASS stars, it appears that the earliest accreted systems were likely enriched by a few supernovae whose light element yields varied from system to system. Neutron-capture elements were sparsely produced and/or diluted, with $r$-process nucleosynthesis playing a role. These insights offer a glimpse into the early formation of the Galaxy. Using neutron-capture elements as a distinguishing criterion for early formation, we have access to a unique metal-poor population that consists of the oldest stars in the universe.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Conformal Prediction for Natural Language Processing: A Survey
Authors:
Margarida M. Campos,
António Farinhas,
Chrysoula Zerva,
Mário A. T. Figueiredo,
André F. T. Martins
Abstract:
The rapid proliferation of large language models and natural language processing (NLP) applications creates a crucial need for uncertainty quantification to mitigate risks such as hallucinations and to enhance decision-making reliability in critical applications. Conformal prediction is emerging as a theoretically sound and practically useful framework, combining flexibility with strong statistica…
▽ More
The rapid proliferation of large language models and natural language processing (NLP) applications creates a crucial need for uncertainty quantification to mitigate risks such as hallucinations and to enhance decision-making reliability in critical applications. Conformal prediction is emerging as a theoretically sound and practically useful framework, combining flexibility with strong statistical guarantees. Its model-agnostic and distribution-free nature makes it particularly promising to address the current shortcomings of NLP systems that stem from the absence of uncertainty quantification. This paper provides a comprehensive survey of conformal prediction techniques, their guarantees, and existing applications in NLP, pointing to directions for future research and open challenges.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
A Measure of Synergy based on Union Information
Authors:
André F. C. Gomes,
Mário A. T. Figueiredo
Abstract:
The partial information decomposition (PID) framework is concerned with decomposing the information that a set of (two or more) random variables (the sources) has about another variable (the target) into three types of information: unique, redundant, and synergistic. Classical information theory alone does not provide a unique way to decompose information in this manner and additional assumptions…
▽ More
The partial information decomposition (PID) framework is concerned with decomposing the information that a set of (two or more) random variables (the sources) has about another variable (the target) into three types of information: unique, redundant, and synergistic. Classical information theory alone does not provide a unique way to decompose information in this manner and additional assumptions have to be made. One often overlooked way to achieve this decomposition is using a so-called measure of union information - which quantifies the information that is present in at least one of the sources - from which a synergy measure stems. In this paper, we introduce a new measure of union information based on adopting a communication channel perspective, compare it with existing measures, and study some of its properties. We also include a comprehensive critical review of characterizations of union information and synergy measures that have been proposed in the literature.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints
Authors:
Jean V. Alves,
Diogo Leitão,
Sérgio Jesus,
Marco O. P. Sampaio,
Javier Liébana,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key aspects of real-world systems that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type 1 and type 2 errors have different costs; ii) requiring c…
▽ More
Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key aspects of real-world systems that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type 1 and type 2 errors have different costs; ii) requiring concurrent human predictions for every instance of the training dataset and iii) not dealing with human work capacity constraints. To address these issues, we propose the deferral under cost and capacity constraints framework (DeCCaF). DeCCaF is a novel L2D approach, employing supervised learning to model the probability of human error under less restrictive data requirements (only one expert prediction per instance) and using constraint programming to globally minimize the error cost subject to workload limitations. We test DeCCaF in a series of cost-sensitive fraud detection scenarios with different teams of 9 synthetic fraud analysts, with individual work capacity constraints. The results demonstrate that our approach performs significantly better than the baselines in a wide array of scenarios, achieving an average 8.4% reduction in the misclassification cost.
△ Less
Submitted 21 March, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
DiConStruct: Causal Concept-based Explanations through Black-Box Distillation
Authors:
Ricardo Moreira,
Jacopo Bono,
Mário Cardoso,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Model interpretability plays a central role in human-AI decision-making systems. Ideally, explanations should be expressed using human-interpretable semantic concepts. Moreover, the causal relations between these concepts should be captured by the explainer to allow for reasoning about the explanations. Lastly, explanation methods should be efficient and not compromise the performance of the predi…
▽ More
Model interpretability plays a central role in human-AI decision-making systems. Ideally, explanations should be expressed using human-interpretable semantic concepts. Moreover, the causal relations between these concepts should be captured by the explainer to allow for reasoning about the explanations. Lastly, explanation methods should be efficient and not compromise the performance of the predictive task. Despite the rapid advances in AI explainability in recent years, as far as we know to date, no method fulfills these three properties. Indeed, mainstream methods for local concept explainability do not produce causal explanations and incur a trade-off between explainability and prediction performance. We present DiConStruct, an explanation method that is both concept-based and causal, with the goal of creating more interpretable local explanations in the form of structural causal models and concept attributions. Our explainer works as a distillation model to any black-box machine learning model by approximating its predictions while producing the respective explanations. Because of this, DiConStruct generates explanations efficiently while not impacting the black-box prediction task. We validate our method on an image dataset and a tabular dataset, showing that DiConStruct approximates the black-box models with higher fidelity than other concept explainability baselines, while providing explanations that include the causal relations between the concepts.
△ Less
Submitted 26 January, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
Impulsive Control on Invariant Surfaces
Authors:
C. C. Silva Jr.,
J. Marao,
A. Figueiredo,
T. M. Rocha Filho
Abstract:
An impulsive feedback-adaptive control is developed in order to drive trajectories of a dynamical system towards an invariant manifold with fixed and spaced impulsive controls. The approach requires the explicit knowledge of the set of equations defining the invariant manifold and is based on the concept of stability exponents of invariant manifolds.
An impulsive feedback-adaptive control is developed in order to drive trajectories of a dynamical system towards an invariant manifold with fixed and spaced impulsive controls. The approach requires the explicit knowledge of the set of equations defining the invariant manifold and is based on the concept of stability exponents of invariant manifolds.
△ Less
Submitted 16 August, 2023;
originally announced January 2024.
-
FiFAR: A Fraud Detection Dataset for Learning to Defer
Authors:
Jean V. Alves,
Diogo Leitão,
Sérgio Jesus,
Marco O. P. Sampaio,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Public dataset limitations have significantly hindered the development and benchmarking of learning to defer (L2D) algorithms, which aim to optimally combine human and AI capabilities in hybrid decision-making systems. In such systems, human availability and domain-specific concerns introduce difficulties, while obtaining human predictions for training and evaluation is costly. Financial fraud det…
▽ More
Public dataset limitations have significantly hindered the development and benchmarking of learning to defer (L2D) algorithms, which aim to optimally combine human and AI capabilities in hybrid decision-making systems. In such systems, human availability and domain-specific concerns introduce difficulties, while obtaining human predictions for training and evaluation is costly. Financial fraud detection is a high-stakes setting where algorithms and human experts often work in tandem; however, there are no publicly available datasets for L2D concerning this important application of human-AI teaming. To fill this gap in L2D research, we introduce the Financial Fraud Alert Review Dataset (FiFAR), a synthetic bank account fraud detection dataset, containing the predictions of a team of 50 highly complex and varied synthetic fraud analysts, with varied bias and feature dependence. We also provide a realistic definition of human work capacity constraints, an aspect of L2D systems that is often overlooked, allowing for extensive testing of assignment systems under real-world conditions. We use our dataset to develop a capacity-aware L2D method and rejection learning approach under realistic data availability conditions, and benchmark these baselines under an array of 300 distinct testing scenarios. We believe that this dataset will serve as a pivotal instrument in facilitating a systematic, rigorous, reproducible, and transparent evaluation and comparison of L2D methods, thereby fostering the development of more synergistic human-AI collaboration in decision-making systems. The public dataset and detailed synthetic expert information are available at: https://github.com/feedzai/fifar-dataset
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Energetic Particle Tracing in Optimized Quasisymmetric Stellarator Equilibria
Authors:
P. A. Figueiredo,
R. Jorge,
J. Ferreira,
P. Rodrigues
Abstract:
Recent developments in the design of magnetic confinement fusion devices have allowed the construction of exceptionally optimized stellarator configurations. The near-axis expansion in particular has proven to enable the construction of magnetic configurations with good confinement properties while taking only a fraction of the usual computation time to generate optimized magnetic equilibria. Howe…
▽ More
Recent developments in the design of magnetic confinement fusion devices have allowed the construction of exceptionally optimized stellarator configurations. The near-axis expansion in particular has proven to enable the construction of magnetic configurations with good confinement properties while taking only a fraction of the usual computation time to generate optimized magnetic equilibria. However, not much is known about the overall features of fast-particle orbits computed in such analytical, yet simplified, equilibria when compared to those originating from accurate equilibrium solutions. This work aims to assess and demonstrate the potential of the near-axis expansion to provide accurate information on particle orbits and to compute loss fractions in moderate to high aspect ratios. The configurations used here are all scaled to fusion-relevant parameters and approximate quasisymmetry in various degrees. This allows us to understand how deviations from quasisymmetry affect particle orbits and what are their effects on the estimation of the loss fraction. Guiding-center trajectories of fusion-born alpha particles are traced using gyronimo and SIMPLE codes under the NEAT framework, showing good numerical agreement. Discrepancies between near-axis and MHD fields have minor effects on passing particles but significant effects on trapped particles, especially in quasihelically symmetric magnetic fields. Effective expressions were found for estimating orbit widths and passing-trapped separatrix in quasisymmetric near-axis fields. Loss fractions agree in the prompt losses regime but diverge afterward.
△ Less
Submitted 15 April, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Synge's dynamic problem for two isolated point charges. A new method to find global solutions for Functional Differential Equations System
Authors:
Rodrigo R. Silva,
Annibal Figueiredo
Abstract:
Synge's problem consists in to determine the dynamics of two point electrical charges interacting through their electromagnetic fields, without to take into account the radiation terms due to the self-forces in each charge. We discuss how this problem is related to the question on to establish initial conditions for the electromagnetic fields that are compatible with the two point charges system i…
▽ More
Synge's problem consists in to determine the dynamics of two point electrical charges interacting through their electromagnetic fields, without to take into account the radiation terms due to the self-forces in each charge. We discuss how this problem is related to the question on to establish initial conditions for the electromagnetic fields that are compatible with the two point charges system isolation, that is, the charges are free from the action of external forces. This problem stems from the existence of inter-temporal constraints for the charges trajectories, which implies that the relativistic Newton equations for the charges is not a system of ODEs, but rather a system of Functional Differential Equations (FDEs). We developed a new method to obtain global solutions that satisfies this system of FDEs and a given initial condition for the charges positions and velocities. This method allows the construction of a recursive numerical algorithm that only use integration methods for ODEs systems. Finally, we apply this algorithm to obtain numerical approximations for the quasi-circular solutions that are predicted in Synge's problem.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
On the probability distributions of the force and potential energy for a system with an infinite number of random point sources
Authors:
E. L. S. Silva,
L. H. Miranda-Filho,
A. Figueiredo
Abstract:
In this work, we study the probability distribution for the force and potential energy of a test particle interacting with $N$ point random sources in the limit $N\rightarrow\infty$. The interaction is given by a central potential $V(R)=k/R^{δ-1}$ in a $ d$-dimensional euclidean space, where $R$ is the random relative distance between the source and the test particle, $δ$ is the force exponent, an…
▽ More
In this work, we study the probability distribution for the force and potential energy of a test particle interacting with $N$ point random sources in the limit $N\rightarrow\infty$. The interaction is given by a central potential $V(R)=k/R^{δ-1}$ in a $ d$-dimensional euclidean space, where $R$ is the random relative distance between the source and the test particle, $δ$ is the force exponent, and $k$ is the coupling parameter. In order to assure a well-defined limit for the probability distribution of the force and potential energy, we { must} renormalize the coupling parameter and/or the system size as a function of the number $N$ of sources.
We show the existence of three non-singular limits, depending on the exponent $δ$ and the spatial dimension $d$. (i) For $δ<d$ the force and potential energy { converge} to their respective mean values. This limit is called Mean Field Limit. (ii) For $δ>d+1$ the potential energy converges to a random variable and the force to a random vector. This limit is called Thermodynamic Limit. (iii) For $d<δ<d+1$ the potential energy converges to its mean and the force to a random vector. This limit is called Mixed Limit
Also, we show the existence of two singular limits: (iv) for $δ=d$ the potential energy converges to its mean and the force to zero, and (v) for $δ=d+1$ the energy converges to a finite value and the force to a random vector.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Mathematical Properties of Strategies to Control Epidemic Outbreaks in the Context of SEIR Models with Multiple Infectious Stages
Authors:
Annibal Figueiredo,
Tarcısio Marciano da Rocha Filho
Abstract:
In this work we analyze mathematically the consequences and effectiveness of strategies to control an epidemic in the framework of classical SEIR models with multiple parallel infectious stages. We define the mathematical concept of a control strategy, showing that it implies turning classic epidemiological models into systems of non-autonomous differential equations. The analysis of these non-aut…
▽ More
In this work we analyze mathematically the consequences and effectiveness of strategies to control an epidemic in the framework of classical SEIR models with multiple parallel infectious stages. We define the mathematical concept of a control strategy, showing that it implies turning classic epidemiological models into systems of non-autonomous differential equations. The analysis of these non-autonomous systems is based on the two main results obtained in this work: the first establishes a condition that implies a dynamic without epidemic outbreaks; the second establishes a maximum value for the susceptible population associated to the fixed points that are attractors, moreover, we proof that any trajectory converges to some of these attractors. An important consequence of this last result is the existence of an insurmountable limit on the number of infected individuals after the end of a given control strategy. This restriction can only be mitigated by changing the maximum value of susceptible population associated to the system attractors, which could only be done with permanent control action, that is, without returning to normality. Another interesting result of our work is to show how the moment to start and the way how the control strategy ends strongly impacts the asymptotic value for the total number of infected individuals. We illustrate our analysis and results in a SEIR model (with two or three parallel stages) applied to describe the COVID-19 epidemic.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Modified Verhulst-Solow model for long-term population and economic growths
Authors:
Iram Gleriaa,
Sergio Da Silvab,
Leon Brenig,
Tarcısio M. Rocha Filho,
Annibal Figueiredo
Abstract:
In this study, we analyze the relationship between human population growth and economic dynamics. To do so, we present a modified version of the Verhulst model and the Solow model, which together simulate population dynamics and the role of economic variables in capital accumulation. The model incorporates support and foraging functions, which participate in the dynamic relationship between popula…
▽ More
In this study, we analyze the relationship between human population growth and economic dynamics. To do so, we present a modified version of the Verhulst model and the Solow model, which together simulate population dynamics and the role of economic variables in capital accumulation. The model incorporates support and foraging functions, which participate in the dynamic relationship between population growth and the creation and destruction of carrying capacity. The validity of the model is demonstrated using empirical data.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Orders between channels and implications for partial information decomposition
Authors:
André F. C. Gomes,
Máario A. T. Figueiredo
Abstract:
The partial information decomposition (PID) framework is concerned with decomposing the information that a set of random variables has with respect to a target variable into three types of components: redundant, synergistic, and unique. Classical information theory alone does not provide a unique way to decompose information in this manner and additional assumptions have to be made. Recently, Kolc…
▽ More
The partial information decomposition (PID) framework is concerned with decomposing the information that a set of random variables has with respect to a target variable into three types of components: redundant, synergistic, and unique. Classical information theory alone does not provide a unique way to decompose information in this manner and additional assumptions have to be made. Recently, Kolchinsky proposed a new general axiomatic approach to obtain measures of redundant information, based on choosing an order relation between information sources (equivalently, order between communication channels). In this paper, we exploit this approach to introduce three new measures of redundant information (and the resulting decompositions) based on well-known preorders between channels, thus contributing to the enrichment of the PID landscape. We relate the new decompositions to existing ones, study some of their properties, and provide examples illustrating their novelty. As a side result, we prove that any preorder that satisfies Kolchinsky's axioms yields a decomposition that meets the axioms originally introduced by Williams and Beer when they first propose the PID.
△ Less
Submitted 14 July, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Fairness-Aware Data Valuation for Supervised Learning
Authors:
José Pombal,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Data valuation is a ML field that studies the value of training instances towards a given predictive task. Although data bias is one of the main sources of downstream model unfairness, previous work in data valuation does not consider how training instances may influence both performance and fairness of ML models. Thus, we propose Fairness-Aware Data vauatiOn (FADO), a data valuation framework tha…
▽ More
Data valuation is a ML field that studies the value of training instances towards a given predictive task. Although data bias is one of the main sources of downstream model unfairness, previous work in data valuation does not consider how training instances may influence both performance and fairness of ML models. Thus, we propose Fairness-Aware Data vauatiOn (FADO), a data valuation framework that can be used to incorporate fairness concerns into a series of ML-related tasks (e.g., data pre-processing, exploratory data analysis, active learning). We propose an entropy-based data valuation metric suited to address our two-pronged goal of maximizing both performance and fairness, which is more computationally efficient than existing metrics. We then show how FADO can be applied as the basis for unfairness mitigation pre-processing techniques. Our methods achieve promising results -- up to a 40 p.p. improvement in fairness at a less than 1 p.p. loss in performance compared to a baseline -- and promote fairness in a data-centric way, where a deeper understanding of data quality takes center stage.
△ Less
Submitted 29 March, 2023;
originally announced March 2023.
-
Distinguishing Cause from Effect on Categorical Data: The Uniform Channel Model
Authors:
Mário A. T. Figueiredo,
Catarina A. Oliveira
Abstract:
Distinguishing cause from effect using observations of a pair of random variables is a core problem in causal discovery. Most approaches proposed for this task, namely additive noise models (ANM), are only adequate for quantitative data. We propose a criterion to address the cause-effect problem with categorical variables (living in sets with no meaningful order), inspired by seeing a conditional…
▽ More
Distinguishing cause from effect using observations of a pair of random variables is a core problem in causal discovery. Most approaches proposed for this task, namely additive noise models (ANM), are only adequate for quantitative data. We propose a criterion to address the cause-effect problem with categorical variables (living in sets with no meaningful order), inspired by seeing a conditional probability mass function (pmf) as a discrete memoryless channel. We select as the most likely causal direction the one in which the conditional pmf is closer to a uniform channel (UC). The rationale is that, in a UC, as in an ANM, the conditional entropy (of the effect given the cause) is independent of the cause distribution, in agreement with the principle of independence of cause and mechanism. Our approach, which we call the uniform channel model (UCM), thus extends the ANM rationale to categorical variables. To assess how close a conditional pmf (estimated from data) is to a UC, we use statistical testing, supported by a closed-form estimate of a UC channel. On the theoretical front, we prove identifiability of the UCM and show its equivalence with a structural causal model with a low-cardinality exogenous variable. Finally, the proposed method compares favorably with recent state-of-the-art alternatives in experiments on synthetic, benchmark, and real data.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
High-order coupling of shear and sonic continua in JET plasmas
Authors:
Paulo Rodrigues,
Duarte Borba,
Francesca Cella,
Rui Coelho,
Jorge Ferreira,
António Figueiredo,
Mervi Mantsinen,
Fernando Nabais,
Sergei Sharapov,
Paula Sirén,
JET Contributors
Abstract:
A recent model coupling the shear-Alfvén and acoustic continua, which depends strongly on the equilibrium sha** and on elongation in particular, is employed to explain the properties of Alfvénic activity observed on JET plasmas below but close to the typical frequency of toroidicity-induced Alfvén eigenmodes (TAEs). The frequency gaps predicted by the model result from high-order harmonics of th…
▽ More
A recent model coupling the shear-Alfvén and acoustic continua, which depends strongly on the equilibrium sha** and on elongation in particular, is employed to explain the properties of Alfvénic activity observed on JET plasmas below but close to the typical frequency of toroidicity-induced Alfvén eigenmodes (TAEs). The frequency gaps predicted by the model result from high-order harmonics of the geodesic field-line curvature caused by plasma sha** (as opposed to lower-order toroidicity) and give rise to high-order geodesic acoustic eigenmodes (HOGAEs), their frequency value being close to one-half of the TAEs one. The theoretical predictions of HOGAE frequency and radial location are found to be in fair agreement with measurements in JET experiments, including magnetic, reflectometry and soft x-ray data. The stability of the observed HOGAEs is evaluated with the linear hybrid MHD/drift-kinetic code CASTOR-K, taking into account the energetic-ion populations produced by the NBI and ICRH heating systems. Wave-particle resonances, along with drive/dam** mechanisms, are also discussed in order to understand the conditions leading to HOGAEs destabilization in JET plasmas.
△ Less
Submitted 18 April, 2023; v1 submitted 13 September, 2022;
originally announced September 2022.
-
ProBoost: a Boosting Method for Probabilistic Classifiers
Authors:
Fábio Mendonça,
Sheikh Shanawaz Mostafa,
Fernando Morgado-Dias,
Antonio G. Ravelo-García,
Mário A. T. Figueiredo
Abstract:
ProBoost, a new boosting algorithm for probabilistic classifiers, is proposed in this work. This algorithm uses the epistemic uncertainty of each training sample to determine the most challenging/uncertain ones; the relevance of these samples is then increased for the next weak learner, producing a sequence that progressively focuses on the samples found to have the highest uncertainty. In the end…
▽ More
ProBoost, a new boosting algorithm for probabilistic classifiers, is proposed in this work. This algorithm uses the epistemic uncertainty of each training sample to determine the most challenging/uncertain ones; the relevance of these samples is then increased for the next weak learner, producing a sequence that progressively focuses on the samples found to have the highest uncertainty. In the end, the weak learners' outputs are combined into a weighted ensemble of classifiers. Three methods are proposed to manipulate the training set: undersampling, oversampling, and weighting the training samples according to the uncertainty estimated by the weak learners. Furthermore, two approaches are studied regarding the ensemble combination. The weak learner herein considered is a standard convolutional neural network, and the probabilistic models underlying the uncertainty estimation use either variational inference or Monte Carlo dropout. The experimental evaluation carried out on MNIST benchmark datasets shows that ProBoost yields a significant performance improvement. The results are further highlighted by assessing the relative achievable improvement, a metric proposed in this work, which shows that a model with only four weak learners leads to an improvement exceeding 12% in this metric (for either accuracy, sensitivity, or specificity), in comparison to the model learned without ProBoost.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
Aveiro Tech City Living Lab: A Communication, Sensing and Computing Platform for City Environments
Authors:
Pedro Rito,
Ana Almeida,
Andreia Figueiredo,
Christian Gomes,
Pedro Teixeira,
Rodrigo Rosmaninho,
Rui Lopes,
Duarte Dias,
Gonçalo Vítor,
Gonçalo Perna,
Miguel Silva,
Carlos Senna,
Duarte Raposo,
Miguel Luís,
Susana Sargento,
Arnaldo Oliveira,
Nuno Borges de Carvalho
Abstract:
This article presents the deployment and experimentation architecture of the Aveiro Tech City Living Lab (ATCLL) in Aveiro, Portugal. This platform comprises a large number of Internet-of-Things devices with communication, sensing and computing capabilities. The communication infrastructure, built on fiber and Millimeter-wave (mmWave) links, integrates a communication network with radio terminals…
▽ More
This article presents the deployment and experimentation architecture of the Aveiro Tech City Living Lab (ATCLL) in Aveiro, Portugal. This platform comprises a large number of Internet-of-Things devices with communication, sensing and computing capabilities. The communication infrastructure, built on fiber and Millimeter-wave (mmWave) links, integrates a communication network with radio terminals (WiFi, ITS-G5, C-V2X, 5G and LoRa(WAN)), multiprotocol, spread throughout 44 connected points of access in the city. Additionally, public transportation has also been equipped with communication and sensing units. All these points combine and interconnect a set of sensors, such as mobility (Radars, Lidars, video cameras) and environmental sensors. Combining edge computing and cloud management to deploy the services and manage the platform, and a data platform to gather and process the data, the living lab supports a wide range of services and applications: IoT, intelligent transportation systems and assisted driving, environmental monitoring, emergency and safety, among others. This article describes the architecture, implementation and deployment to make the overall platform to work and integrate researchers and citizens. Moreover, it showcases some examples of the performance metrics achieved in the city infrastructure, the data that can be collected, visualized and used to build services and applications to the cities, and, finally, different use cases in the mobility and safety scenarios.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Understanding Unfairness in Fraud Detection through Model and Data Bias Interactions
Authors:
José Pombal,
André F. Cruz,
João Bravo,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
In recent years, machine learning algorithms have become ubiquitous in a multitude of high-stakes decision-making applications. The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society -- limiting their access to financial…
▽ More
In recent years, machine learning algorithms have become ubiquitous in a multitude of high-stakes decision-making applications. The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society -- limiting their access to financial services, for example. The awareness of this problem has given rise to the field of Fair ML, which focuses on studying, measuring, and mitigating unfairness in algorithmic prediction, with respect to a set of protected groups (e.g., race or gender). However, the underlying causes for algorithmic unfairness still remain elusive, with researchers divided between blaming either the ML algorithms or the data they are trained on. In this work, we maintain that algorithmic unfairness stems from interactions between models and biases in the data, rather than from isolated contributions of either of them. To this end, we propose a taxonomy to characterize data bias and we study a set of hypotheses regarding the fairness-accuracy trade-offs that fairness-blind ML algorithms exhibit under different data bias settings. On our real-world account-opening fraud use case, we find that each setting entails specific trade-offs, affecting fairness in expected value and variance -- the latter often going unnoticed. Moreover, we show how algorithms compare differently in terms of accuracy and fairness, depending on the biases affecting the data. Finally, we note that under specific data bias conditions, simple pre-processing interventions can successfully balance group-wise error rates, while the same techniques fail in more complex settings.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Human-AI Collaboration in Decision-Making: Beyond Learning to Defer
Authors:
Diogo Leitão,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Human-AI collaboration (HAIC) in decision-making aims to create synergistic teaming between human decision-makers and AI systems. Learning to defer (L2D) has been presented as a promising framework to determine who among humans and AI should make which decisions in order to optimize the performance and fairness of the combined system. Nevertheless, L2D entails several often unfeasible requirements…
▽ More
Human-AI collaboration (HAIC) in decision-making aims to create synergistic teaming between human decision-makers and AI systems. Learning to defer (L2D) has been presented as a promising framework to determine who among humans and AI should make which decisions in order to optimize the performance and fairness of the combined system. Nevertheless, L2D entails several often unfeasible requirements, such as the availability of predictions from humans for every instance or ground-truth labels that are independent from said humans. Furthermore, neither L2D nor alternative approaches tackle fundamental issues of deploying HAIC systems in real-world settings, such as capacity management or dealing with dynamic environments. In this paper, we aim to identify and review these and other limitations, pointing to where opportunities for future research in HAIC may lie.
△ Less
Submitted 13 July, 2022; v1 submitted 27 June, 2022;
originally announced June 2022.
-
Prisoners of Their Own Devices: How Models Induce Data Bias in Performative Prediction
Authors:
José Pombal,
Pedro Saleiro,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society. Much work has been devoted to measuring unfairness in static ML environments, but not in dynamic, performative prediction ones, in which most real-world use cases o…
▽ More
The unparalleled ability of machine learning algorithms to learn patterns from data also enables them to incorporate biases embedded within. A biased model can then make decisions that disproportionately harm certain groups in society. Much work has been devoted to measuring unfairness in static ML environments, but not in dynamic, performative prediction ones, in which most real-world use cases operate. In the latter, the predictive model itself plays a pivotal role in sha** the distribution of the data. However, little attention has been heeded to relating unfairness to these interactions. Thus, to further the understanding of unfairness in these settings, we propose a taxonomy to characterize bias in the data, and study cases where it is shaped by model behaviour. Using a real-world account opening fraud detection case study as an example, we study the dangers to both performance and fairness of two typical biases in performative prediction: distribution shifts, and the problem of selective labels.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Vibrational and structural properties of the $R$Fe$_{4}$Sb$_{12}$ ($R=$Na, K, Ca, Sr, Ba) filled skutterudites
Authors:
Juliana G. de Abrantes,
Marli R. Cantarino,
Wagner R. da Silva Neto,
Victória V. Freire,
Alvaro G. Figueiredo,
Tarsis M. Germano,
Bassim Mounssef Jr.,
Eduardo M. Bittar,
Andreas Leithe-Jasper,
Fernando A. Garcia
Abstract:
Vibrational and elastic properties of the $R$Fe$_{4}$Sb$_{12}$ skutterudites are investigated by, respectively, temperature $(T)$ dependent extended X-ray absorption fine structure (EXAFS) and pressure $(P)$ dependent x-ray diffraction (XRD) experiments. The Fe $K$-edge EXAFS experiments of the $R=$ K, Ca and Ba materials were performed in the $T$-interval $6<T<300$ K and XRD experiments of the…
▽ More
Vibrational and elastic properties of the $R$Fe$_{4}$Sb$_{12}$ skutterudites are investigated by, respectively, temperature $(T)$ dependent extended X-ray absorption fine structure (EXAFS) and pressure $(P)$ dependent x-ray diffraction (XRD) experiments. The Fe $K$-edge EXAFS experiments of the $R=$ K, Ca and Ba materials were performed in the $T$-interval $6<T<300$ K and XRD experiments of the $R=$ Na, K, Ca, Sr and Ba materials were performed in the $P$-interval $1\text{ atm }<P<16$ GPa. From EXAFS, we obtained the correlated Debye-Waller parameters that were thus analyzed to extract effective spring constants connected with the Fe-$Y$ (where $Y=$ either $R$, Fe or Sb) scattering paths. Our findings suggest that in the case of the light cations, $R=$ K or Ca, the $R$ atoms are relatively weakly coupled to the cage, in a scenario reminiscent to the Einstein oscillators. From the XRD experiments, we obtained the bulk modulus $B_{0}$ for all $R=$Na, K, Ca, Sr and Ba materials, with values ranging from $77$ GPa ($R=$ K) to $R=99$ GPa ($R=$ Ba) as well as the compressibility $β$ as a function of $P$. The trend in $β$ as a function of the $R$ filler is discussed and it is shown that it does not correlate with simple geometrical considerations but rather with the filler-cage bonding properties.
△ Less
Submitted 15 August, 2022; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Differentiable Causal Discovery Under Latent Interventions
Authors:
Gonçalo R. A. Faria,
André F. T. Martins,
Mário A. T. Figueiredo
Abstract:
Recent work has shown promising results in causal discovery by leveraging interventional data with gradient-based methods, even when the intervened variables are unknown. However, previous work assumes that the correspondence between samples and interventions is known, which is often unrealistic. We envision a scenario with an extensive dataset sampled from multiple intervention distributions and…
▽ More
Recent work has shown promising results in causal discovery by leveraging interventional data with gradient-based methods, even when the intervened variables are unknown. However, previous work assumes that the correspondence between samples and interventions is known, which is often unrealistic. We envision a scenario with an extensive dataset sampled from multiple intervention distributions and one observation distribution, but where we do not know which distribution originated each sample and how the intervention affected the system, \textit{i.e.}, interventions are entirely latent. We propose a method based on neural networks and variational inference that addresses this scenario by framing it as learning a shared causal graph among an infinite mixture (under a Dirichlet process prior) of intervention structural causal models. Experiments with synthetic and real data show that our approach and its semi-supervised variant are able to discover causal relations in this challenging scenario.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
Orbital localization and the role of the Fe and As $4p$ orbitals in BaFe$_{2}$As$_{2}$ probed by XANES
Authors:
A. G. de Figueiredo,
M. R. Cantarino,
W. R. da Silva Neto,
K. R. Pakuszewski,
R. Grossi,
D. S. Christovam,
J. C. Souza,
M. M. Piva,
G. S. Freitas,
P. G. Pagliuso,
C. Adriano,
F. A. Garcia
Abstract:
The polarization dependence of the near edge x-ray absorption spectroscopy (XANES) is an element specific probe to the real-space distribution of the density of unoccupied states in solid-state materials. In this paper, we present Fe and As $K$-edge experiments of Ba(Fe$_{1-x}$$M_{x}$)$_{2}$As$_{2}$ ($M=$ Mn, Co and $x=0.0$ and $0.08$). The experiments reveal a strong polarization dependence of th…
▽ More
The polarization dependence of the near edge x-ray absorption spectroscopy (XANES) is an element specific probe to the real-space distribution of the density of unoccupied states in solid-state materials. In this paper, we present Fe and As $K$-edge experiments of Ba(Fe$_{1-x}$$M_{x}$)$_{2}$As$_{2}$ ($M=$ Mn, Co and $x=0.0$ and $0.08$). The experiments reveal a strong polarization dependence of the probed XANES spectra, which concerns mainly an increase of the intensity of electronic transitions when the beam polarization is set out of the sample's $ab$ crystallographic plane. The results show that states with $p_{z}$-orbital character dominate the density of unoccupied states close to the Fermi level. Partial substitution of Fe by Co is shown to decrease the intensity anisotropy, suggesting that Co promotes electronic transfer preferentially to states with $p_{z}$-orbital character. On the other hand, Mn substitution causes the increase of the spectra $p_{z}$-orbital anisotropy, which is proposed to take place by means of an enhanced local Fe $3d4p$ mixing, unveiling the role of Fe $4p$ states in the localization of the Fe $3d$ orbitals. Moreover, by comparing our results to previous experiments, we identify the relative mixing between Fe and the pnictide $4p_{x,y,z}$ orbitals as a clear divide between the electronic properties of iron arsenides and selenides. Our conclusions are supported by multiple-scattering theory calculations of the XANES spectra and by quantum chemistry calculations of Fe coordination electronic structure.
△ Less
Submitted 24 January, 2022; v1 submitted 18 December, 2021;
originally announced December 2021.
-
Sparse Continuous Distributions and Fenchel-Young Losses
Authors:
André F. T. Martins,
Marcos Treviso,
António Farinhas,
Pedro M. Q. Aguiar,
Mário A. T. Figueiredo,
Mathieu Blondel,
Vlad Niculae
Abstract:
Exponential families are widely used in machine learning, including many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, recent work on sparse alternatives to softmax (e.g., sparsemax, $α$-entmax, and fused…
▽ More
Exponential families are widely used in machine learning, including many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, recent work on sparse alternatives to softmax (e.g., sparsemax, $α$-entmax, and fusedmax), has led to distributions with varying support.
This paper develops sparse alternatives to continuous distributions, based on several technical contributions: First, we define $Ω$-regularized prediction maps and Fenchel-Young losses for arbitrary domains (possibly countably infinite or continuous). For linearly parametrized families, we show that minimization of Fenchel-Young losses is equivalent to moment matching of the statistics, generalizing a fundamental property of exponential families. When $Ω$ is a Tsallis negentropy with parameter $α$, we obtain ``deformed exponential families,'' which include $α$-entmax and sparsemax ($α=2$) as particular cases. For quadratic energy functions, the resulting densities are $β$-Gaussians, an instance of elliptical distributions that contain as particular cases the Gaussian, biweight, triweight, and Epanechnikov densities, and for which we derive closed-form expressions for the variance, Tsallis entropy, and Fenchel-Young loss. When $Ω$ is a total variation or Sobolev regularizer, we obtain a continuous version of the fusedmax. Finally, we introduce continuous-domain attention mechanisms, deriving efficient gradient backpropagation algorithms for $α\in \{1, 4/3, 3/2, 2\}$. Using these algorithms, we demonstrate our sparse continuous distributions for attention-based audio classification and visual question answering, showing that they allow attending to time intervals and compact regions.
△ Less
Submitted 4 August, 2022; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Distributed Banach-Picard Iteration: Application to Distributed EM and Distributed PCA
Authors:
Francisco L. Andrade,
Mário A. T. Figueiredo,
João Xavier
Abstract:
In recent work, we proposed a distributed Banach-Picard iteration (DBPI) that allows a set of agents, linked by a communication network, to find a fixed point of a locally contractive (LC) map that is the average of individual maps held by said agents. In this work, we build upon the DBPI and its local linear convergence (LLC) guarantees to make several contributions. We show that Sanger's algorit…
▽ More
In recent work, we proposed a distributed Banach-Picard iteration (DBPI) that allows a set of agents, linked by a communication network, to find a fixed point of a locally contractive (LC) map that is the average of individual maps held by said agents. In this work, we build upon the DBPI and its local linear convergence (LLC) guarantees to make several contributions. We show that Sanger's algorithm for principal component analysis (PCA) corresponds to the iteration of an LC map that can be written as the average of local maps, each map known to each agent holding a subset of the data. Similarly, we show that a variant of the expectation-maximization (EM) algorithm for parameter estimation from noisy and faulty measurements in a sensor network can be written as the iteration of an LC map that is the average of local maps, each available at just one node. Consequently, via the DBPI, we derive two distributed algorithms - distributed EM and distributed PCA - whose LLC guarantees follow from those that we proved for the DBPI. The verification of the LC condition for EM is challenging, as the underlying operator depends on random samples, thus the LC condition is of probabilistic nature.
△ Less
Submitted 26 January, 2022; v1 submitted 20 June, 2021;
originally announced June 2021.
-
Distributed Banach-Picard Iteration for Locally Contractive Maps
Authors:
Francisco L. Andrade,
Mário A. T. Figueiredo,
João Xavier
Abstract:
The Banach-Picard iteration is widely used to find fixed points of locally contractive (LC) maps. This paper extends the Banach-Picard iteration to distributed settings; specifically, we assume the map of which the fixed point is sought to be the average of individual (not necessarily LC) maps held by a set of agents linked by a communication network. An additional difficulty is that the LC map is…
▽ More
The Banach-Picard iteration is widely used to find fixed points of locally contractive (LC) maps. This paper extends the Banach-Picard iteration to distributed settings; specifically, we assume the map of which the fixed point is sought to be the average of individual (not necessarily LC) maps held by a set of agents linked by a communication network. An additional difficulty is that the LC map is not assumed to come from an underlying optimization problem, which prevents exploiting strong global properties such as convexity or Lipschitzianity. Yet, we propose a distributed algorithm and prove its convergence, in fact showing that it maintains the linear rate of the standard Banach-Picard iteration for the average LC map. As another contribution, our proof imports tools from perturbation theory of linear operators, which, to the best of our knowledge, had not been used before in the theory of distributed computation.
△ Less
Submitted 28 December, 2021; v1 submitted 31 March, 2021;
originally announced April 2021.
-
TimeSHAP: Explaining Recurrent Models through Sequence Perturbations
Authors:
João Bento,
Pedro Saleiro,
André F. Cruz,
Mário A. T. Figueiredo,
Pedro Bizarro
Abstract:
Although recurrent neural networks (RNNs) are state-of-the-art in numerous sequential decision-making tasks, there has been little research on explaining their predictions. In this work, we present TimeSHAP, a model-agnostic recurrent explainer that builds upon KernelSHAP and extends it to the sequential domain. TimeSHAP computes feature-, timestep-, and cell-level attributions. As sequences may b…
▽ More
Although recurrent neural networks (RNNs) are state-of-the-art in numerous sequential decision-making tasks, there has been little research on explaining their predictions. In this work, we present TimeSHAP, a model-agnostic recurrent explainer that builds upon KernelSHAP and extends it to the sequential domain. TimeSHAP computes feature-, timestep-, and cell-level attributions. As sequences may be arbitrarily long, we further propose a pruning method that is shown to dramatically decrease both its computational cost and the variance of its attributions. We use TimeSHAP to explain the predictions of a real-world bank account takeover fraud detection RNN model, and draw key insights from its explanations: i) the model identifies important features and events aligned with what fraud analysts consider cues for account takeover; ii) positive predicted sequences can be pruned to only 10% of the original length, as older events have residual attribution values; iii) the most recent input event of positive predictions only contributes on average to 41% of the model's score; iv) notably high attribution to client's age, suggesting a potential discriminatory reasoning, later confirmed as higher false positive rates for older clients.
△ Less
Submitted 26 June, 2021; v1 submitted 30 November, 2020;
originally announced December 2020.
-
Secure Vehicular Communications through Reconfigurable Intelligent Surfaces
Authors:
Yun Ai,
Felipe A. P. de Figueiredo,
Long Kong,
Michael Cheffena,
Symeon Chatzinotas,
Björn Ottersten
Abstract:
Reconfigurable intelligent surfaces (RIS) is considered as a revolutionary technique to improve the wireless system performance by reconfiguring the radio wave propagation environment artificially. Motivated by the potential of RIS in vehicular networks, we analyze the secrecy outage performance of RIS-aided vehicular communications in this paper. More specifically, two vehicular communication sce…
▽ More
Reconfigurable intelligent surfaces (RIS) is considered as a revolutionary technique to improve the wireless system performance by reconfiguring the radio wave propagation environment artificially. Motivated by the potential of RIS in vehicular networks, we analyze the secrecy outage performance of RIS-aided vehicular communications in this paper. More specifically, two vehicular communication scenarios are considered, i.e., a vehicular-to-vehicular (V2V) communication where the RIS acts as a relay and a vehicular-to-infrastructure (V2I) scenario where the RIS functions as the receiver. In both scenarios, a passive eavesdropper is present attempting to retrieve the transmitted information. Closed-form expressions for the secrecy outage probability (SOP) are derived and verified. The results demonstrate the potential of improving secrecy with the aid of RIS under both V2V and V2I communications.
△ Less
Submitted 2 December, 2020; v1 submitted 30 November, 2020;
originally announced November 2020.
-
Control with adaptive Q-learning
Authors:
João Pedro Araújo,
Mário A. T. Figueiredo,
Miguel Ayala Botto
Abstract:
This paper evaluates adaptive Q-learning (AQL) and single-partition adaptive Q-learning (SPAQL), two algorithms for efficient model-free episodic reinforcement learning (RL), in two classical control problems (Pendulum and Cartpole). AQL adaptively partitions the state-action space of a Markov decision process (MDP), while learning the control policy, i. e., the map** from states to actions. The…
▽ More
This paper evaluates adaptive Q-learning (AQL) and single-partition adaptive Q-learning (SPAQL), two algorithms for efficient model-free episodic reinforcement learning (RL), in two classical control problems (Pendulum and Cartpole). AQL adaptively partitions the state-action space of a Markov decision process (MDP), while learning the control policy, i. e., the map** from states to actions. The main difference between AQL and SPAQL is that the latter learns time-invariant policies, where the map** from states to actions does not depend explicitly on the time step. This paper also proposes the SPAQL with terminal state (SPAQL-TS), an improved version of SPAQL tailored for the design of regulators for control problems. The time-invariant policies are shown to result in a better performance than the time-variant ones in both problems studied. These algorithms are particularly fitted to RL problems where the action space is finite, as is the case with the Cartpole problem. SPAQL-TS solves the OpenAI Gym Cartpole problem, while also displaying a higher sample efficiency than trust region policy optimization (TRPO), a standard RL algorithm for solving control tasks. Moreover, the policies learned by SPAQL are interpretable, while TRPO policies are typically encoded as neural networks, and therefore hard to interpret. Yielding interpretable policies while being sample-efficient are the major advantages of SPAQL.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
Variational Mixture of Normalizing Flows
Authors:
Guilherme G. P. Freitas Pires,
Mário A. T. Figueiredo
Abstract:
In the past few years, deep generative models, such as generative adversarial networks \autocite{GAN}, variational autoencoders \autocite{vaepaper}, and their variants, have seen wide adoption for the task of modelling complex data distributions. In spite of the outstanding sample quality achieved by those early methods, they model the target distributions \emph{implicitly}, in the sense that the…
▽ More
In the past few years, deep generative models, such as generative adversarial networks \autocite{GAN}, variational autoencoders \autocite{vaepaper}, and their variants, have seen wide adoption for the task of modelling complex data distributions. In spite of the outstanding sample quality achieved by those early methods, they model the target distributions \emph{implicitly}, in the sense that the probability density functions induced by them are not explicitly accessible. This fact renders those methods unfit for tasks that require, for example, scoring new instances of data with the learned distributions. Normalizing flows have overcome this limitation by leveraging the change-of-variables formula for probability density functions, and by using transformations designed to have tractable and cheaply computable Jacobians. Although flexible, this framework lacked (until recently \autocites{semisuplearning_nflows, RAD}) a way to introduce discrete structure (such as the one found in mixtures) in the models it allows to construct, in an unsupervised scenario. The present work overcomes this by using normalizing flows as components in a mixture model and devising an end-to-end training procedure for such a model. This procedure is based on variational inference, and uses a variational posterior parameterized by a neural network. As will become clear, this model naturally lends itself to (multimodal) density estimation, semi-supervised learning, and clustering. The proposed model is illustrated on two synthetic datasets, as well as on a real-world dataset.
Keywords: Deep generative models, normalizing flows, variational inference, probabilistic modelling, mixture models.
△ Less
Submitted 1 September, 2020;
originally announced September 2020.
-
Equilibrium Propagation for Complete Directed Neural Networks
Authors:
Matilde Tristany Farinha,
Sérgio Pequito,
Pedro A. Santos,
Mário A. T. Figueiredo
Abstract:
Artificial neural networks, one of the most successful approaches to supervised learning, were originally inspired by their biological counterparts. However, the most successful learning algorithm for artificial neural networks, backpropagation, is considered biologically implausible. We contribute to the topic of biologically plausible neuronal learning by building upon and extending the equilibr…
▽ More
Artificial neural networks, one of the most successful approaches to supervised learning, were originally inspired by their biological counterparts. However, the most successful learning algorithm for artificial neural networks, backpropagation, is considered biologically implausible. We contribute to the topic of biologically plausible neuronal learning by building upon and extending the equilibrium propagation learning framework. Specifically, we introduce: a new neuronal dynamics and learning rule for arbitrary network architectures; a sparsity-inducing method able to prune irrelevant connections; a dynamical-systems characterization of the models, using Lyapunov theory.
△ Less
Submitted 17 June, 2020; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Constraints on the Physical Properties of GW190814 through Simulations based on DECam Follow-up Observations by the Dark Energy Survey
Authors:
R. Morgan,
M. Soares-Santos,
J. Annis,
K. Herner,
A. Garcia,
A. Palmese,
A. Drlica-Wagner,
R. Kessler,
J. Garcia-Bellido,
T. G. Bachmann N. Sherman,
S. Allam,
K. Bechtol,
C. R. Bom,
D. Brout,
R. E. Butler,
M. Butner,
R. Cartier,
H. Chen,
C. Conselice,
E. Cook,
T. M. Davis,
Z. Doctor,
B. Farr,
A. L. Figueiredo,
D. A. Finley
, et al. (77 additional authors not shown)
Abstract:
On 14 August 2019, the LIGO and Virgo Collaborations detected gravitational waves from a black hole and a 2.6 solar mass compact object, possibly the first neutron star -- black hole (NSBH) merger. In search of an optical counterpart, the Dark Energy Survey (DES) obtained deep imaging of the entire 90 percent confidence level localization area with Blanco/DECam 0, 1, 2, 3, 6, and 16 nights after t…
▽ More
On 14 August 2019, the LIGO and Virgo Collaborations detected gravitational waves from a black hole and a 2.6 solar mass compact object, possibly the first neutron star -- black hole (NSBH) merger. In search of an optical counterpart, the Dark Energy Survey (DES) obtained deep imaging of the entire 90 percent confidence level localization area with Blanco/DECam 0, 1, 2, 3, 6, and 16 nights after the merger. Objects with varying brightness were detected by the DES Pipeline and we systematically reduced the candidate counterparts through catalog matching, light curve properties, host-galaxy photometric redshifts, SOAR spectroscopic follow-up observations, and machine-learning-based photometric classification. All candidates were rejected as counterparts to the merger. To quantify the sensitivity of our search, we applied our selection criteria to full light curve simulations of supernovae and kilonovae as they would appear in the DECam observations. Since the source class of the merger was uncertain, we utilized an agnostic, three-component kilonova model based on tidally-disrupted NS ejecta properties to quantify our detection efficiency of a counterpart if the merger included a NS. We find that if a kilonova occurred during this merger, configurations where the ejected matter is greater than 0.07 solar masses, has lanthanide abundance less than $10^{-8.56}$, and has a velocity between $0.18c$ and $0.21c$ are disfavored at the $2σ$ level. Furthermore, we estimate that our background reduction methods are capable of associating gravitational wave signals with a detected electromagnetic counterpart at the $4σ$ level in $95\%$ of future follow-up observations.
△ Less
Submitted 19 May, 2022; v1 submitted 12 June, 2020;
originally announced June 2020.
-
Sparse and Continuous Attention Mechanisms
Authors:
André F. T. Martins,
António Farinhas,
Marcos Treviso,
Vlad Niculae,
Pedro M. Q. Aguiar,
Mário A. T. Figueiredo
Abstract:
Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, there has been recent work on sparse alternatives to softmax (e.g. sparsemax and a…
▽ More
Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, there has been recent work on sparse alternatives to softmax (e.g. sparsemax and alpha-entmax), which have varying support, being able to assign zero probability to irrelevant categories. This paper expands that work in two directions: first, we extend alpha-entmax to continuous domains, revealing a link with Tsallis statistics and deformed exponential families. Second, we introduce continuous-domain attention mechanisms, deriving efficient gradient backpropagation algorithms for alpha in {1,2}. Experiments on attention-based text classification, machine translation, and visual question answering illustrate the use of continuous attention in 1D and 2D, showing that it allows attending to time intervals and compact regions.
△ Less
Submitted 29 October, 2020; v1 submitted 12 June, 2020;
originally announced June 2020.
-
A Proposed IoT Smart Trap using Computer Vision for Sustainable Pest Control in Coffee Culture
Authors:
Vitor Alexandre Campos Figueiredo,
Samuel Mafra,
Joel Rodrigues
Abstract:
The Internet of Things (IoT) is emerging as a multi-purpose technology with enormous potential for improving the quality of life in several areas. In particular, IoT has been applied in agriculture to make it more sustainable ecologically. For instance, electronic traps have the potential to perform pest control without any pesticide. In this paper, a smart trap with IoT capabilities that uses com…
▽ More
The Internet of Things (IoT) is emerging as a multi-purpose technology with enormous potential for improving the quality of life in several areas. In particular, IoT has been applied in agriculture to make it more sustainable ecologically. For instance, electronic traps have the potential to perform pest control without any pesticide. In this paper, a smart trap with IoT capabilities that uses computer vision to identify the insect of interest is proposed. The solution includes 1) an embedded system with camera, GPS sensor and motor actuators; 2) an IoT middleware as database service provider, and 3) a Web application to present data by a configurable heat map. The demonstration of proposed solution is exposed and the main conclusions are the perception about pest concentration at the plantation and the viability as alternative pest control over traditional control based on pesticides.
△ Less
Submitted 9 April, 2020;
originally announced April 2020.
-
A Classification-Based Approach to Semi-Supervised Clustering with Pairwise Constraints
Authors:
Marek Śmieja,
Łukasz Struski,
Mário A. T. Figueiredo
Abstract:
In this paper, we introduce a neural network framework for semi-supervised clustering (SSC) with pairwise (must-link or cannot-link) constraints. In contrast to existing approaches, we decompose SSC into two simpler classification tasks/stages: the first stage uses a pair of Siamese neural networks to label the unlabeled pairs of points as must-link or cannot-link; the second stage uses the fully…
▽ More
In this paper, we introduce a neural network framework for semi-supervised clustering (SSC) with pairwise (must-link or cannot-link) constraints. In contrast to existing approaches, we decompose SSC into two simpler classification tasks/stages: the first stage uses a pair of Siamese neural networks to label the unlabeled pairs of points as must-link or cannot-link; the second stage uses the fully pairwise-labeled dataset produced by the first stage in a supervised neural-network-based clustering method. The proposed approach, S3C2 (Semi-Supervised Siamese Classifiers for Clustering), is motivated by the observation that binary classification (such as assigning pairwise relations) is usually easier than multi-class clustering with partial supervision. On the other hand, being classification-based, our method solves only well-defined classification problems, rather than less well specified clustering tasks. Extensive experiments on various datasets demonstrate the high performance of the proposed method.
△ Less
Submitted 18 January, 2020;
originally announced January 2020.
-
Hard-core collisional dynamics in the hamiltonian mean-field model
Authors:
Luciano Miranda Filho,
Igor Melo,
Annibal Figueiredo,
Tarcisio Rocha Filho,
L Filho,
Yves Elskens
Abstract:
We consider a modification of the well studied Hamiltonian Mean-Field model by introducing a hard-core point-like repulsive interaction and propose a numerical integration scheme to integrate numerically its dynamics. Our results show that the outcome of the initial violent relaxation is altered, and also that the phase-diagram is modified with a critical temperature at a higher value than in the…
▽ More
We consider a modification of the well studied Hamiltonian Mean-Field model by introducing a hard-core point-like repulsive interaction and propose a numerical integration scheme to integrate numerically its dynamics. Our results show that the outcome of the initial violent relaxation is altered, and also that the phase-diagram is modified with a critical temperature at a higher value than in the non-collisional counterpart.
△ Less
Submitted 23 April, 2020; v1 submitted 9 September, 2019;
originally announced September 2019.
-
The VISCACHA survey -- deep and resolved photometry of star clusters in the Magellanic Clouds
Authors:
Bruno Dias,
Francisco Maia,
Leandro Kerber,
João F. C. dos Santos Jr.,
Eduardo Bica,
Tina Armond,
Beatriz Barbuy,
Luciano Fraga,
Jose A. Hernandez-Jimenez,
Orlando J. Katime Santrich,
Raphael A. P. Oliveira,
Angeles Pérez-Villegas,
Andres Piatti,
Bruno Quint,
David Sanmartin,
Mateus S. Angelo,
Stefano O. Souza,
Rodrigo G. Vieira,
Pieter Westera,
Celeste Parisi,
Doug Geisler,
Dante Minniti,
Roberto Saito,
Lilia Bassino,
Bruno De Bortoli
, et al. (2 additional authors not shown)
Abstract:
The VISCACHA (VIsible Soar photometry of star Clusters in tApii and Coxi HuguA\footnote{LMC and SMC names in the Tupi-Guarani language spoken by native people in Brazil}) Survey is an ongoing project based on deep and spatiallyresolved photometric observations of Magellanic Cloud star clusters, collected using the SOuthern Astrophysical Research (SOAR) telescope together with the SOAR Adaptive Mod…
▽ More
The VISCACHA (VIsible Soar photometry of star Clusters in tApii and Coxi HuguA\footnote{LMC and SMC names in the Tupi-Guarani language spoken by native people in Brazil}) Survey is an ongoing project based on deep and spatiallyresolved photometric observations of Magellanic Cloud star clusters, collected using the SOuthern Astrophysical Research (SOAR) telescope together with the SOAR Adaptive Module Imager. So far we have used $>$300h of telescope time to observe $\sim$150 star clusters, mostly with low mass ($M < 10^4 M_{\odot}$) on the outskirts of the LMC and SMC. With this high-quality data set, we homogeneously determine physical properties using deep colour-magnitude diagrams (ages, metallicities, reddening, distances, mass, luminosity and mass functions) and structural parameters (radial density profiles, sizes) for these clusters which are used as a proxy to investigate the interplay between the Magellanic Clouds and their evolution. We present the VISCACHA survey and its initial results, based on our first two papers. The project's long term goals and expected legacy to the community are also addressed.
△ Less
Submitted 5 September, 2019;
originally announced September 2019.
-
Distribution Probability of Force for a Physical System of N Random Particles
Authors:
A. D. Figueiredo,
T. M. da Rocha Filho,
M. A. Amato
Abstract:
The present paper attempts to address a discussion on mathematical grounds of a model to associate the generalized version of the CLT and the $N$-body problem related to the calculation of the force on a single star or particle due to the $N-1$ stars or particles whenever they are randomly distributed in the space and $N\rightarrow\infty$. We calculate the resultant force on a test particle immers…
▽ More
The present paper attempts to address a discussion on mathematical grounds of a model to associate the generalized version of the CLT and the $N$-body problem related to the calculation of the force on a single star or particle due to the $N-1$ stars or particles whenever they are randomly distributed in the space and $N\rightarrow\infty$. We calculate the resultant force on a test particle immersed in a $N$-particle system under a $1/r^δ$ force ($δ>0$) and discuss the limit force under different approaches referred to as the Vlasov limit and Fluctuation Limit. Also one shows the behaviour of the limit force in different domains for the Lévy exponent ($α$).
△ Less
Submitted 18 June, 2019;
originally announced June 2019.
-
On Whitney embedding of o-minimal manifolds
Authors:
Ricardo Bianconi,
Rodrigo Figueiredo,
Robson A. Figueiredo
Abstract:
We prove a definable version of the Whitney embedding theorem for abstract-definable $\mathcal{C}^p$ manifolds with $1\leq p<\infty$, namely: every abstract-definable $\mathcal{C}^p$ manifold is abstract-definable $C^p$ embedded into $R^N$, for some positive integer $N$. As a consequence, we show that every abstract-definable $\mathcal{C}^p$ manifold has a compatible $\mathcal{C}^{p+1}$ atlas.
We prove a definable version of the Whitney embedding theorem for abstract-definable $\mathcal{C}^p$ manifolds with $1\leq p<\infty$, namely: every abstract-definable $\mathcal{C}^p$ manifold is abstract-definable $C^p$ embedded into $R^N$, for some positive integer $N$. As a consequence, we show that every abstract-definable $\mathcal{C}^p$ manifold has a compatible $\mathcal{C}^{p+1}$ atlas.
△ Less
Submitted 10 April, 2019;
originally announced April 2019.
-
SURE-fuse WFF: A Multi-resolution Windowed Fourier Analysis for Interferometric Phase Denoising
Authors:
Joshin P. Krishnan,
Mário A. T. Figueiredo,
José M. Bioucas-Dias
Abstract:
Interferometric phase (InPhase) imaging is an important part of many present-day coherent imaging technologies. Often in such imaging techniques, the acquired images, known as interferograms, suffer from two major degradations: 1) phase wrap** caused by the fact that the sensing mechanism can only measure sinusoidal $2π$-periodic functions of the actual phase, and 2) noise introduced by the acqu…
▽ More
Interferometric phase (InPhase) imaging is an important part of many present-day coherent imaging technologies. Often in such imaging techniques, the acquired images, known as interferograms, suffer from two major degradations: 1) phase wrap** caused by the fact that the sensing mechanism can only measure sinusoidal $2π$-periodic functions of the actual phase, and 2) noise introduced by the acquisition process or the system. This work focusses on InPhase denoising which is a fundamental restoration step to many posterior applications of InPhase, namely to phase unwrap**. The presence of sharp fringes that arises from phase wrap** makes InPhase denoising a hard-inverse problem. Motivated by the fact that the InPhase images are often locally sparse in Fourier domain, we propose a multi-resolution windowed Fourier filtering (WFF) analysis that fuses WFF estimates with different resolutions, thus overcoming the WFF fixed resolution limitation. The proposed fusion relies on an unbiased estimate of the mean square error derived using the Stein's lemma adapted to complex-valued signals. This estimate, known as SURE, is minimized using an optimization framework to obtain the fusion weights. Strong experimental evidence, using synthetic and real (InSAR & MRI) data, that the developed algorithm, termed as SURE-fuse WFF, outperforms the best hand-tuned fixed resolution WFF as well as other state-of-the-art InPhase denoising algorithms, is provided.
△ Less
Submitted 26 February, 2019; v1 submitted 9 November, 2018;
originally announced November 2018.
-
Conditional Random Fields as Recurrent Neural Networks for 3D Medical Imaging Segmentation
Authors:
Miguel Monteiro,
Mário A. T. Figueiredo,
Arlindo L. Oliveira
Abstract:
The Conditional Random Field as a Recurrent Neural Network layer is a recently proposed algorithm meant to be placed on top of an existing Fully-Convolutional Neural Network to improve the quality of semantic segmentation. In this paper, we test whether this algorithm, which was shown to improve semantic segmentation for 2D RGB images, is able to improve segmentation quality for 3D multi-modal med…
▽ More
The Conditional Random Field as a Recurrent Neural Network layer is a recently proposed algorithm meant to be placed on top of an existing Fully-Convolutional Neural Network to improve the quality of semantic segmentation. In this paper, we test whether this algorithm, which was shown to improve semantic segmentation for 2D RGB images, is able to improve segmentation quality for 3D multi-modal medical images. We developed an implementation of the algorithm which works for any number of spatial dimensions, input/output image channels, and reference image channels. As far as we know this is the first publicly available implementation of this sort. We tested the algorithm with two distinct 3D medical imaging datasets, we concluded that the performance differences observed were not statistically significant. Finally, in the discussion section of the paper, we go into the reasons as to why this technique transfers poorly from natural images to medical images.
△ Less
Submitted 19 July, 2018;
originally announced July 2018.
-
Equivalence between nonlinear dynamical systems and urn processes
Authors:
Léon Brenig,
Iram Gleria,
Tarcísio M. Rocha Filho,
Annibal Figueiredo,
Benito Hernández-Bermejo
Abstract:
An equivalence is shown between a large class of deterministic dynamical systems and a class of stochastic processes, the balanced urn processes. These dynamical systems are governed by quasi-polynomial differential systems that are widely used in mathematical modeling while urn processes are actively studied in combinatorics and probability theory. The presented equivalence extends a theorem by F…
▽ More
An equivalence is shown between a large class of deterministic dynamical systems and a class of stochastic processes, the balanced urn processes. These dynamical systems are governed by quasi-polynomial differential systems that are widely used in mathematical modeling while urn processes are actively studied in combinatorics and probability theory. The presented equivalence extends a theorem by Flajolet et al. (Flajolet, Dumas and Puyhaubert Discr. Math. Theor. Comp. Sc. AG - 2006, DMTCS Proceedings) already establishing an isomorphism between urn processes and a particular class of differential systems with monomial vector fields. The present result is based on the fact that such monomial differential systems are canonical forms for more general dynamical systems.
△ Less
Submitted 17 July, 2018;
originally announced July 2018.
-
Image Restoration Using Conditional Random Fields and Scale Mixtures of Gaussians
Authors:
Milad Niknejad,
Jose M. Bioucas-Dias,
Mario A. T. Figueiredo
Abstract:
This paper proposes a general framework for internal patch-based image restoration based on Conditional Random Fields (CRF). Unlike related models based on Markov Random Fields (MRF), our approach explicitly formulates the posterior distribution for the entire image. The potential functions are taken as proportional to the product of a likelihood and prior for each patch. By assuming identical par…
▽ More
This paper proposes a general framework for internal patch-based image restoration based on Conditional Random Fields (CRF). Unlike related models based on Markov Random Fields (MRF), our approach explicitly formulates the posterior distribution for the entire image. The potential functions are taken as proportional to the product of a likelihood and prior for each patch. By assuming identical parameters for similar patches, our approach can be classified as a model-based non-local method. For the prior term in the potential function of the CRF model, multivariate Gaussians and multivariate scale-mixture of Gaussians are considered, with the latter being a novel prior for image patches. Our results show that the proposed approach outperforms methods based on Gaussian mixture models for image denoising and state-of-the-art methods for image interpolation/inpainting.
△ Less
Submitted 9 July, 2018;
originally announced July 2018.
-
External Patch-Based Image Restoration Using Importance Sampling
Authors:
Milad Niknejad,
Jose M. Bioucas-Dias,
Mario A. T. Figueiredo
Abstract:
This paper introduces a new approach to patch-based image restoration based on external datasets and importance sampling. The Minimum Mean Squared Error (MMSE) estimate of the image patches, the computation of which requires solving a multidimensional (typically intractable) integral, is approximated using samples from an external dataset. The new method, which can be interpreted as a generalizati…
▽ More
This paper introduces a new approach to patch-based image restoration based on external datasets and importance sampling. The Minimum Mean Squared Error (MMSE) estimate of the image patches, the computation of which requires solving a multidimensional (typically intractable) integral, is approximated using samples from an external dataset. The new method, which can be interpreted as a generalization of the external non-local means (NLM), uses self-normalized importance sampling to efficiently approximate the MMSE estimates. The use of self-normalized importance sampling endows the proposed method with great flexibility, namely regarding the statistical properties of the measurement noise. The effectiveness of the proposed method is shown in a series of experiments using both generic large-scale and class-specific external datasets.
△ Less
Submitted 9 July, 2018;
originally announced July 2018.
-
Impulsive Noise Robust Sparse Recovery via Continuous Mixed Norm
Authors:
Amirhossein Javaheri,
Hadi Zayyani,
Mario A. T. Figueiredo,
Farrokh Marvasti
Abstract:
This paper investigates the problem of sparse signal recovery in the presence of additive impulsive noise. The heavytailed impulsive noise is well modelled with stable distributions. Since there is no explicit formulation for the probability density function of $SαS$ distribution, alternative approximations like Generalized Gaussian Distribution (GGD) are used which impose $\ell_p$-norm fidelity o…
▽ More
This paper investigates the problem of sparse signal recovery in the presence of additive impulsive noise. The heavytailed impulsive noise is well modelled with stable distributions. Since there is no explicit formulation for the probability density function of $SαS$ distribution, alternative approximations like Generalized Gaussian Distribution (GGD) are used which impose $\ell_p$-norm fidelity on the residual error. In this paper, we exploit a Continuous Mixed Norm (CMN) for robust sparse recovery instead of $\ell_p$-norm. We show that in blind conditions, i.e., in case where the parameters of noise distribution are unknown, incorporating CMN can lead to near optimal recovery. We apply Alternating Direction Method of Multipliers (ADMM) for solving the problem induced by utilizing CMN for robust sparse recovery. In this approach, CMN is replaced with a surrogate function and Majorization-Minimization technique is incorporated to solve the problem. Simulation results confirm the efficiency of the proposed method compared to some recent algorithms in the literature for impulsive noise robust sparse recovery.
△ Less
Submitted 12 April, 2018;
originally announced April 2018.