-
Can LLMs Generate Visualizations with Dataless Prompts?
Authors:
Darius Coelho,
Harshit Barot,
Naitik Rathod,
Klaus Mueller
Abstract:
Recent advancements in large language models have revolutionized information access, as these models harness data available on the web to address complex queries, becoming the preferred information source for many users. In certain cases, queries are about publicly available data, which can be effectively answered with data visualizations. In this paper, we investigate the ability of large languag…
▽ More
Recent advancements in large language models have revolutionized information access, as these models harness data available on the web to address complex queries, becoming the preferred information source for many users. In certain cases, queries are about publicly available data, which can be effectively answered with data visualizations. In this paper, we investigate the ability of large language models to provide accurate data and relevant visualizations in response to such queries. Specifically, we investigate the ability of GPT-3 and GPT-4 to generate visualizations with dataless prompts, where no data accompanies the query. We evaluate the results of the models by comparing them to visualization cheat sheets created by visualization experts.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
PRIBOOT: A New Data-Driven Expert for Improved Driving Simulations
Authors:
Daniel Coelho,
Miguel Oliveira,
Vitor Santos,
Antonio M. Lopez
Abstract:
The development of Autonomous Driving (AD) systems in simulated environments like CARLA is crucial for advancing real-world automotive technologies. To drive innovation, CARLA introduced Leaderboard 2.0, significantly more challenging than its predecessor. However, current AD methods have struggled to achieve satisfactory outcomes due to a lack of sufficient ground truth data. Human driving logs p…
▽ More
The development of Autonomous Driving (AD) systems in simulated environments like CARLA is crucial for advancing real-world automotive technologies. To drive innovation, CARLA introduced Leaderboard 2.0, significantly more challenging than its predecessor. However, current AD methods have struggled to achieve satisfactory outcomes due to a lack of sufficient ground truth data. Human driving logs provided by CARLA are insufficient, and previously successful expert agents like Autopilot and Roach, used for collecting datasets, have seen reduced effectiveness under these more demanding conditions. To overcome these data limitations, we introduce PRIBOOT, an expert agent that leverages limited human logs with privileged information. We have developed a novel BEV representation specifically tailored to meet the demands of this new benchmark and processed it as an RGB image to facilitate the application of transfer learning techniques, instead of using a set of masks. Additionally, we propose the Infraction Rate Score (IRS), a new evaluation metric designed to provide a more balanced assessment of driving performance over extended routes. PRIBOOT is the first model to achieve a Route Completion (RC) of 75% in Leaderboard 2.0, along with a Driving Score (DS) and IRS of 20% and 45%, respectively. With PRIBOOT, researchers can now generate extensive datasets, potentially solving the data availability issues that have hindered progress in this benchmark.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
A community palm model
Authors:
Nicholas Clinton,
Andreas Vollrath,
Remi D'annunzio,
Desheng Liu,
Henry B. Glick,
Adrià Descals,
Alicia Sullivan,
Oliver Guinan,
Jacob Abramowitz,
Fred Stolle,
Chris Goodman,
Tanya Birch,
David Quinn,
Olga Danylo,
Tijs Lips,
Daniel Coelho,
Enikoe Bihari,
Bryce Cronkite-Ratcliff,
Ate Poortinga,
Atena Haghighattalab,
Evan Notman,
Michael DeWitt,
Aaron Yonas,
Gennadii Donchyts,
Devaja Shah
, et al. (5 additional authors not shown)
Abstract:
Palm oil production has been identified as one of the major drivers of deforestation for tropical countries. To meet supply chain objectives, commodity producers and other stakeholders need timely information of land cover dynamics in their supply shed. However, such data are difficult to obtain from suppliers who may lack digital geographic representations of their supply sheds and production loc…
▽ More
Palm oil production has been identified as one of the major drivers of deforestation for tropical countries. To meet supply chain objectives, commodity producers and other stakeholders need timely information of land cover dynamics in their supply shed. However, such data are difficult to obtain from suppliers who may lack digital geographic representations of their supply sheds and production locations. Here we present a "community model," a machine learning model trained on pooled data sourced from many different stakeholders, to develop a specific land cover probability map, in this case a semi-global oil palm map. An advantage of this method is the inclusion of varied inputs, the ability to easily update the model as new training data becomes available and run the model on any year that input imagery is available. Inclusion of diverse data sources into one probability map can help establish a shared understanding across stakeholders on the presence and absence of a land cover or commodity (in this case oil palm). The model predictors are annual composites built from publicly available satellite imagery provided by Sentinel-1, Sentinel-2, and ALOS DSM. We provide map outputs as the probability of palm in a given pixel, to reflect the uncertainty of the underlying state (palm or not palm). The initial version of this model provides global accuracy estimated to be approximately 90% (at 0.5 probability threshold) from spatially partitioned test data. This model, and resulting oil palm probability map products are useful for accurately identifying the geographic footprint of palm cultivation. Used in conjunction with timely deforestation information, this palm model is useful for understanding the risk of continued oil palm plantation expansion in sensitive forest areas.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Optimization of Quantum Systems Emulation via a Variant of the Bandwidth Minimization Problem
Authors:
M. Yassine Naghmouchi,
Joseph Vovrosh,
Wesley da Silva Coelho,
Alexandre Dauphin
Abstract:
This paper introduces weighted-BMP, a variant of the Bandwidth Minimization Problem (BMP), with a significant application in optimizing quantum emulation. Weighted-BMP optimizes particles ordering to reduce the emulation costs, by designing a particle interaction matrix where strong interactions are placed as close as possible to the diagonal. We formulate the problem using a Mixed Integer Linear…
▽ More
This paper introduces weighted-BMP, a variant of the Bandwidth Minimization Problem (BMP), with a significant application in optimizing quantum emulation. Weighted-BMP optimizes particles ordering to reduce the emulation costs, by designing a particle interaction matrix where strong interactions are placed as close as possible to the diagonal. We formulate the problem using a Mixed Integer Linear Program (MILP) and solve it to optimality with a state of the art solver. To strengthen our MILP model, we introduce symmetry-breaking inequalities and establish a lower bound. Through extensive numerical analysis, we examine the impacts of these enhancements on the solver's performance. The introduced reinforcements result in an average CPU time reduction of 25.61 percent. Additionally, we conduct quantum emulations of realistic instances. Our numerical tests show that the weighted-BMP approach outperforms the Reverse Cuthill-McKee (RCM) algorithm, an efficient heuristic used for site ordering tasks in quantum emulation, achieving an average memory storage reduction of 24.48 percent. From an application standpoint, this study is the first to apply an exact optimization method, weighted-BMP, that considers interactions for site ordering in quantum emulation pre-processing, and shows its crucial role in cost reduction. From an algorithmic perspective, it contributes by introducing important reinforcements and lays the groundwork for future research on further enhancements, particularly on strengthening the weak linear relaxation of the MILP.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Graph Algorithms with Neutral Atom Quantum Processors
Authors:
Constantin Dalyac,
Lucas Leclerc,
Louis Vignoli,
Mehdi Djellabi,
Wesley da Silva Coelho,
Bruno Ximenez,
Alexandre Dareau,
Davide Dreon,
VIncent E. Elfving,
Adrien Signoles,
Louis-Paul Henry,
Loïc Henriet
Abstract:
Neutral atom technology has steadily demonstrated significant theoretical and experimental advancements, positioning itself as a front-runner platform for running quantum algorithms. One unique advantage of this technology lies in the ability to reconfigure the geometry of the qubit register, from shot to shot. This unique feature makes possible the native embedding of graph-structured problems at…
▽ More
Neutral atom technology has steadily demonstrated significant theoretical and experimental advancements, positioning itself as a front-runner platform for running quantum algorithms. One unique advantage of this technology lies in the ability to reconfigure the geometry of the qubit register, from shot to shot. This unique feature makes possible the native embedding of graph-structured problems at the hardware level, with profound consequences for the resolution of complex optimization and machine learning tasks. By driving qubits, one can generate processed quantum states which retain graph complex properties. These states can then be leveraged to offer direct solutions to problems or as resources in hybrid quantum-classical schemes. In this paper, we review the advancements in quantum algorithms for graph problems running on neutral atom Quantum Processing Units (QPUs), and discuss recently introduced embedding and problem-solving techniques. In addition, we clarify ongoing advancements in hardware, with an emphasis on enhancing the scalability, controllability and computation repetition rate of neutral atom QPUs.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Discrete Fourier Transform Approximations Based on the Cooley-Tukey Radix-2 Algorithm
Authors:
D. F. G. Coelho,
R. J. Cintra
Abstract:
This report elaborates on approximations for the discrete Fourier transform by means of replacing the exact Cooley-Tukey algorithm twiddle-factors by low-complexity integers, such as $0, \pm \frac{1}{2}, \pm 1$.
This report elaborates on approximations for the discrete Fourier transform by means of replacing the exact Cooley-Tukey algorithm twiddle-factors by low-complexity integers, such as $0, \pm \frac{1}{2}, \pm 1$.
△ Less
Submitted 25 February, 2024;
originally announced February 2024.
-
Mixed Integer Linear Programming Solver Using Benders Decomposition Assisted by Neutral Atom Quantum Processor
Authors:
M. Yassine Naghmouchi,
Wesley da Silva Coelho
Abstract:
This paper presents a new hybrid classical-quantum approach to solve Mixed Integer Linear Programming (MILP) using neutral atom quantum computations. We apply Benders decomposition (BD) to segment MILPs into a master problem (MP) and a subproblem (SP), where the MP is addressed using a neutral-atom device, after being transformed into a Quadratic Unconstrained Binary Optimization (QUBO) model, wit…
▽ More
This paper presents a new hybrid classical-quantum approach to solve Mixed Integer Linear Programming (MILP) using neutral atom quantum computations. We apply Benders decomposition (BD) to segment MILPs into a master problem (MP) and a subproblem (SP), where the MP is addressed using a neutral-atom device, after being transformed into a Quadratic Unconstrained Binary Optimization (QUBO) model, with an automatized procedure. Our MILP to QUBO conversion tightens the upper bounds of the involved continuous variables, positively impacting the required qubit count, and the convergence of the algorithm. To solve the QUBO, we develop a heuristic for atom register embedding and apply a variational algorithm for pulse sha**. In addition, we implement a Proof of Concept (PoC) that outperforms existing solutions. We also conduct preliminary numerical results: in a series of small MILP instances our algorithm identifies over 95 percent of feasible solutions of high quality, outperforming classical BD approaches where the MP is solved using simulated annealing. To the best of our knowledge, this work is the first to utilize a neutral atom quantum processor in develo** an automated, problem-agnostic framework for solving MILPs through BD.
△ Less
Submitted 17 June, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Do Digital Jobs Need an Image Filter? Factors Contributing to Negative Attitudes
Authors:
Paul H. P. Hanel,
Gabriel Lins de Holanda Coelho,
Jennifer Haase
Abstract:
The rapid expansion of high-speed internet has led to the emergence of new digital jobs, such as digital influencers, fitness models, and adult models who share content on subscription-based social media platforms. Across two experiments involving 1,002 participants, we combined theories from both social psychology and information systems to investigate perceptions of digital jobs compared to matc…
▽ More
The rapid expansion of high-speed internet has led to the emergence of new digital jobs, such as digital influencers, fitness models, and adult models who share content on subscription-based social media platforms. Across two experiments involving 1,002 participants, we combined theories from both social psychology and information systems to investigate perceptions of digital jobs compared to matched established jobs, and predictors of attitudes toward digital jobs (e.g., symbolic threat, contact, perceived usefulness). We found that individuals in digital professions were perceived as less favorably and as less hard-working than those in matched established jobs. Digital jobs were also regarded as more threatening to societal values and less useful. The relation between job type and attitudes toward these jobs was partially mediated by contact with people working in these jobs, perceived usefulness, perception of hard-working, and symbolic threat. These effects were consistent across openness to new experiences, attitudes toward digitalization, political orientation, and age. Among the nine jobs examined, lecturers were perceived as the most favorable, while adult models were viewed least favorably. Overall, our findings demonstrate that integrating theories from social psychology and information systems can enhance our understanding of how attitudes are formed.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments
Authors:
Daniel Coelho,
Miguel Oliveira,
Vitor Santos
Abstract:
Current approaches of Reinforcement Learning (RL) applied in urban Autonomous Driving (AD) focus on decoupling the perception training from the driving policy training. The main reason is to avoid training a convolution encoder alongside a policy network, which is known to have issues related to sample efficiency, degenerated feature representations, and catastrophic self-overfitting. However, thi…
▽ More
Current approaches of Reinforcement Learning (RL) applied in urban Autonomous Driving (AD) focus on decoupling the perception training from the driving policy training. The main reason is to avoid training a convolution encoder alongside a policy network, which is known to have issues related to sample efficiency, degenerated feature representations, and catastrophic self-overfitting. However, this paradigm can lead to representations of the environment that are not aligned with the downstream task, which may result in suboptimal performances. To address this limitation, this paper proposes RLAD, the first Reinforcement Learning from Pixels (RLfP) method applied in the urban AD domain. We propose several techniques to enhance the performance of an RLfP algorithm in this domain, including: i) an image encoder that leverages both image augmentations and Adaptive Local Signal Mixing (A-LIX) layers; ii) WayConv1D, which is a waypoint encoder that harnesses the 2D geometrical information of the waypoints using 1D convolutions; and iii) an auxiliary loss to increase the significance of the traffic lights in the latent representation of the environment. Experimental results show that RLAD significantly outperforms all state-of-the-art RLfP methods on the NoCrash benchmark. We also present an infraction analysis on the NoCrash-regular benchmark, which indicates that RLAD performs better than all other methods in terms of both collision rate and red light infractions.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
Synfeal: A Data-Driven Simulator for End-to-End Camera Localization
Authors:
Daniel Coelho,
Miguel Oliveira,
Paulo Dias
Abstract:
Collecting real-world data is often considered the bottleneck of Artificial Intelligence, stalling the research progress in several fields, one of which is camera localization. End-to-end camera localization methods are still outperformed by traditional methods, and we argue that the inconsistencies associated with the data collection techniques are restraining the potential of end-to-end methods.…
▽ More
Collecting real-world data is often considered the bottleneck of Artificial Intelligence, stalling the research progress in several fields, one of which is camera localization. End-to-end camera localization methods are still outperformed by traditional methods, and we argue that the inconsistencies associated with the data collection techniques are restraining the potential of end-to-end methods. Inspired by the recent data-centric paradigm, we propose a framework that synthesizes large localization datasets based on realistic 3D reconstructions of the real world. Our framework, termed Synfeal: Synthetic from Real, is an open-source, data-driven simulator that synthesizes RGB images by moving a virtual camera through a realistic 3D textured mesh, while collecting the corresponding ground-truth camera poses. The results validate that the training of camera localization algorithms on datasets generated by Synfeal leads to better results when compared to datasets generated by state-of-the-art methods. Using Synfeal, we conducted the first analysis of the relationship between the size of the dataset and the performance of camera localization algorithms. Results show that the performance significantly increases with the dataset size. Our results also suggest that when a large localization dataset with high quality is available, training from scratch leads to better performances. Synfeal is publicly available at https://github.com/DanielCoelho112/synfeal.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
Graph Neural Networks for the Offline Nanosatellite Task Scheduling Problem
Authors:
Bruno Machado Pacheco,
Laio Oriel Seman,
Cezar Antonio Rigo,
Eduardo Camponogara,
Eduardo Augusto Bezerra,
Leandro dos Santos Coelho
Abstract:
This study investigates how to schedule nanosatellite tasks more efficiently using Graph Neural Networks (GNNs). In the Offline Nanosatellite Task Scheduling (ONTS) problem, the goal is to find the optimal schedule for tasks to be carried out in orbit while taking into account Quality-of-Service (QoS) considerations such as priority, minimum and maximum activation events, execution time-frames, pe…
▽ More
This study investigates how to schedule nanosatellite tasks more efficiently using Graph Neural Networks (GNNs). In the Offline Nanosatellite Task Scheduling (ONTS) problem, the goal is to find the optimal schedule for tasks to be carried out in orbit while taking into account Quality-of-Service (QoS) considerations such as priority, minimum and maximum activation events, execution time-frames, periods, and execution windows, as well as constraints on the satellite's power resources and the complexity of energy harvesting and management. The ONTS problem has been approached using conventional mathematical formulations and exact methods, but their applicability to challenging cases of the problem is limited. This study examines the use of GNNs in this context, which has been effectively applied to optimization problems such as the traveling salesman, scheduling, and facility placement problems. More specifically, we investigate whether GNNs can learn the complex structure of the ONTS problem with respect to feasibility and optimality of candidate solutions. Furthermore, we evaluate using GNN-based heuristic solutions to provide better solutions (w.r.t. the objective value) to the ONTS problem and reduce the optimization cost. Our experiments show that GNNs are not only able to learn feasibility and optimality for instances of the ONTS problem, but they can generalize to harder instances than those seen during training. Furthermore, the GNN-based heuristics improved the expected objective value of the best solution found under the time limit in 45%, and reduced the expected time to find a feasible solution in 35%, when compared to the SCIP (Solving Constraint Integer Programs) solver in its off-the-shelf configuration
△ Less
Submitted 20 September, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
Could Nucleobases form in the ISM? A Theoretical Study in the Hoursehead nebula
Authors:
Luciene da Silva Coelho,
Edgar Mendoza,
Amancio Cesar dos Santos Friaça
Abstract:
This work presents the results of a theoretical study that analyzed the possibility of nucleobases to form in the interstellar medium, in the Horsehead nebula, which is a region considered an archetype of molecular cloud. Performing the Meudon PDR code, the reactions of the nitrogen bases formation from formamide, which is a precursor compound identified in several interstellar environment, where…
▽ More
This work presents the results of a theoretical study that analyzed the possibility of nucleobases to form in the interstellar medium, in the Horsehead nebula, which is a region considered an archetype of molecular cloud. Performing the Meudon PDR code, the reactions of the nitrogen bases formation from formamide, which is a precursor compound identified in several interstellar environment, where simulated. The model showed that at least cytosine and uracil presented significant abundances. Finally, from thermochemical and quantum calculations, a investigation was carried out on the formation reactions considered for the nucleobases and no insurmountable energy barrier which would prevent the reactions was found.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Patterns of Social Vulnerability -- An Interactive Dashboard to Explore Risks to Public Health on the US County Level
Authors:
Darius Coelho,
Nikita Gupta,
Eric Papenhausen,
Klaus Mueller
Abstract:
Social vulnerability is the susceptibility of a community to be adversely impacted by natural hazards and public health emergencies, such as drought, earthquakes, flooding, virus outbreaks, and the like. Climate change is at the root of many recent natural hazards while the COVID-19 pandemic is still an active threat. Social vulnerability also refers to resilience, or the ability to recover from s…
▽ More
Social vulnerability is the susceptibility of a community to be adversely impacted by natural hazards and public health emergencies, such as drought, earthquakes, flooding, virus outbreaks, and the like. Climate change is at the root of many recent natural hazards while the COVID-19 pandemic is still an active threat. Social vulnerability also refers to resilience, or the ability to recover from such adverse events. To gauge the many aspects of social vulnerability the US Center of Disease Control (CDC) has subdivided social vulnerabilities into distinct themes, such as socioeconomic status, household composition, and others. Knowing a community's social vulnerabilities can help policymakers and responders to recognize risks to community health, prepare for possible hazards, or recover from disasters. In this paper we study social vulnerabilities on the US county level and present research that suggests that there are certain combinations, or patterns, of social vulnerability indicators into which US counties can be grouped. We then present an interactive dashboard that allows analysts to explore these patterns in various ways. We demonstrate our methodology using COVID-19 death rate as the hazard and show that the patterns we identified have high predictive capabilities of the pandemic's local impact.
△ Less
Submitted 7 January, 2023;
originally announced January 2023.
-
Quantum pricing-based column-generation framework for hard combinatorial problems
Authors:
Wesley da Silva Coelho,
Loïc Henriet,
Louis-Paul Henry
Abstract:
In this work, we present a complete hybrid classical-quantum algorithm involving a quantum sampler based on neutral atom platforms. This approach is inspired by classical column generation frameworks developed in the field of Operations Research and shows how quantum procedures can assist classical solvers in addressing hard combinatorial problems. We benchmark our method on the Minimum Vertex Col…
▽ More
In this work, we present a complete hybrid classical-quantum algorithm involving a quantum sampler based on neutral atom platforms. This approach is inspired by classical column generation frameworks developed in the field of Operations Research and shows how quantum procedures can assist classical solvers in addressing hard combinatorial problems. We benchmark our method on the Minimum Vertex Coloring problem and show that the proposed hybrid quantum-classical column generation algorithm can yield good solutions in relatively few iterations. We compare our results with state-of-the-art classical and quantum approaches.
△ Less
Submitted 4 April, 2023; v1 submitted 6 January, 2023;
originally announced January 2023.
-
Clinical Deterioration Prediction in Brazilian Hospitals Based on Artificial Neural Networks and Tree Decision Models
Authors:
Hamed Yazdanpanah,
Augusto C. M. Silva,
Murilo Guedes,
Hugo M. P. Morales,
Leandro dos S. Coelho,
Fernando G. Moro
Abstract:
Early recognition of clinical deterioration (CD) has vital importance in patients' survival from exacerbation or death. Electronic health records (EHRs) data have been widely employed in Early Warning Scores (EWS) to measure CD risk in hospitalized patients. Recently, EHRs data have been utilized in Machine Learning (ML) models to predict mortality and CD. The ML models have shown superior perform…
▽ More
Early recognition of clinical deterioration (CD) has vital importance in patients' survival from exacerbation or death. Electronic health records (EHRs) data have been widely employed in Early Warning Scores (EWS) to measure CD risk in hospitalized patients. Recently, EHRs data have been utilized in Machine Learning (ML) models to predict mortality and CD. The ML models have shown superior performance in CD prediction compared to EWS. Since EHRs data are structured and tabular, conventional ML models are generally applied to them, and less effort is put into evaluating the artificial neural network's performance on EHRs data. Thus, in this article, an extremely boosted neural network (XBNet) is used to predict CD, and its performance is compared to eXtreme Gradient Boosting (XGBoost) and random forest (RF) models. For this purpose, 103,105 samples from thirteen Brazilian hospitals are used to generate the models. Moreover, the principal component analysis (PCA) is employed to verify whether it can improve the adopted models' performance. The performance of ML models and Modified Early Warning Score (MEWS), an EWS candidate, are evaluated in CD prediction regarding the accuracy, precision, recall, F1-score, and geometric mean (G-mean) metrics in a 10-fold cross-validation approach. According to the experiments, the XGBoost model obtained the best results in predicting CD among Brazilian hospitals' data.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Low-Complexity Loeffler DCT Approximations for Image and Video Coding
Authors:
D. F. G. Coelho,
R. J. Cintra,
F. M. Bayer,
S. Kulasekera,
A. Madanayake,
P. A. C. Martinez,
T. L. T. Silveira,
R. S. Oliveira,
V. S. Dimitrov
Abstract:
This paper introduced a matrix parametrization method based on the Loeffler discrete cosine transform (DCT) algorithm. As a result, a new class of eight-point DCT approximations was proposed, capable of unifying the mathematical formalism of several eight-point DCT approximations archived in the literature. Pareto-efficient DCT approximations are obtained through multicriteria optimization, where…
▽ More
This paper introduced a matrix parametrization method based on the Loeffler discrete cosine transform (DCT) algorithm. As a result, a new class of eight-point DCT approximations was proposed, capable of unifying the mathematical formalism of several eight-point DCT approximations archived in the literature. Pareto-efficient DCT approximations are obtained through multicriteria optimization, where computational complexity, proximity, and coding performance are considered. Efficient approximations and their scaled 16- and 32-point versions are embedded into image and video encoders, including a JPEG-like codec and H.264/AVC and H.265/HEVC standards. Results are compared to the unmodified standard codecs. Efficient approximations are mapped and implemented on a Xilinx VLX240T FPGA and evaluated for area, speed, and power consumption.
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
Efficient protocol for solving combinatorial graph problems on neutral-atom quantum processors
Authors:
Wesley da Silva Coelho,
Mauro D'Arcangelo,
Louis-Paul Henry
Abstract:
On neutral atom platforms, preparing specific quantum states is usually achieved by pulse sha**, i.e., by optimizing the time-dependence of the Hamiltonian related to the system. This process can be extremely costly, as it requires sampling of the final state in the quantum processor many times. Hence, determining a good pulse, as well as a good embedding, to solve specific combinatorial graph p…
▽ More
On neutral atom platforms, preparing specific quantum states is usually achieved by pulse sha**, i.e., by optimizing the time-dependence of the Hamiltonian related to the system. This process can be extremely costly, as it requires sampling of the final state in the quantum processor many times. Hence, determining a good pulse, as well as a good embedding, to solve specific combinatorial graph problems is one of the most important bottlenecks of the analog approach. In this work, we propose a novel protocol for solving hard combinatorial graph problems that combines variational analog quantum computing and machine learning. Our numerical simulations show that the proposed protocol can reduce dramatically the number of iterations to be run on the quantum device. Finally, we assess the quality of the proposed approach by estimating the related Q-score, a recently proposed metric aimed at benchmarking QPUs.
△ Less
Submitted 2 August, 2022; v1 submitted 26 July, 2022;
originally announced July 2022.
-
Towards a Low-SWaP 1024-beam Digital Array: A 32-beam Sub-system at 5.8 GHz
Authors:
Arjuna Madanayake,
Viduneth Ariyarathna,
Suresh Madishetty,
Sravan Pulipati,
R. J. Cintra,
Diego Coelho,
Raíza Oliveira,
Fábio M. Bayer,
Leonid Belostotski,
Soumyajit Mandal,
Theodore S. Rappaport
Abstract:
Millimeter wave communications require multibeam beamforming in order to utilize wireless channels that suffer from obstructions, path loss, and multi-path effects. Digital multibeam beamforming has maximum degrees of freedom compared to analog phased arrays. However, circuit complexity and power consumption are important constraints for digital multibeam systems. A low-complexity digital computin…
▽ More
Millimeter wave communications require multibeam beamforming in order to utilize wireless channels that suffer from obstructions, path loss, and multi-path effects. Digital multibeam beamforming has maximum degrees of freedom compared to analog phased arrays. However, circuit complexity and power consumption are important constraints for digital multibeam systems. A low-complexity digital computing architecture is proposed for a multiplication-free 32-point linear transform that approximates multiple simultaneous RF beams similar to a discrete Fourier transform (DFT). Arithmetic complexity due to multiplication is reduced from the FFT complexity of $\mathcal{O}(N\: \log N)$ for DFT realizations, down to zero, thus yielding a 46% and 55% reduction in chip area and dynamic power consumption, respectively, for the $N=32$ case considered. The paper describes the proposed 32-point DFT approximation targeting a 1024-beams using a 2D array, and shows the multiplierless approximation and its map** to a 32-beam sub-system consisting of 5.8 GHz antennas that can be used for generating 1024 digital beams without multiplications. Real-time beam computation is achieved using a Xilinx FPGA at 120 MHz bandwidth per beam. Theoretical beam performance is compared with measured RF patterns from both a fixed-point FFT as well as the proposed multiplier-free algorithm and are in good agreement.
△ Less
Submitted 29 May, 2024; v1 submitted 18 July, 2022;
originally announced July 2022.
-
Fast Radix-32 Approximate DFTs for 1024-Beam Digital RF Beamforming
Authors:
A. Madanayake,
R. J. Cintra,
N. Akram,
V. Ariyarathna,
S. Mandal,
V. A. Coutinho,
F. M. Bayer,
D. Coelho,
T. S. Rappaport
Abstract:
The discrete Fourier transform (DFT) is widely employed for multi-beam digital beamforming. The DFT can be efficiently implemented through the use of fast Fourier transform (FFT) algorithms, thus reducing chip area, power consumption, processing time, and consumption of other hardware resources. This paper proposes three new hybrid DFT 1024-point DFT approximations and their respective fast algori…
▽ More
The discrete Fourier transform (DFT) is widely employed for multi-beam digital beamforming. The DFT can be efficiently implemented through the use of fast Fourier transform (FFT) algorithms, thus reducing chip area, power consumption, processing time, and consumption of other hardware resources. This paper proposes three new hybrid DFT 1024-point DFT approximations and their respective fast algorithms. These approximate DFT (ADFT) algorithms have significantly reduced circuit complexity and power consumption compared to traditional FFT approaches while trading off a subtle loss in computational precision which is acceptable for digital beamforming applications in RF antenna implementations. ADFT algorithms have not been introduced for beamforming beyond $N = 32$, but this paper anticipates the need for massively large adaptive arrays for future 5G and 6G systems. Digital CMOS circuit designs for the ADFTs show the resulting improvements in both circuit complexity and power consumption metrics. Simulation results show similar or lower critical path delay with up to 48.5% lower chip area compared to a standard Cooley-Tukey FFT. The time-area and dynamic power metrics are reduced up to 66.0%. The 1024-point ADFT beamformers produce signal-to-noise ratio (SNR) gains between 29.2--30.1 dB, which is a loss of $\le$ 0.9 dB SNR gain compared to exact 1024-point DFT beamformers (worst case) realizable at using an FFT.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
A Class of Low-complexity DCT-like Transforms for Image and Video Coding
Authors:
T. L. T. da Silveira,
D. R. Canterle,
D. F. G. Coelho,
V. A. Coutinho,
F. M. Bayer,
R. J. Cintra
Abstract:
The discrete cosine transform (DCT) is a relevant tool in signal processing applications, mainly known for its good decorrelation properties. Current image and video coding standards -- such as JPEG and HEVC -- adopt the DCT as a fundamental building block for compression. Recent works have introduced low-complexity approximations for the DCT, which become paramount in applications demanding real-…
▽ More
The discrete cosine transform (DCT) is a relevant tool in signal processing applications, mainly known for its good decorrelation properties. Current image and video coding standards -- such as JPEG and HEVC -- adopt the DCT as a fundamental building block for compression. Recent works have introduced low-complexity approximations for the DCT, which become paramount in applications demanding real-time computation and low-power consumption. The design of DCT approximations involves a trade-off between computational complexity and performance. This paper introduces a new multiparametric transform class encompassing the round-off DCT (RDCT) and the modified RDCT (MRDCT), two relevant multiplierless 8-point approximate DCTs. The associated fast algorithm is provided. Four novel orthogonal low-complexity 8-point DCT approximations are obtained by solving a multicriteria optimization problem. The optimal 8-point transforms are scaled to lengths 16 and 32 while kee** the arithmetic complexity low. The proposed methods are assessed by proximity and coding measures with respect to the exact DCT. Image and video coding experiments hardware realization are performed. The novel transforms perform close to or outperform the current state-of-the-art DCT approximations.
△ Less
Submitted 8 December, 2022; v1 submitted 31 May, 2022;
originally announced June 2022.
-
UHI in Fortaleza and trends on screen-level air temperature and precipitation
Authors:
A. de A. Coelho
Abstract:
There is a consensus that Urban Heat Island phenomenon - UHI occurs in every large city. This effect is characterized by higher air temperatures in cities than in the neighboring countryside at night. However, to date, there has been no systematic study on the Fortaleza case, the Brazil's 5th largest city. By the comparison between screen-level air temperature measured by two automatic weather sta…
▽ More
There is a consensus that Urban Heat Island phenomenon - UHI occurs in every large city. This effect is characterized by higher air temperatures in cities than in the neighboring countryside at night. However, to date, there has been no systematic study on the Fortaleza case, the Brazil's 5th largest city. By the comparison between screen-level air temperature measured by two automatic weather stations, one located in the city and the other in the neighboring region, this work shows the occurrence of the UHI in Fortaleza, even during the rainy season. In an attempt to find some effect of the UHI on precipitation (and vice-versa), historical series of air temperature and precipitation were analyzed from 1966 to 2020. Besides the considerable increase in air temperature over the years, a slight downward trend was observed in UHI and even more in accumulated precipitation between the hours of 15:00 and 21:00 (local time). However, one believes that these trends may be related to climate change at large scale rather than an urban scale.
△ Less
Submitted 30 April, 2022;
originally announced May 2022.
-
Experimental and theoretical studies of the gas-phase reactions of O($^1$D) with H$_2$O and D$_2$O at low temperature
Authors:
Kevin Hickson,
Somnath Bhowmick,
Yury Suleimanov,
João Brandão,
Daniela Coelho
Abstract:
Here we report the results of an experimental and theoretical study of the gas-phase reactions between O($^1$D) and H$_2$O and O($^1$D) and D$_2$O at room temperature and below. On the experimental side, the kinetics of these reactions have been investigated over the 50-127 K range using a continuous flow Laval nozzle apparatus, coupled with pulsed laser photolysis and pulsed laser induced fluores…
▽ More
Here we report the results of an experimental and theoretical study of the gas-phase reactions between O($^1$D) and H$_2$O and O($^1$D) and D$_2$O at room temperature and below. On the experimental side, the kinetics of these reactions have been investigated over the 50-127 K range using a continuous flow Laval nozzle apparatus, coupled with pulsed laser photolysis and pulsed laser induced fluorescence for the production and detection of O($^1$D) atoms respectively. Experiments were also performed at 296 K in the absence of a Laval nozzle. On the theoretical side, the existing full-dimensional ground X$^1$A potential energy surface for the H$_2$O$_2$ system involved in this process has been reinvestigated and enhanced to provide a better description of the barrierless H-atom abstraction pathway. Based on this enhanced potential energy surface, quasiclassical trajectory calculations and ring polymer molecular dynamics simulations have been performed to obtain low temperature rate constants. The measured and calculated rate constants display similar behaviour above 100 K, showing little or no variation as a function of temperature. Below 100 K, the experimental rate constants increase dramatically, in contrast to the essentially temperature independent theoretical values. The possible origins of the divergence between experiment and theory at low temperatures are discussed.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Low-complexity Scaling Methods for DCT-II Approximations
Authors:
D. F. G. Coelho,
R. J. Cintra,
A. Madanayake,
S. Perera
Abstract:
This paper introduces a collection of scaling methods for generating $2N$-point DCT-II approximations based on $N$-point low-complexity transformations. Such scaling is based on the Hou recursive matrix factorization of the exact $2N$-point DCT-II matrix. Encompassing the widely employed Jridi-Alfalou-Meher scaling method, the proposed techniques are shown to produce DCT-II approximations that out…
▽ More
This paper introduces a collection of scaling methods for generating $2N$-point DCT-II approximations based on $N$-point low-complexity transformations. Such scaling is based on the Hou recursive matrix factorization of the exact $2N$-point DCT-II matrix. Encompassing the widely employed Jridi-Alfalou-Meher scaling method, the proposed techniques are shown to produce DCT-II approximations that outperform the transforms resulting from the JAM scaling method according to total error energy and mean squared error. Orthogonality conditions are derived and an extensive error analysis based on statistical simulation demonstrates the good performance of the introduced scaling methods. A hardware implementation is also provided demonstrating the competitiveness of the proposed methods when compared to the JAM scaling method.
△ Less
Submitted 11 February, 2024; v1 submitted 4 August, 2021;
originally announced August 2021.
-
Possible routes for the Formation of Prebiotic Molecules in the Horsehead Nebula
Authors:
Luciene da Silva Coelho,
Amâncio César dos Santos Friaça,
Edgar Mendoza
Abstract:
This article presents the results of a study concerning interstellar molecules which are useful for the bookkee** of the organic content of the universe and for providing a glimpse into prebiotic conditions on Earth and in other environments in the universe. We explored production channels for astrobiological relevant nitrogen-bearing cyclic molecules (N-heterocycles), e. g. pyrrole and pyridine…
▽ More
This article presents the results of a study concerning interstellar molecules which are useful for the bookkee** of the organic content of the universe and for providing a glimpse into prebiotic conditions on Earth and in other environments in the universe. We explored production channels for astrobiological relevant nitrogen-bearing cyclic molecules (N-heterocycles), e. g. pyrrole and pyridine. The present simulations demonstrate how the exploration of a few possible routes of production of N-heterocycles resulted in significant abundances for these species. One particularly efficient class of channels for the production of N-heterocycles incorporates polycyclic aromatic hydrocarbons (PAHs) as catalysts. Thereby, an exploration of a variety of production paths should reveal more species to be target of astrophysical observations.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
A Systematic Comparison of Forecasting for Gross Domestic Product in an Emergent Economy
Authors:
Kleyton da Costa,
Felipe Leite Coelho da Silva,
Josiane da Silva Cordeiro Coelho,
André de Melo Modenesi
Abstract:
Gross domestic product (GDP) is an important economic indicator that aggregates useful information to assist economic agents and policymakers in their decision-making process. In this context, GDP forecasting becomes a powerful decision optimization tool in several areas. In order to contribute in this direction, we investigated the efficiency of classical time series models, the state-space model…
▽ More
Gross domestic product (GDP) is an important economic indicator that aggregates useful information to assist economic agents and policymakers in their decision-making process. In this context, GDP forecasting becomes a powerful decision optimization tool in several areas. In order to contribute in this direction, we investigated the efficiency of classical time series models, the state-space models, and the neural network models, applied to Brazilian gross domestic product. The models used were: a Seasonal Autoregressive Integrated Moving Average (SARIMA) and a Holt-Winters method, which are classical time series models; the dynamic linear model, a state-space model; and neural network autoregression and the multilayer perceptron, artificial neural network models. Based on statistical metrics of model comparison, the multilayer perceptron presented the best in-sample and out-sample forecasting performance for the analyzed period, also incorporating the growth rate structure significantly.
△ Less
Submitted 3 March, 2022; v1 submitted 25 October, 2020;
originally announced October 2020.
-
Low-complexity Architecture for AR(1) Inference
Authors:
A. Borges Jr.,
R. J. Cintra,
D. F. G. Coelho,
V. S. Dimitrov
Abstract:
In this Letter, we propose a low-complexity estimator for the correlation coefficient based on the signed $\operatorname{AR}(1)$ process. The introduced approximation is suitable for implementation in low-power hardware architectures. Monte Carlo simulations reveal that the proposed estimator performs comparably to the competing methods in literature with maximum error in order of $10^{-2}$. Howev…
▽ More
In this Letter, we propose a low-complexity estimator for the correlation coefficient based on the signed $\operatorname{AR}(1)$ process. The introduced approximation is suitable for implementation in low-power hardware architectures. Monte Carlo simulations reveal that the proposed estimator performs comparably to the competing methods in literature with maximum error in order of $10^{-2}$. However, the hardware implementation of the introduced method presents considerable advantages in several relevant metrics, offering more than 95% reduction in dynamic power and doubling the maximum operating frequency when compared to the reference method.
△ Less
Submitted 21 August, 2020;
originally announced August 2020.
-
Stripe patterns orientation resulting from nonuniform forcings and other competitive effects in the Swift-Hohenberg dynamics
Authors:
Daniel L. Coelho,
Eduardo Vitral,
José Pontes,
Norberto Mangiavacchi
Abstract:
Spatio-temporal pattern formation in complex systems presents rich nonlinear dynamics which leads to the emergence of periodic nonequilibrium structures. One of the most prominent equations for the theoretical and numerical study of the evolution of these textures is the Swift-Hohenberg (SH) equation, which presents a bifurcation parameter (forcing) that controls the dynamics by changing the energ…
▽ More
Spatio-temporal pattern formation in complex systems presents rich nonlinear dynamics which leads to the emergence of periodic nonequilibrium structures. One of the most prominent equations for the theoretical and numerical study of the evolution of these textures is the Swift-Hohenberg (SH) equation, which presents a bifurcation parameter (forcing) that controls the dynamics by changing the energy landscape of the system, and has been largely employed in phase-field models. Though a large part of the literature on pattern formation addresses uniformly forced systems, nonuniform forcings are also observed in several natural systems, for instance, in developmental biology and in soft matter applications. In these cases, an orientation effect due to forcing gradients is a new factor playing a role in the development of patterns, particularly in the class of stripe patterns, which we investigate through the nonuniformly forced SH dynamics. The present work addresses amplitude instability of stripe textures induced by forcing gradients, and the competition between the orientation effect of the gradient and other bulk, boundary, and geometric effects taking part in the selection of the emerging patterns. A weakly nonlinear analysis suggests that stripes are stable with respect to small amplitude perturbations when aligned with the gradient, and become unstable to such perturbations when when aligned perpendicularly to the gradient. This analysis is vastly complemented by a numerical work that accounts for other effects, confirming that forcing gradients drive stripe alignment, or even reorient them from preexisting conditions. However, we observe that the orientation effect does not always prevail in the face of competing effects, whose hierarchy is suggested to depend on the magnitude of the forcing gradient.
△ Less
Submitted 28 August, 2021; v1 submitted 1 August, 2020;
originally announced August 2020.
-
Numerical scheme for solving the nonuniformly forced cubic and quintic Swift-Hohenberg equations strictly respecting the Lyapunov functional
Authors:
D. L. Coelho,
E. Vitral,
J. Pontes,
N. Mangiavacchi
Abstract:
Computational modeling of pattern formation in nonequilibrium systems is a fundamental tool for studying complex phenomena in biology, chemistry, materials science and engineering. The pursuit for theoretical descriptions of some among those physical problems led to the Swift-Hohenberg equation (SH3) which describes pattern selection in the vicinity of instabilities. A finite differences scheme, k…
▽ More
Computational modeling of pattern formation in nonequilibrium systems is a fundamental tool for studying complex phenomena in biology, chemistry, materials science and engineering. The pursuit for theoretical descriptions of some among those physical problems led to the Swift-Hohenberg equation (SH3) which describes pattern selection in the vicinity of instabilities. A finite differences scheme, known as Stabilizing Correction (Christov & Pontes; 2001 DOI: 10.1016/S0895-7177(01)00151-0), developed to integrate the cubic Swift-Hohenberg equation in two dimensions, is reviewed and extended in the present paper. The original scheme features Generalized Dirichlet boundary conditions (GDBC), forcings with a spatial ramp of the control parameter, strict implementation of the associated Lyapunov functional, and second order representation of all derivatives. We now extend these results by including periodic boundary conditions (PBC), forcings with gaussian distributions of the control parameter and the quintic Swift-Hohenberg (SH35) model. The present scheme also features a strict implementation of the functional for all test cases. A code verification was accomplished, showing unconditional stability, along with second order accuracy in both time and space. Test cases confirmed the monotonic decay of the Lyapunov functional and all numerical experiments exhibit the main physical features: highly nonlinear behaviour, wavelength filter and competition between bulk and boundary effects.
△ Less
Submitted 4 February, 2022; v1 submitted 29 July, 2020;
originally announced July 2020.
-
Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives for Brazil
Authors:
Matheus Henrique Dal Molin Ribeiro,
Ramon Gomes da Silva,
Viviana Cocco Mariani,
Leandro dos Santos Coelho
Abstract:
The new Coronavirus (COVID-19) is an emerging disease responsible for infecting millions of people since the first notification until nowadays. Develo** efficient short-term forecasting models allow knowing the number of future cases. In this context, it is possible to develop strategic planning in the public health system to avoid deaths. In this paper, autoregressive integrated moving average…
▽ More
The new Coronavirus (COVID-19) is an emerging disease responsible for infecting millions of people since the first notification until nowadays. Develo** efficient short-term forecasting models allow knowing the number of future cases. In this context, it is possible to develop strategic planning in the public health system to avoid deaths. In this paper, autoregressive integrated moving average (ARIMA), cubist (CUBIST), random forest (RF), ridge regression (RIDGE), support vector regression (SVR), and stacking-ensemble learning are evaluated in the task of time series forecasting with one, three, and six-days ahead the COVID-19 cumulative confirmed cases in ten Brazilian states with a high daily incidence. In the stacking learning approach, the cubist, RF, RIDGE, and SVR models are adopted as base-learners and Gaussian process (GP) as meta-learner. The models' effectiveness is evaluated based on the improvement index, mean absolute error, and symmetric mean absolute percentage error criteria. In most of the cases, the SVR and stacking ensemble learning reach a better performance regarding adopted criteria than compared models. In general, the developed models can generate accurate forecasting, achieving errors in a range of 0.87% - 3.51%, 1.02% - 5.63%, and 0.95% - 6.90% in one, three, and six-days-ahead, respectively. The ranking of models in all scenarios is SVR, stacking ensemble learning, ARIMA, CUBIST, RIDGE, and RF models. The use of evaluated models is recommended to forecasting and monitor the ongoing growth of COVID-19 cases, once these models can assist the managers in the decision-making support systems.
△ Less
Submitted 21 July, 2020;
originally announced July 2020.
-
Forecasting Brazilian and American COVID-19 cases based on artificial intelligence coupled with climatic exogenous variables
Authors:
Ramon Gomes da Silva,
Matheus Henrique Dal Molin Ribeiro,
Viviana Cocco Mariani,
Leandro dos Santos Coelho
Abstract:
The novel coronavirus disease (COVID-19) is a public health problem once according to the World Health Organization up to June 10th, 2020, more than 7.1 million people were infected, and more than 400 thousand have died worldwide. In the current scenario, the Brazil and the United States of America present a high daily incidence of new cases and deaths. It is important to forecast the number of ne…
▽ More
The novel coronavirus disease (COVID-19) is a public health problem once according to the World Health Organization up to June 10th, 2020, more than 7.1 million people were infected, and more than 400 thousand have died worldwide. In the current scenario, the Brazil and the United States of America present a high daily incidence of new cases and deaths. It is important to forecast the number of new cases in a time window of one week, once this can help the public health system develo** strategic planning to deals with the COVID-19. In this paper, Bayesian regression neural network, cubist regression, k-nearest neighbors, quantile random forest, and support vector regression, are used stand-alone, and coupled with the recent pre-processing variational mode decomposition (VMD) employed to decompose the time series into several intrinsic mode functions. All Artificial Intelligence techniques are evaluated in the task of time-series forecasting with one, three, and six-days-ahead the cumulative COVID-19 cases in five Brazilian and American states up to April 28th, 2020. Previous cumulative COVID-19 cases and exogenous variables as daily temperature and precipitation were employed as inputs for all forecasting models. The hybridization of VMD outperformed single forecasting models regarding the accuracy, specifically when the horizon is six-days-ahead, achieving better accuracy in 70% of the cases. Regarding the exogenous variables, the importance ranking as predictor variables is past cases, temperature, and precipitation. Due to the efficiency of evaluated models to forecasting cumulative COVID-19 cases up to six-days-ahead, the adopted models can be recommended as a promising models for forecasting and be used to assist in the development of public policies to mitigate the effects of COVID-19 outbreak.
△ Less
Submitted 21 July, 2020;
originally announced July 2020.
-
Short-term forecasting of Amazon rainforest fires based on ensemble decomposition model
Authors:
Ramon Gomes da Silva,
Matheus Henrique Dal Molin Ribeiro,
Viviana Cocco Mariani,
Leandro dos Santos Coelho
Abstract:
Accurate forecasting is important for decision-makers. Recently, the Amazon rainforest is reaching record levels of the number of fires, a situation that concerns both climate and public health problems. Obtaining the desired forecasting accuracy becomes difficult and challenging. In this paper were developed a novel heterogeneous decomposition-ensemble model by using Seasonal and Trend decomposit…
▽ More
Accurate forecasting is important for decision-makers. Recently, the Amazon rainforest is reaching record levels of the number of fires, a situation that concerns both climate and public health problems. Obtaining the desired forecasting accuracy becomes difficult and challenging. In this paper were developed a novel heterogeneous decomposition-ensemble model by using Seasonal and Trend decomposition based on Loess in combination with algorithms for short-term load forecasting multi-month-ahead, to explore temporal patterns of Amazon rainforest fires in Brazil. The results demonstrate the proposed decomposition-ensemble models can provide more accurate forecasting evaluated by performance measures. Diebold-Mariano statistical test showed the proposed models are better than other compared models, but it is statistically equal to one of them.
△ Less
Submitted 23 July, 2020; v1 submitted 15 July, 2020;
originally announced July 2020.
-
Multi-Stage Transfer Learning with an Application to Selection Process
Authors:
Andre Mendes,
Julian Togelius,
Leandro dos Santos Coelho
Abstract:
In multi-stage processes, decisions happen in an ordered sequence of stages. Many of them have the structure of dual funnel problem: as the sample size decreases from one stage to the other, the information increases. A related example is a selection process, where applicants apply for a position, prize, or grant. In each stage, more applicants are evaluated and filtered out, and from the remainin…
▽ More
In multi-stage processes, decisions happen in an ordered sequence of stages. Many of them have the structure of dual funnel problem: as the sample size decreases from one stage to the other, the information increases. A related example is a selection process, where applicants apply for a position, prize, or grant. In each stage, more applicants are evaluated and filtered out, and from the remaining ones, more information is collected. In the last stage, decision-makers use all available information to make their final decision. To train a classifier for each stage becomes impracticable as they can underfit due to the low dimensionality in early stages or overfit due to the small sample size in the latter stages. In this work, we proposed a \textit{Multi-StaGe Transfer Learning} (MSGTL) approach that uses knowledge from simple classifiers trained in early stages to improve the performance of classifiers in the latter stages. By transferring weights from simpler neural networks trained in larger datasets, we able to fine-tune more complex neural networks in the latter stages without overfitting due to the small sample size. We show that it is possible to control the trade-off between conserving knowledge and fine-tuning using a simple probabilistic map. Experiments using real-world data demonstrate the efficacy of our approach as it outperforms other state-of-the-art methods for transfer learning and regularization.
△ Less
Submitted 1 June, 2020;
originally announced June 2020.
-
MDE-ITMF and DEwI: Two New Multiple Solution Algorithms for Multimodal Optimization
Authors:
Vinícius Magno de Oliveira Coelho,
Gustavo Barbosa Libotte,
Francisco Duarte Moura Neto,
Gustavo Mendes Platt,
Fran Sérgio Lobato
Abstract:
Mathematical formulations of real world optimization studies frequently present characteristics such as non-linearity, discontinuity and high complexity. This class of problems may also exhibit a high number of global minimum/maximum points, especially for optimization problems arising from nonlinear algebraic systems (where null minima correspond to the solutions of the original algebraic system)…
▽ More
Mathematical formulations of real world optimization studies frequently present characteristics such as non-linearity, discontinuity and high complexity. This class of problems may also exhibit a high number of global minimum/maximum points, especially for optimization problems arising from nonlinear algebraic systems (where null minima correspond to the solutions of the original algebraic system). Due to the multimodal nature of these functions, multipopulation methods have been employed in order to obtain the highest number of points of global minimum/maximum. In this work, two new approaches were analyzed, employing an iterative penalization technique and a multipopulation procedure---together with the Differential Evolution algorithm---devoted to obtain the full set of solutions for multimodal optimization problems. The first method proposed is the Multipopulation Differential Evolution with iterative technique of modification of the objective function, MDE-ITMF, and the second method proposed is the Differential Evolution with Initialization, DEwI. In this second proposal, the MDE-ITMF method is used as an initializer of the initial populations and from a given moment the Differential Evolution is used to solve the problem at hand. In both approaches, subpopulations evolve simultaneously throughout the iterative process. MDE-ITMF and DEwI methods were applied in a set of ten multimodal benchmark functions. Based on the results obtained, we can conclude that MDE-ITMF and DEwI are suitable and promising tools for multimodal optimization.
△ Less
Submitted 29 May, 2020;
originally announced June 2020.
-
Unified Multi-Domain Learning and Data Imputation using Adversarial Autoencoder
Authors:
Andre Mendes,
Julian Togelius,
Leandro dos Santos Coelho
Abstract:
We present a novel framework that can combine multi-domain learning (MDL), data imputation (DI) and multi-task learning (MTL) to improve performance for classification and regression tasks in different domains. The core of our method is an adversarial autoencoder that can: (1) learn to produce domain-invariant embeddings to reduce the difference between domains; (2) learn the data distribution for…
▽ More
We present a novel framework that can combine multi-domain learning (MDL), data imputation (DI) and multi-task learning (MTL) to improve performance for classification and regression tasks in different domains. The core of our method is an adversarial autoencoder that can: (1) learn to produce domain-invariant embeddings to reduce the difference between domains; (2) learn the data distribution for each domain and correctly perform data imputation on missing data. For MDL, we use the Maximum Mean Discrepancy (MMD) measure to align the domain distributions. For DI, we use an adversarial approach where a generator fill in information for missing data and a discriminator tries to distinguish between real and imputed values. Finally, using the universal feature representation in the embeddings, we train a classifier using MTL that given input from any domain, can predict labels for all domains. We demonstrate the superior performance of our approach compared to other state-of-art methods in three distinct settings, DG-DI in image recognition with unstructured data, MTL-DI in grade estimation with structured data and MDMTL-DI in a selection process using mixed data.
△ Less
Submitted 15 March, 2020;
originally announced March 2020.
-
Adversarial Encoder-Multi-Task-Decoder for Multi-Stage Processes
Authors:
Andre Mendes,
Julian Togelius,
Leandro dos Santos Coelho
Abstract:
In multi-stage processes, decisions occur in an ordered sequence of stages. Early stages usually have more observations with general information (easier/cheaper to collect), while later stages have fewer observations but more specific data. This situation can be represented by a dual funnel structure, in which the sample size decreases from one stage to the other while the information increases. T…
▽ More
In multi-stage processes, decisions occur in an ordered sequence of stages. Early stages usually have more observations with general information (easier/cheaper to collect), while later stages have fewer observations but more specific data. This situation can be represented by a dual funnel structure, in which the sample size decreases from one stage to the other while the information increases. Training classifiers in this scenario is challenging since information in the early stages may not contain distinct patterns to learn (underfitting). In contrast, the small sample size in later stages can cause overfitting. We address both cases by introducing a framework that combines adversarial autoencoders (AAE), multi-task learning (MTL), and multi-label semi-supervised learning (MLSSL). We improve the decoder of the AAE with an MTL component so it can jointly reconstruct the original input and use feature nets to predict the features for the next stages. We also introduce a sequence constraint in the output of an MLSSL classifier to guarantee the sequential pattern in the predictions. Using real-world data from different domains (selection process, medical diagnosis), we show that our approach outperforms other state-of-the-art methods.
△ Less
Submitted 15 March, 2020;
originally announced March 2020.
-
Document classification using a Bi-LSTM to unclog Brazil's supreme court
Authors:
Fabricio Ataides Braz,
Nilton Correia da Silva,
Teofilo Emidio de Campos,
Felipe Borges S. Chaves,
Marcelo H. S. Ferreira,
Pedro Henrique Inazawa,
Victor H. D. Coelho,
Bernardo Pablo Sukiennik,
Ana Paula Goncalves Soares de Almeida,
Flavio Barros Vidal,
Davi Alves Bezerra,
Davi B. Gusmao,
Gabriel G. Ziegler,
Ricardo V. C. Fernandes,
Roberta Zumblick,
Fabiano Hartmann Peixoto
Abstract:
The Brazilian court system is currently the most clogged up judiciary system in the world. Thousands of lawsuit cases reach the supreme court every day. These cases need to be analyzed in order to be associated to relevant tags and allocated to the right team. Most of the cases reach the court as raster scanned documents with widely variable levels of quality. One of the first steps for the analys…
▽ More
The Brazilian court system is currently the most clogged up judiciary system in the world. Thousands of lawsuit cases reach the supreme court every day. These cases need to be analyzed in order to be associated to relevant tags and allocated to the right team. Most of the cases reach the court as raster scanned documents with widely variable levels of quality. One of the first steps for the analysis is to classify these documents. In this paper we present a Bidirectional Long Short-Term Memory network (Bi-LSTM) to classify these pieces of legal document.
△ Less
Submitted 27 November, 2018;
originally announced November 2018.
-
Fast Matrix Inversion and Determinant Computation for Polarimetric Synthetic Aperture Radar
Authors:
D. F. G. Coelho,
R. J. Cintra,
A. C. Frery,
V. S. Dimitrov
Abstract:
This paper introduces a fast algorithm for simultaneous inversion and determinant computation of small sized matrices in the context of fully Polarimetric Synthetic Aperture Radar (PolSAR) image processing and analysis. The proposed fast algorithm is based on the computation of the adjoint matrix and the symmetry of the input matrix. The algorithm is implemented in a general purpose graphical proc…
▽ More
This paper introduces a fast algorithm for simultaneous inversion and determinant computation of small sized matrices in the context of fully Polarimetric Synthetic Aperture Radar (PolSAR) image processing and analysis. The proposed fast algorithm is based on the computation of the adjoint matrix and the symmetry of the input matrix. The algorithm is implemented in a general purpose graphical processing unit (GPGPU) and compared to the usual approach based on Cholesky factorization. The assessment with simulated observations and data from an actual PolSAR sensor show a speedup factor of about two when compared to the usual Cholesky factorization. Moreover, the expressions provided here can be implemented in any platform.
△ Less
Submitted 21 July, 2018;
originally announced July 2018.
-
Efficient Computation of the 8-point DCT via Summation by Parts
Authors:
D. F. G. Coelho,
R. J. Cintra,
V. S. Dimitrov
Abstract:
This paper introduces a new fast algorithm for the 8-point discrete cosine transform (DCT) based on the summation-by-parts formula. The proposed method converts the DCT matrix into an alternative transformation matrix that can be decomposed into sparse matrices of low multiplicative complexity. The method is capable of scaled and exact DCT computation and its associated fast algorithm achieves the…
▽ More
This paper introduces a new fast algorithm for the 8-point discrete cosine transform (DCT) based on the summation-by-parts formula. The proposed method converts the DCT matrix into an alternative transformation matrix that can be decomposed into sparse matrices of low multiplicative complexity. The method is capable of scaled and exact DCT computation and its associated fast algorithm achieves the theoretical minimal multiplicative complexity for the 8-point DCT. Depending on the nature of the input signal simplifications can be introduced and the overall complexity of the proposed algorithm can be further reduced. Several types of input signal are analyzed: arbitrary, null mean, accumulated, and null mean/accumulated signal. The proposed tool has potential application in harmonic detection, image enhancement, and feature extraction, where input signal DC level is discarded and/or the signal is required to be integrated.
△ Less
Submitted 28 March, 2018; v1 submitted 17 January, 2018;
originally announced January 2018.
-
A Sequence-Based Mesh Classifier for the Prediction of Protein-Protein Interactions
Authors:
Edgar D. Coelho,
Igor N. Cruz,
André Santiago,
José Luis Oliveira,
António Dourado,
Joel P. Arrais
Abstract:
The worldwide surge of multiresistant microbial strains has propelled the search for alternative treatment options. The study of Protein-Protein Interactions (PPIs) has been a cornerstone in the clarification of complex physiological and pathogenic processes, thus being a priority for the identification of vital components and mechanisms in pathogens. Despite the advances of laboratorial technique…
▽ More
The worldwide surge of multiresistant microbial strains has propelled the search for alternative treatment options. The study of Protein-Protein Interactions (PPIs) has been a cornerstone in the clarification of complex physiological and pathogenic processes, thus being a priority for the identification of vital components and mechanisms in pathogens. Despite the advances of laboratorial techniques, computational models allow the screening of protein interactions between entire proteomes in a fast and inexpensive manner. Here, we present a supervised machine learning model for the prediction of PPIs based on the protein sequence. We cluster amino acids regarding their physicochemical properties, and use the discrete cosine transform to represent protein sequences. A mesh of classifiers was constructed to create hyper-specialised classifiers dedicated to the most relevant pairs of molecular function annotations from Gene Ontology. Based on an exhaustive evaluation that includes datasets with different configurations, cross-validation and out-of-sampling validation, the obtained results outscore the state-of-the-art for sequence-based methods. For the final mesh model using SVM with RBF, a consistent average AUC of 0.84 was attained.
△ Less
Submitted 12 November, 2017;
originally announced November 2017.
-
On the Computation of Neumann Series
Authors:
Vassil Dimitrov,
Diego Coelho
Abstract:
This paper proposes new factorizations for computing the Neumann series. The factorizations are based on fast algorithms for small prime sizes series and the splitting of large sizes into several smaller ones. We propose a different basis for factorizations other than the well-known binary and ternary basis. We show that is possible to reduce the overall complexity for the usual binary decompositi…
▽ More
This paper proposes new factorizations for computing the Neumann series. The factorizations are based on fast algorithms for small prime sizes series and the splitting of large sizes into several smaller ones. We propose a different basis for factorizations other than the well-known binary and ternary basis. We show that is possible to reduce the overall complexity for the usual binary decomposition from 2log2(N)-2 multiplications to around 1.72log2(N)-2 using a basis of size five. Merging different basis we can demonstrate that we can build fast algorithms for particular sizes. We also show the asymptotic case where one can reduce the number of multiplications to around 1.70log2(N)-2. Simulations are performed for applications in the context of wireless communications and image rendering, where is necessary perform large sized matrices inversion.
△ Less
Submitted 18 July, 2017;
originally announced July 2017.
-
Magnetocaloric functional properties of $Sm_{0.6}Sr_{0.4}MnO_3$ manganite due to advanced nanostructured morphology
Authors:
V. M. Andrade,
S. S. Pedro,
R. J. Caraballo Vivas,
D. L. Rocco,
M. S. Reis,
A. P. C. Campos,
A. de A. Coelho,
M. Escote,
A. Zenatti
Abstract:
The magnetocaloric effect (MCE) is the key concept to produce new, advanced, freon-like free, low cost and environmental friendly magnetic refrigerators. Among several potential materials, $Sm_{0.6}Sr_{0.4}MnO_3$ manganite presents one of the highest MCE value in comparison to all other known manganites; however, its studied was only concentrated on the bulk material. To overcame this lack of the…
▽ More
The magnetocaloric effect (MCE) is the key concept to produce new, advanced, freon-like free, low cost and environmental friendly magnetic refrigerators. Among several potential materials, $Sm_{0.6}Sr_{0.4}MnO_3$ manganite presents one of the highest MCE value in comparison to all other known manganites; however, its studied was only concentrated on the bulk material. To overcame this lack of the information we successfully produced advanced nanostructures, namely nanoparticles and nanotubes of that highlighted manganite by using a sol-gel modified method. High resolution transmission electron microscopy revealed nanoparticle and nanotube diameters of 29 nm and 200 nm, respectively; and, in addition, this technique also showed that the wall of the nanotube is formed by the nanoparticles with 25 nm of diameter. The magnetocaloric potentials, $ΔS_M$ versus T curves, of the nanostructures were obtained and they are broader than the their bulk counterpart. This increases the useful temperature range of a magnetic refrigerator. But also an undesired M-shape profile for the nanotube sample was observed, due to the rising of a superparamagnetic behavior. These results also evidenced the existence of a nanoparticle size threshold below which the advantage to make the transition wider is no longer valid.
△ Less
Submitted 10 August, 2015;
originally announced August 2015.