-
Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects
Authors:
Javier Poyatos,
Javier Del Ser,
Salvador Garcia,
Hisao Ishibuchi,
Daniel Molina,
Isaac Triguero,
Bing Xue,
Xin Yao,
Francisco Herrera
Abstract:
In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de…
▽ More
In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS.
△ Less
Submitted 3 June, 2024;
originally announced July 2024.
-
Numerical semigroups from rational matrices I: power-integral matrices and nilpotent representations
Authors:
Arsh Chhabra,
Stephan Ramon Garcia,
Fangqian Zhang,
Hechun Zhang
Abstract:
Our aim in this paper is to initiate the study of exponent semigroups for rational matrices. We prove that every numerical semigroup is the exponent semigroup of some rational matrix. We also obtain lower bounds on the size of such matrices and discuss the related class of power-integral matrices.
Our aim in this paper is to initiate the study of exponent semigroups for rational matrices. We prove that every numerical semigroup is the exponent semigroup of some rational matrix. We also obtain lower bounds on the size of such matrices and discuss the related class of power-integral matrices.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Predict. Optimize. Revise. On Forecast and Policy Stability in Energy Management Systems
Authors:
Evgenii Genov,
Julian Ruddick,
Christoph Bergmeir,
Majid Vafaeipour,
Thierry Coosemans,
Salvador Garcia,
Maarten Messagie
Abstract:
This research addresses the challenge of integrating forecasting and optimization in energy management systems, focusing on the impacts of switching costs, forecast accuracy, and stability. It proposes a novel framework for analyzing online optimization problems with switching costs and enabled by deterministic and probabilistic forecasts. Through empirical evaluation and theoretical analysis, the…
▽ More
This research addresses the challenge of integrating forecasting and optimization in energy management systems, focusing on the impacts of switching costs, forecast accuracy, and stability. It proposes a novel framework for analyzing online optimization problems with switching costs and enabled by deterministic and probabilistic forecasts. Through empirical evaluation and theoretical analysis, the research reveals the balance between forecast accuracy, stability, and switching costs in sha** policy performance. Conducted in the context of battery scheduling within energy management applications, it introduces a metric for evaluating probabilistic forecast stability and examines the effects of forecast accuracy and stability on optimization outcomes using the real-world case of the Citylearn 2022 competition. Findings indicate that switching costs significantly influence the trade-off between forecast accuracy and stability, highlighting the importance of integrated systems that enable collaboration between forecasting and operational units for improved decision-making. The study shows that committing to a policy for longer periods can be advantageous over frequent updates. Results also show a correlation between forecast stability and policy performance, suggesting that stable forecasts can mitigate switching costs. The proposed framework provides valuable insights for energy sector decision-makers and forecast practitioners when designing the operation of an energy management system.
△ Less
Submitted 11 July, 2024; v1 submitted 29 June, 2024;
originally announced July 2024.
-
Enantiospecificity in NMR Enabled by Chirality-Induced Spin Selectivity
Authors:
T. Georgiou,
J. L. Palma,
V. Mujica,
S. Varela,
M. Galante,
V. Santamarıa Garcıa,
L. Mboning,
R. N. Schwartz,
G. Cuniberti,
L. -S. Bouchard
Abstract:
Spin polarization in chiral molecules is a magnetic molecular response associated with electron transport and enantioselective bond polarization that occurs even in the absence of an external magnetic field. An unexpected finding by Santos and co-workers reported enantiospecific NMR responses in solid-state cross-polarization (CP) experiments, suggesting a possible additional contribution to the i…
▽ More
Spin polarization in chiral molecules is a magnetic molecular response associated with electron transport and enantioselective bond polarization that occurs even in the absence of an external magnetic field. An unexpected finding by Santos and co-workers reported enantiospecific NMR responses in solid-state cross-polarization (CP) experiments, suggesting a possible additional contribution to the indirect nuclear spin-spin coupling in chiral molecules induced by bond polarization in the presence of spin-orbit coupling. Herein we provide a theoretical treatment for this phenomenon, presenting an effective spin-Hamiltonian for helical molecules like DNA and density functional theory (DFT) results on amino acids that confirm the dependence of J-couplings on the choice of enantiomer. The connection between nuclear spin dynamics and chirality could offer insights for molecular sensing and quantum information sciences. These results establish NMR as a potential tool for chiral discrimination without external agents.
△ Less
Submitted 2 July, 2024; v1 submitted 30 June, 2024;
originally announced July 2024.
-
Connected Speech-Based Cognitive Assessment in Chinese and English
Authors:
Saturnino Luz,
Sofia De La Fuente Garcia,
Fasih Haider,
Davida Fromm,
Brian MacWhinney,
Alyssa Lanzi,
Ya-Ning Chang,
Chia-Ju Chou,
Yi-Chien Liu
Abstract:
We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age…
▽ More
We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age and sex by propensity score analysis to ensure balance and representativity in model training. The prediction tasks encompass mild cognitive impairment diagnosis and cognitive test score prediction. This framework was designed to encourage the development of approaches to speech-based cognitive assessment which generalise across languages. We illustrate it by presenting baseline prediction models that employ language-agnostic and comparable features for diagnosis and cognitive test score prediction. The models achieved unweighted average recall was 59.2% in diagnosis, and root mean squared error of 2.89 in score prediction.
△ Less
Submitted 18 June, 2024; v1 submitted 11 June, 2024;
originally announced June 2024.
-
A Frame-based Attention Interpretation Method for Relevant Acoustic Feature Extraction in Long Speech Depression Detection
Authors:
Qingkun Deng,
Saturnino Luz,
Sofia de la Fuente Garcia
Abstract:
Speech-based depression detection tools could help early screening of depression. Here, we address two issues that may hinder the clinical practicality of such tools: segment-level labelling noise and a lack of model interpretability. We propose a speech-level Audio Spectrogram Transformer to avoid segment-level labelling. We observe that the proposed model significantly outperforms a segment-leve…
▽ More
Speech-based depression detection tools could help early screening of depression. Here, we address two issues that may hinder the clinical practicality of such tools: segment-level labelling noise and a lack of model interpretability. We propose a speech-level Audio Spectrogram Transformer to avoid segment-level labelling. We observe that the proposed model significantly outperforms a segment-level model, providing evidence for the presence of segment-level labelling noise in audio modality and the advantage of longer-duration speech analysis for depression detection. We introduce a frame-based attention interpretation method to extract acoustic features from prediction-relevant waveform signals for interpretation by clinicians. Through interpretation, we observe that the proposed model identifies reduced loudness and F0 as relevant signals of depression, which aligns with the speech characteristics of depressed patients documented in clinical studies.
△ Less
Submitted 7 June, 2024; v1 submitted 5 June, 2024;
originally announced June 2024.
-
On the Ollivier-Ricci curvature as fragility indicator of the stock markets
Authors:
Joaquín Sánchez García,
Sebastian Gherghe
Abstract:
Recently, an indicator for stock market fragility and crash size in terms of the Ollivier-Ricci curvature has been proposed. We study analytical and empirical properties of such indicator, test its elasticity with respect to different parameters and provide heuristics for the parameters involved. We show when and how the indicator accurately describes a financial crisis. We also propose an alterna…
▽ More
Recently, an indicator for stock market fragility and crash size in terms of the Ollivier-Ricci curvature has been proposed. We study analytical and empirical properties of such indicator, test its elasticity with respect to different parameters and provide heuristics for the parameters involved. We show when and how the indicator accurately describes a financial crisis. We also propose an alternate method for calculating the indicator using a specific sub-graph with special curvature properties.
△ Less
Submitted 11 May, 2024;
originally announced May 2024.
-
Mining higher-order triadic interactions
Authors:
Anthony Baptista,
Marta Niedostatek,
Jun Yamamoto,
Ben MacArthur,
Jurgen Kurths,
Ruben Sanchez Garcia,
Ginestra Bianconi
Abstract:
Complex systems often present higher-order interactions which require us to go beyond their description in terms of pairwise networks. Triadic interactions are a fundamental type of higher-order interaction that occurs when one node regulates the interaction between two other nodes. Triadic interactions are a fundamental type of higher-order networks, found in a large variety of biological systems…
▽ More
Complex systems often present higher-order interactions which require us to go beyond their description in terms of pairwise networks. Triadic interactions are a fundamental type of higher-order interaction that occurs when one node regulates the interaction between two other nodes. Triadic interactions are a fundamental type of higher-order networks, found in a large variety of biological systems, from neuron-glia interactions to gene-regulation and ecosystems. However, triadic interactions have been so far mostly neglected. In this article, we propose a theoretical principle to model and mine triadic interactions from node metadata, and we apply this framework to gene expression data finding new candidates for triadic interactions relevant for Acute Myeloid Leukemia. Our work reveals important aspects of higher-order triadic interactions often ignored, which can transform our understanding of complex systems and be applied to a large variety of systems ranging from biology to the climate.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Towards Better Understanding of Cybercrime: The Role of Fine-Tuned LLMs in Translation
Authors:
Veronica Valeros,
Anna Širokova,
Carlos Catania,
Sebastian Garcia
Abstract:
Understanding cybercrime communications is paramount for cybersecurity defence. This often involves translating communications into English for processing, interpreting, and generating timely intelligence. The problem is that translation is hard. Human translation is slow, expensive, and scarce. Machine translation is inaccurate and biased. We propose using fine-tuned Large Language Models (LLM) t…
▽ More
Understanding cybercrime communications is paramount for cybersecurity defence. This often involves translating communications into English for processing, interpreting, and generating timely intelligence. The problem is that translation is hard. Human translation is slow, expensive, and scarce. Machine translation is inaccurate and biased. We propose using fine-tuned Large Language Models (LLM) to generate translations that can accurately capture the nuances of cybercrime language. We apply our technique to public chats from the NoName057(16) Russian-speaking hacktivist group. Our results show that our fine-tuned LLM model is better, faster, more accurate, and able to capture nuances of the language. Our method shows it is possible to achieve high-fidelity translations and significantly reduce costs by a factor ranging from 430 to 23,000 compared to a human translator.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Hunter's positivity theorem and random vector norms
Authors:
Ludovick Bouthat,
Ángel Chávez,
Stephan Ramon Garcia
Abstract:
A theorem of Hunter ensures that the complete homogeneous symmetric polynomials of even degree are positive definite functions. A probabilistic interpretation of Hunter's theorem suggests a broad generalization: the construction of so-called random vector norms on square complex matrices. This paper surveys these ideas, starting from the fundamental notions and develo** the theory to its present…
▽ More
A theorem of Hunter ensures that the complete homogeneous symmetric polynomials of even degree are positive definite functions. A probabilistic interpretation of Hunter's theorem suggests a broad generalization: the construction of so-called random vector norms on square complex matrices. This paper surveys these ideas, starting from the fundamental notions and develo** the theory to its present state. We study numerous examples and present a host of open problems.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Optimizing Conical Intersections Without Explicit Use of Non-Adiabatic Couplings
Authors:
Juan Sanz García,
Rosa Maskri,
Alexander Mitrushchenkov,
Loïc Joubert-Doriol
Abstract:
We present two alternative methods for optimizing minimum energy conical intersection (MECI) molecular geometries without knowledge of the derivative coupling (DC). These methods are based on the utilization of Lagrange multipliers: i) one method uses an approximate calculation of the DC, while the other ii) do not require the DC. Both methods use the fact that information of the DC is contained i…
▽ More
We present two alternative methods for optimizing minimum energy conical intersection (MECI) molecular geometries without knowledge of the derivative coupling (DC). These methods are based on the utilization of Lagrange multipliers: i) one method uses an approximate calculation of the DC, while the other ii) do not require the DC. Both methods use the fact that information of the DC is contained in the Hessian of the squared energy difference. Tests done on a set of small molecular systems, in comparison with other methods, show the ability of the proposed methods to optimize MECIs. Finally, we apply the methods to the furimamide molecule, to optimize and characterize its S$_1$ /S$_2$ MECI, and to optimizing the S$_0$ /S$_1$ MECI of the silver trimer.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Resolving chiral transitions in Rydberg arrays with quantum Kibble-Zurek mechanism and finite-time scaling
Authors:
Jose Soto Garcia,
Natalia Chepiga
Abstract:
The experimental realization of the quantum Kibble-Zurek mechanism in arrays of trapped Rydberg atoms has brought the problem of commensurate-incommensurate transition back into the focus of active research. Relying on equilibrium simulations of finite intervals, direct chiral transitions at the boundary of the period-3 and period-4 phases have been predicted. Here, we study how these chiral trans…
▽ More
The experimental realization of the quantum Kibble-Zurek mechanism in arrays of trapped Rydberg atoms has brought the problem of commensurate-incommensurate transition back into the focus of active research. Relying on equilibrium simulations of finite intervals, direct chiral transitions at the boundary of the period-3 and period-4 phases have been predicted. Here, we study how these chiral transitions can be diagnosed experimentally with critical dynamics. We demonstrate that chiral transitions can be distinguished from the floating phases by comparing Kibble-Zurek dynamics on arrays with different numbers of atoms. Furthermore, by swee** in the opposite direction and kee** track of the order parameter, we identify the location of conformal points. Finally, combining forward and backward sweeps, we extract all critical exponents characterizing the transition.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation
Authors:
Joaquín Sánchez García
Abstract:
We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning.
We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Exploring the sensitivity to non-standard and generalized neutrino interactions through coherent elastic neutrino-nucleus scattering with a NaI detector
Authors:
Sabya Sachi Chatterjee,
Stéphane Lavignac,
O. G. Miranda,
G. Sanchez Garcia
Abstract:
After the first observation of coherent elastic neutrino-nucleus scattering (CE$ν$NS) by the COHERENT collaboration, many efforts are being made to improve the measurement of this process, making it possible to constrain new physics in the neutrino sector. In this paper, we study the sensitivity to non-standard interactions (NSIs) and generalized neutrino interactions (GNIs) of a NaI detector with…
▽ More
After the first observation of coherent elastic neutrino-nucleus scattering (CE$ν$NS) by the COHERENT collaboration, many efforts are being made to improve the measurement of this process, making it possible to constrain new physics in the neutrino sector. In this paper, we study the sensitivity to non-standard interactions (NSIs) and generalized neutrino interactions (GNIs) of a NaI detector with characteristics similar to the one that is currently being deployed at the Spallation Neutron Source at Oak Ridge National Laboratory. We show that such a detector, whose target nuclei have significantly different proton to neutron ratios (at variance with the current CsI detector), could help to partially break the parameter degeneracies arising from the interference between the Standard Model and NSI contributions to the CE$ν$NS cross section, as well as between different NSI parameters. By contrast, only a slight improvement over the current CsI constraints is expected for parameters that do not interfere with the SM contribution. We find that a significant reduction of the background level would make the NaI detector considered in this paper very efficient at breaking degeneracies among NSI parameters.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
Graded polynomial identities of the infinite-dimensional upper triangular matrices over an arbitrary field
Authors:
Micael Said Garcia,
Felipe Yukihide Yasumura
Abstract:
We compute the graded polynomial identities of the infinite dimensional upper triangular matrix algebra over an arbitrary field. If the grading group is finite, we prove that the set of graded polynomial identities admits a finite basis. We find conditions under which a grading on such an algebra satisfies a nontrivial graded polynomial identity. Finally, we provide examples showing that two nonis…
▽ More
We compute the graded polynomial identities of the infinite dimensional upper triangular matrix algebra over an arbitrary field. If the grading group is finite, we prove that the set of graded polynomial identities admits a finite basis. We find conditions under which a grading on such an algebra satisfies a nontrivial graded polynomial identity. Finally, we provide examples showing that two nonisomorphic gradings can have the same set of graded polynomial identities.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Multisource Semisupervised Adversarial Domain Generalization Network for Cross-Scene Sea-Land Clutter Classification
Authors:
Xiaoxuan Zhang,
Quan Pan,
Salvador García
Abstract:
Deep learning (DL)-based sea\textendash land clutter classification for sky-wave over-the-horizon-radar (OTHR) has become a novel research topic. In engineering applications, real-time predictions of sea\textendash land clutter with existing distribution discrepancies are crucial. To solve this problem, this article proposes a novel Multisource Semisupervised Adversarial Domain Generalization Netw…
▽ More
Deep learning (DL)-based sea\textendash land clutter classification for sky-wave over-the-horizon-radar (OTHR) has become a novel research topic. In engineering applications, real-time predictions of sea\textendash land clutter with existing distribution discrepancies are crucial. To solve this problem, this article proposes a novel Multisource Semisupervised Adversarial Domain Generalization Network (MSADGN) for cross-scene sea\textendash land clutter classification. MSADGN can extract domain-invariant and domain-specific features from one labeled source domain and multiple unlabeled source domains, and then generalize these features to an arbitrary unseen target domain for real-time prediction of sea\textendash land clutter. Specifically, MSADGN consists of three modules: domain-related pseudolabeling module, domain-invariant module, and domain-specific module. The first module introduces an improved pseudolabel method called domain-related pseudolabel, which is designed to generate reliable pseudolabels to fully exploit unlabeled source domains. The second module utilizes a generative adversarial network (GAN) with a multidiscriminator to extract domain-invariant features, to enhance the model's transferability in the target domain. The third module employs a parallel multiclassifier branch to extract domain-specific features, to enhance the model's discriminability in the target domain. The effectiveness of our method is validated in twelve domain generalizations (DG) scenarios. Meanwhile, we selected 10 state-of-the-art DG methods for comparison. The experimental results demonstrate the superiority of our method.
△ Less
Submitted 9 March, 2024; v1 submitted 9 February, 2024;
originally announced February 2024.
-
Structural and optical properties of self-assembled AlN nanowires grown on SiO2/Si substrates by molecular beam epitaxy
Authors:
Ž. Gačević,
J. Grandal,
Q. Guo,
R. Kirste,
M. Varela,
Z. Sitar,
M. A. Sánchez García
Abstract:
Self assembled AlN nanowires (NWs) are grown by plasma assisted molecular beam epitaxy (PAMBE) on SiO2 / Si (111) substrates. Using a combination of in-situ reflective high energy electron diffraction and ex situ X ray diffraction (XRD), we show that the NWs grow nearly strain free, preferentially perpendicular to the amorphous SiO2 interlayer and without epitaxial relationship to Si(111) substrat…
▽ More
Self assembled AlN nanowires (NWs) are grown by plasma assisted molecular beam epitaxy (PAMBE) on SiO2 / Si (111) substrates. Using a combination of in-situ reflective high energy electron diffraction and ex situ X ray diffraction (XRD), we show that the NWs grow nearly strain free, preferentially perpendicular to the amorphous SiO2 interlayer and without epitaxial relationship to Si(111) substrate, as expected. Scanning electron microscopy investigation reveals significant NWs coalescence, which results in their progressively increasing diameter and formation of columnar structures with non hexagonal cross section. Making use of scanning transmission electron microscopy (STEM), the NWs initial diameters are found in the 20 to 30 nm range. In addition, the formation of a thin (30 nm) polycrystalline AlN layer is observed on the substrate surface. Regarding the structural quality of the AlN NWs, STEM measurements reveal the formation of extended columnar regions, which grow with a virtually perfect metal-polarity wurtzite arrangement and with extended defects only sporadically observed. Combination of STEM and electron energy loss spectroscopy (EELS) reveals the formation of continuous aluminum oxide (1 to 2 nm) on the NW surface. Low temperature photoluminescence measurements reveal a single near band edge (NBE) emission peak, positioned at 6.03 eV (at 2 K), a value consistent with nearly zero NW strain evidenced by XRD and in agreement with the values obtained on AlN bulk layers synthesized by other growth techniques. The significant full width at half maximum of NBE emission, found at 20 meV (at 2 K), suggests that free and bound excitons are mixed together within this single emission band.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Final CONUS results on coherent elastic neutrino nucleus scattering at the Brokdorf reactor
Authors:
N. Ackermann,
H. Bonet,
A. Bonhomme,
C. Buck,
K. Fülber,
J. Hakenmüller,
J. Hempfling,
J. Henrichs,
G. Heusser,
M. Lindner,
W. Maneschg,
T. Rink,
E. Sanchez Garcia,
J. Stauber,
H. Strecker,
R. Wink
Abstract:
The CONUS experiment studies coherent elastic neutrino nucleus scattering in four 1 kg germanium spectrometers. Low ionization energy thresholds of 210 eV were achieved. The detectors were operated inside an optimized shield at the Brokdorf nuclear power plant which provided a reactor antineutrino flux of up to $2.3\cdot10^{13}$ cm$^{-2}$s$^{-1}$. In the final phase of data collection at this site…
▽ More
The CONUS experiment studies coherent elastic neutrino nucleus scattering in four 1 kg germanium spectrometers. Low ionization energy thresholds of 210 eV were achieved. The detectors were operated inside an optimized shield at the Brokdorf nuclear power plant which provided a reactor antineutrino flux of up to $2.3\cdot10^{13}$ cm$^{-2}$s$^{-1}$. In the final phase of data collection at this site, the constraints on the neutrino interaction rate were improved by an order of magnitude as compared to the previous CONUS analysis. The new limit of less than 0.34 signal events kg$^{-1}$d$^{-1}$ is within a factor 2 of the rate predicted by the Standard Model.
△ Less
Submitted 5 April, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Automatic UAV-based Airport Pavement Inspection Using Mixed Real and Virtual Scenarios
Authors:
Pablo Alonso,
Jon Ander Iñiguez de Gordoa,
Juan Diego Ortega,
Sara García,
Francisco Javier Iriarte,
Marcos Nieto
Abstract:
Runway and taxiway pavements are exposed to high stress during their projected lifetime, which inevitably leads to a decrease in their condition over time. To make sure airport pavement condition ensure uninterrupted and resilient operations, it is of utmost importance to monitor their condition and conduct regular inspections. UAV-based inspection is recently gaining importance due to its wide ra…
▽ More
Runway and taxiway pavements are exposed to high stress during their projected lifetime, which inevitably leads to a decrease in their condition over time. To make sure airport pavement condition ensure uninterrupted and resilient operations, it is of utmost importance to monitor their condition and conduct regular inspections. UAV-based inspection is recently gaining importance due to its wide range monitoring capabilities and reduced cost. In this work, we propose a vision-based approach to automatically identify pavement distress using images captured by UAVs. The proposed method is based on Deep Learning (DL) to segment defects in the image. The DL architecture leverages the low computational capacities of embedded systems in UAVs by using an optimised implementation of EfficientNet feature extraction and Feature Pyramid Network segmentation. To deal with the lack of annotated data for training we have developed a synthetic dataset generation methodology to extend available distress datasets. We demonstrate that the use of a mixed dataset composed of synthetic and real training images yields better results when testing the training models in real application scenarios.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Cheetah: Bridging the Gap Between Machine Learning and Particle Accelerator Physics with High-Speed, Differentiable Simulations
Authors:
Jan Kaiser,
Chenran Xu,
Annika Eichler,
Andrea Santamaria Garcia
Abstract:
Machine learning has emerged as a powerful solution to the modern challenges in accelerator physics. However, the limited availability of beam time, the computational cost of simulations, and the high-dimensionality of optimisation problems pose significant challenges in generating the required data for training state-of-the-art machine learning models. In this work, we introduce Cheetah, a PyTorc…
▽ More
Machine learning has emerged as a powerful solution to the modern challenges in accelerator physics. However, the limited availability of beam time, the computational cost of simulations, and the high-dimensionality of optimisation problems pose significant challenges in generating the required data for training state-of-the-art machine learning models. In this work, we introduce Cheetah, a PyTorch-based high-speed differentiable linear-beam dynamics code. Cheetah enables the fast collection of large data sets by reducing computation times by multiple orders of magnitude and facilitates efficient gradient-based optimisation for accelerator tuning and system identification. This positions Cheetah as a user-friendly, readily extensible tool that integrates seamlessly with widely adopted machine learning tools. We showcase the utility of Cheetah through five examples, including reinforcement learning training, gradient-based beamline tuning, gradient-based system identification, physics-informed Bayesian optimisation priors, and modular neural network surrogate modelling of space charge effects. The use of such a high-speed differentiable simulation code will simplify the development of machine learning-based methods for particle accelerators and fast-track their integration into everyday operations of accelerator facilities.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
A Fully Automated Pipeline Using Swin Transformers for Deep Learning-Based Blood Segmentation on Head CT Scans After Aneurysmal Subarachnoid Hemorrhage
Authors:
Sergio Garcia Garcia,
Santiago Cepeda,
Ignacio Arrese,
Rosario Sarabia
Abstract:
Background: Accurate volumetric assessment of spontaneous subarachnoid hemorrhage (SAH) is a labor-intensive task performed with current manual and semiautomatic methods that might be relevant for its clinical and prognostic implications. In the present research, we sought to develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for SAH patients via noncon…
▽ More
Background: Accurate volumetric assessment of spontaneous subarachnoid hemorrhage (SAH) is a labor-intensive task performed with current manual and semiautomatic methods that might be relevant for its clinical and prognostic implications. In the present research, we sought to develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for SAH patients via noncontrast computed tomography (NCCT) scans employing a transformer-based Swin UNETR architecture. Methods: We retrospectively analyzed NCCT scans from patients with confirmed aneurysmal subarachnoid hemorrhage (aSAH) utilizing the Swin UNETR for segmentation. The performance of the proposed method was evaluated against manually segmented ground truth data using metrics such as Dice score, intersection over union (IoU), the volumetric similarity index (VSI), the symmetric average surface distance (SASD), and sensitivity and specificity. A validation cohort from an external institution was included to test the generalizability of the model. Results: The model demonstrated high accuracy with robust performance metrics across the internal and external validation cohorts. Notably, it achieved high Dice coefficient (0.873), IoU (0.810), VSI (0.840), sensitivity (0.821) and specificity (0.996) values and a low SASD (1.866), suggesting proficiency in segmenting blood in SAH patients. The model's efficiency was reflected in its processing speed, indicating potential for real-time applications. Conclusions: Our Swin UNETR-based model offers significant advances in the automated segmentation of blood after aSAH on NCCT images. Despite the computational intensity, the model operates effectively on standard hardware with a user-friendly interface, facilitating broader clinical adoption. Further validation across diverse datasets is warranted to confirm its clinical reliability.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Bayesian Optimization Algorithms for Accelerator Physics
Authors:
Ryan Roussel,
Auralee L. Edelen,
Tobias Boltz,
Dylan Kennedy,
Zhe Zhang,
Fuhao Ji,
Xiaobiao Huang,
Daniel Ratner,
Andrea Santamaria Garcia,
Chenran Xu,
Jan Kaiser,
Angel Ferran Pousa,
Annika Eichler,
Jannis O. Lubsen,
Natalie M. Isenberg,
Yuan Gao,
Nikita Kuklev,
Jose Martinez,
Brahim Mustapha,
Verena Kain,
Weijian Lin,
Simone Maria Liuzzo,
Jason St. John,
Matthew J. V. Streeter,
Remi Lehe
, et al. (1 additional authors not shown)
Abstract:
Accelerator physics relies on numerical algorithms to solve optimization problems in online accelerator control and tasks such as experimental design and model calibration in simulations. The effectiveness of optimization algorithms in discovering ideal solutions for complex challenges with limited resources often determines the problem complexity these methods can address. The accelerator physics…
▽ More
Accelerator physics relies on numerical algorithms to solve optimization problems in online accelerator control and tasks such as experimental design and model calibration in simulations. The effectiveness of optimization algorithms in discovering ideal solutions for complex challenges with limited resources often determines the problem complexity these methods can address. The accelerator physics community has recognized the advantages of Bayesian optimization algorithms, which leverage statistical surrogate models of objective functions to effectively address complex optimization challenges, especially in the presence of noise during accelerator operation and in resource-intensive physics simulations. In this review article, we offer a conceptual overview of applying Bayesian optimization techniques towards solving optimization problems in accelerator physics. We begin by providing a straightforward explanation of the essential components that make up Bayesian optimization techniques. We then give an overview of current and previous work applying and modifying these techniques to solve accelerator physics challenges. Finally, we explore practical implementation strategies for Bayesian optimization algorithms to maximize their performance, enabling users to effectively address complex optimization challenges in real-time beam control and accelerator design.
△ Less
Submitted 5 April, 2024; v1 submitted 9 December, 2023;
originally announced December 2023.
-
Protecting Sensitive Tabular Data in Hybrid Clouds
Authors:
Maya Anderson,
Gidon Gershinsky,
Eliot Salant,
Salvador Garcia
Abstract:
Regulated industries, such as Healthcare and Finance, are starting to move parts of their data and workloads to the public cloud. However, they are still reluctant to trust the public cloud with their most sensitive records, and hence leave them in their premises, leveraging the hybrid cloud architecture. We address the security and performance challenges of big data analytics using a hybrid cloud…
▽ More
Regulated industries, such as Healthcare and Finance, are starting to move parts of their data and workloads to the public cloud. However, they are still reluctant to trust the public cloud with their most sensitive records, and hence leave them in their premises, leveraging the hybrid cloud architecture. We address the security and performance challenges of big data analytics using a hybrid cloud in a real-life use case from a hospital. In this use case, the hospital collects sensitive patient data and wants to run analytics on it in order to lower antibiotics resistance, a significant challenge in healthcare. We show that it is possible to run large-scale analytics on data that is securely stored in the public cloud encrypted using Apache Parquet Modular Encryption (PME), without significant performance losses even if the secret encryption keys are stored on-premises. PME is a standard mechanism for data encryption and key management, not specific to any public cloud, and therefore helps prevent vendor lock-in. It also provides privacy and integrity guarantees, and enables granular access control to the data. We also present an innovation in PME for lowering the performance hit incurred by calls to the Key Management Service. Our solution therefore enables protecting large amounts of sensitive data in hybrid clouds and still allows to efficiently gain valuable insights from it.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
Atomically thin current pathways in graphene through Kekulé-O engineering
Authors:
Santiago Galván y García,
Yonatan Betancur-Ocampo,
Francisco Sánchez-Ochoa,
Thomas Stegmann
Abstract:
We demonstrate that the current flow in graphene can be guided on atomically thin current pathways by means of the engineering of Kekulé-O distortions. A grain boundary in these distortions separates the system into topological distinct regions and induces a ballistic domain-wall state. The state does not depend on the precise orientation of the grain boundary with respect to the graphene sublatti…
▽ More
We demonstrate that the current flow in graphene can be guided on atomically thin current pathways by means of the engineering of Kekulé-O distortions. A grain boundary in these distortions separates the system into topological distinct regions and induces a ballistic domain-wall state. The state does not depend on the precise orientation of the grain boundary with respect to the graphene sublattice and therefore, permits to guide the current on arbitrary paths through the system. As the state is gapped, the current flow can be switched by electrostatic gates. Our findings can be explained by a generalization of the Jackiw-Rebbi model, where the electrons behave in one region of the system as fermions with an effective complex mass, making the device not only promising for technological applications but also a test-ground for concepts from high-energy physics. An atomic model supported by DFT calculations demonstrates that the proposed system can be realized by decorating graphene with Ti atoms.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Probing nuclear properties and neutrino physics with current and future CEνNS experiments
Authors:
R. R. Rossi,
G. Sanchez Garcia,
M. Tórtola
Abstract:
The recent observation of Coherent Elastic Neutrino Nucleus Scattering (CEνNS) with neutrinos from pion decay at rest (π-DAR) sources by the COHERENT Collaboration has raised interest in this process in the search for new physics. Unfortunately, current uncertainties in the determination of nuclear parameters relevant to those processes can hide new physics effects. This is not the case for proces…
▽ More
The recent observation of Coherent Elastic Neutrino Nucleus Scattering (CEνNS) with neutrinos from pion decay at rest (π-DAR) sources by the COHERENT Collaboration has raised interest in this process in the search for new physics. Unfortunately, current uncertainties in the determination of nuclear parameters relevant to those processes can hide new physics effects. This is not the case for processes involving lower-energy neutrino sources such as nuclear reactors. Note, however, that a CEνNS measurement with reactor neutrinos depends largely on the determination of the quenching factor, making its observation more challenging. In the upcoming years, once this signal is confirmed, a combined analysis of π-DAR and reactor CEνNS experiments will be very useful to probe particle and nuclear physics, with a reduced dependence on the nuclear uncertainties. In this work, we explore this idea by simultaneously testing the sensitivity of current and future CEνNS experiments to neutrino non-standard interactions (NSI) and the neutron root mean square (rms) radius, considering different neutrino sources as well as several detection materials. We show how the interplay between future reactor and accelerator CEνNS experiments can help to get robust constraints on the neutron rms, and to break degeneracies between the NSI parameters. Our forecast could be used as a guide to optimize the experimental sensitivity to the parameters under study.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
An Almgren monotonicity formula for discrete harmonic functions
Authors:
Mariana Smit Vega Garcia,
Stefan Steinerberger
Abstract:
The celebrated Almgren monotonicity formula for harmonic functions $u:\mathbb{R}^n \rightarrow \mathbb{R}$ says that its $L^2-$energy concentrated on a sphere of radius $r$, when measured in a suitable sense, is non-decreasing: if $u$ oscillates at a certain scale, it has even larger oscillations at a larger scale. We prove a discrete analogue of the Almgren monotonicity formula for harmonic funct…
▽ More
The celebrated Almgren monotonicity formula for harmonic functions $u:\mathbb{R}^n \rightarrow \mathbb{R}$ says that its $L^2-$energy concentrated on a sphere of radius $r$, when measured in a suitable sense, is non-decreasing: if $u$ oscillates at a certain scale, it has even larger oscillations at a larger scale. We prove a discrete analogue of the Almgren monotonicity formula for harmonic functions on infinite combinatorial graphs $G=(V,E)$. Some applications are discussed.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Deep Learning Brasil at ABSAPT 2022: Portuguese Transformer Ensemble Approaches
Authors:
Juliana Resplande Santanna Gomes,
Eduardo Augusto Santos Garcia,
Adalberto Ferreira Barbosa Junior,
Ruan Chaves Rodrigues,
Diogo Fernandes Costa Silva,
Dyonnatan Ferreira Maia,
Nádia Félix Felipe da Silva,
Arlindo Rodrigues Galvão Filho,
Anderson da Silva Soares
Abstract:
Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarit…
▽ More
Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarity of each aspect term (positive, negative or neutral). This article presents we present our participation in Aspect-Based Sentiment Analysis in Portuguese (ABSAPT) 2022 at IberLEF 2022. We submitted the best performing systems, achieving new state-of-the-art results on both subtasks.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
On Forecast Stability
Authors:
Rakshitha Godahewa,
Christoph Bergmeir,
Zeynep Erkin Baz,
Chengjun Zhu,
Zhangdi Song,
Salvador García,
Dario Benavides
Abstract:
Forecasts are typically not produced in a vacuum but in a business context, where forecasts are generated on a regular basis and interact with each other. For decisions, it may be important that forecasts do not change arbitrarily, and are stable in some sense. However, this area has received only limited attention in the forecasting literature. In this paper, we explore two types of forecast stab…
▽ More
Forecasts are typically not produced in a vacuum but in a business context, where forecasts are generated on a regular basis and interact with each other. For decisions, it may be important that forecasts do not change arbitrarily, and are stable in some sense. However, this area has received only limited attention in the forecasting literature. In this paper, we explore two types of forecast stability that we call vertical stability and horizontal stability. The existing works in the literature are only applicable to certain base models and extending these frameworks to be compatible with any base model is not straightforward. Furthermore, these frameworks can only stabilise the forecasts vertically. To fill this gap, we propose a simple linear-interpolation-based approach that is applicable to stabilise the forecasts provided by any base model vertically and horizontally. The approach can produce both accurate and stable forecasts. Using N-BEATS, Pooled Regression and LightGBM as the base models, in our evaluation on four publicly available datasets, the proposed framework is able to achieve significantly higher stability and/or accuracy compared to a set of benchmarks including a state-of-the-art forecast stabilisation method across three error metrics and six stability metrics.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Norms on complex matrices induced by random vectors II: extension of weakly unitarily invariant norms
Authors:
Ángel Chávez,
Stephan Ramon Garcia,
Jackson Hurley
Abstract:
We improve and expand in two directions the theory of norms on complex matrices induced by random vectors. We first provide a simple proof of the classification of weakly unitarily invariant norms on the Hermitian matrices. We use this to extend the main theorem in [7] from exponent $d\geq 2$ to $d \geq 1$. Our proofs are much simpler than the originals: they do not require Lewis' framework for gr…
▽ More
We improve and expand in two directions the theory of norms on complex matrices induced by random vectors. We first provide a simple proof of the classification of weakly unitarily invariant norms on the Hermitian matrices. We use this to extend the main theorem in [7] from exponent $d\geq 2$ to $d \geq 1$. Our proofs are much simpler than the originals: they do not require Lewis' framework for group invariance in convex matrix analysis. This clarification puts the entire theory on simpler foundations while extending its range of applicability.
△ Less
Submitted 25 October, 2023; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Symmetric tensor powers of graphs
Authors:
Weymar Astaiza,
Alexander J. Barrios,
Henry Chimal-Dzul,
Stephan Ramon Garcia,
Jaaziel de la Luz,
Victor H. Moll,
Yunied Puig,
Diego Villamizar
Abstract:
The symmetric tensor power of graphs is introduced and its fundamental properties are explored. A wide range of intriguing phenomena occur when one considers symmetric tensor powers of familiar graphs. A host of open questions are presented, ho** to spur future research.
The symmetric tensor power of graphs is introduced and its fundamental properties are explored. A wide range of intriguing phenomena occur when one considers symmetric tensor powers of familiar graphs. A host of open questions are presented, ho** to spur future research.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.
-
Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection
Authors:
Qingkun Deng,
Saturnino Luz,
Sofia de la Fuente Garcia
Abstract:
Depression is a common mental disorder. Automatic depression detection tools using speech, enabled by machine learning, help early screening of depression. This paper addresses two limitations that may hinder the clinical implementations of such tools: noise resulting from segment-level labelling and a lack of model interpretability. We propose a bi-modal speech-level transformer to avoid segment-…
▽ More
Depression is a common mental disorder. Automatic depression detection tools using speech, enabled by machine learning, help early screening of depression. This paper addresses two limitations that may hinder the clinical implementations of such tools: noise resulting from segment-level labelling and a lack of model interpretability. We propose a bi-modal speech-level transformer to avoid segment-level labelling and introduce a hierarchical interpretation approach to provide both speech-level and sentence-level interpretations, based on gradient-weighted attention maps derived from all attention layers to track interactions between input features. We show that the proposed model outperforms a model that learns at a segment level ($p$=0.854, $r$=0.947, $F1$=0.897 compared to $p$=0.732, $r$=0.808, $F1$=0.768). For model interpretation, using one true positive sample, we show which sentences within a given speech are most relevant to depression detection; and which text tokens and Mel-spectrogram regions within these sentences are most relevant to depression detection. These interpretations allow clinicians to verify the validity of predictions made by depression detection tools, promoting their clinical implementations.
△ Less
Submitted 6 October, 2023; v1 submitted 23 September, 2023;
originally announced September 2023.
-
Differences between quantum and classical adiabatic evolution
Authors:
Cyrill Bösch,
Andreas Fichtner,
Marc Serra Garcia
Abstract:
Adiabatic evolution is an emergent design principle for time modulated metamaterials, often inspired by insights from topological quantum computing such as Majorana fermions and braiding operations. However, the pursuit of classical adiabatic metamaterials is rooted on the assumption that classical and quantum adiabatic evolution are equivalent. We show that this is not the case; and some instance…
▽ More
Adiabatic evolution is an emergent design principle for time modulated metamaterials, often inspired by insights from topological quantum computing such as Majorana fermions and braiding operations. However, the pursuit of classical adiabatic metamaterials is rooted on the assumption that classical and quantum adiabatic evolution are equivalent. We show that this is not the case; and some instances of quantum adiabatic evolution, such as those containing zero modes, cannot be reproduced in classical systems. This is because mode coupling is fundamentally different in classical mechanics. We derive classical conditions to ensure adiabaticity and demonstrate that only under these, from quantum mechanics distinct conditions the Berry phase and Wilczek-Zee matrix emerge as meaningful quantities encoding the geometry of classical adiabatic evolution.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Proceedings of the XII International Workshop on Locational Analysis and Related Problems
Authors:
Marta Baldomero-Naranjo,
Víctor Blanco,
Sergio García,
Ricardo Gázquez,
Jörg Kalcsics,
Luisa I. Martínez-Merino,
Juan M. Muñoz-Ocaña,
Francisco Temprano,
Alberto Torrejón
Abstract:
The International Workshop on Locational Analysis and Related Problems will take place during September 7-8, 2023 in Edinburgh (United Kingdom). It is organized by the Spanish Location Network and the Location Group GELOCA from the Spanish Society of Statistics and Operations Research (SEIO). The Spanish Location Network is a group of more than 140 researchers from several Spanish universities org…
▽ More
The International Workshop on Locational Analysis and Related Problems will take place during September 7-8, 2023 in Edinburgh (United Kingdom). It is organized by the Spanish Location Network and the Location Group GELOCA from the Spanish Society of Statistics and Operations Research (SEIO). The Spanish Location Network is a group of more than 140 researchers from several Spanish universities organized into 7 thematic groups. The Network has been funded by the Spanish Government since 2003. The current project is RED2022-134149-T. One of the main activities of the Network is a yearly meeting aimed at promoting the communication among its members and between them and other researchers, and to contribute to the development of the location field and related problems. As a proof of the internationalization of this research group, this will be the first time that the meeting is held out of Spain. The topics of interest are location analysis and related problems. This includes location models, networks, transportation, logistics, exact and heuristic solution methods, and computational geometry, among others.
△ Less
Submitted 5 October, 2023; v1 submitted 15 September, 2023;
originally announced September 2023.
-
An update on site search activities for SWGO
Authors:
M. Santander,
U. Barres de Almeida,
J. A. Bellido,
T. Bulik,
C. Dib,
B. Dingus,
S. Garcia,
F. Guarino,
P. Huentemeyer,
D. Mandat,
E. Meza,
L. Mendes,
L. Nellen,
C. Ocampo,
L. Otiniano,
E. Quispe,
A. Reisenegger,
A. C. Rovero,
F. Sanchez,
A. Sandoval,
R. Yanyachi,
H. Zhou
Abstract:
The Southern Wide-field Gamma-ray Observatory (SWGO) is a project by scientists and engineers from 14 countries and 78 institutions to design and build the first wide-field, ground-based gamma-ray observatory in the Southern Hemisphere, with high duty cycle and covering an energy range rom hundreds of GeV to the PeV scale. The observatory will cover the Southern sky and aims to map the Galaxy's la…
▽ More
The Southern Wide-field Gamma-ray Observatory (SWGO) is a project by scientists and engineers from 14 countries and 78 institutions to design and build the first wide-field, ground-based gamma-ray observatory in the Southern Hemisphere, with high duty cycle and covering an energy range rom hundreds of GeV to the PeV scale. The observatory will cover the Southern sky and aims to map the Galaxy's large-scale emission, as well as detecting transient and variable phenomena. The host sites under consideration are at a minimum altitude of 4400 m.a.s.l. and comprise two types: flat plateaus of at least 1 km$^{2}$ for the installation of an array of tank-based water Cherenkov detectors (WCD), or large natural lakes for the direct deployment of WCD units. Four South American countries proposed excellent sites to host the observatory meeting these requirements. Argentina proposed two locations in the Salta province, Bolivia presented one site in Chacaltaya, Chile two locations within the Atacama Astronomical Park, and Peru two ground-based locations in the Arequipa district as well as lakes in the Cuzco region. The SWGO collaboration is currently conducting a site characterization study, gathering all the necessary information for site shortlisting and final site selection by the end of 2023. The process has reached the shortlisting phase, in which primary and backup sites for each country have been identified. The primary sites were visited by a team of experts from the collaboration, to investigate and validate the proposed site characteristics. Here we present an update on these site selection activities.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
LLM in the Shell: Generative Honeypots
Authors:
Muris Sladić,
Veronica Valeros,
Carlos Catania,
Sebastian Garcia
Abstract:
Honeypots are essential tools in cybersecurity. However, most of them (even the high-interaction ones) lack the required realism to engage and fool human attackers. This limitation makes them easily discernible, hindering their effectiveness. This work introduces a novel method to create dynamic and realistic software honeypots based on Large Language Models. Preliminary results indicate that LLMs…
▽ More
Honeypots are essential tools in cybersecurity. However, most of them (even the high-interaction ones) lack the required realism to engage and fool human attackers. This limitation makes them easily discernible, hindering their effectiveness. This work introduces a novel method to create dynamic and realistic software honeypots based on Large Language Models. Preliminary results indicate that LLMs can create credible and dynamic honeypots capable of addressing important limitations of previous honeypots, such as deterministic responses, lack of adaptability, etc. We evaluated the realism of each command by conducting an experiment with human attackers who needed to say if the answer from the honeypot was fake or not. Our proposed honeypot, called shelLM, reached an accuracy of 0.92. The source code and prompts necessary for replicating the experiments have been made publicly available.
△ Less
Submitted 9 February, 2024; v1 submitted 31 August, 2023;
originally announced September 2023.
-
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning
Authors:
Maria Rigaki,
Sebastian Garcia
Abstract:
Due to the proliferation of malware, defenders are increasingly turning to automation and machine learning as part of the malware detection tool-chain. However, machine learning models are susceptible to adversarial attacks, requiring the testing of model and product robustness. Meanwhile, attackers also seek to automate malware generation and evasion of antivirus systems, and defenders try to gai…
▽ More
Due to the proliferation of malware, defenders are increasingly turning to automation and machine learning as part of the malware detection tool-chain. However, machine learning models are susceptible to adversarial attacks, requiring the testing of model and product robustness. Meanwhile, attackers also seek to automate malware generation and evasion of antivirus systems, and defenders try to gain insight into their methods. This work proposes a new algorithm that combines Malware Evasion and Model Extraction (MEME) attacks. MEME uses model-based reinforcement learning to adversarially modify Windows executable binary samples while simultaneously training a surrogate model with a high agreement with the target model to evade. To evaluate this method, we compare it with two state-of-the-art attacks in adversarial malware creation, using three well-known published models and one antivirus product as targets. Results show that MEME outperforms the state-of-the-art methods in terms of evasion capabilities in almost all cases, producing evasive malware with an evasion rate in the range of 32-73%. It also produces surrogate models with a prediction label agreement with the respective target models between 97-99%. The surrogate could be used to fine-tune and improve the evasion rate in the future.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Conti Inc.: Understanding the Internal Discussions of a large Ransomware-as-a-Service Operator with Machine Learning
Authors:
Estelle Ruellan,
Masarah Paquet-Clouston,
Sebastian Garcia
Abstract:
Ransomware-as-a-service (RaaS) is increasing the scale and complexity of ransomware attacks. Understanding the internal operations behind RaaS has been a challenge due to the illegality of such activities. The recent chat leak of the Conti RaaS operator, one of the most infamous ransomware operators on the international scene, offers a key opportunity to better understand the inner workings of suc…
▽ More
Ransomware-as-a-service (RaaS) is increasing the scale and complexity of ransomware attacks. Understanding the internal operations behind RaaS has been a challenge due to the illegality of such activities. The recent chat leak of the Conti RaaS operator, one of the most infamous ransomware operators on the international scene, offers a key opportunity to better understand the inner workings of such organizations. This paper analyzes the main topic discussions in the Conti chat leak using machine learning techniques such as Natural Language Processing (NLP) and Latent Dirichlet Allocation (LDA), as well as visualization strategies. Five discussion topics are found: 1) Business, 2) Technical, 3) Internal tasking/Management, 4) Malware, and 5) Customer Service/Problem Solving. Moreover, the distribution of topics among Conti members shows that only 4% of individuals have specialized discussions while almost all individuals (96%) are all-rounders, meaning that their discussions revolve around the five topics. The results also indicate that a significant proportion of Conti discussions are non-tech related. This study thus highlights that running such large RaaS operations requires a workforce skilled beyond technical abilities, with individuals involved in various tasks, from management to customer service or problem solving. The discussion topics also show that the organization behind the Conti RaaS oper5086933ator shares similarities with a large firm. We conclude that, although RaaS represents an example of specialization in the cybercrime industry, only a few members are specialized in one topic, while the rest runs and coordinates the RaaS operation.
△ Less
Submitted 30 August, 2023;
originally announced August 2023.
-
Pulse shape discrimination for the CONUS experiment in the keV and sub-keV regime
Authors:
H. Bonet,
A. Bonhomme,
C. Buck,
K. Fülber,
J. Hakenmüller,
J. Hempfling,
J. Henrichs,
G. Heusser,
M. Lindner,
W. Maneschg,
T. Rink,
E. Sanchez Garcia,
J. Stauber,
H. Strecker,
R. Wink
Abstract:
Point-contact p-type high-purity germanium detectors (PPC HPGe) are particularly suited for detection of sub-keV nuclear recoils from coherent elastic scattering of neutrinos or light dark matter particles. While these particles are expected to interact homogeneously in the entire detector volume, specific classes of external background radiation preferably deposit their energy close to the semi-a…
▽ More
Point-contact p-type high-purity germanium detectors (PPC HPGe) are particularly suited for detection of sub-keV nuclear recoils from coherent elastic scattering of neutrinos or light dark matter particles. While these particles are expected to interact homogeneously in the entire detector volume, specific classes of external background radiation preferably deposit their energy close to the semi-active detector surface, in which diffusion processes dominate that subsequently lead to slower rising pulses compared to the ones from the fully active bulk volume. Dedicated studies of their shape are therefore highly beneficial for the understanding and the rejection of these unwanted events. This article reports about the development of a data-driven pulse shape discrimination (PSD) method for the four 1 kg size PPC HPGe detectors of the CONUS experiment in the keV and sub-keV regime down to 210 eV$_{\text{ee}}$. The impact of the electronic noise at such low energies is carefully examined. It is shown that for an acceptance of 90% of the faster signal-like pulses from the bulk volume, approx. 50% of the surface events can be rejected at the energy threshold and that their contribution is fully suppressed above 800 eV$_{\text{ee}}$. Applied to the CONUS background data, such a PSD rejection cut allows to achieve an overall (15-25)% reduction of the total background budget. The new method allows to improve the sensitivity of future CONUS analyses and to refine the corresponding background model in the sub-keV energy region.
△ Less
Submitted 9 February, 2024; v1 submitted 23 August, 2023;
originally announced August 2023.
-
Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments
Authors:
Maria Rigaki,
Ondřej Lukáš,
Carlos A. Catania,
Sebastian Garcia
Abstract:
Large Language Models (LLMs) have gained widespread popularity across diverse domains involving text generation, summarization, and various natural language processing tasks. Despite their inherent limitations, LLM-based designs have shown promising capabilities in planning and navigating open-world scenarios. This paper introduces a novel application of pre-trained LLMs as agents within cybersecu…
▽ More
Large Language Models (LLMs) have gained widespread popularity across diverse domains involving text generation, summarization, and various natural language processing tasks. Despite their inherent limitations, LLM-based designs have shown promising capabilities in planning and navigating open-world scenarios. This paper introduces a novel application of pre-trained LLMs as agents within cybersecurity network environments, focusing on their utility for sequential decision-making processes.
We present an approach wherein pre-trained LLMs are leveraged as attacking agents in two reinforcement learning environments. Our proposed agents demonstrate similar or better performance against state-of-the-art agents trained for thousands of episodes in most scenarios and configurations. In addition, the best LLM agents perform similarly to human testers of the environment without any additional training process. This design highlights the potential of LLMs to efficiently address complex decision-making tasks within cybersecurity.
Furthermore, we introduce a new network security environment named NetSecGame. The environment is designed to eventually support complex multi-agent scenarios within the network security domain. The proposed environment mimics real network attacks and is designed to be highly modular and adaptable for various scenarios.
△ Less
Submitted 28 August, 2023; v1 submitted 23 August, 2023;
originally announced August 2023.
-
Two-phase almost minimizers for a fractional free boundary problem
Authors:
Mark Allen,
Mariana Smit Vega Garcia
Abstract:
In this paper, we study almost minimizers to a fractional Alt-Caffarelli-Friedman type functional. Our main results concern the optimal $C^{0,s}$ regularity of almost minimizers as well as the structure of the free boundary. We first prove that the two free boundaries $F^+(u)=\partial\{u(\cdot,0)>0\}$ and $F^-(u)=\partial\{u(\cdot,0)<0\}$ cannot touch, that is, $F^+(u)\cap F^-(u)=\emptyset$. Lastl…
▽ More
In this paper, we study almost minimizers to a fractional Alt-Caffarelli-Friedman type functional. Our main results concern the optimal $C^{0,s}$ regularity of almost minimizers as well as the structure of the free boundary. We first prove that the two free boundaries $F^+(u)=\partial\{u(\cdot,0)>0\}$ and $F^-(u)=\partial\{u(\cdot,0)<0\}$ cannot touch, that is, $F^+(u)\cap F^-(u)=\emptyset$. Lastly, we prove a flatness implies $C^{1,γ}$ result for the free boundary.
△ Less
Submitted 28 February, 2024; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Spin-valley locking in Kekulé-distorted graphene with Dirac-Rashba interactions
Authors:
David A. Ruiz-Tijerina,
Jesús Arturo Sánchez-Sánchez,
Ramon Carrillo-Bastos,
Santiago Galván y García,
Francisco Mireles
Abstract:
The joint effects of Kekulé lattice distortions and Rashba-type spin-orbit coupling on the electronic properties of graphene are explored. We modeled the position dependence of the Rashba energy term in a manner that allows its seamless integration into the scheme introduced by Gamayun et al.[New J. Phys. 20, 023016 (2018)] to describe graphene with Kekulé lattice distortion. Particularly for the…
▽ More
The joint effects of Kekulé lattice distortions and Rashba-type spin-orbit coupling on the electronic properties of graphene are explored. We modeled the position dependence of the Rashba energy term in a manner that allows its seamless integration into the scheme introduced by Gamayun et al.[New J. Phys. 20, 023016 (2018)] to describe graphene with Kekulé lattice distortion. Particularly for the Kekulé-Y texture, the effective low energy Dirac Hamiltonian contains a new spin-valley locking term, in addition to the well-known Rashba-induced momentum-pseudospin and spin-pseudospin couplings, and the Kekulé-induced momentum-valley coupling term. We report on the low-energy band structure and Landau level spectra of Rashba-spin-orbit-coupled Kek-Y graphene, and propose an experimental scheme to discern between the presence of Rashba spin-orbit coupling, Kek-Y lattice distortion, and both, based on do**-dependent magnetotransport measurements.
△ Less
Submitted 4 August, 2023;
originally announced August 2023.
-
A neutrino window to scalar leptoquarks: from low energy to colliders
Authors:
Valentina De Romeri,
Victor Martin Lozano,
G. Sanchez Garcia
Abstract:
Leptoquarks are theorized particles of either scalar or vector nature that couple simultaneously to quarks and leptons. Motivated by recent measurements of coherent elastic neutrino-nucleus scattering, we consider the impact of scalar leptoquarks coupling to neutrinos on a few complementary processes, from low energy to colliders. In particular, we set competitive constraints on the typical mass a…
▽ More
Leptoquarks are theorized particles of either scalar or vector nature that couple simultaneously to quarks and leptons. Motivated by recent measurements of coherent elastic neutrino-nucleus scattering, we consider the impact of scalar leptoquarks coupling to neutrinos on a few complementary processes, from low energy to colliders. In particular, we set competitive constraints on the typical mass and coupling of scalar leptoquarks by analyzing recent COHERENT data. We compare these constraints with bounds from atomic parity violation experiments, deep inelastic neutrino-nucleon scattering and LHC data. Our results highlight a strong complementarity between different facilities and demonstrate the compelling power of coherent elastic neutrino-nucleus scattering experiments to probe leptoquark masses in the MeV-GeV range. Finally, we also present prospects for improving current bounds with future upgrades of the COHERENT detectors and the planned European Spallation Source.
△ Less
Submitted 28 May, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Reading Between the Lanes: Text VideoQA on the Road
Authors:
George Tom,
Minesh Mathew,
Sergi Garcia,
Dimosthenis Karatzas,
C. V. Jawahar
Abstract:
Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and te…
▽ More
Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and textual cues from the video stream but also reason over time. To address this issue, we introduce RoadTextVQA, a new dataset for the task of video question answering (VideoQA) in the context of driver assistance. RoadTextVQA consists of $3,222$ driving videos collected from multiple countries, annotated with $10,500$ questions, all based on text or road signs present in the driving videos. We assess the performance of state-of-the-art video question answering models on our RoadTextVQA dataset, highlighting the significant potential for improvement in this domain and the usefulness of the dataset in advancing research on in-vehicle support systems and text-aware multimodal question answering. The dataset is available at http://cvit.iiit.ac.in/research/projects/cvit-projects/roadtextvqa
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
Symmetric and Antisymmetric Tensor Products for the Function-Theoretic Operator Theorist
Authors:
Stephan Ramon Garcia,
Ryan O'Loughlin,
Jiahui Yu
Abstract:
We study symmetric and antisymmetric tensor products of Hilbert-space operators, focusing on norms and spectra for some well-known classes favored by function-theoretic operator theorists. We pose many open questions that should interest the field.
We study symmetric and antisymmetric tensor products of Hilbert-space operators, focusing on norms and spectra for some well-known classes favored by function-theoretic operator theorists. We pose many open questions that should interest the field.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Methodology for generating synthetic labeled datasets for visual container inspection
Authors:
Guillem Delgado,
Andoni Cortés,
Sara García,
Estíbaliz Loyo,
Maialen Berasategi,
Nerea Aranjuelo
Abstract:
Nowadays, containerized freight transport is one of the most important transportation systems that is undergoing an automation process due to the Deep Learning success. However, it suffers from a lack of annotated data in order to incorporate state-of-the-art neural network models to its systems. In this paper we present an innovative methodology to generate a realistic, varied, balanced, and la…
▽ More
Nowadays, containerized freight transport is one of the most important transportation systems that is undergoing an automation process due to the Deep Learning success. However, it suffers from a lack of annotated data in order to incorporate state-of-the-art neural network models to its systems. In this paper we present an innovative methodology to generate a realistic, varied, balanced, and labelled dataset for visual inspection task of containers in a dock environment. In addition, we validate this methodology with multiple visual tasks recurrently found in the state of the art. We prove that the generated synthetic labelled dataset allows to train a deep neural network that can be used in a real world scenario. On the other side, using this methodology we provide the first open synthetic labelled dataset called SeaFront available in: https://datasets.vicomtech.org/di21-seafront/readme.txt.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Mesas of Stirling permutations
Authors:
Nicolle González,
Pamela E. Harris,
Gordon Rojas Kirby,
Mariana Smit Vega Garcia,
Bridget Eileen Tenner
Abstract:
Given a Stirling permutation w, we introduce the mesa set of w as the natural generalization of the pinnacle set of a permutation. Our main results characterize admissible mesa sets and give closed enumerative formulas in terms of rational Catalan numbers by providing an explicit bijection between mesa sets and rational Dyck paths.
Given a Stirling permutation w, we introduce the mesa set of w as the natural generalization of the pinnacle set of a permutation. Our main results characterize admissible mesa sets and give closed enumerative formulas in terms of rational Catalan numbers by providing an explicit bijection between mesa sets and rational Dyck paths.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
A primal-dual data-driven method for computational optical imaging with a photonic lantern
Authors:
Carlos Santos Garcia,
Mathilde Larchevêque,
Solal O'Sullivan,
Martin Van Waerebeke,
Robert R. Thomson,
Audrey Repetti,
Jean-Christophe Pesquet
Abstract:
Optical fibres aim to image in-vivo biological processes. In this context, high spatial resolution and stability to fibre movements are key to enable decision-making processes (e.g., for microendoscopy). Recently, a single-pixel imaging technique based on a multicore fibre photonic lantern has been designed, named computational optical imaging using a lantern (COIL). A proximal algorithm based on…
▽ More
Optical fibres aim to image in-vivo biological processes. In this context, high spatial resolution and stability to fibre movements are key to enable decision-making processes (e.g., for microendoscopy). Recently, a single-pixel imaging technique based on a multicore fibre photonic lantern has been designed, named computational optical imaging using a lantern (COIL). A proximal algorithm based on a sparsity prior, dubbed SARA-COIL, has been further proposed to solve the associated inverse problem, to enable image reconstructions for high resolution COIL microendoscopy. In this work, we develop a data-driven approach for COIL. We replace the sparsity prior in the proximal algorithm by a learned denoiser, leading to a plug-and-play (PnP) algorithm. The resulting PnP method, based on a proximal primal-dual algorithm, enables to solve the Morozov formulation of the inverse problem. We use recent results in learning theory to train a network with desirable Lipschitz properties, and we show that the resulting primal-dual PnP algorithm converges to a solution to a monotone inclusion problem. Our simulations highlight that the proposed data-driven approach improves the reconstruction quality over variational SARA-COIL method on both simulated and real data.
△ Less
Submitted 17 April, 2024; v1 submitted 20 June, 2023;
originally announced June 2023.
-
The Reciprocal Schur Inequality
Authors:
Albrecht Boettcher,
Stephan Ramon Garcia,
Mishko Mitkovski
Abstract:
Schur's inequality states that the sum of three special terms is always nonnegative. This note is a short review of inequalities for the sum of the reciprocals of these terms and of extensions of the latter inequalities to an arbitrary number of terms and thus to higher-order divided differences.
Schur's inequality states that the sum of three special terms is always nonnegative. This note is a short review of inequalities for the sum of the reciprocals of these terms and of extensions of the latter inequalities to an arbitrary number of terms and thus to higher-order divided differences.
△ Less
Submitted 16 June, 2023; v1 submitted 6 June, 2023;
originally announced June 2023.
-
Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning
Authors:
Jan Kaiser,
Chenran Xu,
Annika Eichler,
Andrea Santamaria Garcia,
Oliver Stein,
Erik Bründermann,
Willi Kuropka,
Hannes Dinter,
Frank Mayet,
Thomas Vinatier,
Florian Burkart,
Holger Schlarb
Abstract:
Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reduci…
▽ More
Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reducing tuning times. Which algorithm to choose in different scenarios, however, remains an open question. Here we present a comparative study using a routine task in a real particle accelerator as an example, showing that RLO generally outperforms BO, but is not always the best choice. Based on the study's results, we provide a clear set of criteria to guide the choice of algorithm for a given tuning task. These can ease the adoption of learning-based autonomous tuning solutions to the operation of complex real-world plants, ultimately improving the availability and pushing the limits of operability of these facilities, thereby enabling scientific and engineering advancements.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Attacker Profiling Through Analysis of Attack Patterns in Geographically Distributed Honeypots
Authors:
Veronica Valeros,
Maria Rigaki,
Sebastian Garcia
Abstract:
Honeypots are a well-known and widely used technology in the cybersecurity community, where it is assumed that placing honeypots in different geographical locations provides better visibility and increases effectiveness. However, how geolocation affects the usefulness of honeypots is not well-studied, especially for threat intelligence as early warning systems. This paper examines attack patterns…
▽ More
Honeypots are a well-known and widely used technology in the cybersecurity community, where it is assumed that placing honeypots in different geographical locations provides better visibility and increases effectiveness. However, how geolocation affects the usefulness of honeypots is not well-studied, especially for threat intelligence as early warning systems. This paper examines attack patterns in a large public dataset of geographically distributed honeypots by answering methodological questions and creating behavioural profiles of attackers. Results show that the location of honeypots helps identify attack patterns and build profiles for the attackers. We conclude that not all the intelligence collected from geographically distributed honeypots is equally valuable and that a good early warning system against resourceful attackers may be built with only two distributed honeypots and a production server.
△ Less
Submitted 2 May, 2023;
originally announced May 2023.