Search | arXiv e-print repository

arXiv:2407.08745 [pdf, other]

Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects

Authors: Javier Poyatos, Javier Del Ser, Salvador Garcia, Hisao Ishibuchi, Daniel Molina, Isaac Triguero, Bing Xue, Xin Yao, Francisco Herrera

Abstract: In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de… ▽ More In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS. △ Less

Submitted 3 June, 2024; originally announced July 2024.

arXiv:2407.03560 [pdf, ps, other]

Numerical semigroups from rational matrices I: power-integral matrices and nilpotent representations

Authors: Arsh Chhabra, Stephan Ramon Garcia, Fangqian Zhang, Hechun Zhang

Abstract: Our aim in this paper is to initiate the study of exponent semigroups for rational matrices. We prove that every numerical semigroup is the exponent semigroup of some rational matrix. We also obtain lower bounds on the size of such matrices and discuss the related class of power-integral matrices. Our aim in this paper is to initiate the study of exponent semigroups for rational matrices. We prove that every numerical semigroup is the exponent semigroup of some rational matrix. We also obtain lower bounds on the size of such matrices and discuss the related class of power-integral matrices. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 13 pages

arXiv:2407.03368 [pdf, other]

Predict. Optimize. Revise. On Forecast and Policy Stability in Energy Management Systems

Authors: Evgenii Genov, Julian Ruddick, Christoph Bergmeir, Majid Vafaeipour, Thierry Coosemans, Salvador Garcia, Maarten Messagie

Abstract: This research addresses the challenge of integrating forecasting and optimization in energy management systems, focusing on the impacts of switching costs, forecast accuracy, and stability. It proposes a novel framework for analyzing online optimization problems with switching costs and enabled by deterministic and probabilistic forecasts. Through empirical evaluation and theoretical analysis, the… ▽ More This research addresses the challenge of integrating forecasting and optimization in energy management systems, focusing on the impacts of switching costs, forecast accuracy, and stability. It proposes a novel framework for analyzing online optimization problems with switching costs and enabled by deterministic and probabilistic forecasts. Through empirical evaluation and theoretical analysis, the research reveals the balance between forecast accuracy, stability, and switching costs in sha** policy performance. Conducted in the context of battery scheduling within energy management applications, it introduces a metric for evaluating probabilistic forecast stability and examines the effects of forecast accuracy and stability on optimization outcomes using the real-world case of the Citylearn 2022 competition. Findings indicate that switching costs significantly influence the trade-off between forecast accuracy and stability, highlighting the importance of integrated systems that enable collaboration between forecasting and operational units for improved decision-making. The study shows that committing to a policy for longer periods can be advantageous over frequent updates. Results also show a correlation between forecast stability and policy performance, suggesting that stable forecasts can mitigate switching costs. The proposed framework provides valuable insights for energy sector decision-makers and forecast practitioners when designing the operation of an energy management system. △ Less

Submitted 11 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

Comments: 14 pages, contains the Online Appendix with a comment on KPIs, MPC formulation, Theoretical analysis of the MPC performance bounds and extra results on the in-sample performance

arXiv:2407.00861 [pdf, other]

Enantiospecificity in NMR Enabled by Chirality-Induced Spin Selectivity

Authors: T. Georgiou, J. L. Palma, V. Mujica, S. Varela, M. Galante, V. Santamarıa Garcıa, L. Mboning, R. N. Schwartz, G. Cuniberti, L. -S. Bouchard

Abstract: Spin polarization in chiral molecules is a magnetic molecular response associated with electron transport and enantioselective bond polarization that occurs even in the absence of an external magnetic field. An unexpected finding by Santos and co-workers reported enantiospecific NMR responses in solid-state cross-polarization (CP) experiments, suggesting a possible additional contribution to the i… ▽ More Spin polarization in chiral molecules is a magnetic molecular response associated with electron transport and enantioselective bond polarization that occurs even in the absence of an external magnetic field. An unexpected finding by Santos and co-workers reported enantiospecific NMR responses in solid-state cross-polarization (CP) experiments, suggesting a possible additional contribution to the indirect nuclear spin-spin coupling in chiral molecules induced by bond polarization in the presence of spin-orbit coupling. Herein we provide a theoretical treatment for this phenomenon, presenting an effective spin-Hamiltonian for helical molecules like DNA and density functional theory (DFT) results on amino acids that confirm the dependence of J-couplings on the choice of enantiomer. The connection between nuclear spin dynamics and chirality could offer insights for molecular sensing and quantum information sciences. These results establish NMR as a potential tool for chiral discrimination without external agents. △ Less

Submitted 2 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

Comments: 102 pages, 16 figures, 40 tables

arXiv:2406.10272 [pdf, other]

Connected Speech-Based Cognitive Assessment in Chinese and English

Authors: Saturnino Luz, Sofia De La Fuente Garcia, Fasih Haider, Davida Fromm, Brian MacWhinney, Alyssa Lanzi, Ya-Ning Chang, Chia-Ju Chou, Yi-Chien Liu

Abstract: We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age… ▽ More We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age and sex by propensity score analysis to ensure balance and representativity in model training. The prediction tasks encompass mild cognitive impairment diagnosis and cognitive test score prediction. This framework was designed to encourage the development of approaches to speech-based cognitive assessment which generalise across languages. We illustrate it by presenting baseline prediction models that employ language-agnostic and comparable features for diagnosis and cognitive test score prediction. The models achieved unweighted average recall was 59.2% in diagnosis, and root mean squared error of 2.89 in score prediction. △ Less

Submitted 18 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

Comments: To appear in Proceedings of Interspeech 2024

ACM Class: J.3; I.5.4

arXiv:2406.03138 [pdf, other]

A Frame-based Attention Interpretation Method for Relevant Acoustic Feature Extraction in Long Speech Depression Detection

Authors: Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

Abstract: Speech-based depression detection tools could help early screening of depression. Here, we address two issues that may hinder the clinical practicality of such tools: segment-level labelling noise and a lack of model interpretability. We propose a speech-level Audio Spectrogram Transformer to avoid segment-level labelling. We observe that the proposed model significantly outperforms a segment-leve… ▽ More Speech-based depression detection tools could help early screening of depression. Here, we address two issues that may hinder the clinical practicality of such tools: segment-level labelling noise and a lack of model interpretability. We propose a speech-level Audio Spectrogram Transformer to avoid segment-level labelling. We observe that the proposed model significantly outperforms a segment-level model, providing evidence for the presence of segment-level labelling noise in audio modality and the advantage of longer-duration speech analysis for depression detection. We introduce a frame-based attention interpretation method to extract acoustic features from prediction-relevant waveform signals for interpretation by clinicians. Through interpretation, we observe that the proposed model identifies reduced loudness and F0 as relevant signals of depression, which aligns with the speech characteristics of depressed patients documented in clinical studies. △ Less

Submitted 7 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

Comments: 5 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2309.13476

arXiv:2405.07134 [pdf, other]

On the Ollivier-Ricci curvature as fragility indicator of the stock markets

Authors: Joaquín Sánchez García, Sebastian Gherghe

Abstract: Recently, an indicator for stock market fragility and crash size in terms of the Ollivier-Ricci curvature has been proposed. We study analytical and empirical properties of such indicator, test its elasticity with respect to different parameters and provide heuristics for the parameters involved. We show when and how the indicator accurately describes a financial crisis. We also propose an alterna… ▽ More Recently, an indicator for stock market fragility and crash size in terms of the Ollivier-Ricci curvature has been proposed. We study analytical and empirical properties of such indicator, test its elasticity with respect to different parameters and provide heuristics for the parameters involved. We show when and how the indicator accurately describes a financial crisis. We also propose an alternate method for calculating the indicator using a specific sub-graph with special curvature properties. △ Less

Submitted 11 May, 2024; originally announced May 2024.

arXiv:2404.14997 [pdf, other]

Mining higher-order triadic interactions

Authors: Anthony Baptista, Marta Niedostatek, Jun Yamamoto, Ben MacArthur, Jurgen Kurths, Ruben Sanchez Garcia, Ginestra Bianconi

Abstract: Complex systems often present higher-order interactions which require us to go beyond their description in terms of pairwise networks. Triadic interactions are a fundamental type of higher-order interaction that occurs when one node regulates the interaction between two other nodes. Triadic interactions are a fundamental type of higher-order networks, found in a large variety of biological systems… ▽ More Complex systems often present higher-order interactions which require us to go beyond their description in terms of pairwise networks. Triadic interactions are a fundamental type of higher-order interaction that occurs when one node regulates the interaction between two other nodes. Triadic interactions are a fundamental type of higher-order networks, found in a large variety of biological systems, from neuron-glia interactions to gene-regulation and ecosystems. However, triadic interactions have been so far mostly neglected. In this article, we propose a theoretical principle to model and mine triadic interactions from node metadata, and we apply this framework to gene expression data finding new candidates for triadic interactions relevant for Acute Myeloid Leukemia. Our work reveals important aspects of higher-order triadic interactions often ignored, which can transform our understanding of complex systems and be applied to a large variety of systems ranging from biology to the climate. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.01940 [pdf, other]

Towards Better Understanding of Cybercrime: The Role of Fine-Tuned LLMs in Translation

Authors: Veronica Valeros, Anna Širokova, Carlos Catania, Sebastian Garcia

Abstract: Understanding cybercrime communications is paramount for cybersecurity defence. This often involves translating communications into English for processing, interpreting, and generating timely intelligence. The problem is that translation is hard. Human translation is slow, expensive, and scarce. Machine translation is inaccurate and biased. We propose using fine-tuned Large Language Models (LLM) t… ▽ More Understanding cybercrime communications is paramount for cybersecurity defence. This often involves translating communications into English for processing, interpreting, and generating timely intelligence. The problem is that translation is hard. Human translation is slow, expensive, and scarce. Machine translation is inaccurate and biased. We propose using fine-tuned Large Language Models (LLM) to generate translations that can accurately capture the nuances of cybercrime language. We apply our technique to public chats from the NoName057(16) Russian-speaking hacktivist group. Our results show that our fine-tuned LLM model is better, faster, more accurate, and able to capture nuances of the language. Our method shows it is possible to achieve high-fidelity translations and significantly reduce costs by a factor ranging from 430 to 23,000 compared to a human translator. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 9 pages, 4 figures

arXiv:2403.10314 [pdf, other]

Hunter's positivity theorem and random vector norms

Authors: Ludovick Bouthat, Ángel Chávez, Stephan Ramon Garcia

Abstract: A theorem of Hunter ensures that the complete homogeneous symmetric polynomials of even degree are positive definite functions. A probabilistic interpretation of Hunter's theorem suggests a broad generalization: the construction of so-called random vector norms on square complex matrices. This paper surveys these ideas, starting from the fundamental notions and develo** the theory to its present… ▽ More A theorem of Hunter ensures that the complete homogeneous symmetric polynomials of even degree are positive definite functions. A probabilistic interpretation of Hunter's theorem suggests a broad generalization: the construction of so-called random vector norms on square complex matrices. This paper surveys these ideas, starting from the fundamental notions and develo** the theory to its present state. We study numerous examples and present a host of open problems. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 62 pages

arXiv:2403.08574 [pdf, other]

Optimizing Conical Intersections Without Explicit Use of Non-Adiabatic Couplings

Authors: Juan Sanz García, Rosa Maskri, Alexander Mitrushchenkov, Loïc Joubert-Doriol

Abstract: We present two alternative methods for optimizing minimum energy conical intersection (MECI) molecular geometries without knowledge of the derivative coupling (DC). These methods are based on the utilization of Lagrange multipliers: i) one method uses an approximate calculation of the DC, while the other ii) do not require the DC. Both methods use the fact that information of the DC is contained i… ▽ More We present two alternative methods for optimizing minimum energy conical intersection (MECI) molecular geometries without knowledge of the derivative coupling (DC). These methods are based on the utilization of Lagrange multipliers: i) one method uses an approximate calculation of the DC, while the other ii) do not require the DC. Both methods use the fact that information of the DC is contained in the Hessian of the squared energy difference. Tests done on a set of small molecular systems, in comparison with other methods, show the ability of the proposed methods to optimize MECIs. Finally, we apply the methods to the furimamide molecule, to optimize and characterize its S$_1$ /S$_2$ MECI, and to optimizing the S$_0$ /S$_1$ MECI of the silver trimer. △ Less

Submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.03081 [pdf, other]

Resolving chiral transitions in Rydberg arrays with quantum Kibble-Zurek mechanism and finite-time scaling

Authors: Jose Soto Garcia, Natalia Chepiga

Abstract: The experimental realization of the quantum Kibble-Zurek mechanism in arrays of trapped Rydberg atoms has brought the problem of commensurate-incommensurate transition back into the focus of active research. Relying on equilibrium simulations of finite intervals, direct chiral transitions at the boundary of the period-3 and period-4 phases have been predicted. Here, we study how these chiral trans… ▽ More The experimental realization of the quantum Kibble-Zurek mechanism in arrays of trapped Rydberg atoms has brought the problem of commensurate-incommensurate transition back into the focus of active research. Relying on equilibrium simulations of finite intervals, direct chiral transitions at the boundary of the period-3 and period-4 phases have been predicted. Here, we study how these chiral transitions can be diagnosed experimentally with critical dynamics. We demonstrate that chiral transitions can be distinguished from the floating phases by comparing Kibble-Zurek dynamics on arrays with different numbers of atoms. Furthermore, by swee** in the opposite direction and kee** track of the order parameter, we identify the location of conformal points. Finally, combining forward and backward sweeps, we extract all critical exponents characterizing the transition. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 12 pages, 16 figures

arXiv:2403.02432 [pdf, other]

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation

Authors: Joaquín Sánchez García

Abstract: We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning. We study a new technique for understanding convergence of learning agents under small modifications of data. We show that such convergence can be understood via an analogue of Fatou's lemma which yields gamma-convergence. We show it's relevance and applications in general machine learning tasks and domain adaptation transfer learning. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.16953 [pdf, other]

Exploring the sensitivity to non-standard and generalized neutrino interactions through coherent elastic neutrino-nucleus scattering with a NaI detector

Authors: Sabya Sachi Chatterjee, Stéphane Lavignac, O. G. Miranda, G. Sanchez Garcia

Abstract: After the first observation of coherent elastic neutrino-nucleus scattering (CE$ν$NS) by the COHERENT collaboration, many efforts are being made to improve the measurement of this process, making it possible to constrain new physics in the neutrino sector. In this paper, we study the sensitivity to non-standard interactions (NSIs) and generalized neutrino interactions (GNIs) of a NaI detector with… ▽ More After the first observation of coherent elastic neutrino-nucleus scattering (CE$ν$NS) by the COHERENT collaboration, many efforts are being made to improve the measurement of this process, making it possible to constrain new physics in the neutrino sector. In this paper, we study the sensitivity to non-standard interactions (NSIs) and generalized neutrino interactions (GNIs) of a NaI detector with characteristics similar to the one that is currently being deployed at the Spallation Neutron Source at Oak Ridge National Laboratory. We show that such a detector, whose target nuclei have significantly different proton to neutron ratios (at variance with the current CsI detector), could help to partially break the parameter degeneracies arising from the interference between the Standard Model and NSI contributions to the CE$ν$NS cross section, as well as between different NSI parameters. By contrast, only a slight improvement over the current CsI constraints is expected for parameters that do not interfere with the SM contribution. We find that a significant reduction of the background level would make the NaI detector considered in this paper very efficient at breaking degeneracies among NSI parameters. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 31 pages, 10 pdf figures, and 4 tables

arXiv:2402.10839 [pdf, ps, other]

Graded polynomial identities of the infinite-dimensional upper triangular matrices over an arbitrary field

Authors: Micael Said Garcia, Felipe Yukihide Yasumura

Abstract: We compute the graded polynomial identities of the infinite dimensional upper triangular matrix algebra over an arbitrary field. If the grading group is finite, we prove that the set of graded polynomial identities admits a finite basis. We find conditions under which a grading on such an algebra satisfies a nontrivial graded polynomial identity. Finally, we provide examples showing that two nonis… ▽ More We compute the graded polynomial identities of the infinite dimensional upper triangular matrix algebra over an arbitrary field. If the grading group is finite, we prove that the set of graded polynomial identities admits a finite basis. We find conditions under which a grading on such an algebra satisfies a nontrivial graded polynomial identity. Finally, we provide examples showing that two nonisomorphic gradings can have the same set of graded polynomial identities. △ Less

Submitted 16 February, 2024; originally announced February 2024.

arXiv:2402.06315 [pdf, other]

Multisource Semisupervised Adversarial Domain Generalization Network for Cross-Scene Sea-Land Clutter Classification

Authors: Xiaoxuan Zhang, Quan Pan, Salvador García

Abstract: Deep learning (DL)-based sea\textendash land clutter classification for sky-wave over-the-horizon-radar (OTHR) has become a novel research topic. In engineering applications, real-time predictions of sea\textendash land clutter with existing distribution discrepancies are crucial. To solve this problem, this article proposes a novel Multisource Semisupervised Adversarial Domain Generalization Netw… ▽ More Deep learning (DL)-based sea\textendash land clutter classification for sky-wave over-the-horizon-radar (OTHR) has become a novel research topic. In engineering applications, real-time predictions of sea\textendash land clutter with existing distribution discrepancies are crucial. To solve this problem, this article proposes a novel Multisource Semisupervised Adversarial Domain Generalization Network (MSADGN) for cross-scene sea\textendash land clutter classification. MSADGN can extract domain-invariant and domain-specific features from one labeled source domain and multiple unlabeled source domains, and then generalize these features to an arbitrary unseen target domain for real-time prediction of sea\textendash land clutter. Specifically, MSADGN consists of three modules: domain-related pseudolabeling module, domain-invariant module, and domain-specific module. The first module introduces an improved pseudolabel method called domain-related pseudolabel, which is designed to generate reliable pseudolabels to fully exploit unlabeled source domains. The second module utilizes a generative adversarial network (GAN) with a multidiscriminator to extract domain-invariant features, to enhance the model's transferability in the target domain. The third module employs a parallel multiclassifier branch to extract domain-specific features, to enhance the model's discriminability in the target domain. The effectiveness of our method is validated in twelve domain generalizations (DG) scenarios. Meanwhile, we selected 10 state-of-the-art DG methods for comparison. The experimental results demonstrate the superiority of our method. △ Less

Submitted 9 March, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: 15 pages, 8 figures, 4 tables

arXiv:2402.00216 [pdf]

doi 10.1088/1361-6528/abe2c7

Structural and optical properties of self-assembled AlN nanowires grown on SiO2/Si substrates by molecular beam epitaxy

Authors: Ž. Gačević, J. Grandal, Q. Guo, R. Kirste, M. Varela, Z. Sitar, M. A. Sánchez García

Abstract: Self assembled AlN nanowires (NWs) are grown by plasma assisted molecular beam epitaxy (PAMBE) on SiO2 / Si (111) substrates. Using a combination of in-situ reflective high energy electron diffraction and ex situ X ray diffraction (XRD), we show that the NWs grow nearly strain free, preferentially perpendicular to the amorphous SiO2 interlayer and without epitaxial relationship to Si(111) substrat… ▽ More Self assembled AlN nanowires (NWs) are grown by plasma assisted molecular beam epitaxy (PAMBE) on SiO2 / Si (111) substrates. Using a combination of in-situ reflective high energy electron diffraction and ex situ X ray diffraction (XRD), we show that the NWs grow nearly strain free, preferentially perpendicular to the amorphous SiO2 interlayer and without epitaxial relationship to Si(111) substrate, as expected. Scanning electron microscopy investigation reveals significant NWs coalescence, which results in their progressively increasing diameter and formation of columnar structures with non hexagonal cross section. Making use of scanning transmission electron microscopy (STEM), the NWs initial diameters are found in the 20 to 30 nm range. In addition, the formation of a thin (30 nm) polycrystalline AlN layer is observed on the substrate surface. Regarding the structural quality of the AlN NWs, STEM measurements reveal the formation of extended columnar regions, which grow with a virtually perfect metal-polarity wurtzite arrangement and with extended defects only sporadically observed. Combination of STEM and electron energy loss spectroscopy (EELS) reveals the formation of continuous aluminum oxide (1 to 2 nm) on the NW surface. Low temperature photoluminescence measurements reveal a single near band edge (NBE) emission peak, positioned at 6.03 eV (at 2 K), a value consistent with nearly zero NW strain evidenced by XRD and in agreement with the values obtained on AlN bulk layers synthesized by other growth techniques. The significant full width at half maximum of NBE emission, found at 20 meV (at 2 K), suggests that free and bound excitons are mixed together within this single emission band. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 9 pages, 5 figures

Journal ref: Nanotechnology 32 (2021) 195601

arXiv:2401.07684 [pdf, other]

Final CONUS results on coherent elastic neutrino nucleus scattering at the Brokdorf reactor

Authors: N. Ackermann, H. Bonet, A. Bonhomme, C. Buck, K. Fülber, J. Hakenmüller, J. Hempfling, J. Henrichs, G. Heusser, M. Lindner, W. Maneschg, T. Rink, E. Sanchez Garcia, J. Stauber, H. Strecker, R. Wink

Abstract: The CONUS experiment studies coherent elastic neutrino nucleus scattering in four 1 kg germanium spectrometers. Low ionization energy thresholds of 210 eV were achieved. The detectors were operated inside an optimized shield at the Brokdorf nuclear power plant which provided a reactor antineutrino flux of up to $2.3\cdot10^{13}$ cm$^{-2}$s$^{-1}$. In the final phase of data collection at this site… ▽ More The CONUS experiment studies coherent elastic neutrino nucleus scattering in four 1 kg germanium spectrometers. Low ionization energy thresholds of 210 eV were achieved. The detectors were operated inside an optimized shield at the Brokdorf nuclear power plant which provided a reactor antineutrino flux of up to $2.3\cdot10^{13}$ cm$^{-2}$s$^{-1}$. In the final phase of data collection at this site, the constraints on the neutrino interaction rate were improved by an order of magnitude as compared to the previous CONUS analysis. The new limit of less than 0.34 signal events kg$^{-1}$d$^{-1}$ is within a factor 2 of the rate predicted by the Standard Model. △ Less

Submitted 5 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

Comments: 8 figures, 4 tables

arXiv:2401.06019 [pdf, other]

doi 10.1117/12.2679734

Automatic UAV-based Airport Pavement Inspection Using Mixed Real and Virtual Scenarios

Authors: Pablo Alonso, Jon Ander Iñiguez de Gordoa, Juan Diego Ortega, Sara García, Francisco Javier Iriarte, Marcos Nieto

Abstract: Runway and taxiway pavements are exposed to high stress during their projected lifetime, which inevitably leads to a decrease in their condition over time. To make sure airport pavement condition ensure uninterrupted and resilient operations, it is of utmost importance to monitor their condition and conduct regular inspections. UAV-based inspection is recently gaining importance due to its wide ra… ▽ More Runway and taxiway pavements are exposed to high stress during their projected lifetime, which inevitably leads to a decrease in their condition over time. To make sure airport pavement condition ensure uninterrupted and resilient operations, it is of utmost importance to monitor their condition and conduct regular inspections. UAV-based inspection is recently gaining importance due to its wide range monitoring capabilities and reduced cost. In this work, we propose a vision-based approach to automatically identify pavement distress using images captured by UAVs. The proposed method is based on Deep Learning (DL) to segment defects in the image. The DL architecture leverages the low computational capacities of embedded systems in UAVs by using an optimised implementation of EfficientNet feature extraction and Feature Pyramid Network segmentation. To deal with the lack of annotated data for training we have developed a synthetic dataset generation methodology to extend available distress datasets. We demonstrate that the use of a mixed dataset composed of synthetic and real training images yields better results when testing the training models in real application scenarios. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 12 pages, 6 figures, published in proceedings of 15th International Conference on Machine Vision (ICMV)

Journal ref: Proc. SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), 1270118

arXiv:2401.05815 [pdf, other]

doi 10.1103/PhysRevAccelBeams.27.054601

Cheetah: Bridging the Gap Between Machine Learning and Particle Accelerator Physics with High-Speed, Differentiable Simulations

Authors: Jan Kaiser, Chenran Xu, Annika Eichler, Andrea Santamaria Garcia

Abstract: Machine learning has emerged as a powerful solution to the modern challenges in accelerator physics. However, the limited availability of beam time, the computational cost of simulations, and the high-dimensionality of optimisation problems pose significant challenges in generating the required data for training state-of-the-art machine learning models. In this work, we introduce Cheetah, a PyTorc… ▽ More Machine learning has emerged as a powerful solution to the modern challenges in accelerator physics. However, the limited availability of beam time, the computational cost of simulations, and the high-dimensionality of optimisation problems pose significant challenges in generating the required data for training state-of-the-art machine learning models. In this work, we introduce Cheetah, a PyTorch-based high-speed differentiable linear-beam dynamics code. Cheetah enables the fast collection of large data sets by reducing computation times by multiple orders of magnitude and facilitates efficient gradient-based optimisation for accelerator tuning and system identification. This positions Cheetah as a user-friendly, readily extensible tool that integrates seamlessly with widely adopted machine learning tools. We showcase the utility of Cheetah through five examples, including reinforcement learning training, gradient-based beamline tuning, gradient-based system identification, physics-informed Bayesian optimisation priors, and modular neural network surrogate modelling of space charge effects. The use of such a high-speed differentiable simulation code will simplify the development of machine learning-based methods for particle accelerators and fast-track their integration into everyday operations of accelerator facilities. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 16 pages, 9 figures, 3 tables

Report number: PUBDB-2023-07854

Journal ref: Phys. Rev. Accel. Beams 27 (2024) 054601

arXiv:2312.17553 [pdf, other]

A Fully Automated Pipeline Using Swin Transformers for Deep Learning-Based Blood Segmentation on Head CT Scans After Aneurysmal Subarachnoid Hemorrhage

Authors: Sergio Garcia Garcia, Santiago Cepeda, Ignacio Arrese, Rosario Sarabia

Abstract: Background: Accurate volumetric assessment of spontaneous subarachnoid hemorrhage (SAH) is a labor-intensive task performed with current manual and semiautomatic methods that might be relevant for its clinical and prognostic implications. In the present research, we sought to develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for SAH patients via noncon… ▽ More Background: Accurate volumetric assessment of spontaneous subarachnoid hemorrhage (SAH) is a labor-intensive task performed with current manual and semiautomatic methods that might be relevant for its clinical and prognostic implications. In the present research, we sought to develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for SAH patients via noncontrast computed tomography (NCCT) scans employing a transformer-based Swin UNETR architecture. Methods: We retrospectively analyzed NCCT scans from patients with confirmed aneurysmal subarachnoid hemorrhage (aSAH) utilizing the Swin UNETR for segmentation. The performance of the proposed method was evaluated against manually segmented ground truth data using metrics such as Dice score, intersection over union (IoU), the volumetric similarity index (VSI), the symmetric average surface distance (SASD), and sensitivity and specificity. A validation cohort from an external institution was included to test the generalizability of the model. Results: The model demonstrated high accuracy with robust performance metrics across the internal and external validation cohorts. Notably, it achieved high Dice coefficient (0.873), IoU (0.810), VSI (0.840), sensitivity (0.821) and specificity (0.996) values and a low SASD (1.866), suggesting proficiency in segmenting blood in SAH patients. The model's efficiency was reflected in its processing speed, indicating potential for real-time applications. Conclusions: Our Swin UNETR-based model offers significant advances in the automated segmentation of blood after aSAH on NCCT images. Despite the computational intensity, the model operates effectively on standard hardware with a user-friendly interface, facilitating broader clinical adoption. Further validation across diverse datasets is warranted to confirm its clinical reliability. △ Less

Submitted 29 December, 2023; originally announced December 2023.

arXiv:2312.05667 [pdf, other]

Bayesian Optimization Algorithms for Accelerator Physics

Authors: Ryan Roussel, Auralee L. Edelen, Tobias Boltz, Dylan Kennedy, Zhe Zhang, Fuhao Ji, Xiaobiao Huang, Daniel Ratner, Andrea Santamaria Garcia, Chenran Xu, Jan Kaiser, Angel Ferran Pousa, Annika Eichler, Jannis O. Lubsen, Natalie M. Isenberg, Yuan Gao, Nikita Kuklev, Jose Martinez, Brahim Mustapha, Verena Kain, Weijian Lin, Simone Maria Liuzzo, Jason St. John, Matthew J. V. Streeter, Remi Lehe , et al. (1 additional authors not shown)

Abstract: Accelerator physics relies on numerical algorithms to solve optimization problems in online accelerator control and tasks such as experimental design and model calibration in simulations. The effectiveness of optimization algorithms in discovering ideal solutions for complex challenges with limited resources often determines the problem complexity these methods can address. The accelerator physics… ▽ More Accelerator physics relies on numerical algorithms to solve optimization problems in online accelerator control and tasks such as experimental design and model calibration in simulations. The effectiveness of optimization algorithms in discovering ideal solutions for complex challenges with limited resources often determines the problem complexity these methods can address. The accelerator physics community has recognized the advantages of Bayesian optimization algorithms, which leverage statistical surrogate models of objective functions to effectively address complex optimization challenges, especially in the presence of noise during accelerator operation and in resource-intensive physics simulations. In this review article, we offer a conceptual overview of applying Bayesian optimization techniques towards solving optimization problems in accelerator physics. We begin by providing a straightforward explanation of the essential components that make up Bayesian optimization techniques. We then give an overview of current and previous work applying and modifying these techniques to solve accelerator physics challenges. Finally, we explore practical implementation strategies for Bayesian optimization algorithms to maximize their performance, enabling users to effectively address complex optimization challenges in real-time beam control and accelerator design. △ Less

Submitted 5 April, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

arXiv:2312.01354 [pdf, other]

Protecting Sensitive Tabular Data in Hybrid Clouds

Authors: Maya Anderson, Gidon Gershinsky, Eliot Salant, Salvador Garcia

Abstract: Regulated industries, such as Healthcare and Finance, are starting to move parts of their data and workloads to the public cloud. However, they are still reluctant to trust the public cloud with their most sensitive records, and hence leave them in their premises, leveraging the hybrid cloud architecture. We address the security and performance challenges of big data analytics using a hybrid cloud… ▽ More Regulated industries, such as Healthcare and Finance, are starting to move parts of their data and workloads to the public cloud. However, they are still reluctant to trust the public cloud with their most sensitive records, and hence leave them in their premises, leveraging the hybrid cloud architecture. We address the security and performance challenges of big data analytics using a hybrid cloud in a real-life use case from a hospital. In this use case, the hospital collects sensitive patient data and wants to run analytics on it in order to lower antibiotics resistance, a significant challenge in healthcare. We show that it is possible to run large-scale analytics on data that is securely stored in the public cloud encrypted using Apache Parquet Modular Encryption (PME), without significant performance losses even if the secret encryption keys are stored on-premises. PME is a standard mechanism for data encryption and key management, not specific to any public cloud, and therefore helps prevent vendor lock-in. It also provides privacy and integrity guarantees, and enables granular access control to the data. We also present an innovation in PME for lowering the performance hit incurred by calls to the Key Management Service. Our solution therefore enables protecting large amounts of sensitive data in hybrid clouds and still allows to efficiently gain valuable insights from it. △ Less

Submitted 3 December, 2023; originally announced December 2023.

Comments: 5 pages, 3 figures

ACM Class: D.4.6; K.6.5; J.3

arXiv:2311.17880 [pdf, other]

Atomically thin current pathways in graphene through Kekulé-O engineering

Authors: Santiago Galván y García, Yonatan Betancur-Ocampo, Francisco Sánchez-Ochoa, Thomas Stegmann

Abstract: We demonstrate that the current flow in graphene can be guided on atomically thin current pathways by means of the engineering of Kekulé-O distortions. A grain boundary in these distortions separates the system into topological distinct regions and induces a ballistic domain-wall state. The state does not depend on the precise orientation of the grain boundary with respect to the graphene sublatti… ▽ More We demonstrate that the current flow in graphene can be guided on atomically thin current pathways by means of the engineering of Kekulé-O distortions. A grain boundary in these distortions separates the system into topological distinct regions and induces a ballistic domain-wall state. The state does not depend on the precise orientation of the grain boundary with respect to the graphene sublattice and therefore, permits to guide the current on arbitrary paths through the system. As the state is gapped, the current flow can be switched by electrostatic gates. Our findings can be explained by a generalization of the Jackiw-Rebbi model, where the electrons behave in one region of the system as fermions with an effective complex mass, making the device not only promising for technological applications but also a test-ground for concepts from high-energy physics. An atomic model supported by DFT calculations demonstrates that the proposed system can be realized by decorating graphene with Ti atoms. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: 8 pages, 5 figures

arXiv:2311.17168 [pdf, other]

Probing nuclear properties and neutrino physics with current and future CEνNS experiments

Authors: R. R. Rossi, G. Sanchez Garcia, M. Tórtola

Abstract: The recent observation of Coherent Elastic Neutrino Nucleus Scattering (CEνNS) with neutrinos from pion decay at rest (π-DAR) sources by the COHERENT Collaboration has raised interest in this process in the search for new physics. Unfortunately, current uncertainties in the determination of nuclear parameters relevant to those processes can hide new physics effects. This is not the case for proces… ▽ More The recent observation of Coherent Elastic Neutrino Nucleus Scattering (CEνNS) with neutrinos from pion decay at rest (π-DAR) sources by the COHERENT Collaboration has raised interest in this process in the search for new physics. Unfortunately, current uncertainties in the determination of nuclear parameters relevant to those processes can hide new physics effects. This is not the case for processes involving lower-energy neutrino sources such as nuclear reactors. Note, however, that a CEνNS measurement with reactor neutrinos depends largely on the determination of the quenching factor, making its observation more challenging. In the upcoming years, once this signal is confirmed, a combined analysis of π-DAR and reactor CEνNS experiments will be very useful to probe particle and nuclear physics, with a reduced dependence on the nuclear uncertainties. In this work, we explore this idea by simultaneously testing the sensitivity of current and future CEνNS experiments to neutrino non-standard interactions (NSI) and the neutron root mean square (rms) radius, considering different neutrino sources as well as several detection materials. We show how the interplay between future reactor and accelerator CEνNS experiments can help to get robust constraints on the neutron rms, and to break degeneracies between the NSI parameters. Our forecast could be used as a guide to optimize the experimental sensitivity to the parameters under study. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 18 pages, 11 figures

arXiv:2311.11887 [pdf, ps, other]

An Almgren monotonicity formula for discrete harmonic functions

Authors: Mariana Smit Vega Garcia, Stefan Steinerberger

Abstract: The celebrated Almgren monotonicity formula for harmonic functions $u:\mathbb{R}^n \rightarrow \mathbb{R}$ says that its $L^2-$energy concentrated on a sphere of radius $r$, when measured in a suitable sense, is non-decreasing: if $u$ oscillates at a certain scale, it has even larger oscillations at a larger scale. We prove a discrete analogue of the Almgren monotonicity formula for harmonic funct… ▽ More The celebrated Almgren monotonicity formula for harmonic functions $u:\mathbb{R}^n \rightarrow \mathbb{R}$ says that its $L^2-$energy concentrated on a sphere of radius $r$, when measured in a suitable sense, is non-decreasing: if $u$ oscillates at a certain scale, it has even larger oscillations at a larger scale. We prove a discrete analogue of the Almgren monotonicity formula for harmonic functions on infinite combinatorial graphs $G=(V,E)$. Some applications are discussed. △ Less

Submitted 20 November, 2023; originally announced November 2023.

arXiv:2311.05051 [pdf, other]

Deep Learning Brasil at ABSAPT 2022: Portuguese Transformer Ensemble Approaches

Authors: Juliana Resplande Santanna Gomes, Eduardo Augusto Santos Garcia, Adalberto Ferreira Barbosa Junior, Ruan Chaves Rodrigues, Diogo Fernandes Costa Silva, Dyonnatan Ferreira Maia, Nádia Félix Felipe da Silva, Arlindo Rodrigues Galvão Filho, Anderson da Silva Soares

Abstract: Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarit… ▽ More Aspect-based Sentiment Analysis (ABSA) is a task whose objective is to classify the individual sentiment polarity of all entities, called aspects, in a sentence. The task is composed of two subtasks: Aspect Term Extraction (ATE), identify all aspect terms in a sentence; and Sentiment Orientation Extraction (SOE), given a sentence and its aspect terms, the task is to determine the sentiment polarity of each aspect term (positive, negative or neutral). This article presents we present our participation in Aspect-Based Sentiment Analysis in Portuguese (ABSAPT) 2022 at IberLEF 2022. We submitted the best performing systems, achieving new state-of-the-art results on both subtasks. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 11 pages, 3 figures, In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2022), Online. CEUR. org

Report number: urn:nbn:de:0074-3202-9

arXiv:2310.17332 [pdf, other]

On Forecast Stability

Authors: Rakshitha Godahewa, Christoph Bergmeir, Zeynep Erkin Baz, Chengjun Zhu, Zhangdi Song, Salvador García, Dario Benavides

Abstract: Forecasts are typically not produced in a vacuum but in a business context, where forecasts are generated on a regular basis and interact with each other. For decisions, it may be important that forecasts do not change arbitrarily, and are stable in some sense. However, this area has received only limited attention in the forecasting literature. In this paper, we explore two types of forecast stab… ▽ More Forecasts are typically not produced in a vacuum but in a business context, where forecasts are generated on a regular basis and interact with each other. For decisions, it may be important that forecasts do not change arbitrarily, and are stable in some sense. However, this area has received only limited attention in the forecasting literature. In this paper, we explore two types of forecast stability that we call vertical stability and horizontal stability. The existing works in the literature are only applicable to certain base models and extending these frameworks to be compatible with any base model is not straightforward. Furthermore, these frameworks can only stabilise the forecasts vertically. To fill this gap, we propose a simple linear-interpolation-based approach that is applicable to stabilise the forecasts provided by any base model vertically and horizontally. The approach can produce both accurate and stable forecasts. Using N-BEATS, Pooled Regression and LightGBM as the base models, in our evaluation on four publicly available datasets, the proposed framework is able to achieve significantly higher stability and/or accuracy compared to a set of benchmarks including a state-of-the-art forecast stabilisation method across three error metrics and six stability metrics. △ Less

Submitted 26 October, 2023; originally announced October 2023.

arXiv:2310.07196 [pdf, other]

Norms on complex matrices induced by random vectors II: extension of weakly unitarily invariant norms

Authors: Ángel Chávez, Stephan Ramon Garcia, Jackson Hurley

Abstract: We improve and expand in two directions the theory of norms on complex matrices induced by random vectors. We first provide a simple proof of the classification of weakly unitarily invariant norms on the Hermitian matrices. We use this to extend the main theorem in [7] from exponent $d\geq 2$ to $d \geq 1$. Our proofs are much simpler than the originals: they do not require Lewis' framework for gr… ▽ More We improve and expand in two directions the theory of norms on complex matrices induced by random vectors. We first provide a simple proof of the classification of weakly unitarily invariant norms on the Hermitian matrices. We use this to extend the main theorem in [7] from exponent $d\geq 2$ to $d \geq 1$. Our proofs are much simpler than the originals: they do not require Lewis' framework for group invariance in convex matrix analysis. This clarification puts the entire theory on simpler foundations while extending its range of applicability. △ Less

Submitted 25 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: 10 pages

MSC Class: 47A30; 15A60; 16R30

arXiv:2309.13741 [pdf, ps, other]

Symmetric tensor powers of graphs

Authors: Weymar Astaiza, Alexander J. Barrios, Henry Chimal-Dzul, Stephan Ramon Garcia, Jaaziel de la Luz, Victor H. Moll, Yunied Puig, Diego Villamizar

Abstract: The symmetric tensor power of graphs is introduced and its fundamental properties are explored. A wide range of intriguing phenomena occur when one considers symmetric tensor powers of familiar graphs. A host of open questions are presented, ho** to spur future research. The symmetric tensor power of graphs is introduced and its fundamental properties are explored. A wide range of intriguing phenomena occur when one considers symmetric tensor powers of familiar graphs. A host of open questions are presented, ho** to spur future research. △ Less

Submitted 24 September, 2023; originally announced September 2023.

MSC Class: 05C76; 05C40

arXiv:2309.13476 [pdf, other]

Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection

Authors: Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

Abstract: Depression is a common mental disorder. Automatic depression detection tools using speech, enabled by machine learning, help early screening of depression. This paper addresses two limitations that may hinder the clinical implementations of such tools: noise resulting from segment-level labelling and a lack of model interpretability. We propose a bi-modal speech-level transformer to avoid segment-… ▽ More Depression is a common mental disorder. Automatic depression detection tools using speech, enabled by machine learning, help early screening of depression. This paper addresses two limitations that may hinder the clinical implementations of such tools: noise resulting from segment-level labelling and a lack of model interpretability. We propose a bi-modal speech-level transformer to avoid segment-level labelling and introduce a hierarchical interpretation approach to provide both speech-level and sentence-level interpretations, based on gradient-weighted attention maps derived from all attention layers to track interactions between input features. We show that the proposed model outperforms a model that learns at a segment level ($p$=0.854, $r$=0.947, $F1$=0.897 compared to $p$=0.732, $r$=0.808, $F1$=0.768). For model interpretation, using one true positive sample, we show which sentences within a given speech are most relevant to depression detection; and which text tokens and Mel-spectrogram regions within these sentences are most relevant to depression detection. These interpretations allow clinicians to verify the validity of predictions made by depression detection tools, promoting their clinical implementations. △ Less

Submitted 6 October, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

Comments: 5 pages, 3 figures, submitted to IEEE International Conference on Acoustics, Speech, and Signal Processing

ACM Class: F.2.2; I.2.7

arXiv:2309.08510 [pdf, other]

Differences between quantum and classical adiabatic evolution

Authors: Cyrill Bösch, Andreas Fichtner, Marc Serra Garcia

Abstract: Adiabatic evolution is an emergent design principle for time modulated metamaterials, often inspired by insights from topological quantum computing such as Majorana fermions and braiding operations. However, the pursuit of classical adiabatic metamaterials is rooted on the assumption that classical and quantum adiabatic evolution are equivalent. We show that this is not the case; and some instance… ▽ More Adiabatic evolution is an emergent design principle for time modulated metamaterials, often inspired by insights from topological quantum computing such as Majorana fermions and braiding operations. However, the pursuit of classical adiabatic metamaterials is rooted on the assumption that classical and quantum adiabatic evolution are equivalent. We show that this is not the case; and some instances of quantum adiabatic evolution, such as those containing zero modes, cannot be reproduced in classical systems. This is because mode coupling is fundamentally different in classical mechanics. We derive classical conditions to ensure adiabaticity and demonstrate that only under these, from quantum mechanics distinct conditions the Berry phase and Wilczek-Zee matrix emerge as meaningful quantities encoding the geometry of classical adiabatic evolution. △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: 7 pages, 1 figure

arXiv:2309.08337 [pdf]

Proceedings of the XII International Workshop on Locational Analysis and Related Problems

Authors: Marta Baldomero-Naranjo, Víctor Blanco, Sergio García, Ricardo Gázquez, Jörg Kalcsics, Luisa I. Martínez-Merino, Juan M. Muñoz-Ocaña, Francisco Temprano, Alberto Torrejón

Abstract: The International Workshop on Locational Analysis and Related Problems will take place during September 7-8, 2023 in Edinburgh (United Kingdom). It is organized by the Spanish Location Network and the Location Group GELOCA from the Spanish Society of Statistics and Operations Research (SEIO). The Spanish Location Network is a group of more than 140 researchers from several Spanish universities org… ▽ More The International Workshop on Locational Analysis and Related Problems will take place during September 7-8, 2023 in Edinburgh (United Kingdom). It is organized by the Spanish Location Network and the Location Group GELOCA from the Spanish Society of Statistics and Operations Research (SEIO). The Spanish Location Network is a group of more than 140 researchers from several Spanish universities organized into 7 thematic groups. The Network has been funded by the Spanish Government since 2003. The current project is RED2022-134149-T. One of the main activities of the Network is a yearly meeting aimed at promoting the communication among its members and between them and other researchers, and to contribute to the development of the location field and related problems. As a proof of the internationalization of this research group, this will be the first time that the meeting is held out of Spain. The topics of interest are location analysis and related problems. This includes location models, networks, transportation, logistics, exact and heuristic solution methods, and computational geometry, among others. △ Less

Submitted 5 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: The proceedings book of the previous editions can be found at arXiv:2002.08287 arXiv:2002.08293 arXiv:2002.08300 arXiv:2002.01702 arXiv:2202.13878

Report number: ISBN: 978-84-09-53463-0

arXiv:2309.03053 [pdf, other]

doi 10.22323/1.444.0752

An update on site search activities for SWGO

Authors: M. Santander, U. Barres de Almeida, J. A. Bellido, T. Bulik, C. Dib, B. Dingus, S. Garcia, F. Guarino, P. Huentemeyer, D. Mandat, E. Meza, L. Mendes, L. Nellen, C. Ocampo, L. Otiniano, E. Quispe, A. Reisenegger, A. C. Rovero, F. Sanchez, A. Sandoval, R. Yanyachi, H. Zhou

Abstract: The Southern Wide-field Gamma-ray Observatory (SWGO) is a project by scientists and engineers from 14 countries and 78 institutions to design and build the first wide-field, ground-based gamma-ray observatory in the Southern Hemisphere, with high duty cycle and covering an energy range rom hundreds of GeV to the PeV scale. The observatory will cover the Southern sky and aims to map the Galaxy's la… ▽ More The Southern Wide-field Gamma-ray Observatory (SWGO) is a project by scientists and engineers from 14 countries and 78 institutions to design and build the first wide-field, ground-based gamma-ray observatory in the Southern Hemisphere, with high duty cycle and covering an energy range rom hundreds of GeV to the PeV scale. The observatory will cover the Southern sky and aims to map the Galaxy's large-scale emission, as well as detecting transient and variable phenomena. The host sites under consideration are at a minimum altitude of 4400 m.a.s.l. and comprise two types: flat plateaus of at least 1 km$^{2}$ for the installation of an array of tank-based water Cherenkov detectors (WCD), or large natural lakes for the direct deployment of WCD units. Four South American countries proposed excellent sites to host the observatory meeting these requirements. Argentina proposed two locations in the Salta province, Bolivia presented one site in Chacaltaya, Chile two locations within the Atacama Astronomical Park, and Peru two ground-based locations in the Arequipa district as well as lakes in the Cuzco region. The SWGO collaboration is currently conducting a site characterization study, gathering all the necessary information for site shortlisting and final site selection by the end of 2023. The process has reached the shortlisting phase, in which primary and backup sites for each country have been identified. The primary sites were visited by a team of experts from the collaboration, to investigate and validate the proposed site characteristics. Here we present an update on these site selection activities. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: In Proceedings of the 2023 ICRC, Nagoya, Japan

Journal ref: PoS (ICRC2023) 752

arXiv:2309.00155 [pdf, other]

LLM in the Shell: Generative Honeypots

Authors: Muris Sladić, Veronica Valeros, Carlos Catania, Sebastian Garcia

Abstract: Honeypots are essential tools in cybersecurity. However, most of them (even the high-interaction ones) lack the required realism to engage and fool human attackers. This limitation makes them easily discernible, hindering their effectiveness. This work introduces a novel method to create dynamic and realistic software honeypots based on Large Language Models. Preliminary results indicate that LLMs… ▽ More Honeypots are essential tools in cybersecurity. However, most of them (even the high-interaction ones) lack the required realism to engage and fool human attackers. This limitation makes them easily discernible, hindering their effectiveness. This work introduces a novel method to create dynamic and realistic software honeypots based on Large Language Models. Preliminary results indicate that LLMs can create credible and dynamic honeypots capable of addressing important limitations of previous honeypots, such as deterministic responses, lack of adaptability, etc. We evaluated the realism of each command by conducting an experiment with human attackers who needed to say if the answer from the honeypot was fake or not. Our proposed honeypot, called shelLM, reached an accuracy of 0.92. The source code and prompts necessary for replicating the experiments have been made publicly available. △ Less

Submitted 9 February, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

Comments: 6 pages. 2 figures. 2 tables

arXiv:2308.16562 [pdf, other]

doi 10.1007/978-3-031-51482-1_3

The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning

Authors: Maria Rigaki, Sebastian Garcia

Abstract: Due to the proliferation of malware, defenders are increasingly turning to automation and machine learning as part of the malware detection tool-chain. However, machine learning models are susceptible to adversarial attacks, requiring the testing of model and product robustness. Meanwhile, attackers also seek to automate malware generation and evasion of antivirus systems, and defenders try to gai… ▽ More Due to the proliferation of malware, defenders are increasingly turning to automation and machine learning as part of the malware detection tool-chain. However, machine learning models are susceptible to adversarial attacks, requiring the testing of model and product robustness. Meanwhile, attackers also seek to automate malware generation and evasion of antivirus systems, and defenders try to gain insight into their methods. This work proposes a new algorithm that combines Malware Evasion and Model Extraction (MEME) attacks. MEME uses model-based reinforcement learning to adversarially modify Windows executable binary samples while simultaneously training a surrogate model with a high agreement with the target model to evade. To evaluate this method, we compare it with two state-of-the-art attacks in adversarial malware creation, using three well-known published models and one antivirus product as targets. Results show that MEME outperforms the state-of-the-art methods in terms of evasion capabilities in almost all cases, producing evasive malware with an evasion rate in the range of 32-73%. It also produces surrogate models with a prediction label agreement with the respective target models between 97-99%. The surrogate could be used to fine-tune and improve the evasion rate in the future. △ Less

Submitted 31 August, 2023; originally announced August 2023.

Comments: 12 pages, 3 figures, 3 tables. Accepted at ESORICS 2023

arXiv:2308.16061 [pdf, other]

Conti Inc.: Understanding the Internal Discussions of a large Ransomware-as-a-Service Operator with Machine Learning

Authors: Estelle Ruellan, Masarah Paquet-Clouston, Sebastian Garcia

Abstract: Ransomware-as-a-service (RaaS) is increasing the scale and complexity of ransomware attacks. Understanding the internal operations behind RaaS has been a challenge due to the illegality of such activities. The recent chat leak of the Conti RaaS operator, one of the most infamous ransomware operators on the international scene, offers a key opportunity to better understand the inner workings of suc… ▽ More Ransomware-as-a-service (RaaS) is increasing the scale and complexity of ransomware attacks. Understanding the internal operations behind RaaS has been a challenge due to the illegality of such activities. The recent chat leak of the Conti RaaS operator, one of the most infamous ransomware operators on the international scene, offers a key opportunity to better understand the inner workings of such organizations. This paper analyzes the main topic discussions in the Conti chat leak using machine learning techniques such as Natural Language Processing (NLP) and Latent Dirichlet Allocation (LDA), as well as visualization strategies. Five discussion topics are found: 1) Business, 2) Technical, 3) Internal tasking/Management, 4) Malware, and 5) Customer Service/Problem Solving. Moreover, the distribution of topics among Conti members shows that only 4% of individuals have specialized discussions while almost all individuals (96%) are all-rounders, meaning that their discussions revolve around the five topics. The results also indicate that a significant proportion of Conti discussions are non-tech related. This study thus highlights that running such large RaaS operations requires a workforce skilled beyond technical abilities, with individuals involved in various tasks, from management to customer service or problem solving. The discussion topics also show that the organization behind the Conti RaaS oper5086933ator shares similarities with a large firm. We conclude that, although RaaS represents an example of specialization in the cybercrime industry, only a few members are specialized in one topic, while the rest runs and coordinates the RaaS operation. △ Less

Submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.12105 [pdf, other]

doi 10.1140/epjc/s10052-024-12470-w

Pulse shape discrimination for the CONUS experiment in the keV and sub-keV regime

Authors: H. Bonet, A. Bonhomme, C. Buck, K. Fülber, J. Hakenmüller, J. Hempfling, J. Henrichs, G. Heusser, M. Lindner, W. Maneschg, T. Rink, E. Sanchez Garcia, J. Stauber, H. Strecker, R. Wink

Abstract: Point-contact p-type high-purity germanium detectors (PPC HPGe) are particularly suited for detection of sub-keV nuclear recoils from coherent elastic scattering of neutrinos or light dark matter particles. While these particles are expected to interact homogeneously in the entire detector volume, specific classes of external background radiation preferably deposit their energy close to the semi-a… ▽ More Point-contact p-type high-purity germanium detectors (PPC HPGe) are particularly suited for detection of sub-keV nuclear recoils from coherent elastic scattering of neutrinos or light dark matter particles. While these particles are expected to interact homogeneously in the entire detector volume, specific classes of external background radiation preferably deposit their energy close to the semi-active detector surface, in which diffusion processes dominate that subsequently lead to slower rising pulses compared to the ones from the fully active bulk volume. Dedicated studies of their shape are therefore highly beneficial for the understanding and the rejection of these unwanted events. This article reports about the development of a data-driven pulse shape discrimination (PSD) method for the four 1 kg size PPC HPGe detectors of the CONUS experiment in the keV and sub-keV regime down to 210 eV$_{\text{ee}}$. The impact of the electronic noise at such low energies is carefully examined. It is shown that for an acceptance of 90% of the faster signal-like pulses from the bulk volume, approx. 50% of the surface events can be rejected at the energy threshold and that their contribution is fully suppressed above 800 eV$_{\text{ee}}$. Applied to the CONUS background data, such a PSD rejection cut allows to achieve an overall (15-25)% reduction of the total background budget. The new method allows to improve the sensitivity of future CONUS analyses and to refine the corresponding background model in the sub-keV energy region. △ Less

Submitted 9 February, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

Journal ref: Eur. Phys. J. C 84, 139 (2024)

arXiv:2308.12086 [pdf, other]

doi 10.5220/0012391800003636

Out of the Cage: How Stochastic Parrots Win in Cyber Security Environments

Authors: Maria Rigaki, Ondřej Lukáš, Carlos A. Catania, Sebastian Garcia

Abstract: Large Language Models (LLMs) have gained widespread popularity across diverse domains involving text generation, summarization, and various natural language processing tasks. Despite their inherent limitations, LLM-based designs have shown promising capabilities in planning and navigating open-world scenarios. This paper introduces a novel application of pre-trained LLMs as agents within cybersecu… ▽ More Large Language Models (LLMs) have gained widespread popularity across diverse domains involving text generation, summarization, and various natural language processing tasks. Despite their inherent limitations, LLM-based designs have shown promising capabilities in planning and navigating open-world scenarios. This paper introduces a novel application of pre-trained LLMs as agents within cybersecurity network environments, focusing on their utility for sequential decision-making processes. We present an approach wherein pre-trained LLMs are leveraged as attacking agents in two reinforcement learning environments. Our proposed agents demonstrate similar or better performance against state-of-the-art agents trained for thousands of episodes in most scenarios and configurations. In addition, the best LLM agents perform similarly to human testers of the environment without any additional training process. This design highlights the potential of LLMs to efficiently address complex decision-making tasks within cybersecurity. Furthermore, we introduce a new network security environment named NetSecGame. The environment is designed to eventually support complex multi-agent scenarios within the network security domain. The proposed environment mimics real network attacks and is designed to be highly modular and adaptable for various scenarios. △ Less

Submitted 28 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

Comments: Under review. 10 pages plus appendices, 7 figures, 4 tables. Edit: fix e-mails and code repository

arXiv:2308.06357 [pdf, ps, other]

Two-phase almost minimizers for a fractional free boundary problem

Authors: Mark Allen, Mariana Smit Vega Garcia

Abstract: In this paper, we study almost minimizers to a fractional Alt-Caffarelli-Friedman type functional. Our main results concern the optimal $C^{0,s}$ regularity of almost minimizers as well as the structure of the free boundary. We first prove that the two free boundaries $F^+(u)=\partial\{u(\cdot,0)>0\}$ and $F^-(u)=\partial\{u(\cdot,0)<0\}$ cannot touch, that is, $F^+(u)\cap F^-(u)=\emptyset$. Lastl… ▽ More In this paper, we study almost minimizers to a fractional Alt-Caffarelli-Friedman type functional. Our main results concern the optimal $C^{0,s}$ regularity of almost minimizers as well as the structure of the free boundary. We first prove that the two free boundaries $F^+(u)=\partial\{u(\cdot,0)>0\}$ and $F^-(u)=\partial\{u(\cdot,0)<0\}$ cannot touch, that is, $F^+(u)\cap F^-(u)=\emptyset$. Lastly, we prove a flatness implies $C^{1,γ}$ result for the free boundary. △ Less

Submitted 28 February, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

arXiv:2308.02130 [pdf, other]

Spin-valley locking in Kekulé-distorted graphene with Dirac-Rashba interactions

Authors: David A. Ruiz-Tijerina, Jesús Arturo Sánchez-Sánchez, Ramon Carrillo-Bastos, Santiago Galván y García, Francisco Mireles

Abstract: The joint effects of Kekulé lattice distortions and Rashba-type spin-orbit coupling on the electronic properties of graphene are explored. We modeled the position dependence of the Rashba energy term in a manner that allows its seamless integration into the scheme introduced by Gamayun et al.[New J. Phys. 20, 023016 (2018)] to describe graphene with Kekulé lattice distortion. Particularly for the… ▽ More The joint effects of Kekulé lattice distortions and Rashba-type spin-orbit coupling on the electronic properties of graphene are explored. We modeled the position dependence of the Rashba energy term in a manner that allows its seamless integration into the scheme introduced by Gamayun et al.[New J. Phys. 20, 023016 (2018)] to describe graphene with Kekulé lattice distortion. Particularly for the Kekulé-Y texture, the effective low energy Dirac Hamiltonian contains a new spin-valley locking term, in addition to the well-known Rashba-induced momentum-pseudospin and spin-pseudospin couplings, and the Kekulé-induced momentum-valley coupling term. We report on the low-energy band structure and Landau level spectra of Rashba-spin-orbit-coupled Kek-Y graphene, and propose an experimental scheme to discern between the presence of Rashba spin-orbit coupling, Kek-Y lattice distortion, and both, based on do**-dependent magnetotransport measurements. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: 14 pages, including 8 figures and 4 appendices

arXiv:2307.13790 [pdf, other]

doi 10.1103/PhysRevD.109.055014

A neutrino window to scalar leptoquarks: from low energy to colliders

Authors: Valentina De Romeri, Victor Martin Lozano, G. Sanchez Garcia

Abstract: Leptoquarks are theorized particles of either scalar or vector nature that couple simultaneously to quarks and leptons. Motivated by recent measurements of coherent elastic neutrino-nucleus scattering, we consider the impact of scalar leptoquarks coupling to neutrinos on a few complementary processes, from low energy to colliders. In particular, we set competitive constraints on the typical mass a… ▽ More Leptoquarks are theorized particles of either scalar or vector nature that couple simultaneously to quarks and leptons. Motivated by recent measurements of coherent elastic neutrino-nucleus scattering, we consider the impact of scalar leptoquarks coupling to neutrinos on a few complementary processes, from low energy to colliders. In particular, we set competitive constraints on the typical mass and coupling of scalar leptoquarks by analyzing recent COHERENT data. We compare these constraints with bounds from atomic parity violation experiments, deep inelastic neutrino-nucleon scattering and LHC data. Our results highlight a strong complementarity between different facilities and demonstrate the compelling power of coherent elastic neutrino-nucleus scattering experiments to probe leptoquark masses in the MeV-GeV range. Finally, we also present prospects for improving current bounds with future upgrades of the COHERENT detectors and the planned European Spallation Source. △ Less

Submitted 28 May, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

Comments: 30 pages, 9 figures, 2 tables, improved constraints on LQ masses and couplings. Matches published version in PRD

Report number: IFIC/23-27

Journal ref: Phys.Rev.D 109 (2024) 5, 055014

arXiv:2307.03948 [pdf, other]

Reading Between the Lanes: Text VideoQA on the Road

Authors: George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Abstract: Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and te… ▽ More Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness. Scene text recognition in motion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and textual cues from the video stream but also reason over time. To address this issue, we introduce RoadTextVQA, a new dataset for the task of video question answering (VideoQA) in the context of driver assistance. RoadTextVQA consists of $3,222$ driving videos collected from multiple countries, annotated with $10,500$ questions, all based on text or road signs present in the driving videos. We assess the performance of state-of-the-art video question answering models on our RoadTextVQA dataset, highlighting the significant potential for improvement in this domain and the usefulness of the dataset in advancing research on in-vehicle support systems and text-aware multimodal question answering. The dataset is available at http://cvit.iiit.ac.in/research/projects/cvit-projects/roadtextvqa △ Less

Submitted 8 July, 2023; originally announced July 2023.

arXiv:2306.14807 [pdf, ps, other]

Symmetric and Antisymmetric Tensor Products for the Function-Theoretic Operator Theorist

Authors: Stephan Ramon Garcia, Ryan O'Loughlin, Jiahui Yu

Abstract: We study symmetric and antisymmetric tensor products of Hilbert-space operators, focusing on norms and spectra for some well-known classes favored by function-theoretic operator theorists. We pose many open questions that should interest the field. We study symmetric and antisymmetric tensor products of Hilbert-space operators, focusing on norms and spectra for some well-known classes favored by function-theoretic operator theorists. We pose many open questions that should interest the field. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Comments: 20 pages

MSC Class: 46B28; 30H10; 47A30; 47A10

arXiv:2306.14584 [pdf]

doi 10.1016/j.tre.2023.103174

Methodology for generating synthetic labeled datasets for visual container inspection

Authors: Guillem Delgado, Andoni Cortés, Sara García, Estíbaliz Loyo, Maialen Berasategi, Nerea Aranjuelo

Abstract: Nowadays, containerized freight transport is one of the most important transportation systems that is undergoing an automation process due to the Deep Learning success. However, it suffers from a lack of annotated data in order to incorporate state-of-the-art neural network models to its systems. In this paper we present an innovative methodology to generate a realistic, varied, balanced, and la… ▽ More Nowadays, containerized freight transport is one of the most important transportation systems that is undergoing an automation process due to the Deep Learning success. However, it suffers from a lack of annotated data in order to incorporate state-of-the-art neural network models to its systems. In this paper we present an innovative methodology to generate a realistic, varied, balanced, and labelled dataset for visual inspection task of containers in a dock environment. In addition, we validate this methodology with multiple visual tasks recurrently found in the state of the art. We prove that the generated synthetic labelled dataset allows to train a deep neural network that can be used in a real world scenario. On the other side, using this methodology we provide the first open synthetic labelled dataset called SeaFront available in: https://datasets.vicomtech.org/di21-seafront/readme.txt. △ Less

Submitted 26 June, 2023; originally announced June 2023.

Journal ref: Transportation Research Part E: Logistics and Transportation Review, Volume 175, 2023, 103174

arXiv:2306.12158 [pdf, ps, other]

Mesas of Stirling permutations

Authors: Nicolle González, Pamela E. Harris, Gordon Rojas Kirby, Mariana Smit Vega Garcia, Bridget Eileen Tenner

Abstract: Given a Stirling permutation w, we introduce the mesa set of w as the natural generalization of the pinnacle set of a permutation. Our main results characterize admissible mesa sets and give closed enumerative formulas in terms of rational Catalan numbers by providing an explicit bijection between mesa sets and rational Dyck paths. Given a Stirling permutation w, we introduce the mesa set of w as the natural generalization of the pinnacle set of a permutation. Our main results characterize admissible mesa sets and give closed enumerative formulas in terms of rational Catalan numbers by providing an explicit bijection between mesa sets and rational Dyck paths. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: 11 pages, 3 figures

MSC Class: 05A05 (Primary); 05A15 (Secondary)

arXiv:2306.11679 [pdf, other]

A primal-dual data-driven method for computational optical imaging with a photonic lantern

Authors: Carlos Santos Garcia, Mathilde Larchevêque, Solal O'Sullivan, Martin Van Waerebeke, Robert R. Thomson, Audrey Repetti, Jean-Christophe Pesquet

Abstract: Optical fibres aim to image in-vivo biological processes. In this context, high spatial resolution and stability to fibre movements are key to enable decision-making processes (e.g., for microendoscopy). Recently, a single-pixel imaging technique based on a multicore fibre photonic lantern has been designed, named computational optical imaging using a lantern (COIL). A proximal algorithm based on… ▽ More Optical fibres aim to image in-vivo biological processes. In this context, high spatial resolution and stability to fibre movements are key to enable decision-making processes (e.g., for microendoscopy). Recently, a single-pixel imaging technique based on a multicore fibre photonic lantern has been designed, named computational optical imaging using a lantern (COIL). A proximal algorithm based on a sparsity prior, dubbed SARA-COIL, has been further proposed to solve the associated inverse problem, to enable image reconstructions for high resolution COIL microendoscopy. In this work, we develop a data-driven approach for COIL. We replace the sparsity prior in the proximal algorithm by a learned denoiser, leading to a plug-and-play (PnP) algorithm. The resulting PnP method, based on a proximal primal-dual algorithm, enables to solve the Morozov formulation of the inverse problem. We use recent results in learning theory to train a network with desirable Lipschitz properties, and we show that the resulting primal-dual PnP algorithm converges to a solution to a monotone inclusion problem. Our simulations highlight that the proposed data-driven approach improves the reconstruction quality over variational SARA-COIL method on both simulated and real data. △ Less

Submitted 17 April, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.04046 [pdf, ps, other]

The Reciprocal Schur Inequality

Authors: Albrecht Boettcher, Stephan Ramon Garcia, Mishko Mitkovski

Abstract: Schur's inequality states that the sum of three special terms is always nonnegative. This note is a short review of inequalities for the sum of the reciprocals of these terms and of extensions of the latter inequalities to an arbitrary number of terms and thus to higher-order divided differences. Schur's inequality states that the sum of three special terms is always nonnegative. This note is a short review of inequalities for the sum of the reciprocals of these terms and of extensions of the latter inequalities to an arbitrary number of terms and thus to higher-order divided differences. △ Less

Submitted 16 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: 7 pages

MSC Class: 26D15; 05E05; 26A51; 39B62

arXiv:2306.03739 [pdf, other]

Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning

Authors: Jan Kaiser, Chenran Xu, Annika Eichler, Andrea Santamaria Garcia, Oliver Stein, Erik Bründermann, Willi Kuropka, Hannes Dinter, Frank Mayet, Thomas Vinatier, Florian Burkart, Holger Schlarb

Abstract: Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reduci… ▽ More Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reducing tuning times. Which algorithm to choose in different scenarios, however, remains an open question. Here we present a comparative study using a routine task in a real particle accelerator as an example, showing that RLO generally outperforms BO, but is not always the best choice. Based on the study's results, we provide a clear set of criteria to guide the choice of algorithm for a given tuning task. These can ease the adoption of learning-based autonomous tuning solutions to the operation of complex real-world plants, ultimately improving the availability and pushing the limits of operability of these facilities, thereby enabling scientific and engineering advancements. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: 17 pages, 8 figures, 2 tables

arXiv:2305.01346 [pdf, other]

Attacker Profiling Through Analysis of Attack Patterns in Geographically Distributed Honeypots

Authors: Veronica Valeros, Maria Rigaki, Sebastian Garcia

Abstract: Honeypots are a well-known and widely used technology in the cybersecurity community, where it is assumed that placing honeypots in different geographical locations provides better visibility and increases effectiveness. However, how geolocation affects the usefulness of honeypots is not well-studied, especially for threat intelligence as early warning systems. This paper examines attack patterns… ▽ More Honeypots are a well-known and widely used technology in the cybersecurity community, where it is assumed that placing honeypots in different geographical locations provides better visibility and increases effectiveness. However, how geolocation affects the usefulness of honeypots is not well-studied, especially for threat intelligence as early warning systems. This paper examines attack patterns in a large public dataset of geographically distributed honeypots by answering methodological questions and creating behavioural profiles of attackers. Results show that the location of honeypots helps identify attack patterns and build profiles for the attackers. We conclude that not all the intelligence collected from geographically distributed honeypots is equally valuable and that a good early warning system against resourceful attackers may be built with only two distributed honeypots and a production server. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Showing 1–50 of 307 results for author: Garcia, S