Search | arXiv e-print repository

Weakly-supervised Autism Severity Assessment in Long Videos

Authors: Abid Ali, Mahmoud Ali, Jean-Marc Odobez, Camilla Barbini, Séverine Dubuisson, Francois Bremond, Susanne Thümmler

Abstract: Autism Spectrum Disorder (ASD) is a diverse collection of neurobiological conditions marked by challenges in social communication and reciprocal interactions, as well as repetitive and stereotypical behaviors. Atypical behavior patterns in a long, untrimmed video can serve as biomarkers for children with ASD. In this paper, we propose a video-based weakly-supervised method that takes spatio-tempor… ▽ More Autism Spectrum Disorder (ASD) is a diverse collection of neurobiological conditions marked by challenges in social communication and reciprocal interactions, as well as repetitive and stereotypical behaviors. Atypical behavior patterns in a long, untrimmed video can serve as biomarkers for children with ASD. In this paper, we propose a video-based weakly-supervised method that takes spatio-temporal features of long videos to learn typical and atypical behaviors for autism detection. On top of that, we propose a shallow TCN-MLP network, which is designed to further categorize the severity score. We evaluate our method on actual evaluation videos of children with autism collected and annotated (for severity score) by clinical professionals. Experimental results demonstrate the effectiveness of behavioral biomarkers that could help clinicians in autism spectrum analysis. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Journal ref: https://cbmi2024.org/

arXiv:2407.07172 [pdf, other]

Lorentzian anti-de Sitter plane

Authors: A. Z. Ali, Yu. L. Sachkov

Abstract: In this paper, we study a two-dimensional Lorentzian problem on the anti-de Sitter plane. Using methods of geometric control theory and differential geometry, it was possible to construct an orthonormal frame, calculate extremal trajectories, describe the reachable set, construct an optimal synthesis, and describe the Lorentzian distance. In this paper, we study a two-dimensional Lorentzian problem on the anti-de Sitter plane. Using methods of geometric control theory and differential geometry, it was possible to construct an orthonormal frame, calculate extremal trajectories, describe the reachable set, construct an optimal synthesis, and describe the Lorentzian distance. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: In Russian

arXiv:2407.06392 [pdf, other]

Effects of Small-Scale User Mobility on Highly Directional XR Communications

Authors: Asad Ali, Olga Galinina, Jiri Hosek, Sergey Andreev

Abstract: The development of next-generation communication systems promises to enable extended reality (XR) applications, such as XR gaming with ultra-realistic content and human-grade sensory feedback. These demanding applications impose stringent performance requirements on the underlying wireless communication infrastructure. To meet the expected Quality of Experience (QoE) for XR applications, high-capa… ▽ More The development of next-generation communication systems promises to enable extended reality (XR) applications, such as XR gaming with ultra-realistic content and human-grade sensory feedback. These demanding applications impose stringent performance requirements on the underlying wireless communication infrastructure. To meet the expected Quality of Experience (QoE) for XR applications, high-capacity connections are necessary, which can be achieved by using millimeter-wave (mmWave) frequency bands and employing highly directional beams. However, these narrow beams are susceptible to even minor misalignments caused by small-scale user mobility, such as changes in the orientation of the XR head-mounted device (HMD) or minor shifts in user body position. This article explores the impact of small-scale user mobility on mmWave connectivity for XR and reviews approaches to resolve the challenges arising due to small-scale mobility. To deepen our understanding of small-scale mobility during XR usage, we prepared a dataset of user mobility during XR gaming. We use this dataset to study the effects of user mobility on highly directional communication, identifying specific aspects of user mobility that significantly affect the performance of narrow-beam wireless communication systems. Our results confirm the substantial influence of small-scale mobility on beam misalignment, highlighting the need for enhanced mechanisms to effectively manage the consequences of small-scale mobility. △ Less

Submitted 8 July, 2024; originally announced July 2024.

arXiv:2407.05509 [pdf, other]

Physically Accessible and Inaccessible Quantum Correlations of Dirac Fields in Schwarzschild Spacetime

Authors: Samira Elghaayda, Asad Ali, Saif Al-Kuwari, Mostafa Mansour

Abstract: In this study, we investigate the influence of Hawking decoherence on the quantum correlations of Dirac fields between \textit{Alice} and \textit{Bob}. Initially, they share a \textit{Gisin} state near the Schwarzschild black hole (SBH) in an asymptotically flat region. Then, \textit{Alice} remains stationary in this region, while \textit{Bob} hovers near the event horizon (EH) of the SBH. We expe… ▽ More In this study, we investigate the influence of Hawking decoherence on the quantum correlations of Dirac fields between \textit{Alice} and \textit{Bob}. Initially, they share a \textit{Gisin} state near the Schwarzschild black hole (SBH) in an asymptotically flat region. Then, \textit{Alice} remains stationary in this region, while \textit{Bob} hovers near the event horizon (EH) of the SBH. We expect that \textit{Bob}, using his excited detector, will detect a thermal Fermi-Dirac particle distribution. We assess the quantum correlations in the evolved \textit{Gisin} state using quantum consonance and uncertainty-induced non-locality across physically accessible, physically inaccessible, and spacetime regions. Our investigation examines how these measures vary with Hawking temperature, Dirac particle frequency, and the parameters of the initial \textit{Gisin} state. Additionally, we analyze the distribution of these quantum correlation measures across all possible regions, noting a redistribution towards the physically inaccessible region. Our findings demonstrate that Hawking decoherence reduces the quantum correlations of Dirac fields in the physically accessible region, with the extent of reduction depending on the initial state parameters. Moreover, as Hawking decoherence intensifies in the physically inaccessible and spacetime regions, the quantum correlations of Dirac fields reemerge and ultimately converge to specific values at infinite Hawking temperature. These results contribute to our understanding of quantum correlation dynamics within the framework of relativistic quantum information (RQI). △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: 16 pages, 9 figures

arXiv:2407.01915 [pdf, other]

Unraveling the Trigger Mechanism of Explosive Reconnection in Partially Ionized Solar Plasma

Authors: Abdullah Zafar, Lei Ni, Jun Lin, Ahmad Ali

Abstract: Plasmoid instability is usually accounted for the onset of fast reconnection events observed in astrophysical plasmas. However, the measured reconnection rate from observations can be one order of magnitude higher than that derived from MHD simulations. In this study, we present the results of magnetic reconnection in the partially ionized low solar atmosphere based on 2.5D magnetohydrodynamics (M… ▽ More Plasmoid instability is usually accounted for the onset of fast reconnection events observed in astrophysical plasmas. However, the measured reconnection rate from observations can be one order of magnitude higher than that derived from MHD simulations. In this study, we present the results of magnetic reconnection in the partially ionized low solar atmosphere based on 2.5D magnetohydrodynamics (MHD) simulations. The whole reconnection process covers two different fast reconnection phases. In the first phase, the slow Sweet-Parker reconnection transits to the plasmoid-mediated reconnection, and the reconnection rate reaches about 0.02. In the second phase, a faster explosive reconnection appears, with the reconnection rate reaching above 0.06. At the same time, a sharp decrease in plasma temperature and density at the principle X-point is observed which is associated with the strong radiative cooling, the ejection of hot plasma from the local reconnection region or the motion of principle X-point from hot and denser region to cool and less dense one along the narrow current sheet. This causes gas pressure depletion and the increasing of magnetic diffusion at the main X-point, resulting in the local Petschek-like reconnection and a violent and rapid increase in the reconnection rate. This study for the first time reveals a common phenomenon that the plasmoid dominated reconnection transits to an explosive faster reconnection with the rate approaching the order of 0.1 in partially ionized plasma in the MHD scale. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.17973 [pdf, other]

Koopman-LQR Controller for Quadrotor UAVs from Data

Authors: Zeyad M. Manaa, Ayman M. Abdallah, Mohammad A. Abido, Syed S. Azhar Ali

Abstract: Quadrotor systems are common and beneficial for many fields, but their intricate behavior often makes it challenging to design effective and optimal control strategies. Some traditional approaches to nonlinear control often rely on local linearizations or complex nonlinear models, which can be inaccurate or computationally expensive. We present a data-driven approach to identify the dynamics of a… ▽ More Quadrotor systems are common and beneficial for many fields, but their intricate behavior often makes it challenging to design effective and optimal control strategies. Some traditional approaches to nonlinear control often rely on local linearizations or complex nonlinear models, which can be inaccurate or computationally expensive. We present a data-driven approach to identify the dynamics of a given quadrotor system using Koopman operator theory. Koopman theory offers a framework for representing nonlinear dynamics as linear operators acting on observable functions of the state space. This allows to approximate nonlinear systems with globally linear models in a higher dimensional space, which can be analyzed and controlled using standard linear optimal control techniques. We leverage the method of extended dynamic mode decomposition (EDMD) to identify Koopman operator from data with total least squares. We demonstrate that the identified model can be stabilized and controllable by designing a controller using linear quadratic regulator (LQR). △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.17445 [pdf, other]

Copula-Based Estimation of Causal Effects in Multiple Linear and Path Analysis Models

Authors: Alam Ali, Ashok Kumar Pathak, Mohd Arshad, Ayyub Sheikhi

Abstract: Regression analysis is one of the most popularly used statistical technique which only measures the direct effect of independent variables on dependent variable. Path analysis looks for both direct and indirect effects of independent variables and may overcome several hurdles allied with regression models. It utilizes one or more structural regression equations in the model which are used to estim… ▽ More Regression analysis is one of the most popularly used statistical technique which only measures the direct effect of independent variables on dependent variable. Path analysis looks for both direct and indirect effects of independent variables and may overcome several hurdles allied with regression models. It utilizes one or more structural regression equations in the model which are used to estimate the unknown parameters. The aim of this work is to study the path analysis models when the endogenous (dependent) variable and exogenous (independent) variables are linked through the elliptical copulas. Using well-organized numerical schemes, we investigate the performance of path models when direct and indirect effects are estimated applying classical ordinary least squares and copula-based regression approaches in different scenarios. Finally, two real data applications are also presented to demonstrate the performance of path analysis using copula approach. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 23 pages, 3 figures, 11 tables

MSC Class: 62H05; 62J05; 62F10

arXiv:2406.16099 [pdf, other]

Speech Representation Analysis based on Inter- and Intra-Model Similarities

Authors: Yassine El Kheir, Ahmed Ali, Shammur Absar Chowdhury

Abstract: Self-supervised models have revolutionized speech processing, achieving new levels of performance in a wide variety of tasks with limited resources. However, the inner workings of these models are still opaque. In this paper, we aim to analyze the encoded contextual representation of these foundation models based on their inter- and intra-model similarity, independent of any external annotation an… ▽ More Self-supervised models have revolutionized speech processing, achieving new levels of performance in a wide variety of tasks with limited resources. However, the inner workings of these models are still opaque. In this paper, we aim to analyze the encoded contextual representation of these foundation models based on their inter- and intra-model similarity, independent of any external annotation and task-specific constraint. We examine different SSL models varying their training paradigm -- Contrastive (Wav2Vec2.0) and Predictive models (HuBERT); and model sizes (base and large). We explore these models on different levels of localization/distributivity of information including (i) individual neurons; (ii) layer representation; (iii) attention weights and (iv) compare the representations with their finetuned counterparts.Our results highlight that these models converge to similar representation subspaces but not to similar neuron-localized concepts\footnote{A concept represents a coherent fragment of knowledge, such as ``a class containing certain objects as elements, where the objects have certain properties. We made the code publicly available for facilitating further research, we publicly released our code. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 5 pages, Accepted to appear in ICASSP XAI-SA Workshop

arXiv:2406.15044 [pdf, other]

From Overfitting to Robustness: Quantity, Quality, and Variety Oriented Negative Sample Selection in Graph Contrastive Learning

Authors: Adnan Ali, **long Li, Huanhuan Chen, Ali Kashif Bashir

Abstract: Graph contrastive learning (GCL) aims to contrast positive-negative counterparts to learn the node embeddings, whereas graph data augmentation methods are employed to generate these positive-negative samples. The variation, quantity, and quality of negative samples compared to positive samples play crucial roles in learning meaningful embeddings for node classification downstream tasks. Less varia… ▽ More Graph contrastive learning (GCL) aims to contrast positive-negative counterparts to learn the node embeddings, whereas graph data augmentation methods are employed to generate these positive-negative samples. The variation, quantity, and quality of negative samples compared to positive samples play crucial roles in learning meaningful embeddings for node classification downstream tasks. Less variation, excessive quantity, and low-quality negative samples cause the model to be overfitted for particular nodes, resulting in less robust models. To solve the overfitting problem in the GCL paradigm, this study proposes a novel Cumulative Sample Selection (CSS) algorithm by comprehensively considering negative samples' quality, variations, and quantity. Initially, three negative sample pools are constructed: easy, medium, and hard negative samples, which contain 25%, 50%, and 25% of the total available negative samples, respectively. Then, 10% negative samples are selected from each of these three negative sample pools for training the model. After that, a decision agent module evaluates model training results and decides whether to explore more negative samples from three negative sample pools by increasing the ratio or keep exploiting the current sampling ratio. The proposed algorithm is integrated into a proposed graph contrastive learning framework named NegAmplify. NegAmplify is compared with the SOTA methods on nine graph node classification datasets, with seven achieving better node classification accuracy with up to 2.86% improvement. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.12255 [pdf, other]

A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

Authors: Lijie Hu, Liang Liu, Shu Yang, Xin Chen, Hongru Xiao, Mengdi Li, Pan Zhou, Muhammad Asif Ali, Di Wang

Abstract: Chain-of-Thought (CoT) holds a significant place in augmenting the reasoning performance for large language models (LLMs). While some studies focus on improving CoT accuracy through methods like retrieval enhancement, yet a rigorous explanation for why CoT achieves such success remains unclear. In this paper, we analyze CoT methods under two different settings by asking the following questions: (1… ▽ More Chain-of-Thought (CoT) holds a significant place in augmenting the reasoning performance for large language models (LLMs). While some studies focus on improving CoT accuracy through methods like retrieval enhancement, yet a rigorous explanation for why CoT achieves such success remains unclear. In this paper, we analyze CoT methods under two different settings by asking the following questions: (1) For zero-shot CoT, why does prompting the model with "let's think step by step" significantly impact its outputs? (2) For few-shot CoT, why does providing examples before questioning the model could substantially improve its reasoning ability? To answer these questions, we conduct a top-down explainable analysis from the Hopfieldian view and propose a Read-and-Control approach for controlling the accuracy of CoT. Through extensive experiments on seven datasets for three different tasks, we demonstrate that our framework can decipher the inner workings of CoT, provide reasoning error localization, and control to come up with the correct reasoning path. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 21 pages

arXiv:2406.05912 [pdf]

BD-SAT: High-resolution Land Use Land Cover Dataset & Benchmark Results for Develo** Division: Dhaka, BD

Authors: Ovi Paul, Abu Bakar Siddik Nayem, Anis Sarker, Amin Ahsan Ali, M Ashraful Amin, AKM Mahbubur Rahman

Abstract: Land Use Land Cover (LULC) analysis on satellite images using deep learning-based methods is significantly helpful in understanding the geography, socio-economic conditions, poverty levels, and urban sprawl in develo** countries. Recent works involve segmentation with LULC classes such as farmland, built-up areas, forests, meadows, water bodies, etc. Training deep learning methods on satellite i… ▽ More Land Use Land Cover (LULC) analysis on satellite images using deep learning-based methods is significantly helpful in understanding the geography, socio-economic conditions, poverty levels, and urban sprawl in develo** countries. Recent works involve segmentation with LULC classes such as farmland, built-up areas, forests, meadows, water bodies, etc. Training deep learning methods on satellite images requires large sets of images annotated with LULC classes. However, annotated data for develo** countries are scarce due to a lack of funding, absence of dedicated residential/industrial/economic zones, a large population, and diverse building materials. BD-SAT provides a high-resolution dataset that includes pixel-by-pixel LULC annotations for Dhaka metropolitan city and surrounding rural/urban areas. Using a strict and standardized procedure, the ground truth is created using Bing satellite imagery with a ground spatial distance of 2.22 meters per pixel. A three-stage, well-defined annotation process has been followed with support from GIS experts to ensure the reliability of the annotations. We performed several experiments to establish benchmark results. The results show that the annotated BD-SAT is sufficient to train large deep learning models with adequate accuracy for five major LULC classes: forest, farmland, built-up areas, water bodies, and meadows. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 26 pages, 15 figures and 12 tables

arXiv:2406.05716 [pdf, other]

Near or far: On determining the appropriate channel estimation strategy in cross-field communication

Authors: Simon Tarboush, Anum Ali, Tareq Y. Al-Naffouri

Abstract: The use of ultra-massive multiple-input multiple-output and high-frequency large bandwidth systems is likely in the next-generation wireless communication systems. In such systems, the user moves between near- and far-field regions, and consequently, the channel estimation will need to be carried out in the cross-field scenario. Channel estimation strategies have been proposed for both near- and f… ▽ More The use of ultra-massive multiple-input multiple-output and high-frequency large bandwidth systems is likely in the next-generation wireless communication systems. In such systems, the user moves between near- and far-field regions, and consequently, the channel estimation will need to be carried out in the cross-field scenario. Channel estimation strategies have been proposed for both near- and far-fields, but in the cross-field problem, the first step is to determine whether the near- or far-field is applicable so that an appropriate channel estimation strategy can be employed. In this work, we propose using a hidden Markov model over an ensemble of region estimates to enhance the accuracy of selecting the actual region. The region indicators are calculated using the pair-wise power differences between received signals across the subarrays within an array-of-subarrays architecture. Numerical results show that the proposed method achieves a high success rate in determining the appropriate channel estimation strategy. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2406.01666 [pdf, other]

Feedback in Emerging extragAlactic Star clusTers, FEAST: JWST spots PAH destruction in NGC 628 during the emerging phase of star formation

Authors: Alex Pedrini, Angela Adamo, Daniela Calzetti, Arjan Bik, Benjamin Gregg, Sean T. Linden, Varun Bajaj, Jenna E. Ryon, Ahmad A. Ali, Giacomo Bortolini, Matteo Correnti, Bruce G. Elmegreen, Debra Meloy Elmegreen, John S. Gallagher, Kathryn Grasha, Robert A. Gutermuth, Kelsey E. Johnson, Jens Melinder, Matteo Messa, Göran Östlin, Elena Sabbi, Linda J. Smith, Monica Tosi, Helena Faustino Vieira

Abstract: We investigate the emergence phase of young star clusters in the nearby spiral galaxy NGC 628. We use JWST NIRCam and MIRI observations to create spatially resolved maps of the Pa$α$-1.87 $μ$m and Br$α$-4.05 $μ$m hydrogen recombination lines, as well as the 3.3 $μ$m and 7.7 $μ$m emission from polycyclic aromatic hydrocarbons (PAHs). We extract 953 compact HII regions and analyze the PAH emission a… ▽ More We investigate the emergence phase of young star clusters in the nearby spiral galaxy NGC 628. We use JWST NIRCam and MIRI observations to create spatially resolved maps of the Pa$α$-1.87 $μ$m and Br$α$-4.05 $μ$m hydrogen recombination lines, as well as the 3.3 $μ$m and 7.7 $μ$m emission from polycyclic aromatic hydrocarbons (PAHs). We extract 953 compact HII regions and analyze the PAH emission and morphology at $\sim$10 pc scales in the associated photo-dissociation regions (PDRs). While HII regions remain compact, radial profiles help us to define three PAH morphological classes: compact ($\sim$ 42%), extended ($\sim$ 34%) and open ($\sim$ 24%). The majority of compact and extended PAH morphologies are associated with very young star clusters ($<$5 Myr), while open PAH morphologies are mainly associated with star clusters older than 3 Myr. We observe a general decrease in the 3.3 $μ$m and 7.7 $μ$m PAH band emission as a function of cluster age, while their ratio remains constant with age out to 10 Myr and morphological class. The recovered PAH$_{3.3 μ{\rm m}}$/PAH$_{7.7 μ{\rm m}}$ ratio is lower than values reported in the literature for reference models that consider neutral and ionized PAH populations and analyses conducted at galactic scales. The 3.3 $μ$m and 7.7 $μ$m bands are typically associated to neutral and ionised PAHs, respectively. While we expected neutral PAHs to be suppressed in proximity of the ionizing source, the constant PAH$_{3.3 μ{\rm m}}$/PAH$_{7.7 μ{\rm m}}$ ratio would indicate that both families of molecules disrupt at similar rates in proximity of the HII regions. △ Less

Submitted 26 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

Comments: 25 pages, 14 figures, 3 tables. Accepted for publication in ApJ V2: Minor changes to Figures 7, 8, and 9, and to the text

arXiv:2406.00887 [pdf, ps, other]

Deep Reinforcement Learning for Sim-to-Real Policy Transfer of VTOL-UAVs Offshore Docking Operations

Authors: Ali M. Ali, Aryaman Gupta, Hashim A. Hashim

Abstract: This paper proposes a novel Reinforcement Learning (RL) approach for sim-to-real policy transfer of Vertical Take-Off and Landing Unmanned Aerial Vehicle (VTOL-UAV). The proposed approach is designed for VTOL-UAV landing on offshore docking stations in maritime operations. VTOL-UAVs in maritime operations encounter limitations in their operational range, primarily stemming from constraints imposed… ▽ More This paper proposes a novel Reinforcement Learning (RL) approach for sim-to-real policy transfer of Vertical Take-Off and Landing Unmanned Aerial Vehicle (VTOL-UAV). The proposed approach is designed for VTOL-UAV landing on offshore docking stations in maritime operations. VTOL-UAVs in maritime operations encounter limitations in their operational range, primarily stemming from constraints imposed by their battery capacity. The concept of autonomous landing on a charging platform presents an intriguing prospect for mitigating these limitations by facilitating battery charging and data transfer. However, current Deep Reinforcement Learning (DRL) methods exhibit drawbacks, including lengthy training times, and modest success rates. In this paper, we tackle these concerns comprehensively by decomposing the landing procedure into a sequence of more manageable but analogous tasks in terms of an approach phase and a landing phase. The proposed architecture utilizes a model-based control scheme for the approach phase, where the VTOL-UAV is approaching the offshore docking station. In the Landing phase, DRL agents were trained offline to learn the optimal policy to dock on the offshore station. The Joint North Sea Wave Project (JONSWAP) spectrum model has been employed to create a wave model for each episode, enhancing policy generalization for sim2real transfer. A set of DRL algorithms have been tested through numerical simulations including value-based agents and policy-based agents such as Deep \textit{Q} Networks (DQN) and Proximal Policy Optimization (PPO) respectively. The numerical experiments show that the PPO agent can learn complicated and efficient policies to land in uncertain environments, which in turn enhances the likelihood of successful sim-to-real transfer. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.16504 [pdf, other]

A Unified Implicit Attention Formulation for Gated-Linear Recurrent Sequence Models

Authors: Itamar Zimerman, Ameen Ali, Lior Wolf

Abstract: Recent advances in efficient sequence modeling have led to attention-free layers, such as Mamba, RWKV, and various gated RNNs, all featuring sub-quadratic complexity in sequence length and excellent scaling properties, enabling the construction of a new type of foundation models. In this paper, we present a unified view of these models, formulating such layers as implicit causal self-attention lay… ▽ More Recent advances in efficient sequence modeling have led to attention-free layers, such as Mamba, RWKV, and various gated RNNs, all featuring sub-quadratic complexity in sequence length and excellent scaling properties, enabling the construction of a new type of foundation models. In this paper, we present a unified view of these models, formulating such layers as implicit causal self-attention layers. The formulation includes most of their sub-components and is not limited to a specific part of the architecture. The framework compares the underlying mechanisms on similar grounds for different layers and provides a direct means for applying explainability methods. Our experiments show that our attention matrices and attribution method outperform an alternative and a more limited formulation that was recently proposed for Mamba. For the other architectures for which our method is the first to provide such a view, our method is effective and competitive in the relevant metrics compared to the results obtained by state-of-the-art transformer explainability methods. Our code is publicly available. △ Less

Submitted 26 May, 2024; originally announced May 2024.

ACM Class: F.2.2; I.2.7

arXiv:2405.16294 [pdf, other]

Nonclassical characteristics in spin-1/2 Heisenberg XYZ model with added DM and KSEA interactions under sinusoidal magnetic field: Hierarchy of quantum resources

Authors: A. Ali, S. Al-Kuwari, M. T. Rahim, M. Ghominejad, H. Ali, S. Haddadi

Abstract: We investigate the behavior of various measures of quantum coherence and quantum correlation in the spin-1/2 Heisenberg XYZ model with added Dzyaloshinsky-Moriya (DM) and Kaplan--Shekhtman--Entin-Wohlman--Aharony (KSEA) interactions at a thermal regime described by a Gibbs density operator. We aim to understand the restricted hierarchical classification of different quantum resources, where quantu… ▽ More We investigate the behavior of various measures of quantum coherence and quantum correlation in the spin-1/2 Heisenberg XYZ model with added Dzyaloshinsky-Moriya (DM) and Kaplan--Shekhtman--Entin-Wohlman--Aharony (KSEA) interactions at a thermal regime described by a Gibbs density operator. We aim to understand the restricted hierarchical classification of different quantum resources, where quantum coherence $\supseteq$ quantum discord $\supseteq$ quantum entanglement $\supseteq$ quantum steering $\supseteq$ Bell nonlocality. In order to enhance quantum coherence, quantum correlation, and fidelity of teleportation, our analysis encompasses the effects of independently provided sinusoidal magnetic field control as well as DM and KSEA interactions on the considered system. The results reveal that enhancing the entanglement or quantum correlation of the channel does not always guarantee successful teleportation or even an improvement in teleportation fidelity. Thus, the relationship between teleportation fidelity and the channel's underlying quantum properties is intricate. Our study provides valuable insights into the complex interplay of quantum coherence and correlation hierarchy, offering potential applications for quantum communication and information processing technologies. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: 11 pages, 4 figures. All comments are welcome

arXiv:2405.15820 [pdf]

Concurrent Multiphysics and Multiscale Topology Optimization for Lightweight Laser-Driven Porous Actuator Systems

Authors: Musaddiq Al Ali, Masatoshi Shimoda

Abstract: In this research, multi-physics topology optimization is employed to achieve the detailed design of a lightweight porous linear actuation mechanism that harnesses energy through laser activation. A multiscale topology optimization methodology is introduced for micro- and macroscale design, considering energy dissipation via heat convection and radiation. This investigation meticulously considers t… ▽ More In this research, multi-physics topology optimization is employed to achieve the detailed design of a lightweight porous linear actuation mechanism that harnesses energy through laser activation. A multiscale topology optimization methodology is introduced for micro- and macroscale design, considering energy dissipation via heat convection and radiation. This investigation meticulously considers the impact of heat dissipation mechanisms, including thermal conduction, convection, and radiation. Through various numerical cases, we systematically explore the influence of micro-scale considerations on porous design and understand the effects on the topology optimization process by incorporating various microstructural systems. The results demonstrate that porous actuator designs exhibit superior performance compared to solid actuator designs. This study contributes to advancing the understanding of multiscale effects in topology optimization, paving the way for more efficient and lightweight designs in the field of laser-activated porous actuators. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.15452 [pdf, other]

Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top

Authors: Keyuan Cheng, Muhammad Asif Ali, Shu Yang, Gang Lin, Yuxuan Zhai, Haoyang Fei, Ke Xu, Lu Yu, Lijie Hu, Di Wang

Abstract: Multi-hop Question Answering (MQA) under knowledge editing (KE) is a key challenge in Large Language Models (LLMs). While best-performing solutions in this domain use a plan and solve paradigm to split a question into sub-questions followed by response generation, we claim that this approach is sub-optimal as it fails for hard to decompose questions, and it does not explicitly cater to correlated… ▽ More Multi-hop Question Answering (MQA) under knowledge editing (KE) is a key challenge in Large Language Models (LLMs). While best-performing solutions in this domain use a plan and solve paradigm to split a question into sub-questions followed by response generation, we claim that this approach is sub-optimal as it fails for hard to decompose questions, and it does not explicitly cater to correlated knowledge updates resulting as a consequence of knowledge edits. This has a detrimental impact on the overall consistency of the updated knowledge. To address these issues, in this paper, we propose a novel framework named RULE-KE, i.e., RULE based Knowledge Editing, which is a cherry on the top for augmenting the performance of all existing MQA methods under KE. Specifically, RULE-KE leverages rule discovery to discover a set of logical rules. Then, it uses these discovered rules to update knowledge about facts highly correlated with the edit. Experimental evaluation using existing and newly curated datasets (i.e., RKE-EVAL) shows that RULE-KE helps augment both performances of parameter-based and memory-based solutions up to 92% and 112.9%, respectively. △ Less

Submitted 27 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: 18 pages

arXiv:2405.15342 [pdf, other]

doi 10.1051/epjconf/202429507026

Implementation of New Security Features in CMSWEB Kubernetes Cluster at CERN

Authors: Aamir Ali, Muhammad Imran, Valentin Kuznetsov, Spyridon Trigazis, Aroosha Pervaiz, Andreas Pfeiffer, Marco Mascheroni

Abstract: The CMSWEB cluster is pivotal to the activities of the Compact Muon Solenoid (CMS) experiment, as it hosts critical services required for the operational needs of the CMS experiment. The security of these services and the corresponding data is crucial to CMS. Any malicious attack can compromise the availability of our services. Therefore, it is important to construct a robust security infrastructu… ▽ More The CMSWEB cluster is pivotal to the activities of the Compact Muon Solenoid (CMS) experiment, as it hosts critical services required for the operational needs of the CMS experiment. The security of these services and the corresponding data is crucial to CMS. Any malicious attack can compromise the availability of our services. Therefore, it is important to construct a robust security infrastructure. In this work, we discuss new security features introduced to the CMSWEB Kubernetes ("k8s") cluster. The new features include the implementation of network policies, deployment of Open Policy Agent (OPA), enforcement of OPA policies, and the integration of Vault. The network policies act as an inside-the-cluster firewall to limit the network communication between the pods to the minimum necessary, and its dynamic nature allows us to work with microservices. The OPA validates the objects against some custom-defined policies during create, update, and delete operations to further enhance security. Without recompiling or changing the configuration of the Kubernetes API server, it can apply customized policies on Kubernetes objects and their audit functionality enabling us to detect pre-existing conflicts and issues. Although Kubernetes incorporates the concepts of secrets, they are only base64 encoded and are not dynamically configured. This is where Vault comes into play: Vault dynamically secures, stores, and tightly controls access to sensitive data. This way, the secret information is encrypted, secured, and centralized, making it more scalable and easier to manage. Thus, the implementation of these three security features corroborate the enhanced security and reliability of the CMSWEB Kubernetes infrastructure. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY & NUCLEAR PHYSICS - 2023

arXiv:2405.14242 [pdf]

M2ANET: Mobile Malaria Attention Network for efficient classification of plasmodium parasites in blood cells

Authors: Salam Ahmed Ali, Peshraw Salam Abdulqadir, Shan Ali Abdullah, Haruna Yunusa

Abstract: Malaria is a life-threatening infectious disease caused by Plasmodium parasites, which poses a significant public health challenge worldwide, particularly in tropical and subtropical regions. Timely and accurate detection of malaria parasites in blood cells is crucial for effective treatment and control of the disease. In recent years, deep learning techniques have demonstrated remarkable success… ▽ More Malaria is a life-threatening infectious disease caused by Plasmodium parasites, which poses a significant public health challenge worldwide, particularly in tropical and subtropical regions. Timely and accurate detection of malaria parasites in blood cells is crucial for effective treatment and control of the disease. In recent years, deep learning techniques have demonstrated remarkable success in medical image analysis tasks, offering promising avenues for improving diagnostic accuracy, with limited studies on hybrid mobile models due to the complexity of combining two distinct models and the significant memory demand of self-attention mechanism especially for edge devices. In this study, we explore the potential of designing a hybrid mobile model for efficient classification of plasmodium parasites in blood cell images. Therefore, we present M2ANET (Mobile Malaria Attention Network). The model integrates MBConv3 (MobileNetV3 blocks) for efficient capturing of local feature extractions within blood cell images and a modified global-MHSA (multi-head self-attention) mechanism in the latter stages of the network for capturing global context. Through extensive experimentation on benchmark, we demonstrate that M2ANET outperforms some state-of-the-art lightweight and mobile networks in terms of both accuracy and efficiency. Moreover, we discuss the potential implications of M2ANET in advancing malaria diagnosis and treatment, highlighting its suitability for deployment in resource-constrained healthcare settings. The development of M2ANET represents a significant advancement in the pursuit of efficient and accurate malaria detection, with broader implications for medical image analysis and global healthcare initiatives. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.12488 [pdf, other]

First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data

Authors: Super-Kamiokande, T2K collaborations, :, S. Abe, K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, S. Amanai, C. Andreopoulos, L. H. V. Anthony, M. Antonova, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, Y. Asada, R. Asaka, Y. Ashida, E. T. Atkin, N. Babu , et al. (524 additional authors not shown)

Abstract: The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of… ▽ More The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 10 pages, 3 figures

arXiv:2405.11624 [pdf, other]

On Generalized Transmuted Lifetime Distribution

Authors: Alok Kumar Pandey, Alam Ali, Ashok Kumar Pathak

Abstract: This article presents a new class of generalized transmuted lifetime distributions which includes a large number of lifetime distributions as sub-family. Several important mathematical quantities such as density function, distribution function, quantile function, moments, moment generating function, stress-strength reliability function, order statistics, Rényi and q-entropy, residual and reversed… ▽ More This article presents a new class of generalized transmuted lifetime distributions which includes a large number of lifetime distributions as sub-family. Several important mathematical quantities such as density function, distribution function, quantile function, moments, moment generating function, stress-strength reliability function, order statistics, Rényi and q-entropy, residual and reversed residual life function, and cumulative information generating function are obtained. The methods of maximum likelihood, ordinary least square, weighted least square, Cramér-von Mises, Anderson Darling, and Right-tail Anderson Darling are considered to estimate the model parameters in a general way. Further, a well-organized Monte Carlo simulation experiments have been performed to observe the behavior of the estimators. Finally, two real data have also been analyzed to demonstrate the effectiveness of the proposed distribution in real-life modeling. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 26 pages, 8 figures

MSC Class: 60E05; 62F10; 62E15; 65C05; 33B20

arXiv:2405.11608 [pdf, other]

Full private delegated quantum computing tailored from user to industry

Authors: Alejandro Mata Ali, Adriano Mauricio Lusso, Edgar Mencia

Abstract: In this paper, we present a set of private and secure delegated quantum computing protocols and techniques tailored to user-level and industry-level use cases, depending on the computational resources available to the client, the specific privacy needs required, and the type of algorithm. Our protocols are presented at a high level as they are independent of the particular algorithm used for such… ▽ More In this paper, we present a set of private and secure delegated quantum computing protocols and techniques tailored to user-level and industry-level use cases, depending on the computational resources available to the client, the specific privacy needs required, and the type of algorithm. Our protocols are presented at a high level as they are independent of the particular algorithm used for such encryption and decryption processes. Additionally, we propose a method to verify the correct execution of operations by the external server. △ Less

Submitted 24 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures

MSC Class: 81P68 ACM Class: E.3

arXiv:2405.07958 [pdf, other]

Record-based transmuted unit omega distribution: different methods of estimation and applications

Authors: Ashok Kumar Pathak, Mohd. Arshad, Alok Kumar Pandey, Alam Ali

Abstract: Dombi et al. (2019) introduced a three parameter omega distribution and showed that its asymptotic distribution is the Weibull model. We propose a new record-based transmuted generalization of the unit omega distribution by considering Balakrishnan and He (2021) approach. We call it the RTUOMG distribution. We derive expressions for some statistical quantities, like, probability density function,… ▽ More Dombi et al. (2019) introduced a three parameter omega distribution and showed that its asymptotic distribution is the Weibull model. We propose a new record-based transmuted generalization of the unit omega distribution by considering Balakrishnan and He (2021) approach. We call it the RTUOMG distribution. We derive expressions for some statistical quantities, like, probability density function, distribution, hazard function, quantile function, moments, incomplete moments, inverted moments, moment generating function, Lorenz curve, and Bonferroni curve of the proposed distribution. The numerical values of various measures of central tendency and coefficient of skewness and kurtosis are also presented. Concepts of stochastic ordering and some results related to ordered statistics of the RTUOMG distribution are discussed. The parameters of the RTUOMG distribution are estimated using five distinct estimators. Additionally, the Monte Carlo simulations are performed to assess the performance of these estimators. Finally, two real data sets are analyzed to demonstrate the utility of the RTUOMG distribution. △ Less

Submitted 13 May, 2024; originally announced May 2024.

Comments: 26 pages, 14 figures

MSC Class: 60E05; 62F10; 62E15; 65C05; 33C05

arXiv:2405.06579 [pdf, other]

Terahertz Antenna Impedance Matched to a Graphene Photodetector

Authors: François Joint, Kunyi Zhang, Jayaprakash Poojali, Daniel Lewis, Michael Pedowitz, Brendan Jordan, Gyan Prakash, Ashraf Ali, Kevin Daniels, Rachael L. Myers-Ward, Thomas E. Murphy, Howard D. Drew

Abstract: Develo** low-power, high-sensitivity photodetectors for the terahertz (THz) band that operate at room temperature is an important challenge in optoelectronics. In this study, we introduce a photo-thermal-electric (PTE) effect detector based on quasi-free standing bilayer graphene (BLG) on a silicon carbide (SiC) substrate, designed for the THz frequency range. Our detector's performance hinges o… ▽ More Develo** low-power, high-sensitivity photodetectors for the terahertz (THz) band that operate at room temperature is an important challenge in optoelectronics. In this study, we introduce a photo-thermal-electric (PTE) effect detector based on quasi-free standing bilayer graphene (BLG) on a silicon carbide (SiC) substrate, designed for the THz frequency range. Our detector's performance hinges on a quasi-optical coupling scheme, which integrates an aspherical silicon lens, to optimize impedance matching between the THz antenna and the graphene p-n junction. At room temperature, we achieved a noise equivalent power (NEP) of less than 300 $pW/\sqrt{Hz}$. Through an impedance matching analysis, we coupled a planar antenna with a graphene p-n junction, inserted in parallel to the nano-gap of the antenna, via two coupling capacitors. By adjusting the capacitors and the antenna arm length, we tailored the antenna's maximum infrared power absorption to specific frequencies. The sensitivity, spectral properties, and scalability of our material make it an ideal candidate for future development of far-infrared detectors operating at room temperature. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 21 pages, 4 figures

arXiv:2405.05269 [pdf, other]

doi 10.1016/j.aop.2024.169717

Modified gravity/entropic gravity correspondence due to graviton mass

Authors: Kimet Jusufi, Ahmed Farag Ali, Abdelrahman Yasser, Nader Inan, A. Y. Ellithi

Abstract: Some time ago, it has been suggested that gravitons can acquire mass in the process of spontaneous symmetry breaking of diffeomorphisms through the condensation of scalar fields [Chamseddine and Mukhanov, JHEP, 2010]. Taking this possibility into account, in the present paper, first we show how the graviton mass intricately reshapes the gravitational potential akin to a Yukawa-like potential at la… ▽ More Some time ago, it has been suggested that gravitons can acquire mass in the process of spontaneous symmetry breaking of diffeomorphisms through the condensation of scalar fields [Chamseddine and Mukhanov, JHEP, 2010]. Taking this possibility into account, in the present paper, first we show how the graviton mass intricately reshapes the gravitational potential akin to a Yukawa-like potential at large distances. Notably, this long-range force modifies the Newton's law in large distances and might explain the phenomena of dark matter. The most important finding in the present paper is the derivation of a modified Newtons law of gravity by modifying the Verlindes entropic force relation due to the graviton contribution. The graviton contribution to the entropy basically measures the correlation of graviton and matter fields which then reproduces the Bekenstein-Hawking entropy at the horizon. This result shows the dual description of gravity: in the language of quantum information and entropy the gravity can be viewed as an entropic force, however in terms of particles and fields, it can be viewed as a longe range force. Further we have recovered the corrected Einstein field equations as well as the $Λ$CDM where dark matter emerges as an apparent effect. △ Less

Submitted 10 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

Comments: Accepted by Annals of Physics

arXiv:2405.02578 [pdf]

Mixat: A Data Set of Bilingual Emirati-English Speech

Authors: Maryam Al Ali, Hanan Aldarmaki

Abstract: This paper introduces Mixat: a dataset of Emirati speech code-mixed with English. Mixat was developed to address the shortcomings of current speech recognition resources when applied to Emirati speech, and in particular, to bilignual Emirati speakers who often mix and switch between their local dialect and English. The data set consists of 15 hours of speech derived from two public podcasts featur… ▽ More This paper introduces Mixat: a dataset of Emirati speech code-mixed with English. Mixat was developed to address the shortcomings of current speech recognition resources when applied to Emirati speech, and in particular, to bilignual Emirati speakers who often mix and switch between their local dialect and English. The data set consists of 15 hours of speech derived from two public podcasts featuring native Emirati speakers, one of which is in the form of conversations between the host and a guest. Therefore, the collection contains examples of Emirati-English code-switching in both formal and natural conversational contexts. In this paper, we describe the process of data collection and annotation, and describe some of the features and statistics of the resulting data set. In addition, we evaluate the performance of pre-trained Arabic and multi-lingual ASR systems on our dataset, demonstrating the shortcomings of existing models on this low-resource dialectal Arabic, and the additional challenge of recognizing code-switching in ASR. The dataset will be made publicly available for research use. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: SIGUL 2024

arXiv:2405.02563 [pdf, other]

Deep Representation Learning-Based Dynamic Trajectory Phenoty** for Acute Respiratory Failure in Medical Intensive Care Units

Authors: Alan Wu, Tilendra Choudhary, Pulakesh Upadhyaya, Ayman Ali, Philip Yang, Rishikesan Kamaleswaran

Abstract: Sepsis-induced acute respiratory failure (ARF) is a serious complication with a poor prognosis. This paper presents a deep representation learningbased phenoty** method to identify distinct groups of clinical trajectories of septic patients with ARF. For this retrospective study, we created a dataset from electronic medical records (EMR) consisting of data from sepsis patients admitted to medica… ▽ More Sepsis-induced acute respiratory failure (ARF) is a serious complication with a poor prognosis. This paper presents a deep representation learningbased phenoty** method to identify distinct groups of clinical trajectories of septic patients with ARF. For this retrospective study, we created a dataset from electronic medical records (EMR) consisting of data from sepsis patients admitted to medical intensive care units who required at least 24 hours of invasive mechanical ventilation at a quarternary care academic hospital in southeast USA for the years 2016-2021. A total of N=3349 patient encounters were included in this study. Clustering Representation Learning on Incomplete Time Series Data (CRLI) algorithm was applied to a parsimonious set of EMR variables in this data set. To validate the optimal number of clusters, the K-means algorithm was used in conjunction with dynamic time war**. Our model yielded four distinct patient phenotypes that were characterized as liver dysfunction/heterogeneous, hypercapnia, hypoxemia, and multiple organ dysfunction syndrome by a critical care expert. A Kaplan-Meier analysis to compare the 28-day mortality trends exhibited significant differences (p < 0.005) between the four phenotypes. The study demonstrates the utility of our deep representation learning-based approach in unraveling phenotypes that reflect the heterogeneity in sepsis-induced ARF in terms of different mortality outcomes and severity. These phenotypes might reveal important clinical insights into an effective prognosis and tailored treatment strategies. △ Less

Submitted 4 May, 2024; originally announced May 2024.

Comments: 9 pages

arXiv:2405.01173 [pdf, ps, other]

Signature decay modes of the compact doubly-heavy tetraquarks $T_{bb\bar{u} \bar{d}}$ and $T_{bc\bar{u} \bar{d}}$

Authors: Ahmed Ali, Ishtiaq Ahmed, Muhammad Jamil Aslam

Abstract: Based on the expectations that the lowest-lying doubly-bottom tetraquark $T_{bb\bar u \bar d}$ ($J^P = 1^+$) and the bottom-charm tetraquark $T_{bc\bar u \bar d}$ ($J^P = 0^+$) are stable against strong and electromagnetic decays, we work out a number of semileptonic and non-leptonic weak decays of these hadrons, making use of the heavy quark symmetry. In doing this, we concentrate on the exclusiv… ▽ More Based on the expectations that the lowest-lying doubly-bottom tetraquark $T_{bb\bar u \bar d}$ ($J^P = 1^+$) and the bottom-charm tetraquark $T_{bc\bar u \bar d}$ ($J^P = 0^+$) are stable against strong and electromagnetic decays, we work out a number of semileptonic and non-leptonic weak decays of these hadrons, making use of the heavy quark symmetry. In doing this, we concentrate on the exclusive decays involving also tetraquarks in the final states, i.e., transitions such as $T_{bb\bar u \bar d} \to T_{bc\bar u \bar d}\, (\ell^- ν_\ell,\, h^-)$ and $T_{bc\bar u \bar d} \to T_{cc\bar u \bar d}\, (\ell^- ν_\ell,\, h^-)$, where $h^- = π^-, ρ^-, a_1^-, D^-_s, D^{*-}_s$. So far, only the $J^P = 1^+$ tetraquark $T_{cc\bar u \bar d}$ has been discovered, which we identify with the $I = 0$ $T_{cc}^+$ object, compatible with $J^P = 1^+$ and having the pole mass relative to the $D^{*+} D^0$ mass threshold and decay widths $δm = M (T_{cc}^+) - ( M (D^{*+}) + M (D^0) ) = - 360 \pm 40^{+4}_{-0}$ keV and $Γ(T_{cc}^+) = 48^{+2}_{-14}$ keV. Experimental discoveries of the transitions worked out here, and related ones involving doubly-heavy baryons, will quantify the diquark-antidiquark component of these tetraquarks. △ Less

Submitted 6 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

Comments: 14 pages, 2 figures; Some more decay modes and references included and numerical errors corrected. Version accepted for publication in the Physics Letters B

Report number: DESY-24-061

arXiv:2404.17645 [pdf, other]

Técnicas Quantum-Inspired en Tensor Networks para Contextos Industriales

Authors: Alejandro Mata Ali, Iñigo Perez Delgado, Aitor Moreno Fdez. de Leceta

Abstract: In this paper we present a study of the applicability and feasibility of quantum-inspired algorithms and techniques in tensor networks for industrial environments and contexts, with a compilation of the available literature and an analysis of the use cases that may be affected by such methods. In addition, we explore the limitations of such techniques in order to determine their potential scalabil… ▽ More In this paper we present a study of the applicability and feasibility of quantum-inspired algorithms and techniques in tensor networks for industrial environments and contexts, with a compilation of the available literature and an analysis of the use cases that may be affected by such methods. In addition, we explore the limitations of such techniques in order to determine their potential scalability. △ Less

Submitted 8 March, 2024; originally announced April 2024.

Comments: 10 pages, in Spanish language, 5 figures

ACM Class: A.1; G.1.3; G.2.1; G.0; I.0; J.0

arXiv:2404.15693 [pdf]

DFT mediated X2AuYZ6 (X= Cs, Rb; Z= Cl, Br, I) double Perovskites for photovoltaic and wasted heat management device applications

Authors: S. Mahmud, M. A. Ali, M. M. Hossain, M. M. Uddin

Abstract: This paper presents the phase stability, opto-electronic and thermo-electric behavior of X2AuYZ6 (X = Cs, Rb; Z = Cl/Br/I) double perovskite halides by using the DFT method. The compounds belong to the cubic arrangement and are verified by the tolerance and octahedral factor. Formation enthalpy and binding energy meet the requirements of structural stability. The ductility behavior was also confir… ▽ More This paper presents the phase stability, opto-electronic and thermo-electric behavior of X2AuYZ6 (X = Cs, Rb; Z = Cl/Br/I) double perovskite halides by using the DFT method. The compounds belong to the cubic arrangement and are verified by the tolerance and octahedral factor. Formation enthalpy and binding energy meet the requirements of structural stability. The ductility behavior was also confirmed by the Cauchy pressure, Pugh's ratio, and Poisson's ratio. The positive frequency of phonon dispersion except Rb2AuYI6 compound shows the dynamical stability and the negative formation energy of each identified competing phase confirms the thermo-dynamic equilibrium of all compounds. The band gap values of 2.85(2.91), 2.35(2.40), and 1.74(1.78) eV of Cs2AuYZ6 (Rb2AuYZ6) [Z = Cl, Br, I) double perovskites has been explored in the context of optoelectronic properties, and the results show that these materials might be useful in such devices. The spectral optical response covers the visible-to-UV area, which governs the solar cell and thermo-electric device applications. A comprehensive study of thermo-electric properties such as the thermal conductivity (electrical and electronic part), carrier concentration, thermo-power, and figure of merit was also observed. The investigated compounds [Cs (Rb)-based] exhibit ZT values of 0.51(0.55), 0.53(0.62), and 0.58(0.75) at room temperature with Cl, Br, and I respectively. Additional routine work was also done on the thermo-mechanical characteristics. These studies provide in-depth knowledge of these materials in preparation for their future use. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.12042 [pdf, other]

Exploring Boundaries and Intensities in Offensive and Hate Speech: Unveiling the Complex Spectrum of Social Media Discourse

Authors: Abinew Ali Ayele, Esubalew Alemneh Jalew, Adem Chanie Ali, Seid Muhie Yimam, Chris Biemann

Abstract: The prevalence of digital media and evolving sociopolitical dynamics have significantly amplified the dissemination of hateful content. Existing studies mainly focus on classifying texts into binary categories, often overlooking the continuous spectrum of offensiveness and hatefulness inherent in the text. In this research, we present an extensive benchmark dataset for Amharic, comprising 8,258 tw… ▽ More The prevalence of digital media and evolving sociopolitical dynamics have significantly amplified the dissemination of hateful content. Existing studies mainly focus on classifying texts into binary categories, often overlooking the continuous spectrum of offensiveness and hatefulness inherent in the text. In this research, we present an extensive benchmark dataset for Amharic, comprising 8,258 tweets annotated for three distinct tasks: category classification, identification of hate targets, and rating offensiveness and hatefulness intensities. Our study highlights that a considerable majority of tweets belong to the less offensive and less hate intensity levels, underscoring the need for early interventions by stakeholders. The prevalence of ethnic and political hatred targets, with significant overlaps in our dataset, emphasizes the complex relationships within Ethiopia's sociopolitical landscape. We build classification and regression models and investigate the efficacy of models in handling these tasks. Our results reveal that hate and offensive speech can not be addressed by a simplistic binary classification, instead manifesting as variables across a continuous range of values. The Afro-XLMR-large model exhibits the best performances achieving F1-scores of 75.30%, 70.59%, and 29.42% for the category, target, and regression tasks, respectively. The 80.22% correlation coefficient of the Afro-XLMR-large model indicates strong alignments. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2404.11277 [pdf, other]

Quantum-inspired Techniques in Tensor Networks for Industrial Contexts

Authors: Alejandro Mata Ali, Iñigo Perez Delgado, Aitor Moreno Fdez. de Leceta

Abstract: In this paper we present a study of the applicability and feasibility of quantum-inspired algorithms and techniques in tensor networks for industrial environments and contexts, with a compilation of the available literature and an analysis of the use cases that may be affected by such methods. In addition, we explore the limitations of such techniques in order to determine their potential scalabil… ▽ More In this paper we present a study of the applicability and feasibility of quantum-inspired algorithms and techniques in tensor networks for industrial environments and contexts, with a compilation of the available literature and an analysis of the use cases that may be affected by such methods. In addition, we explore the limitations of such techniques in order to determine their potential scalability. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 13 pages, 5 figures

MSC Class: 81P68; 15A69 ACM Class: G.1.3; G.2.1; I.2; I.4

arXiv:2404.11022 [pdf, other]

The Radius Distribution of M dwarf-hosted Planets and its Evolution

Authors: Eric Gaidos, Aleezah Ali, Adam L. Kraus, Jason F. Rowe

Abstract: M dwarf stars are not only the most promising hosts for detection and characterization of small and potentially habitable planets, they provide leverage relative to solar-type stars to test models of planet formation and evolution. Using Gaia astrometry, adaptive optics imaging, and calibrated gyrochronologic relations to estimate stellar properties, filter binaries, and assign ages, we refined th… ▽ More M dwarf stars are not only the most promising hosts for detection and characterization of small and potentially habitable planets, they provide leverage relative to solar-type stars to test models of planet formation and evolution. Using Gaia astrometry, adaptive optics imaging, and calibrated gyrochronologic relations to estimate stellar properties, filter binaries, and assign ages, we refined the radii of 179 transiting planets orbiting 119 single late K- and early M-type stars detected by the Kepler mission, and assigned stellar rotation-based ages ) to 115 of these. We constructed the radius distribution of <4R$_{\oplus}$ planets and assessed its evolution with time. As for solar-type stars, the inferred distribution contains distinct populations of "super-Earths" (at ~1.3R$_{\oplus}$) and "sub-Neptunes" (at ~2.2Rearth) separated by a gap or "valley" at $\approx$1.7R$_{\oplus}$ that has a period dependence that is significantly weaker (power law index of -0.026$^{+0.026}_{-0.017}$) than for solar-type stars. Sub-Neptunes are largely absent at short periods ($<$2 days) and high irradiance, a feature analogous to the "Neptune desert" observed around solar-type stars. The relative number of sub-Neptunes to super-Earths declines between the younger and older halves of the sample (median age 3.8 Gyr), although the formal significance is low ($p = 0.06$) because of the small sample size. The decline in sub-Neptunes appears to be more pronounced at long orbital periods vs. short periods; this is not due to detection bias and could indicate that these objects are inflated by a mechanism that operates at elevated irradiance, e.g. a runaway water greenhouse augmented by H/He. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: Submitted to MNRAS on 2023 August 12, re-submitted with moderate revisions on 2024 February 12

arXiv:2404.10018 [pdf, other]

A Linear MPC with Control Barrier Functions for Differential Drive Robots

Authors: Ali Mohamed Ali, Chao Shen, Hashim A. Hashim

Abstract: The need for fully autonomous mobile robots has surged over the past decade, with the imperative of ensuring safe navigation in a dynamic setting emerging as a primary challenge impeding advancements in this domain. In this paper, a Safety Critical Model Predictive Control based on Dynamic Feedback Linearization tailored to the application of differential drive robots with two wheels is proposed t… ▽ More The need for fully autonomous mobile robots has surged over the past decade, with the imperative of ensuring safe navigation in a dynamic setting emerging as a primary challenge impeding advancements in this domain. In this paper, a Safety Critical Model Predictive Control based on Dynamic Feedback Linearization tailored to the application of differential drive robots with two wheels is proposed to generate control signals that result in obstacle-free paths. A barrier function introduces a safety constraint to the optimization problem of the Model Predictive Control (MPC) to prevent collisions. Due to the intrinsic nonlinearities of the differential drive robots, computational complexity while implementing a Nonlinear Model Predictive Control (NMPC) arises. To facilitate the real-time implementation of the optimization problem and to accommodate the underactuated nature of the robot, a combination of Linear Model Predictive Control (LMPC) and Dynamic Feedback Linearization (DFL) is proposed. The MPC problem is formulated on a linear equivalent model of the differential drive robot rendered by the DFL controller. The analysis of the closed-loop stability and recursive feasibility of the proposed control design is discussed. Numerical experiments illustrate the robustness and effectiveness of the proposed control synthesis in avoiding obstacles with respect to the benchmark of using Euclidean distance constraints. Keywords: Model Predictive Control, MPC, Autonomous Ground Vehicles, Nonlinearity, Dynamic Feedback Linearization, Optimal Control, Differential Robots. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: Accepted IET Control Theory & Applications. arXiv admin note: text overlap with arXiv:2404.09320

arXiv:2404.09920 [pdf, other]

Combined Pre-Supernova Alert System with Kamland and Super-Kamiokande

Authors: KamLAND, Super-Kamiokande Collaborations, :, Seisho Abe, Minori Eizuka, Sawako Futagi, Azusa Gando, Yoshihito Gando, Shun Goto, Takahiko Hachiya, Kazumi Hata, Koichi Ichimura, Sei Ieki, Haruo Ikeda, Kunio Inoue, Koji Ishidoshiro, Yuto Kamei, Nanami Kawada, Yasuhiro Kishimoto, Masayuki Koga, Maho Kurasawa, Tadao Mitsui, Haruhiko Miyake, Daisuke Morita, Takeshi Nakahata , et al. (290 additional authors not shown)

Abstract: Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are ob… ▽ More Preceding a core-collapse supernova, various processes produce an increasing amount of neutrinos of all flavors characterized by mounting energies from the interior of massive stars. Among them, the electron antineutrinos are potentially detectable by terrestrial neutrino experiments such as KamLAND and Super-Kamiokande via inverse beta decay interactions. Once these pre-supernova neutrinos are observed, an early warning of the upcoming core-collapse supernova can be provided. In light of this, KamLAND and Super-Kamiokande, both located in the Kamioka mine in Japan, have been monitoring pre-supernova neutrinos since 2015 and 2021, respectively. Recently, we performed a joint study between KamLAND and Super-Kamiokande on pre-supernova neutrino detection. A pre-supernova alert system combining the KamLAND detector and the Super-Kamiokande detector was developed and put into operation, which can provide a supernova alert to the astrophysics community. Fully leveraging the complementary properties of these two detectors, the combined alert is expected to resolve a pre-supernova neutrino signal from a 15 M$_{\odot}$ star within 510 pc of the Earth, at a significance level corresponding to a false alarm rate of no more than 1 per century. For a Betelgeuse-like model with optimistic parameters, it can provide early warnings up to 12 hours in advance. △ Less

Submitted 1 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Comments: Resubmitted to ApJ. 22 pages, 16 figures, for more information about the combined pre-supernova alert system, see https://www.lowbg.org/presnalarm/

arXiv:2404.09320 [pdf, other]

MPC Based Linear Equivalence with Control Barrier Functions for VTOL-UAVs

Authors: Ali Mohamed Ali, Hashim A. Hashim, Chao Shen

Abstract: In this work, we propose a cascaded scheme of linear Model prediction Control (MPC) based on Control Barrier Functions (CBF) with Dynamic Feedback Linearization (DFL) for Vertical Take-off and Landing (VTOL) Unmanned Aerial Vehicles (UAVs). CBF is a tool that allows enforcement of forward invariance of a set using Lyapunov-like functions to ensure safety. The First control synthesis that employed… ▽ More In this work, we propose a cascaded scheme of linear Model prediction Control (MPC) based on Control Barrier Functions (CBF) with Dynamic Feedback Linearization (DFL) for Vertical Take-off and Landing (VTOL) Unmanned Aerial Vehicles (UAVs). CBF is a tool that allows enforcement of forward invariance of a set using Lyapunov-like functions to ensure safety. The First control synthesis that employed CBF was based on Quadratic Program (QP) that modifies the existing controller to satisfy the safety requirements. However, the CBF-QP-based controllers leading to longer detours and undesirable transient performance. Recent contributions utilize the framework of MPC benefiting from the prediction capabilities and constraints imposed on the state and control inputs. Due to the intrinsic nonlinearities of the dynamics of robotics systems, all the existing MPC-CBF solutions rely on nonlinear MPC formulations or operate on less accurate linear models. In contrast, our novel solution unlocks the benefits of linear MPC-CBF while considering the full underactuated dynamics without any linear approximations. The cascaded scheme converts the problem of safe VTOL-UAV navigation to a Quadratic Constraint Quadratic Programming (QCQP) problem solved efficiently by off-the-shelf solvers. The closed-loop stability and recursive feasibility is proved along with numerical simulations showing the effective and robust solutions. Keywords: Unmanned Aerial Vehicles, Vertical Take-off and Landing, Model Predictive Control, MPC, Nonlinearity, Dynamic Feedback Linearization, Optimal Control. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: The 2024 IEEE American Control Conference (ACC)

arXiv:2404.08725 [pdf, other]

Development of a data overflow protection system for Super-Kamiokande to maximize data from nearby supernovae

Authors: M. Mori, K. Abe, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya, H. Shiba, K. Shimizu , et al. (230 additional authors not shown)

Abstract: Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem,… ▽ More Neutrinos from very nearby supernovae, such as Betelgeuse, are expected to generate more than ten million events over 10\,s in Super-Kamokande (SK). At such large event rates, the buffers of the SK analog-to-digital conversion board (QBEE) will overflow, causing random loss of data that is critical for understanding the dynamics of the supernova explosion mechanism. In order to solve this problem, two new DAQ modules were developed to aid in the observation of very nearby supernovae. The first of these, the SN module, is designed to save only the number of hit PMTs during a supernova burst and the second, the Veto module, prescales the high rate neutrino events to prevent the QBEE from overflowing based on information from the SN module. In the event of a very nearby supernova, these modules allow SK to reconstruct the time evolution of the neutrino event rate from beginning to end using both QBEE and SN module data. This paper presents the development and testing of these modules together with an analysis of supernova-like data generated with a flashing laser diode. We demonstrate that the Veto module successfully prevents DAQ overflows for Betelgeuse-like supernovae as well as the long-term stability of the new modules. During normal running the Veto module is found to issue DAQ vetos a few times per month resulting in a total dead time less than 1\,ms, and does not influence ordinary operations. Additionally, using simulation data we find that supernovae closer than 800~pc will trigger Veto module resulting in a prescaling of the observed neutrino data. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 28 pages, 18 figures. Submitted to PTEP

arXiv:2404.05916 [pdf, other]

Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis

Authors: Sekeun Kim, Hui Ren, Peng Guo, Abder-Rahman Ali, Patrick Zhang, Kyungsang Kim, Xiang Li, Quanzheng Li

Abstract: Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation… ▽ More Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation as the number of required models increases with the number of standard views. To address this, in this paper, we present a prompt-driven universal method for view-agnostic echocardiography analysis. Considering the domain shift between standard views, we first introduce a method called prompt matching, aimed at learning prompts specific to different views by matching prompts and querying input embeddings using a pre-trained vision model. Then, we utilized a pre-trained medical language model to align textual information with pixel data for accurate segmentation. Extensive experiments on three standard views showed that our approach significantly outperforms the state-of-the-art universal methods and achieves comparable or even better performances over the segmentation model trained and tested on same views. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.03872 [pdf, ps, other]

Graviton mass due to dark energy as a superconducting medium: theoretical and phenomenological aspects

Authors: Nader Inan, Ahmed Farag Ali, Kimet Jusufi, Abdelrahman Yasser

Abstract: It is well known that the cosmological constant term in the Einstein field equations can be interpreted as a stress tensor for dark energy. This stress tensor is formally analogous to an elastic constitutive equation in continuum mechanics. As a result, the cosmological constant leads to a "shear modulus" and "bulk modulus" affecting all gravitational fields in the universe. The form of the consti… ▽ More It is well known that the cosmological constant term in the Einstein field equations can be interpreted as a stress tensor for dark energy. This stress tensor is formally analogous to an elastic constitutive equation in continuum mechanics. As a result, the cosmological constant leads to a "shear modulus" and "bulk modulus" affecting all gravitational fields in the universe. The form of the constitutive equation is also analogous to the London constitutive equation for a superconductor. Treating dark energy as a type of superconducting medium for gravitational waves leads to a Yukawa-like gravitational potential and a massive graviton within standard General Relativity. We discuss a number of resulting phenomenological aspects such as a screening length scale that can also be used to describe the effects generally attributed to dark matter. In addition, we find a gravitational wave plasma frequency, index of refraction, and impedance. The expansion of the universe is interpreted as a Meissner-like effect as dark energy causes an outward "expulsion" of space-time similar to a superconductor expelling a magnetic field. The fundamental cause of these effects is interpreted as a type of spontaneous symmetry breaking of a scalar field. There is an associated chemical potential, critical temperature, and an Unruh-Hawking effect associated with the formulation. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2404.00492 [pdf, other]

Multi-hop Question Answering under Temporal Knowledge Editing

Authors: Keyuan Cheng, Gang Lin, Haoyang Fei, Yuxuan zhai, Lu Yu, Muhammad Asif Ali, Lijie Hu, Di Wang

Abstract: Multi-hop question answering (MQA) under knowledge editing (KE) has garnered significant attention in the era of large language models. However, existing models for MQA under KE exhibit poor performance when dealing with questions containing explicit temporal contexts. To address this limitation, we propose a novel framework, namely TEMPoral knowLEdge augmented Multi-hop Question Answering (TEMPLE… ▽ More Multi-hop question answering (MQA) under knowledge editing (KE) has garnered significant attention in the era of large language models. However, existing models for MQA under KE exhibit poor performance when dealing with questions containing explicit temporal contexts. To address this limitation, we propose a novel framework, namely TEMPoral knowLEdge augmented Multi-hop Question Answering (TEMPLE-MQA). Unlike previous methods, TEMPLE-MQA first constructs a time-aware graph (TAG) to store edit knowledge in a structured manner. Then, through our proposed inference path, structural retrieval, and joint reasoning stages, TEMPLE-MQA effectively discerns temporal contexts within the question query. Experiments on benchmark datasets demonstrate that TEMPLE-MQA significantly outperforms baseline models. Additionally, we contribute a new dataset, namely TKEMQA, which serves as the inaugural benchmark tailored specifically for MQA with temporal scopes. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: 23 pages

arXiv:2404.00489 [pdf, other]

PROMPT-SAW: Leveraging Relation-Aware Graphs for Textual Prompt Compression

Authors: Muhammad Asif Ali, Zheng** Li, Shu Yang, Keyuan Cheng, Yang Cao, Tianhao Huang, Lijie Hu, Lu Yu, Di Wang

Abstract: Large language models (LLMs) have shown exceptional abilities for multiple different natural language processing tasks. While prompting is a crucial tool for LLM inference, we observe that there is a significant cost associated with exceedingly lengthy prompts. Existing attempts to compress lengthy prompts lead to sub-standard results in terms of readability and interpretability of the compressed… ▽ More Large language models (LLMs) have shown exceptional abilities for multiple different natural language processing tasks. While prompting is a crucial tool for LLM inference, we observe that there is a significant cost associated with exceedingly lengthy prompts. Existing attempts to compress lengthy prompts lead to sub-standard results in terms of readability and interpretability of the compressed prompt, with a detrimental impact on prompt utility. To address this, we propose PROMPT-SAW: Prompt compresSion via Relation AWare graphs, an effective strategy for prompt compression over task-agnostic and task-aware prompts. PROMPT-SAW uses the prompt's textual information to build a graph, later extracts key information elements in the graph to come up with the compressed prompt. We also propose GSM8K-AUG, i.e., an extended version of the existing GSM8k benchmark for task-agnostic prompts in order to provide a comprehensive evaluation platform. Experimental evaluation using benchmark datasets shows that prompts compressed by PROMPT-SAW are not only better in terms of readability, but they also outperform the best-performing baseline models by up to 14.3 and 13.7 respectively for task-aware and task-agnostic settings while compressing the original prompt text by 33.0 and 56.7. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2404.00486 [pdf, other]

Dialectical Alignment: Resolving the Tension of 3H and Security Threats of LLMs

Authors: Shu Yang, Jiayuan Su, Han Jiang, Mengdi Li, Keyuan Cheng, Muhammad Asif Ali, Lijie Hu, Di Wang

Abstract: With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial. While existing alignment methods like RLHF, DPO, etc., effectively fine-tune LLMs to match preferences in the preference dataset, they often lead LLMs to highly receptive human input and external evidence, even when this informat… ▽ More With the rise of large language models (LLMs), ensuring they embody the principles of being helpful, honest, and harmless (3H), known as Human Alignment, becomes crucial. While existing alignment methods like RLHF, DPO, etc., effectively fine-tune LLMs to match preferences in the preference dataset, they often lead LLMs to highly receptive human input and external evidence, even when this information is poisoned. This leads to a tendency for LLMs to be Adaptive Chameleons when external evidence conflicts with their parametric memory. This exacerbates the risk of LLM being attacked by external poisoned data, which poses a significant security risk to LLM system applications such as Retrieval-augmented generation (RAG). To address the challenge, we propose a novel framework: Dialectical Alignment (DA), which (1) utilizes AI feedback to identify optimal strategies for LLMs to navigate inter-context conflicts and context-memory conflicts with different external evidence in context window (i.e., different ratios of poisoned factual contexts); (2) constructs the SFT dataset as well as the preference dataset based on the AI feedback and strategies above; (3) uses the above datasets for LLM alignment to defense poisoned context attack while preserving the effectiveness of in-context knowledge editing. Our experiments show that the dialectical alignment model improves poisoned data attack defense by 20 and does not require any additional prompt engineering or prior declaration of ``you may be attacked`` to the LLMs' context window. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2403.15979 [pdf, other]

Primordial black holes and secondary gravitational waves from the inflation potential with a tiny bump

Authors: Wei Yang, Yu-Xuan Kang, Arshad Ali, Tao-Tao Sui, Chen-Hao Wu, Ya-Peng Hu

Abstract: This paper explores the generation of primordial black holes (PBHs) and scalar-induced gravitational waves (SIGWs) from the inflation potential with a tiny bump. We propose a Lorentz function that makes a tiny bump characteristic of the inflation model potential. This property makes the scalar field move locally ultra slowly, which not only makes the primordial curvature power spectrum have… ▽ More This paper explores the generation of primordial black holes (PBHs) and scalar-induced gravitational waves (SIGWs) from the inflation potential with a tiny bump. We propose a Lorentz function that makes a tiny bump characteristic of the inflation model potential. This property makes the scalar field move locally ultra slowly, which not only makes the primordial curvature power spectrum have $\mathcal{O}(10^{-2})$ peaks at a small scale but also satisfies observational constraints of the cosmic microwave background (CMB) on a large scale. Specifically, we calculate the abundances of PBHs for different mass ranges in this model, where PBHs with mass $10^{-12}M_\odot$ can make up almost all dark matter and PBHs with mass $10^{-5}M_\odot$ can explain OGLE ultrashort-timescale microlensing events. Moreover, we find that SIGWs accompanying the PBHs can be tested by the Square Kilometre Array (SKA), TianQin, Taiji, Laser Interferometer Space Antenna (LISA), and DECIGO. As for the parameter set I, the consequent SIGWs can explain the NANOGrav 12.5yrs signal. △ Less

Submitted 23 March, 2024; originally announced March 2024.

Comments: 9 pages, 5 figures

arXiv:2403.13514 [pdf, other]

How Gender Interacts with Political Values: A Case Study on Czech BERT Models

Authors: Adnan Al Ali, **dřich Libovický

Abstract: Neural language models, which reach state-of-the-art results on most natural language processing tasks, are trained on large text corpora that inevitably contain value-burdened content and often capture undesirable biases, which the models reflect. This case study focuses on the political biases of pre-trained encoders in Czech and compares them with a representative value survey. Because Czech is… ▽ More Neural language models, which reach state-of-the-art results on most natural language processing tasks, are trained on large text corpora that inevitably contain value-burdened content and often capture undesirable biases, which the models reflect. This case study focuses on the political biases of pre-trained encoders in Czech and compares them with a representative value survey. Because Czech is a gendered language, we also measure how the grammatical gender coincides with responses to men and women in the survey. We introduce a novel method for measuring the model's perceived political values. We find that the models do not assign statement probability following value-driven reasoning, and there is no systematic difference between feminine and masculine sentences. We conclude that BERT-sized models do not manifest systematic alignment with political values and that the biases observed in the models are rather due to superficial imitation of training data patterns than systematic value beliefs encoded in the models. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 11 pages, 2 figures; LREC-COLING 2024

arXiv:2403.09987 [pdf, other]

Trusting the Search: Unraveling Human Trust in Health Information from Google and ChatGPT

Authors: Xin Sun, Rongjun Ma, Xiaochang Zhao, Zhuying Li, Janne Lindqvist, Abdallah El Ali, Jos A. Bosch

Abstract: People increasingly rely on online sources for health information seeking due to their convenience and timeliness, traditionally using search engines like Google as the primary search agent. Recently, the emergence of generative Artificial Intelligence (AI) has made Large Language Model (LLM) powered conversational agents such as ChatGPT a viable alternative for health information search. However,… ▽ More People increasingly rely on online sources for health information seeking due to their convenience and timeliness, traditionally using search engines like Google as the primary search agent. Recently, the emergence of generative Artificial Intelligence (AI) has made Large Language Model (LLM) powered conversational agents such as ChatGPT a viable alternative for health information search. However, while trust is crucial for adopting the online health advice, the factors influencing people's trust judgments in health information provided by LLM-powered conversational agents remain unclear. To address this, we conducted a mixed-methods, within-subjects lab study (N=21) to explore how interactions with different agents (ChatGPT vs. Google) across three health search tasks influence participants' trust judgments of the search results as well as the search agents themselves. Our key findings showed that: (a) participants' trust levels in ChatGPT were significantly higher than Google in the context of health information seeking; (b) there is a significant correlation between trust in health-related information and trust in the search agent, however only for Google; (c) the type of search tasks did not affect participants' perceived trust; and (d) participants' prior knowledge, the style of information presentation, and the interactive manner of using search agents were key determinants of trust in the health-related information. Our study taps into differences in trust perceptions when using traditional search engines compared to LLM-powered conversational agents. We highlight the potential role LLMs play in health-related information-seeking contexts, where they excel as step** stones for further search. We contribute key factors and considerations for ensuring effective and reliable personal health information seeking in the age of generative AI. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: 24 pages

ACM Class: F.2.2, I.2.7

arXiv:2403.08728 [pdf, other]

Ambient Diffusion Posterior Sampling: Solving Inverse Problems with Diffusion Models trained on Corrupted Data

Authors: Asad Aali, Giannis Daras, Brett Levac, Sidharth Kumar, Alexandros G. Dimakis, Jonathan I. Tamir

Abstract: We provide a framework for solving inverse problems with diffusion models learned from linearly corrupted data. Our method, Ambient Diffusion Posterior Sampling (A-DPS), leverages a generative model pre-trained on one type of corruption (e.g. image inpainting) to perform posterior sampling conditioned on measurements from a potentially different forward process (e.g. image blurring). We test the e… ▽ More We provide a framework for solving inverse problems with diffusion models learned from linearly corrupted data. Our method, Ambient Diffusion Posterior Sampling (A-DPS), leverages a generative model pre-trained on one type of corruption (e.g. image inpainting) to perform posterior sampling conditioned on measurements from a potentially different forward process (e.g. image blurring). We test the efficacy of our approach on standard natural image datasets (CelebA, FFHQ, and AFHQ) and we show that A-DPS can sometimes outperform models trained on clean data for several image restoration tasks in both speed and performance. We further extend the Ambient Diffusion framework to train MRI models with access only to Fourier subsampled multi-coil MRI measurements at various acceleration factors (R=2, 4, 6, 8). We again observe that models trained on highly subsampled data are better priors for solving inverse problems in the high acceleration regime than models trained on fully sampled data. We open-source our code and the trained Ambient Diffusion MRI models: https://github.com/utcsilab/ambient-diffusion-mri . △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: Pre-print, work in progress

arXiv:2403.08619 [pdf, other]

Measurements of the charge ratio and polarization of cosmic-ray muons with the Super-Kamiokande detector

Authors: H. Kitagawa, T. Tada, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, Y. Nakano, M. Nakahata, S. Nakayama, Y. Noguchi, K. Okamoto, K. Sato, H. Sekiya , et al. (231 additional authors not shown)

Abstract: We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$… ▽ More We present the results of the charge ratio ($R$) and polarization ($P^μ_{0}$) measurements using the decay electron events collected from 2008 September to 2022 June by the Super-Kamiokande detector. Because of its underground location and long operation, we performed high precision measurements by accumulating cosmic-ray muons. We measured the muon charge ratio to be $R=1.32 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at $E_μ\cos θ_{\mathrm{Zenith}}=0.7^{+0.3}_{-0.2}$ $\mathrm{TeV}$, where $E_μ$ is the muon energy and $θ_{\mathrm{Zenith}}$ is the zenith angle of incoming cosmic-ray muons. This result is consistent with the Honda flux model while this suggests a tension with the $πK$ model of $1.9σ$. We also measured the muon polarization at the production location to be $P^μ_{0}=0.52 \pm 0.02$ $(\mathrm{stat.}{+}\mathrm{syst.})$ at the muon momentum of $0.9^{+0.6}_{-0.1}$ $\mathrm{TeV}/c$ at the surface of the mountain; this also suggests a tension with the Honda flux model of $1.5σ$. This is the most precise measurement ever to experimentally determine the cosmic-ray muon polarization near $1~\mathrm{TeV}/c$. These measurement results are useful to improve the atmospheric neutrino simulations. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 29 pages, 45 figures

arXiv:2403.08363 [pdf, other]

doi 10.1145/3613904.3642425

ShareYourReality: Investigating Haptic Feedback and Agency in Virtual Avatar Co-embodiment

Authors: Karthikeya Puttur Venkatraj, Wo Meijer, Monica Perusquía-Hernández, Gijs Huisman, Abdallah El Ali

Abstract: Virtual co-embodiment enables two users to share a single avatar in Virtual Reality (VR). During such experiences, the illusion of shared motion control can break during joint-action activities, highlighting the need for position-aware feedback mechanisms. Drawing on the perceptual crossing paradigm, we explore how haptics can enable non-verbal coordination between co-embodied participants. In a w… ▽ More Virtual co-embodiment enables two users to share a single avatar in Virtual Reality (VR). During such experiences, the illusion of shared motion control can break during joint-action activities, highlighting the need for position-aware feedback mechanisms. Drawing on the perceptual crossing paradigm, we explore how haptics can enable non-verbal coordination between co-embodied participants. In a within-subjects study (20 participant pairs), we examined the effects of vibrotactile haptic feedback (None, Present) and avatar control distribution (25-75%, 50-50%, 75-25%) across two VR reaching tasks (Targeted, Free-choice) on participants Sense of Agency (SoA), co-presence, body ownership, and motion synchrony. We found (a) lower SoA in the free-choice with haptics than without, (b) higher SoA during the shared targeted task, (c) co-presence and body ownership were significantly higher in the free-choice task, (d) players hand motions synchronized more in the targeted task. We provide cautionary considerations when including haptic feedback mechanisms for avatar co-embodiment experiences. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: Accepted to CHI 2024

ACM Class: H.5.m

arXiv:2403.07833 [pdf, other]

A Science4Peace initiative: Alleviating the consequences of sanctions in international scientific cooperation

Authors: A. Ali, M. Barone, S. Brentjes, D. Britzger, M. Dittmar, T. Ekelöf, J. Ellis, S. Fonseca de Souza, A. Glazov, A. V. Gritsan, R. Hoffmann, H. Jung, M. Klein, V. Klyukhin, V. Korbel, P. Kokkas, P. Kostka, U. Langenegger, J. List, N. Raicevic, A. Rostovtsev, A. Sabio Vera, M. Spiro, G. Tonelli, P. van Mechelen , et al. (1 additional authors not shown)

Abstract: The armed invasion of Ukraine by the Russian Federation has adversely affected the relations between Russia and Western countries. Among other aspects, it has put scientific cooperation and collaboration into question and changed the scientific landscape significantly. Cooperation between some Western institutions and their Russian and Belarusian partners were put on hold after February 24, 2022.… ▽ More The armed invasion of Ukraine by the Russian Federation has adversely affected the relations between Russia and Western countries. Among other aspects, it has put scientific cooperation and collaboration into question and changed the scientific landscape significantly. Cooperation between some Western institutions and their Russian and Belarusian partners were put on hold after February 24, 2022. The CERN Council decided at its meeting in December 2023 to terminate cooperation agreements with Russia and Belarus that date back a decade. CERN is an international institution with UN observer status, and has so far played a role in international cooperation which was independent of national political strategies. We argue that the Science4Peace idea still has a great value and scientific collaboration between scientists must continue, since fundamental science is by its nature an international discipline. A ban of scientists participating in international cooperation and collaboration is against the traditions, requirements and understanding of science. We call for measures to reactivate the peaceful cooperation of individual scientists on fundamental research in order to stimulate international cooperation for a more peaceful world in the future. Specifically, we plead for finding ways to continue this cooperation through international organizations, such as CERN and JINR. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Showing 1–50 of 1,078 results for author: Ali, A