Search | arXiv e-print repository

Understanding differences in applying DETR to natural and medical images

Authors: Yanqi Xu, Yiqiu Shen, Carlos Fernandez-Granda, Laura Heacock, Krzysztof J. Geras

Abstract: Transformer-based detectors have shown success in computer vision tasks with natural images. These models, exemplified by the Deformable DETR, are optimized through complex engineering strategies tailored to the typical characteristics of natural scenes. However, medical imaging data presents unique challenges such as extremely large image sizes, fewer and smaller regions of interest, and object c… ▽ More Transformer-based detectors have shown success in computer vision tasks with natural images. These models, exemplified by the Deformable DETR, are optimized through complex engineering strategies tailored to the typical characteristics of natural scenes. However, medical imaging data presents unique challenges such as extremely large image sizes, fewer and smaller regions of interest, and object classes which can be differentiated only through subtle differences. This study evaluates the applicability of these transformer-based design choices when applied to a screening mammography dataset that represents these distinct medical imaging data characteristics. Our analysis reveals that common design choices from the natural image domain, such as complex encoder architectures, multi-scale feature fusion, query initialization, and iterative bounding box refinement, do not improve and sometimes even impair object detection performance in medical imaging. In contrast, simpler and shallower architectures often achieve equal or superior results. This finding suggests that the adaptation of transformer models for medical imaging data requires a reevaluation of standard practices, potentially leading to more efficient and specialized frameworks for medical diagnosis. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.17562 [pdf, other]

Angular fractals in thermal QFT

Authors: Nathan Benjamin, Jaeha Lee, Sridip Pal, David Simmons-Duffin, Yixin Xu

Abstract: We show that thermal effective field theory controls the long-distance expansion of the partition function of a $d$-dimensional QFT, with an insertion of any finite-order spatial isometry. Consequently, the thermal partition function on a sphere displays a fractal-like structure as a function of angular twist, reminiscent of the behavior of a modular form near the real line. As an example applicat… ▽ More We show that thermal effective field theory controls the long-distance expansion of the partition function of a $d$-dimensional QFT, with an insertion of any finite-order spatial isometry. Consequently, the thermal partition function on a sphere displays a fractal-like structure as a function of angular twist, reminiscent of the behavior of a modular form near the real line. As an example application, we find that for CFTs, the effective free energy of even-spin minus odd-spin operators at high temperature is smaller than the usual free energy by a factor of $1/2^d$. Near certain rational angles, the partition function receives subleading contributions from "Kaluza-Klein vortex defects" in the thermal EFT, which we classify. We illustrate our results with examples in free and holographic theories, and also discuss nonperturbative corrections from worldline instantons. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 45 pages+ appendices, 7 figures

Report number: CALT-TH 2024-021

arXiv:2405.17535 [pdf, other]

Calibrated Dataset Condensation for Faster Hyperparameter Search

Authors: Mucong Ding, Yuancheng Xu, Tahseen Rabbani, Xiaoyu Liu, Brian Gravelle, Teresa Ranadive, Tai-Ching Tuan, Furong Huang

Abstract: Dataset condensation can be used to reduce the computational cost of training multiple models on a large dataset by condensing the training dataset into a small synthetic set. State-of-the-art approaches rely on matching the model gradients between the real and synthetic data. However, there is no theoretical guarantee of the generalizability of the condensed data: data condensation often generali… ▽ More Dataset condensation can be used to reduce the computational cost of training multiple models on a large dataset by condensing the training dataset into a small synthetic set. State-of-the-art approaches rely on matching the model gradients between the real and synthetic data. However, there is no theoretical guarantee of the generalizability of the condensed data: data condensation often generalizes poorly across hyperparameters/architectures in practice. This paper considers a different condensation objective specifically geared toward hyperparameter search. We aim to generate a synthetic validation dataset so that the validation-performance rankings of the models, with different hyperparameters, on the condensed and original datasets are comparable. We propose a novel hyperparameter-calibrated dataset condensation (HCDC) algorithm, which obtains the synthetic validation dataset by matching the hyperparameter gradients computed via implicit differentiation and efficient inverse Hessian approximation. Experiments demonstrate that the proposed framework effectively maintains the validation-performance rankings of models and speeds up hyperparameter/architecture search for tasks on both images and graphs. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.17414 [pdf, other]

Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

Authors: Zhengfei Kuang, Shengqu Cai, Hao He, Yinghao Xu, Hongsheng Li, Leonidas Guibas, Gordon Wetzstein

Abstract: Research on video generation has recently made tremendous progress, enabling high-quality videos to be generated from text prompts or images. Adding control to the video generation process is an important goal moving forward and recent approaches that condition video generation models on camera trajectories make strides towards it. Yet, it remains challenging to generate a video of the same scene… ▽ More Research on video generation has recently made tremendous progress, enabling high-quality videos to be generated from text prompts or images. Adding control to the video generation process is an important goal moving forward and recent approaches that condition video generation models on camera trajectories make strides towards it. Yet, it remains challenging to generate a video of the same scene from multiple different camera trajectories. Solutions to this multi-video generation problem could enable large-scale 3D scene generation with editable camera trajectories, among other applications. We introduce collaborative video diffusion (CVD) as an important step towards this vision. The CVD framework includes a novel cross-video synchronization module that promotes consistency between corresponding frames of the same video rendered from different camera poses using an epipolar attention mechanism. Trained on top of a state-of-the-art camera-control module for video generation, CVD generates multiple videos rendered from different camera trajectories with significantly better consistency than baselines, as shown in extensive experiments. Project page: https://collaborativevideodiffusion.github.io/. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.17279 [pdf]

Socially-Aware Shared Control Navigation for Assistive Mobile Robots in the Built Environment

Authors: Yifan Xu, Qianwei Wang, Vineet Kamat, Carol Menassa

Abstract: As the number of Persons with Disabilities (PWD), particularly those with one or more physical impairments, increases, there is an increasing demand for assistive robotic technologies that can support independent mobility in the built environment and reduce the burden on caregivers. Current assistive mobility platforms (e.g., robotic wheelchairs) often fail to incorporate user preferences and cont… ▽ More As the number of Persons with Disabilities (PWD), particularly those with one or more physical impairments, increases, there is an increasing demand for assistive robotic technologies that can support independent mobility in the built environment and reduce the burden on caregivers. Current assistive mobility platforms (e.g., robotic wheelchairs) often fail to incorporate user preferences and control, leading to reduced trust and efficiency. Existing shared control algorithms do not allow the incorporation of the user control preferences inside the navigation framework or the path planning algorithm. In addition, existing dynamic local planner algorithms for robotic wheelchairs do not take into account the social spaces of people, potentially leading such platforms to infringe upon these areas and cause discomfort. To address these concerns, this work introduces a novel socially-aware shared autonomy-based navigation system for assistive mobile robotic platforms. Our navigation framework comprises a Global Planner and a Local Planner. To implement the Global Planner, the proposed approach introduces a novel User Preference Field (UPF) theory within its global planning framework, explicitly acknowledging user preferences to adeptly navigate away from congested areas. For the Local Planner, we propose a Socially-aware Shared Control-based Model Predictive Control with Dynamic Control Barrier Function (SS-MPC-DCBF) to adjust movements in real-time, integrating user preferences for safer, more autonomous navigation. Evaluation results show that our Global Planner aligns closely with user preferences compared to baselines, and our Local Planner demonstrates enhanced safety and efficiency in dynamic and static scenarios. This integrated approach fosters trust and autonomy, crucial for the acceptance of assistive mobility technologies in the built environment. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 42 pages, 14 figures

arXiv:2405.17031 [pdf, other]

Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning

Authors: Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu

Abstract: Model-based methods in reinforcement learning offer a promising approach to enhance data efficiency by facilitating policy exploration within a dynamics model. However, accurately predicting sequential steps in the dynamics model remains a challenge due to the bootstrap** prediction, which attributes the next state to the prediction of the current state. This leads to accumulated errors during m… ▽ More Model-based methods in reinforcement learning offer a promising approach to enhance data efficiency by facilitating policy exploration within a dynamics model. However, accurately predicting sequential steps in the dynamics model remains a challenge due to the bootstrap** prediction, which attributes the next state to the prediction of the current state. This leads to accumulated errors during model roll-out. In this paper, we propose the Any-step Dynamics Model (ADM) to mitigate the compounding error by reducing bootstrap** prediction to direct prediction. ADM allows for the use of variable-length plans as inputs for predicting future states without frequent bootstrap**. We design two algorithms, ADMPO-ON and ADMPO-OFF, which apply ADM in online and offline model-based frameworks, respectively. In the online setting, ADMPO-ON demonstrates improved sample efficiency compared to previous state-of-the-art methods. In the offline setting, ADMPO-OFF not only demonstrates superior performance compared to recent state-of-the-art offline approaches but also offers better quantification of model uncertainty using only a single ADM. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.16869 [pdf, other]

Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion

Authors: Yichi Zhang, Zhuo Chen, Lingbing Guo, Ya**g Xu, Binbin Hu, Ziqi Liu, Wen Zhang, Huajun Chen

Abstract: Multi-modal knowledge graph completion (MMKGC) aims to automatically discover new knowledge triples in the given multi-modal knowledge graphs (MMKGs), which is achieved by collaborative modeling the structural information concealed in massive triples and the multi-modal features of the entities. Existing methods tend to focus on crafting elegant entity-wise multi-modal fusion strategies, yet they… ▽ More Multi-modal knowledge graph completion (MMKGC) aims to automatically discover new knowledge triples in the given multi-modal knowledge graphs (MMKGs), which is achieved by collaborative modeling the structural information concealed in massive triples and the multi-modal features of the entities. Existing methods tend to focus on crafting elegant entity-wise multi-modal fusion strategies, yet they overlook the utilization of multi-perspective features concealed within the modalities under diverse relational contexts. To address this issue, we introduce a novel MMKGC framework with Mixture of Modality Knowledge experts (MoMoK for short) to learn adaptive multi-modal embedding under intricate relational contexts. We design relation-guided modality knowledge experts to acquire relation-aware modality embeddings and integrate the predictions from multi-modalities to achieve comprehensive decisions. Additionally, we disentangle the experts by minimizing their mutual information. Experiments on four public MMKG benchmarks demonstrate the outstanding performance of MoMoK under complex scenarios. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: Work in progress. Code and data will be released at https://github.com/zjukg/MoMoK

arXiv:2405.16858 [pdf, other]

Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations

Authors: **gguo Liu, Yijun Xu, Shigang Li, Jianfeng Li

Abstract: Disconnectivity and distortion are the two problems which must be coped with when processing 360 degrees equirectangular images. In this paper, we propose a method of estimating the depth of monocular panoramic image with a teacher-student model fusing equirectangular and spherical representations. In contrast with the existing methods fusing an equirectangular representation with a cube map repre… ▽ More Disconnectivity and distortion are the two problems which must be coped with when processing 360 degrees equirectangular images. In this paper, we propose a method of estimating the depth of monocular panoramic image with a teacher-student model fusing equirectangular and spherical representations. In contrast with the existing methods fusing an equirectangular representation with a cube map representation or tangent representation, a spherical representation is a better choice because a sampling on a sphere is more uniform and can also cope with distortion more effectively. In this processing, a novel spherical convolution kernel computing with sampling points on a sphere is developed to extract features from the spherical representation, and then, a Segmentation Feature Fusion(SFF) methodology is utilized to combine the features with ones extracted from the equirectangular representation. In contrast with the existing methods using a teacher-student model to obtain a lighter model of depth estimation, we use a teacher-student model to learn the latent features of depth images. This results in a trained model which estimates the depth map of an equirectangular image using not only the feature maps extracted from an input equirectangular image but also the distilled knowledge learnt from the ground truth of depth map of a training set. In experiments, the proposed method is tested on several well-known 360 monocular depth estimation benchmark datasets, and outperforms the existing methods for the most evaluation indexes. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.16831 [pdf]

Giant anomalous Hall effect and band folding in a Kagome metal with mixed dimensionality

Authors: Erjian Cheng, Kaipu Wang, Simin Nie, Tian** Ying, Zongkai Li, Yiwei Li, Yang Xu, Houke Chen, Ralf Koban, Horst Borrmann, Walter Schnelle, Vicky Hasse, Meixiao Wang, Yulin Chen, Zhongkai Liu, Claudia Felser

Abstract: Magnetic metals with geometric frustration offer a fertile ground for studying novel states of matter with strong quantum fluctuations and unique electromagnetic responses from conduction electrons coupled to spin textures. Recently, TbTi$_3$Bi$_4$ has emerged as such an intriguing platform as it behaves as a quasi-one-dimension (quasi-1D) Ising magnet with antiferromagnetic orderings at 20.4 K an… ▽ More Magnetic metals with geometric frustration offer a fertile ground for studying novel states of matter with strong quantum fluctuations and unique electromagnetic responses from conduction electrons coupled to spin textures. Recently, TbTi$_3$Bi$_4$ has emerged as such an intriguing platform as it behaves as a quasi-one-dimension (quasi-1D) Ising magnet with antiferromagnetic orderings at 20.4 K and 3 K, respectively. Magnetic fields along the Tb zigzag-chain direction reveal plateaus at 1/3 and 2/3 of saturated magnetization, respectively. At metamagnetic transition boundaries, a record-high anomalous Hall conductivity of 6.2 $\times$ 10$^5$ $Ω^{-1}$ cm$^{-1}$ is observed. Within the plateau, noncollinear magnetic texture is suggested. In addition to the characteristic Kagome 2D electronic structure, ARPES unequivocally demonstrates quasi-1D electronic structure from the Tb 5$d$ bands and a quasi-1D hybridization gap in the magnetic state due to band folding with $q$ = (1/3, 0, 0) possibly from the spin-density-wave order along the Tb chain. These findings emphasize the crucial role of mixed dimensionality and the strong coupling between magnetic texture and electronic band structure in regulating physical properties of materials, offering new strategies for designing materials for future spintronics applications. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 22 pages, 4 figures

arXiv:2405.16795 [pdf]

Physics-informed Inverse Design of Multi-bit Programmable Metasurfaces

Authors: Yucheng Xu, Jia-Qi Yang, Kebin Fan, Sheng Wang, **gbo Wu, Caihong Zhang, De-Chuan Zhan, Willie J. Padilla, Biaobing **, Jian Chen, Peiheng Wu

Abstract: Emerging reconfigurable metasurfaces offer various possibilities in programmatically manipulating electromagnetic waves across spatial, spectral, and temporal domains, showcasing great potential for enhancing terahertz applications. However, they are hindered by limited tunability, particularly evident in relatively small phase tuning over 270o, due to the design constraints with time-intensive fo… ▽ More Emerging reconfigurable metasurfaces offer various possibilities in programmatically manipulating electromagnetic waves across spatial, spectral, and temporal domains, showcasing great potential for enhancing terahertz applications. However, they are hindered by limited tunability, particularly evident in relatively small phase tuning over 270o, due to the design constraints with time-intensive forward design methodologies. Here, we demonstrate a multi-bit programmable metasurface capable of terahertz beam steering, facilitated by a developed physics-informed inverse design (PIID) approach. Through integrating a modified coupled mode theory (MCMT) into residual neural networks, our PIID algorithm not only significantly increases the design accuracy compared to conventional neural networks but also elucidates the intricate physical relations between the geometry and the modes. Without decreasing the reflection intensity, our method achieves the enhanced phase tuning as large as 300o. Additionally, we experimentally validate the inverse designed programmable beam steering metasurface, which is adaptable across 1-bit, 2-bit, and tri-state coding schemes, yielding a deflection angle up to 68o and broadened steering coverage. Our demonstration provides a promising pathway for rapidly exploring advanced metasurface devices, with potentially great impact on communication and imaging technologies. △ Less

Submitted 26 May, 2024; originally announced May 2024.

arXiv:2405.16777 [pdf, other]

Coverage Analysis of Downlink Transmission in Multi-Connectivity Cellular V2X Networks

Authors: Luofang Jiao, Tianqi Zhang, Jiwei Zhao, Yunting Xu, Haibo Zhou

Abstract: With the increasing of connected vehicles in the fifth-generation mobile communication networks (5G) and beyond 5G (B5G), ensuring the reliable and high-speed cellular vehicle-to-everything (C-V2X) communication has posed significant challenges due to the high mobility of vehicles. For improving the network performance and reliability, multi-connectivity technology has emerged as a crucial transmi… ▽ More With the increasing of connected vehicles in the fifth-generation mobile communication networks (5G) and beyond 5G (B5G), ensuring the reliable and high-speed cellular vehicle-to-everything (C-V2X) communication has posed significant challenges due to the high mobility of vehicles. For improving the network performance and reliability, multi-connectivity technology has emerged as a crucial transmission mode for C-V2X in the 5G era. To this end, this paper proposes a framework for analyzing the performance of multi-connectivity in C-V2X downlink transmission, with a focus on the performance indicators of joint distance distribution and coverage probability. Specifically, we first derive the joint distance distribution of multi-connectivity. By leveraging the tools of stochastic geometry, we then obtain the analytical expressions of coverage probability based on the previous results for general multi-connectivity cases in C-V2X. Subsequently, we evaluate the effect of path loss exponent and downlink base station density on coverage probability based on the proposed analytical framework. Finally, extensive Monte Carlo simulations are conducted to validate the effectiveness of the proposed analytical framework and the simulation results reveal that multi-connectivity technology can significantly enhance the coverage probability in C-V2X. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 6 pagers, 5 figures. arXiv admin note: substantial text overlap with arXiv:2404.17823

Journal ref: 2023 International Conference on Wireless Communications and Signal Processing (WCSP). IEEE, 2023: 815-820

arXiv:2405.16420 [pdf, other]

M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions

Authors: Zheng Wang, Shu Xian Teo, Jieer Ouyang, Yongjun Xu, Wei Shi

Abstract: Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by retrieving relevant memories from an external database. However, existing RAG methods typically organize all memories in a whole database, potentially limiting focus on crucial memories and introducing noise. In this paper, we introduce a multiple partition paradigm for RAG (called M-RAG), where each database partition s… ▽ More Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by retrieving relevant memories from an external database. However, existing RAG methods typically organize all memories in a whole database, potentially limiting focus on crucial memories and introducing noise. In this paper, we introduce a multiple partition paradigm for RAG (called M-RAG), where each database partition serves as a basic unit for RAG execution. Based on this paradigm, we propose a novel framework that leverages LLMs with Multi-Agent Reinforcement Learning to optimize different language generation tasks explicitly. Through comprehensive experiments conducted on seven datasets, spanning three language generation tasks and involving three distinct language model architectures, we confirm that M-RAG consistently outperforms various baseline methods, achieving improvements of 11%, 8%, and 12% for text summarization, machine translation, and dialogue generation, respectively. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: This paper has been accepted by ACL 2024

arXiv:2405.16057 [pdf, other]

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Authors: Xudong Lu, Aojun Zhou, Yuhui Xu, Renrui Zhang, Peng Gao, Hongsheng Li

Abstract: Large Language Models (LLMs) have become pivotal in advancing the field of artificial intelligence, yet their immense sizes pose significant challenges for both fine-tuning and deployment. Current post-training pruning methods, while reducing the sizes of LLMs, often fail to maintain their original performance. To address these challenges, this paper introduces SPP, a Sparsity-Preserved Parameter-… ▽ More Large Language Models (LLMs) have become pivotal in advancing the field of artificial intelligence, yet their immense sizes pose significant challenges for both fine-tuning and deployment. Current post-training pruning methods, while reducing the sizes of LLMs, often fail to maintain their original performance. To address these challenges, this paper introduces SPP, a Sparsity-Preserved Parameter-efficient fine-tuning method. Different from existing post-training pruning approaches that struggle with performance retention, SPP proposes to employ lightweight learnable column and row matrices to optimize sparse LLM weights, kee** the structure and sparsity of pruned pre-trained models intact. By element-wise multiplication and residual addition, SPP ensures the consistency of model sparsity pattern and ratio during both training and weight-merging processes. We demonstrate the effectiveness of SPP by applying it to the LLaMA and LLaMA-2 model families with recent post-training pruning methods. Our results show that SPP significantly enhances the performance of models with different sparsity patterns (i.e. unstructured and N:M sparsity), especially for those with high sparsity ratios (e.g. 75%), making it a promising solution for the efficient fine-tuning of sparse LLMs. Code will be made available at https://github.com/Lucky-Lance/SPP. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.15517 [pdf, other]

Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction

Authors: Yuyang Xue, **gshuai Liu, Steven McDonagh, Sotirios A. Tsaftaris

Abstract: Machine unlearning is a promising paradigm for removing unwanted data samples from a trained model, towards ensuring compliance with privacy regulations and limiting harmful biases. Although unlearning has been shown in, e.g., classification and recommendation systems, its potential in medical image-to-image translation, specifically in image recon-struction, has not been thoroughly investigated.… ▽ More Machine unlearning is a promising paradigm for removing unwanted data samples from a trained model, towards ensuring compliance with privacy regulations and limiting harmful biases. Although unlearning has been shown in, e.g., classification and recommendation systems, its potential in medical image-to-image translation, specifically in image recon-struction, has not been thoroughly investigated. This paper shows that machine unlearning is possible in MRI tasks and has the potential to benefit for bias removal. We set up a protocol to study how much shared knowledge exists between datasets of different organs, allowing us to effectively quantify the effect of unlearning. Our study reveals that combining training data can lead to hallucinations and reduced image quality in the reconstructed data. We use unlearning to remove hallucinations as a proxy exemplar of undesired data removal. Indeed, we show that machine unlearning is possible without full retraining. Furthermore, our observations indicate that maintaining high performance is feasible even when using only a subset of retain data. We have made our code publicly accessible. △ Less

Submitted 18 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

Comments: The paper is accpeted by MIDL 2024

arXiv:2405.15344 [pdf, other]

Adaptive Finite Element Method for a Nonlinear Helmholtz Equation with High Wave Number

Authors: Run Jiang, Haijun Wu, Yifeng Xu, Jun Zou

Abstract: A nonlinear Helmholtz (NLH) equation with high frequencies and corner singularities is discretized by the linear finite element method (FEM). After deriving some wave-number-explicit stability estimates and the singularity decomposition for the NLH problem, a priori stability and error estimates are established for the FEM on shape regular meshes including the case of locally refined meshes. Then… ▽ More A nonlinear Helmholtz (NLH) equation with high frequencies and corner singularities is discretized by the linear finite element method (FEM). After deriving some wave-number-explicit stability estimates and the singularity decomposition for the NLH problem, a priori stability and error estimates are established for the FEM on shape regular meshes including the case of locally refined meshes. Then a posteriori upper and lower bounds using a new residual-type error estimator, which is equivalent to the standard one, are derived for the FE solutions to the NLH problem. These a posteriori estimates have confirmed a significant fact that is also valid for the NLH problem, namely the residual-type estimator seriously underestimates the error of the FE solution in the preasymptotic regime, which was first observed by Babuška et al. [Int J Numer Methods Eng 40 (1997)] for a one-dimensional linear problem. Based on the new a posteriori error estimator, both the convergence and the quasi-optimality of the resulting adaptive finite element algorithm are proved the first time for the NLH problem, when the initial mesh size lying in the preasymptotic regime. Finally, numerical examples are presented to validate the theoretical findings and demonstrate that applying the continuous interior penalty (CIP) technique with appropriate penalty parameters can reduce the pollution errors efficiently. In particular, the nonlinear phenomenon of optical bistability with Gaussian incident waves is successfully simulated by the adaptive CIPFEM. △ Less

Submitted 27 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.15280 [pdf, other]

DFGNN: Dual-frequency Graph Neural Network for Sign-aware Feedback

Authors: Yiqing Wu, Ruobing Xie, Zhao Zhang, Xu Zhang, Fuzhen Zhuang, Leyu Lin, Zhanhui Kang, Yongjun Xu

Abstract: The graph-based recommendation has achieved great success in recent years. However, most existing graph-based recommendations focus on capturing user preference based on positive edges/feedback, while ignoring negative edges/feedback (e.g., dislike, low rating) that widely exist in real-world recommender systems. How to utilize negative feedback in graph-based recommendations still remains underex… ▽ More The graph-based recommendation has achieved great success in recent years. However, most existing graph-based recommendations focus on capturing user preference based on positive edges/feedback, while ignoring negative edges/feedback (e.g., dislike, low rating) that widely exist in real-world recommender systems. How to utilize negative feedback in graph-based recommendations still remains underexplored. In this study, we first conducted a comprehensive experimental analysis and found that (1) existing graph neural networks are not well-suited for modeling negative feedback, which acts as a high-frequency signal in a user-item graph. (2) The graph-based recommendation suffers from the representation degeneration problem. Based on the two observations, we propose a novel model that models positive and negative feedback from a frequency filter perspective called Dual-frequency Graph Neural Network for Sign-aware Recommendation (DFGNN). Specifically, in DFGNN, the designed dual-frequency graph filter (DGF) captures both low-frequency and high-frequency signals that contain positive and negative feedback. Furthermore, the proposed signed graph regularization is applied to maintain the user/item embedding uniform in the embedding space to alleviate the representation degeneration problem. Additionally, we conduct extensive experiments on real-world datasets and demonstrate the effectiveness of the proposed model. Codes of our model will be released upon acceptance. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: Accepted by KDD 2024 Research Track

arXiv:2405.15277 [pdf]

doi 10.1021/acs.jpcc.3c05692

Inducing ferroelectricity in NH$_4$I and NH$_4$Br via partial replacement of protons by deuterons

Authors: Miao Miao Zhao, Lei Meng, Yi Yang Xu, Na Du, Fei Yen

Abstract: While all of the polymorphs of NH$_4$I and NH$_4$Br are non-polar, a reversible electric polarization is established in the ordered $γ$ phases of (NH$_4$)$_{0.73}$(ND$_4$)$_{0.27}$I and (NH$_4$)$_{0.84}$(ND$_4$)$_{0.16}$Br (where D is $^2$H) via $dc$ electric fields. The presence of two groups of orbital magnetic moments appears to be responsible for the asymmetric lattice distortions. Our finding… ▽ More While all of the polymorphs of NH$_4$I and NH$_4$Br are non-polar, a reversible electric polarization is established in the ordered $γ$ phases of (NH$_4$)$_{0.73}$(ND$_4$)$_{0.27}$I and (NH$_4$)$_{0.84}$(ND$_4$)$_{0.16}$Br (where D is $^2$H) via $dc$ electric fields. The presence of two groups of orbital magnetic moments appears to be responsible for the asymmetric lattice distortions. Our findings provide an alternative pathway for hydrogen-based materials to potentially add a ferroelectric functionality. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 14 pages, 3 figures

Journal ref: J. Phys. Chem. C 127, 20951-20955 (2023)

arXiv:2405.15261 [pdf]

doi 10.1016/j.jallcom.2023.170685

Electric Polarization and Magnetic Properties of (NH$_4$)$_{1-x}$K$_x$I (x = 0.05-0.17)

Authors: Yi Yang Xu, Lei Meng, Miao Miao Zhao, Chu Xin Peng, Fei Yen

Abstract: While all of the polymorphs of pure NH$_4$I and KI are non-polar, we identify that (NH$_4$)$_{0.95}$K$_{0.05}$I is ferroelectric and (NH$_4$)$_{0.87}$K$_{0.13}$I and (NH$_4$)$_{0.83}$K$_{0.17}$I are pyroelectric through measurements of their pyroelectric current and complex dielectric constant. The order to disorder phase transitions occur near 245 K. Magnetic susceptibility measurements indicate… ▽ More While all of the polymorphs of pure NH$_4$I and KI are non-polar, we identify that (NH$_4$)$_{0.95}$K$_{0.05}$I is ferroelectric and (NH$_4$)$_{0.87}$K$_{0.13}$I and (NH$_4$)$_{0.83}$K$_{0.17}$I are pyroelectric through measurements of their pyroelectric current and complex dielectric constant. The order to disorder phase transitions occur near 245 K. Magnetic susceptibility measurements indicate that the proton orbitals of the NH$_4$$^+$ continue to become ordered in the ground state in the (NH$_4$)$_{1-x}$K$_x$I system up to x <= 0.17. The polar phases are proposed to stem from K$^+$ ions disrupting the symmetry of proton-orbital-lattice interactions between the NH$_4$$^+$ and I$^-$ ions. Our work introduces a new pathway for the ordered phases of ammonium-based compounds to potentially become ferroelectric. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 13 pages, 5 figures

Journal ref: Journal of Alloys and Compounds 960, 170685 (2023)

arXiv:2405.14900 [pdf, other]

doi 10.1016/j.media.2024.103206.

Fair Evaluation of Federated Learning Algorithms for Automated Breast Density Classification: The Results of the 2022 ACR-NCI-NVIDIA Federated Learning Challenge

Authors: Kendall Schmidt, Benjamin Bearce, Ken Chang, Laura Coombs, Keyvan Farahani, Marawan Elbatele, Kaouther Mouhebe, Robert Marti, Ruipeng Zhang, Yao Zhang, Yanfeng Wang, Yaojun Hu, Haochao Ying, Yuyang Xu, Conrad Testagrose, Mutlu Demirer, Vikash Gupta, Ünal Akünal, Markus Bujotzek, Klaus H. Maier-Hein, Yi Qin, Xiaomeng Li, Jayashree Kalpathy-Cramer, Holger R. Roth

Abstract: The correct interpretation of breast density is important in the assessment of breast cancer risk. AI has been shown capable of accurately predicting breast density, however, due to the differences in imaging characteristics across mammography systems, models built using data from one system do not generalize well to other systems. Though federated learning (FL) has emerged as a way to improve the… ▽ More The correct interpretation of breast density is important in the assessment of breast cancer risk. AI has been shown capable of accurately predicting breast density, however, due to the differences in imaging characteristics across mammography systems, models built using data from one system do not generalize well to other systems. Though federated learning (FL) has emerged as a way to improve the generalizability of AI without the need to share data, the best way to preserve features from all training data during FL is an active area of research. To explore FL methodology, the breast density classification FL challenge was hosted in partnership with the American College of Radiology, Harvard Medical School's Mass General Brigham, University of Colorado, NVIDIA, and the National Institutes of Health National Cancer Institute. Challenge participants were able to submit docker containers capable of implementing FL on three simulated medical facilities, each containing a unique large mammography dataset. The breast density FL challenge ran from June 15 to September 5, 2022, attracting seven finalists from around the world. The winning FL submission reached a linear kappa score of 0.653 on the challenge test data and 0.413 on an external testing dataset, scoring comparably to a model trained on the same data in a central location. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 16 pages, 9 figures

Journal ref: Medical Image Analysis Volume 95, July 2024, 103206

arXiv:2405.14854 [pdf, other]

TerDiT: Ternary Diffusion Models with Transformers

Authors: Xudong Lu, Aojun Zhou, Ziyi Lin, Qi Liu, Yuhui Xu, Renrui Zhang, Yafei Wen, Shuai Ren, Peng Gao, Junchi Yan, Hongsheng Li

Abstract: Recent developments in large-scale pre-trained text-to-image diffusion models have significantly improved the generation of high-fidelity images, particularly with the emergence of diffusion models based on transformer architecture (DiTs). Among these diffusion models, diffusion transformers have demonstrated superior image generation capabilities, boosting lower FID scores and higher scalability.… ▽ More Recent developments in large-scale pre-trained text-to-image diffusion models have significantly improved the generation of high-fidelity images, particularly with the emergence of diffusion models based on transformer architecture (DiTs). Among these diffusion models, diffusion transformers have demonstrated superior image generation capabilities, boosting lower FID scores and higher scalability. However, deploying large-scale DiT models can be expensive due to their extensive parameter numbers. Although existing research has explored efficient deployment techniques for diffusion models such as model quantization, there is still little work concerning DiT-based models. To tackle this research gap, in this paper, we propose TerDiT, a quantization-aware training (QAT) and efficient deployment scheme for ternary diffusion models with transformers. We focus on the ternarization of DiT networks and scale model sizes from 600M to 4.2B. Our work contributes to the exploration of efficient deployment strategies for large-scale DiT models, demonstrating the feasibility of training extremely low-bit diffusion transformer models from scratch while maintaining competitive image generation capacities compared to full-precision models. Code will be available at https://github.com/Lucky-Lance/TerDiT. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 18 pages, 13 figures

arXiv:2405.14770 [pdf, other]

Physics-informed Score-based Diffusion Model for Limited-angle Reconstruction of Cardiac Computed Tomography

Authors: Shuo Han, Yongshun Xu, Dayang Wang, Bahareh Morovati, Li Zhou, Jonathan S. Maltz, Ge Wang, Hengyong Yu

Abstract: Cardiac computed tomography (CT) has emerged as a major imaging modality for the diagnosis and monitoring of cardiovascular diseases. High temporal resolution is essential to ensure diagnostic accuracy. Limited-angle data acquisition can reduce scan time and improve temporal resolution, but typically leads to severe image degradation and motivates for improved reconstruction techniques. In this pa… ▽ More Cardiac computed tomography (CT) has emerged as a major imaging modality for the diagnosis and monitoring of cardiovascular diseases. High temporal resolution is essential to ensure diagnostic accuracy. Limited-angle data acquisition can reduce scan time and improve temporal resolution, but typically leads to severe image degradation and motivates for improved reconstruction techniques. In this paper, we propose a novel physics-informed score-based diffusion model (PSDM) for limited-angle reconstruction of cardiac CT. At the sampling time, we combine a data prior from a diffusion model and a model prior obtained via an iterative algorithm and Fourier fusion to further enhance the image quality. Specifically, our approach integrates the primal-dual hybrid gradient (PDHG) algorithm with score-based diffusion models, thereby enabling us to reconstruct high-quality cardiac CT images from limited-angle data. The numerical simulations and real data experiments confirm the effectiveness of our proposed approach. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 12 pages

arXiv:2405.14539 [pdf, other]

VINS-Multi: A Robust Asynchronous Multi-camera-IMU State Estimator

Authors: Luqi Wang, Yang Xu, Shaojie Shen

Abstract: State estimation is a critical foundational module in robotics applications, where robustness and performance are paramount. Although in recent years, many works have been focusing on improving one of the most widely adopted state estimation methods, visual inertial odometry (VIO), by incorporating multiple cameras, these efforts predominantly address synchronous camera systems. Asynchronous camer… ▽ More State estimation is a critical foundational module in robotics applications, where robustness and performance are paramount. Although in recent years, many works have been focusing on improving one of the most widely adopted state estimation methods, visual inertial odometry (VIO), by incorporating multiple cameras, these efforts predominantly address synchronous camera systems. Asynchronous cameras, which offer simpler hardware configurations and enhanced resilience, have been largely overlooked. To fill this gap, this paper presents VINS-Multi, a novel multi-camera-IMU state estimator for asynchronous cameras. The estimator comprises parallel front ends, a front end coordinator, and a back end optimization module capable of handling asynchronous input frames. It utilizes the frames effectively through a dynamic feature number allocation and a frame priority coordination strategy. The proposed estimator is integrated into a customized quadrotor platform and tested in multiple realistic and challenging scenarios to validate its practicality. Additionally, comprehensive benchmark results are provided to showcase the robustness and superior performance of the proposed estimator. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.14486 [pdf, other]

RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models

Authors: Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang

Abstract: Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents RefChecker, a framework that introduces claim-triplets to represent claims in LLM responses, aiming to detect fine-grained hallucinations. In RefChecker, an extractor generates claim-triplets from a response, which are then evaluated by a checker against a reference. W… ▽ More Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents RefChecker, a framework that introduces claim-triplets to represent claims in LLM responses, aiming to detect fine-grained hallucinations. In RefChecker, an extractor generates claim-triplets from a response, which are then evaluated by a checker against a reference. We delineate three task settings: Zero, Noisy and Accurate Context, to reflect various real-world use cases. We curated a benchmark spanning various NLP tasks and annotated 11k claim-triplets from 2.1k responses by seven LLMs. RefChecker supports both proprietary and open-source models as the extractor and checker. Experiments demonstrate that claim-triplets enable superior hallucination detection, compared to other granularities such as response, sentence and sub-sentence level claims. RefChecker outperforms prior methods by 6.8 to 26.1 points on our benchmark and the checking results of RefChecker are strongly aligned with human judgments. This work is open sourced at https://github.com/amazon-science/RefChecker △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.14398 [pdf, other]

SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

Authors: Weiyu Guo, Ying Sun, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong

Abstract: Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distri… ▽ More Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distribution shifts in real-world settings, compromises model robustness. To tackle these challenges, we propose a novel SpGesture framework based on Spiking Neural Networks, which possesses several unique merits compared with existing methods: (1) Robustness: By utilizing membrane potential as a memory list, we pioneer the introduction of Source-Free Domain Adaptation into SNN for the first time. This enables SpGesture to mitigate the accuracy degradation caused by distribution shifts. (2) High Accuracy: With a novel Spiking Jaccard Attention, SpGesture enhances the SNNs' ability to represent sEMG features, leading to a notable rise in system accuracy. To validate SpGesture's performance, we collected a new sEMG gesture dataset which has different forearm postures, where SpGesture achieved the highest accuracy among the baselines ($89.26\%$). Moreover, the actual deployment on the CPU demonstrated a system latency below 100ms, well within real-time requirements. This impressive performance showcases SpGesture's potential to enhance the applicability of sEMG in real-world scenarios. The code is available at https://anonymous.4open.science/r/SpGesture. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.14201 [pdf, other]

FreeTuner: Any Subject in Any Style with Training-free Diffusion

Authors: Youcan Xu, Zhen Wang, Jun Xiao, Wei Liu, Long Chen

Abstract: With the advance of diffusion models, various personalized image generation methods have been proposed. However, almost all existing work only focuses on either subject-driven or style-driven personalization. Meanwhile, state-of-the-art methods face several challenges in realizing compositional personalization, i.e., composing different subject and style concepts, such as concept disentanglement,… ▽ More With the advance of diffusion models, various personalized image generation methods have been proposed. However, almost all existing work only focuses on either subject-driven or style-driven personalization. Meanwhile, state-of-the-art methods face several challenges in realizing compositional personalization, i.e., composing different subject and style concepts, such as concept disentanglement, unified reconstruction paradigm, and insufficient training data. To address these issues, we introduce FreeTuner, a flexible and training-free method for compositional personalization that can generate any user-provided subject in any user-provided style (see Figure 1). Our approach employs a disentanglement strategy that separates the generation process into two stages to effectively mitigate concept entanglement. FreeTuner leverages the intermediate features within the diffusion model for subject concept representation and introduces style guidance to align the synthesized images with the style concept, ensuring the preservation of both the subject's structure and the style's aesthetic features. Extensive experiments have demonstrated the generation ability of FreeTuner across various personalization settings. △ Less

Submitted 26 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.13715 [pdf, other]

Traffic Scenario Logic: A Spatial-Temporal Logic for Modeling and Reasoning of Urban Traffic Scenarios

Authors: Ruolin Wang, Yuejiao Xu, Jianmin Ji

Abstract: Formal representations of traffic scenarios can be used to generate test cases for the safety verification of autonomous driving. However, most existing methods are limited in highway or highly simplified intersection scenarios due to the intricacy and diversity of traffic scenarios. In response, we propose Traffic Scenario Logic (TSL), which is a spatial-temporal logic designed for modeling and r… ▽ More Formal representations of traffic scenarios can be used to generate test cases for the safety verification of autonomous driving. However, most existing methods are limited in highway or highly simplified intersection scenarios due to the intricacy and diversity of traffic scenarios. In response, we propose Traffic Scenario Logic (TSL), which is a spatial-temporal logic designed for modeling and reasoning of urban pedestrian-free traffic scenarios. TSL provides a formal representation of the urban road network that can be derived from OpenDRIVE, i.e., the de facto industry standard of high-definition maps for autonomous driving, enabling the representation of a broad range of traffic scenarios. We implemented the reasoning of TSL using Telingo, i.e., a solver for temporal programs based on the Answer Set Programming, and tested it on different urban road layouts. Demonstrations show the effectiveness of TSL in test scenario generation and its potential value in areas like decision-making and control verification of autonomous driving. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Submitted to KR 2024

arXiv:2405.13704 [pdf, other]

Safe and Personalizable Logical Guidance for Trajectory Planning of Autonomous Driving

Authors: Yuejiao Xu, Ruolin Wang, Chengpeng Xu, Jianmin Ji

Abstract: Autonomous vehicles necessitate a delicate balance between safety, efficiency, and user preferences in trajectory planning. Existing traditional or learning-based methods face challenges in adequately addressing all these aspects. In response, this paper proposes a novel component termed the Logical Guidance Layer (LGL), designed for seamless integration into autonomous driving trajectory planning… ▽ More Autonomous vehicles necessitate a delicate balance between safety, efficiency, and user preferences in trajectory planning. Existing traditional or learning-based methods face challenges in adequately addressing all these aspects. In response, this paper proposes a novel component termed the Logical Guidance Layer (LGL), designed for seamless integration into autonomous driving trajectory planning frameworks, specifically tailored for highway scenarios. The LGL guides the trajectory planning with a local target area determined through scenario reasoning, scenario evaluation, and guidance area calculation. Integrating the Responsibility-Sensitive Safety (RSS) model, the LGL ensures formal safety guarantees while accommodating various user preferences defined by logical formulae. Experimental validation demonstrates the effectiveness of the LGL in achieving a balance between safety and efficiency, and meeting user preferences in autonomous highway driving scenarios. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Submitted to ITSC 2024

arXiv:2405.13671 [pdf, other]

Sun-as-a-star observations of obscuration dimmings caused by filament eruptions

Authors: Yu Xu, Hui Tian, Astrid M. Veronig, Karin Dissauer

Abstract: Filament eruptions often lead to coronal mass ejections (CMEs) on the Sun and are one of the most energetic eruptive phenomena in the atmospheres of other late-type stars. However, the detection of filament eruptions and CMEs on stars beyond the solar system is challenging. Here we present six filament eruption cases on the Sun and show that filament material obscuring part of the solar disk can c… ▽ More Filament eruptions often lead to coronal mass ejections (CMEs) on the Sun and are one of the most energetic eruptive phenomena in the atmospheres of other late-type stars. However, the detection of filament eruptions and CMEs on stars beyond the solar system is challenging. Here we present six filament eruption cases on the Sun and show that filament material obscuring part of the solar disk can cause detectable dimming signatures in sun-as-a-star flux curves of He II 304 A. Those filament eruptions have similar morphological features, originating from small filaments inside active regions and subsequently strongly expanding to obscure large areas of the solar disk or the bright flare regions. We have tracked the detailed evolution of six obscuration dimmings and estimated the dimming properties, such as dimming depths, dimming areas, and duration. The largest dimming depth among the six events under study is 6.2% accompanied by the largest dimming area of 5.6\% of the solar disk area. Other events have maximum dimming depths in a range of around 1% to 3% with maximum areas varying between about 3% to 4% of the solar disk area. The duration of the dimming spans from around 0.4 hours to 7.0 hours for the six events under study. A positive correlation was found between the dimming depth and area, which may help to set constraint on the filament sizes in stellar observations. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 13 pages, 6 figures; accepted by ApJ

arXiv:2405.13668 [pdf, other]

doi 10.1016/j.physletb.2024.138737

Electromagnetic moments of the odd-mass nickel isotopes $^{59-67}$Ni

Authors: P. Müller, S. Kaufmann, T. Miyagi, J. Billowes, M. L. Bissell, K. Blaum, B. Cheal, R. F. Garcia Ruiz, W. Gins, C. Gorges, H. Heylen, A. Kanellakopoulos, S. Malbrunot-Ettenauer, R. Neugart, G. Neyens, W. Nörtershäuser, T. Ratajczyk, L. V. Rodríguez, R. Sánchez, S. Sailer, A. Schwenk, L. Wehner, C. Wraith, L. Xie, Z. Y. Xu , et al. (2 additional authors not shown)

Abstract: The magnetic dipole and the spectroscopic quadrupole moments of the nuclear ground states in the odd-mass nickel isotopes $^{59-67}$Ni have been determined using collinear laser spectroscopy at the CERN-ISOLDE facility. They are compared to ab initio valence-space in-medium similarity renormalization group (VS-IMSRG) calculations including contributions of two-body currents as well as to shell-mod… ▽ More The magnetic dipole and the spectroscopic quadrupole moments of the nuclear ground states in the odd-mass nickel isotopes $^{59-67}$Ni have been determined using collinear laser spectroscopy at the CERN-ISOLDE facility. They are compared to ab initio valence-space in-medium similarity renormalization group (VS-IMSRG) calculations including contributions of two-body currents as well as to shell-model calculations. The two-body-current contributions significantly improve the agreement with experimental data, reducing the mean-square deviation from the experimental moments by a factor of 3 to 5, depending on the employed interaction. For all interactions, the largest contributions are obtained for the $5/2^-$ ($7/2^-$) isotopes $^{65}$Ni ($^{55}$Ni), which is ascribed to the high angular momentum of the $f$ orbitals. Our results demonstrate that the inclusion of two-body-current contributions to the magnetic moment in an isotopic chain of complex nuclei can be handled by the VS-IMSRG and can outperform phenomenological shell-model calculations using effective $g$-factors in the nickel region. △ Less

Submitted 30 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

Comments: Published in Physics Letters B, 10 pages, 3 figures

Journal ref: Physics Letters B 854 (2024) 138737

arXiv:2405.13369 [pdf, other]

Realization of a crosstalk-free multi-ion node for long-distance quantum networking

Authors: P. -C. Lai, Y. Wang, J. -X. Shi, Z. -B. Cui, Z. -Q. Wang, S. Zhang, P. -Y. Liu, Z. -C. Tian, Y. -D. Sun, X. -Y. Chang, B. -X. Qi, Y. -Y. Huang, Z. -C. Zhou, Y. -K. Wu, Y. Xu, Y. -F. Pu, L. -M. Duan

Abstract: Trapped atomic ions constitute one of the leading physical platforms for building the quantum repeater nodes to realize large-scale quantum networks. In a long-distance trapped-ion quantum network, it is essential to have crosstalk-free dual-type qubits: one type, called the communication qubit, to establish entangling interface with telecom photons; and the other type, called the memory qubit, to… ▽ More Trapped atomic ions constitute one of the leading physical platforms for building the quantum repeater nodes to realize large-scale quantum networks. In a long-distance trapped-ion quantum network, it is essential to have crosstalk-free dual-type qubits: one type, called the communication qubit, to establish entangling interface with telecom photons; and the other type, called the memory qubit, to store quantum information immune from photon scattering under entangling attempts. Here, we report the first experimental implementation of a telecom-compatible and crosstalk-free quantum network node based on two trapped $^{40}$Ca$^{+}$ ions. The memory qubit is encoded on a long-lived metastable level to avoid crosstalk with the communication qubit encoded in another subspace of the same ion species, and a quantum wavelength conversion module is employed to generate ion-photon entanglement over a $12\,$km fiber in a heralded style. Our work therefore constitutes an important step towards the realization of quantum repeaters and long-distance quantum networks. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 12 pages, 12 figures

arXiv:2405.13315 [pdf, other]

Study of the decays $χ_{cJ}\toΛ\barΛω$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

Abstract: Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, we present the first observation of the decays $χ_{cJ}\toΛ\barΛω$, where $J=0, 1, 2$, with statistical significances of $11.7 σ, 11.2 σ$, and $11.8 σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\toΛ\barΛω)=({2.37 \pm 0.22 \pm 0.23}) \times 10^{-4}$,… ▽ More Using $(27.12\pm 0.14)\times10^{8}$ $ψ(3686)$ events collected with the BESIII detector, we present the first observation of the decays $χ_{cJ}\toΛ\barΛω$, where $J=0, 1, 2$, with statistical significances of $11.7 σ, 11.2 σ$, and $11.8 σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\toΛ\barΛω)=({2.37 \pm 0.22 \pm 0.23}) \times 10^{-4}$, $\mathcal{B}(χ_{c1}\toΛ\barΛω)=({1.01 \pm 0.10 \pm 0.11}) \times 10^{-4}$, and $\mathcal{B}(χ_{c2}\toΛ\barΛω)=({1.40 \pm 0.13 \pm 0.17}) \times 10^{-4}$, where the first uncertainties are statistical and the second are systematic. We observe no clear intermediate structures. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 11 pages, 10 figures

arXiv:2405.12872 [pdf, other]

Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image

Authors: Zerui Zhang, Zhichao Sun, Zelong Liu, Bo Du, Rui Yu, Zhou Zhao, Yongchao Xu

Abstract: Medical anomaly detection is a critical research area aimed at recognizing abnormal images to aid in diagnosis.Most existing methods adopt synthetic anomalies and image restoration on normal samples to detect anomaly. The unlabeled data consisting of both normal and abnormal data is not well explored. We introduce a novel Spatial-aware Attention Generative Adversarial Network (SAGAN) for one-class… ▽ More Medical anomaly detection is a critical research area aimed at recognizing abnormal images to aid in diagnosis.Most existing methods adopt synthetic anomalies and image restoration on normal samples to detect anomaly. The unlabeled data consisting of both normal and abnormal data is not well explored. We introduce a novel Spatial-aware Attention Generative Adversarial Network (SAGAN) for one-class semi-supervised generation of health images.Our core insight is the utilization of position encoding and attention to accurately focus on restoring abnormal regions and preserving normal regions. To fully utilize the unlabelled data, SAGAN relaxes the cyclic consistency requirement of the existing unpaired image-to-image conversion methods, and generates high-quality health images corresponding to unlabeled data, guided by the reconstruction of normal images and restoration of pseudo-anomaly images.Subsequently, the discrepancy between the generated healthy image and the original image is utilized as an anomaly score.Extensive experiments on three medical datasets demonstrate that the proposed SAGAN outperforms the state-of-the-art methods. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: Early Accept by MICCAI 2024

arXiv:2405.12838 [pdf, ps, other]

Quantum Non-Identical Mean Estimation: Efficient Algorithms and Fundamental Limits

Authors: Jiachen Hu, Tongyang Li, Xinzhao Wang, Yecheng Xue, Chenyi Zhang, Han Zhong

Abstract: We systematically investigate quantum algorithms and lower bounds for mean estimation given query access to non-identically distributed samples. On the one hand, we give quantum mean estimators with quadratic quantum speed-up given samples from different bounded or sub-Gaussian random variables. On the other hand, we prove that, in general, it is impossible for any quantum algorithm to achieve qua… ▽ More We systematically investigate quantum algorithms and lower bounds for mean estimation given query access to non-identically distributed samples. On the one hand, we give quantum mean estimators with quadratic quantum speed-up given samples from different bounded or sub-Gaussian random variables. On the other hand, we prove that, in general, it is impossible for any quantum algorithm to achieve quadratic speed-up over the number of classical samples needed to estimate the mean $μ$, where the samples come from different random variables with mean close to $μ$. Technically, our quantum algorithms reduce bounded and sub-Gaussian random variables to the Bernoulli case, and use an uncomputation trick to overcome the challenge that direct amplitude estimation does not work with non-identical query access. Our quantum query lower bounds are established by simulating non-identical oracles by parallel oracles, and also by an adversarial method with non-identical oracles. Both results pave the way for proving quantum query lower bounds with non-identical oracles in general, which may be of independent interest. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 31 pages, 0 figure. To appear in the 19th Theory of Quantum Computation, Communication and Cryptography (TQC 2024)

arXiv:2405.12809 [pdf, other]

Precision measurement of the branching fraction of \boldmath $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (604 additional authors not shown)

Abstract: Using a sample of $448.1 \times 10^6$ $ψ(2S)$ events collected with the BESIII detector, we perform a study of the decay $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$. The branching fraction of $J/ψ\rightarrow K^+K^-$ is determined to be $\mathcal{B}_{K^+K^-}=(3.072\pm 0.023({\rm stat.})\pm 0.050({\rm syst.}))\times 10^{-4}$, which is consistent with previous measurements but with sig… ▽ More Using a sample of $448.1 \times 10^6$ $ψ(2S)$ events collected with the BESIII detector, we perform a study of the decay $J/ψ\rightarrow K^+K^-$ via $ψ(2S)\rightarrow π^+π^-J/ψ$. The branching fraction of $J/ψ\rightarrow K^+K^-$ is determined to be $\mathcal{B}_{K^+K^-}=(3.072\pm 0.023({\rm stat.})\pm 0.050({\rm syst.}))\times 10^{-4}$, which is consistent with previous measurements but with significantly improved precision. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: to be submitted to PRD

arXiv:2405.12629 [pdf, ps, other]

A Local Gaussian Process Regression Approach to Frequency Response Function Estimation

Authors: Xiaozhu Fang, Yu Xu, Tianshi Chen

Abstract: Frequency response function (FRF) estimation is a classical subject in system identification. In the past two decades, there have been remarkable advances in develo** local methods for this subject, e.g., the local polynomial method, local rational method, and iterative local rational method. The recent concentrations for local methods are two issues: the model order selection and the identifica… ▽ More Frequency response function (FRF) estimation is a classical subject in system identification. In the past two decades, there have been remarkable advances in develo** local methods for this subject, e.g., the local polynomial method, local rational method, and iterative local rational method. The recent concentrations for local methods are two issues: the model order selection and the identification of lightly damped systems. To address these two issues, we propose a new local method called local Gaussian process regression (LGPR). We show that the frequency response function locally is either analytic or resonant, and this prior knowledge can be embedded into a kernel-based regularized estimate through a dot-product kernel plus a resonance kernel induced by a second-order resonant system. The LGPR provides a new route to tackle the aforementioned issues. In the numerical simulations, the LGPR shows the best FRF estimation accuracy compared with the existing local methods, and moreover, the LGPR is more robust with respect to sample size and noise level. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: the IFAC Symposium on System Identification, Boston, USA, July 17-18, 2024

arXiv:2405.12575 [pdf, other]

Three-dimensional map** and electronic origin of large altermagnetic splitting near Fermi level in CrSb

Authors: Guowei Yang, Zhanghuan Li, Sai Yang, Jiyuan Li, Hao Zheng, Weifan Zhu, Saizheng Cao, Wenxuan Zhao, Jiawen Zhang, Mao Ye, Yu Song, Lun-Hui Hu, Lexian Yang, Ming Shi, Huiqiu Yuan, Yongjun Zhang, Yuanfeng Xu, Yang Liu

Abstract: Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space map** and is crucial for spintronic applic… ▽ More Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space map** and is crucial for spintronic applications and emergent phenomena, remains challenging. Here by employing synchrotron-based angle-resolved photoemission spectroscopy (ARPES) and model calculations, we uncover a large altermagnetic splitting, up to ~1.0 eV, near the Fermi level in CrSb. We verify its bulk-type g-wave altermagnetism through systematic three-dimensional kspace map**, which unambiguously reveals the altermagnetic symmetry and associated nodal planes. The ARPES results are well captured by density functional theory calculations. In addition, tight-binding model analysis indicate that the large altermagnetic splitting arises from strong third-nearest-neighbor hop** mediated by Sb ions, which breaks both the space-time reversal symmetry and the translational spin-rotation symmetry. The large band/spin splitting near Fermi level in metallic CrSb, together with its high TN (up to 705 K) and simple spin configuration, paves the way for exploring emergent phenomena and spintronic applications based on altermagnets. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 16 pages, 4 figures and 1 table

arXiv:2405.12488 [pdf, other]

First joint oscillation analysis of Super-Kamiokande atmospheric and T2K accelerator neutrino data

Authors: Super-Kamiokande, T2K collaborations, :, S. Abe, K. Abe, N. Akhlaq, R. Akutsu, H. Alarakia-Charles, A. Ali, Y. I. Alj Hakim, S. Alonso Monsalve, S. Amanai, C. Andreopoulos, L. H. V. Anthony, M. Antonova, S. Aoki, K. A. Apte, T. Arai, T. Arihara, S. Arimoto, Y. Asada, R. Asaka, Y. Ashida, E. T. Atkin, N. Babu , et al. (524 additional authors not shown)

Abstract: The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of… ▽ More The Super-Kamiokande and T2K collaborations present a joint measurement of neutrino oscillation parameters from their atmospheric and beam neutrino data. It uses a common interaction model for events overlap** in neutrino energy and correlated detector systematic uncertainties between the two datasets, which are found to be compatible. Using 3244.4 days of atmospheric data and a beam exposure of $19.7(16.3) \times 10^{20}$ protons on target in (anti)neutrino mode, the analysis finds a 1.9$σ$ exclusion of CP-conservation (defined as $J_{CP}=0$) and a preference for the normal mass ordering. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 10 pages, 3 figures

arXiv:2405.12425 [pdf, other]

Continuous wave driving elucidates the desynchronisation dynamics of ultrashort dissipative Raman solitons generated in dispersive Kerr resonators

Authors: Zongda Li, Yiqing Xu, Stéphane Coen, Stuart G. Murdoch, Miro Erkintalo

Abstract: Phase-coherent pulsed driving of passive optical fiber resonators enable the generation of ultrashort dissipative Raman solitons with durations well below 100~fs. The existence and characteristics of such solitons critically depends on the desynchronisation between the pulsed driving source and the resonator roundtrip time, yet the full mechanism through which these dependencies arise remains uncl… ▽ More Phase-coherent pulsed driving of passive optical fiber resonators enable the generation of ultrashort dissipative Raman solitons with durations well below 100~fs. The existence and characteristics of such solitons critically depends on the desynchronisation between the pulsed driving source and the resonator roundtrip time, yet the full mechanism through which these dependencies arise remains unclear. Here, we numerically demonstrate that Raman solitons can exist even under conditions of continuous wave driving, and by numerically examining the existence and characteristics of Raman solitons under such conditions, we elucidate the role of desynchronisation in pulse-driven systems. In addition to providing new insights on the existence and characteristics of ultrashort Raman solitons, our analysis yields a qualitative explanation for the range of desynchronisations over which the solitons can exist. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.12294 [pdf, other]

Stabilizing fractional Chern insulators via exchange interaction in moiré systems

Authors: Xiaoyang Shen, Chonghao Wang, Rui** Guo, Zhiming Xu, Wenhui Duan, Yong Xu

Abstract: Recent experimental discovery of fractional Chern insulator in moiré Chern band in twisted transition metal dichalocogenide homobilayers has sparked intensive interest in exploring the ways of engineering band topology and correlated states in moiré systems. In this letter, we demonstrate that, with an additional exchange interaction induced by proximity effect, the topology and bandwidth of the m… ▽ More Recent experimental discovery of fractional Chern insulator in moiré Chern band in twisted transition metal dichalocogenide homobilayers has sparked intensive interest in exploring the ways of engineering band topology and correlated states in moiré systems. In this letter, we demonstrate that, with an additional exchange interaction induced by proximity effect, the topology and bandwidth of the moiré minibands of twisted $\mathrm{MoTe_2}$ homobilayers can be easily tuned. Fractional Chern insulators at -2/3 filling are found to appear at enlarged twist angles over a large range of twist angles with enhanced many-body gaps. We further discover a topological phase transition between the fractional Chern insulator, quantum anomalous Hall crystal, and charge density wave. Our results shed light on the interplay between topology and correlation physics. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 7 pages, 4 figures

arXiv:2405.11976 [pdf, other]

Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays

Authors: Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu

Abstract: Anomaly detection in chest X-rays is a critical task. Most methods mainly model the distribution of normal images, and then regard significant deviation from normal distribution as anomaly. Recently, CLIP-based methods, pre-trained on a large number of medical images, have shown impressive performance on zero/few-shot downstream tasks. In this paper, we aim to explore the potential of CLIP-based m… ▽ More Anomaly detection in chest X-rays is a critical task. Most methods mainly model the distribution of normal images, and then regard significant deviation from normal distribution as anomaly. Recently, CLIP-based methods, pre-trained on a large number of medical images, have shown impressive performance on zero/few-shot downstream tasks. In this paper, we aim to explore the potential of CLIP-based methods for anomaly detection in chest X-rays. Considering the discrepancy between the CLIP pre-training data and the task-specific data, we propose a position-guided prompt learning method. Specifically, inspired by the fact that experts diagnose chest X-rays by carefully examining distinct lung regions, we propose learnable position-guided text and image prompts to adapt the task data to the frozen pre-trained CLIP-based model. To enhance the model's discriminative capability, we propose a novel structure-preserving anomaly synthesis method within chest x-rays during the training process. Extensive experiments on three datasets demonstrate that our proposed method outperforms some state-of-the-art methods. The code of our implementation is available at https://github.com/sunzc-sunny/PPAD. △ Less

Submitted 19 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: MICCAI 2024 Early Accept

arXiv:2405.11928 [pdf, other]

"Set It Up!": Functional Object Arrangement with Compositional Generative Models

Authors: Yiqing Xu, Jiayuan Mao, Yilun Du, Tomas Lozáno-Pérez, Leslie Pack Kaebling, David Hsu

Abstract: This paper studies the challenge of develo** robots capable of understanding under-specified instructions for creating functional object arrangements, such as "set up a dining table for two"; previous arrangement approaches have focused on much more explicit instructions, such as "put object A on the table." We introduce a framework, SetItUp, for learning to interpret under-specified instruction… ▽ More This paper studies the challenge of develo** robots capable of understanding under-specified instructions for creating functional object arrangements, such as "set up a dining table for two"; previous arrangement approaches have focused on much more explicit instructions, such as "put object A on the table." We introduce a framework, SetItUp, for learning to interpret under-specified instructions. SetItUp takes a small number of training examples and a human-crafted program sketch to uncover arrangement rules for specific scene types. By leveraging an intermediate graph-like representation of abstract spatial relationships among objects, SetItUp decomposes the arrangement problem into two subproblems: i) learning the arrangement patterns from limited data and ii) grounding these abstract relationships into object poses. SetItUp leverages large language models (LLMs) to propose the abstract spatial relationships among objects in novel scenes as the constraints to be satisfied; then, it composes a library of diffusion models associated with these abstract relationships to find object poses that satisfy the constraints. We validate our framework on a dataset comprising study desks, dining tables, and coffee tables, with the results showing superior performance in generating physically plausible, functional, and aesthetically pleasing object arrangements compared to existing models. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 10 pages main paper, 21 pages appendix, RSS 2024

arXiv:2405.11856 [pdf, other]

Modeling and simulation of a mechanism for suppressing the flip** problem of a jum** robot

Authors: Qi Li, Liang Peng, Zhiyuan Wu, Pengda Ye, Weitao Zhang, Yi Xu, Qing Shi

Abstract: In order to solve the problem of stable jum** of micro robot, we design a special mechanism: elastic passive joint (EPJ). EPJ can assist in achieving smooth jum** through the opening-closing process when the robot jumps. First, we introduce the composition and operation principle of EPJ, and perform a dynamic modeling of the robot's jum** process. Then, in order to verify the effectiveness o… ▽ More In order to solve the problem of stable jum** of micro robot, we design a special mechanism: elastic passive joint (EPJ). EPJ can assist in achieving smooth jum** through the opening-closing process when the robot jumps. First, we introduce the composition and operation principle of EPJ, and perform a dynamic modeling of the robot's jum** process. Then, in order to verify the effectiveness of EPJ in controlling the robot's smooth jump, we design a simulation experiment based on MATLAB. Through comparative experiments, it was proved that EPJ can greatly adjust the angular velocity of the robot and increase the jump distance of the robot. Finally, we analyze each parameter in EPJ and performs parameter optimization. After optimization, EPJ achieves a completely flip-free jump of the robot, laying an important foundation for improving the mobility of micro-robot. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11770 [pdf, other]

Learning Spatial Similarity Distribution for Few-shot Object Counting

Authors: Yuanwu Xu, Feifan Song, Haofeng Zhang

Abstract: Few-shot object counting aims to count the number of objects in a query image that belong to the same class as the given exemplar images. Existing methods compute the similarity between the query image and exemplars in the 2D spatial domain and perform regression to obtain the counting number. However, these methods overlook the rich information about the spatial distribution of similarity on the… ▽ More Few-shot object counting aims to count the number of objects in a query image that belong to the same class as the given exemplar images. Existing methods compute the similarity between the query image and exemplars in the 2D spatial domain and perform regression to obtain the counting number. However, these methods overlook the rich information about the spatial distribution of similarity on the exemplar images, leading to significant impact on matching accuracy. To address this issue, we propose a network learning Spatial Similarity Distribution (SSD) for few-shot object counting, which preserves the spatial structure of exemplar features and calculates a 4D similarity pyramid point-to-point between the query features and exemplar features, capturing the complete distribution information for each point in the 4D similarity space. We propose a Similarity Learning Module (SLM) which applies the efficient center-pivot 4D convolutions on the similarity pyramid to map different similarity distributions to distinct predicted density values, thereby obtaining accurate count. Furthermore, we also introduce a Feature Cross Enhancement (FCE) module that enhances query and exemplar features mutually to improve the accuracy of feature matching. Our approach outperforms state-of-the-art methods on multiple datasets, including FSC-147 and CARPK. Code is available at https://github.com/CBalance/SSD. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: Accepted to IJCAI2024

arXiv:2405.11632 [pdf, other]

Attention to Quantum Complexity

Authors: Hye** Kim, Yiqing Zhou, Yichen Xu, Kaarthik Varma, Amir H. Karamlou, Ilan T. Rosen, Jesse C. Hoke, Chao Wan, ** Peng Zhou, William D. Oliver, Yuri D. Lensky, Kilian Q. Weinberger, Eun-Ah Kim

Abstract: The imminent era of error-corrected quantum computing urgently demands robust methods to characterize complex quantum states, even from limited and noisy measurements. We introduce the Quantum Attention Network (QuAN), a versatile classical AI framework leveraging the power of attention mechanisms specifically tailored to address the unique challenges of learning quantum complexity. Inspired by la… ▽ More The imminent era of error-corrected quantum computing urgently demands robust methods to characterize complex quantum states, even from limited and noisy measurements. We introduce the Quantum Attention Network (QuAN), a versatile classical AI framework leveraging the power of attention mechanisms specifically tailored to address the unique challenges of learning quantum complexity. Inspired by large language models, QuAN treats measurement snapshots as tokens while respecting their permutation invariance. Combined with a novel parameter-efficient mini-set self-attention block (MSSAB), such data structure enables QuAN to access high-order moments of the bit-string distribution and preferentially attend to less noisy snapshots. We rigorously test QuAN across three distinct quantum simulation settings: driven hard-core Bose-Hubbard model, random quantum circuits, and the toric code under coherent and incoherent noise. QuAN directly learns the growth in entanglement and state complexity from experimentally obtained computational basis measurements. In particular, it learns the growth in complexity of random circuit data upon increasing depth from noisy experimental data. Taken to a regime inaccessible by existing theory, QuAN unveils the complete phase diagram for noisy toric code data as a function of both noise types. This breakthrough highlights the transformative potential of using purposefully designed AI-driven solutions to assist quantum hardware. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.11585 [pdf, other]

Improved measurement of the branching fraction of $h_{c}\rightarrowγη^\prime/η$ and search for $h_{c}\rightarrowγπ^0$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

Abstract: The processes $h_c\rightarrowγP(P = η^\prime,~η,~π^{0}))$ are studied with a sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The branching fractions of $h_c\rightarrowγη^\prime$ and $h_c\rightarrowγη$ are measured to be $(1.40\pm0.11\pm0.04\pm0.10)\times10^{-3}$ and $(3.77\pm0.55\pm0.13\pm0.26)\times10^{-4}$, respectively, where the… ▽ More The processes $h_c\rightarrowγP(P = η^\prime,~η,~π^{0}))$ are studied with a sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider. The branching fractions of $h_c\rightarrowγη^\prime$ and $h_c\rightarrowγη$ are measured to be $(1.40\pm0.11\pm0.04\pm0.10)\times10^{-3}$ and $(3.77\pm0.55\pm0.13\pm0.26)\times10^{-4}$, respectively, where the first uncertainties are statistical, the second systematic, and the third from the branching fraction of $ψ(3686)\rightarrowπ^{0}h_c$. The ratio $R_{h_c}=\frac{\mathscr{B}(h_c\rightarrowγη)}{\mathscr{B}(h_c\rightarrowγη^\prime)}$ is calculated to be $(27.0\pm4.4\pm1.0)\%$. The measurements are consistent with the previous results with improved precision by a factor of 2. The results are valuable for gaining a deeper understanding of $η-η^\prime$ mixing, and its manifestation within quantum chromodynamics. No significant signal is found for the decay $h_c\rightarrowγπ^{0}$, and an upper limit is placed on its branching fraction of $\mathscr{B}(h_c\rightarrowγπ^{0})<5.0\times10^{-5}$, at the 90\% confidence level. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2405.11560 [pdf]

High Discrimination Ratio, Broadband Circularly Polarized Light Photodetector Using Dielectric Achiral Nanostructures

Authors: Guanyu Zhang, Xiaying Lyu, Yulu Qin, Yaolong Li, Zipu Fan, Xianghan Meng, Yuqing Cheng, Zini Cao, Yixuan Xu, Dong Sun, Yunan Gao, Qihuang Gong, Guowei Lu

Abstract: The on-chip measurement of polarization states plays an increasingly crucial role in modern sensing and imaging applications. While high-performance monolithic linearly polarized photodetectors have been extensively studied, integrated circularly polarized light (CPL) photodetectors are still hindered by inadequate discrimination capability. In this study, we employ achiral all-dielectric nanostru… ▽ More The on-chip measurement of polarization states plays an increasingly crucial role in modern sensing and imaging applications. While high-performance monolithic linearly polarized photodetectors have been extensively studied, integrated circularly polarized light (CPL) photodetectors are still hindered by inadequate discrimination capability. In this study, we employ achiral all-dielectric nanostructures to develop a broadband CPL photodetector with an impressive discrimination ratio of ~107 at the wavelength of 405 nm, significantly surpassing its counterparts by two orders of magnitude. Our device shows outstanding CPL discrimination capability across the visible band without requiring intensity calibration. Its function mechanism is based on the CPL-dependent near-field modes within achiral structures: under left or right CPL illumination, distinct near-field modes are excited, resulting in asymmetric irradiation of the two electrodes and generating a photovoltage with directions determined by the chirality of the incident light field. The proposed design strategy facilitates the realization of ultra-compact CPL detection across diverse materials, structures, and spectral ranges, presenting a novel avenue for achieving high-performance monolithic CPL detection. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 20 pages, 4 figures

arXiv:2405.11523 [pdf, other]

Diffusion-Based Hierarchical Image Steganography

Authors: Youmin Xu, Xuanyu Zhang, Jiwen Yu, Chong Mou, Xiandong Meng, Jian Zhang

Abstract: This paper introduces Hierarchical Image Steganography, a novel method that enhances the security and capacity of embedding multiple images into a single container using diffusion models. HIS assigns varying levels of robustness to images based on their importance, ensuring enhanced protection against manipulation. It adaptively exploits the robustness of the Diffusion Model alongside the reversib… ▽ More This paper introduces Hierarchical Image Steganography, a novel method that enhances the security and capacity of embedding multiple images into a single container using diffusion models. HIS assigns varying levels of robustness to images based on their importance, ensuring enhanced protection against manipulation. It adaptively exploits the robustness of the Diffusion Model alongside the reversibility of the Flow Model. The integration of Embed-Flow and Enhance-Flow improves embedding efficiency and image recovery quality, respectively, setting HIS apart from conventional multi-image steganography techniques. This innovative structure can autonomously generate a container image, thereby securely and efficiently concealing multiple images and text. Rigorous subjective and objective evaluations underscore our advantage in analytical resistance, robustness, and capacity, illustrating its expansive applicability in content safeguarding and privacy fortification. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: arXiv admin note: text overlap with arXiv:2305.16936

Report number: A-01

arXiv:2405.11502 [pdf, other]

CTGNN: Crystal Transformer Graph Neural Network for Crystal Material Property Prediction

Authors: Zijian Du, Luozhijie **, Le Shu, Yan Cen, Yuanfeng Xu, Yongfeng Mei, Hao Zhang

Abstract: The combination of deep learning algorithm and materials science has made significant progress in predicting novel materials and understanding various behaviours of materials. Here, we introduced a new model called as the Crystal Transformer Graph Neural Network (CTGNN), which combines the advantages of Transformer model and graph neural networks to address the complexity of structure-properties r… ▽ More The combination of deep learning algorithm and materials science has made significant progress in predicting novel materials and understanding various behaviours of materials. Here, we introduced a new model called as the Crystal Transformer Graph Neural Network (CTGNN), which combines the advantages of Transformer model and graph neural networks to address the complexity of structure-properties relation of material data. Compared to the state-of-the-art models, CTGNN incorporates the graph network structure for capturing local atomic interactions and the dual-Transformer structures to model intra-crystal and inter-atomic relationships comprehensively. The benchmark carried on by the proposed CTGNN indicates that CTGNN significantly outperforms existing models like CGCNN and MEGNET in the prediction of formation energy and bandgap properties. Our work highlights the potential of CTGNN to enhance the performance of properties prediction and accelerates the discovery of new materials, particularly for perovskite materials. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 17 pages

arXiv:2405.11439 [pdf, other]

doi 10.3847/1538-3881/ad4030

On the Structure of the Sagittarius Spiral Arm in the Inner Milky Way

Authors: S. B. Bian, Y. W. Wu, Y. Xu, M. J. Reid, J. J. Li, B. Zhang, K. M. Menten, L. Moscadelli, A. Brunthaler

Abstract: We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G04… ▽ More We report measurements of trigonometric parallax and proper motion for two 6.7 GHz methanol and two 22 GHz water masers located in the far portion of the Sagittarius spiral arm as part of the BeSSeL Survey. Distances for these sources are estimated from parallax measurements combined with 3-dimensional kinematic distances. The distances of G033.64$-$00.22, G035.57$-$00.03, G041.15$-$00.20, and G043.89$-$00.78 are $9.9\pm0.5$, $10.2\pm0.6$, $7.6\pm0.5$, and $7.5\pm0.3$ kpc, respectively. Based on these measurements, we suggest that the Sagittarius arm segment beyond about 8 kpc from the Sun in the first Galactic quadrant should be adjusted radially outward relative to previous models. This supports the suggestion of Xu et al. (2023) that the Sagittarius and Perseus spiral arms might merge in the first quadrant before spiraling inward to the far end of the Galactic bar. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 14 pages, 5 figures, accepted to AJ

Journal ref: 2024 AJ 167:267

arXiv:2405.11359 [pdf, other]

Optimizing Layerwise Microservice Management in Heterogeneous Wireless Networks

Authors: Haojie Yan, Yuedong Xu, Lianggui Dai

Abstract: Small cells with edge computing are densely deployed in 5G mobile networks to provide high throughput communication and low-latency computation. The flexibility of edge computation is empowered by the deployment of lightweight container-based microservices. In this paper, we take the first step toward optimizing the microservice management in small-cell networks. The prominent feature is that each… ▽ More Small cells with edge computing are densely deployed in 5G mobile networks to provide high throughput communication and low-latency computation. The flexibility of edge computation is empowered by the deployment of lightweight container-based microservices. In this paper, we take the first step toward optimizing the microservice management in small-cell networks. The prominent feature is that each microservice consists of multiple image layers and different microservices may share some basic layers, thus bringing deep coupling in their placement and service provision. Our objective is to minimize the expected total latency of microservice requests under the storage, communication and computing constraints of the sparsely interconnected small cell nodes. We formulate a binary quadratic program (BQP) with the multi-dimensional strategy of the image layer placement, the access selection and the task assignment. The BQP problem is then transformed into an ILP problem, and is solved by use of a novel sphere-box alternating direction multipliers method (ADMM) with reasonable complexity $O(q^{4})$, where $q$ is the number of variables in the transformed problem. Trace-driven experiments show that the gap between our proposed algorithm and the optimal is reduced by 35$\%$ compared with benchmark algorithms. △ Less

Submitted 18 May, 2024; originally announced May 2024.

Showing 151–200 of 6,207 results for author: Xu, Y