Skip to main content

Showing 1–50 of 1,292 results for author: cao, X

.
  1. arXiv:2407.08509  [pdf, other

    eess.IV cs.CV

    Haar Nuclear Norms with Applications to Remote Sensing Imagery Restoration

    Authors: Shuang Xu, Chang Yu, Jiangjun Peng, Xiangyong Cao

    Abstract: Remote sensing image restoration aims to reconstruct missing or corrupted areas within images. To date, low-rank based models have garnered significant interest in this field. This paper proposes a novel low-rank regularization term, named the Haar nuclear norm (HNN), for efficient and effective remote sensing image restoration. It leverages the low-rank properties of wavelet coefficients derive… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08457  [pdf, other

    cs.CV

    Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending

    Authors: Delong Wu, Hao Zhu, Qi Zhang, You Li, Zhan Ma, Xun Cao

    Abstract: Implicit Neural Representation (INR) has become a popular method for representing visual signals (e.g., 2D images and 3D scenes), demonstrating promising results in various downstream applications. Given its potential as a medium for visual signals, exploring the development of a neural blending method that utilizes INRs is a natural progression. Neural blending involves merging two INRs to create… ▽ More

    Submitted 11 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: accepted by ECCV 2024

  3. arXiv:2407.07513  [pdf, other

    quant-ph

    High-rate quantum digital signatures network with integrated silicon photonics

    Authors: Yongqiang Du, Bing-Hong Li, Xin Hua, Xiao-Yu Cao, Zhengeng Zhao, Feng Xie, Zhenrong Zhang, Hua-Lei Yin, Xi Xiao, Ke** Wei

    Abstract: The development of quantum networks is paramount towards practical and secure communications. Quantum digital signatures (QDS) offer an information-theoretically secure solution for ensuring data integrity, authenticity, and non-repudiation, rapidly growing from proof-of-concept to robust demonstrations. However, previous QDS systems relied on expensive and bulky optical equipment, limiting large-… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 figures

  4. Top-K Pairwise Ranking: Bridging the Gap Among Ranking-Based Measures for Multi-Label Classification

    Authors: Zitai Wang, Qianqian Xu, Zhiyong Yang, Peisong Wen, Yuan He, Xiaochun Cao, Qingming Huang

    Abstract: Multi-label ranking, which returns multiple top-ranked labels for each instance, has a wide range of applications for visual tasks. Due to its complicated setting, prior arts have proposed various measures to evaluate model performances. However, both theoretical analysis and empirical observations show that a model might perform inconsistently on different measures. To bridge this gap, this paper… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2407.06633  [pdf, other

    eess.IV cs.CV

    Variational Zero-shot Multispectral Pansharpening

    Authors: Xiangyu Rui, Xiangyong Cao, Yining Li, Deyu Meng

    Abstract: Pansharpening aims to generate a high spatial resolution multispectral image (HRMS) by fusing a low spatial resolution multispectral image (LRMS) and a panchromatic image (PAN). The most challenging issue for this task is that only the to-be-fused LRMS and PAN are available, and the existing deep learning-based methods are unsuitable since they rely on many training pairs. Traditional variational… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2407.06064  [pdf, other

    eess.IV cs.CV

    Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation

    Authors: Shuang Xu, Qiao Ke, Jiangjun Peng, Xiangyong Cao, Zixiang Zhao

    Abstract: This paper introduces a novel paradigm for hyperspectral image (HSI) denoising, which is termed \textit{pan-denoising}. In a given scene, panchromatic (PAN) images capture similar structures and textures to HSIs but with less noise. This enables the utilization of PAN images to guide the HSI denoising process. Consequently, pan-denoising, which incorporates an additional prior, has the potential t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  7. arXiv:2407.05023  [pdf, other

    cs.CV

    SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction

    Authors: Weixing Xie, Junfeng Yao, Xianpeng Cao, Qiqin Lin, Zerui Tang, Xiao Dong, Xiaohu Guo

    Abstract: Dynamic reconstruction of deformable tissues in endoscopic video is a key technology for robot-assisted surgery. Recent reconstruction methods based on neural radiance fields (NeRFs) have achieved remarkable results in the reconstruction of surgical scenes. However, based on implicit representation, NeRFs struggle to capture the intricate details of objects in the scene and cannot achieve real-tim… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  8. arXiv:2407.04928  [pdf, other

    cs.CV eess.IV

    CLIPVQA:Video Quality Assessment via CLIP

    Authors: Fengchuang Xing, Mingjie Li, Yuan-Gen Wang, Guopu Zhu, Xiaochun Cao

    Abstract: In learning vision-language representations from web-scale data, the contrastive language-image pre-training (CLIP) mechanism has demonstrated a remarkable performance in many vision tasks. However, its application to the widely studied video quality assessment (VQA) task is still an open issue. In this paper, we propose an efficient and effective CLIP-based Transformer method for the VQA problem… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  9. arXiv:2407.03335  [pdf, other

    math.NA cs.CV eess.IV

    Dual-Domain Deep D-bar Method for Solving Electrical Impedance Tomography

    Authors: Xiang Cao, Qiaoqiao Ding, Xiaoqun Zhang

    Abstract: The regularized D-bar method is one of the most prominent methods for solving Electrical Impedance Tomography (EIT) problems due to its efficiency and simplicity. It provides a direct approach by applying low-pass filtering to the scattering data in the non-linear Fourier domain, thereby yielding a smoothed conductivity approximation. However, D-bar images often present low contrast and low resolu… ▽ More

    Submitted 12 May, 2024; originally announced July 2024.

    Comments: 15 pages, 7 figures

  10. arXiv:2407.02961  [pdf, other

    cs.LG cs.AI

    Towards a Scalable Reference-Free Evaluation of Generative Models

    Authors: Azim Ospanov, **gwei Zhang, Mohammad Jalali, Xuenan Cao, Andrej Bogdanov, Farzan Farnia

    Abstract: While standard evaluation scores for generative models are mostly reference-based, a reference-dependent assessment of generative models could be generally difficult due to the unavailability of applicable reference datasets. Recently, the reference-free entropy scores, VENDI and RKE, have been proposed to evaluate the diversity of generated data. However, estimating these scores from data leads t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  11. arXiv:2407.02565  [pdf, other

    hep-th

    Orientifold Calabi-Yau Threefolds: Divisor Exchanges and Multi-Reflections

    Authors: Xu Cao, Hongfei Gao, Xin Gao

    Abstract: Using the Kreuzer-Skarke database of 4-dimensional reflexive polytopes, we systematically constructed a new database of orientifold Calabi-Yau threefolds with $h^{1,1}(X) \leq 12$. Our approach involved non-trivial $\mathbb{Z}_2$ involutions, incorporating both divisor exchanges and multi-divisor reflections acting on the Calabi-Yau threefolds. Each proper involution results in an orientifold Cala… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 55 pages, 2 figures

  12. Sequential Manipulation Against Rank Aggregation: Theory and Algorithm

    Authors: Ke Ma, Qianqian Xu, **shan Zeng, Wei Liu, Xiaochun Cao, Yingfei Sun, Qingming Huang

    Abstract: Rank aggregation with pairwise comparisons is widely encountered in sociology, politics, economics, psychology, sports, etc . Given the enormous social impact and the consequent incentives, the potential adversary has a strong motivation to manipulate the ranking list. However, the ideal attack opportunity and the excessive adversarial capability cause the existing methods to be impractical. To fu… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted by IEEE TPAMI URL: https://ieeexplore.ieee.org/document/10564181

  13. arXiv:2407.00933  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis

    Authors: Xueyao Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: This paper investigates autonomous driving safety improvement via task offloading from cellular vehicles (CVs) to a multi-access edge computing (MEC) server using vehicle-to-infrastructure (V2I) links. Considering that the latter links can be reused by vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of the V2I link may suffer from severe interference that can… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  14. arXiv:2407.00631  [pdf, other

    cs.LG cs.AI

    TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets

    Authors: **tai Chen, Yaojun Hu, Yue Wang, Yingzhou Lu, Xu Cao, Miao Lin, Hongxia Xu, Jian Wu, Cao Xiao, Jimeng Sun, Lucas Glass, Kexin Huang, Marinka Zitnik, Tianfan Fu

    Abstract: Clinical trials are pivotal for develo** new medical treatments, yet they typically pose some risks such as patient mortality, adverse events, and enrollment failure that waste immense efforts spanning over a decade. Applying artificial intelligence (AI) to forecast or simulate key events in clinical trials holds great potential for providing insights to guide trial designs. However, complex dat… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  15. arXiv:2406.18844  [pdf, other

    cs.CV

    Revisiting Backdoor Attacks against Large Vision-Language Models

    Authors: Siyuan Liang, Jiawei Liang, Tianyu Pang, Chao Du, Aishan Liu, Ee-Chien Chang, Xiaochun Cao

    Abstract: Instruction tuning enhances large vision-language models (LVLMs) but raises security risks through potential backdoor attacks due to their openness. Previous backdoor studies focus on enclosed scenarios with consistent training and testing instructions, neglecting the practical domain gaps that could affect attack effectiveness. This paper empirically examines the generalizability of backdoor atta… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 24 pages, 8 figures

  16. arXiv:2406.18599  [pdf, other

    physics.ins-det nucl-ex nucl-th

    Fudan Multi-purpose Active TArget Time Projection Chamber (fMeta-TPC) for Photonnuclear Reaction Experiments

    Authors: Huang-Kai Wu, Xi-Yang Wang, Yu-Miao Wang, You-**g Wang, De-Qing Fang, Wan-Bing He, Wei-Hu Ma, Xi-Guang Cao, Chang-Bo Fu, Xian-Gai Deng, Yu-Gang Ma

    Abstract: Active Target Time Projection Chambers (AT-TPCs) are state-of-the-art tools in the field of low-energy nuclear physics, particularly suitable for experiments using low-intensity radioactive ion beams or gamma rays. The Fudan Multi-purpose Active Target Time Projection Chamber (fMeta-TPC) with 2048 channels has been developed to study $α$-clustering nuclei. {\fcb In this work, the focus is on the s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  17. arXiv:2406.18055  [pdf, other

    cs.IT eess.SP

    Filtering Reconfigurable Intelligent Computational Surface for RF Spectrum Purification

    Authors: Kaining Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Mérouane Debbah, Chau Yuen

    Abstract: The increasing demand for communication is degrading the electromagnetic (EM) transmission environment due to severe EM interference, significantly reducing the efficiency of the radio frequency (RF) spectrum. Metasurfaces, a promising technology for controlling desired EM waves, have recently received significant attention from both academia and industry. However, the potential impact of out-of-b… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  18. Efficient source-independent quantum conference key agreement

    Authors: Yu Bao, Yi-Ran Xiao, Yu-Chen Song, Yao Fu, Xiao-Yu Cao, Hua-Lei Yin, Zeng-Bing Chen

    Abstract: Quantum conference key agreement (QCKA) enables the unconditional secure distribution of conference keys among multiple participants. Due to challenges in high-fidelity preparation and long-distance distribution of multi-photon entanglement, entanglement-based QCKA is facing severe limitations in both key rate and scalability. Here, we propose a source-independent QCKA scheme utilizing the post-ma… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures

    Journal ref: Optics Express 32, 24629 (2024)

  19. arXiv:2406.17248  [pdf, other

    quant-ph

    MindSpore Quantum: A User-Friendly, High-Performance, and AI-Compatible Quantum Computing Framework

    Authors: Xusheng Xu, Jiangyu Cui, Zidong Cui, Runhong He, Qingyu Li, Xiaowei Li, Yanling Lin, Jiale Liu, Wuxin Liu, Jiale Lu, Maolin Luo, Chufan Lyu, Shijie Pan, Mosharev Pavel, Runqiu Shu, Jialiang Tang, Ruoqian Xu, Shu Xu, Kang Yang, Fan Yu, Qingguo Zeng, Haiying Zhao, Qiang Zheng, Junyuan Zhou, Xu Zhou , et al. (14 additional authors not shown)

    Abstract: We introduce MindSpore Quantum, a pioneering hybrid quantum-classical framework with a primary focus on the design and implementation of noisy intermediate-scale quantum (NISQ) algorithms. Leveraging the robust support of MindSpore, an advanced open-source deep learning training/inference framework, MindSpore Quantum exhibits exceptional efficiency in the design and training of variational quantum… ▽ More

    Submitted 10 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  20. arXiv:2406.17126  [pdf, other

    cs.CV cs.LG

    MM-SpuBench: Towards Better Understanding of Spurious Biases in Multimodal LLMs

    Authors: Wenqian Ye, Guangtao Zheng, Yunsheng Ma, Xu Cao, Bolin Lai, James M. Rehg, Aidong Zhang

    Abstract: Spurious bias, a tendency to use spurious correlations between non-essential input attributes and target variables for predictions, has revealed a severe robustness pitfall in deep learning models trained on single modality data. Multimodal Large Language Models (MLLMs), which integrate both vision and language models, have demonstrated strong capability in joint vision-language understanding. How… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  21. arXiv:2406.15994  [pdf, other

    astro-ph.HE astro-ph.CO

    The delayed radio emission in the black hole X-ray binary MAXI J1348$-$630

    Authors: Bei You, Shuai-kang Yang, Zhen Yan, Xinwu Cao, Andrzej A. Zdziarski

    Abstract: We explore the coupling between the accretion flow and the jet in black hole X-ray binary (BHXRB) MAXI J1348-630 by analyzing the X-ray and radio observations during its 2019 outburst. We measure the time delay between the radio and Comptonization fluxes with the interpolated cross-correlation function. For the first time, we find that the radio emission lags behind the X-ray Comptonization emissi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures, Accepted for publication in ApJ Letters

  22. arXiv:2406.14194  [pdf, other

    cs.CV cs.AI

    VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

    Authors: Jie Zhang, Sibo Wang, Xiangkui Cao, Zheng Yuan, Shiguang Shan, Xilin Chen, Wen Gao

    Abstract: The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving general artificial intelligence. However, these advancements are tempered by the outputs that often reflect biases, a concern not yet extensively investigated. Existing benchmarks are not sufficiently comprehensive in evaluating biases due to their limited data scale, single questioning format and nar… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  23. arXiv:2406.13499  [pdf, other

    cs.SI cs.LG

    GraphMU: Repairing Robustness of Graph Neural Networks via Machine Unlearning

    Authors: Tao Wu, Xinwen Cao, Chao Wang, Shaojie Qiao, Lin Yuan, Canyixing Cui, Yanbing Liu

    Abstract: Graph Neural Networks (GNNs) have demonstrated significant application potential in various fields. However, GNNs are still vulnerable to adversarial attacks. Numerous adversarial defense methods on GNNs are proposed to address the problem of adversarial attacks. However, these methods can only serve as a defense before poisoning, but cannot repair poisoned GNN. Therefore, there is an urgent need… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  24. arXiv:2406.13335  [pdf, other

    cs.NI eess.SP

    AI-Empowered Multiple Access for 6G: A Survey of Spectrum Sensing, Protocol Designs, and Optimizations

    Authors: Xuelin Cao, Bo Yang, Kaining Wang, Xinghua Li, Zhiwen Yu, Chau Yuen, Yan Zhang, Zhu Han

    Abstract: With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimiz… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  25. arXiv:2406.13169  [pdf, other

    astro-ph.GA astro-ph.HE

    A surprising excess of radio emission in extremely stable quasars: a unique clue to jet launching?

    Authors: Wen-Yong Kang, Jun-Xian Wang, Zhen-Yi Cai, Hao-Chen Wang, Wen-Ke Ren, Mai Liao, Feng Yuan, Andrzej Zdziarski, Xinwu Cao

    Abstract: Quasars are generally divided into jetted radio-loud and non-jetted radio-quiet ones, but why only 10% quasars are radio loud has been puzzling for decades. Other than jet-induced-phenomena, black hole mass, or Eddington ratio, prominent difference between jetted and non-jetted quasars has scarcely been detected. Here we show a unique distinction between them and the mystery of jet launching could… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 11 pages, 16 figures, Accepted by ApJ

  26. arXiv:2406.10424  [pdf, other

    cs.CV cs.AI

    What is the Visual Cognition Gap between Humans and Multimodal LLMs?

    Authors: Xu Cao, Bolin Lai, Wenqian Ye, Yunsheng Ma, Joerg Heintz, **tai Chen, Jianguo Cao, James M. Rehg

    Abstract: Recently, Multimodal Large Language Models (MLLMs) have shown great promise in language-guided perceptual tasks such as recognition, segmentation, and object detection. However, their effectiveness in addressing visual cognition problems that require high-level reasoning is not well-established. One such challenge is abstract visual reasoning (AVR) -- the cognitive ability to discern relationships… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures, the appendix will be updated soon

    MSC Class: 68T01

  27. arXiv:2406.09102  [pdf, ps, other

    math.AP

    Analytic smoothing effect of the Cauchy problem for a class of ultra-parabolic equations

    Authors: Xiao-Dong Cao, Chao-Jiang Xu

    Abstract: In this paper, we study a class of strongly degenerate ultraparabolic equations with analytic coefficients. We demonstrate that the Cauchy problem exhibits an analytic smoothing effect. This means that, with an initial datum belonging to the Sobolev space $H^s$ (of real index s), the associated Cauchy problem admits a unique solution that is analytic in all spatial variables for any strictly posit… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  28. arXiv:2406.08165  [pdf, other

    hep-ph

    Double pion photoproduction off nucleons in covariant chiral perturbation theory

    Authors: Kai-Ge Kang, Xiong-Hui Cao, De-Liang Yao, Han-Qing Zheng

    Abstract: The double pion photoproduction off nucleons near threshold is analyzed in a covariant baryon chiral perturbation theory up to next to leading order, where the $Δ(1232)$, $N^*(1400)$ and $ρ(770)$ resonances are included as explicit degrees of freedom. For the process $γp \to π^+ π^0 n$, the chiral results of total cross sections, invariant-mass distributions and beam-helicity asymmetry are in good… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 27 pages, 11 figures, 1 table

  29. arXiv:2406.06089  [pdf, other

    cs.CV

    Texture Re-scalable Universal Adversarial Perturbation

    Authors: Yihao Huang, Qing Guo, Felix Juefei-Xu, Ming Hu, Xiaojun Jia, Xiaochun Cao, Geguang Pu, Yang Liu

    Abstract: Universal adversarial perturbation (UAP), also known as image-agnostic perturbation, is a fixed perturbation map that can fool the classifier with high probabilities on arbitrary images, making it more practical for attacking deep models in the real world. Previous UAP methods generate a scale-fixed and texture-fixed perturbation map for all images, which ignores the multi-scale objects in images… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 14 pages (accepted by TIFS2024)

  30. arXiv:2406.05982  [pdf

    eess.IV cs.LG physics.med-ph

    Artificial Intelligence for Neuro MRI Acquisition: A Review

    Authors: Hongjia Yang, Guanhua Wang, Ziyu Li, Haoxiang Li, Jialan Zheng, Yuxin Hu, Xiaozhi Cao, Congyu Liao, Huihui Ye, Qiyuan Tian

    Abstract: Magnetic resonance imaging (MRI) has significantly benefited from the resurgence of artificial intelligence (AI). By leveraging AI's capabilities in large-scale optimization and pattern recognition, innovative methods are transforming the MRI acquisition workflow, including planning, sequence design, and correction of acquisition artifacts. These emerging algorithms demonstrate substantial potenti… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Magn Reson Mater Phy (2024)

  31. arXiv:2406.04612  [pdf, other

    cs.LG cs.AI cs.IT cs.NE cs.SI

    Revisiting Attention Weights as Interpretations of Message-Passing Neural Networks

    Authors: Yong-Min Shin, Siqing Li, Xin Cao, Won-Yong Shin

    Abstract: The self-attention mechanism has been adopted in several widely-used message-passing neural networks (MPNNs) (e.g., GATs), which adaptively controls the amount of information that flows along the edges of the underlying graph. This usage of attention has made such models a baseline for studies on explainable AI (XAI) since interpretations via attention have been popularized in various domains (e.g… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures, 5 tables

  32. arXiv:2405.21018  [pdf, other

    cs.LG cs.CL cs.CR

    Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

    Authors: Xiaojun Jia, Tianyu Pang, Chao Du, Yihao Huang, **dong Gu, Yang Liu, Xiaochun Cao, Min Lin

    Abstract: Large language models (LLMs) are being rapidly developed, and a key component of their widespread deployment is their safety-related alignment. Many red-teaming efforts aim to jailbreak LLMs, where among these efforts, the Greedy Coordinate Gradient (GCG) attack's success has led to a growing interest in the study of optimization-based jailbreaking techniques. Although GCG is a significant milesto… ▽ More

    Submitted 5 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  33. arXiv:2405.18044  [pdf, other

    cs.MA cs.AI

    Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation

    Authors: Jiaqi Shao, Tianjun Yuan, Tao Lin, Xuanyu Cao, Bing Luo

    Abstract: Cognitive abilities, such as Theory of Mind (ToM), play a vital role in facilitating cooperation in human social interactions. However, our study reveals that agents with higher ToM abilities may not necessarily exhibit better cooperative behavior compared to those with lower ToM abilities. To address this challenge, we propose a novel matching coalition mechanism that leverages the strengths of a… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  34. arXiv:2405.16766  [pdf, other

    cs.CV cs.AI cs.LG

    Reframing the Relationship in Out-of-Distribution Detection

    Authors: YuXiao Lee, Xiaofeng Cao

    Abstract: The remarkable achievements of Large Language Models (LLMs) have captivated the attention of both academia and industry, transcending their initial role in dialogue generation. The utilization of LLMs as intermediary agents in various tasks has yielded promising results, sparking a wave of innovation in artificial intelligence. Building on these breakthroughs, we introduce a novel approach that in… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  35. arXiv:2405.16073  [pdf

    quant-ph

    Unveiling the 3D Morphology of Epitaxial GaAs/AlGaAs Quantum Dots

    Authors: Yiteng Zhang, Lukas Gruenewald, Xin Cao, Doaa Abdelbarey, Xian Zheng, Eddy P. Rugeramigabo, Johan Verbeeck, Michael Zopf, Fei Ding

    Abstract: Strain-free GaAs/AlGaAs semiconductor quantum dots (QDs) grown by droplet etching and nanohole infilling (DENI) are highly promising candidates for the on-demand generation of indistinguishable and entangled photon sources. The spectroscopic fingerprint and quantum optical properties of QDs are significantly influenced by their morphology. The effects of nanohole geometry and infilled material on… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  36. arXiv:2405.16058  [pdf, other

    math.OC

    A Novel Privacy Enhancement Scheme with Dynamic Quantization for Federated Learning

    Authors: Yifan Wang, Xianghui Cao, Shi **, Mo-Yuen Chow

    Abstract: Federated learning (FL) has been widely regarded as a promising paradigm for privacy preservation of raw data in machine learning. Although, the data privacy in FL is locally protected to some extent, it is still a desideratum to enhance privacy and alleviate communication overhead caused by repetitively transmitting model parameters. Typically, these challenges are addressed separately, or jointl… ▽ More

    Submitted 27 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  37. arXiv:2405.15873  [pdf, other

    hep-th cond-mat.stat-mech

    Ramp from Replica Trick

    Authors: Xuchen Cao, Thomas Faulkner

    Abstract: We compute the spectral form factor of the modular Hamiltonian $K=-\lnρ_A$ associated to the reduced density matrix of a Haar random state. A ramp is demonstrated and we find an analytic expression for its slope. Our method involves an application of the replica trick, where we first calculate the correlator $<\text{tr}ρ_A^n\;\text{tr}ρ_A^m>$ at large bond dimension and then analytically continue… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 35 pages, 13 figures

  38. arXiv:2405.15086  [pdf, other

    quant-ph

    Parametrically controlled chiral interface for superconducting quantum devices

    Authors: Xi Cao, Abdullah Irfan, Michael Mollenhauer, Kaushik Singirikonda, Wolfgang Pfaff

    Abstract: Nonreciprocal microwave routing plays a crucial role for measuring quantum circuits, and allows for realizing cascaded quantum systems for generating and stabilizing entanglement between non-interacting qubits. The most commonly used tools for implementing directionality are ferrite-based circulators. These devices are versatile, but suffer from excess loss, a large footprint, and fixed directiona… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 20 pages, 13 figures

  39. arXiv:2405.14832  [pdf, other

    cs.CV

    Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer

    Authors: Shuang Wu, Youtian Lin, Feihu Zhang, Yifei Zeng, **gxi Xu, Philip Torr, Xun Cao, Yao Yao

    Abstract: Generating high-quality 3D assets from text and images has long been challenging, primarily due to the absence of scalable 3D representations capable of capturing intricate geometry distributions. In this work, we introduce Direct3D, a native 3D generative model scalable to in-the-wild input images, without requiring a multiview diffusion model or SDS optimization. Our approach comprises two prima… ▽ More

    Submitted 1 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  40. arXiv:2405.11733  [pdf, other

    quant-ph

    Simulating a Chern Insulator with C = $\pm$2 on Synthetic Floquet Lattice

    Authors: Lingxiao Lei, Weichen Wang, Guangyao Huang, Shun Hu, Xi Cao, Xinfang Zhang, Mingtang Deng, **xing Chen

    Abstract: The synthetic Floquet lattice, generated by multiple strong drives with mutually incommensurate frequencies, provides a powerful platform for the quantum simulation of topological phenomena. In this study, we propose a 4-band tight-binding model of the Chern insulator with a Chern number C = $\pm$2 by coupling two layers of the half-BHZ lattice and subsequently map** it onto the Floquet lattice… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  41. arXiv:2405.10663  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    Instability of Circumnuclear Gas Supply as An Origin of "Changing-look" Phenomenon of Supermassive Blackholes

    Authors: J. Wang, D. W. Xu, Xinwu Cao, C. Gao, C. H. Xie, J. Y. Wei

    Abstract: The origin of the "Changing-look" (CL) phenomenon in supermassive black holes (SMBHs) remains an open issue. This study aims to shed light on this phenomenon by focusing on a sample that encompasses all known repeating CL active galactic nuclei (AGNs). Through the identification of a characteristic time scale for the CL phenomenon, it was observed that larger SMBHs possess shorter characteristic t… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures and 2 tables, accepted by ApJ

  42. arXiv:2405.10513  [pdf, other

    cs.LG eess.SP

    Federated Learning With Energy Harvesting Devices: An MDP Framework

    Authors: Kai Zhang, Xuanyu Cao

    Abstract: Federated learning (FL) requires edge devices to perform local training and exchange information with a parameter server, leading to substantial energy consumption. A critical challenge in practical FL systems is the rapid energy depletion of battery-limited edge devices, which curtails their operational lifespan and affects the learning performance. To address this issue, we apply energy harvesti… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  43. arXiv:2405.09782  [pdf, other

    cs.CV

    Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

    Authors: Feiran Li, Qianqian Xu, Shilong Bao, Zhiyong Yang, Runmin Cong, Xiaochun Cao, Qingming Huang

    Abstract: This paper explores the size-invariance of evaluation metrics in Salient Object Detection (SOD), especially when multiple targets of diverse sizes co-exist in the same image. We observe that current metrics are size-sensitive, where larger objects are focused, and smaller ones tend to be ignored. We argue that the evaluation should be size-invariant because bias based on size is unjustified withou… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by ICML2024

  44. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  45. arXiv:2405.07780  [pdf, other

    cs.LG cs.AI cs.CV

    Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition

    Authors: Zhiyong Yang, Qianqian Xu, Zitai Wang, Sicong Li, Boyu Han, Shilong Bao, Xiaochun Cao, Qingming Huang

    Abstract: This paper explores test-agnostic long-tail recognition, a challenging long-tail task where the test label distributions are unknown and arbitrarily imbalanced. We argue that the variation in these distributions can be broken down hierarchically into global and local levels. The global ones reflect a broad range of diversity, while the local ones typically arise from milder changes, often focused… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  46. arXiv:2405.06948  [pdf, other

    cs.CV

    Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation

    Authors: Shengyuan Liu, Bo Wang, Ye Ma, Te Yang, Xipeng Cao, Quan Chen, Han Li, Di Dong, Peng Jiang

    Abstract: Existing subject-driven text-to-image generation models suffer from tedious fine-tuning steps and struggle to maintain both text-image alignment and subject fidelity. For generating compositional subjects, it often encounters problems such as object missing and attribute mixing, where some subjects in the input prompt are not generated or their attributes are incorrectly combined. To address these… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 26 pages, 13 figures

  47. arXiv:2405.06896  [pdf, other

    hep-ph hep-th nucl-th

    Energy momentum tensor on and off the light cone: exposition with scalar Yukawa theory

    Authors: Xianghui Cao, Siqi Xu, Yang Li, Guangyao Chen, Xingbo Zhao, Vladimir A. Karmanov, James P. Vary

    Abstract: We compute the gravitational form factors $A_i$, $D_i$ and $\bar c_i$ of the scalar Yukawa theory using both the light-cone and covariant perturbation theory at the one-loop level. The light-cone formalism provides a potential approach to access these form factors beyond the perturbative regime. However, unlike the covariant formulation, the Poincaré symmetry on the light cone is not manifest. In… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: 13 pages, 2 figures. Published in JHEP

    Journal ref: JHEP 07, 095 (2024)

  48. arXiv:2405.06598  [pdf, other

    cs.CV

    A Lightweight Transformer for Remote Sensing Image Change Captioning

    Authors: Dongwei Sun, Yajie Bao, Xiangyong Cao

    Abstract: Remote sensing image change captioning (RSICC) aims to automatically generate sentences that describe content differences in remote sensing bitemporal images. Recently, attention-based transformers have become a prevalent idea for capturing the features of global change. However, existing transformer-based RSICC methods face challenges, e.g., high parameters and high computational complexity cause… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  49. arXiv:2405.05545  [pdf, other

    cs.LG stat.ML

    Deep Hierarchical Graph Alignment Kernels

    Authors: Shuhao Tang, Hao Tian, Xiaofeng Cao, Wei Ye

    Abstract: Typical R-convolution graph kernels invoke the kernel functions that decompose graphs into non-isomorphic substructures and compare them. However, overlooking implicit similarities and topological position information between those substructures limits their performances. In this paper, we introduce Deep Hierarchical Graph Alignment Kernels (DHGAK) to resolve this problem. Specifically, the relati… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  50. arXiv:2405.04788  [pdf, other

    cs.CV

    DiffMatch: Visual-Language Guidance Makes Better Semi-supervised Change Detector

    Authors: Kaiyu Li, Xiangyong Cao, Yupeng Deng, Junmin Liu, Deyu Meng, Zhi Wang

    Abstract: Change Detection (CD) aims to identify pixels with semantic changes between images. However, annotating massive numbers of pixel-level images is labor-intensive and costly, especially for multi-temporal images, which require pixel-wise comparisons by human experts. Considering the excellent performance of visual language models (VLMs) for zero-shot, open-vocabulary, etc. with prompt-based reasonin… ▽ More

    Submitted 22 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 13 pages, 5 figures