Skip to main content

Showing 201–250 of 1,564 results for author: Xue, R

.
  1. arXiv:2401.01715  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    HEOM-QUICK2: a general-purpose simulator for fermionic many-body open quantum systems -- An Update

    Authors: Daochi Zhang, Lyuzhou Ye, Jiaan Cao, Yao Wang, Rui-Xue Xu, Xiao Zheng, Yi**g Yan

    Abstract: Many-body open quantum systems (OQS) have a profound impact on various subdisciplines of physics, chemistry, and biology. Thus, the development of a computer program capable of accurately, efficiently, and versatilely simulating many-body OQS is highly desirable. In recent years, we have focused on the advancement of numerical algorithms based on the fermionic hierarchical equations of motion (HEO… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 22 pages; 9 figures

  2. arXiv:2401.01507  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Real-space hole-do** titration and manipulation of correlated charge density wave state in 1T-TaS2

    Authors: Haoyu Dong, Yanyan Geng, Jianfeng Guo, Le Lei, Yan Li, Li Huang, Fei Pang, Rui Xu, Weiqiang Yu, Wei Ji, Hong-Jun Gao, Weichang Zhou, Zhihai Cheng

    Abstract: The complex correlated charge density wave (CDW) phases of 1T-TaS2 have attracted great attention due to their emergent quantum states, such as intricate CDW phase, Mott-Hubbard state, superconductivity and quantum spin liquid. The delicate interplay among the complex intra-/inter-layer electron-electron and electron-lattice interactions is the fundamental prerequisite of these exotic quantum stat… ▽ More

    Submitted 21 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  3. arXiv:2401.00921  [pdf, other

    cs.CV

    Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence

    Authors: Ruizhuo Xu, Linzhi Huang, Mei Wang, Jiani Hu, Weihong Deng

    Abstract: Self-supervised pre-training paradigms have been extensively explored in the field of skeleton-based action recognition. In particular, methods based on masked prediction have pushed the performance of pre-training to a new height. However, these methods take low-level features, such as raw joint coordinates or temporal motion, as prediction targets for the masked regions, which is suboptimal. In… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Submitted to CVPR 2024

  4. arXiv:2401.00719  [pdf, other

    cs.CV cs.AI

    Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition

    Authors: Ruizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Deng

    Abstract: With the increasing availability of consumer depth sensors, 3D face recognition (FR) has attracted more and more attention. However, the data acquired by these sensors are often coarse and noisy, making them impractical to use directly. In this paper, we introduce an innovative Depth map denoising network (DMDNet) based on the Denoising Implicit Image Function (DIIF) to reduce noise and enhance th… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted by Pattern Recognition

  5. arXiv:2401.00569  [pdf, other

    math.OC

    Decision Making under Costly Sequential Information Acquisition: the Paradigm of Reversible and Irreversible Decisions

    Authors: Renyuan Xu, Thaleia Zariphopoulou, Luhao Zhang

    Abstract: Decision making in modern stochastic systems, including e-commerce platforms, financial markets, and healthcare systems, has evolved into a multifaceted process that involves information acquisition and adaptive information sources. This paper initiates a study on this integrated process, where these elements are not only fundamental but also interact in a complex and dynamically intertwined manne… ▽ More

    Submitted 10 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  6. arXiv:2401.00424  [pdf, other

    cs.CL

    SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection

    Authors: Shijue Huang, Libo Qin, Bingbing Wang, Geng Tu, Ruifeng Xu

    Abstract: Multi-modal intent detection aims to utilize various modalities to understand the user's intentions, which is essential for the deployment of dialogue systems in real-world scenarios. The two core challenges for multi-modal intent detection are (1) how to effectively align and fuse different features of modalities and (2) the limited labeled multi-modal intent training data. In this work, we intro… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  7. arXiv:2312.17694  [pdf, other

    quant-ph cond-mat.mes-hall

    Map** of valley-splitting by conveyor-mode spin-coherent electron shuttling

    Authors: Mats Volmer, Tom Struck, Arnau Sala, Bingjie Chen, Max Oberländer, Tobias Offermann, Ran Xue, Lino Visser, Jhih-Sian Tu, Stefan Trellenkamp, Łukasz Cywiński, Hendrik Bluhm, Lars R. Schreiber

    Abstract: In Si/SiGe heterostructures, the low-lying excited valley state seriously limit operability and scalability of electron spin qubits. For characterizing and understanding the local variations in valley splitting, fast probing methods with high spatial and energy resolution are lacking. Leveraging the spatial control granted by conveyor-mode spin-coherent electron shuttling, we introduce a method fo… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 17 pages, 11 Figures

  8. arXiv:2312.16170  [pdf, other

    cs.CV cs.AI cs.RO

    EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

    Authors: Tai Wang, Xiaohan Mao, Chenming Zhu, Runsen Xu, Ruiyuan Lyu, Peisen Li, Xiao Chen, Wenwei Zhang, Kai Chen, Tianfan Xue, Xihui Liu, Cewu Lu, Dahua Lin, Jiangmiao Pang

    Abstract: In the realm of computer vision and robotics, embodied agents are expected to explore their environment and carry out human instructions. This necessitates the ability to fully understand 3D scenes given their first-person observations and contextualize them into language for interaction. However, traditional research focuses more on scene-level input and output setups from a global view. To addre… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: A multi-modal, ego-centric 3D perception dataset and benchmark for holistic 3D scene understanding. Project page: http://tai-wang.github.io/embodiedscan

  9. arXiv:2312.15918  [pdf, other

    cs.CL cs.AI

    Supervised Knowledge Makes Large Language Models Better In-context Learners

    Authors: Linyi Yang, Shuibai Zhang, Zhuohao Yu, Guangsheng Bao, Yidong Wang, **dong Wang, Ruochen Xu, Wei Ye, Xing Xie, Weizhu Chen, Yue Zhang

    Abstract: Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering. The recent progress in large-scale generative models has further expanded their use in real-world language applications. However, the critical challenge of improving the generalizability and factuality of LLMs in natural language understanding and question answering remains under-explored. While… ▽ More

    Submitted 11 April, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted to ICLR 2024

  10. arXiv:2312.13618  [pdf, other

    quant-ph physics.chem-ph

    Generalized system-bath entanglement theorem for Gaussian environments

    Authors: Yu Su, Yao Wang, Rui-Xue Xu, Yi**g Yan

    Abstract: A system-bath entanglement theorem (SBET) with Gaussian environments was established previously in J. Chem. Phys. 152, 034102 (2020) in terms of linear response functions. This theorem connects the system-bath entanglement responses to the local system and bare bath ones. In this work, we generalize it to correlation functions. Key steps in derivation are the generalized Langevin dynamics for the… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 9 pages, 3 figures

  11. arXiv:2312.12754  [pdf, other

    cs.CV cs.CL

    Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation

    Authors: Wenhao Xu, Rongtao Xu, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

    Abstract: Recently, CLIP has found practical utility in the domain of pixel-level zero-shot segmentation tasks. The present landscape features two-stage methodologies beset by issues such as intricate pipelines and elevated computational costs. While current one-stage approaches alleviate these concerns and incorporate Visual Prompt Training (VPT) to uphold CLIP's generalization capacity, they still fall sh… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: AAAI2024 Accepted

  12. arXiv:2312.12480  [pdf, other

    cs.CV

    Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation

    Authors: Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang

    Abstract: Continual Test-Time Adaptation (CTTA) is proposed to migrate a source pre-trained model to continually changing target distributions, addressing real-world dynamism. Existing CTTA methods mainly rely on entropy minimization or teacher-student pseudo-labeling schemes for knowledge extraction in unlabeled target domains. However, dynamic data distributions cause miscalibrated predictions and noisy p… ▽ More

    Submitted 27 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR2024

  13. arXiv:2312.11984  [pdf, other

    astro-ph.HE

    PSR B0943+10: Mode Switch, Polar Cap Geometry, and Orthogonally Polarized Radiation

    Authors: Shunshun Cao, **chen Jiang, Jaroslaw Dyks, Longfei Hao, Kejia Lee, Zhixuan Li, Jiguang Lu, Zhichen Pan, Weiyang Wang, Zhengli Wang, Jiangwei Xu, Heng Xu, Renxin Xu

    Abstract: As one of the paradigm examples to probe into pulsar magnetospheric dynamics, PSR B0943+10 (J0946+0951) manifests representatively, showing mode switch, orthogonal polarization and subpulse drifting. Both integrated and single pulses are studied with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The mode switch phenomenon of this pulsar is studied using an eigen-mode searching… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 27 pages, 28 figures, 2 tables, submitted to ApJ

  14. arXiv:2312.09085  [pdf, other

    cs.CL cs.AI cs.CR cs.CY

    The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation

    Authors: Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, Han Qiu

    Abstract: Large language models (LLMs) encapsulate vast amounts of knowledge but still remain vulnerable to external misinformation. Existing research mainly studied this susceptibility behavior in a single-turn setting. However, belief can change during a multi-turn conversation, especially a persuasive one. Therefore, in this study, we delve into LLMs' susceptibility to persuasive conversations, particula… ▽ More

    Submitted 31 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted to ACL'24 (Main). Camera-ready version

  15. arXiv:2312.08935  [pdf, other

    cs.AI cs.CL cs.LG

    Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

    Authors: Peiyi Wang, Lei Li, Zhihong Shao, R. X. Xu, Damai Dai, Yifei Li, Deli Chen, Y. Wu, Zhifang Sui

    Abstract: In this paper, we present an innovative process-oriented math process reward model called \textbf{Math-Shepherd}, which assigns a reward score to each step of math problem solutions. The training of Math-Shepherd is achieved using automatically constructed process-wise supervision data, breaking the bottleneck of heavy reliance on manual annotation in existing work. We explore the effectiveness of… ▽ More

    Submitted 19 February, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Add Step-by-Step reinforcement learning results

  16. arXiv:2312.06722  [pdf, other

    cs.CV cs.CL cs.RO

    EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning

    Authors: Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu

    Abstract: The pursuit of artificial general intelligence (AGI) has been accelerated by Multimodal Large Language Models (MLLMs), which exhibit superior reasoning, generalization capabilities, and proficiency in processing multimodal inputs. A crucial milestone in the evolution of AGI is the attainment of human-level planning, a fundamental ability for making informed decisions in complex environments, and s… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Project released at: https://github.com/ChenYi99/EgoPlan

  17. Repeating FRBs reveal the secret of pulsar magnetospheric activity

    Authors: Renxin Xu, Weiyang Wang

    Abstract: The puzzling mechanism of coherent radio emission remains unknown, but fortunately, repeating fast radio bursts (FRBs) provide a precious opportunity, with extremely bright subpulses created in a clear and vacuum-like pulsar magnetosphere. FRBs are millisecond-duration signals that are highly dispersed at distant galaxies but with uncertain physical origin(s). Coherent curvature radiation by bunch… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 7 pages, 2 figures, published in AN

  18. arXiv:2312.04737  [pdf, other

    cs.LG cs.AI cs.CL

    Efficient Large Language Models Fine-Tuning On Graphs

    Authors: Rui Xue, Xipeng Shen, Ruozhou Yu, Xiaorui Liu

    Abstract: Learning from Text-Attributed Graphs (TAGs) has attracted significant attention due to its wide range of real-world applications. The rapid evolution of large language models (LLMs) has revolutionized the way we process textual data, which indicates a strong potential to replace shallow text embedding generally used in Graph Neural Networks (GNNs). However, we find that existing LLM approaches tha… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  19. arXiv:2312.04418  [pdf, other

    cs.NI eess.SY

    MIST: An Efficient Approach for Software-Defined Multicast in Wireless Mesh Networks

    Authors: Rupei Xu, Yuming Jiang, Jason P. Jue

    Abstract: Multicasting is a vital information dissemination technique in Software-Defined Networking (SDN). With SDN, a multicast service can incorporate network functions implemented at different nodes, which is referred to as software-defined multicast. Emerging ubiquitous wireless networks for 5G and Beyond (B5G) inherently support multicast. However, the broadcast nature of wireless channels, especially… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  20. arXiv:2312.03814  [pdf, other

    cs.LG cs.AI

    Pearl: A Production-ready Reinforcement Learning Agent

    Authors: Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

    Abstract: Reinforcement Learning (RL) offers a versatile framework for achieving long-term goals. Its generality allows us to formalize a wide range of problems that real-world intelligent systems encounter, such as dealing with delayed rewards, handling partial observability, addressing the exploration and exploitation dilemma, utilizing offline data to improve online performance, and ensuring safety const… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  21. arXiv:2312.03200  [pdf, other

    math.DS math.CA

    Dynamics of a $2$-dimensional slow-fast Belousov-Zabotinsky model

    Authors: Ruihan Xu, Ming Sun, Xiang Zhang

    Abstract: For the reduced two-dimensional Belousov-Zhabotinsky slow-fast differential system, the known results are the existence of one limit cycle and its stability for particular values of the parameters. Here, we characterize all dynamics of this system except one degenerate case. The results include global stability of the positive equilibrium, supercritical and subcritical Hopf bifurcations, the exist… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages, 3 figures, 2 tables

    MSC Class: 37N25; 34D23; 37C75; 34C26; 34C60

  22. Probing the vector charge of Sagittarius A* with pulsar timing

    Authors: Zexin Hu, Li**g Shao, Rui Xu, Dicong Liang, Zhan-Feng Mai

    Abstract: Timing a pulsar orbiting around Sagittarius A* (Sgr A*) can provide us with a unique opportunity of testing gravity theories. We investigate the detectability of a vector charge carried by the Sgr A* black hole (BH) in the bumblebee gravity model with simulated future pulsar timing observations. The spacetime of a bumblebee BH introduces characteristic changes to the orbital dynamics of the pulsar… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 18 pages, 6 figures

    Journal ref: JCAP 04 (2024) 087

  23. arXiv:2312.01406  [pdf, other

    gr-qc

    Can a star be smaller than a black hole of the same mass?

    Authors: Shoulong Li, H. Lü, Yong Gao, Rui Xu, Li**g Shao, Hongwei Yu

    Abstract: It is commonly believed that black holes are the smallest self-gravitating objects of the same mass in the Universe. Here, we demonstrate, in a subclass of higher-order pure gravities known as quasi-topological gravity, that by modifying general relativity (GR) to reduce the strength of gravity in strong-field regimes while kee** GR unchanged in weak-field regimes, it is possible for stars to co… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 15 pages, 3 figures, submitted in August

  24. arXiv:2311.18799  [pdf, other

    cs.CV cs.CL

    X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning

    Authors: Artemis Panagopoulou, Le Xue, Ning Yu, Junnan Li, Dongxu Li, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles

    Abstract: Vision-language pre-training and instruction tuning have demonstrated general-purpose capabilities in 2D visual reasoning tasks by aligning visual encoders with state-of-the-art large language models (LLMs). In this paper, we introduce a simple, yet effective, cross-modality framework built atop frozen LLMs that allows the integration of various modalities without extensive modality-specific custo… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  25. arXiv:2311.18148  [pdf

    physics.optics

    A universal optical modulator for synthetic topologically tuneable structured matter

    Authors: Chao He, Binguo Chen, Zipei Song, Zimo Zhao, Yifei Ma, Honghui He, Lin Luo, Tade Marozsak, An Wang, Rui Xu, Peixiang Huang, Xuke Qiu, Bangshan Sun, Jiahe Cui, Yuxi Cai, Yun Zhang, Patrick Salter, Julian AJ Fells, Ben Dai, Shaoxiong Liu, Limei Guo, Hui Ma, Steve J Elston, Qiwen Zhan, Chengwei Qiu , et al. (3 additional authors not shown)

    Abstract: Topologically structured matter, such as metasurfaces and metamaterials, have given rise to impressive photonic functionality, fuelling diverse applications from microscopy and holography to encryption and communication. Presently these solutions are limited by their largely static nature and preset functionality, hindering applications that demand dynamic photonic systems with reconfigurable topo… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  26. arXiv:2311.16916  [pdf, other

    cs.RO

    Stein Variational Belief Propagation for Multi-Robot Coordination

    Authors: Jana Pavlasek, Joshua **g Zhi Mah, Ruihan Xu, Odest Chadwicke Jenkins, Fabio Ramos

    Abstract: Decentralized coordination for multi-robot systems involves planning in challenging, high-dimensional spaces. The planning problem is particularly challenging in the presence of obstacles and different sources of uncertainty such as inaccurate dynamic models and sensor noise. In this paper, we introduce Stein Variational Belief Propagation (SVBP), a novel algorithm for performing inference over no… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 8 pages, accepted for publication in Robotics and Automation Letters (RA-L); experiment updated, background methodology added

  27. arXiv:2311.14265  [pdf, other

    cs.CV

    Adaptive Calibration: A Unified Conversion Framework of Spiking Neural Networks

    Authors: Ziqing Wang, Yuetong Fang, Jiahang Cao, Ren**g Xu

    Abstract: Spiking Neural Networks (SNNs) have emerged as a promising energy-efficient alternative to traditional Artificial Neural Networks (ANNs). Despite this, bridging the performance gap with ANNs in practical scenarios remains a significant challenge. This paper focuses on addressing the dual objectives of enhancing the performance and efficiency of SNNs through the established SNN Calibration conversi… ▽ More

    Submitted 16 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Under review

  28. arXiv:2311.14168  [pdf, other

    math.OC cs.LG

    Fast Policy Learning for Linear Quadratic Control with Entropy Regularization

    Authors: Xin Guo, Xinyu Li, Renyuan Xu

    Abstract: This paper proposes and analyzes two new policy learning methods: regularized policy gradient (RPG) and iterative policy optimization (IPO), for a class of discounted linear-quadratic control (LQC) problems over an infinite time horizon with entropy regularization. Assuming access to the exact policy evaluation, both proposed approaches are proven to converge linearly in finding optimal policies o… ▽ More

    Submitted 11 December, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 33 pages, 3 figures

  29. arXiv:2311.14097  [pdf, other

    cs.CV

    ACT-Diffusion: Efficient Adversarial Consistency Training for One-step Diffusion Models

    Authors: Fei Kong, **hao Duan, Lichao Sun, Hao Cheng, Ren**g Xu, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Though diffusion models excel in image generation, their step-by-step denoising leads to slow generation speeds. Consistency training addresses this issue with single-step sampling but often produces lower-quality generations and requires high training costs. In this paper, we show that optimizing consistency training loss minimizes the Wasserstein distance between target and generated distributio… ▽ More

    Submitted 28 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: To appear in CVPR 2024

  30. arXiv:2311.13589  [pdf, ps, other

    cs.LG math.OC

    Risk-sensitive Markov Decision Process and Learning under General Utility Functions

    Authors: Zhengqi Wu, Renyuan Xu

    Abstract: Reinforcement Learning (RL) has gained substantial attention across diverse application domains and theoretical investigations. Existing literature on RL theory largely focuses on risk-neutral settings where the decision-maker learns to maximize the expected cumulative reward. However, in practical scenarios such as portfolio management and e-commerce recommendations, decision-makers often persist… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 36 pages

  31. Narrow spectra of repeating fast radio bursts: A magnetospheric origin

    Authors: Wei-Yang Wang, Yuan-Pei Yang, Hong-Bo Li, Jifeng Liu, Renxin Xu

    Abstract: Fast radio bursts (FRBs) can present a variety of polarization properties, and some of them have narrow spectra. We study spectral properties from perspectives of intrinsic radiation mechanisms and absorption during the waves propagating in the magnetosphere. The intrinsic radiation mechanisms are considered by invoking quasi-periodic bunch distribution and perturbations on charged bunches moving… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 19 pages, 11 figures, A&A accepted

    Journal ref: A&A 685, A87 (2024)

  32. arXiv:2311.12060  [pdf, other

    cs.NE

    Pursing the Sparse Limitation of Spiking Deep Learning Structures

    Authors: Hao Cheng, Jiahang Cao, Erjia Xiao, Mengshu Sun, Le Yang, Jize Zhang, Xue Lin, Bhavya Kailkhura, Kaidi Xu, Ren**g Xu

    Abstract: Spiking Neural Networks (SNNs), a novel brain-inspired algorithm, are garnering increased attention for their superior computation and energy efficiency over traditional artificial neural networks (ANNs). To facilitate deployment on memory-constrained devices, numerous studies have explored SNN pruning. However, these efforts are hindered by challenges such as scalability challenges in more comple… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  33. arXiv:2311.10700  [pdf, ps, other

    cs.MS

    Deriving Algorithms for Triangular Tridiagonalization a (Skew-)Symmetric Matrix

    Authors: Robert van de Geijn, Maggie Myers, RuQing G. Xu, Devin Matthews

    Abstract: We apply the FLAME methodology to derive algorithms hand in hand with their proofs of correctness for the computation of the $ L T L^T $ decomposition (with and without pivoting) of a skew-symmetric matrix. The approach yields known as well as new algorithms, presented using the FLAME notation. A number of BLAS-like primitives are exposed at the core of blocked algorithms that can attain high perf… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 28 pages

  34. arXiv:2311.08608  [pdf, other

    cs.RO

    Multi-Radar Inertial Odometry for 3D State Estimation using mmWave Imaging Radar

    Authors: Jui-Te Huang, Ruoyang Xu, Akshay Hinduja, Michael Kaess

    Abstract: State estimation is a crucial component for the successful implementation of robotic systems, relying on sensors such as cameras, LiDAR, and IMUs. However, in real-world scenarios, the performance of these sensors is degraded by challenging environments, e.g. adverse weather conditions and low-light scenarios. The emerging 4D imaging radar technology is capable of providing robust perception in ad… ▽ More

    Submitted 14 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted to ICRA 2024

  35. arXiv:2311.08425  [pdf

    cs.SD eess.AS math.NA physics.ao-ph physics.app-ph

    Research and experimental verification on low-frequency long-range underwater sound propagation dispersion characteristics under dual-channel sound speed profiles in the Chukchi Plateau

    Authors: **bao Weng, Yubo Qi, Yanming Yang, Hongtao Wen, Hongtao Zhou, Ruichao Xue

    Abstract: The dual-channel sound speed profiles of the Chukchi Plateau and the Canadian Basin have become current research hotspots due to their excellent low-frequency sound signal propagation ability. Previous research has mainly focused on using sound propagation theory to explain the changes in sound signal energy. This article is mainly based on the theory of normal modes to study the fine structure of… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 30 pages, 18 figures

  36. arXiv:2311.08011  [pdf, other

    cs.CL

    Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models

    Authors: Shiwen Ni, Dingwei Chen, Chengming Li, Xi** Hu, Ruifeng Xu, Min Yang

    Abstract: Recent advancements in Large Language Models (LLMs) have showcased their remarkable capabilities in text understanding and generation. However, even stronger LLMs are susceptible to acquiring erroneous or obsolete information from the training corpus. Direct secondary fine-tuning with data containing new knowledge may be ineffective in updating knowledge due to the conflict between old and new kno… ▽ More

    Submitted 16 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  37. arXiv:2311.07752  [pdf, other

    stat.ME

    Doubly Robust Estimation under Possibly Misspecified Marginal Structural Cox Model

    Authors: Jiyu Luo, Denise Rava, Jelena Bradic, Ronghui Xu

    Abstract: In this paper we address the challenges posed by non-proportional hazards and informative censoring, offering a path toward more meaningful causal inference conclusions. We start from the marginal structural Cox model, which has been widely used for analyzing observational studies with survival outcomes, and typically relies on the inverse probability weighting method. The latter hinges upon a pro… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  38. arXiv:2311.07175  [pdf

    cs.SD math.NA physics.ao-ph physics.app-ph

    Research and experimental verification on low-frequency long-range sound propagation characteristics under ice-covered and range-dependent marine environment in the Arctic

    Authors: **bao Weng, Yubo Qi, Yanming Yang, Hongtao Wen, Hongtao Zhou, Ruichao Xue

    Abstract: At present, research on sound propagation under the Arctic ice mainly focuses on modeling and experimental verification of sound propagation under sea ice cover and unique sound velocity profiles. Among them, the main research object of concern is sound transmission loss, and this article will delve into the time-domain waveform and fine dispersion structure of low-frequency broadband acoustic sig… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 46 pages, 35 figures

  39. arXiv:2311.06761  [pdf, other

    cs.CL

    Learning Knowledge-Enhanced Contextual Language Representations for Domain Natural Language Understanding

    Authors: Ruyao Xu, Taolin Zhang, Chengyu Wang, Zhongjie Duan, Cen Chen, Minghui Qiu, Dawei Cheng, Xiaofeng He, Weining Qian

    Abstract: Knowledge-Enhanced Pre-trained Language Models (KEPLMs) improve the performance of various downstream NLP tasks by injecting knowledge facts from large-scale Knowledge Graphs (KGs). However, existing methods for pre-training KEPLMs with relational triples are difficult to be adapted to close domains due to the lack of sufficient domain graph semantics. In this paper, we propose a Knowledge-enhance… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: emnlp 2023

  40. arXiv:2311.06158  [pdf, other

    cs.CL cs.AI

    Language Models can be Logical Solvers

    Authors: Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao, Weizhu Chen

    Abstract: Logical reasoning is a fundamental aspect of human intelligence and a key component of tasks like problem-solving and decision-making. Recent advancements have enabled Large Language Models (LLMs) to potentially exhibit reasoning capabilities, but complex logical reasoning remains a challenge. The state-of-the-art, solver-augmented language models, use LLMs to parse natural language logical questi… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: Preprint

  41. arXiv:2311.05890  [pdf, ps, other

    cs.CC

    Improved bounds on the Product Rank of the Permanent

    Authors: Rongyu Xu, Edinah Gnang

    Abstract: We unify Ryser's and Glynn's formulas for computing the permanent into a single framework. We then show via an orbital bound argument that the product rank of the permanent is asymptotically upper bounded by $ \frac{\exp\left(π\sqrt{\frac{2n}{3}}\right)}{4\sqrt{3}n} $.

    Submitted 10 November, 2023; originally announced November 2023.

  42. arXiv:2311.05298  [pdf, other

    cs.CV

    Improving Vision-and-Language Reasoning via Spatial Relations Modeling

    Authors: Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou

    Abstract: Visual commonsense reasoning (VCR) is a challenging multi-modal task, which requires high-level cognition and commonsense reasoning ability about the real world. In recent years, large-scale pre-training approaches have been developed and promoted the state-of-the-art performance of VCR. However, the existing approaches almost employ the BERT-like objectives to learn multi-modal representations. T… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  43. arXiv:2311.05143  [pdf, other

    cs.CV

    SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training

    Authors: Rui Xu, Wenkang Qin, Peixiang Huang, Hao Wang, Lin Luo

    Abstract: Deep Neural Networks (DNNs) are expected to provide explanation for users to understand their black-box predictions. Saliency map is a common form of explanation illustrating the heatmap of feature attributions, but it suffers from noise in distinguishing important features. In this paper, we propose a model-agnostic learning method called Saliency Constrained Adaptive Adversarial Training (SCAAT)… ▽ More

    Submitted 10 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  44. arXiv:2311.03774  [pdf, other

    cs.CV

    Meta-Adapter: An Online Few-shot Learner for Vision-Language Model

    Authors: Cheng Cheng, Lin Song, Ruoyi Xue, Hang Wang, Hongbin Sun, Yixiao Ge, Ying Shan

    Abstract: The contrastive vision-language pre-training, known as CLIP, demonstrates remarkable potential in perceiving open-world visual concepts, enabling effective zero-shot image recognition. Nevertheless, few-shot learning methods based on CLIP typically require offline fine-tuning of the parameters on few-shot samples, resulting in longer inference time and the risk of over-fitting in certain domains.… ▽ More

    Submitted 11 January, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  45. arXiv:2311.02893  [pdf

    cond-mat.mtrl-sci

    Topological electronic structure and spin texture of quasi-one-dimensional higher-order topological insulator Bi4Br4

    Authors: W. X. Zhao, M. Yang, R. Z. Xu, X. Du, Y. D. Li, K. Y. Zhai, C. Peng, D. Pei, H. Gao, Y. W. Li, L. X. Xu, J. F. Han, Y. Huang, Z. K. Liu, Y. G. Yao, J. C. Zhuang, Y. Du, J. J. Zhou, Y. L. Chen, L. X. Yang

    Abstract: The notion of topological insulators (TIs), characterized by an insulating bulk and conducting topological surface states, can be extended to higher-order topological insulators (HOTIs) hosting gapless modes localized at the boundaries of two or more dimensions lower than the insulating bulk1-5. In this work, by performing high-resolution angle-resolved photoemission spectroscopy (ARPES) measureme… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  46. arXiv:2311.00530  [pdf, other

    cs.AI

    Advances in Embodied Navigation Using Large Language Models: A Survey

    Authors: **zhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man Zhang, Li Guo, Shibiao Xu

    Abstract: In recent years, the rapid advancement of Large Language Models (LLMs) such as the Generative Pre-trained Transformer (GPT) has attracted increasing attention due to their potential in a variety of practical applications. The application of LLMs with Embodied Intelligence has emerged as a significant area of focus. Among the myriad applications of LLMs, navigation tasks are particularly noteworthy… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  47. arXiv:2311.00287  [pdf, other

    cs.CL cs.AI cs.LG q-bio.QM

    Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

    Authors: Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, Wei **, Joyce Ho, Carl Yang

    Abstract: Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts. Recently, large language models (LLMs) have shown promise in this domain. Yet, their direct deployment can lead to privacy issues and are constrained by resources. To address this challenge, we delve into synthetic clinical text generation us… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  48. arXiv:2310.20607  [pdf, other

    cs.CV cs.AI

    What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning

    Authors: Wenkang Qin, Rui Xu, Peixiang Huang, Xiaomin Wu, Heyu Zhang, Lin Luo

    Abstract: Pathological captioning of Whole Slide Images (WSIs), though is essential in computer-aided pathological diagnosis, has rarely been studied due to the limitations in datasets and model training efficacy. In this paper, we propose a new paradigm Subtype-guided Masked Transformer (SGMT) for pathological captioning based on Transformers, which treats a WSI as a sequence of sparse patches and generate… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  49. arXiv:2310.20427  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

    Authors: Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

    Abstract: Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis. However, multiple steps from tissue preparation to slide imaging introduce various image corruptions, making it difficult for deep neural network (DNN) models to achieve stable diagnostic results for clinical use. In order to assess an… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  50. arXiv:2310.18804  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting

    Authors: Hejie Cui, Xinyu Fang, Zihan Zhang, Ran Xu, Xuan Kan, Xin Liu, Yue Yu, Manling Li, Yangqiu Song, Carl Yang

    Abstract: Images contain rich relational knowledge that can help machines understand the world. Existing methods on visual knowledge extraction often rely on the pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction. To achi… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023