Skip to main content

Showing 201–250 of 11,892 results for author: Li, X

.
  1. arXiv:2406.04100  [pdf, other

    cs.CV cs.RO

    Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

    Authors: Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab

    Abstract: Ultrasound imaging has been widely used in clinical examinations owing to the advantages of being portable, real-time, and radiation-free. Considering the potential of extensive deployment of autonomous examination systems in hospitals, robotic US imaging has attracted increased attention. However, due to the inter-patient variations, it is still challenging to have an optimal path for each patien… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2406.03902  [pdf, other

    eess.IV cs.CV

    C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction

    Authors: Yiqun Lin, Jiewen Yang, Hualiang Wang, Xinpeng Ding, Wei Zhao, Xiaomeng Li

    Abstract: Cone beam computed tomography (CBCT) is an important imaging technology widely used in medical scenarios, such as diagnosis and preoperative planning. Using fewer projection views to reconstruct CT, also known as sparse-view reconstruction, can reduce ionizing radiation and further benefit interventional radiology. Compared with sparse-view reconstruction for traditional parallel/fan-beam CT, CBCT… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024

  3. arXiv:2406.03867  [pdf, other

    quant-ph cs.ET

    A Comprehensive Study of Quantum Arithmetic Circuits

    Authors: Siyi Wang, Xiufan Li, Wei Jie Bryan Lee, Suman Deb, Eugene Lim, Anupam Chattopadhyay

    Abstract: In recent decades, the field of quantum computing has experienced remarkable progress. This progress is marked by the superior performance of many quantum algorithms compared to their classical counterparts, with Shor's algorithm serving as a prominent illustration. Quantum arithmetic circuits, which are the fundamental building blocks in numerous quantum algorithms, have attracted much attention.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Under review at the Royal Society's Philosophical Transactions A

  4. arXiv:2406.03809  [pdf, other

    astro-ph.HE astro-ph.CO

    Time delay of fast radio burst population with respect to the star formation history

    Authors: Hai-Nan Lin, Xin-Yi Li, Rui Zou

    Abstract: In spite of significant progress in the research of fast radio bursts (FRBs) in recent decade, their origin is still under extensive debate. Investigation on the population of FRBs can provide new insight into this interesting problem. In this paper, based on the first CHIME/FRB catalog, we construct a Bayesian framework to analyze the FRB population, with the selection effect of the CHIME telesco… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 12 pages, 2 figures, 2 tables. Accepted by Astrophysical Journal

  5. arXiv:2406.03763  [pdf, other

    cs.SI physics.soc-ph

    The impact of nodes of information dissemination on epidemic spreading in dynamic multiplex networks

    Authors: Minyu Feng, Xiangxi Li, Yuhan Li, Qin Li

    Abstract: Epidemic spreading processes on dynamic multiplex networks provide a more accurate description of natural spreading processes than those on single layered networks. To describe the influence of different individuals in the awareness layer on epidemic spreading, we propose a two-layer network-based epidemic spreading model, including some individuals who neglect the epidemic, and we explore how ind… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages, 10 figures

  6. arXiv:2406.03420  [pdf, other

    math.DS

    Dynamic properties of a class of van der Pol-Duffing oscillators

    Authors: Yelei Kuang, Xuemei Li

    Abstract: In this paper, we study the existence of bifurcation of a van der Pol-Duffing oscillator with quintic terms and its quasi-periodic solutions by means of qualitative and bifurcation theories. Firstly, we analyze the autonomous system and find that it has two kinds of local bifurcations and a global bifurcation: pitchfork bifurcation, Hopf bifurcation, homoclinic bifurcation. It is worth noting that… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2406.03409  [pdf, other

    cs.LG cs.AI

    Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model

    Authors: **yin Chen, Xiaoming Zhao, Haibin Zheng, Xiao Li, Sheng Xiang, Haifeng Guo

    Abstract: Benefiting from well-trained deep neural networks (DNNs), model compression have captured special attention for computing resource limited equipment, especially edge devices. Knowledge distillation (KD) is one of the widely used compression techniques for edge deployment, by obtaining a lightweight student model from a well-trained teacher model released on public platforms. However, it has been e… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  8. arXiv:2406.03394  [pdf, other

    cs.CV

    Gaussian Representation for Deformable Image Registration

    Authors: Jihe Li, Fabian Zhang, Xia Li, Tianhao Zhang, Ye Zhang, Joachim Buhmann

    Abstract: Deformable image registration (DIR) is a fundamental task in radiotherapy, with existing methods often struggling to balance computational efficiency, registration accuracy, and speed effectively. We introduce a novel DIR approach employing parametric 3D Gaussian control points achieving a better tradeoff. It provides an explicit and flexible representation for spatial deformation fields between 3… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  9. arXiv:2406.03228  [pdf, other

    eess.AS

    Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement

    Authors: Wang Dai, Xiaofei Li, Archontis Politis, Tuomas Virtanen

    Abstract: In end-to-end multi-channel speech enhancement, the traditional approach of designating one microphone signal as the reference for processing may not always yield optimal results. The limitation is particularly in scenarios with large distributed microphone arrays with varying speaker-to-microphone distances or compact, highly directional microphone arrays where speaker or microphone positions cha… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by EUSIPCO 2024

  10. arXiv:2406.03081  [pdf, other

    quant-ph

    A Quantum Neural Network-Based Approach to Power Quality Disturbances Detection and Recognition

    Authors: Guo-Dong Li, Hai-Yan He, Yue Li, Xin-Hao Li, Hao Liu, Qing-Le Wang, Long Cheng

    Abstract: Power quality disturbances (PQDs) significantly impact the stability and reliability of power systems, necessitating accurate and efficient detection and recognition methods. While numerous classical algorithms for PQDs detection and recognition have been extensively studied and applied, related work in the quantum domain is still in its infancy. In this paper, an improved quantum neural networks… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  11. arXiv:2406.02939  [pdf, ps, other

    math.OC cs.DC cs.LG

    Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes

    Authors: Yan Huang, Xiang Li, Yipeng Shen, Niao He, **ming Xu

    Abstract: In this paper, we show that applying adaptive methods directly to distributed minimax problems can result in non-convergence due to inconsistency in locally computed adaptive stepsizes. To address this challenge, we propose D-AdaST, a Distributed Adaptive minimax method with Stepsize Tracking. The key strategy is to employ an adaptive stepsize tracking protocol involving the transmission of two ex… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  12. arXiv:2406.02931  [pdf, other

    hep-ex

    Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  13. arXiv:2406.02833  [pdf, other

    cs.CV

    DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images

    Authors: Yimian Dai, Minrui Zou, Yuxuan Li, Xiang Li, Kang Ni, Jian Yang

    Abstract: Synthetic Aperture Radar (SAR) target detection has long been impeded by inherent speckle noise and the prevalence of diminutive, ambiguous targets. While deep neural networks have advanced SAR target detection, their intrinsic low-frequency bias and static post-training weights falter with coherent noise and preserving subtle details across heterogeneous terrains. Motivated by traditional SAR ima… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  14. arXiv:2406.02635  [pdf, other

    cs.LG cs.AI

    Evidentially Calibrated Source-Free Time-Series Domain Adaptation with Temporal Imputation

    Authors: Mohamed Ragab, Peiliang Gong, Emadeldeen Eldele, Wenyu Zhang, Min Wu, Chuan-Sheng Foo, Daoqiang Zhang, Xiaoli Li, Zhenghua Chen

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a model pre-trained on a labeled source domain to an unlabeled target domain without access to source data, preserving the source domain's privacy. While SFDA is prevalent in computer vision, it remains largely unexplored in time series analysis. Existing SFDA methods, designed for visual data, struggle to capture the inherent temporal dynamics of… ▽ More

    Submitted 12 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  15. arXiv:2406.02430  [pdf, other

    eess.AS cs.SD

    Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

    Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

    Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  16. arXiv:2406.02395  [pdf, other

    cs.LG cs.CV

    GrootVL: Tree Topology is All You Need in State Space Model

    Authors: Yicheng Xiao, Lin Song, Shaoli Huang, Jiangshan Wang, Siyu Song, Yixiao Ge, Xiu Li, Ying Shan

    Abstract: The state space models, employing recursively propagated features, demonstrate strong representation capabilities comparable to Transformer models and superior efficiency. However, constrained by the inherent geometric constraints of sequences, it still falls short in modeling long-range dependencies. To address this issue, we propose the GrootVL network, which first dynamically generates a tree t… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: The code is available at https://github.com/EasonXiao-888/GrootVL

  17. arXiv:2406.02273  [pdf, ps, other

    math.OC cs.LG

    A KL-based Analysis Framework with Applications to Non-Descent Optimization Methods

    Authors: Junwen Qiu, Bohao Ma, Xiao Li, Andre Milzarek

    Abstract: We propose a novel analysis framework for non-descent-type optimization methodologies in nonconvex scenarios based on the Kurdyka-Lojasiewicz property. Our framework allows covering a broad class of algorithms, including those commonly employed in stochastic and distributed optimization. Specifically, it enables the analysis of first-order methods that lack a sufficient descent property and do not… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 29 pages

    MSC Class: 90C06; 90C26; 90C30

  18. arXiv:2406.02092  [pdf, other

    cs.SD cs.AI cs.LG eess.AS eess.SP

    MaskSR: Masked Language Model for Full-band Speech Restoration

    Authors: Xu Li, Qirui Wang, Xiaoyu Liu

    Abstract: Speech restoration aims at restoring high quality speech in the presence of a diverse set of distortions. Although several deep learning paradigms have been studied for this task, the power of the recently emerging language models has not been fully explored. In this paper, we propose MaskSR, a masked language model capable of restoring full-band 44.1 kHz speech jointly considering noise, reverb,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024. Demo page: https://masksr.github.io/MaskSR/

  19. arXiv:2406.01839  [pdf, other

    physics.ins-det hep-ex

    Simulation of DAMPE silicon microstrip detectors in the $\rm Allpix^{2}$ framework

    Authors: Yu-Xin Cui, Xiang Li, Shen Wang, Chuan Yue, Qiang Wan, Shi-Jun Lei, Guan-Wen Yuan, Yi-Ming Hu, Jia-Ju Wei, Jian-Hua Guo

    Abstract: Silicon strip detectors have been widely utilized in space experiments for gamma-ray and cosmic-ray detections thanks to their high spatial resolution and stable performance. For a silicon micro-strip detector, the Monte Carlo simulation is recognized as a practical and cost-effective approach to verify the detector performance. In this study, a technique for the simulation of the silicon micro-st… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Journal ref: Nuclear Instruments and Methods in Physics Research A 1057 (2023) 168685

  20. arXiv:2406.01653  [pdf, other

    stat.ML cs.LG math.PR stat.AP stat.ME

    An efficient Wasserstein-distance approach for reconstructing jump-diffusion processes using parameterized neural networks

    Authors: Mingtao Xia, Xiangting Li, Qi**g Shen, Tom Chou

    Abstract: We analyze the Wasserstein distance ($W$-distance) between two probability distributions associated with two multidimensional jump-diffusion processes. Specifically, we analyze a temporally decoupled squared $W_2$-distance, which provides both upper and lower bounds associated with the discrepancies in the drift, diffusion, and jump amplitude functions between the two jump-diffusion processes. The… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    MSC Class: 60G07; 60J76

  21. arXiv:2406.01489  [pdf, other

    cs.CV

    DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention

    Authors: Yang Liu, Xiaofei Li, Jun Zhang, Shengze Hu, Jun Lei

    Abstract: The increasing difficulty in accurately detecting forged images generated by AIGC(Artificial Intelligence Generative Content) poses many risks, necessitating the development of effective methods to identify and further locate forged areas. In this paper, to facilitate research efforts, we construct a DA-HFNet forged image dataset guided by text or image-assisted GAN and Diffusion model. Our goal i… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  22. arXiv:2406.01432  [pdf, other

    cs.CV

    ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models

    Authors: Thanh-Dat Truong, Xin Li, Bhiksha Raj, Jackson Cothren, Khoa Luu

    Abstract: The Vision-Language Foundation Model has recently shown outstanding performance in various perception learning tasks. The outstanding performance of the vision-language model mainly relies on large-scale pre-training datasets and different data augmentation techniques. However, the domain generalization problem of the vision-language foundation model needs to be addressed. This problem has limited… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  23. arXiv:2406.01332  [pdf, ps, other

    hep-ex

    Measurements of the branching fractions of semileptonic $D^{+}_s$ decays via $e^+e^-\to D_s^{*+}D_s^{*-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: We measure the absolute branching fractions of semileptonic $D^+_s$ decays via the $e^+e^-\to D_s^{*+}D_s^{*-}$ process using $e^+e^-$ collision data corresponding to an integrated luminosity of $10.64~\mathrm{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies between 4.237 and 4.699 GeV. The branching fractions are… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 14 pages, 3 figures

  24. arXiv:2406.01222  [pdf, other

    physics.chem-ph physics.comp-ph

    Symmetry enforced solution of the many-body Schrödinger equation with deep neural network

    Authors: Zhe Li, Zixiang Lu, Ruichen Li, Xuelan Wen, Xiang Li, Liwei Wang, Ji Chen, Weiluo Ren

    Abstract: The integration of deep neural networks with the Variational Monte Carlo (VMC) method has marked a significant advancement in solving the Schrödinger equation. In this work, we enforce spin symmetry in the neural network-based VMC calculation with modified optimization target. Our method is designed to solve for the ground state and multiple excited states with target spin symmetry at a low comput… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  25. arXiv:2406.01112  [pdf, other

    cs.CV

    BACON: Bayesian Optimal Condensation Framework for Dataset Distillation

    Authors: Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu, Wenquan Feng, Qi Zhao

    Abstract: Dataset Distillation (DD) aims to distill knowledge from extensive datasets into more compact ones while preserving performance on the test set, thereby reducing storage costs and training expenses. However, existing methods often suffer from computational intensity, particularly exhibiting suboptimal performance with large dataset sizes due to the lack of a robust theoretical framework for analyz… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 22 pages, 10 figures

  26. arXiv:2406.01069  [pdf, other

    cs.CV

    UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

    Authors: Hantao Zhou, Longxiang Tang, Rui Yang, Guanyi Qin, Yan Zhang, Runze Hu, Xiu Li

    Abstract: Image Quality Assessment (IQA) and Image Aesthetic Assessment (IAA) aim to simulate human subjective perception of image visual quality and aesthetic appeal. Existing methods typically address these tasks independently due to distinct learning objectives. However, they neglect the underlying interconnectedness of both tasks, which hinders the learning of task-agnostic shared representations for hu… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  27. arXiv:2406.01065  [pdf, other

    cs.LG cs.AI

    Causal prompting model-based offline reinforcement learning

    Authors: Xuehui Yu, Yi Guan, Rujia Shen, Xin Li, Chen Tang, **gchi Jiang

    Abstract: Model-based offline Reinforcement Learning (RL) allows agents to fully utilise pre-collected datasets without requiring additional or unethical explorations. However, applying model-based offline RL to online systems presents challenges, primarily due to the highly suboptimal (noise-filled) and diverse nature of datasets generated by online systems. To tackle these issues, we introduce the Causal… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  28. arXiv:2406.01029  [pdf, other

    cs.CV

    CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos

    Authors: Trong-Thuan Nguyen, Pha Nguyen, Xin Li, Jackson Cothren, Alper Yilmaz, Khoa Luu

    Abstract: Video scene graph generation (VidSGG) has emerged as a transformative approach to capturing and interpreting the intricate relationships among objects and their temporal dynamics in video sequences. In this paper, we introduce the new AeroEye dataset that focuses on multi-object relationship modeling in aerial videos. Our AeroEye dataset features various drone scenes and includes a visually compre… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  29. arXiv:2406.01013  [pdf, other

    cs.LG cs.CL

    Scalable Ensembling For Mitigating Reward Overoptimisation

    Authors: Ahmed M. Ahmed, Rafael Rafailov, Stepan Sharkov, Xuechen Li, Sanmi Koyejo

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has enabled significant advancements within language modeling for powerful, instruction-following models. However, the alignment of these models remains a pressing challenge as the policy tends to overfit the learned ``proxy" reward model past an inflection point of utility as measured by a ``gold" reward model that is more performant -- a phenomen… ▽ More

    Submitted 18 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  30. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  31. arXiv:2406.00993  [pdf

    eess.SP cs.HC q-bio.OT

    Detection of Acetone as a Gas Biomarker for Diabetes Based on Gas Sensor Technology

    Authors: Jiaming Wei, Tong Liu, Jipeng Huang, Xiaowei Li, Yurui Qi, Gangyin Luo

    Abstract: With the continuous development and improvement of medical services, there is a growing demand for improving diabetes diagnosis. Exhaled breath analysis, characterized by its speed, convenience, and non-invasive nature, is leading the trend in diagnostic development. Studies have shown that the acetone levels in the breath of diabetes patients are higher than normal, making acetone a basis for dia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 14 figures

  32. All-sky Guide Star Catalog for CSST

    Authors: Hui-Mei Feng, Zi-Huang Cao, Man I Lam, Ran Li, Hao Tian, Da-Yi Yin, Yuan-Yu Yang, Xin Zhang, Dong-Wei Fan, Yi-Qiao Dong, Xin-Feng Li, Wei Wang, Long Li, Hugh R. A. Jones, Yi-Han Tao, Jia-Lu Nie, Pei-Pei Wang, Mao-Yuan Liu, He-jun Yang, Chao Liu

    Abstract: The China Space Station Telescope (CSST) is a two-meter space telescope with multiple back-end instruments. The Fine Guidance Sensor (FGS) is an essential subsystem of the CSST Precision Image Stability System to ensure the required absolute pointing accuracy and line-of-sight stabilization. In this study, we construct the Main Guide Star Catalog for FGS. To accomplish this, we utilize the informa… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: published on RAA

  33. arXiv:2406.00918  [pdf, other

    cs.CR cs.CV cs.LG

    Assessing the Adversarial Security of Perceptual Hashing Algorithms

    Authors: Jordan Madden, Moxanki Bhavsar, Lhamo Dorje, Xiaohua Li

    Abstract: Perceptual hashing algorithms (PHAs) are utilized extensively for identifying illegal online content. Given their crucial role in sensitive applications, understanding their security strengths and weaknesses is critical. This paper compares three major PHAs deployed widely in practice: PhotoDNA, PDQ, and NeuralHash, and assesses their robustness against three typical attacks: normal image editing… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  34. arXiv:2406.00899  [pdf, other

    cs.CL cs.SD eess.AS

    YODAS: Youtube-Oriented Dataset for Audio and Speech

    Authors: Xinjian Li, Shinnosuke Takamichi, Takaaki Saeki, William Chen, Sayaka Shiota, Shinji Watanabe

    Abstract: In this study, we introduce YODAS (YouTube-Oriented Dataset for Audio and Speech), a large-scale, multilingual dataset comprising currently over 500k hours of speech data in more than 100 languages, sourced from both labeled and unlabeled YouTube speech datasets. The labeled subsets, including manual or automatic subtitles, facilitate supervised model training. Conversely, the unlabeled subsets ar… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: ASRU 2023

  35. arXiv:2406.00800  [pdf, other

    cs.LG cs.AI

    MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization

    Authors: Aozhong Zhang, Naigang Wang, Yanxia Deng, Xin Li, Zi Yang, Penghang Yin

    Abstract: In this paper, we present a simple optimization-based preprocessing technique called Weight Magnitude Reduction (MagR) to improve the performance of post-training quantization. For each linear layer, we adjust the pre-trained floating-point weights by solving an $\ell_\infty$-regularized optimization problem. This process greatly diminishes the maximum magnitude of the weights and smooths out outl… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  36. arXiv:2406.00755  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction

    Authors: Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng

    Abstract: The rapid advancement of Large Language Models (LLMs) in the realm of mathematical reasoning necessitates comprehensive evaluations to gauge progress and inspire future directions. Existing assessments predominantly focus on problem-solving from the examinee perspective, overlooking a dual perspective of examiner regarding error identification and correction. From the examiner perspective, we defi… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  37. arXiv:2406.00594  [pdf

    cs.IT

    Artificial General Intelligence (AGI) for the oil and gas industry: a review

    Authors: Jimmy Xuekai Li, Tiancheng Zhang, Yiran Zhu, Zhongwei Chen

    Abstract: Artificial General Intelligence (AGI) is set to profoundly impact the oil and gas industry by introducing unprecedented efficiencies and innovations. This paper explores AGI's foundational principles and its transformative applications, particularly focusing on the advancements brought about by large language models (LLMs) and extensive computer vision systems in the upstream sectors of the indust… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: 20 Pages, Review paper, 15 Figures

  38. arXiv:2406.00490  [pdf, other

    cs.CV cs.AI

    Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology

    Authors: **gyu Zhang, ** Cao, **ghao Chang, Xin** Li, Houze Liu, Zhenglin Li

    Abstract: This research aims to explore the application of deep learning in autonomous driving computer vision technology and its impact on improving system performance. By using advanced technologies such as convolutional neural networks (CNN), multi-task joint learning methods, and deep reinforcement learning, this article analyzes in detail the application of deep learning in image recognition, real-time… ▽ More

    Submitted 3 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  39. arXiv:2406.00488  [pdf, other

    cs.LG cs.DC

    Federated Model Heterogeneous Matryoshka Representation Learning

    Authors: Li** Yi, Han Yu, Chao Ren, Gang Wang, Xiaoguang Liu, Xiaoxiao Li

    Abstract: Model heterogeneous federated learning (MHeteroFL) enables FL clients to collaboratively train models with heterogeneous structures in a distributed fashion. However, existing MHeteroFL methods rely on training loss to transfer knowledge between the client model and the server model, resulting in limited knowledge exchange. To address this limitation, we propose the Federated model heterogeneous M… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  40. arXiv:2406.00440  [pdf, other

    cs.CV

    Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture

    Authors: Xuanchen Li, Yuhao Cheng, Xingyu Ren, Haozhe Jia, Di Xu, Wenhan Zhu, Yichao Yan

    Abstract: 4D head capture aims to generate dynamic topological meshes and corresponding texture maps from videos, which is widely utilized in movies and games for its ability to simulate facial muscle movements and recover dynamic textures in pore-squeezing. The industry often adopts the method involving multi-view stereo and non-rigid alignment. However, this approach is prone to errors and heavily reliant… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  41. arXiv:2406.00439  [pdf, other

    cs.RO cs.CV

    Learning Manipulation by Predicting Interaction

    Authors: Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, ** Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li

    Abstract: Representation learning approaches for robotic manipulation have boomed in recent years. Due to the scarcity of in-domain robot data, prevailing methodologies tend to leverage large-scale human video datasets to extract generalizable features for visuomotor policy learning. Despite the progress achieved, prior endeavors disregard the interactive dynamics that capture behavior patterns and physical… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to RSS 2024. Project page: https://github.com/OpenDriveLab/MPI

  42. arXiv:2406.00262  [pdf, other

    cs.LG cs.AI

    Contrastive Learning Via Equivariant Representation

    Authors: Sifan Song, **feng Wang, Qiaochu Zhao, Xiang Li, Dufan Wu, Angelos Stefanidis, Jionglong Su, S. Kevin Zhou, Quanzheng Li

    Abstract: Invariant-based Contrastive Learning (ICL) methods have achieved impressive performance across various domains. However, the absence of latent space representation for distortion (augmentation)-related information in the latent space makes ICL sub-optimal regarding training efficiency and robustness in downstream tasks. Recent studies suggest that introducing equivariance into Contrastive Learning… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Preprint. Under review

  43. arXiv:2406.00109  [pdf, other

    astro-ph.SR astro-ph.HE physics.space-ph

    Energetic Electrons Accelerated and Trapped in a Magnetic Bottle above a Solar Flare Arcade

    Authors: Bin Chen, Xiangliang Kong, Sijie Yu, Chengcai Shen, Xiaocan Li, Fan Guo, Yixian Zhang, Lindsay Glesener, Säm Krucker

    Abstract: Where and how flares efficiently accelerate charged particles remains an unresolved question. Recent studies revealed that a "magnetic bottle" structure, which forms near the bottom of a large-scale reconnection current sheet above the flare arcade, is an excellent candidate for confining and accelerating charged particles. However, further understanding its role requires linking the various obser… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 20 pages, 13 figures (12 pages and 10 figures for main text). Accepted for publication in The Astrophysical Journal

  44. arXiv:2406.00032  [pdf, other

    cs.CL cs.AI cs.IR

    Paths of A Million People: Extracting Life Trajectories from Wikipedia

    Authors: Ying Zhang, Xiaofeng Li, Zhaoyang Liu, Haipeng Zhang

    Abstract: Notable people's life trajectories have been a focus of study -- the locations and times of various activities, such as birth, death, education, marriage, competition, work, delivering a speech, making a scientific discovery, finishing a masterpiece, and fighting a battle, and how these people interact with others, carry important messages for the broad research related to human dynamics. However,… ▽ More

    Submitted 25 May, 2024; originally announced June 2024.

    Comments: Preprint, under review. 15 pages

  45. arXiv:2406.00017  [pdf, other

    cs.CL cs.AI cs.MM

    PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment

    Authors: Shezheng Song, Shasha Li, Shan Zhao, Chengyu Wang, Xiaopeng Li, Jie Yu, Qian Wan, Jun Ma, Tianwei Yan, Wentao Ma, Xiaoguang Mao

    Abstract: Multimodal aspect-based sentiment analysis (MABSA) aims to understand opinions in a granular manner, advancing human-computer interaction and other fields. Traditionally, MABSA methods use a joint prediction approach to identify aspects and sentiments simultaneously. However, we argue that joint models are not always superior. Our analysis shows that joint models struggle to align relevant text to… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 May, 2024; originally announced June 2024.

    Comments: Code will be released upon publication

  46. arXiv:2406.00009  [pdf, other

    cs.RO

    ULTra-AV: A Unified Longitudinal Trajectory Dataset for Automated Vehicle

    Authors: Hang Zhou, Ke Ma, Shixiao Liang, Xiaopeng Li, Xiaobo Qu

    Abstract: Automated Vehicles (AVs) promise significant advances in transportation. Critical to these improvements is understanding AVs' longitudinal behavior, relying heavily on real-world trajectory data. Existing open-source trajectory datasets of AV, however, often fall short in refinement, reliability, and completeness, hindering effective performance metrics analysis and model development. This study a… ▽ More

    Submitted 16 May, 2024; originally announced June 2024.

    Comments: NA

  47. arXiv:2405.20754  [pdf, ps, other

    math.AP

    Non-uniqueness of weak solutions to 2D generalized Navier-Stokes equations

    Authors: Xinliang Li, Zhong Tan

    Abstract: We study the non-uniqueness of weak solutions for the two-dimensional hyper-dissipative Navier-Stokes equations in the super-critical spaces $L_{t}^γL_{x}^{p}$ when $α\in[1,\frac{3}{2})$, and obtain the conclusion that the non-uniqueness of the weak solutions at the endpoint $(γ,p)=(\infty, \frac{2}{2α-1})$ is sharp in view of the generalized Ladyženskaja-Prodi-Serrin condition by using a differen… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 30 pages

    MSC Class: 35A02; 35Q30; 76D05

  48. arXiv:2405.20702  [pdf, other

    q-bio.PE physics.soc-ph

    Effect of antibody levels on the spread of disease in multiple infections

    Authors: Xiangxi Li, Yuhan Li, Minyu Feng, Jürgen Kurths

    Abstract: There are complex interactions between antibody levels and epidemic propagation, the antibody level of an individual influences the probability of infection, and the spread of the virus influences the antibody level of each individual. There exist some viruses that, in their natural state, cause antibody levels in an infected individual to gradually decay. When these antibody levels decay to a cer… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 14 pages, 9 figures

  49. arXiv:2405.20680  [pdf, other

    cs.AI cs.CL

    Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

    Authors: Mingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang

    Abstract: Although Retrieval-Augmented Large Language Models (RALMs) demonstrate their superiority in terms of factuality, they do not consistently outperform the original retrieval-free Language Models (LMs). Our experiments reveal that this example-level performance inconsistency exists not only between retrieval-augmented and retrieval-free LM but also among different retrievers. To understand this pheno… ▽ More

    Submitted 4 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: ACL 2024 (findings)

  50. arXiv:2405.20676  [pdf, other

    hep-ex

    Search for $e^{+}e^{-}\toη'ψ(2S)$ at center-of-mass energies from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.67~\mathrm{fb}^{-1}$ collected by the BESIII detector operating at the BEPCII collider, we search for the process $e^+e^- \rightarrow η' ψ(2S)$ at center-of-mass energies from $4.66$ to $4.95~\mathrm{GeV}$. No significant signal is observed, and upper limits for the Born cross sections $σ^B(e^+e^-\rightarrowη'ψ(2S))$ at the 90\% confidence lev… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.