Skip to main content

Showing 1–50 of 7,307 results for author: li, C

.
  1. arXiv:2407.05671  [pdf, other

    cs.CV cs.AI

    MSTF: Multiscale Transformer for Incomplete Trajectory Prediction

    Authors: Zhanwen Liu, Chao Li, Nan Yang, Yang Wang, Jiaqi Ma, Guangliang Cheng, Xiangmo Zhao

    Abstract: Motion forecasting plays a pivotal role in autonomous driving systems, enabling vehicles to execute collision warnings and rational local-path planning based on predictions of the surrounding vehicles. However, prevalent methods often assume complete observed trajectories, neglecting the potential impact of missing values induced by object occlusion, scope limitation, and sensor failures. Such ove… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.05552  [pdf, other

    cs.CV

    Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder

    Authors: Jia Liu, Changlin Li, Qirui Sun, Jiahui Ming, Chen Fang, Jue Wang, Bing Zeng, Shuaicheng Liu

    Abstract: Fine-tuning advanced diffusion models for high-quality image stylization usually requires large training datasets and substantial computational resources, hindering their practical applicability. We propose Ada-Adapter, a novel framework for few-shot style personalization of diffusion models. Ada-Adapter leverages off-the-shelf diffusion models and pre-trained image feature encoders to learn a com… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 16 pages, 11 figures

    MSC Class: 68T07 ACM Class: I.4.0

  3. arXiv:2407.05540  [pdf, other

    cs.CV

    GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation

    Authors: Chenxin Li, Xinyu Liu, Cheng Wang, Yifan Liu, Weihao Yu, **g Shao, Yixuan Yuan

    Abstract: Recent advances in learning multi-modal representation have witnessed the success in biomedical domains. While established techniques enable handling multi-modal information, the challenges are posed when extended to various clinical modalities and practical modalitymissing setting due to the inherent modality gaps. To tackle these, we propose an innovative Modality-prompted Heterogeneous Graph fo… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  4. arXiv:2407.05376  [pdf, other

    cs.RO

    Rethinking Closed-loop Planning Framework for Imitation-based Model Integrating Prediction and Planning

    Authors: Jiayu Guo, Mingyue Feng, Pengfei Zhu, Chengjun Li, Jian Pu

    Abstract: In recent years, the integration of prediction and planning through neural networks has received substantial attention. Despite extensive studies on it, there is a noticeable gap in understanding the operation of such models within a closed-loop planning setting. To bridge this gap, we propose a novel closed-loop planning framework compatible with neural networks engaged in joint prediction and pl… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 7 pages,5 figures

  5. arXiv:2407.05332  [pdf, other

    quant-ph

    Experimental investigation of direct non-Hermitian measurement and uncertainty relation towards high-dimensional quantum domain

    Authors: Yi-Tao Wang, Zhao-An Wang, Zhi-Peng Li, Xiao-Dong Zeng, Jia-Ming Ren, Wei Liu, Yuan-Ze Yang, Nai-Jie Guo, Lin-Ke Xie, Jun-You Liu, Yu-Hang Ma, Jian-Shun Tang, Chengjie Zhang, Chuan-Feng Li, Guang-Can Guo

    Abstract: Non-Hermitian dynamics in quantum systems have unveiled novel phenomena, yet the implementation of valid non-Hermitian quantum measurement remains a challenge, because a universal quantum projective mechanism on the complete but skewed non-Hermitian eigenstates is not explicit in experiment. This limitation hinders the direct acquisition of non-Hermitian observable statistics (e.g., non-Hermitian… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

  6. arXiv:2407.05117  [pdf, ps, other

    hep-ex

    Search for the baryon number and lepton number violating decays $τ^-\to Λπ^-$ and $τ^-\to \barΛπ^-$ at Belle II

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (349 additional authors not shown)

    Abstract: We present a search for the baryon number $B$ and lepton number $L$ violating decays $τ^- \rightarrow Λπ^-$ and $τ^- \rightarrow \barΛ π^-$ produced from the $e^+e^-\to τ^+τ^-$ process, using a 364 fb$^{-1}$ data sample collected by the Belle~II experiment at the SuperKEKB collider. No evidence of signal is found in either decay mode, which have $|Δ(B-L)|$ equal to $2$ and $0$, respectively. Upper… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures

    Report number: Belle II Preprint 2024-020; KEK Preprint 2024-17

  7. arXiv:2407.04995  [pdf

    physics.optics

    A Broadband Algorithm for Adiabatic Mode Evolution and An Application on Polarization Splitter-Rotator on LNOI Platform

    Authors: Geng Chen, Chijun Li, Xuanhao Wang, Yuankang Huang, Siyu Lu, Yiqi Dai, Xiangyu Meng, Cheng Zeng, **song Xia

    Abstract: Adiabatic mode evolution waveguides (AMEWs) are widely utilized in integrated photonics, including tapered waveguides, edge couplers, mode converters, splitters, etc. An analytical theory and a novel AMEW design algorithm are developed to create shortcuts to adiabaticity (STA). With the new algorithm, we demonstrate a broadband and highly efficient polarization splitter-rotator (PSR) on a lithium-… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 9 pages, 6 figures, 2 tables

  8. arXiv:2407.04643  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Granular Ta-Te nanowire superconductivity violating the Pauli limit

    Authors: Lingxiao Zhao, Yi Zhao, Cuiying Pei, Changhua Li, Qi Wang, Juefei Wu, Weizheng Cao, Lin Xiong, Haiyin Zhu, Tian** Ying, Yanpeng Qi

    Abstract: Strategies to achieve higher upper-critical-field superconductors (μ0Hc2(0)) are of great interest for both fundamental science and practical applications. While reducing the thickness of two-dimensional (2D) materials to a few layers significantly enhances μ0Hc2(0) with accompanied potential unconventional pairing mechanisms, further dimensional reduction to 1D compounds rarely exceeds the expect… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 12 pages,4 figures

  9. arXiv:2407.04281  [pdf, other

    cs.RO

    WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning

    Authors: Yiheng Li, Chongjian Ge, Chenran Li, Chenfeng Xu, Masayoshi Tomizuka, Chen Tang, Mingyu Ding, Wei Zhan

    Abstract: We propose Waymo Open Motion Dataset-Reasoning (WOMD-Reasoning), a language annotation dataset built on WOMD, with a focus on describing and reasoning interactions and intentions in driving scenarios. Previous language datasets primarily captured interactions caused by close distances. However, interactions induced by traffic rules and human intentions, which can occur over long distances, are yet… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  10. Exploration of Class Center for Fine-Grained Visual Classification

    Authors: Hang Yao, Qiguang Miao, Peipei Zhao, Chaoneng Li, Xin Li, Guanwen Feng, Ruyi Liu

    Abstract: Different from large-scale classification tasks, fine-grained visual classification is a challenging task due to two critical problems: 1) evident intra-class variances and subtle inter-class differences, and 2) overfitting owing to fewer training samples in datasets. Most existing methods extract key features to reduce intra-class variances, but pay no attention to subtle inter-class differences… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Accpeted by TCSVT. Code and trained models are here:https://github.com/hyao1/ECC

  11. arXiv:2407.04078  [pdf, other

    cs.CL cs.AI cs.LG

    DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

    Authors: Chengpeng Li, Guanting Dong, Mingfeng Xue, Ru Peng, Xiang Wang, Dayiheng Liu

    Abstract: Large language models (LLMs) have made impressive progress in handling simple math problems, yet they still struggle with more challenging and complex mathematical tasks. In this paper, we introduce a series of LLMs that employs the Decomposition of thought with code assistance and self-correction for mathematical reasoning, dubbed as DotaMath. DotaMath models tackle complex mathematical tasks by… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Work in progress

  12. arXiv:2407.03995  [pdf, other

    cs.LG cs.AI cs.RO

    ROER: Regularized Optimal Experience Replay

    Authors: Changling Li, Zhang-Wei Hong, Pulkit Agrawal, Divyansh Garg, Joni Pajarinen

    Abstract: Experience replay serves as a key component in the success of online reinforcement learning (RL). Prioritized experience replay (PER) reweights experiences by the temporal difference (TD) error empirically enhancing the performance. However, few works have explored the motivation of using TD error. In this work, we provide an alternative perspective on TD-error-based reweighting. We show the conne… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  13. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  14. arXiv:2407.03816  [pdf

    physics.optics

    Compact ultra-broadband light coupling on chip via nonadiabatic pum**

    Authors: Weiwei Liu, Chijun Li, Bing Wang, Tianyan Chai, Lingzhi Zheng, Zhuoxiong Liu, Haoru Zhang, Shuaifei Ren, Xiaohong Li, Cheng Zeng, **song Xia, Peixiang Lu

    Abstract: Enlarging bandwidth capacity of the integrated photonic systems demands efficient and broadband light coupling among optical elements, which has been a vital issue in integrated photonics. Here, we have developed a compact ultra-broadband light coupling strategy based on nonadiabatic pum** in coupled optical waveguides, and experimentally demonstrated the designs in thin-film lithium niobate on… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  15. arXiv:2407.03757  [pdf, other

    cs.CV

    DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

    Authors: Zheng-Peng Duan, Jiawei zhang, Zheng Lin, Xin **, Dongqing Zou, Chunle Guo, Chongyi Li

    Abstract: Image retouching aims to enhance the visual quality of photos. Considering the different aesthetic preferences of users, the target of retouching is subjective. However, current retouching methods mostly adopt deterministic models, which not only neglects the style diversity in the expert-retouched results and tends to learn an average style during training, but also lacks sample diversity during… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  16. arXiv:2407.03717  [pdf, ps, other

    math.AP

    The existence of solutions for a Schrodinger equation with jum** nonlinearities crossing the essential spectrum

    Authors: Chong Li, Xinyu Li

    Abstract: In this paper, we establish the existence of one solution for a Schrödinger equation with jum** nonlinearities: $-Δu+V(x)u=f(x,u)$, $x\in \mathbb {R}^N$, and $u(x)\to 0$, $|x|\to +\infty$, where $V$ is a potential function on which we make hypotheses, and in particular allow $V$ which is unbounded below, and $f(x,u)=au^-+bu^++g(x,u)$. No restriction on $b$ is required, which implies that… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  17. arXiv:2407.03037  [pdf, other

    cs.SE

    Vision-driven Automated Mobile GUI Testing via Multimodal Large Language Model

    Authors: Zhe Liu, Cheng Li, Chunyang Chen, Junjie Wang, Boyu Wu, Yawen Wang, Jun Hu, Qing Wang

    Abstract: With the advancement of software rendering techniques, GUI pages in mobile apps now encompass a wealth of visual information, where the visual semantics of each page contribute to the overall app logic, presenting new challenges to software testing. Despite the progress in automated Graphical User Interface (GUI) testing, the absence of testing oracles has constrained its efficacy to identify only… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  18. arXiv:2407.03033  [pdf

    cs.CV

    ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation

    Authors: Chang Li, Pengfei Zhang, Yu Wang

    Abstract: Currently the semantic segmentation task of multispectral remotely sensed imagery (MSRSI) faces the following problems: 1) Usually, only single domain feature (i.e., space domain or frequency domain) is considered; 2) downsampling operation in encoder generally leads to the accuracy loss of edge extraction; 3) multichannel features of MSRSI are not fully considered; and 4) prior knowledge of remot… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  19. arXiv:2407.02899  [pdf, other

    hep-ex

    Measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: A high precision measurement of the branching fraction of the decay $J/ψ\to p \bar{p} η$ is performed using $(10 087 \pm 44) \times 10^6$ $J/ψ$ events recorded by the {BESIII} detector at the {BEPCII} storage ring. The branching fractions of the two decays $J/ψ\to p \bar{p} η(η\to γγ)$ and $J/ψ\to p \bar{p} η(η\to π^+ π^- π^0)$ are measured individually to be… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  20. arXiv:2407.02328  [pdf, other

    cs.CL

    Efficient Sparse Attention needs Adaptive Token Release

    Authors: Chaoran Zhang, Lixin Zou, Dan Luo, Min Tang, Xiangyang Luo, Zihao Li, Chenliang Li

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide array of text-centric tasks. However, their `large' scale introduces significant computational and storage challenges, particularly in managing the key-value states of the transformer, which limits their wider applicability. Therefore, we propose to adaptively release resources from caches and reb… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024(Findings)

  21. arXiv:2407.02265  [pdf, other

    cs.LG q-bio.BM

    DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing

    Authors: Yingzhou Lu, Yaojun Hu, Chenhao Li

    Abstract: Bringing a novel drug from the original idea to market typically requires more than ten years and billions of dollars. To alleviate the heavy burden, a natural idea is to reuse the approved drug to treat new diseases. The process is also known as drug repurposing or drug repositioning. Machine learning methods exhibited huge potential in automating drug repurposing. However, it still encounter som… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  22. arXiv:2407.02235  [pdf

    cs.CL

    Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation

    Authors: Cheng-Yi Li, Kao-Jung Chang, Cheng-Fu Yang, Hsin-Yu Wu, Wenting Chen, Hritik Bansal, Ling Chen, Yi-** Yang, Yu-Chun Chen, Shih-Pin Chen, Jiing-Feng Lirng, Kai-Wei Chang, Shih-Hwa Chiou

    Abstract: Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to reflect the real-world diagnostic challenge in the volumetric 3D anatomy. To mitigate three crucial limitation aspects in the existing literature, includin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 6 figures, 5 supplementary figures, 8 supplementary tables

  23. arXiv:2407.02207  [pdf, other

    quant-ph physics.app-ph physics.optics

    Global calibration of large-scale photonic integrated circuits

    Authors: **-Hao Zheng, Qin-Qin Wang, Lan-Tian Feng, Yu-Yang Ding, Xiao-Ye Xu, Xi-Feng Ren, Chuan-Feng Li, Guang-Can Guo

    Abstract: The advancing maturity of photonic integrated circuit (PIC) fabrication technology enables the high integration of an increasing number of optical components onto a single chip. With the incremental circuit complexity, the calibration of active phase shifters in a large-scale PIC becomes a crucially important issue. The traditional one-by-one calibration techniques encounter significant hurdles wi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 9 pages, 5 figures, and comments are welcome

  24. arXiv:2407.01900  [pdf, other

    nlin.SI math-ph math.CO

    Non-crossing permutations for the KP solitons under the Gel'fand-Dickey reductions and the vertex operators

    Authors: Shilong Huang, Yuji Kodama, Chuanzhong Li

    Abstract: We give a classification of the $regular$ soliton solutions of the KP hierarchy, referred to as the $KP solitons$, under the Gel'fand-Dickey $\ell$-reductions in terms of the permutation of the symmetric group. As an example, we show that the regular soliton solutions of the (good) Boussinesq equation as the 3-reduction can have $at ~most$ one resonant soliton in addition to two sets of solitons p… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 24 pages

  25. arXiv:2407.01584  [pdf, ps, other

    math.CA math.AP math.PR

    The map** properties of fractional derivatives in weighted fractional Sobolev space

    Authors: Cailing Li

    Abstract: We study the map** behavior of the Marchaud fractional derivative with different extensions in the scale of fractional weighted Sobolev spaces. In particular we show that the $α$--order Riemann--Liouville fractional derivative maps $W^{p,s}_0(Ω)$ to $W^{p,s-α}(Ω)$, for all $0<α<s<1$ and the $α$--order Marchaud fractional derivative with even extension maps the fractional Sobolev space… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

  26. arXiv:2407.01470  [pdf, other

    cs.CL

    DogeRM: Equip** Reward Models with Domain Knowledge through Model Merging

    Authors: Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen

    Abstract: Reinforcement learning from human feedback (RLHF) is a popular strategy for aligning large language models (LLMs) with desired behaviors. Reward modeling is a crucial step in RLHF. However, collecting paired preference data for training reward models is often costly and time-consuming, especially for domain-specific preferences requiring expert annotation. To address this challenge, we propose the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Preprint. Code will be released after the review results

  27. arXiv:2407.01306  [pdf, other

    cs.LG cs.CR

    Unveiling the Unseen: Exploring Whitebox Membership Inference through the Lens of Explainability

    Authors: Chenxi Li, Abhinav Kumar, Zhen Guo, Jie Hou, Reza Tourani

    Abstract: The increasing prominence of deep learning applications and reliance on personalized data underscore the urgent need to address privacy vulnerabilities, particularly Membership Inference Attacks (MIAs). Despite numerous MIA studies, significant knowledge gaps persist, particularly regarding the impact of hidden features (in isolation) on attack efficacy and insufficient justification for the root… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 20 pages, 10 figures, 4 tables

  28. arXiv:2407.01301  [pdf, other

    cs.CV

    GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting

    Authors: Chenxin Li, Hengyu Liu, Zhiwen Fan, Wuyang Li, Yifan Liu, Panwang Pan, Yixuan Yuan

    Abstract: Recent advancements in large generative models and real-time neural rendering using point-based techniques pave the way for a future of widespread visual data distribution through sharing synthesized 3D assets. However, while standardized methods for embedding proprietary or copyright information, either overtly or subtly, exist for conventional visual content such as images and videos, this issue… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Project website: https://gaussian-stego.github.io/

  29. arXiv:2407.01284  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.SC

    We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

    Authors: Runqi Qiao, Qiuna Tan, Guanting Dong, Minhui Wu, Chong Sun, Xiaoshuai Song, Zhuoma GongQue, Shanglin Lei, Zhe Wei, Miaoxuan Zhang, Runfeng Qiao, Yifan Zhang, Xiao Zong, Yida Xu, Muxi Diao, Zhimin Bao, Chen Li, Honggang Zhang

    Abstract: Visual mathematical reasoning, as a fundamental visual reasoning ability, has received widespread attention from the Large Multimodal Models (LMMs) community. Existing benchmarks, such as MathVista and MathVerse, focus more on the result-oriented performance but neglect the underlying principles in knowledge acquisition and generalization. Inspired by human-like mathematical reasoning, we introduc… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress

  30. arXiv:2407.01244  [pdf, other

    cs.CV

    CLHOP: Combined Audio-Video Learning for Horse 3D Pose and Shape Estimation

    Authors: Ci Li, Elin Hernlund, Hedvig Kjellström, Silvia Zuffi

    Abstract: In the monocular setting, predicting 3D pose and shape of animals typically relies solely on visual information, which is highly under-constrained. In this work, we explore using audio to enhance 3D shape and motion recovery of horses from monocular video. We test our approach on two datasets: an indoor treadmill dataset for 3D evaluation and an outdoor dataset capturing diverse horse movements, t… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: CVPR CV4Animals Workshop 2024

  31. arXiv:2407.01029  [pdf, other

    cs.CV

    EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

    Authors: Chenxin Li, Brandon Y. Feng, Yifan Liu, Hengyu Liu, Cheng Wang, Weihao Yu, Yixuan Yuan

    Abstract: 3D reconstruction of biological tissues from a collection of endoscopic images is a key to unlock various important downstream surgical applications with 3D capabilities. Existing methods employ various advanced neural rendering techniques for photorealistic view synthesis, but they often struggle to recover accurate 3D representations when only sparse observations are available, which is usually… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accpeted by MICCAI2024

  32. arXiv:2407.01018  [pdf, other

    cs.IT eess.SP

    Experimental Comparison of Average-Power Constrained and Peak-Power Constrained 64QAM under Optimal Clip** in 400Gbps Unamplified Coherent Links

    Authors: Wing-Chau Ng, Chuandong Li

    Abstract: We experimentally demonstrated an end-to-end link budget optimization over clip** in 400Gbps unamplified links, showing that the clipped MB distribution outperforms the peak-power constrained 64QAM by 1dB link budget.

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Submitted to European Conference on Optical Communications (ECOC) 2024

  33. arXiv:2407.00984  [pdf

    q-bio.NC cs.AI

    Individual brain parcellation: Review of methods, validations and applications

    Authors: Chengyi Li, Shan Yu, Yue Cui

    Abstract: Individual brains vary greatly in morphology, connectivity and organization. The applicability of group-level parcellations is limited by the rapid development of precision medicine today because they do not take into account the variation of parcels at the individual level. Accurate map** of brain functional regions at the individual level is pivotal for a comprehensive understanding of the var… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures

  34. arXiv:2407.00965  [pdf, other

    hep-ex

    Measurement of the integrated luminosity of data samples collected during 2019-2022 by the Belle II experiment

    Authors: The Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, J. K. Ahn, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (382 additional authors not shown)

    Abstract: A series of data samples was collected with the Belle II detector at the SuperKEKB collider from March 2019 to June 2022. We determine the integrated luminosities of these data samples using three distinct methodologies involving Bhabha ($e^+e^- \to e^+e^-(nγ)$), digamma ($e^+e^- \to γγ(nγ)$), and dimuon ($e^+e^- \to μ^+ μ^- (nγ)$) events. The total integrated luminosity obtained with Bhabha, diga… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 12 pages, 3 figures

    Report number: Belle II Preprint 2024-019; KEK Preprint 2024-16

  35. arXiv:2407.00898  [pdf, other

    cs.RO

    Residual-MPPI: Online Policy Customization for Continuous Control

    Authors: Pengcheng Wang, Chenran Li, Catherine Weaver, Kenta Kawamoto, Masayoshi Tomizuka, Chen Tang, Wei Zhan

    Abstract: Policies learned through Reinforcement Learning (RL) and Imitation Learning (IL) have demonstrated significant potential in achieving advanced performance in continuous control tasks. However, in real-world environments, it is often necessary to further customize a trained policy when there are additional requirements that were unforeseen during the original training phase. It is possible to fine-… ▽ More

    Submitted 3 July, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

  36. arXiv:2407.00488  [pdf, other

    cs.CL cs.AI

    PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

    Authors: Kunquan Deng, Zeyu Huang, Chen Li, Chenghua Lin, Min Gao, Wenge Rong

    Abstract: Large Language Models (LLMs) excel in fluency but risk producing inaccurate content, called "hallucinations." This paper outlines a standardized process for categorizing fine-grained hallucination types and proposes an innovative framework--the Progressive Fine-grained Model Editor (PFME)--specifically designed to detect and correct fine-grained hallucinations in LLMs. PFME consists of two collabo… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  37. arXiv:2407.00474  [pdf, other

    cs.LG cs.AI

    MH-pFLGB: Model Heterogeneous personalized Federated Learning via Global Bypass for Medical Image Analysis

    Authors: Luyuan Xie, Manqing Lin, ChenMing Xu, Tianyu Luan, Zhipeng Zeng, Wenjun Qian, Cong Li, Yuejian Fang, Qingni Shen, Zhonghai Wu

    Abstract: In the evolving application of medical artificial intelligence, federated learning is notable for its ability to protect training data privacy. Federated learning facilitates collaborative model development without the need to share local data from healthcare institutions. Yet, the statistical and system heterogeneity among these institutions poses substantial challenges, which affects the effecti… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.06822

  38. arXiv:2407.00462  [pdf, other

    cs.CV cs.AI

    pFLFE: Cross-silo Personalized Federated Learning via Feature Enhancement on Medical Image Segmentation

    Authors: Luyuan Xie, Manqing Lin, Siyuan Liu, ChenMing Xu, Tianyu Luan, Cong Li, Yuejian Fang, Qingni Shen, Zhonghai Wu

    Abstract: In medical image segmentation, personalized cross-silo federated learning (FL) is becoming popular for utilizing varied data across healthcare settings to overcome data scarcity and privacy concerns. However, existing methods often suffer from client drift, leading to inconsistent performance and delayed training. We propose a new framework, Personalized Federated Learning via Feature Enhancement… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  39. arXiv:2407.00421  [pdf

    physics.optics

    Multi-wavelength switchable single-frequency hyper Raman microlasers

    Authors: Chuntao Li, Ni Yao, **tian Lin, Renhong Gao, Jianglin Guan, Guanghui Zhao, Minghui Li, Min Wang, Lingling Qiao, Ya Cheng

    Abstract: Multi-wavelength switchable single-frequency microlasers in a broad spectral range are highly desirable for integrated photonic applications due to their dynamic switching functionality, narrow linewidth, and high side-mode-suppression-ratio (SMSR). Here, a strategy based on highly efficient successive excitation of different stimulated multi-photon hyper-Raman scattering (SMPHRS) processes is pro… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 17 pages,5 figures, and 1 table

  40. arXiv:2407.00397  [pdf, other

    cs.LG stat.ML

    Markovian Gaussian Process: A Universal State-Space Representation for Stationary Temporal Gaussian Process

    Authors: Weihan Li, Yule Wang, Chengrui Li, Anqi Wu

    Abstract: Gaussian Processes (GPs) and Linear Dynamical Systems (LDSs) are essential time series and dynamic system modeling tools. GPs can handle complex, nonlinear dynamics but are computationally demanding, while LDSs offer efficient computation but lack the expressive power of GPs. To combine their benefits, we introduce a universal method that allows an LDS to mirror stationary temporal GPs. This state… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  41. arXiv:2407.00136  [pdf, other

    hep-ex

    Observation of the Electromagnetic Dalitz Transition $h_c \rightarrow e^+e^-η_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, R. Aliberti, A. Amoroso, M. R. An, Q. An, X. H. Bai, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (495 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^8$ $ψ(3686)$ decays and data samples of $e^+e^-$ collisions with $\sqrt{s}$ from 4.130 to 4.780~GeV collected with the BESIII detector, we report the first observation of the electromagnetic Dalitz transition $h_c\to e^+e^-η_c$ with a statistical significance of $5.4σ$. We measure the ratio of the branching fractions… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

  42. arXiv:2407.00068  [pdf, other

    cs.DC cs.DM math.OC

    D&A: Resource Optimisation in Personalised PageRank Computations Using Multi-Core Machines

    Authors: Kai Siong Yow, Chunbo Li

    Abstract: Resource optimisation is commonly used in workload management, ensuring efficient and timely task completion utilising available resources. It serves to minimise costs, prompting the development of numerous algorithms tailored to this end. The majority of these techniques focus on scheduling and executing workloads effectively within the provided resource constraints. In this paper, we tackle this… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

    Comments: v1: 6 pages

    MSC Class: 68R10; 68W10; 91B32

  43. arXiv:2407.00029  [pdf, other

    cs.DC

    Distributed Inference Performance Optimization for LLMs on CPUs

    Authors: Pujiang He, Shan Zhou, Changqing Li, Wenhuan Huang, Weifei Yu, Duyi Wang, Chen Meng, Sheng Gui

    Abstract: Large language models (LLMs) hold tremendous potential for addressing numerous real-world challenges, yet they typically demand significant computational resources and memory. Deploying LLMs onto a resource-limited hardware device with restricted memory capacity presents considerable challenges. Distributed computing emerges as a prevalent strategy to mitigate single-node memory constraints and ex… ▽ More

    Submitted 16 May, 2024; originally announced July 2024.

    Comments: 4 pages, 3 figures, Practical ML for Low Resource Settings Workshop @ ICLR 2024

  44. arXiv:2406.20099  [pdf, other

    cs.CV

    Odd-One-Out: Anomaly Detection by Comparing with Neighbors

    Authors: Ankan Bhunia, Changjian Li, Hakan Bilen

    Abstract: This paper introduces a novel anomaly detection (AD) problem that focuses on identifying `odd-looking' objects relative to the other instances within a scene. Unlike the traditional AD benchmarks, in our setting, anomalies in this context are scene-specific, defined by the regular instances that make up the majority. Since object instances are often partly visible from a single viewpoint, our sett… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Codes & Dataset at https://github.com/VICO-UoE/OddOneOutAD

  45. arXiv:2406.19964  [pdf, other

    cs.CR

    Secure Outsourced Decryption for HE-based Privacy-preserving Cloud Computing System

    Authors: Xirong Ma, Chuan Li, Yuchang Hu, Yunting Tao, Yali Jiang, Yanbin Li, Fanyu Kong, Chunpeng Ge

    Abstract: The demand for processing vast volumes of data has surged dramatically due to the advancement of machine learning technology. Large-scale data processing necessitates substantial computational resources, prompting individuals and enterprises to turn to cloud services. Accompanying this trend is a growing concern regarding data leakage and misuse. Homomorphic encryption (HE) is one solution for saf… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  46. arXiv:2406.19859  [pdf, other

    cs.AI cs.HC cs.MM

    MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis

    Authors: Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, **gdong Sun, Qi He, Wangmeng Xiang, Hanyuan Chen, **-Peng Lan, Xianhui Lin, Kang Zhu, Bin Luo, Yifeng Geng, Xuansong Xie, Alexander G. Hauptmann

    Abstract: MetaDesigner revolutionizes artistic typography synthesis by leveraging the strengths of Large Language Models (LLMs) to drive a design paradigm centered around user engagement. At the core of this framework lies a multi-agent system comprising the Pipeline, Glyph, and Texture agents, which collectively enable the creation of customized WordArt, ranging from semantic enhancements to the imposition… ▽ More

    Submitted 4 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: 18 pages, 16 figures, Project: https://modelscope.cn/studios/WordArt/WordArt

  47. arXiv:2406.19393  [pdf, other

    cs.CV

    Looking 3D: Anomaly Detection with 2D-3D Alignment

    Authors: Ankan Bhunia, Changjian Li, Hakan Bilen

    Abstract: Automatic anomaly detection based on visual cues holds practical significance in various domains, such as manufacturing and product quality assessment. This paper introduces a new conditional anomaly detection problem, which involves identifying anomalies in a query image by comparing it to a reference shape. To address this challenge, we have created a large dataset, BrokenChairs-180K, consisting… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR'24. Codes & dataset available at https://github.com/VICO-UoE/Looking3D

  48. arXiv:2406.19310  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Imaging semiconductor-to-metal transition and topological flat bands of twisted bilayer MoTe2

    Authors: Yufeng Liu, Yu Gu, Ting Bao, Ning Mao, Can Li, Shudan Jiang, Liang Liu, Dandan Guan, Yaoyi Li, Hao Zheng, Canhua Liu, Kenji Watanabe, Takashi Taniguchi, Wenhui Duan, **feng Jia, Xiaoxue Liu, Yang Zhang, Tingxin Li, Shiyong Wang

    Abstract: Two-dimensional (2D) moiré materials have emerged as a highly tunable platform for investigating novel quantum states of matter arising from strong electronic correlations and nontrivial band topology. Recently, topological flat bands formed in 2D semiconducting moiré superlattices have attracted great interests. In particular, a series of topological quantum phases, including the long-sought frac… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  49. arXiv:2406.19190  [pdf, ps, other

    hep-ex

    Improved measurement of the semileptonic decay $D^+_{s}\to K^0 e^+ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Analyzing $e^+e^-$ collision data corresponding to an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we measure the branching fraction of the semileptonic decay $D^+_{s}\to K^0 e^+ν_e$ to be $(2.98\pm0.23\pm0.12)\times10^{-3}$. The $D_s^+\to K^0$ hadronic form factor is determined from the differential dec… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

  50. arXiv:2406.18582  [pdf, other

    cs.CV cs.GR

    Canonical Consolidation Fields: Reconstructing Dynamic Shapes from Point Clouds

    Authors: Miaowei Wang, Changjian Li, Amir Vaxman

    Abstract: We present Canonical Consolidation Fields (CanFields): a method for reconstructing a time series of independently-sampled point clouds into a single deforming coherent shape. Such input often comes from motion capture. Existing methods either couple the geometry and the deformation, where by doing so they smooth fine details and lose the ability to track moving points, or they track the deformatio… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.