Skip to main content

Showing 201–250 of 8,727 results for author: Su

.
  1. arXiv:2406.03156  [pdf, other

    hep-ex

    Observation of new charmonium(-like) states in $B^+ \to D^{*\pm} D^{\mp} K^+$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contribu… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-047.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-047, CERN-EP-2024-096

  2. arXiv:2406.03136  [pdf, ps, other

    cs.LG cs.AI cs.CC stat.ML

    Computational Limits of Low-Rank Adaptation (LoRA) for Transformer-Based Models

    Authors: Jerry Yao-Chieh Hu, Maojiang Su, En-Jui Kuo, Zhao Song, Han Liu

    Abstract: We study the computational limits of Low-Rank Adaptation (LoRA) update for finetuning transformer-based models using fine-grained complexity theory. Our key observation is that the existence of low-rank decompositions within the gradient computation of LoRA adaptation leads to possible algorithmic speedup. This allows us to (i) identify a phase transition behavior and (ii) prove the existence of n… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2406.02931  [pdf, other

    hep-ex

    Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  4. arXiv:2406.02925  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition

    Authors: Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee

    Abstract: Synthetic data is widely used in speech recognition due to the availability of text-to-speech models, which facilitate adapting models to previously unseen text domains. However, existing methods suffer in performance when they fine-tune an automatic speech recognition (ASR) model on synthetic data as they suffer from the distributional shift commonly referred to as the synthetic-to-real gap. In t… ▽ More

    Submitted 15 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2406.02825  [pdf, ps, other

    math.CO math.LO

    Continuous Edge Chromatic Numbers of Abelian Group Actions

    Authors: Su Gao, Ruijun Wang, Tianhao Wang

    Abstract: We prove that for any generating set $S$ of $Γ=\mathbb {Z}^n$, the continuous edge chromatic number of the Schreier graph of the Bernoulli shift action $G=F(S,2^Γ)$ is $χ'_c(G)=χ'(G)+1$. In particular, for the standard generating set, the continuous edge chromatic number of $F(2^{\mathbb {Z}^n})$ is $2n+1$.

    Submitted 4 June, 2024; originally announced June 2024.

    MSC Class: 03E15

  6. arXiv:2406.02783  [pdf, other

    astro-ph.SR

    High-resolution Observation of Blowout Jets Regulated by Sunspot Rotation

    Authors: Tingyu Gou, Rui Liu, Yang Su, Astrid M. Veronig, Hanya Pan, Runbin Luo, Weiqun Gan

    Abstract: Coronal jets are believed to be the miniature version of large-scale solar eruptions. In particular, the eruption of a mini-filament inside the base arch is suggested to be the trigger and even driver of blowout jets. Here we propose an alternative triggering mechanism, based on high-resolution H-alpha observations of a blowout jet associated with a mini-filament and an M1.2-class flare. The mini-… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 10 figures, accepted in Solar Physics

  7. arXiv:2406.02668  [pdf, other

    quant-ph

    Latent Style-based Quantum GAN for high-quality Image Generation

    Authors: Su Yeon Chang, Supanut Thanasilp, Bertrand Le Saux, Sofia Vallecorsa, Michele Grossi

    Abstract: Quantum generative modeling is among the promising candidates for achieving a practical advantage in data analysis. Nevertheless, one key challenge is to generate large-size images comparable to those generated by their classical counterparts. In this work, we take an initial step in this direction and introduce the Latent Style-based Quantum GAN (LaSt-QGAN), which employs a hybrid classical-quant… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 28 pages, 22 figures

  8. arXiv:2406.02376  [pdf, other

    cs.CL

    Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs

    Authors: Zhiwei Cao, Qian Cao, Yu Lu, Ningxin Peng, Luyang Huang, Shanbo Cheng, **song Su

    Abstract: The growing popularity of Large Language Models has sparked interest in context compression for Large Language Models (LLMs). However, the performance of previous methods degrades dramatically as compression ratios increase, sometimes even falling to the closed-book level. This decline can be attributed to the loss of key information during the compression process. Our preliminary study supports t… ▽ More

    Submitted 17 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  9. arXiv:2406.01658  [pdf, other

    cs.CV

    Proxy Denoising for Source-Free Domain Adaptation

    Authors: Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu

    Abstract: Source-free Domain Adaptation (SFDA) aims to adapt a pre-trained source model to an unlabeled target domain with no access to the source data. Inspired by the success of pre-trained large vision-language (ViL) models in many other applications, the latest SFDA methods have also validated the benefit of ViL models by leveraging their predictions as pseudo supervision. However, we observe that ViL's… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2406.01591  [pdf, other

    cs.CV

    DeNVeR: Deformable Neural Vessel Representations for Unsupervised Video Vessel Segmentation

    Authors: Chun-Hung Wu, Shih-Hong Chen, Chih-Yao Hu, Hsin-Yu Wu, Kai-Hsin Chen, Yu-You Chen, Chih-Hai Su, Chih-Kuo Lee, Yu-Lun Liu

    Abstract: This paper presents Deformable Neural Vessel Representations (DeNVeR), an unsupervised approach for vessel segmentation in X-ray videos without annotated ground truth. DeNVeR uses optical flow and layer separation, enhancing segmentation accuracy and adaptability through test-time training. A key component of our research is the introduction of the XACV dataset, the first X-ray angiography coronar… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Project page: https://kirito878.github.io/DeNVeR/

  11. arXiv:2406.01481  [pdf, other

    cs.LG cs.AI

    Learning from Streaming Data when Users Choose

    Authors: **yan Su, Sarah Dean

    Abstract: In digital markets comprised of many competing services, each user chooses between multiple service providers according to their preferences, and the chosen service makes use of the user data to incrementally improve its model. The service providers' models influence which service the user will choose at the next time step, and the user's choice, in return, influences the model update, leading to… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML24

  12. arXiv:2406.01392  [pdf, other

    cs.CL

    Sparsity-Accelerated Training for Large Language Models

    Authors: Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu

    Abstract: Large language models (LLMs) have demonstrated proficiency across various natural language processing (NLP) tasks but often require additional training, such as continual pre-training and supervised fine-tuning. However, the costs associated with this, primarily due to their large parameter count, remain high. This paper proposes leveraging \emph{sparsity} in pre-trained LLMs to expedite this trai… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  13. arXiv:2406.01375  [pdf, other

    cs.CL

    D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models

    Authors: Haoran Que, Jiaheng Liu, Ge Zhang, Chenchen Zhang, Xingwei Qu, Yinghao Ma, Feiyu Duan, Zhiqi Bai, Jiakai Wang, Yuanxing Zhang, Xu Tan, Jie Fu, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng

    Abstract: Continual Pre-Training (CPT) on Large Language Models (LLMs) has been widely used to expand the model's fundamental understanding of specific downstream domains (e.g., math and code). For the CPT on domain-specific LLMs, one important question is how to choose the optimal mixture ratio between the general-corpus (e.g., Dolma, Slim-pajama) and the downstream domain-corpus. Existing methods usually… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2406.01359  [pdf, other

    cs.CL cs.SE

    R2C2-Coder: Enhancing and Benchmarking Real-world Repository-level Code Completion Abilities of Code Large Language Models

    Authors: Ken Deng, Jiaheng Liu, He Zhu, Congnan Liu, **gxin Li, Jiakai Wang, Peng Zhao, Chenchen Zhang, Yanan Wu, Xueqiao Yin, Yuanxing Zhang, Wenbo Su, Bangyu Xiang, Tiezheng Ge, Bo Zheng

    Abstract: Code completion models have made significant progress in recent years. Recently, repository-level code completion has drawn more attention in modern software development, and several baseline methods and benchmarks have been proposed. However, existing repository-level code completion methods often fall short of fully using the extensive context of a project repository, such as the intricacies of… ▽ More

    Submitted 3 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2406.01332  [pdf, ps, other

    hep-ex

    Measurements of the branching fractions of semileptonic $D^{+}_s$ decays via $e^+e^-\to D_s^{*+}D_s^{*-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: We measure the absolute branching fractions of semileptonic $D^+_s$ decays via the $e^+e^-\to D_s^{*+}D_s^{*-}$ process using $e^+e^-$ collision data corresponding to an integrated luminosity of $10.64~\mathrm{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies between 4.237 and 4.699 GeV. The branching fractions are… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 14 pages, 3 figures

  16. arXiv:2406.01224  [pdf, other

    cs.CL

    Demonstration Augmentation for Zero-shot In-context Learning

    Authors: Yi Su, Yunpeng Tai, Yixin Ji, Juntao Li, Bowen Yan, Min Zhang

    Abstract: Large Language Models (LLMs) have demonstrated an impressive capability known as In-context Learning (ICL), which enables them to acquire knowledge from textual demonstrations without the need for parameter updates. However, many studies have highlighted that the model's performance is sensitive to the choice of demonstrations, presenting a significant challenge for practical applications where we… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  17. arXiv:2406.00976  [pdf, other

    cs.CL cs.SD eess.AS

    Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer

    Authors: Yongxin Zhu, Dan Su, Liqiang He, Linli Xu, Dong Yu

    Abstract: While recent advancements in speech language models have achieved significant progress, they face remarkable challenges in modeling the long acoustic sequences of neural audio codecs. In this paper, we introduce \textbf{G}enerative \textbf{P}re-trained \textbf{S}peech \textbf{T}ransformer (GPST), a hierarchical transformer designed for efficient speech language modeling. GPST quantizes audio wavef… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accept in ACL2024-main

  18. arXiv:2406.00941  [pdf, ps, other

    econ.EM

    A Robust Residual-Based Test for Structural Changes in Factor Models

    Authors: Bin Peng, Liangjun Su, Yayi Yan

    Abstract: In this paper, we propose an easy-to-implement residual-based specification testing procedure for detecting structural changes in factor models, which is powerful against both smooth and abrupt structural changes with unknown break dates. The proposed test is robust against the over-specified number of factors, and serially and cross-sectionally correlated error processes. A new central limit theo… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  19. arXiv:2406.00781  [pdf, other

    astro-ph.HE astro-ph.GA

    ViCTORIA project: The LOFAR-MeerKAT view of AGN in Virgo cluster early-type galaxies

    Authors: A. Spasic, H. W. Edler, Y. Su, M. Brüggen, F. de Gasperin, T. Pasini, V. Heesen, M. Simonte, A. Boselli, H. J. A. Röttgering, M. Fossati

    Abstract: The evolution of Active Galactic Nuclei (AGN) is closely connected to their host galaxies and surroundings. Via feedback processes, AGN can counteract the cooling of the intracluster medium (ICM) and suppress star formation in their host galaxies. Radio observations at low frequencies provide a glimpse into the history of AGN activity. The Virgo cluster is a substantial reservoir of nearby galaxie… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 17 pages, 10 figures

  20. arXiv:2406.00769  [pdf, other

    astro-ph.SR

    Filament eruption by multiple reconnections

    Authors: Y. Liu, G. P. Ruan, B. Schmieder, J. H. Guo, Y. Chen, R. S. Zheng, J. T. Su, B. Wang

    Abstract: Filament eruption is a common phenomenon in solar activity, but the triggering mechanism is not well understood. We focus our study on a filament eruption located in a complex nest of three active regions close to a coronal hole. The filament eruption is observed at multiple wavelengths: by the GONG, the STEREO, the SUTRI, and the AIA and Helioseismic and Magnetic Imager (HMI) on board the SDO. Th… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  21. arXiv:2406.00756  [pdf, other

    hep-ph

    An AI-Inspired Numerical Method in the Quark Model: Application to Finding the Wave Functions for Heavy Tetraquark States

    Authors: Daeho Park, Su Houng Lee

    Abstract: The current ongoing advancements in AI have shed light on the landscape of numerical analysis in science. Inspired by the path of achievement of AI, we have developed a method to construct accurate ground state wave functions of multiquark configurations within a quark model. We successfully tested our method through comparisons with meson-type two-body systems with analytic and numerical solution… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures

  22. arXiv:2406.00644  [pdf, other

    cs.CV

    Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

    Authors: Jun Li, Tongkun Su, Baoliang Zhao, Faqin Lv, Qiong Wang, Nassir Navab, Ying Hu, Zhongliang Jiang

    Abstract: Automatic report generation has arisen as a significant research area in computer-aided diagnosis, aiming to alleviate the burden on clinicians by generating reports automatically based on medical images. In this work, we propose a novel framework for automatic ultrasound report generation, leveraging a combination of unsupervised and supervised learning methods to aid the report generation proces… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  23. arXiv:2406.00446  [pdf, other

    cs.CV cs.AI

    GLCAN: Global-Local Collaborative Auxiliary Network for Local Learning

    Authors: Feiyu Zhu, Yuming Zhang, Changpeng Cai, Guinan Guo, Jiao Li, Xiuyuan Guo, Quanwei Zhang, Peizhe Wang, Chenghao He, Junhao Su

    Abstract: Traditional deep neural networks typically use end-to-end backpropagation, which often places a big burden on GPU memory. Another promising training method is local learning, which involves splitting the network into blocks and training them in parallel with the help of an auxiliary network. Local learning has been widely studied and applied to image classification tasks, and its performance is co… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  24. arXiv:2406.00341  [pdf, other

    eess.IV cs.CV

    DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation

    Authors: Qihang Xie, Mengguo Guo, Lei Mou, Dan Zhang, Da Chen, Caifeng Shan, Yitian Zhao, Ruisheng Su, Jiong Zhang

    Abstract: Cerebrovascular diseases (CVDs) remain a leading cause of global disability and mortality. Digital Subtraction Angiography (DSA) sequences, recognized as the golden standard for diagnosing CVDs, can clearly visualize the dynamic flow and reveal pathological conditions within the cerebrovasculature. Therefore, precise segmentation of cerebral arteries (CAs) and classification between their main tru… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  25. arXiv:2406.00276  [pdf

    cs.LG cs.AI cs.CE physics.data-an

    Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning

    Authors: Shengyu Tao, Mengtian Zhang, Zixi Zhao, Haoyang Li, Ruifei Ma, Yunhong Che, Xin Sun, Lin Su, Xiangyu Chen, Zihao Zhou, Heng Chang, Tingwei Cao, Xiao Xiao, Yaojun Liu, Wenjun Yu, Zhongling Xu, Yang Li, Han Hao, Xuan Zhang, Xiaosong Hu, Guangmin ZHou

    Abstract: Manufacturing complexities and uncertainties have impeded the transition from material prototypes to commercial batteries, making prototype verification critical to quality assessment. A fundamental challenge involves deciphering intertwined chemical processes to characterize degradation patterns and their quantitative relationship with battery performance. Here we show that a physics-informed mac… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    ACM Class: J.2; G.3

  26. arXiv:2406.00262  [pdf, other

    cs.LG cs.AI

    Contrastive Learning Via Equivariant Representation

    Authors: Sifan Song, **feng Wang, Qiaochu Zhao, Xiang Li, Dufan Wu, Angelos Stefanidis, Jionglong Su, S. Kevin Zhou, Quanzheng Li

    Abstract: Invariant-based Contrastive Learning (ICL) methods have achieved impressive performance across various domains. However, the absence of latent space representation for distortion (augmentation)-related information in the latent space makes ICL sub-optimal regarding training efficiency and robustness in downstream tasks. Recent studies suggest that introducing equivariance into Contrastive Learning… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Preprint. Under review

  27. arXiv:2406.00252  [pdf, other

    cs.AI cs.CL cs.CV cs.MA

    Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

    Authors: Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J. Su, Camillo J. Taylor, Tanwi Mallick

    Abstract: Rationality is the quality of being guided by reason, characterized by logical thinking and decision-making that align with evidence and logical rules. This quality is essential for effective problem-solving, as it ensures that solutions are well-founded and systematically derived. Despite the advancements of large language models (LLMs) in generating human-like text with remarkable accuracy, they… ▽ More

    Submitted 18 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  28. arXiv:2406.00235  [pdf, other

    hep-ex

    Amplitude analysis of the radiative decay $B^0_s\to K^+K^-γ$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1061 additional authors not shown)

    Abstract: A search for radiative decay of $B^0_s$ mesons to orbitally excited $K^+K^-$ states is performed using proton proton collisions recorded by the \mbox{LHCb}\xspace experiment, corresponding to an integrated luminosity of 9~fb$^{-1}$. The dikaon spectrum in the mass range $m_{KK}<2400$~{\ensuremath{\,\text{Me\kern -0.1em V\!/}c^2}\xspace} is dominated by the $φ(1020)$ resonance that accounts for alm… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-002.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-002, CERN-EP-2024-115

  29. arXiv:2405.21001  [pdf

    physics.ins-det

    Progress in patterned wax stamp for prototy** of paper-based microfluidic analytical devices via injection molding

    Authors: Zhizhi Zhou, Jiahuan Jiang, Yuanyuan Sun, Qing Qin, Sitong Yuan, Xilin Wang, Jianhua Jiang, Yifeng Su, Xing Hu, Mingying Liu, Feng Yang

    Abstract: In this study, we successfully developed two-dimensional paper-based analytical devices using a hybrid technique of injection molding and embossing. This innovative approach involves passive or active delivery of molten wax onto a glass substrate through a sealed chip, facilitating wax stamp creation.

    Submitted 31 May, 2024; originally announced May 2024.

  30. arXiv:2405.20810  [pdf, other

    cs.CV

    Context-aware Difference Distilling for Multi-change Captioning

    Authors: Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang

    Abstract: Multi-change captioning aims to describe complex and coupled changes within an image pair in natural language. Compared with single-change captioning, this task requires the model to have higher-level cognition ability to reason an arbitrary number of changes. In this paper, we propose a novel context-aware difference distilling (CARD) network to capture all genuine changes for yielding sentences.… ▽ More

    Submitted 7 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 main conference (long paper)

  31. arXiv:2405.20727  [pdf, other

    cs.CR cs.AI cs.DC

    GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning

    Authors: Xiaoyun Gan, Shanyu Gan, Taizhi Su, Peng Liu

    Abstract: With heightened awareness of data privacy protection, Federated Learning (FL) has attracted widespread attention as a privacy-preserving distributed machine learning method. However, the distributed nature of federated learning also provides opportunities for backdoor attacks, where attackers can guide the model to produce incorrect predictions without affecting the global model training process.… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  32. arXiv:2405.20676  [pdf, other

    hep-ex

    Search for $e^{+}e^{-}\toη'ψ(2S)$ at center-of-mass energies from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.67~\mathrm{fb}^{-1}$ collected by the BESIII detector operating at the BEPCII collider, we search for the process $e^+e^- \rightarrow η' ψ(2S)$ at center-of-mass energies from $4.66$ to $4.95~\mathrm{GeV}$. No significant signal is observed, and upper limits for the Born cross sections $σ^B(e^+e^-\rightarrowη'ψ(2S))$ at the 90\% confidence lev… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  33. arXiv:2405.20638  [pdf, other

    hep-ex

    Study of the decays $χ_{cJ} \rightarrow Λ\barΛφ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: Based on $(2712.4 \pm 14.3) \times 10^{6}$ $ e^{+}e^{-}\toψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, we report the first evidence of $χ_{c0}\to Λ\bar Λφ$ decays and the first observation of $χ_{c1,2}\to Λ\bar Λφ$ decays, with significances of $4.5σ$, $11.3σ$ and $13.0σ$, respectively. The decay branching fractions of $χ_{c0,1,2}\to Λ\bar Λφ$ are measured t… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 10 pages, 9 figures

  34. arXiv:2405.20195  [pdf, other

    cs.HC

    Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations

    Authors: Zilin Ma, Susannah, Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Yanyi Zhang, Sophia, Yang, Ziqing Luo, Siyao Li, Gekai Liao, Boxiang Wang, **glun Gao, Zihan Wen, Claude Bruderlein, Weiwei Pan

    Abstract: Humanitarian negotiations in conflict zones, called \emph{frontline negotiation}, are often highly adversarial, complex, and high-risk. Several best-practices have emerged over the years that help negotiators extract insights from large datasets to navigate nuanced and rapidly evolving scenarios. Recent advances in large language models (LLMs) have sparked interest in the potential for AI to aid d… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  35. arXiv:2405.20071  [pdf

    physics.med-ph cs.LG

    A Staged Approach using Machine Learning and Uncertainty Quantification to Predict the Risk of Hip Fracture

    Authors: Anjum Shaik, Kristoffer Larsen, Nancy E. Lane, Chen Zhao, Kuan-Jui Su, Joyce H. Keyak, Qing Tian, Qiuying Sha, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: Despite advancements in medical care, hip fractures impose a significant burden on individuals and healthcare systems. This paper focuses on the prediction of hip fracture risk in older and middle-aged adults, where falls and compromised bone quality are predominant factors. We propose a novel staged model that combines advanced imaging and clinical data to improve predictive performance. By using… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 29 pages, 5 figures, 6 tables

  36. arXiv:2405.19885  [pdf, other

    cs.LG cs.RO

    Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning

    Authors: Hengkai Tan, Songming Liu, Kai Ma, Chengyang Ying, Xingxing Zhang, Hang Su, Jun Zhu

    Abstract: Transformer has shown promise in reinforcement learning to model time-varying features for obtaining generalized low-level robot policies on diverse robotics datasets in embodied learning. However, it still suffers from the issues of low data efficiency and high inference latency. In this paper, we propose to investigate the task from a new perspective of the frequency domain. We first observe tha… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  37. arXiv:2405.19813  [pdf, other

    cs.RO

    SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

    Authors: Jiang Wang, Yuanzheng He, Daobilige Su, Katsutoshi Itoyama, Kazuhiro Nakadai, Junfeng Wu, Shoudong Huang, Youfu Li, He Kong

    Abstract: Robot audition systems with multiple microphone arrays have many applications in practice. However, accurate calibration of multiple microphone arrays remains challenging because there are many unknown parameters to be identified, including the relative transforms (i.e., orientation, translation) and asynchronous factors (i.e., initial time offset and sampling clock difference) between microphone… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: This paper was accepted to and going to appear in the IEEE Transactions on Robotics

  38. arXiv:2405.19802  [pdf, other

    cs.MM

    Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models

    Authors: Shuyuan Liu, Jiawei Chen, Shouwei Ruan, Hang Su, Zhaoxia Yin

    Abstract: Embodied intelligence empowers agents with a profound sense of perception, enabling them to respond in a manner closely aligned with real-world situations. Large Language Models (LLMs) delve into language instructions with depth, serving a crucial role in generating plans for intricate tasks. Thus, LLM-based embodied models further enhance the agent's capacity to comprehend and process information… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  39. arXiv:2405.19791  [pdf, other

    cond-mat.mes-hall physics.optics

    Trembling Motion of Exciton-Polaritons Close to the Rashba-Dresselhaus Regime

    Authors: Wen Wen, Jie Liang, Huawen Xu, Feng **, Yuri G. Rubo, Timothy C. H. Liew, Rui Su

    Abstract: We report the experimental emulation of trembling quantum motion, or Zitterbewegung, of exciton polaritons in a perovskite microcavity at room temperature. By introducing liquid crystal molecules into the microcavity, we achieve spinor states with synthetic Rashba-Dresselhaus spin-orbit coupling and tunable energy splitting. Under a resonant excitation, the polariton fluid exhibits clear trembling… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  40. arXiv:2405.19789  [pdf, other

    cs.LG cs.DC

    Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning

    Authors: Guogang Zhu, Xuefeng Liu, Xinghao Wu, Shaojie Tang, Chao Tang, Jianwei Niu, Hao Su

    Abstract: Federated Semi-Supervised Learning (FSSL) leverages both labeled and unlabeled data on clients to collaboratively train a model.In FSSL, the heterogeneous data can introduce prediction bias into the model, causing the model's prediction to skew towards some certain classes. Existing FSSL methods primarily tackle this issue by enhancing consistency in model parameters or outputs. However, as the mo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  41. arXiv:2405.19734  [pdf, other

    hep-ex

    Search for the decay $B^{0}\toγγ$ using Belle and Belle II data

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot , et al. (385 additional authors not shown)

    Abstract: We report the result of a search for the rare decay $B^{0} \to γγ$ using a combined dataset of $753\times10^{6}$ $B\bar{B}$ pairs collected by the Belle experiment and $387\times10^{6}$ $B\bar{B}$ pairs collected by the Belle II experiment from decays of the $\rm Υ(4S)$ resonance produced in $e^{+}e^{-}$ collisions. A simultaneous fit to the Belle and Belle II data sets yields… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Report number: Belle II Preprint: 2024-017, KEK Preprint: 2024-13

  42. arXiv:2405.19668  [pdf, other

    cs.CV

    AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization

    Authors: Jiawei Chen, Xiao Yang, Zhengwei Fang, Yu Tian, Yinpeng Dong, Zhaoxia Yin, Hang Su

    Abstract: Despite the widespread application of large language models (LLMs) across various tasks, recent studies indicate that they are susceptible to jailbreak attacks, which can render their defense mechanisms ineffective. However, previous jailbreak research has frequently been constrained by limited universality, suboptimal efficiency, and a reliance on manual crafting. In response, we rethink the appr… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Under review

  43. arXiv:2405.19665  [pdf

    eess.SY cs.AI cs.LG

    A novel fault localization with data refinement for hydroelectric units

    Authors: Jialong Huang, Junlin Song, Penglong Lian, Mengjie Gan, Zhiheng Su, Benhao Wang, Wenji Zhu, Xiaomin Pu, Jianxiao Zou, Shicai Fan

    Abstract: Due to the scarcity of fault samples and the complexity of non-linear and non-smooth characteristics data in hydroelectric units, most of the traditional hydroelectric unit fault localization methods are difficult to carry out accurate localization. To address these problems, a sparse autoencoder (SAE)-generative adversarial network (GAN)-wavelet noise reduction (WNR)- manifold-boosted deep learni… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6pages,4 figures,Conference on Decision and Control(CDC) conference

  44. arXiv:2405.19654  [pdf, other

    cs.AI

    Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training

    Authors: **xia Yang, Bing Su, Wayne Xin Zhao, Ji-Rong Wen

    Abstract: Medical vision-language pre-training methods mainly leverage the correspondence between paired medical images and radiological reports. Although multi-view spatial images and temporal sequences of image-report pairs are available in off-the-shelf multi-modal medical datasets, most existing methods have not thoroughly tapped into such extensive supervision signals. In this paper, we introduce the M… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  45. arXiv:2405.19642  [pdf

    cs.AI

    Few-shot fault diagnosis based on multi-scale graph convolution filtering for industry

    Authors: Mengjie Gan, Penglong Lian, Zhiheng Su, Jiyang Zhang, Jialong Huang, Benhao Wang, Jianxiao Zou, Shicai Fan

    Abstract: Industrial equipment fault diagnosis often encounter challenges such as the scarcity of fault data, complex operating conditions, and varied types of failures. Signal analysis, data statistical learning, and conventional deep learning techniques face constraints under these conditions due to their substantial data requirements and the necessity for transfer learning to accommodate new failure mode… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 2 figures, 2 tables, 63rd IEEE Conference on Decision and Control

  46. arXiv:2405.19581  [pdf, other

    cs.SE cs.AI

    Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases

    Authors: Zian Su, Xiangzhe Xu, Ziyang Huang, Kaiyuan Zhang, Xiangyu Zhang

    Abstract: Human-Oriented Binary Reverse Engineering (HOBRE) lies at the intersection of binary and source code, aiming to lift binary code to human-readable content relevant to source code, thereby bridging the binary-source semantic gap. Recent advancements in uni-modal code model pre-training, particularly in generative Source Code Foundation Models (SCFMs) and binary understanding models, have laid the g… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  47. arXiv:2405.19564  [pdf, ps, other

    quant-ph

    Quantum error detection with noise-resilient parity-controlled gate in two-dimensional Rydberg atom arrays

    Authors: F. Q. Guo, S. L. Su, Weibin Li, X. Q. Shao

    Abstract: Quantum error detection relies primarily on precise measurement of qubit parity, a fundamental operation in quantum information processing. Here, we introduce a resilient parity-controlled gate tailored for detecting quantum errors within a 2D Rydberg atom array. Our method enables the discrimination between even and odd parities of virtually excited control atoms by tracking the dynamic evolution… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 17 figures, comments are welcome

  48. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Gei**, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  49. arXiv:2405.19434  [pdf, other

    astro-ph.HE

    The Pulsar Science Collaboratory: Multi-Epoch Scintillation Studies of Pulsars

    Authors: Jacob E. Turner, Juan G. Lebron Medina, Zachary Zelensky, Kathleen A. Gustavso, Jeffrey Marx, Manvith Kothapalli, Luis D. Cruz Vega, Alexander Lee, Caryelis B. Figueroa, Daniel E. Reichart, Joshua B. Haislip, Vladimir V. Kouprianov, Steve White, Frank Ghigo, Sue Ann Heatherly, Maura A. McLaughlin

    Abstract: We report on findings from scintillation analyses using high-cadence observations of nine canonical pulsars with observing baselines ranging from one to three years. We obtain scintillation bandwidth and timescale measurements for all pulsars in our survey and obtain scintillation arc curvature measurements for four pulsars, detecting multiple arcs for two of them. Using updated pulsar distance es… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Submitted to ApJ

  50. arXiv:2405.19207  [pdf

    cs.IR cs.AI

    A Multi-Source Retrieval Question Answering Framework Based on RAG

    Authors: Ridong Wu, Shuhong Chen, Xiangbiao Su, Yuankai Zhu, Yifei Liao, Jianming Wu

    Abstract: With the rapid development of large-scale language models, Retrieval-Augmented Generation (RAG) has been widely adopted. However, existing RAG paradigms are inevitably influenced by erroneous retrieval information, thereby reducing the reliability and correctness of generated results. Therefore, to improve the relevance of retrieval information, this study proposes a method that replaces tradition… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 4 pages,3 figures