Skip to main content

Showing 101–150 of 1,510 results for author: Guo, Q

.
  1. arXiv:2403.19091  [pdf, other

    hep-ex

    Observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ collected at a center-of-mass energy of 3.773 GeV with the \text{BESIII} detector, the first observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$ is reported. With a dominant hadronic contribution from $K_1(1270)$, the branching fra… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 19pages

  2. arXiv:2403.19066  [pdf, other

    cs.CV cs.AI

    Generative Quanta Color Imaging

    Authors: Vishal Purohit, Junjie Luo, Yiheng Chi, Qi Guo, Stanley H. Chan, Qiang Qiu

    Abstract: The astonishing development of single-photon cameras has created an unprecedented opportunity for scientific and industrial imaging. However, the high data throughput generated by these 1-bit sensors creates a significant bottleneck for low-power applications. In this paper, we explore the possibility of generating a color image from a single binary frame of a single-photon camera. We evidently fi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  3. arXiv:2403.18554  [pdf, other

    cs.CV

    CosalPure: Learning Concept from Group Images for Robust Co-Saliency Detection

    Authors: Jiayi Zhu, Qing Guo, Felix Juefei-Xu, Yihao Huang, Yang Liu, Geguang Pu

    Abstract: Co-salient object detection (CoSOD) aims to identify the common and salient (usually in the foreground) regions across a given group of images. Although achieving significant progress, state-of-the-art CoSODs could be easily affected by some adversarial perturbations, leading to substantial accuracy reduction. The adversarial perturbations can mislead CoSODs but do not change the high-level semant… ▽ More

    Submitted 11 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: This paper is accepted by CVPR 2024

  4. arXiv:2403.17372  [pdf, other

    cs.IR

    An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders

    Authors: Youhua Li, Hanwen Du, Yongxin Ni, Yuanqi He, Junchen Fu, Xiangyan Liu, Qi Guo

    Abstract: Sequential Recommendation (SR) aims to predict future user-item interactions based on historical interactions. While many SR approaches concentrate on user IDs and item IDs, the human perception of the world through multi-modal signals, like text and images, has inspired researchers to delve into constructing SR from multi-modal information without using IDs. However, the complexity of multi-modal… ▽ More

    Submitted 30 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: An Empirical Study of Training ID-Agnostic Multi-modal Sequential Recommenders

  5. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  6. arXiv:2403.16935  [pdf, other

    quant-ph

    Measuring Spectral Form Factor in Many-Body Chaotic and Localized Phases of Quantum Processors

    Authors: Hang Dong, Pengfei Zhang, Ceren B. Dag, Yu Gao, Ning Wang, **feng Deng, Xu Zhang, Jiachen Chen, Shibo Xu, Ke Wang, Yaozu Wu, Chuanyu Zhang, Feitong **, Xuhao Zhu, Aosai Zhang, Yiren Zou, Ziqi Tan, Zhengyi Cui, Zitian Zhu, Fanhao Shen, Tingting Li, Jiarun Zhong, Zehang Bao, Hekang Li, Zhen Wang , et al. (6 additional authors not shown)

    Abstract: The spectral form factor (SFF) captures universal spectral fluctuations as signatures of quantum chaos, and has been instrumental in advancing multiple frontiers of physics including the studies of black holes and quantum many-body systems. However, the measurement of SFF in many-body systems is challenging due to the difficulty in resolving level spacings that become exponentially small with incr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures

  7. arXiv:2403.16811  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^+e^-\to ηψ(2S)$ and search for $e^+e^-\toη\tilde{X}(3872)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The energy-dependent cross section for $e^+e^-\to ηψ(2S)$ is measured at eighteen center of mass energies from 4.288 GeV to 4.951 GeV using the BESIII detector. Using the same data samples, we also perform the first search for the reaction $e^+e^-\toη\tilde{X}(3872)$, but no evidence is found for the $\tilde{X}(3872)$ in the $π^+π^- J/ψ$ mass distribution. At each of the eighteen center of mass en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  8. arXiv:2403.16494  [pdf, other

    cs.CV

    CT-Bound: Robust Boundary Detection From Noisy Images Via Hybrid Convolution and Transformer Neural Networks

    Authors: Wei Xu, Junjie Luo, Qi Guo

    Abstract: We present CT-Bound, a robust and fast boundary detection method for very noisy images using a hybrid Convolution and Transformer neural network. The proposed architecture decomposes boundary estimation into two tasks: local detection and global regularization. During the local detection, the model uses a convolutional architecture to predict the boundary structure of each image patch in the form… ▽ More

    Submitted 25 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.14998  [pdf, other

    hep-ex

    Precise measurement of the $e^+e^-\to D_s^+D_s^-$ cross sections at center-of-mass energies from threshold to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using the $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, at center-of-mass energies from the threshold to $4.95$~GeV, we present precise measurements of the cross sections for the process $e^+e^-\to D_s^+D_s^-$ using a single tag method. The resulting cross section lineshape exhibits several new structures, thereby offering an input for coupled channel… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, published to PRL

  10. arXiv:2403.14778  [pdf, other

    cs.CV eess.IV

    Diffusion Attack: Leveraging Stable Diffusion for Naturalistic Image Attacking

    Authors: Qianyu Guo, Jiaming Fu, Yawen Lu, Dongming Gan

    Abstract: In Virtual Reality (VR), adversarial attack remains a significant security threat. Most deep learning-based methods for physical and digital adversarial attacks focus on enhancing attack performance by crafting adversarial examples that contain large printable distortions that are easy for human observers to identify. However, attackers rarely impose limitations on the naturalness and comfort of t… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to IEEE VRW

  11. arXiv:2403.14734  [pdf, other

    cs.SE cs.AI cs.CL cs.PL

    A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond

    Authors: Qiushi Sun, Zhirui Chen, Fangzhi Xu, Kanzhi Cheng, Chang Ma, Zhangyue Yin, Jianing Wang, Chengcheng Han, Renyu Zhu, Shuai Yuan, Qipeng Guo, Xipeng Qiu, Pengcheng Yin, Xiaoli Li, Fei Yuan, Lingpeng Kong, Xiang Li, Zhiyong Wu

    Abstract: Neural Code Intelligence -- leveraging deep learning to understand, generate, and optimize code -- holds immense potential for transformative impacts on the whole society. Bridging the gap between Natural Language and Programming Language, this domain has drawn significant attention from researchers in both research communities over the past few years. This survey presents a systematic and chronol… ▽ More

    Submitted 23 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 64 pages, 6 figures, 10 tables, 692 references

  12. arXiv:2403.13437  [pdf, other

    hep-ex

    Search for $ΔS=2$ nonleptonic hyperon decays $Ω^-\toΣ^{0}π^{-}$ and $Ω^-\to nK^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the center-of-mass energy of $\sqrt{s} = 3.686$ GeV, we search for the first time for two nonleptonic hyperon decays that change strangeness by two units, $Ω^-\toΣ^{0}π^-$ and $Ω^-\to nK^{-}$. No significant signal is observed. The upper limits on their decay branching fractions are determined to be… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  13. arXiv:2403.12766  [pdf, other

    cs.CL

    NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens

    Authors: Cunxiang Wang, Ruoxi Ning, Boqi Pan, Tonghui Wu, Qipeng Guo, Cheng Deng, Guangsheng Bao, Xiangkun Hu, Zheng Zhang, Qian Wang, Yue Zhang

    Abstract: The rapid advancement of Large Language Models (LLMs) has introduced a new frontier in natural language processing, particularly in understanding and processing long-context information. However, the evaluation of these models' long-context abilities remains a challenge due to the limitations of current benchmarks. To address this gap, we introduce NovelQA, a benchmark specifically designed to tes… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  14. arXiv:2403.12445  [pdf, other

    cs.CV

    Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory

    Authors: Sensen Gao, Xiaojun Jia, Xuhong Ren, Ivor Tsang, Qing Guo

    Abstract: Vision-language pre-training (VLP) models exhibit remarkable capabilities in comprehending both images and text, yet they remain susceptible to multimodal adversarial examples (AEs). Strengthening adversarial attacks and uncovering vulnerabilities, especially common issues in VLP models (e.g., high transferable AEs), can stimulate further research on constructing reliable and practical VLP models.… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  15. arXiv:2403.11935  [pdf, other

    cs.CV eess.IV

    HyperColorization: Propagating spatially sparse noisy spectral clues for reconstructing hyperspectral images

    Authors: M. Kerem Aydin, Qi Guo, Emma Alexander

    Abstract: Hyperspectral cameras face challenging spatial-spectral resolution trade-offs and are more affected by shot noise than RGB photos taken over the same total exposure time. Here, we present a colorization algorithm to reconstruct hyperspectral images from a grayscale guide image and spatially sparse spectral clues. We demonstrate that our algorithm generalizes to varying spectral dimensions for hype… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 Pages, 13 Figures, 3 Tables, for more information: https://mehmetkeremaydin.github.io/hypercolorization/

    ACM Class: I.4.5

    Journal ref: Optics Express, Vol:7, year:2024, p:10761-10776

  16. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  17. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  18. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  19. JWST's first glimpse of a z > 2 forming cluster reveals a top-heavy stellar mass function

    Authors: Hanwen Sun, Tao Wang, Ke Xu, Emanuele Daddi, Qing Gu, Tadayuki Kodama, Anita Zanella, David Elbaz, Ichi Tanaka, Raphael Gobat, Qi Guo, Jiaxin Han, Shiying Lu, Luwenjia Zhou

    Abstract: Clusters and their progenitors (protoclusters) at z = 2-4, the peak epoch of star formation, are ideal laboratories to study the formation process of both the clusters themselves and their member galaxies. However, a complete census of their member galaxies has been challenging due to observational difficulties. Here we present new JWST/NIRCam observations targeting the distant cluster CLJ1001 at… ▽ More

    Submitted 29 May, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 14 pages, 9 figures, 1 table, published by ApJL

    Journal ref: ApJL, 967, L34 (2024)

  20. arXiv:2403.03579  [pdf, other

    quant-ph

    Testing the unified bounds of quantum speed limit

    Authors: Yaozu Wu, Jiale Yuan, Chuanyu Zhang, Zitian Zhu, **feng Deng, Xu Zhang, Pengfei Zhang, Qiujiang Guo, Zhen Wang, Jiehui Huang, Chao Song, Hekang Li, Da-Wei Wang, H. Wang, Girish S. Agarwal

    Abstract: Quantum speed limits (QSLs) impose fundamental constraints on the evolution speed of quantum systems. Traditionally, the Mandelstam-Tamm (MT) and Margolus-Levitin (ML) bounds have been widely employed, relying on the standard deviation and mean of energy distribution to define the QSLs. However, these universal bounds only offer loose restrictions on the quantum evolution. Here we introduce the ge… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  21. arXiv:2403.03558  [pdf, other

    cs.CL

    Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem

    Authors: Yuhong Sun, Zhangyue Yin, Qipeng Guo, Jiawen Wu, Xipeng Qiu, Hui Zhao

    Abstract: Large language models (LLMs) are highly effective in various natural language processing (NLP) tasks. However, they are susceptible to producing unreliable conjectures in ambiguous contexts called hallucination. This paper presents a new method for evaluating LLM hallucination in Question Answering (QA) based on the unanswerable math word problem (MWP). To support this approach, we innovatively de… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 8 figures, accepted by LREC-Coling 2024

  22. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  23. arXiv:2403.02330  [pdf, other

    cs.CV

    RegionGPT: Towards Region Understanding Vision Language Model

    Authors: Qiushan Guo, Shalini De Mello, Hongxu Yin, Wonmin Byeon, Ka Chun Cheung, Yizhou Yu, ** Luo, Sifei Liu

    Abstract: Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  24. arXiv:2403.01971  [pdf, other

    cs.SE

    ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs

    Authors: Jiaolong Kong, Mingfei Cheng, Xiaofei Xie, Shangqing Liu, Xiaoning Du, Qi Guo

    Abstract: Automated Program Repair (APR) aims to automatically generate patches for rectifying software bugs. Recent strides in Large Language Models (LLM), such as ChatGPT, have yielded encouraging outcomes in APR, especially within the conversation-driven APR framework. Nevertheless, the efficacy of conversation-driven APR is contingent on the quality of the feedback information. In this paper, we propose… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  25. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  26. arXiv:2403.00758  [pdf, other

    cs.CL cs.AI cs.LG

    Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

    Authors: Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang

    Abstract: While large language models (LLMs) have achieved impressive performance across diverse tasks, recent studies showcase that causal LLMs suffer from the "reversal curse". It is a typical example that the model knows "A's father is B", but is unable to reason "B's child is A". This limitation poses a challenge to the advancement of artificial general intelligence (AGI), as it suggests a gap in the mo… ▽ More

    Submitted 20 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  27. arXiv:2402.16319  [pdf, other

    cs.CL

    Data-freeWeight Compress and Denoise for Large Language Models

    Authors: Runyu Peng, Yunhua Zhou, Qipeng Guo, Yang Gao, Hang Yan, Xipeng Qiu, Dahua Lin

    Abstract: Large Language Models (LLMs) are resha** the research landscape in artificial intelligence, particularly as model parameters scale up significantly, unlocking remarkable capabilities across various domains. Nevertheless, the scalability of model parameters faces constraints due to limitations in GPU memory and computational speed. To address these constraints, various weight compression methods… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  28. arXiv:2402.16088  [pdf

    cond-mat.mes-hall

    Origin of giant magnetoresistance in layered nodal-line semimetal TaNiTe5 nanoflakes

    Authors: Ding-Bang Zhou, Kuang-Hong Gao, Meng-Fan Zhao, Zhi-Yan Jia, Xiao-Xia Hu, Qian-** Guo, Hai-Yan Du, Xiao-** Chen, Zhi-Qing Li

    Abstract: Layered transition metal chalcogenides have stimulated a wide research interest due to their many exotic physical properties. In this paper, we studied the magnetotransport properties of the exfoliated TaNiTe5, a recently discovered Dirac nodal-line semimetal. A giant positive magnetoresistance (MR) is observed when the current is parallel to the crystallographic c axis, while it is strongly dimin… ▽ More

    Submitted 18 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: 21 pages, 7 figures, 1 table

  29. arXiv:2402.15283  [pdf, other

    cs.LG cs.AI

    When in Doubt, Think Slow: Iterative Reasoning with Latent Imagination

    Authors: Martin Benfeghoul, Umais Zahid, Qinghai Guo, Zafeirios Fountas

    Abstract: In an unfamiliar setting, a model-based reinforcement learning agent can be limited by the accuracy of its world model. In this work, we present a novel, training-free approach to improving the performance of such agents separately from planning and learning. We do so by applying iterative inference at decision-time, to fine-tune the inferred agent states based on the coherence of future state rep… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    ACM Class: I.2.0; I.2.8; I.2.10; I.4.5; I.4.10

  30. arXiv:2402.14845  [pdf, other

    cs.CL cs.AI cs.LG

    Purifying Large Language Models by Ensembling a Small Language Model

    Authors: Tianlin Li, Qian Liu, Tianyu Pang, Chao Du, Qing Guo, Yang Liu, Min Lin

    Abstract: The emerging success of large language models (LLMs) heavily relies on collecting abundant training data from external (untrusted) sources. Despite substantial efforts devoted to data cleaning and curation, well-constructed LLMs have been reported to suffer from copyright infringement, data poisoning, and/or privacy violations, which would impede practical deployment of LLMs. In this study, we pro… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    ACM Class: I.2

  31. arXiv:2402.13583  [pdf, other

    cs.CL

    LongWanjuan: Towards Systematic Measurement for Long Text Quality

    Authors: Kai Lv, Xiaoran Liu, Qipeng Guo, Hang Yan, Conghui He, Xipeng Qiu, Dahua Lin

    Abstract: The quality of training data are crucial for enhancing the long-text capabilities of foundation models. Despite existing efforts to refine data quality through heuristic rules and evaluations based on data diversity and difficulty, there's a lack of systematic approaches specifically tailored for assessing long texts. Addressing this gap, our work systematically measures the quality of long texts… ▽ More

    Submitted 21 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Update Figures

  32. arXiv:2402.13435  [pdf, other

    cs.IR cs.LG

    Learning to Retrieve for Job Matching

    Authors: Jianqiang Shen, Yuchin Juan, Shaobo Zhang, ** Liu, Wen Pu, Sriram Vasudevan, Qingquan Song, Fedor Borisyuk, Kay Qianqi Shen, Haichao Wei, Yunxiang Ren, Yeou S. Chiou, Sicong Kuang, Yuan Yin, Ben Zheng, Muchen Wu, Shaghayegh Gharghabi, Xiaoqing Wang, Huichao Xue, Qi Guo, Daniel Hewlett, Luke Simon, Liangjie Hong, Wen**g Zhang

    Abstract: Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we d… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  33. arXiv:2402.13055  [pdf, other

    cs.CL cs.AI

    Identifying Semantic Induction Heads to Understand In-Context Learning

    Authors: Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Xipeng Qiu, Dahua Lin

    Abstract: Although large language models (LLMs) have demonstrated remarkable performance, the lack of transparency in their inference logic raises concerns about their trustworthiness. To gain a better understanding of LLMs, we conduct a detailed analysis of the operations of attention heads and aim to better understand the in-context learning of LLMs. Specifically, we investigate whether attention heads en… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  34. arXiv:2402.13013  [pdf, other

    cs.CL

    Code Needs Comments: Enhancing Code LLMs with Comment Augmentation

    Authors: Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin

    Abstract: The programming skill is one crucial ability for Large Language Models (LLMs), necessitating a deep understanding of programming languages (PLs) and their correlation with natural languages (NLs). We examine the impact of pre-training data on code-focused LLMs' performance by assessing the comment density as a measure of PL-NL alignment. Given the scarcity of code-comment aligned data in pre-train… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  35. arXiv:2402.12760  [pdf, other

    cs.MM cs.AI cs.CV

    A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

    Authors: Nailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang

    Abstract: Well-designed prompts have demonstrated the potential to guide text-to-image models in generating amazing images. Although existing prompt engineering methods can provide high-level guidance, it is challenging for novice users to achieve the desired results by manually entering prompts due to a discrepancy between novice-user-input prompts and the model-preferred prompts. To bridge the distributio… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  36. arXiv:2402.12399  [pdf, other

    cs.LG cs.AI cs.CL

    Turn Waste into Worth: Rectifying Top-$k$ Router of MoE

    Authors: Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu

    Abstract: Sparse Mixture of Experts (MoE) models are popular for training large language models due to their computational efficiency. However, the commonly used top-$k$ routing mechanism suffers from redundancy computation and memory costs due to the unbalanced routing. Some experts are overflow, where the exceeding tokens are dropped. While some experts are vacant, which are padded with zeros, negatively… ▽ More

    Submitted 21 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  37. arXiv:2402.12150  [pdf, other

    cs.CL cs.AI

    Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

    Authors: Tianlin Li, Xiaoyu Zhang, Chao Du, Tianyu Pang, Qian Liu, Qing Guo, Chao Shen, Yang Liu

    Abstract: The widespread adoption of large language models (LLMs) underscores the urgent need to ensure their fairness. However, LLMs frequently present dominant viewpoints while ignoring alternative perspectives from minority parties, resulting in potential biases. We hypothesize that these fairness-violating behaviors occur because LLMs express their viewpoints using a human personality that represents th… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    ACM Class: I.2; J.4

  38. arXiv:2402.11678  [pdf, other

    astro-ph.CO astro-ph.GA

    The boundary of cosmic filaments

    Authors: Wei Wang, Peng Wang, Hong Guo, Xi Kang, Noam I. Libeskind, Daniela Galarraga-Espinosa, Volker Springel, Rahul Kannan, Lars Hernquist, Rudiger Pakmor, Haoran Yu, Sownak Bose, Quan Guo, Luo Yu, Cesar Hernandez-Aguayo

    Abstract: For decades, the boundary of cosmic filaments have been a subject of debate. In this work, we determine the physically-motivated radii of filaments by constructing stacked galaxy number density profiles around the filament spines. We find that the slope of the profile changes with distance to the filament spine, reaching its minimum at approximately 1 Mpc at z = 0 in both state-of-the-art hydrodyn… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 11 pages, 6 figures, submitted

  39. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  40. arXiv:2402.10594  [pdf, other

    hep-ph

    Strong decays of the $Λ_{c}(2910)$ and $Λ_{c}(2940)$ in the $ND^{*}$ molecular frame

    Authors: Zi-Li Yue, Quan-Yun Guo, Dian-Yong Chen

    Abstract: Stimulated by the observation of a new structure, named $Λ_{c}(2910)$, in the $Σ_{c}(2455)^{0,++}π^{+,-}$ decay channel from B meson decay process by the Belle Collaboration, and the similarity of the $Λ_{c}(2910)/Λ_{c}(2940)$ and $P_{c}$ states, we investigate the decay behavior of the $Λ_{c}(2910)$ and $Λ_{c}(2940)$ in the $N D^{*}$ molecular frame with the possible $J^{P}$ quantum numbers to be… ▽ More

    Submitted 26 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 12 pages, 13 figures, 3 tables, accepted for publication in PRD

  41. arXiv:2402.10593  [pdf, other

    cs.IT eess.SP

    Bayesian Learning for Double-RIS Aided ISAC Systems with Superimposed Pilots and Data

    Authors: Xu Gan, Chongwen Huang, Zhaohui Yang, Caijun Zhong, Xiaoming Chen, Zhaoyang Zhang, Qinghua Guo, Chau Yuen, Merouane Debbah

    Abstract: Reconfigurable intelligent surface (RIS) has great potential to improve the performance of integrated sensing and communication (ISAC) systems, especially in scenarios where line-of-sight paths between the base station and users are blocked. However, the spectral efficiency (SE) of RIS-aided ISAC uplink transmissions may be drastically reduced by the heavy burden of pilot overhead for realizing se… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  42. arXiv:2402.09263  [pdf

    eess.SY

    Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach

    Authors: Zhengcheng Wang, Fei Teng, Yanzhen Zhou, Qinglai Guo, Hongbin Sun

    Abstract: Transient stability-constrained preventive redispatch plays a crucial role in ensuring power system security and stability. Since redispatch strategies need to simultaneously satisfy complex transient constraints and the economic need, model-based formulation and optimization become extremely challenging. In addition, the increasing uncertainty and variability introduced by renewable sources start… ▽ More

    Submitted 29 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 13 pages,11 figures,accepted by IEEE Transactions on Power Systems on 24-Jun-2024

  43. arXiv:2402.06341  [pdf, other

    cs.CL

    RareBench: Can LLMs Serve as Rare Diseases Specialists?

    Authors: Xuanzhong Chen, Xiaohao Mao, Qihan Guo, Lun Wang, Shuyang Zhang, Ting Chen

    Abstract: Generalist Large Language Models (LLMs), such as GPT-4, have shown considerable promise in various domains, including medical diagnosis. Rare diseases, affecting approximately 300 million people worldwide, often have unsatisfactory clinical diagnosis rates primarily due to a lack of experienced physicians and the complexity of differentiating among many rare diseases. In this context, recent news… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  44. arXiv:2402.06329  [pdf

    cs.CV eess.IV

    A Network for structural dense displacement based on 3D deformable mesh model and optical flow

    Authors: Peimian Du, Qicheng Guo, Yanru Li

    Abstract: This study proposes a Network to recognize displacement of a RC frame structure from a video by a monocular camera. The proposed Network consists of two modules which is FlowNet2 and POFRN-Net. FlowNet2 is used to generate dense optical flow as well as POFRN-Net is to extract pose parameter H. FlowNet2 convert two video frames into dense optical flow. POFRN-Net is inputted dense optical flow from… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Paper for the 3rd International Competition for Structural Health Monitoring (IC-SHM 2022): 15 pages, 13 figures

  45. arXiv:2402.05492  [pdf, other

    astro-ph.CO

    Cosmological Forecast of the Void Size Function Measurement from the CSST Spectroscopic Survey

    Authors: Yingxiao Song, Qi Xiong, Yan Gong, Furen Deng, Kwan Chuen Chan, Xuelei Chen, Qi Guo, Jiaxin Han, Guoliang Li, Ming Li, Yun Liu, Yu Luo, Wenxiang Pei, Chengliang Wei

    Abstract: Void size function (VSF) contains information of the cosmic large-scale structure (LSS), and can be used to derive the properties of dark energy and dark matter. We predict the VSFs measured from the spectroscopic galaxy survey operated by the China Space Station Telescope (CSST), and study the strength of cosmological constraint. We employ a high-resolution Jiutian simulation to get CSST galaxy m… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 7 figures, 3 tables. Accepted for publication in MNRAS

  46. arXiv:2402.04546  [pdf, other

    cs.RO

    LiDAR-Forest Dataset: LiDAR Point Cloud Simulation Dataset for Forestry Application

    Authors: Yawen Lu, Zhuoyang Sun, **yuan Shao, Qianyu Guo, Yunhan Huang, Songlin Fei, Yingjie Chen

    Abstract: The popularity of LiDAR devices and sensor technology has gradually empowered users from autonomous driving to forest monitoring, and research on 3D LiDAR has made remarkable progress over the years. Unlike 2D images, whose focused area is visible and rich in texture information, understanding the point distribution can help companies and researchers find better ways to develop point-based 3D appl… ▽ More

    Submitted 15 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 5 pages

  47. arXiv:2402.03829  [pdf, ps, other

    hep-ex

    Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

  48. arXiv:2402.03705  [pdf, other

    cs.CV cs.CR

    FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution

    Authors: Qi Zhou, Dongxia Wang, Tianlin Li, Zhihong Xu, Yang Liu, Kui Ren, Wenhai Wang, Qing Guo

    Abstract: Guided image synthesis methods, like SDEdit based on the diffusion model, excel at creating realistic images from user inputs such as stroke paintings. However, existing efforts mainly focus on image quality, often overlooking a key point: the diffusion model represents a data distribution, not individual images. This introduces a low but critical chance of generating images that contradict user i… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  49. arXiv:2402.03661  [pdf, other

    cs.LG cs.AI

    Transductive Reward Inference on Graph

    Authors: Bohao Qu, Xiaofeng Cao, Qing Guo, Yi Chang, Ivor W. Tsang, Chengqi Zhang

    Abstract: In this study, we present a transductive inference approach on that reward information propagation graph, which enables the effective estimation of rewards for unlabelled data in offline reinforcement learning. Reward inference is the key to learning effective policies in practical scenarios, while direct environmental interactions are either too costly or unethical and the reward functions are ra… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  50. arXiv:2402.02738  [pdf, other

    cs.CV cs.LG

    Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective

    Authors: Yihao Huang, Kaiyuan Yu, Qing Guo, Felix Juefei-Xu, Xiaojun Jia, Tianlin Li, Geguang Pu, Yang Liu

    Abstract: In recent years, LiDAR-camera fusion models have markedly advanced 3D object detection tasks in autonomous driving. However, their robustness against common weather corruption such as fog, rain, snow, and sunlight in the intricate physical world remains underexplored. In this paper, we evaluate the robustness of fusion models from the perspective of fusion strategies on the corrupted dataset. Base… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 17 pages