Skip to main content

Showing 1–50 of 368 results for author: Zhong, W

.
  1. arXiv:2407.02043  [pdf, other

    cs.CL

    Concise and Precise Context Compression for Tool-Using Language Models

    Authors: Yang Xu, Yunlong Feng, Honglin Mu, Yutai Hou, Yitong Li, Xinghao Wang, Wanjun Zhong, Zhongyang Li, Dandan Tu, Qingfu Zhu, Min Zhang, Wanxiang Che

    Abstract: Through reading the documentation in the context, tool-using language models can dynamically extend their capability using external tools. The cost is that we have to input lengthy documentation every time the model needs to use the tool, occupying the input window as well as slowing down the decoding process. Given the progress in general-purpose compression, soft context compression is a suita… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.00569  [pdf, other

    cs.CV cs.AI cs.CL

    Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models

    Authors: Weihong Zhong, Xiaocheng Feng, Liang Zhao, Qiming Li, Lei Huang, Yuxuan Gu, Weitao Ma, Yuan Xu, Bing Qin

    Abstract: Though advanced in understanding visual information with human languages, Large Vision-Language Models (LVLMs) still suffer from multimodal hallucinations. A natural concern is that during multimodal interaction, the generated hallucinations could influence the LVLMs' subsequent generation. Thus, we raise a question: When presented with a query relevant to the previously generated hallucination, w… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Main Conference. 21 pages, 20 figures

  3. arXiv:2406.19827  [pdf, other

    cs.LG

    Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory

    Authors: Wenliang Zhong, Haoyu Tang, Qinghai Zheng, Mingzhu Xu, Yupeng Hu, Liqiang Nie

    Abstract: The rapid evolution of deep learning and large language models has led to an exponential growth in the demand for training data, prompting the development of Dataset Distillation methods to address the challenges of managing large datasets. Among these, Matching Training Trajectories (MTT) has been a prominent approach, which replicates the training trajectory of an expert network on real data wit… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 11 pages

  4. arXiv:2406.15796  [pdf, other

    cs.CL

    Rethinking Entity-level Unlearning for Large Language Models

    Authors: Weitao Ma, Xiaocheng Feng, Weihong Zhong, Lei Huang, Yangfan Ye, Bing Qin

    Abstract: Large language model unlearning has gained increasing attention due to its potential to mitigate security and privacy concerns. Current research predominantly focuses on Instance-level unlearning, specifically aiming at forgetting predefined instances of sensitive content. However, a notable gap still exists in exploring the deletion of complete entity-related information, which is crucial in many… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Work in progress

  5. arXiv:2406.12278  [pdf, other

    econ.TH

    Persuasion and Optimal Stop**

    Authors: Andrew Koh, Sivakorn Sanguanmoo, Weijie Zhong

    Abstract: We provide a unified analysis of how dynamic information should be designed in optimal stop** problems: a principal controls the flow of information about a payoff relevant state to persuade an agent to stop at the right time, in the right state, and choose the right action. We further show that for arbitrary preferences, intertemporal commitment is unnecessary: optimal dynamic information desig… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.08790  [pdf, ps, other

    quant-ph

    Direct generation of multi-photon hyperentanglement

    Authors: Peng Zhao, Jia-Wei Ying, Meng-Ying Yang, Wei Zhong, Ming-Ming Du, Shu-Ting Shen, Yun-Xi Li, An-Lei Zhang, Lan Zhou, Yu-Bo Sheng

    Abstract: Multi-photon hyperentangement is of fundamental importance in optical quantum information processing. Existing theory and experiment producing multi-photon hyperentangled states have until now relied on the outcome post-selection, a procedure where only the measurement results corresponding to the desired state are considered. Such approach severely limits the usefulness of the resulting hyperenta… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.02987  [pdf, other

    cs.CV

    Enhancing Multimodal Large Language Models with Multi-instance Visual Prompt Generator for Visual Representation Enrichment

    Authors: Wenliang Zhong, Wenyi Wu, Qi Li, Rob Barton, Boxin Du, Shioulin Sam, Karim Bouyarmane, Ismail Tutar, Junzhou Huang

    Abstract: Multimodal Large Language Models (MLLMs) have achieved SOTA performance in various visual language tasks by fusing the visual representations with LLMs leveraging some visual adapters. In this paper, we first establish that adapters using query-based Transformers such as Q-former is a simplified Multi-instance Learning method without considering instance heterogeneity/correlation. We then propose… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  8. arXiv:2405.20314  [pdf, ps, other

    cs.CL

    S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs

    Authors: Wei Zhong, Manasa Bharadwaj

    Abstract: Speculative decoding (SD) has attracted a significant amount of research attention due to the substantial speedup it can achieve for LLM inference. However, despite the high speedups they offer, speculative decoding methods often achieve optimal performance on high-end devices or with a substantial GPU memory overhead. Given limited memory and the necessity of quantization, a high-performing model… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.17132  [pdf, other

    cs.LG

    Your decision path does matter in pre-training industrial recommenders with multi-source behaviors

    Authors: Chun**g Gan, Binbin Hu, Bo Huang, Ziqi Liu, Jian Ma, Zhiqiang Zhang, Wenliang Zhong, Jun Zhou

    Abstract: Online service platforms offering a wide range of services through miniapps have become crucial for users who visit these platforms with clear intentions to find services they are interested in. Aiming at effective content delivery, cross-domain recommendation are introduced to learn high-quality representations by transferring behaviors from data-rich scenarios. However, these methods overlook th… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  10. arXiv:2405.16970  [pdf, ps, other

    quant-ph

    Memory-assisted measurement-device-independent quantum secret sharing

    Authors: Cheng Zhang, Qi Zhang, Wei Zhong, Ming-Ming Du, Shu-Ting Shen, Xi-Yun Li, An-Lei Zhang, Lan Zhou, Yu-Bo Sheng

    Abstract: Measurement-device-independent quantum secret sharing (MDI-QSS) can eliminate all the security loopholes associated with imperfect measurement devices and greatly enhance QS's security under practical experimental condition. MDI-QSS requires each communication user to send single photon to the measurement party for the coincident measurement. However, the unsynchronization of the transmitted photo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 11 pages, 6 figures

  11. arXiv:2405.15600  [pdf, ps, other

    stat.ML cs.LG econ.EM stat.ME

    Transfer Learning for Spatial Autoregressive Models

    Authors: Hao Zeng, Wei Zhong, Xingbai Xu

    Abstract: The spatial autoregressive (SAR) model has been widely applied in various empirical economic studies to characterize the spatial dependence among subjects. However, the precision of estimating the SAR model diminishes when the sample size of the target data is limited. In this paper, we propose a new transfer learning framework for the SAR model to borrow the information from similar source data t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  12. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  13. arXiv:2405.11273  [pdf, other

    cs.AI cs.CL cs.CV cs.MM

    Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts

    Authors: Yunxin Li, Shenyuan Jiang, Baotian Hu, Longyue Wang, Wanqi Zhong, Wenhan Luo, Lin Ma, Min Zhang

    Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) underscore the significance of scalable models and data to boost performance, yet this often incurs substantial computational costs. Although the Mixture of Experts (MoE) architecture has been employed to efficiently scale large language and image-text models, these efforts typically involve fewer experts and limited modalities. To ad… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 22 pages, 13 figures. Project Website: https://uni-moe.github.io/. Working in progress

  14. arXiv:2405.11221  [pdf, other

    physics.plasm-ph

    Real-time equilibrium reconstruction by neural network based on HL-3 tokamak

    Authors: Guohui Zheng, Songfen Liu, Zongyu Yang, Rui Ma, Xinwen Gong, Ao Wang, Shuo Wang, Wulyu Zhong

    Abstract: A neural network model, EFITNN, has been developed capable of real-time magnetic equilibrium reconstruction based on HL-3 tokamak magnetic measurement signals. The model processes inputs from 68 channels of magnetic measurement data gathered from 1159 HL-3 experimental discharges, including plasma current, loop voltage, and the poloidal magnetic fields measured by equilibrium probes. The outputs o… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  15. arXiv:2405.10676  [pdf, other

    physics.plasm-ph

    Identifying L-H transition in HL-2A through deep learning

    Authors: Meihuizi He, Songfen Liu, Fan Xia, Zongyu Yang, Wulyu Zhong

    Abstract: During the operation of tokamak devices, addressing the thermal load issues caused by Edge Localized Modes (ELMs) eruption is crucial. Ideally, mitigation and suppression measures for ELMs should be promptly initiated as soon as the first low-to-high confinement (L-H) transition occurs, which necessitates the real-time monitoring and accurate identification of the L-H transition process. Motivated… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  16. arXiv:2405.05739  [pdf

    physics.plasm-ph

    Preliminary Exploration on the Low-Pressure Ar-O2 Plasma Generated by Low-Frequency Alternating Current (AC) Power Supply

    Authors: Niaz Wali, W. W. Xiao, Q. U. Din, N. U. Rehman, C. Y. Wang, J. T. Ma, W. J. Zhong, Q. W. Yang

    Abstract: This study reports a low-frequency alternating current (AC) power supply as a novel approach for generating low-pressure capacitively coupled Ar-O2 plasma, offering advantages in cost, compactness, and operational simplicity, which are crucial for both material science and biological applications. The effectiveness of low-frequency AC-generated plasma against traditional RF systems by examining ke… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 16 pages, 7 figures

  17. arXiv:2404.13646  [pdf, other

    math.NA cs.LG

    Physics-informed Mesh-independent Deep Compositional Operator Network

    Authors: Weiheng Zhong, Hadi Meidani

    Abstract: Solving parametric Partial Differential Equations (PDEs) for a broad range of parameters is a critical challenge in scientific computing. To this end, neural operators, which learn map**s from parameters to solutions, have been successfully used. However, the training of neural operators typically demands large training datasets, the acquisition of which can be prohibitively expensive. To addres… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  18. arXiv:2404.12602  [pdf

    cs.CV cs.LG

    A visualization method for data domain changes in CNN networks and the optimization method for selecting thresholds in classification tasks

    Authors: Minzhe Huang, Changwei Nie, Weihong Zhong

    Abstract: In recent years, Face Anti-Spoofing (FAS) has played a crucial role in preserving the security of face recognition technology. With the rise of counterfeit face generation techniques, the challenge posed by digitally edited faces to face anti-spoofing is escalating. Existing FAS technologies primarily focus on intercepting physically forged faces and lack a robust solution for cross-domain FAS cha… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  19. arXiv:2404.03333  [pdf, other

    astro-ph.EP

    The Impact-driven Atmospheric Loss of Super-Earths around Different Spectral Type Host Stars

    Authors: Wei Zhong, Cong Yu, Shi Jia, Shang-Fei Liu

    Abstract: The planet's mass loss is important for the planet's formation and evolution. The radius valley (RV) is believed to be triggered by evaporation-induced mass loss. As an alternative mechanism for the RV, the mass loss of post-impact planets is thoroughly investigated in this work. The impact energy is converted to the planet's internal energy, enhancing its core energy and accelerating mass loss an… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 19 pages, 12 figures; ApJ Accepted

  20. arXiv:2404.01735  [pdf, other

    cs.IR cs.MM

    CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling

    Authors: Yunshan Ma, Yingzhi He, Wenjun Zhong, Xiang Wang, Roger Zimmermann, Tat-Seng Chua

    Abstract: Product bundling has been a prevailing marketing strategy that is beneficial in the online shop** scenario. Effective product bundling methods depend on high-quality item representations, which need to capture both the individual items' semantics and cross-item relations. However, previous item representation learning methods, either feature fusion or graph learning, suffer from inadequate cross… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: arXiv preprint, 10 pages, 4 figures, 6 tables

    ACM Class: H.3.0

  21. The Physical Origin of the Mass-Size Relation and Its Scatter of Disk Galaxies

    Authors: Min Du, Hong-Chuan Ma, Wen-Yu Zhong, Luis C. Ho, Shihong Liao, Yingjie Peng

    Abstract: Utilizing a kinematic decomposition of simulated galaxies, we focus on galaxies with tiny kinematically inferred stellar halos, indicative of weak external influences. We investigate the intricate interplay between internal (natural) and external (nurture) processes in sha** the scaling relationships of specific angular momentum ($j_\star$), stellar mass ($M_\star$), and size of disk galaxies wi… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 11 pages, 9 figures, accepted for publication in A&A

    Journal ref: A&A 686, A168 (2024)

  22. arXiv:2403.12524  [pdf, other

    astro-ph.HE astro-ph.SR

    Search for GeV gamma-ray emission from the possible TeV-bright red dwarfs with Fermi-LAT

    Authors: Chen Huang, Xiao Zhang, Yang Chen, Wen-Juan Zhong

    Abstract: Red dwarfs have been suggested to be among the possible astrophysical species accelerating particles and emitting TeV $γ$-rays. As an effort to search for the GeV $γ$-ray counterparts of the suggested TeV emission from eight red dwarfs, we analyse the 0.2--500 GeV $γ$-ray emission of the regions covering them exploiting the $\sim$13.6 yr Pass 8 data of the Fermi Large Area Telescope. A GeV $γ$-ray… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures

  23. arXiv:2403.11880  [pdf, other

    cond-mat.str-el cond-mat.stat-mech

    Topological edge modes and phase transition in the critical fermionic chain with long-range interaction

    Authors: Wen-Hao Zhong, Wei-Lin Li, Yong-Chang Chen, Xue-Jia Yu

    Abstract: The long-range interaction can fundamentally alter properties in gapped topological phases such as emergent massive edge modes. However, recent research has shifted attention to topological nontrivial critical points or phases, and it is natural to explore how long-range interaction influences them. In this work, we investigate the topological behavior and phase transition of extended Kitaev chain… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 13 pages, 11 figures. Any comments or suggestions are welcome !

  24. arXiv:2403.11211  [pdf

    cs.CV

    RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

    Authors: Qingrong Sun, Weixiang Zhong, Jie Zhou, Chong Lai, Xiaodong Teng, Maode Lai

    Abstract: The annotation of digital pathological slide data for renal cell carcinoma is of paramount importance for correct diagnosis of artificial intelligence models due to the heterogeneous nature of the tumor. This process not only facilitates a deeper understanding of renal cell cancer heterogeneity but also aims to minimize noise in the data for more accurate studies. To enhance the applicability of t… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures, 1 table

  25. arXiv:2403.10137  [pdf, ps, other

    quant-ph

    Device-independent quantum secret sharing with noise pre-processing and post-selection

    Authors: Qi Zhang, Wei Zhong, Ming-Ming Du, Shu-Ting Shen, Xi-Yun Li, An-Lei Zhang, Lan Zhou, Yu-Bo Sheng

    Abstract: Quantum secret sharing (QSS) is a fundamental quantum secure communication primitive, which enables a dealer to distribute secret keys to a set of players. Device-independent (DI) QSS can relax the security assumptions about the devices' internal working, and effectively enhance QSS's security under practical experimental conditions. Here, we propose a DI-QSS protocol based on Greenberger-Horne-Ze… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 13 pages, 6 figures

  26. arXiv:2403.09957  [pdf, other

    astro-ph.GA

    Suppression of Star Formation in Galaxy Pairs

    Authors: Shuai Feng, Shi-Yin Shen, Fang-Ting Yuan, Wen-Xin Zhong, Wen-Yuan Cui, Lin-Lin Li

    Abstract: We investigate the suppression of star formation in galaxy pairs based on the isolated galaxy pair sample derived from the SDSS survey. By comparing the star formation rate between late-type galaxies in galaxy pairs and those in the isolated environment, we detect the signal of star formation suppression in galaxy pairs at $d_p < 100$kpc and $200$kpc$ < d_p < 350$kpc. The occurrence of star format… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures, accepted for publication in ApJ

  27. arXiv:2403.08967  [pdf, other

    cs.CV cs.AI

    PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning

    Authors: Qifeng Zhou, Wenliang Zhong, Yuzhi Guo, Michael Xiao, Hehuan Ma, Junzhou Huang

    Abstract: In the field of computational histopathology, both whole slide images (WSIs) and diagnostic captions provide valuable insights for making diagnostic decisions. However, aligning WSIs with diagnostic captions presents a significant challenge. This difficulty arises from two main factors: 1) Gigapixel WSIs are unsuitable for direct input into deep learning models, and the redundancy and correlation… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  28. arXiv:2403.05890  [pdf, other

    cs.LG cs.DC

    Towards Efficient Replay in Federated Incremental Learning

    Authors: Yichen Li, Qunwei Li, Haozhao Wang, Ruixuan Li, Wenliang Zhong, Guannan Zhang

    Abstract: In Federated Learning (FL), the data in each client is typically assumed fixed or static. However, data often comes in an incremental manner in real-world applications, where the data domain may increase dynamically. In this work, we study catastrophic forgetting with data heterogeneity in Federated Incremental Learning (FIL) scenarios where edge clients may lack enough storage space to retain ful… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  29. arXiv:2403.03514  [pdf, other

    cs.CL

    CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models

    Authors: Zexuan Qiu, **g**g Li, Shijue Huang, Wanjun Zhong, Irwin King

    Abstract: Develo** Large Language Models (LLMs) with robust long-context capabilities has been the recent research focus, resulting in the emergence of long-context LLMs proficient in Chinese. However, the evaluation of these models remains underdeveloped due to a lack of benchmarks. To address this gap, we present CLongEval, a comprehensive Chinese benchmark for evaluating long-context LLMs. CLongEval is… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 19 pages, 4 figures

  30. Improving Visual Perception of a Social Robot for Controlled and In-the-wild Human-robot Interaction

    Authors: Wangjie Zhong, Leimin Tian, Duy Tho Le, Hamid Rezatofighi

    Abstract: Social robots often rely on visual perception to understand their users and the environment. Recent advancements in data-driven approaches for computer vision have demonstrated great potentials for applying deep-learning models to enhance a social robot's visual perception. However, the high computational demands of deep-learning methods, as opposed to the more resource-efficient shallow-learning… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: accepted to HRI 2024 (LBR track)

  31. arXiv:2402.16427  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Electronic phase transitions and superconductivity in ferroelectric Sn$_2$P$_2$Se$_6$ under pressure

    Authors: He Zhang, Wei Zhong, Xiaohui Yu, Binbin Yue, Fang Hong

    Abstract: Since there is both strong electron-phonon coupling during a ferroelectric/FE transition and superconducting/SC transition, it has been an important topic to explore superconductivity from the FE instability. Sn$_2$P$_2$Se$_6$ arouses broad attention due to its unique FE properties. Here, we reported the electronic phase transitions and superconductivity in this compound based on high-pressure ele… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures

  32. arXiv:2402.16288  [pdf, other

    cs.CL cs.AI cs.IR

    PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

    Authors: Yiming Du, Hongru Wang, Zhengyi Zhao, Bin Liang, Baojun Wang, Wanjun Zhong, Zezhong Wang, Kam-Fai Wong

    Abstract: Long-term memory plays a critical role in personal interaction, considering long-term memory can better leverage world knowledge, historical information, and preferences in dialogues. Our research introduces PerLTQA, an innovative QA dataset that combines semantic and episodic memories, including world knowledge, profiles, social relationships, events, and dialogues. This dataset is collected to i… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  33. arXiv:2402.11905  [pdf, other

    cs.CL

    Learning to Edit: Aligning LLMs with Knowledge Editing

    Authors: Yuxin Jiang, Yufei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang

    Abstract: Knowledge editing techniques, aiming to efficiently modify a minor proportion of knowledge in large language models (LLMs) without negatively impacting performance across other inputs, have garnered widespread attention. However, existing methods predominantly rely on memorizing the updated knowledge, impeding LLMs from effectively combining the new knowledge with their inherent knowledge when ans… ▽ More

    Submitted 5 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 17 pages, 8 figures, 9 tables. ACL 2024 main camera-ready version

  34. arXiv:2402.02718  [pdf, other

    cs.IR cs.AI

    Denoising Time Cycle Modeling for Recommendation

    Authors: Sicong Xie, Qunwei Li, Weidi Xu, Kaiming Shen, Shaohu Chen, Wenliang Zhong

    Abstract: Recently, modeling temporal patterns of user-item interactions have attracted much attention in recommender systems. We argue that existing methods ignore the variety of temporal patterns of user behaviors. We define the subset of user behaviors that are irrelevant to the target item as noises, which limits the performance of target-related time cycle modeling and affect the recommendation perform… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  35. arXiv:2402.02709  [pdf, ps, other

    quant-ph

    Passive decoy-state quantum secure direct communication with heralded single-photon source

    Authors: Jia-Wei Ying, Peng Zhao, Wei Zhong, Ming-Ming Du, Xi-Yun Li, Shu-Ting Shen, An-Lei Zhang, Lan Zhou, Yu-Bo Sheng

    Abstract: Quantum secure direct communications (QSDC) can directly transmit secret messages through quantum channel without keys. The imperfect photon source is a major obstacle for QSDC's practical implementation. The unwanted vacuum state and multi-photon components emitted from imperfect photon source largely reduce QSDC's secrecy message capacity and even threaten its security. In the paper, we propose… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 11 pages, 3 figures

  36. arXiv:2402.00578  [pdf, other

    astro-ph.HE

    Discovery and timing of pulsar J2016$+$3711 in supernova remnant CTB 87 with FAST

    Authors: Qian-Cheng Liu, Wen-Juan Zhong, Yang Chen, Pei Wang, ** Zhou, You-Ling Yue, Di Li

    Abstract: We report on our discovery of the radio pulsar, PSR J2016$+$3711, in supernova remnant (SNR) CTB 87, with a $\sim10.8σ$ significance of pulses, which confirms the compact nature of the X-ray point source in CTB 87. It is the first pulsar discovered in SNRs using Five-hundred-meter Aperture Spherical radio Telescope (FAST). Its integrated radio pulse profile can be well described by a single compon… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 7 pages, 5 figures, accepted for publication in MNRAS

  37. arXiv:2401.17167  [pdf, other

    cs.CL

    Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

    Authors: Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu

    Abstract: The recent trend of using Large Language Models (LLMs) as tool agents in real-world applications underscores the necessity for comprehensive evaluations of their capabilities, particularly in complex scenarios involving planning, creating, and using tools. However, existing benchmarks typically focus on simple synthesized queries that do not reflect real-world complexity, thereby offering limited… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL2024 Findings

  38. arXiv:2401.15670  [pdf, other

    cs.CL cs.AI cs.LG

    YODA: Teacher-Student Progressive Learning for Language Models

    Authors: Jianqiao Lu, Wanjun Zhong, Yufei Wang, Zhijiang Guo, Qi Zhu, Wenyong Huang, Yanlin Wang, Fei Mi, Baojun Wang, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu

    Abstract: Although large language models (LLMs) have demonstrated adeptness in a range of tasks, they still lag behind human learning efficiency. This disparity is often linked to the inherent human capacity to learn from basic examples, gradually generalize and handle more complex problems, and refine their skills with continuous feedback. Inspired by this, this paper introduces YODA, a novel teacher-stude… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 14 pages, 4 figures, 3 tables

  39. arXiv:2401.11657  [pdf

    physics.optics physics.med-ph

    A photon-level broadband dual-comb interferometer for turbulent open-air trace gases detection application

    Authors: Wei Zhong, Yingyu Liu, Qin Yin, Ruocan Zhao, Yiwei Ding, Chong Wang, Tindi Chen, Xiankang Dou, Xianghui Xue

    Abstract: Open-path dual-comb spectroscopy (DCS) significantly enhances our understanding of regional trace gases. However, due to technical challenges, cost considerations, and eye-safety regulations, its sensing range and flexibility remain limited. The photon-counting DCS demonstrated recently heralds potential innovations over open-path DCS. Nevertheless, a major challenge in open-air applications of th… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 24 pages, 10 figures

  40. arXiv:2401.06300  [pdf, other

    quant-ph cond-mat.dis-nn cs.AI cs.LG

    Advantage of Quantum Neural Networks as Quantum Information Decoders

    Authors: Weishun Zhong, Oles Shtanko, Ramis Movassagh

    Abstract: A promising strategy to protect quantum information from noise-induced errors is to encode it into the low-energy states of a topological quantum memory device. However, readout errors from such memory under realistic settings is less understood. We study the problem of decoding quantum information encoded in the groundspaces of topological stabilizer Hamiltonians in the presence of generic pertur… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 25 pages, 5 figures

  41. arXiv:2401.05975  [pdf, other

    cs.IR cs.AI

    End-to-end Learnable Clustering for Intent Learning in Recommendation

    Authors: Yue Liu, Shihao Zhu, Jun Xia, Yingwei Ma, Jian Ma, Wenliang Zhong, Xinwang Liu, Guannan Zhang, Kejun Zhang

    Abstract: Intent learning, which aims to learn users' intents for user understanding and item recommendation, has become a hot research spot in recent years. However, the existing methods suffer from complex and cumbersome alternating optimization, limiting the performance and scalability. To this end, we propose a novel intent learning method termed \underline{ELCRec}, by unifying behavior representation l… ▽ More

    Submitted 2 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 24 pages

  42. arXiv:2401.03906  [pdf, other

    math.CO

    Reconstruction of hypermatrices from subhypermatrices

    Authors: Xiande Zhang, Wenjie Zhong

    Abstract: For a given $n$, what is the smallest number $k$ such that every sequence of length $n$ is determined by the multiset of all its $k$-subsequences? This is called the $k$-deck problem for sequence reconstruction, and has been generalized to the two-dimensional case -- reconstruction of $n\times n$-matrices from submatrices. Previous works show that the smallest $k$ is at most $O(n^\frac{1}{2})$ for… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 25 pages, 4 figures

  43. arXiv:2401.02035  [pdf, ps, other

    cs.IT

    Efficient Information Geometry Approach for Massive MIMO-OFDM Channel Estimation

    Authors: Jiyuan Yang, Yan Chen, Mingrui Fan, An-An Lu, Wen Zhong, Xiqi Gao, Xiaohu You, Xiang-Gen Xia, Dirk Slock

    Abstract: We investigate the channel estimation for massive multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) systems. We revisit the information geometry approach (IGA) for massive MIMO-OFDM channel estimation. By using the constant magnitude property of the entries of the measurement matrix, we find that the second-order natural parameters of the distributions on all th… ▽ More

    Submitted 3 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  44. arXiv:2312.17266  [pdf

    eess.IV cs.AI cs.CV cs.RO

    Automatic laminectomy cutting plane planning based on artificial intelligence in robot assisted laminectomy surgery

    Authors: Zhuofu Li, Yonghong Zhang, Chengxia Wang, Shanshan Liu, Xiongkang Song, Xuquan Ji, Shuai Jiang, Woquan Zhong, Lei Hu, Weishi Li

    Abstract: Objective: This study aims to use artificial intelligence to realize the automatic planning of laminectomy, and verify the method. Methods: We propose a two-stage approach for automatic laminectomy cutting plane planning. The first stage was the identification of key points. 7 key points were manually marked on each CT image. The Spatial Pyramid Upsampling Network (SPU-Net) algorithm developed by… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  45. arXiv:2312.17109  [pdf, other

    cs.CV cs.AI cs.CL

    MIVC: Multiple Instance Visual Component for Visual-Language Models

    Authors: Wenyi Wu, Qi Li, Wenliang Zhong, Junzhou Huang

    Abstract: Vision-language models have been widely explored across a wide range of tasks and achieve satisfactory performance. However, it's under-explored how to consolidate entity understanding through a varying number of images and to align it with the pre-trained language models for generative tasks. In this paper, we propose MIVC, a general multiple instance visual component to bridge the gap between va… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted at WACV 2024

  46. arXiv:2312.16484  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Emergence of superconductivity near 11 K by suppressing the 3-fold helical-chain structure in noncentrosymmetric HgS

    Authors: He Zhang, Wei Zhong, Yanghao Meng, Bowen Tang, Binbin Yue, Xiaohui Yu, Fang Hong

    Abstract: The trigonal $α$-HgS has a 3-fold helical chain structure, and is in form of a noncentrosymmetric $P3_121$ phase, known as the cinnabar phase. However, under pressure, the helical chains gradually approach and connect with each other, finally reconstructing into a centrosymmetric NaCl structure at 21 GPa. Superconductivity emerges just after this helical-nonhelical structural transition. The maxim… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 16 pages, 6 figures

  47. arXiv:2312.11370  [pdf, other

    cs.CL

    G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model

    Authors: Jiahui Gao, Renjie Pi, Jipeng Zhang, Jiacheng Ye, Wanjun Zhong, Yufei Wang, Lanqing Hong, Jianhua Han, Hang Xu, Zhenguo Li, Lingpeng Kong

    Abstract: Large language models (LLMs) have shown remarkable proficiency in human-level reasoning and generation capabilities, which encourages extensive research on their application in mathematical problem solving. However, current work has been largely focused on text-based mathematical problems, with limited investigation in problems involving geometric information. Addressing this gap, we aim to enable… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages

  48. arXiv:2312.01916  [pdf, other

    cs.IR

    PEACE: Prototype lEarning Augmented transferable framework for Cross-domain rEcommendation

    Authors: Chun**g Gan, Bo Huang, Binbin Hu, Jian Ma, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Guannan Zhang, Wenliang Zhong

    Abstract: To help merchants/customers to provide/access a variety of services through miniapps, online service platforms have occupied a critical position in the effective content delivery, in which how to recommend items in the new domain launched by the service provider for customers has become more urgent. However, the non-negligible gap between the source and diversified target domains poses a considera… ▽ More

    Submitted 17 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by WSDM 2024

  49. arXiv:2312.01700  [pdf, other

    cs.CL cs.AI

    Data Management For Large Language Models: A Survey

    Authors: Zige Wang, Wanjun Zhong, Yufei Wang, Qi Zhu, Fei Mi, Baojun Wang, Lifeng Shang, Xin Jiang, Qun Liu

    Abstract: Data plays a fundamental role in the training of Large Language Models (LLMs). Effective data management, particularly in the formulation of a well-suited training dataset, holds significance for enhancing model performance and improving training efficiency during pretraining and supervised fine-tuning phases. Despite the considerable importance of data management, the current research community s… ▽ More

    Submitted 25 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Work in progress

  50. arXiv:2312.00553  [pdf

    cs.HC eess.SP

    A Spatio-Temporal Graph Convolutional Network for Gesture Recognition from High-Density Electromyography

    Authors: Wenjuan Zhong, Yuyang Zhang, Peiwen Fu, Wenxuan Xiong, Mingming Zhang

    Abstract: Accurate hand gesture prediction is crucial for effective upper-limb prosthetic limbs control. As the high flexibility and multiple degrees of freedom exhibited by human hands, there has been a growing interest in integrating deep networks with high-density surface electromyography (HD-sEMG) grids to enhance gesture recognition capabilities. However, many existing methods fall short in fully explo… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.