Skip to main content

Showing 151–200 of 10,323 results for author: Zhang, Z

.
  1. arXiv:2406.06567  [pdf, other

    cs.LG cs.AI cs.CL

    DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion

    Authors: Yilong Chen, Linhao Zhang, Junyuan Shang, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun

    Abstract: Large language models (LLMs) with billions of parameters demonstrate impressive performance. However, the widely used Multi-Head Attention (MHA) in LLMs incurs substantial computational and memory costs during inference. While some efforts have optimized attention mechanisms by pruning heads or sharing parameters among heads, these methods often lead to performance degradation or necessitate subst… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 9 figures, 3 tables

  2. arXiv:2406.06518  [pdf, other

    cs.LG

    Data Augmentation for Multivariate Time Series Classification: An Experimental Study

    Authors: Romain Ilbert, Thai V. Hoang, Zonghua Zhang

    Abstract: Our study investigates the impact of data augmentation on the performance of multivariate time series models, focusing on datasets from the UCR archive. Despite the limited size of these datasets, we achieved classification accuracy improvements in 10 out of 13 datasets using the Rocket and InceptionTime models. This highlights the essential role of sufficient data in training effective models, pa… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Workshop on Multivariate Time Series Analytics (MulTiSA), ICDE Workshop

  3. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Σ^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/ψ, ψ(3686) \to Σ^0 \barΣ^{0}$ processes and subsequent decays are studied using the world's largest $J/ψ$ and $ψ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Σ^0$ hyperons for the first time by measuring the decay parameters, $α_{Σ^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barα_{Σ^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.06087  [pdf, other

    cs.CV

    GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

    Authors: Zijian Chen, Wei Sun, Yuan Tian, Jun Jia, Zicheng Zhang, Jiarui Wang, Ru Huang, Xiongkuo Min, Guangtao Zhai, Wenjun Zhang

    Abstract: Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV). Current action quality assessment (AQA) algorithms predominantly focus on actions from real specific scenarios and are pre-trained with normative action features, thus ren… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 28 pages, 13 figures

  5. Modeling User Retention through Generative Flow Networks

    Authors: Ziru Liu, Shuchang Liu, Bin Yang, Zhenghai Xue, Qingpeng Cai, Xiangyu Zhao, Zijian Zhang, Lantao Hu, Han Li, Peng Jiang

    Abstract: Recommender systems aim to fulfill the user's daily demands. While most existing research focuses on maximizing the user's engagement with the system, it has recently been pointed out that how frequently the users come back for the service also reflects the quality and stability of recommendations. However, optimizing this user retention behavior is non-trivial and poses several challenges includi… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: KDD-ADS 2024

  6. arXiv:2406.06039  [pdf, other

    cs.CV

    Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset

    Authors: Shijie Lian, Ziyi Zhang, Hua Li, Wenjie Li, Laurence Tianruo Yang, Sam Kwong, Runmin Cong

    Abstract: With the breakthrough of large models, Segment Anything Model (SAM) and its extensions have been attempted to apply in diverse tasks of computer vision. Underwater salient instance segmentation is a foundational and vital step for various underwater vision tasks, which often suffer from low segmentation accuracy due to the complex underwater circumstances and the adaptive ability of models. Moreov… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024, Code released at: https://github.com/LiamLian0727/USIS10K

  7. arXiv:2406.05961  [pdf, other

    eess.AS

    BS-PLCNet 2: Two-stage Band-split Packet Loss Concealment Network with Intra-model Knowledge Distillation

    Authors: Zihan Zhang, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, Lei Xie

    Abstract: Audio packet loss is an inevitable problem in real-time speech communication. A band-split packet loss concealment network (BS-PLCNet) targeting full-band signals was recently proposed. Although it performs superiorly in the ICASSP 2024 PLC Challenge, BS-PLCNet is a large model with high computational complexity of 8.95G FLOPS. This paper presents its updated version, BS-PLCNet 2, to reduce comput… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  8. arXiv:2406.05955  [pdf, other

    cs.LG cs.CL

    Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

    Authors: Yixin Song, Haotong Xie, Zhengyan Zhang, Bo Wen, Li Ma, Zeyu Mi, Haibo Chen

    Abstract: Exploiting activation sparsity is a promising approach to significantly accelerating the inference process of large language models (LLMs) without compromising performance. However, activation sparsity is determined by activation functions, and commonly used ones like SwiGLU and GeGLU exhibit limited sparsity. Simply replacing these functions with ReLU fails to achieve sufficient sparsity. Moreove… ▽ More

    Submitted 10 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  9. arXiv:2406.05898  [pdf, other

    cs.IR cs.AI cs.LG

    Async Learned User Embeddings for Ads Delivery Optimization

    Authors: Mingwei Tang, Meng Liu, Hong Li, Junjie Yang, Chenglin Wei, Boyang Li, Dai Li, Rengan Xu, Yifan Xu, Zehua Zhang, Xiangyu Wang, Linfeng Liu, Yuelei Xie, Chengye Liu, Labib Fawaz, Li Li, Hongnan Wang, Bill Zhu, Sri Reddy

    Abstract: In recommendation systems, high-quality user embeddings can capture subtle preferences, enable precise similarity calculations, and adapt to changing preferences over time to maintain relevance. The effectiveness of recommendation systems depends on the quality of user embedding. We propose to asynchronously learn high fidelity user embeddings for billions of users each day from sequence based mul… ▽ More

    Submitted 23 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by workshop on Multimodal Representation and Retrieval at SIGIR 2024, Washington DC

  10. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  11. arXiv:2406.05746  [pdf

    cs.AI cs.HC cs.LG

    Methodology and Real-World Applications of Dynamic Uncertain Causality Graph for Clinical Diagnosis with Explainability and Invariance

    Authors: Zhan Zhang, Qin Zhang, Yang Jiao, Lin Lu, Lin Ma, Aihua Liu, Xiao Liu, Juan Zhao, Yajun Xue, Bing Wei, Mingxia Zhang, Ru Gao, Hong Zhao, Jie Lu, Fan Li, Yang Zhang, Yiming Wang, Lei Zhang, Fengwei Tian, Jie Hu, Xin Gou

    Abstract: AI-aided clinical diagnosis is desired in medical care. Existing deep learning models lack explainability and mainly focus on image analysis. The recently developed Dynamic Uncertain Causality Graph (DUCG) approach is causality-driven, explainable, and invariant across different application scenarios, without problems of data collection, labeling, fitting, privacy, bias, generalization, high cost… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Journal ref: Artificaial Intelligence Review, (2024) 57:151

  12. arXiv:2406.05657  [pdf, other

    physics.ins-det

    Single channel PICOSEC Micromegas detector with improved time resolution

    Authors: A. Utrobicic, R. Aleksan, Y. Angelis, J. Bortfeldt, F. Brunbauer, M. Brunoldi, E. Chatzianagnostou, J. Datta, K. Dehmelt, G. Fanourakis, D. Fiorina, K. J. Floethner, M. Gallinaro, F. Garcia, I. Giomataris, K. Gnanvo, F. J. Iguaz, D. Janssens, A. Kallitsopoulou, M. Kovacic, B. Kross, P. Legou, M. Lisowska, J. Liu, M. Lupberger , et al. (25 additional authors not shown)

    Abstract: This paper presents design guidelines and experimental verification of a single-channel PICOSEC Micromegas (MM) detector with an improved time resolution. The design encompasses the detector board, vessel, auxiliary mechanical parts, and electrical connectivity for high voltage (HV) and signals, focusing on improving stability, reducing noise, and ensuring signal integrity to optimize timing perfo… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  13. arXiv:2406.05625  [pdf, other

    cs.CL

    ATLAS: Improving Lay Summarisation with Attribute-based Control

    Authors: Zhihao Zhang, Tomas Goldsack, Carolina Scarton, Chenghua Lin

    Abstract: Lay summarisation aims to produce summaries of scientific articles that are comprehensible to non-expert audiences. However, previous work assumes a one-size-fits-all approach, where the content and style of the produced summary are entirely dependent on the data used to train the model. In practice, audiences with different levels of expertise will have specific needs, impacting what content shou… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  14. arXiv:2406.05513  [pdf, ps, other

    cs.CV

    A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+

    Authors: Jianzhao Wang, Yanyan Wei, Dehua Hu, Yilin Zhang, Shengeng Tang, Dan Guo, Zhao Zhang

    Abstract: This technical report presents our team's solution for the WeatherProof Dataset Challenge: Semantic Segmentation in Adverse Weather at CVPR'24 UG2+. We propose a two-stage deep learning framework for this task. In the first stage, we preprocess the provided dataset by concatenating images into video sequences. Subsequently, we leverage a low-rank video deraining method to generate high-fidelity ps… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  15. arXiv:2406.05452  [pdf, other

    eess.SP cs.IT

    Near-Field Channel Estimation for Extremely Large-Scale Terahertz Communications

    Authors: Songjie Yang, Yizhou Peng, Wanting Lyu, Ya Li, Hongjun He, Zhongpei Zhang, Chau Yuen

    Abstract: Future Terahertz communications exhibit significant potential in accommodating ultra-high-rate services. Employing extremely large-scale array antennas is a key approach to realize this potential, as they can harness substantial beamforming gains to overcome the severe path loss and leverage the electromagnetic advantages in the near field. This paper proposes novel estimation methods designed to… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  16. arXiv:2406.05318  [pdf

    cs.CV cs.AI

    Integrating Text and Image Pre-training for Multi-modal Algorithmic Reasoning

    Authors: Zijian Zhang, Wei Liu

    Abstract: In this paper, we present our solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024. Unlike traditional visual questions and answer tasks, this challenge evaluates abstraction, deduction and generalization ability of neural network in solving visuo-linguistic puzzles designed for specially children in the 6-8 age group. Our model is based on two pre-trained models, d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  17. arXiv:2406.05168  [pdf, other

    cond-mat.mes-hall physics.optics

    Topological photonic alloy

    Authors: Tiantao Qu, Mudi Wang, Xiaoyu Cheng, Xiaohan Cui, Ruo-Yang Zhang, Zhao-Qing Zhang, Lei Zhang, Jun Chen, C. T. Chan

    Abstract: We present the new concept of photonic alloy as a non-periodic topological material. By mixing non-magnetized and magnetized rods in a non-periodic 2D photonic crystal configuration, we realized photonic alloys in the microwave regime. Our experimental findings reveal that the photonic alloy sustains non-reciprocal chiral edge states (CESs) even at very low concentration of magnetized rods. The no… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 132, 223802 (2024)

  18. arXiv:2406.04906  [pdf, other

    cs.CV cs.AI

    RU-AI: A Large Multimodal Dataset for Machine Generated Content Detection

    Authors: Liting Huang, Zhihao Zhang, Yiran Zhang, Xiyue Zhou, Shou** Wang

    Abstract: The recent advancements in generative AI models, which can create realistic and human-like content, are significantly transforming how people communicate, create, and work. While the appropriate use of generative AI models can benefit the society, their misuse poses significant threats to data reliability and authentication. However, due to a lack of aligned multimodal datasets, effective and robu… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  19. Magnetism of $\mathrm{NaYbS_2}$: From finite temperatures to ground state

    Authors: Weizhen Zhuo, Zheng Zhang, Mingtai Xie, Anmin Zhang, Jianting Ji, Feng **, Qingming Zhang

    Abstract: Rare-earth chalcogenide compounds $\mathrm{ARECh_2}$ (A = alkali or monovalent metal, RE = rare earth, Ch = O, S, Se, Te) are a large family of quantum spin liquid (QSL) candidate materials. $\mathrm{NaYbS_2}$ is a representative member of the family. Several key issues on $\mathrm{NaYbS_2}$, particularly how to determine the highly anisotropic spin Hamiltonian and describe the magnetism at finite… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

    Journal ref: Science China Physics, Mechanics & Astronomy (2024)

  20. arXiv:2406.04504  [pdf, other

    math.NA math-ph

    Mixed Finite Element Method for Multi-layer Elastic Contact Systems

    Authors: Zhizhuo Zhang, Mikaël Barboteu, Xiaobing Nie, Serge Dumont, Mahmoud Abdel-Aty, **de Cao

    Abstract: With the development of multi-layer elastic systems in the field of engineering mechanics, the corresponding variational inequality theory and algorithm design have received more attention and research. In this study, a class of equivalent saddle point problems with interlayer Tresca friction conditions and the mixed finite element method are proposed and analyzed. Then, the convergence of the num… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  21. arXiv:2406.04499  [pdf, other

    math.NA math-ph

    A layer decomposition method for multi-layer elastic contact systems with interlayer Tresca friction

    Authors: Zhizhuo Zhang, Xiaobing Nie, Mikaël Barboteu, **de Cao

    Abstract: With the increasing demand for the accuracy of numerical simulation of pavement mechanics, the variational inequality model and its induced finite element method which can simulate the interlayer contact state becomes a potential solution. In this paper, a layer decomposition algorithm for solving variational inequality models of multi-layer elastic contact systems with interlayer Tresca friction… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  22. arXiv:2406.04324  [pdf, other

    cs.CV eess.IV

    SF-V: Single Forward Video Generation Model

    Authors: Zhixing Zhang, Yanyu Li, Yushu Wu, Yanwu Xu, Anil Kag, Ivan Skorokhodov, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Dimitris Metaxas, Sergey Tulyakov, Jian Ren

    Abstract: Diffusion-based video generation models have demonstrated remarkable success in obtaining high-fidelity videos through the iterative denoising process. However, these models require multiple denoising steps during sampling, resulting in high computational costs. In this work, we propose a novel approach to obtain single-step video generation models by leveraging adversarial training to fine-tune p… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://snap-research.github.io/SF-V

  23. arXiv:2406.03877  [pdf, other

    cs.RO cs.CV

    Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving

    Authors: Xiaosong Jia, Zhenjie Yang, Qifeng Li, Zhiyuan Zhang, Junchi Yan

    Abstract: In an era marked by the rapid scaling of foundation models, autonomous driving technologies are approaching a transformative threshold where end-to-end autonomous driving (E2E-AD) emerges due to its potential of scaling up in the data-driven manner. However, existing E2E-AD methods are mostly evaluated under the open-loop log-replay manner with L2 errors and collision rate as metrics (e.g., in nuS… ▽ More

    Submitted 11 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Fix typos in text and Table 4. More reference

  24. arXiv:2406.03876  [pdf

    physics.optics cond-mat.mes-hall

    Time-resolved optical assessment of exciton formation in mixed two-dimensional perovskite films

    Authors: Zheng Zhang, Jianan Wang, Yijie Shi, Xi Wang, Zhong Wang, Xiangyu Zhu, Chunlong Hu, Zonghao Liu, Wei Chen, Wenxi Liang

    Abstract: We report the observation of exciton formation from the cooled band-edge carriers in mixed two-dimensional hybrid organic-inorganic perovskites using femtosecond transient absorption spectroscopy. By monitoring the changes of bleach signal upon excitations with various photon energy, we are able to extract the values of exciton binding energy and the occupancies of carriers of free and bound state… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Main text: 15 pages, 4 figures. Supplementary Information: 16 pages, 16 figures, 10 tables

  25. arXiv:2406.03875  [pdf, other

    eess.SY

    Energy-storing analysis and fishtail stiffness optimization for a wire-driven elastic robotic fish

    Authors: Xiaocun Liao, Chao Zhou, Junfeng Fan, Zhuoliang Zhang, Zhaoran Yin, Liangwei Deng

    Abstract: The robotic fish with high propulsion efficiency and good maneuverability achieves underwater fishlike propulsion by commonly adopting the motor to drive the fishtail, causing the significant fluctuations of the motor power due to the uneven swing speed of the fishtail in one swing cycle. Hence, we propose a wire-driven robotic fish with a spring-steel-based active-segment elastic spine. This bion… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 14 pages, 19 figures

  26. arXiv:2406.03848  [pdf, other

    physics.ao-ph cs.AI cs.LG

    OceanCastNet: A Deep Learning Ocean Wave Model with Energy Conservation

    Authors: Ziliang Zhang, Huaming Yu, Danqin Ren

    Abstract: Traditional wave forecasting models, although based on energy conservation equations, are computationally expensive. On the other hand, existing deep learning geophysical fluid models, while computationally efficient, often suffer from issues such as energy dissipation in long-term forecasts. This paper proposes a novel energy-balanced deep learning wave forecasting model called OceanCastNet (OCN)… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  27. arXiv:2406.03844  [pdf, other

    nucl-th astro-ph.HE hep-ph nucl-ex

    PREX and CREX: Evidence for Strong Isovector Spin-Orbit Interaction

    Authors: Tong-Gang Yue, Zhen Zhang, Lie-Wen Chen

    Abstract: The recent PREX-2 and CREX data on the model-independent extraction of the charge-weak form factor difference $ΔF_{\rm CW}$ in $^{208}$Pb and $^{48}$Ca challenge modern nuclear energy density functionals (EDFs) as well as our present understanding on the neutron skin and nuclear symmetry energy. Within the Skyrme-like EDFs, we demonstrate that the isovector spin-orbit interaction can strongly chan… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 14 pages, 6 figures (including Supplemental Material)

  28. arXiv:2406.03842  [pdf, ps, other

    math.AP

    Blow-up of cylindrically symmetric solutions for Fractional NLS

    Authors: Tianxiang Gou, Vicentiu D. Radulescu, Zhitao Zhang

    Abstract: In this paper, we consider blow-up of solutions to the Cauchy problem for the following fractional NLS, $$ \textnormal{i} \, \partial_t u=(-Δ)^s u-|u|^{2 σ} u \quad \text{in} \,\, \R \times \R^N, $$ where $N \geq 2$, $1/2 <s<1$ and $0<σ<2s/(N-2s)$. In the mass critical and supercritical cases, we establish a criterion for blow-up of solutions to the problem for cylindrically symmetric data. And we… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 9 pages

    MSC Class: 35R11; 35B44

  29. arXiv:2406.03840  [pdf, ps, other

    hep-ph

    Global tensor polarization of spin $3/2$ hadrons and quark spin correlations in relativistic heavy ion collisions

    Authors: Zhe Zhang, Ji-peng Lv, Zi-han Yu, Zuo-tang Liang

    Abstract: We study the global polarization of spin-$3/2$ hadrons in relativistic heavy ion collisions. We show in particular that the global tensor polarizations of rank two or three for spin-$3/2$ hadrons are sensitive to the local two or three quark spin correlations respectively in the quark gluon plasma produced in the collision processes. We present the relationships between these measurable tensor pol… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 11 pages

  30. arXiv:2406.03817  [pdf, other

    cond-mat.stat-mech math-ph

    Field Theory of Active Brownian Particles with Dry Friction

    Authors: Ziluo Zhang, Shurui Yuan, Shigeyuki Komura

    Abstract: We present a field theoretic approach to capture the motion of a particle with dry friction for one- and two-dimensional diffusive particles, and further expand the framework for two-dimensional active Brownian particles. Starting with the Fokker-Planck equation and introducing the Hermite polynomials as the corresponding eigen-functions, we obtain the actions and propagators. Using a perturbation… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  31. arXiv:2406.03758  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Phonon heat conduction across slippery interfaces in twisted graphite

    Authors: Fuwei Yang, Wenjiang Zhou, Zhibin Zhang, Xuanyu Huang, **gwen Zhang, Nianjie Liang, Wujuan Yan, Yuxi Wang, Mingchao Ding, Quanlin Guo, Yu Han, Te-Huan Liu, Kaihui Liu, Quanshui Zheng, Bai Song

    Abstract: Interlayer rotation in van der Waals (vdW) materials offers great potential for manipulating phonon dynamics and heat flow in advanced electronics with ever higher compactness and power density. However, despite extensive theoretical efforts in recent years, experimental measurements remain scarce especially due to the critical challenges of preparing single-crystalline twisted interfaces and prob… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  32. arXiv:2406.03746  [pdf, other

    cs.CL cs.AI

    Efficient Knowledge Infusion via KG-LLM Alignment

    Authors: Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang

    Abstract: To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion. However, existing approaches face two primary challenges: knowledge mismatch between public available knowledge graphs and the specific domain of the task at hand, and poor infor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL2024 Findings

  33. arXiv:2406.03712  [pdf, other

    cs.CL cs.LG

    A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

    Authors: Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, **jie Gu, Zhixuan Chu, Zhan Qin, Kui Ren

    Abstract: Large language models (LLMs), such as GPT series models, have received substantial attention due to their impressive capabilities for generating and understanding human-level language. More recently, LLMs have emerged as an innovative and powerful adjunct in the medical field, transforming traditional practices and heralding a new era of enhanced healthcare services. This survey provides a compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  34. arXiv:2406.03700  [pdf

    cond-mat.supr-con cond-mat.mes-hall

    Ferroelectricity-tuned band topology and superconductivity in two-dimensional materials and related heterostructures

    Authors: Jianyong Chen, ** Cui, Zhenyu Zhang

    Abstract: Ferroelectricity, band topology, and superconductivity are respectively local, global, and macroscopic properties of quantum materials, and understanding their mutual couplings offers unique opportunities for exploring rich physics and enhanced functionalities. In this mini-review, we attempt to highlight some of the latest advances in this vibrant area, focusing in particular on ferroelectricity-… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Invited Review for Adv.Funct.Mater.,comments are welcome

  35. arXiv:2406.03684  [pdf, other

    cs.CV cs.CR

    Principles of Designing Robust Remote Face Anti-Spoofing Systems

    Authors: Xiang Xu, Tianchen Zhao, Zheng Zhang, Zhihua Li, Jon Wu, Alessandro Achille, Mani Srivastava

    Abstract: Protecting digital identities of human face from various attack vectors is paramount, and face anti-spoofing plays a crucial role in this endeavor. Current approaches primarily focus on detecting spoofing attempts within individual frames to detect presentation attacks. However, the emergence of hyper-realistic generative models capable of real-time operation has heightened the risk of digitally g… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Under review

  36. Refactoring to Pythonic Idioms: A Hybrid Knowledge-Driven Approach Leveraging Large Language Models

    Authors: Zejun Zhang, Zhenchang Xing, Xiaoxue Ren, Qinghua Lu, Xiwei Xu

    Abstract: Pythonic idioms are highly valued and widely used in the Python programming community. However, many Python users find it challenging to use Pythonic idioms. Adopting a rule-based approach or LLM-only approach is not sufficient to overcome three persistent challenges of code idiomatization including code miss, wrong detection and wrong refactoring. Motivated by the determinism of rules and adaptab… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by FSE 2024,22 pages

  37. arXiv:2406.03474  [pdf, other

    cs.CV

    AD-H: Autonomous Driving with Hierarchical Agents

    Authors: Zaibin Zhang, Shiyu Tang, Yuanhang Zhang, Talas Fu, Yifan Wang, Yang Liu, Dong Wang, **g Shao, Lijun Wang, Huchuan Lu

    Abstract: Due to the impressive capabilities of multimodal large language models (MLLMs), recent works have focused on employing MLLM-based agents for autonomous driving in large-scale and dynamic environments. However, prevalent approaches often directly translate high-level instructions into low-level vehicle control signals, which deviates from the inherent language generation paradigm of MLLMs and fails… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  38. arXiv:2406.03403  [pdf, other

    cs.LG cs.AI q-bio.QM

    Structure-based Drug Design Benchmark: Do 3D Methods Really Dominate?

    Authors: Kangyu Zheng, Yingzhou Lu, Zaixi Zhang, Zhongwei Wan, Yao Ma, Marinka Zitnik, Tianfan Fu

    Abstract: Currently, the field of structure-based drug design is dominated by three main types of algorithms: search-based algorithms, deep generative models, and reinforcement learning. While existing works have typically focused on comparing models within a single algorithmic category, cross-algorithm comparisons remain scarce. In this paper, to fill the gap, we establish a benchmark to evaluate the perfo… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  39. arXiv:2406.03387  [pdf, other

    hep-ex

    Measurement of the branching fraction ratios $R(D^{+})$ and $R(D^{*+})$ using muonic $τ$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1063 additional authors not shown)

    Abstract: The branching fraction ratios of $\overline{B}^0\to D^+τ^-\overlineν_τ$ and $\overline{B}^0\to D^{*+}τ^-\overlineν_τ$ decays are measured with respect to their muonic counterparts, using a data sample corresponding to an integrated luminosity of 2.0 fb$^{-1}$ collected by the LHCb experiment in proton-proton collisions at $\sqrt{s} = 13$ TeV. The reconstructed final states are formed by combining… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lhcbproject.web.cern.ch/Publications/LHCbProjectPublic/LHCb-PAPER-2024-007.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-007, CERN-EP-2024-125

  40. arXiv:2406.03287  [pdf, other

    cs.NE cs.CL cs.LG

    SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

    Authors: Xingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan, Yequan Wang, Jiajun Zhang, Guoqi Li

    Abstract: Towards energy-efficient artificial intelligence similar to the human brain, the bio-inspired spiking neural networks (SNNs) have advantages of biological plausibility, event-driven sparsity, and binary activation. Recently, large-scale language models exhibit promising generalization capability, making it a valuable issue to explore more general spike-driven models. However, the binary spikes in… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  41. arXiv:2406.03250  [pdf, other

    cs.CV cs.AI

    Prompt-based Visual Alignment for Zero-shot Policy Transfer

    Authors: Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen

    Abstract: Overfitting in RL has become one of the main obstacles to applications in reinforcement learning(RL). Existing methods do not provide explicit semantic constrain for the feature extractor, hindering the agent from learning a unified cross-domain representation and resulting in performance degradation on unseen domains. Besides, abundant data from multiple domains are needed. To address these issue… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by ICML2024

  42. arXiv:2406.03199  [pdf, other

    cs.CL cs.AI cs.LG

    Bayesian WeakS-to-Strong from Text Classification to Generation

    Authors: Ziyun Cui, Ziyang Zhang, Wen Wu, Guangzhi Sun, Chao Zhang

    Abstract: Advances in large language models raise the question of how alignment techniques will adapt as models become increasingly complex and humans will only be able to supervise them weakly. Weak-to-Strong mimics such a scenario where weak model supervision attempts to harness the full capabilities of a much stronger model. This work extends Weak-to-Strong to WeakS-to-Strong by exploring an ensemble of… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

  43. arXiv:2406.03156  [pdf, other

    hep-ex

    Observation of new charmonium(-like) states in $B^+ \to D^{*\pm} D^{\mp} K^+$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A study of resonant structures in $B^{+}\rightarrow{D^{\ast+}D^{-}K^{+}}$ and $B^{+}\rightarrow{D^{\ast-}D^{+}K^{+}}$ decays is performed, using proton-proton collision data at centre-of-mass energies of $\sqrt{s}=7, 8$, and $13$ TeV recorded by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$. A simultaneous amplitude fit is performed to the two channels with contribu… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-047.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-047, CERN-EP-2024-096

  44. arXiv:2406.03070  [pdf, other

    cs.CV cs.AI

    A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

    Authors: Zicheng Zhang, Haoning Wu, Chunyi Li, Yingjie Zhou, Wei Sun, Xiongkuo Min, Zijian Chen, Xiaohong Liu, Weisi Lin, Guangtao Zhai

    Abstract: How to accurately and efficiently assess AI-generated images (AIGIs) remains a critical challenge for generative models. Given the high costs and extensive time commitments required for user studies, many researchers have turned towards employing large multi-modal models (LMMs) as AIGI evaluators, the precision and validity of which are still questionable. Furthermore, traditional benchmarks often… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  45. arXiv:2406.02931  [pdf, other

    hep-ex

    Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  46. arXiv:2406.02928  [pdf, other

    cond-mat.mtrl-sci

    Unveiling a Family of Dimerized Quantum Magnets in Ternary Metal Borides

    Authors: Zhen Zhang, Andrew P. Porter, Yang Sun, Kirill D. Belashchenko, Gayatri Viswanathan, Arka Sarkar, Kirill Kovnir, Kai-Ming Ho, Vladimir Antropov

    Abstract: Dimerized quantum magnets are exotic crystalline materials where Bose-Einstein condensation of magnetic excitations can happen. However, known dimerized quantum magnets are limited to only a few oxides and halides. Here, we unveil 9 dimerized quantum magnets and 11 conventional antiferromagnets in ternary metal borides MTB$_4$ (M = Sc, Y, La, Ce, Lu, Mg, Ca, Al; T = V, Cr, Mn, Fe, Co, Ni). In this… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  47. arXiv:2406.02854  [pdf

    eess.SY

    Development of an underwater inductive coupling communication system with power carrier technology

    Authors: Zhongxing Zhang

    Abstract: Inductive coupling communication is one of the main methods of underwater communication systems due to its excellent comprehensive performance. However, the data transmission distance and operational power consumption need to be further enhanced. In this paper, an underwater induction coupling communication scheme based on power carrier technology is proposed to improve the transmission speed and… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  48. arXiv:2406.02669  [pdf, other

    quant-ph

    A generalized cycle benchmarking algorithm for characterizing mid-circuit measurements

    Authors: Zhihan Zhang, Senrui Chen, Yunchao Liu, Liang Jiang

    Abstract: Mid-circuit measurement (MCM) is a crucial ingredient in the development of fault-tolerant quantum computation. While there have been rapid experimental progresses in realizing MCM, a systematic method for characterizing noisy MCM is still under exploration. In this work we develop an algorithm to characterize noisy MCM, via a generalization of cycle benchmarking -- a standard approach for charact… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 27 pages, 9 figures

  49. arXiv:2406.02609  [pdf, other

    cs.LG cs.AI

    Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation

    Authors: Jiayao Tan, Fan Lyu, Chenggong Ni, Tingliang Feng, Fuyuan Hu, Zhang Zhang, Shaochuang Zhao, Liang Wang

    Abstract: Continual Test-Time Adaptation (CTTA) aims to adapt a pre-trained model to a sequence of target domains during the test phase without accessing the source data. To adapt to unlabeled data from unknown domains, existing methods rely on constructing pseudo-labels for all samples and updating the model through self-training. However, these pseudo-labels often involve noise, leading to insufficient ad… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.03335 by other authors

  50. arXiv:2406.02472  [pdf, other

    cs.CL

    Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding

    Authors: Zhihan Zhang, Yixin Cao, Chenchen Ye, Yunshan Ma, Lizi Liao, Tat-Seng Chua

    Abstract: The digital landscape is rapidly evolving with an ever-increasing volume of online news, emphasizing the need for swift and precise analysis of complex events. We refer to the complex events composed of many news articles over an extended period as Temporal Complex Event (TCE). This paper proposes a novel approach using Large Language Models (LLMs) to systematically extract and analyze the event c… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024