Skip to main content

Showing 151–200 of 9,507 results for author: Zhang, H

.
  1. arXiv:2406.07162  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

    Authors: Ziyang Ma, Mingjie Chen, Hezhao Zhang, Zhisheng Zheng, Wenxi Chen, Xiquan Li, Jiaxin Ye, Xie Chen, Thomas Hain

    Abstract: Speech emotion recognition (SER) is an important part of human-computer interaction, receiving extensive attention from both industry and academia. However, the current research field of SER has long suffered from the following problems: 1) There are few reasonable and universal splits of the datasets, making comparing different models and methods difficult. 2) No commonly used benchmark covers nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024. GitHub Repository: https://github.com/emo-box/EmoBox

  2. arXiv:2406.07077  [pdf, other

    eess.SY

    Meta-Backscatter: A New ISAC Paradigm for Battery-Free Internet of Things

    Authors: Xu Liu, Hongliang Zhang, Kaigui Bian, Xi Weng, Lingyang Song

    Abstract: The meta-material sensor has been regarded as a next-generation sensing technology for the battery-free Internet of Things (IoT) due to its battery-free characteristic and improved sensing performance. The meta-material sensors function as backscatter tags that change their reflection coefficients with the conditions of sensing targets such as temperature and gas concentration, allowing transceive… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.07048  [pdf, other

    cs.RO

    GPU-Accelerated Optimization-Based Collision Avoidance

    Authors: Zeming Wu, Zhu** Wang, Hao Zhang

    Abstract: This paper proposes a GPU-accelerated optimization framework for collision avoidance problems where the controlled objects and the obstacles can be modeled as the finite union of convex polyhedra. A novel collision avoidance constraint is proposed based on scale-based collision detection and the strong duality of convex optimization. Under this constraint, the high-dimensional non-convex optimizat… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.07036  [pdf, other

    cs.CL cs.AI

    Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model

    Authors: Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang

    Abstract: Large language models (LLMs) have showcased impressive multilingual machine translation ability. However, unlike encoder-decoder style models, decoder-only LLMs lack an explicit alignment between source and target contexts. Analyzing contribution scores during generation processes revealed that LLMs can be biased towards previously generated tokens over corresponding source tokens, leading to unfa… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  5. arXiv:2406.06999  [pdf, other

    cs.CV

    Teaching with Uncertainty: Unleashing the Potential of Knowledge Distillation in Object Detection

    Authors: Junfei Yi, Jianxu Mao, Tengfei Liu, Mingjie Li, Hanyu Gu, Hui Zhang, Xiaojun Chang, Yaonan Wang

    Abstract: Knowledge distillation (KD) is a widely adopted and effective method for compressing models in object detection tasks. Particularly, feature-based distillation methods have shown remarkable performance. Existing approaches often ignore the uncertainty in the teacher model's knowledge, which stems from data noise and imperfect training. This limits the student model's ability to learn latent knowle… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  6. arXiv:2406.06918  [pdf, other

    cs.SE

    Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond

    Authors: Dewu Zheng, Yanlin Wang, Ensheng Shi, Ruikai Zhang, Yuchi Ma, Hongyu Zhang, Zibin Zheng

    Abstract: To evaluate the code generation capabilities of Large Language Models (LLMs) in complex real-world software development scenarios, many evaluation approaches have been developed. They typically leverage contextual code from the latest version of a project to facilitate LLMs in accurately generating the desired function. However, such evaluation approaches fail to consider the dynamic evolution of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  7. arXiv:2406.06697  [pdf, other

    astro-ph.GA

    A quasar-galaxy merger at $z\sim 6.2$: rapid host growth via accretion of two massive satellite galaxies

    Authors: Roberto Decarli, Federica Loiacono, Emanuele Paolo Farina, Massimo Dotti, Alessandro Lupi, Romain A. Meyer, Marco Mignoli, Antonio Pensabene, Michael A. Strauss, Bram Venemans, **yi Yang, Fabian Walter, Julien Wolf, Eduardo Bañados, Laura Blecha, Sarah Bosman, Chris L. Carilli, Andrea Comastri, Thomas Connor, Tiago Costa, Anna-Christina Eilers, Xiaohui Fan, Roberto Gilli, Hyunsung D. Jun, Weizhe Liu , et al. (16 additional authors not shown)

    Abstract: We present JWST/NIRSpec Integral Field Spectroscopy in the rest-frame optical bands of the system PJ308-21, a quasar at $z=6.2342$ caught as its host galaxy interacts with companion galaxies. We detect spatially extended emission of several emission lines (H$α$, H$β$, [OIII], [NII], [SII], HeII), which we use to study the properties of the ionized phase of the interstellar medium: the source and h… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 15 pages, 16 figures. Accepted for publication in A&A

  8. arXiv:2406.06678  [pdf, other

    astro-ph.GA

    Kinematics and Dynamics of the Galactic Bar revealed by Gaia Long Period Variables

    Authors: Han-Yuan Zhang, Vasily Belokurov, N. Wyn Evans, Sarah G. Kane, Jason L. Sanders

    Abstract: We take low-amplitude, long period variable (LA-LPV) candidates in Gaia DR3 as tracers of the kinematics and dynamics of the Milky Way bar. LA-LPVs, like other LPVs, have high luminosities and follow a tight period-luminosity relation, but unlike e.g. Mira variables, their radial velocity measurements are reliable due to their smaller pulsation amplitudes. We supplement the Gaia astrometric and ra… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 20 pages, 11 figures, submitted to MNRAS. Comments welcome

  9. arXiv:2406.06517  [pdf, other

    cs.CV

    Genomics-guided Representation Learning for Pathologic Pan-cancer Tumor Microenvironment Subtype Prediction

    Authors: Fangliangzi Meng, Hongrun Zhang, Ruodan Yan, Guohui Chuai, Chao Li, Qi Liu

    Abstract: The characterization of Tumor MicroEnvironment (TME) is challenging due to its complexity and heterogeneity. Relatively consistent TME characteristics embedded within highly specific tissue features, render them difficult to predict. The capability to accurately classify TME subtypes is of critical significance for clinical tumor diagnosis and precision medicine. Based on the observation that tumo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  10. arXiv:2406.06367  [pdf, other

    cs.CV

    MVGamba: Unify 3D Content Generation as State Space Sequence Modeling

    Authors: Xuanyu Yi, Zike Wu, Qiuhong Shen, Qingshan Xu, Pan Zhou, Joo-Hwee Lim, Shuicheng Yan, Xinchao Wang, Hanwang Zhang

    Abstract: Recent 3D large reconstruction models (LRMs) can generate high-quality 3D content in sub-seconds by integrating multi-view diffusion models with scalable multi-view reconstructors. Current works further leverage 3D Gaussian Splatting as 3D representation for improved visual quality and rendering efficiency. However, we observe that existing Gaussian reconstruction models often suffer from multi-vi… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  11. arXiv:2406.06156  [pdf, other

    cs.SE

    Stronger, Cheaper and Demonstration-Free Log Parsing with LLMs

    Authors: Yi Xiao, Van-Hoang Le, Hongyu Zhang

    Abstract: Log parsing, the process of converting raw log messages into structured formats, is an important initial step for automated analysis of logs of large-scale software systems. Traditional log parsers often rely on heuristics or handcrafted features, which may not generalize well across diverse log sources or require extensive model tuning. Recently, some log parsers have utilized powerful generative… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  12. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Σ^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/ψ, ψ(3686) \to Σ^0 \barΣ^{0}$ processes and subsequent decays are studied using the world's largest $J/ψ$ and $ψ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Σ^0$ hyperons for the first time by measuring the decay parameters, $α_{Σ^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barα_{Σ^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  13. arXiv:2406.06063  [pdf, other

    physics.comp-ph quant-ph

    Enabling Large-Scale and High-Precision Fluid Simulations on Near-Term Quantum Computers

    Authors: Zhao-Yun Chen, Teng-Yang Ma, Chuang-Chao Ye, Liang Xu, Ming-Yang Tan, Xi-Ning Zhuang, Xiao-Fan Xu, Yun-Jie Wang, Tai-** Sun, Yong Chen, Lei Du, Liang-Liang Guo, Hai-Feng Zhang, Hao-Ran Tao, Tian-Le Wang, Xiao-Yan Yang, Ze-An Zhao, Peng Wang, Sheng Zhang, Chi Zhang, Ren-Ze Zhao, Zhi-Long Jia, Wei-Cheng Kong, Meng-Han Dou, Jun-Chao Wang , et al. (7 additional authors not shown)

    Abstract: Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement o… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 31 pages, 10 figures

  14. arXiv:2406.06040  [pdf, other

    cs.CV

    Vript: A Video Is Worth Thousands of Words

    Authors: Dongjie Yang, Suyuan Huang, Chengqiang Lu, Xiaodong Han, Haoxin Zhang, Yan Gao, Yao Hu, Hai Zhao

    Abstract: Advancements in multimodal learning, particularly in video understanding and generation, require high-quality video-text datasets for improved model performance. Vript addresses this issue with a meticulously annotated corpus of 12K high-resolution videos, offering detailed, dense, and script-like captions for over 420K clips. Each clip has a caption of ~145 words, which is over 10x longer than mo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: submitted to NeurIPS Dataset & Benchmark track

  15. arXiv:2406.06022  [pdf, other

    cs.LG cs.DC

    GraphStorm: all-in-one graph machine learning framework for industry applications

    Authors: Da Zheng, Xiang Song, Qi Zhu, Jian Zhang, Theodore Vasiloudis, Runjie Ma, Houyu Zhang, Zichen Wang, Soji Adeshina, Israt Nisa, Alejandro Mottini, Qingjun Cui, Huzefa Rangwala, Belinda Zeng, Christos Faloutsos, George Karypis

    Abstract: Graph machine learning (GML) is effective in many business applications. However, making GML easy to use and applicable to industry applications with massive datasets remain challenging. We developed GraphStorm, which provides an end-to-end solution for scalable graph construction, graph model training and inference. GraphStorm has the following desirable properties: (a) Easy to use: it can perfor… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Journal ref: KDD 2024

  16. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  17. arXiv:2406.05780  [pdf, ps, other

    eess.SP

    Two-Stage Resource Allocation in Reconfigurable Intelligent Surface Assisted Hybrid Networks via Multi-Player Bandits

    Authors: **gwen Tong, Hongliang Zhang, Liqun Fu, Amir Leshem, Zhu Han

    Abstract: This paper considers a resource allocation problem where several Internet-of-Things (IoT) devices send data to a base station (BS) with or without the help of the reconfigurable intelligent surface (RIS) assisted cellular network. The objective is to maximize the sum rate of all IoT devices by finding the optimal RIS and spreading factor (SF) for each device. Since these IoT devices lack prior inf… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: This paper was published in IEEE Transcation on Communications

  18. arXiv:2406.05685  [pdf, other

    cs.SE

    Understanding Open Source Contributor Profiles in Popular Machine Learning Libraries

    Authors: Jiawen Liu, Haoxiang Zhang, Ying Zou

    Abstract: With the increasing popularity of machine learning (ML), many open-source software (OSS) contributors are attracted to develo** and adopting ML approaches. Comprehensive understanding of ML contributors is crucial for successful ML OSS development and maintenance. Without such knowledge, there is a risk of inefficient resource allocation and hindered collaboration in ML OSS projects. Existing re… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  19. arXiv:2406.05678  [pdf, other

    cs.CL

    SinkLoRA: Enhanced Efficiency and Chat Capabilities for Long-Context Large Language Models

    Authors: Hengyu Zhang

    Abstract: Extending the functionality of the Transformer model to accommodate longer sequence lengths has become a critical challenge. This extension is crucial not only for improving tasks such as language translation and long-context processing but also for enabling novel applications like chatbots, code generation, and multimedia content creation. The primary obstacle is the self-attention mechanism, whi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: A rethinking of Short Shifted Attention

  20. OAM-SWIPT for IoE-Driven 6G

    Authors: Runyu Lyu, Wenchi Cheng, Bazhong Shen, Zhiyuan Ren, Hailin Zhang

    Abstract: Simultaneous wireless information and power transfer (SWIPT), which achieves both wireless energy transfer (WET) and information transfer, is an attractive technique for future Internet of Everything (IoE) in the sixth-generation (6G) mobile communications. With SWIPT, battery-less IoE devices can be powered while communicating with other devices. Line-of-sight (LOS) RF transmission and near-field… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

    Journal ref: in IEEE Communications Magazine, vol. 60, no. 3, pp. 19-25, March 2022

  21. arXiv:2406.05514  [pdf, other

    cs.SE

    RAG-Enhanced Commit Message Generation

    Authors: Linghao Zhang, Hongyi Zhang, Chong Wang, Peng Liang

    Abstract: Commit message is one of the most important textual information in software development and maintenance. However, it is time-consuming and labor-intensive to write commit messages manually. Commit Message Generation (CMG) has become a research hotspot in automated software engineering. Researchers have proposed several methods for CMG and achieved great results. In recent years, CodeBERT, CodeT5,… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  22. arXiv:2406.05412  [pdf

    cs.CV

    Select-Mosaic: Data Augmentation Method for Dense Small Object Scenes

    Authors: Hao Zhang, Shuaijie Zhang, Renbin Zou

    Abstract: Data augmentation refers to the process of applying a series of transformations or expansions to original data to generate new samples, thereby increasing the diversity and quantity of the data, effectively improving the performance and robustness of models. As a common data augmentation method, Mosaic data augmentation technique stitches multiple images together to increase the diversity and comp… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  23. arXiv:2406.05391  [pdf, other

    cs.LG

    DUPLEX: Dual GAT for Complex Embedding of Directed Graphs

    Authors: Zhaoru Ke, Hang Yu, Jianguo Li, Haipeng Zhang

    Abstract: Current directed graph embedding methods build upon undirected techniques but often inadequately capture directed edge information, leading to challenges such as: (1) Suboptimal representations for nodes with low in/out-degrees, due to the insufficient neighbor interactions; (2) Limited inductive ability for representing new nodes post-training; (3) Narrow generalizability, as training is overly c… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  24. arXiv:2406.05261  [pdf, other

    cs.CV cs.GR

    Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi Partitioning

    Authors: Yilin Liu, Jiale Chen, Shanshan Pan, Daniel Cohen-Or, Hao Zhang, Hui Huang

    Abstract: We introduce a novel method for acquiring boundary representations (B-Reps) of 3D CAD models which involves a two-step process: it first applies a spatial partitioning, referred to as the ``split``, followed by a ``fit`` operation to derive a single primitive within each partition. Specifically, our partitioning aims to produce the classical Voronoi diagram of the set of ground-truth (GT) B-Rep pr… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACM Transactions on Graphics (SIGGRAPH 2024); Project page: https://vcc.tech/research/2024/BRepVP; Code: https://github.com/yilinliu77/NVDNet

  25. arXiv:2406.05167  [pdf, other

    cond-mat.stat-mech hep-th quant-ph

    Universal Critical Holography and Domain Wall Formation

    Authors: Tian-Chi Ma, Han-Qing Shi, Hai-Qing Zhang, Adolfo del Campo

    Abstract: Using holography, we study the universal scaling laws governing the coarsening dynamics of strongly coupled domain walls. Specifically, we studied the universal dependence of the length of the domain wall interfaces on the quench rate. The relation satisfies the Kibble-Zurek scaling shortly after the critical point. However, as time goes by, the coarsening dynamics suppresses the Kibble-Zurek scal… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 9 pages, 9 figures

  26. arXiv:2406.05127  [pdf, other

    cs.CV

    Towards Semantic Equivalence of Tokenization in Multimodal LLM

    Authors: Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in processing vision-language tasks. One of the crux of MLLMs lies in vision tokenization, which involves efficiently transforming input visual signals into feature representations that are most beneficial for LLMs. However, existing vision tokenizers, essential for semantic alignment between vision and language, r… ▽ More

    Submitted 27 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Technical Report. The project page: https://chocowu.github.io/SeTok-web/

  27. arXiv:2406.05031  [pdf, other

    hep-ph astro-ph.CO nlin.PS

    Unified view of scalar and vector dark matter solitons

    Authors: Hong-Yi Zhang

    Abstract: The existence of solitons -- stable, long-lived, and localized field configurations -- is a generic prediction for ultralight dark matter. These solitons, known by various names such as boson stars, axion stars, oscillons, and Q-balls depending on the context, are typically treated as distinct entities in the literature. This study aims to provide a unified perspective on these solitonic objects f… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 17+3 pages, 5 figures

  28. arXiv:2406.04949  [pdf, other

    cs.CV cs.AI cs.LG

    Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment

    Authors: Venkanna Babu Guthula, Stefan Oehmcke, Remigio Chilaule, Hui Zhang, Nico Lang, Ankit Kariryaa, Johan Mottelson, Christian Igel

    Abstract: As low-quality housing and in particular certain roof characteristics are associated with an increased risk of malaria, classification of roof types based on remote sensing imagery can support the assessment of malaria risk and thereby help prevent the disease. To support research in this area, we release the Nacala-Roof-Material dataset, which contains high-resolution drone images from Mozambique… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  29. arXiv:2406.04762  [pdf, other

    eess.SP

    Holographic Intelligence Surface Assisted Integrated Sensing and Communication

    Authors: Zhuoyang Liu, Yuchen Zhang, Haiyang Zhang, Feng Xu, Yonina C. Eldar

    Abstract: Traditional discrete-array-based systems fail to exploit interactions between closely spaced antennas, resulting in inadequate utilization of the aperture resource. In this paper, we propose a holographic intelligence surface (HIS) assisted integrated sensing and communication (HISAC) system, wherein both the transmitter and receiver are fabricated using a continuous-aperture array. A continuous-d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  30. arXiv:2406.04584  [pdf, other

    cs.LG cs.AI cs.CV

    CLoG: Benchmarking Continual Learning of Image Generation Models

    Authors: Haotian Zhang, Junting Zhou, Haowei Lin, Hang Ye, Jianhua Zhu, Zihao Wang, Liangcai Gao, Yizhou Wang, Yitao Liang

    Abstract: Continual Learning (CL) poses a significant challenge in Artificial Intelligence, aiming to mirror the human ability to incrementally acquire knowledge and skills. While extensive research has focused on CL within the context of classification tasks, the advent of increasingly powerful generative models necessitates the exploration of Continual Learning of Generative models (CLoG). This paper advo… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  31. arXiv:2406.04558  [pdf, other

    cs.LG math.OC

    On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization

    Authors: Motahareh Sohrabi, Juan Ramirez, Tianyue H. Zhang, Simon Lacoste-Julien, Jose Gallego-Posada

    Abstract: Constrained optimization offers a powerful framework to prescribe desired behaviors in neural network models. Typically, constrained problems are solved via their min-max Lagrangian formulations, which exhibit unstable oscillatory dynamics when optimized using gradient descent-ascent. The adoption of constrained optimization techniques in the machine learning community is currently limited by the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Published at ICML 2024. Code available at https://github.com/motahareh-sohrabi/nuPI

  32. arXiv:2406.04520  [pdf, other

    cs.CL cs.AI

    NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

    Authors: Huaixiu Steven Zheng, Swaroop Mishra, Hugh Zhang, Xinyun Chen, Minmin Chen, Azade Nova, Le Hou, Heng-Tze Cheng, Quoc V. Le, Ed H. Chi, Denny Zhou

    Abstract: We introduce NATURAL PLAN, a realistic planning benchmark in natural language containing 3 key tasks: Trip Planning, Meeting Planning, and Calendar Scheduling. We focus our evaluation on the planning capabilities of LLMs with full information on the task, by providing outputs from tools such as Google Flights, Google Maps, and Google Calendar as contexts to the models. This eliminates the need for… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  33. arXiv:2406.04276  [pdf, other

    cs.LG cs.AI

    Generative AI-in-the-loop: Integrating LLMs and GPTs into the Next Generation Networks

    Authors: Han Zhang, Akram Bin Sediq, Ali Afana, Melike Erol-Kantarci

    Abstract: In recent years, machine learning (ML) techniques have created numerous opportunities for intelligent mobile networks and have accelerated the automation of network operations. However, complex network tasks may involve variables and considerations even beyond the capacity of traditional ML algorithms. On the other hand, large language models (LLMs) have recently emerged, demonstrating near-human-… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  34. arXiv:2406.04207  [pdf, other

    cs.CV

    CDMamba: Remote Sensing Image Change Detection with Mamba

    Authors: Haotian Zhang, Keyan Chen, Chenyang Liu, Hao Chen, Zhengxia Zou, Zhenwei Shi

    Abstract: Recently, the Mamba architecture based on state space models has demonstrated remarkable performance in a series of natural language processing tasks and has been rapidly applied to remote sensing change detection (CD) tasks. However, most methods enhance the global receptive field by directly modifying the scanning mode of Mamba, neglecting the crucial role that local information plays in dense p… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  35. arXiv:2406.04129  [pdf, other

    cs.CV

    LenslessFace: An End-to-End Optimized Lensless System for Privacy-Preserving Face Verification

    Authors: Xin Cai, Hailong Zhang, Chenchen Wang, Wentao Liu, **wei Gu, Tianfan Xue

    Abstract: Lensless cameras, innovatively replacing traditional lenses for ultra-thin, flat optics, encode light directly onto sensors, producing images that are not immediately recognizable. This compact, lightweight, and cost-effective imaging solution offers inherent privacy advantages, making it attractive for privacy-sensitive applications like face verification. Typical lensless face verification adopt… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: under review

  36. arXiv:2406.03933  [pdf, other

    cs.CR cs.IR

    Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation

    Authors: Honglei Zhang, Haoxuan Li, Jundong Chen, Sen Cui, Kunda Yan, Abudukelimu Wuerkaixi, Xin Zhou, Zhiqi Shen, Yidong Li

    Abstract: Federated recommendation aims to collect global knowledge by aggregating local models from massive devices, to provide recommendations while ensuring privacy. Current methods mainly leverage aggregation functions invented by federated vision community to aggregate parameters from similar clients, e.g., clustering aggregation. Despite considerable performance, we argue that it is suboptimal to appl… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  37. arXiv:2406.03794  [pdf, other

    cs.LG

    Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium Models

    Authors: Zun Wang, Chang Liu, Nianlong Zou, He Zhang, Xinran Wei, Lin Huang, Lijun Wu, Bin Shao

    Abstract: In this study, we introduce a unified neural network architecture, the Deep Equilibrium Density Functional Theory Hamiltonian (DEQH) model, which incorporates Deep Equilibrium Models (DEQs) for predicting Density Functional Theory (DFT) Hamiltonians. The DEQH model inherently captures the self-consistency nature of Hamiltonian, a critical aspect often overlooked by traditional machine learning app… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  38. arXiv:2406.03725  [pdf, other

    cs.CL

    LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification

    Authors: Chun Liu, Hongguang Zhang, Kainan Zhao, Xinghai Ju, Lin Yang

    Abstract: With the booming of Large Language Models (LLMs), prompt-learning has become a promising method mainly researched in various research areas. Recently, many attempts based on prompt-learning have been made to improve the performance of text classification. However, most of these methods are based on heuristic Chain-of-Thought (CoT), and tend to be more complex but less efficient. In this paper, we… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ACL 2024 main conference

  39. arXiv:2406.03108  [pdf, ps, other

    hep-ph

    Lepton flavor violating decays $Z\rightarrow l^{\pm}_{i}l^{\mp}_{j}$ in the B-L Supersymmetric Standard Model

    Authors: Jia-Peng Huo, Xing-Xing Dong, Jiao Ma, Shu-Min Zhao, Cai Guo, Hai-Bin Zhang, **-Lei Yang, Tai-Fu Feng

    Abstract: Lepton flavor violation (LFV) represents a clear new physics (NP) signal beyond the standard model (SM). In this paper, we study LFV decays $Z\rightarrow l^{\pm}_{i}l^{\mp}_{j}$ in the B-L Supersymmetric Standard Model(B-LSSM). We calculate these processes separately in the mass eigenstate basis and the electroweak interaction basis, and the latter adopt the mass insertion approximation (MIA) meth… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  40. CAMEL. II. A 3D Coronal Mass Ejection Catalog Based on Coronal Mass Ejection Automatic Detection with Deep Learning

    Authors: Jiahui Shan, Huapeng Zhang, Lei Lu, Yan Zhang, Li Feng, Yunyi Ge, Jianchao Xue, Shuting Li

    Abstract: Coronal mass ejections (CMEs) are major drivers of geomagnetic storms, which may cause severe space weather effects. Automating the detection, tracking, and three-dimensional (3D) reconstruction of CMEs is important for operational predictions of CME arrivals. The COR1 coronagraphs on board the Solar Terrestrial Relations Observatory spacecraft have facilitated extensive polarization observations,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  41. arXiv:2406.02931  [pdf, other

    hep-ex

    Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  42. arXiv:2406.02616  [pdf, other

    cs.LG cs.AI

    Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach

    Authors: Yuxuan Chen, Rongpeng Li, Xiaoxue Yu, Zhifeng Zhao, Honggang Zhang

    Abstract: Optimizing the deployment of large language models (LLMs) in edge computing environments is critical for enhancing privacy and computational efficiency. Toward efficient wireless LLM inference in edge computing, this study comprehensively analyzes the impact of different splitting points in mainstream open-source LLMs. On this basis, this study introduces a framework taking inspiration from model-… ▽ More

    Submitted 8 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  43. arXiv:2406.02603  [pdf, other

    cs.CR cs.LG

    Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

    Authors: Yihan Wu, Ruibo Chen, Zhengmian Hu, Yanshuo Chen, Junfeng Guo, Hongyang Zhang, Heng Huang

    Abstract: Language model (LM) watermarking techniques inject a statistical signal into LM-generated content by substituting the random sampling process with pseudo-random sampling, using watermark keys as the random seed. Among these statistical watermarking approaches, distortion-free watermarks are particularly crucial because they embed watermarks into LM-generated content without compromising generation… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  44. arXiv:2406.02374  [pdf

    cond-mat.soft

    Direct measurement of the viscocapillary lift force near a liquid interface

    Authors: Hao Zhang, Zaicheng Zhang, Aditya Jha, Yacine Amarouchene, Thomas Salez, Thomas Guérin, Chaouqi Misbah, Abdelhamid Maali

    Abstract: Lift force of viscous origin is widespread across disciplines, from mechanics to biology. Here, we present the first direct measurement of the lift force acting on a particle moving in a viscous fluid along the liquid interface that separates two liquids. The force arises from the coupling between the viscous flow induced by the particle motion and the capillary deformation of the interface. The m… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  45. arXiv:2406.02212  [pdf, other

    cs.CE

    Generative Pre-Trained Diffusion Paradigm for Zero-Shot Time Series Forecasting

    Authors: Jiarui Yang, Tao Dai, Naiqi Li, Junxi Wu, Peiyuan Liu, **min Li, Jigang Bao, Haigang Zhang, Shutao Xia

    Abstract: In recent years, generative pre-trained paradigms such as Large Language Models (LLMs) and Large Vision Models (LVMs) have achieved revolutionary advancements and widespread real-world applications. Particularly, the emergence of pre-trained LLMs-based temporal works, compared to previous deep model approaches, has demonstrated superior generalization and robustness, showcasing the potential of ge… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  46. arXiv:2406.02125  [pdf, other

    cs.CV

    Domain Game: Disentangle Anatomical Feature for Single Domain Generalized Segmentation

    Authors: Hao Chen, Hongrun Zhang, U Wang Chan, Rui Yin, Xiaofei Wang, Chao Li

    Abstract: Single domain generalization aims to address the challenge of out-of-distribution generalization problem with only one source domain available. Feature distanglement is a classic solution to this purpose, where the extracted task-related feature is presumed to be resilient to domain shift. However, the absence of references from other domains in a single-domain scenario poses significant uncertain… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  47. arXiv:2406.01926  [pdf, ps, other

    hep-ph

    Explaining the possible 95 GeV excesses in the $B-L$ symmetric SSM

    Authors: **-Lei Yang, Ming-Hui Guo, Wen-Hui Zhang, Hai-Bin Zhang, Tai-Fu Feng

    Abstract: This study investigates the excesses observed in the diphoton and $b\bar b$ data around $95\;{\rm GeV}$ within the framework of the $B-L$ supersymmetric model (B-LSSM). Comparing with the minimal supersymmetric standard model, the B-LSSM incorporates two singlet chiral Higgs bosons which mix with the SM-like Higgs boson due to the gauge kinetic mixing effect. The richer Higgs sector indicates that… ▽ More

    Submitted 17 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 28 pages, 5 figures. arXiv admin note: text overlap with arXiv:2405.07243

  48. arXiv:2406.01879  [pdf, other

    cs.CL

    Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check

    Authors: Haiming Wu, Hanqing Zhang, Richeng Xuan, Dawei Song

    Abstract: Chinese Spelling Check (CSC) aims to detect and correct potentially misspelled characters in Chinese sentences. Naturally, it involves the detection and correction subtasks, which interact with each other dynamically. Such interactions are bi-directional, i.e., the detection result would help reduce the risk of over-correction and under-correction while the knowledge learnt from correction would h… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures

  49. arXiv:2406.01334  [pdf, other

    cs.CV

    HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models

    Authors: Mengcheng Li, Hongwen Zhang, Yuxiang Zhang, Ruizhi Shao, Tao Yu, Yebin Liu

    Abstract: Recent years have witnessed a trend of the deep integration of the generation and reconstruction paradigms. In this paper, we extend the ability of controllable generative models for a more comprehensive hand mesh recovery task: direct hand mesh generation, inpainting, reconstruction, and fitting in a single framework, which we name as Holistic Hand Mesh Recovery (HHMR). Our key observation is tha… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: accepted in CVPR2024, project page: https://dw1010.github.io/project/HHMR/HHMR.html

  50. arXiv:2406.01332  [pdf, ps, other

    hep-ex

    Measurements of the branching fractions of semileptonic $D^{+}_s$ decays via $e^+e^-\to D_s^{*+}D_s^{*-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: We measure the absolute branching fractions of semileptonic $D^+_s$ decays via the $e^+e^-\to D_s^{*+}D_s^{*-}$ process using $e^+e^-$ collision data corresponding to an integrated luminosity of $10.64~\mathrm{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies between 4.237 and 4.699 GeV. The branching fractions are… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 14 pages, 3 figures